Hacker News new | past | comments | ask | show | jobs | submit login

Not sure it qualifies, but I've started building an Android app that wraps bergamot[0] (the firefox translation models) to have on-device translation without reliance on google.

Bergamot is already used inside firefox, but I wanted translation also outside the browser.

[0]: bergamot https://github.com/browsermt/bergamot-translator




I would be very interested if someone is aware of any small/tiny models to perform OCR, so the app can translate pictures as well


MiniCPM-V 2.6 isn't that small (8b) but it can do this.

Here is a demo.

* https://i.imgur.com/pAuTeAf.jpeg

Using this script:

* https://github.com/jabberjabberjabber/LLMOCR/




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: