Show HN: I launched a super cheap and simple to use OCR tool for macOS

jedbrooke · 2024-11-04T01:05:08 1730682308

> Are there other solutions out there?

yes you can just do cmd+shift+4 to take a screen shot, then open the screenshot in the popup that appears and MacOS will automatically OCR it (orc button in the bottom right). This is a built in functionality in MacOS

gumboshoes · 2024-11-04T15:48:36 1730735316

I disabled that function because it gives the false illusion that docs and images can be saved with text and then will be indexable and searchable in the Finder and other apps; they are not. When I open a PDF, I need to know that it has native text actually saved in the file. If it doesn't, then I will OCR it so it is for sure indexable and searchable.

selcuka · 2024-11-04T01:25:28 1730683528

Interestingly the macOS one is not very accurate. I took a screenshot of your comment and macOS OCR read the "cmd+shift+4" as "cod+shift+4".

c0wb0yc0d3r · 2024-11-04T14:24:12 1730730252

I wonder why that is? Could it mean that Apple trained their ocr tool to favor nontechnical text. Meaning the tool determined that “cod” was more likely than “cmd”

Interestingly, iOS corrected “cmd” to “cod” when I first typed it out.

Wowfunhappy · 2024-11-04T01:29:22 1730683762

The thing is, if the linked app is using Apple's Vision API, it will perform the same.

selcuka · 2024-11-04T01:36:46 1730684206

Good point. From the list of supported languages [1] it looks like it is in fact using the Vision API in fast mode (as accurate mode seems to support more languages).

[1] https://www.textcapture.app/#faq

wingerlang · 2024-11-04T05:06:22 1730696782

It correctly OCR'd it for me.

blacksmith_tb · 2024-11-04T04:09:58 1730693398

I have been using this one for quite a while, it works well for me:

https://github.com/schappim/macOCR

(I'd say my number one use is snagging urls out of Zoom presentations, quicker and easier than a screenshot)

joshdavham · 2024-11-04T01:20:08 1730683208

Agreed. But I do wonder if this product provides a better enough UX to be worth it’s current price. In my case, it doesn’t support the languages I use so I’ll be sticking with the default Mac feature.

constantlm · 2024-11-04T02:05:37 1730685937

I've been doing this for a while and find that the OCR performance is fantastic.

jwells89 · 2024-11-04T01:12:51 1730682771

Works for images in Preview and even in Safari too. Super handy.

evilduck · 2024-11-04T01:23:45 1730683425

Works in Photos.app for searching for text in your photo albums too.

macOS OCR behavior extends to most similar things in iOS too.

bobnamob · 2024-11-04T01:46:07 1730684767

Which makes Photos.app a surprisingly good recipe book.

evilduck · 2024-11-04T15:02:01 1730732521

Also as a rolodex. I just take pictures of business cards and you can long press to OCR the phone number and dial from that immediately with no need to even create contact entries unless it becomes a repeat relationship, and if you do, you can usually insta-create the contact card in full with just a long press on the image.

frizlab · 2024-11-04T06:36:09 1730702169

You can even search for text in images in Safari. I was dumbfounded the first time I searched for some text in a page and Safari found it in an image on the page.

vundercind · 2024-11-04T02:07:05 1730686025

The moment I realized this was now a table-stakes feature for a GUI OS, for me, was when I’d been reading and copy-pasting from an image for a couple minutes before realizing it wasn’t a PDF.

scratchyone · 2024-11-04T00:59:20 1730681960

This looks wonderful! Just a small heads up, you have a meta tag listing @marc_louvion as the creator (assuming this landing page is built on one of his templates?). I figure you may want to update that so it has your info instead.

  <meta name="twitter:creator" content="@marc_louvion">

auden_pierce · 2024-11-04T23:21:35 1730762495

Thanks!

wodenokoto · 2024-11-04T05:19:03 1730697543

If you have installed Microsofts Power Toys on Windows [1], you can win+shift+T and select any area on screen and windows will OCR it and store it on your clipboard.

It's not SOTA AI powered OCR, but works great for copying a link on a streamed tech talk or text from an application / website that tries to make text not-selectable.

[1] https://learn.microsoft.com/en-us/windows/powertoys/

scosman · 2024-11-04T01:04:10 1730682250

What makes it better than screenshot and Preview? The built in OCR is pretty great on MacOS.

Aaron2222 · 2024-11-04T05:40:04 1730698804

If it's something you'd have to screenshot to use OCR on (i.e. it doesn't just let you directly select the text), this (and the other options) is a bit faster than having to take a screenshot then select the text from it (you select the region like when taking a screenshot and the text is OCRed and copied to the pasteboard in one go).

gumboshoes · 2024-11-04T13:34:08 1730727248

I use EasyDict, which also does translations with multiple services. Open source. https://github.com/tisfeng/Easydict

jitl · 2024-11-04T01:10:09 1730682609

The system does this automatically on macOS and iOS in screenshots and stuff.

eevmanu · 2024-11-04T02:43:45 1730688225

Any decent alternative for Linux or Ubuntu-based OS? Thanks.

danpla · 2024-11-04T07:19:58 1730704798

Try dpScreenOCR:

https://danpla.github.io/dpscreenocr/

rammer · 2024-11-04T01:10:54 1730682654

Anyone know of good windows alternatives for this?

crtasm · 2024-11-04T01:16:33 1730682993

PowerToys https://learn.microsoft.com/en-us/windows/powertoys/text-ext...

hu3 · 2024-11-04T01:17:52 1730683072

Windows 11 native screenshot tool does OCR for me.

You can also get it free with PowerToys from Microsoft and press WIN+SHIFT+T.

wodenokoto · 2024-11-04T05:24:11 1730697851

> Windows 11 native screenshot tool does OCR for me.

The win+shift+s command / snip & sketch tool? It doesn't appear to have any OCR option before or after capturing

hu3 · 2024-11-04T13:34:04 1730727244

After taking a screenshot with Win+Shift+S, it displays a notification with the preview of the image for me, on the lower right corner of the screen.

When I click that preview, it opens the "Snipping Tool":

https://support.microsoft.com/en-us/windows/use-snipping-too...

In this tool there's a button that does OCR: https://i.imgur.com/GtYUvSS.png

There's also Power Tools, which is another free option, also from Microsoft.

I hope it helps. Let me know if it worked for you or not.

mdrzn · 2024-11-04T07:59:35 1730707175

The screenshot and the OCR are two different commands and keybinds. Check Powertoys.

seltzered_ · 2024-11-04T01:14:33 1730682873

https://screenotate.com/ is one windows example

BOOSTERHIDROGEN · 2024-11-04T02:04:00 1730685840

Snipping Tool

danpla · 2024-11-04T07:22:24 1730704944

dpScreenOCR

counternotions · 2024-11-04T02:48:30 1730688510

Cleanshot has OCR built in as a feature too

dmitrygr · 2024-11-04T02:35:02 1730687702

This is built into the OS itself. I don’t get it. What am I missing? I can select text in any image or screenshot seamlessly and very accurately, for $0.00 up front and $0.00 per month.

eviks · 2024-11-04T06:14:15 1730700855

Native OS is limited to a few apps, so not seamless?

dmitrygr · 2024-11-04T07:19:08 1730704748

Screenshots auto-do OCR and you can screenshot anything.

eviks · 2024-11-04T07:52:08 1730706728

Yes, screenshots is one of those apps, but it's not seamless: seamless "auto-do OCR" is selecting an area on the screen and getting text in your clipboard without other side effects. So no extra screenshot files created, no need to navigate another interface to select text in the screenshot

lucasllinasm · 2024-11-04T00:20:25 1730679625

Neat and quick—what are the next couple of features you'd incorporate?

auden_pierce · 2024-11-04T23:24:52 1730762692

Thanks! Probably more languages, and barcode/QR code detection. I'm currently collecting feedback, would love to hear your suggestions -> https://insigh.to/b/textcapture