Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Not op, but I wrote something equivalent based on a linux script that I found [1]. Here's the mac version of the script [2]. You need tesseract [3] installed for the OCR. For launching it using a keyboard shortcut I use a hammerspoon [4]. Example of how I bind it [5]

[1] https://github.com/sdushantha/bin/blob/master/utils/ocr

[2] https://github.com/JJGO/dotfiles/blob/master/scripts/bin/ocr

[3] https://pypi.org/project/pytesseract/

[4] http://www.hammerspoon.org/

[5] https://github.com/JJGO/dotfiles/blob/master/mac/.hammerspoo...



Nice, I started with tesseract, but moved to Google Vision as the quality was better.

Azure's offering was better for hand written notes.


Interesting, that doesn't really work for me as I sometimes use it to OCR sensitive stuff from scanned documents, and it wouldn't work offline. I found tesseract to work pretty reliably for my use and only fail when scanning non-ASCII characters.


Yeah, I only use it for things where the context wouldn't matter. For example grabbing an account number from a document.

I've been experimenting w/ CoreImage detectors (mainly for categorisation of scanned documents based on QR codes).

When I clean things up today, I'll see what can be used.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: