logoalt Hacker News

InsideOutSantayesterday at 8:58 AM2 repliesview on HN

It's really good. I didn't do any type of statistical evaluation or comparison to other models, but it's so good that it doesn't matter to me if there's an option that might be even better.


Replies

potsandpansyesterday at 4:12 PM

This just dropped: https://huggingface.co/baidu/Unlimited-OCR

Which can run comfortably on 12gb of vram. I gave it a whirl and it does seem pretty competitive. I wonder how that compares for your usecase

nok22konyesterday at 9:09 AM

curious if you tried local LLM models for OCR, like a Gemma4, or your volume is too much for that

show 1 reply