It's really good. I didn't do any type of statistical evaluation or comparison to other models, but it's so good that it doesn't matter to me if there's an option that might be even better.
curious if you tried local LLM models for OCR, like a Gemma4, or your volume is too much for that
This just dropped: https://huggingface.co/baidu/Unlimited-OCR
Which can run comfortably on 12gb of vram. I gave it a whirl and it does seem pretty competitive. I wonder how that compares for your usecase