logoalt Hacker News

HPsquaredyesterday at 4:24 PM1 replyview on HN

You could maybe then do a second pass on the whole text (as plain text not OCR) to look for likely mistakes.


Replies

kergonathyesterday at 5:03 PM

This is not always easy. The models I tried were too helpful and rewrote too much instead of fixing simple typos. When I tried I ended up with huge prompts and I still found sentences where the LLM was too enthusiastic. I ended up applying regexes with common typos and accepted some residual errors. It might be better now, though. But since then I’ve moved to all-in-one solutions like Mathpix and Mistral-OCR which are quite good for my purpose.