Off the top of my head: for a lot of OCR tasks, it’s kind of worse for the model to be smart. I don’...

amluto • yesterday at 4:15 PM • 2 replies • view on HN

Off the top of my head: for a lot of OCR tasks, it’s kind of worse for the model to be smart. I don’t want my OCR to make stuff up or answer questions — I want to to recognize what is actually on the page.

Replies

retrac • yesterday at 9:11 PM

Sometimes what is on the page is ambiguous. Imagine a scan where the dot over the i is missing in a word like "this". What's on the page is "thls" but to transcribe it that way would be an error outside of forensic contexts.

I am reminded it's basically impossible to read cursive writing in a language you don't know even if it's the same alphabet.

rdos • yesterday at 4:26 PM

Interesting. Won't stuff like entity extraction suffer? Especially in multilingual use cases. My worry is that a smaller model might not realize some text is actually a persons name because it is very unusual.

➕ show 1 reply

alt Hacker News

Replies