logoalt Hacker News

vinzenzutoday at 4:10 PM0 repliesview on HN

Leaps and bounds better! I don't think I benchmarked it.

But the experience was that it was able to find small details in PDFs, in technical diagrams, and this was really not captured well at all with OCR.

In general, OCR I think should be used more as an add-on to retrieve data, not given to the generation model itself. Similar to retrieving based off a text description and then giving the generation model the image.