logoalt Hacker News

rdosyesterday at 4:11 PM2 repliesview on HN

Is it possible for such a small model to outperform gemini 3 or is this a case of benchmarks not showing the reality? I would love to be hopeful, but so far an open source model was never better than a closed one even when benchmarks were showing that.


Replies

amlutoyesterday at 4:15 PM

Off the top of my head: for a lot of OCR tasks, it’s kind of worse for the model to be smart. I don’t want my OCR to make stuff up or answer questions — I want to to recognize what is actually on the page.

show 2 replies
woeiruayesterday at 5:24 PM

No. Gemini is clearly the leader across the board: https://www.ocrarena.ai/leaderboard