logoalt Hacker News

RugnirVikingtoday at 10:25 AM1 replyview on HN

what do you use vision for? I have failed to find a workflow with it that makes sense, asking it to review screenshots of websites or whatever it misses extremely obvious details like text flowing out of it's container/overlapping other text, things being in entirely the wrong place, etc.


Replies

bckrtoday at 2:04 PM

What models have you tried? Gemini 3.1 pro has vision capable of reading my sloppy diaries from 10 years ago, down to small glyphs and doodles.

show 1 reply