logoalt Hacker News

lrvickyesterday at 10:50 AM1 replyview on HN

The qwen models not only have good OCR, they will describe pictures to you.


Replies

maptyesterday at 12:16 PM

Anyone wanna do a quick offline MVP on a general vision assistant for the blind? We've had things like Google Lens for a while, but it's a bit vision and touchscreen-centric.