logoalt Hacker News

nunodonatotoday at 8:26 AM3 repliesview on HN

it would be a really great option if it didn't lack vision


Replies

pizzafeelsrighttoday at 5:14 PM

this is mcp or custom call to lowest cost model

someone did a webcam + agentic + capture of other computer bios/boot -> upload to image model -> back to agent

RugnirVikingtoday at 10:25 AM

what do you use vision for? I have failed to find a workflow with it that makes sense, asking it to review screenshots of websites or whatever it misses extremely obvious details like text flowing out of it's container/overlapping other text, things being in entirely the wrong place, etc.

show 1 reply
cromkatoday at 9:34 AM

For coding?