logoalt Hacker News

twobitshiftertoday at 11:10 AM0 repliesview on HN

Right, just give the text llm access to a vision specific agent and that problem can be solved. Or if you really want let it even call Opus with an image - seems like you’d still save money