logoalt Hacker News

FergusArgyll12/11/20250 repliesview on HN

It's very much a vision test. The reason all the models don't pass it easily is only because of the vision component. It doesn't have much to do with reasoning at all