logoalt Hacker News

culiyesterday at 8:19 PM0 repliesview on HN

Yes but with a significant (logarithmic) increase in cost per task. The ARC-AGI site is less misleading and shows how GPT and Claude are not actually far behind

https://arcprize.org/leaderboard