logoalt Hacker News

BoorishBearsyesterday at 6:20 PM0 repliesview on HN

https://aibenchy.com/compare/anthropic-claude-opus-4-6-mediu...

That's not even the tip of the iceberg in how useless their benchmark is.