https://aibenchy.com/compare/anthropic-claude-opus-4-6-mediu...
That's not even the tip of the iceberg in how useless their benchmark is.