Check out my comparison too, it has some not-really-benchmarks too (between any two models actually, SVG generation test and CSS animation test):
https://aibenchy.com/compare/anthropic-claude-opus-4-8-mediu...