Since as per Anthropics own benchmarks Sonnet 4.5 is beaten by Opus 4.5 would it not suffice to infer the rest?
https://x.com/OpenAI/status/1999182104362668275