My take away is: it's roughly as good as Opus 4.5.
Now the question is: how much faster or cheaper is it?
How can you determine whether it's as good as Opus 4.5 within minutes of release? The quantitative metrics don't seem to mean much anymore. Noticing qualitative differences seems like it would take dozens of conversations and perhaps days to weeks of use before you can reliably determine the model's quality.
Given that the price remains the same as Sonnet 4.5, this is the first time I've been tempted to lower my default model choice.
If it maintains the same price (with Anthropic tends to do or undercuts themselves) then this would be 1/3rd of the price of Opus.
Edit: Yep, same price. "Pricing remains the same as Sonnet 4.5, starting at $3/$15 per million tokens."
> That's a long document.
Probably written by LLMs, for LLMs
40% cheaper: https://platform.claude.com/docs/en/about-claude/pricing