I ran an internal (oil and gas focused) benchmark yesterday and found Opus 4.7 was 50% cheaper than Opus 4.6, driven by significantly fewer output tokens for reasoning. It also scored 80% (vs. 60%).
That’s just adaptive reasoning, not related to the increased tokenizer costs.
That’s just adaptive reasoning, not related to the increased tokenizer costs.