logoalt Hacker News

alach11yesterday at 9:15 PM1 replyview on HN

I ran an internal (oil and gas focused) benchmark yesterday and found Opus 4.7 was 50% cheaper than Opus 4.6, driven by significantly fewer output tokens for reasoning. It also scored 80% (vs. 60%).


Replies

stingraycharlestoday at 12:23 AM

That’s just adaptive reasoning, not related to the increased tokenizer costs.

show 1 reply