logoalt Hacker News

louiereedersonyesterday at 6:32 PM1 replyview on HN

Chip costs strongly impact the economics of model serving.

It is entirely plausible to me that Opus 4.7 is designed to consume more tokens in order to artificially reduce the API cost/token, thereby obscuring the true operating cost of the model.

I agree though, I chose poor phrasing originally. Better to say that GB200 vs Tranium could contribute to the efficiency differential.


Replies

itemize123today at 3:17 AM

probably the wrong take - they are arm racing to a better model. it's not enshittification era for models just yet

show 1 reply