logoalt Hacker News

b65e8bee43c2ed0yesterday at 9:20 PM2 repliesview on HN

productivity (tokens per second per hardware unit) increases at the cost of output quality, but the price remains the same.

both Anthropic and OpenAI quantize their models a few weeks after release. they'd never admit it out loud, but it's more or less common knowledge now. no one has enough compute.


Replies

sthimonsyesterday at 9:26 PM

Pretty bold claim - you have a source for that?

show 1 reply
cebertyesterday at 9:23 PM

Do you have a source for that claim?

show 1 reply