logoalt Hacker News

phplovesongyesterday at 6:06 PM4 repliesview on HN

Hoe much power did it take to train the models?


Replies

freeqazyesterday at 6:16 PM

I would honestly guess that this is just a small amount of tweaking on top of the Sonnet 4.x models. It seems like providers are rarely training new 'base' models anymore. We're at a point where the gains are more from modifying the model's architecture and doing a "post" training refinement. That's what we've been seeing for the past 12-18 months, iirc.

show 1 reply
brutalcyesterday at 6:18 PM

[dead]

neural_thingyesterday at 6:18 PM

Does it matter? How much power does it take to run duolingo? How much power did it take to manufacture 300000 Teslas? Everything takes power

show 2 replies