logoalt Hacker News

semiquaveryesterday at 6:38 PM4 repliesview on HN

The frontier labs distill their own base models all day long. It’s not just something done by nefarious Chinese copycats. The knowledge embodied by the internal base models that we never see is much more powerful and useful than the much sparser raw training data


Replies

coldteayesterday at 7:14 PM

>It’s not just something done by nefarious Chinese copycats

And even that would be rich as a accusation from SOTAs that depend on explicitly disregarding millions of training data intellectual property..

flosslyyesterday at 11:57 PM

> nefarious Chinese copycats

LLMs are themselves copy cats.

I say thanks for open sourcing and thereby promoting affordable innovation, instead of "nefarious". :)

manmalyesterday at 7:17 PM

But how? The training data is the unadulterated content those models are based on? I genuinely don’t understand, no snark.

show 1 reply
supern0vayesterday at 6:46 PM

I think you replied to the wrong parent.