logoalt Hacker News

anon7725yesterday at 5:56 AM1 replyview on HN

If the smarts came from post-training, we could show significant gains by doing that post-training again for previous generations of models. But we know that isn’t happening - effective post training is necessary but not sufficient for model performance.


Replies

otabdeveloper4yesterday at 5:48 PM

> we could show significant gains by doing that post-training again for previous generations of models

That's what Chinese models are doing, and beating Opus et al.