logoalt Hacker News

macleginnyesterday at 9:32 AM3 repliesview on HN

Unremarkable base model will remain an unremarkable fine-tuned model that memorised a couple thousand of input-output pairings.


Replies

ACCount37yesterday at 10:52 AM

Ha ha, as if.

Base models have a lot of capabilities - arranged in all the wrong ways for high performance reasoning and problem-solving. The power of fine tuning on "a couple thousand of input-output pairings" is that it can fix some of that. If your pairings are very well chosen, that is.

Laurel1234yesterday at 11:34 AM

If that were the case, Anthropic wouldn't be throwing a fit over distillation "attacks".

show 1 reply
danw1979yesterday at 9:42 AM

Yes, neural networks are famously poor at generalising.

show 1 reply