Unremarkable base model will remain an unremarkable fine-tuned model that memorised a couple thousan...

macleginn • yesterday at 9:32 AM • 3 replies • view on HN

Unremarkable base model will remain an unremarkable fine-tuned model that memorised a couple thousand of input-output pairings.

Replies

ACCount37 • yesterday at 10:52 AM

Ha ha, as if.

Base models have a lot of capabilities - arranged in all the wrong ways for high performance reasoning and problem-solving. The power of fine tuning on "a couple thousand of input-output pairings" is that it can fix some of that. If your pairings are very well chosen, that is.

Laurel1234 • yesterday at 11:34 AM

If that were the case, Anthropic wouldn't be throwing a fit over distillation "attacks".

➕ show 1 reply

danw1979 • yesterday at 9:42 AM

Yes, neural networks are famously poor at generalising.

➕ show 1 reply

alt Hacker News

Replies