logoalt Hacker News

jumploopslast Friday at 8:34 AM1 replyview on HN

Is that technically not a new pretrained model?

(Also not sure how that would work, but maybe I’ve missed a paper or two!)


Replies

redox99last Friday at 9:16 AM

I'd say for it to be called a new pretrained model, it'd need to be trained from scratch (like llama 1, 2, 3).

But it's just semantics.