what's wild is they accidentally solved it — pretraining IS unsupervised learning at scale, RLH...

cold_harbor • today at 11:17 AM • 1 reply • view on HN

what's wild is they accidentally solved it — pretraining IS unsupervised learning at scale, RLHF IS reinforcement learning. they just didnt know the recipe yet

Replies

jmalicki • today at 12:06 PM

pretraining isn't unsupervised, it is self-supervised - meaning it is moderately more scale limited.

➕ show 2 replies

alt Hacker News

Replies