pretraining isn't unsupervised, it is self-supervised - meaning it is moderately more scale lim...

jmalicki • today at 12:06 PM • 2 replies • view on HN

pretraining isn't unsupervised, it is self-supervised - meaning it is moderately more scale limited.

Replies

What would unsupervised mean, would unsupervised be something like alphago playing against itself trillions of times?

Whereas self-supervised, allows learning without explicit annotation of data ; but it doesn't matter if the models already trained on the entire Internet, and it's not like a game where it can come up with effectively new training data for itself?

➕ show 1 reply

cold_harbor • today at 2:10 PM

fair point — OpenAI's original plan literally said "solve unsupervised learning". the self-supervised distinction wasnt really standard til after BERT/GPT popularized it

➕ show 1 reply

alt Hacker News

Replies