logoalt Hacker News

gmuecklyesterday at 9:34 PM0 repliesview on HN

This is obviously wrong. There is a bunch of knowledge embedded in those weights, and some of it can be recalled verbatim. So, by virtue of this recall alone, training is a form of lossy data compression.