logoalt Hacker News

losvediryesterday at 12:05 PM0 repliesview on HN

Er, then what is the "already trained" model? I thought pre-training was the gradient descent through the internet part of building foundational models.