logoalt Hacker News

rao-vyesterday at 11:08 PM0 repliesview on HN

Yes absolutely! I should have been more specific - I don’t believe people are using it to train 30B models from 300B models (and I’d love to learn that I’m off about this)