logoalt Hacker News

byzantinegenetoday at 9:44 AM0 repliesview on HN

we're already doing that, it's called distillation and how models like deepseek are trained.