logoalt Hacker News

zozbot234yesterday at 3:19 PM0 repliesview on HN

The whole notion of 'distillation' at a distance is extremely iffy anyway. You're just training on LLM chat logs, but that's nowhere near enough to even loosely copy or replicate the actual model. You need the weights for that.