logoalt Hacker News

_3u10today at 4:38 PM0 repliesview on HN

No, they need the same arch, but you can distill them into a single model. And yes, if you use the API directly Claude will often say it’s an open weight model (likely the ones it was distilled from)