logoalt Hacker News

nullcyesterday at 5:31 PM1 replyview on HN

In the case of the closed models too... Claude would happily tell you it was deepseek-v3 if you asked in chinese until it caught public attention and they papered over it.


Replies

dragonwriteryesterday at 11:41 PM

The word “openly” in my post there for a reason; the commercial models are not openly distilled from competitors: many open source models have in their model documentation that distillation was done from a dataset drawn from specific other models, including commercial models.

That distillation might be inferred from the behavior of commercial models is not the same as them openly doing it.

show 1 reply