This logic works only if distilling Claude is the only way to create another SOTA LLM, which is not the case.
How do you think the Qwen and MiniMax models perform so similarly to Anthropic frontier models? What is your take then?
How do you think the Qwen and MiniMax models perform so similarly to Anthropic frontier models? What is your take then?