logoalt Hacker News

DonsDiscountGastoday at 6:01 PM1 replyview on HN

Seemed to work when it comes to selling ads. I'm thinking training LLMs is harder than anthropic and openai make it look


Replies

cyanydeeztoday at 6:15 PM

I'm guessing both openai and anthropic have transitioned to prompt magic and fine tuning rather than try to keep building LLMs at scale. The fact that QWEN and other models are impressive, small and perfectly suitable for most work means every dollar you're spending on trying to train larger models is a losing prop.