Seemed to work when it comes to selling ads. I'm thinking training LLMs is harder than anthropi...

DonsDiscountGas • today at 6:01 PM • 1 reply • view on HN

Seemed to work when it comes to selling ads. I'm thinking training LLMs is harder than anthropic and openai make it look

Replies

cyanydeez • today at 6:15 PM

I'm guessing both openai and anthropic have transitioned to prompt magic and fine tuning rather than try to keep building LLMs at scale. The fact that QWEN and other models are impressive, small and perfectly suitable for most work means every dollar you're spending on trying to train larger models is a losing prop.

alt Hacker News

Replies