> "run a model like Gemma 4 31b, which is almost anthropic sonnet levels of performance"...

jwr • yesterday at 3:42 PM • 2 replies • view on HN

> "run a model like Gemma 4 31b, which is almost anthropic sonnet levels of performance"

I wish people stopped deluding themselves — I regularly try (and benchmark for my purposes) local models and they are NOWHERE near the huge models like Sonnet or Opus. Nowhere. Yes, you can sometimes get plausibly-looking output for simple tasks, but for anything even remotely requiring thinking there is simply no comparison.

Local models are useful. I use them for spam filtering, and soon intend to use them for image tagging and OCR. But let's stop saying they can get us "anthropic sonnet levels of performance", because that's just not true.

Replies

g-technology • yesterday at 10:05 PM

It all depends on use case. A local fine tuned model on a very specific use case can definitely out perform a much bigger cloud model that doesn’t have the training on your use case. But, that requires looking at the ai models as a means to end and not a Swiss Army knife that can do it all.

alt Hacker News

Replies