logoalt Hacker News

bugglebeetleyesterday at 5:44 AM1 replyview on HN

Anthropic serves quantized versions of their models and you can run q8 locally.


Replies

nicceyesterday at 7:43 AM

I don't even use Sonnet anymore. Current feels worse than Claude 3.5 couple years ago. They have quantized that much? Switched to GPT 5.5, let's see how long it will stay good.

show 1 reply