There's this[1]. Model providers have a strong incentive to switch (a part of) their inference ...

woadwarrior01 • yesterday at 11:23 AM • 1 reply • view on HN

There's this[1]. Model providers have a strong incentive to switch (a part of) their inference fleet to quantized models during peak loads. From a systems perspective, it's just another lever. Better to have slightly nerfed models than complete downtime.

[1]: https://marginlab.ai/trackers/claude-code/

Replies

nl • yesterday at 11:31 AM

So - as the charts say - no statistical difference?

Isn't this link am argument against the point you are making?

➕ show 1 reply

alt Hacker News

Replies