I have two A100s and have been playing with local models for years. There's definitely moments ...

root_axis • yesterday at 2:35 AM • 1 reply • view on HN

I have two A100s and have been playing with local models for years. There's definitely moments where they are quite impressive, but small context sizes and unreliability become immediately obvious.

> For those of us a bit crazy, we are running KimiK2.6, GLM5.1

Yes, those can compare to Opus, but you can't run those unquantized for less than $400k in hardware.

Replies

doctorpangloss • yesterday at 2:39 AM

Two Mac Studio M3 Ultra 512GB and 1 USB cable can run all those models - maybe about $30,000 in hardware - and based on my benchmarks, those Mac Studios were twice as fast as the A100s on Deepseek v4 Flash, which has a quantization but not really a lossy one.

➕ show 1 reply

alt Hacker News

Replies