DeepSeek V4 with 1 million token context window is pretty powerful, although still not there. There's hope that Opus 4.5 level performance locally is not that far away.
Running DeepSeek V4 without extreme quantization locally requires a lot of hardware.
The IQ2 quants that fit into 128GB machines are very degraded.
From what I read, ds v4 is very close with opus 4.6 performance.
Running DeepSeek V4 without extreme quantization locally requires a lot of hardware.
The IQ2 quants that fit into 128GB machines are very degraded.