logoalt Hacker News

DeathArrowtoday at 4:21 AM2 repliesview on HN

>The blog post implies that it currently requires 96GB of VRAM.

From the Github page it seems it only supports Apple and DGX Spark. I have 128 GB of RAM and a 3090 but it probably won't work.


Replies

thomasm6m6today at 5:11 AM

FYI, llama.cpp (which antirez/ds4 is inspired by) supports system ram. E.g. [1] is a good guide for running a similar-sized model with 128gb ram and a 3090-sized GPU.

[1] https://unsloth.ai/docs/models/tutorials/minimax-m27

(Unsloth's deepseek-v4 support is still WIP)

show 1 reply
manmaltoday at 5:59 AM

It wouldn’t be useful with your setup, probably 3-4 token per second.

show 1 reply