logoalt Hacker News

karel-3dtoday at 8:59 AM1 replyview on HN

Can I... somehow run this locally? DeepSeek is opensource? Do I even need their API key?

(I have no experience with running anything locally, maybe it's a stupid question)


Replies

zozbot234today at 9:17 AM

Waiting for official support in llama.cpp. There is a fork that can run a lightly quantized (Q2 expert layers) DeepSeek V4 Flash in 128GB RAM without offloading weight fetches from disk.

show 1 reply