> But also, the latest DeepSeek is 1.6T parameters. “Choosing” to run this locally is a choice th...

zozbot234 • yesterday at 5:18 PM • 1 reply • view on HN

> But also, the latest DeepSeek is 1.6T parameters. “Choosing” to run this locally is a choice that comes with a seven digit price tag

Unless you're specifically thinking about running the model at stock precision in a datacenter environment and generating ~100 tok/s or more on a 24/7 basis (the equivalent of a >$1000/mo spend even on the cheapest third-party APIs), that's very likely off by multiple orders of magnitude. Even then, experimentation can be done with cheap neoclouds on a pay-as-you-go basis.

Replies

kube-system • yesterday at 5:21 PM

I’m aware. The context of the discussion here is choosing DeepSeek over a US hosted model from Google, Anthropic or OpenAI.

The equivalent comparison would be running it at full frontier quality.

If you want less than frontier quality, there’s tons of great open weight models other than DeepSeek.

> cheap neoclouds

Again, fails the compliance checkbox.

➕ show 3 replies

alt Hacker News

Replies