We need another "Deepseek moment" or else it will become impossible for the regular dude t...

hei-lima • yesterday at 7:35 PM • 7 replies • view on HN

We need another "Deepseek moment" or else it will become impossible for the regular dude to use AI. It will become something that only big companies can afford.

Replies

SwellJoe • yesterday at 8:40 PM

We're having DeepSeek moments every couple of weeks.

Qwen 3.6 hit hard in the self-hosting space. It's incredibly capable for its size, really shaking up what's possible in 64GB or even 32GB of VRAM.

The Prism Bonsai ternary model crams a tremendous amount of capability into 1.75GB.

And, DeepSeek V4 is crazy good for the price. They're charging flash model prices for their top-tier Pro model, which is competitive with the frontier of a few months ago.

The winners in the AI war will be the companies that figure out how to run them efficiently, not the ones that eke out a couple percent better performance on a benchmark while spending ten times as much on inference (though the capability has to be there, I think we're seeing that capability alone isn't a strong moat...there's enough competent competition to insure there's always at least a few options even at the very frontier of capability).

➕ show 2 replies

squidbeak • yesterday at 7:49 PM

Deepseek had another moment a few weeks ago. V4 isn't far behind the US frontier, and so far its flash variant seems a very reliable coder and costs a pittance.

➕ show 1 reply

xbmcuser • yesterday at 8:31 PM

What we need is a deepseek moment in hardware ie China reaching parity on node size that is the only way latest computers let alone latest ai will be available to us in the future otherwise the profit margins will push most production to AI.

➕ show 2 replies

stared • yesterday at 11:12 PM

We have a "DeepSeek moment", https://github.com/antirez/ds4 (see https://news.ycombinator.com/item?id=48142108).

Or if you prefer smaller ones, Qwen3.6-35B-A3B, https://huggingface.co/bartowski/Qwen_Qwen3.6-35B-A3B-GGUF

segmondy • yesterday at 7:43 PM

You can use lots of open weight models today.

➕ show 2 replies

pianopatrick • yesterday at 9:10 PM

Maybe we can figure out better ways to use the models that can run on cheap hardware.

GeorgeOldfield • yesterday at 7:53 PM

gemini isn't even that good. just tested 3.5 on usual complex prompts to opus/chat 5.5. meh

➕ show 3 replies

alt Hacker News

Replies