logoalt Hacker News

hei-limayesterday at 7:35 PM7 repliesview on HN

We need another "Deepseek moment" or else it will become impossible for the regular dude to use AI. It will become something that only big companies can afford.


Replies

SwellJoeyesterday at 8:40 PM

We're having DeepSeek moments every couple of weeks.

Qwen 3.6 hit hard in the self-hosting space. It's incredibly capable for its size, really shaking up what's possible in 64GB or even 32GB of VRAM.

The Prism Bonsai ternary model crams a tremendous amount of capability into 1.75GB.

And, DeepSeek V4 is crazy good for the price. They're charging flash model prices for their top-tier Pro model, which is competitive with the frontier of a few months ago.

The winners in the AI war will be the companies that figure out how to run them efficiently, not the ones that eke out a couple percent better performance on a benchmark while spending ten times as much on inference (though the capability has to be there, I think we're seeing that capability alone isn't a strong moat...there's enough competent competition to insure there's always at least a few options even at the very frontier of capability).

show 2 replies
squidbeakyesterday at 7:49 PM

Deepseek had another moment a few weeks ago. V4 isn't far behind the US frontier, and so far its flash variant seems a very reliable coder and costs a pittance.

show 1 reply
xbmcuseryesterday at 8:31 PM

What we need is a deepseek moment in hardware ie China reaching parity on node size that is the only way latest computers let alone latest ai will be available to us in the future otherwise the profit margins will push most production to AI.

show 2 replies
segmondyyesterday at 7:43 PM

You can use lots of open weight models today.

show 2 replies
pianopatrickyesterday at 9:10 PM

Maybe we can figure out better ways to use the models that can run on cheap hardware.

GeorgeOldfieldyesterday at 7:53 PM

gemini isn't even that good. just tested 3.5 on usual complex prompts to opus/chat 5.5. meh

show 3 replies