logoalt Hacker News

selcukatoday at 1:04 AM5 repliesview on HN

It's a race to the bottom. DeepSeek beats all others (single-shot), and it is ~50% cheaper than the cost of local electricity only.

> DeepSeek V3.2 Reasoning 86.2% ~$0.002 API, single-shot

> ATLAS V3 (pass@1-v(k=3)) 74.6% ~$0.004 Local electricity only, best-of-3 + repair pipeline


Replies

strangescripttoday at 1:47 PM

I will "suffer" through .004 of electricity if I can run it on my own computer

sourcecodeplztoday at 4:04 AM

I've tested many open models, Deepseek 3.2 is the only SOTA similar.

show 1 reply
yogthostoday at 2:01 AM

You could use this approach with DeepSeek as well. The innovation here is that you can generate a bunch of solutions, use a small model to pick promising candidates and then test them. Then you feed errors back to the generator model and iterate. In a way, it's sort of like a genetic algorithm that converges on a solution.

show 1 reply
alifeinbinarytoday at 4:01 PM

All those parameters and it still won't answer questions about Tianenman Square in 1989... :(

show 1 reply
mikestorrenttoday at 1:15 AM

> cheaper than the cost of local electricity only.

Can you explain what that means?

show 3 replies