logoalt Hacker News

throwa356262today at 6:57 AM2 repliesview on HN

1024 Huawei Ascend superpods = 50K 910C chips.

That is a tiny tiny system. OpenAI uses _milions_ of GPUs for training

On the other hand, this probably reuses the existing deepseek v4 architecture and weights. Maybe didn't need that much compute.


Replies

mrngldtoday at 1:24 PM

I'm sure it also takes more compute effort to be at the frontier, rather than being able to distill and poach ideas from the frontier. No mistake that it's the same handful of labs taking turns at or near the frontier.

show 1 reply