logoalt Hacker News

renticuloustoday at 10:03 AM2 repliesview on HN

How can something so obvious be overlooked by team building the data centre? Can't the sharding be uneven so that weaker GPUs still finish fast by taking on a smaller workload?


Replies

Makentoday at 11:15 AM

It's not like they had much of an option, when everybody was hoarding every GPU they could. For the second Colossus they could book future production, but the first one had to be built ASAP so xAI looked as a serious competitor in the AI space.

sjsdaiuasgdiatoday at 12:04 PM

I imagine it involved a petulant billionaire screaming "Fucking build it. Build it NOW!" in response to expert feedback.