From the beginning of this I’ve wondered the same question: how do these companies justify spending ...

Chance-Device • today at 1:01 PM • 4 replies • view on HN

From the beginning of this I’ve wondered the same question: how do these companies justify spending such massive amounts now (and 3 or 4 years ago) when software and hardware efficiencies will bring down the cost dramatically fairly soon?

They basically decided that scaling at any cost was the way to go. This only works as a strategy if efficiency can’t work, not if you simply haven’t tried. Otherwise, a few breakthroughs and order of magnitude improvements and people are running equivalent models on their desktops, then their laptops, then their phones.

Arguably the costs involved means that our existing hardware and software is simply non viable for what they were and are trying to do, and a few iterations later the money will simply have been wasted. If you consider funnelling everything to nvidia shareholders wasting it, which I do.

Replies

Aperocky • today at 1:24 PM

The decision is the right one. Scaling at any cost is the right way to go.

You cannot find the efficiency if you haven't been experimenting at scale, this is true personally as well.

If someone haven't been burning a few B tokens per month, everything coming out of their mouth about AI is largely theory. It could be right or wrong, but they don't have the practice to validate what they're talking about.

Not everyone scaling to that degree would have the right answer or outcome, many would be wrong and go bust. But everyone who didn't will not have the right answer.

➕ show 1 reply

ap99 • today at 1:11 PM

They're not just betting on the current tech, they're building out infra like this because probably any future tech currently being researched will also require massive data centers.

Like how the gpt llms were kind of a side project at openai until someone showed how powerful they could be if you threw a lot more parameters at it.

There could be some other architecture in the works that makes gpts look old - first to build and train that new ai will be the winner.

phito • today at 1:09 PM

I think their current goal is to capture as much market as they can while they still have the best models, their only moat. Look at Anthropic, they are clearly trying to lock their users in their ecosystem by refusing to follow conventions (AGENT.md etc) and restricting their tools exclusively to their own services.

mrob • today at 1:08 PM

Because whoever wins the AI race (assuming they don't overshoot and trigger the hard takeoff scenario) becomes a living god. Everybody else becomes their slave, to be killed or exploited as they please. It's a risky gamble, but in the eyes of the participants the upside justifies it. If they don't go all in they're still exposed to all the downside risk but have no chance of winning.

I don't expect hardware prices to go down unless the third option (economic collapse) happens before somebody triggers the dystopia/extinction option.

➕ show 1 reply

alt Hacker News

Replies