Without any insider knowledge on the economics of these companies, I suspect it's that the amount of infrastructure you have to build is determined by peak usage rather than average usage. If peak usage is much higher for a small part of one day a week (say on Monday morning as software developers across the US get back to work) the cost of fulfilling demand at all times can be insane. That's why companies are implementing batch/standard/priority pricing for the API.
Check out this article from today [0].
It sounds like it's more of a profit maximization function (and not just demand) with GPU rental prices increasing 48% since Feb.
> Renting one of Nvidia’s most-advanced Blackwell generation of chips for one hour costs $4.08, up 48% from the $2.75 it cost two months ago, according to the Ornn Compute Price Index.
[0] https://www.wsj.com/tech/ai/ai-is-using-so-much-energy-that-...