logoalt Hacker News

fallpeakyesterday at 4:19 PM0 repliesview on HN

On OpenRouter there are 11 third-party inference providers hosting DeepSeek V4 Pro right now, of which 8 are US-based and 7 of those have zero data-retention policies (which I mention to rule out any claims of "oh they're making up the money by logging all your data"). This is a 1.6T-A49B model, so a bit bigger than Sonnet (~2/3rds the size) and a bit smaller than Opus (~3x as large). These third parties are almost perfectly interchangeable via OpenRouter as a marketplace, so they have no incentive to offer any sort of "growth pricing" below costs, and they serve it at $3.48/Mtok out.

Kimi K2.6 is 1T-A32B with a slightly less computationally efficient architecture, and is served at around $3.50/Mtok out by 9 US ZDR providers.

Unless you think that either the generally accepted size estimates for Anthropic/OpenAI models are wildly off or those companies are a lot worse at serving models efficiently, Anthropic and OpenAI are probably making around 5-8x margins on their API costs.

The cost of training new models is of course a major factor not counted here. Depending on how you want to think about that this may or may not make them net profitable. I remember one of those CEOs gave an interview a while back where they described it as a series of independent investments, where each model they train is net-positive in revenue by EOL just from its own inference, but I don't know whether that's still true or not.

Regardless, the point is that if they stopped training new models today, both Anthropic and OpenAI are making incredibly generous profits on their API inference.