logoalt Hacker News

drakytheyesterday at 1:52 PM5 repliesview on HN

Anthropic also recently tweaked their usage limits to discourage use during peak hours. Why would they do that if inference was profitable?


Replies

infectoyesterday at 1:54 PM

Don’t confuse inference (api usage) with the consumer plan products. When people say inference is profitable they are referring to the cost to serve a token via the API. The consumer products are absolutely a question mark on profitability and as we see with most of the business and enterprise plans, going away for pure on demand use (api cost) full time.

strangegeckoyesterday at 2:42 PM

Profitability doesn't imply infinite ability to scale. Of course they will want to prioritize their most profitable customers when they hit capacity issues.

aurareturnyesterday at 7:35 PM

They do it because their demand is higher than the compute that they have available to them. Their GPUs must be melting during peak hours so they're encouraging people who move their workload to off peak hours if possible.

This is the opposite of an AI bubble burst.

paulddraperyesterday at 6:40 PM

Those are subscription plans. They tweaked the limits/periods included in the subscription. Having higher limits for subscription plans didn't give them any more revenue.

financltravstyyesterday at 5:04 PM

Their infra team is very understaffed and they are reacting to the public backlash of "no 9s?"