logoalt Hacker News

recitedropperyesterday at 7:18 PM1 replyview on HN

Streaming, caching, and tool calling can get pretty expensive with scale, even when you don't touch inference. Maybe they're doing something clever and are quite profitable.. or maybe they've already taken $40mm from VCs and are currently trying to raise $120mm at a 1.3B evaluation.

They also show headline prices for the cheapest provider of whatever model, but then need to hit different backends some of which may be more expensive. For now they absorb those costs, but the VCs always come knocking.

Just my opinion though. Totally agreed that they have one of the best positions amongst all AI providers from a financial standpoint.


Replies

vanviegenyesterday at 8:04 PM

> They also show headline prices for the cheapest provider of whatever model, but then need to hit different backends some of which may be more expensive. For now they absorb those costs, [..]

They do?? I was under the impression I was just playing the price for whatever provider they deemed 'best' for each completion.

show 1 reply