logoalt Hacker News

oa335yesterday at 10:21 PM2 repliesview on HN

> it costs >1000% to run inference

do you have a source for this claim? i thought LLM providers earn high margins from inference (charged by token). is this no longer the case?


Replies

vlian2088yesterday at 10:56 PM

if a $6000000 cabinet can generate 10000/s tokens of Opus but only 1000/s tokens of Mythos, then Mythos costs 1000% to run no matter the markup.

no one has a source, because no one knows closed model parameter counts. we have only heuristics which strongly indicate that Mythos is simply a big fucking model that any other lab could make an equivalent of.

3836293648yesterday at 10:41 PM

This was just theorised. The leaked OpenAI financials suggest otherwise (because of shady naming of losses)

The only ones who seem to profit are the ones running smaller Chinese models. Even NVIDIA seems to have to "reinvest" their profits into sponsoring companies to buy their cards now.