logoalt Hacker News

jerojeroyesterday at 1:49 PM2 repliesview on HN

Companies doing foundational models need to cover the cost of training which is much more expensive than training something like kimi.


Replies

wongarsuyesterday at 1:53 PM

Yes. I would not consider Kimi a particularly good model relative to its size, and making a SotA model is a lot more expensive. But training costs are explicitly excluded when talking about the cost to serve tokens

gruezyesterday at 1:52 PM

>Companies doing foundational models need to cover the cost of training [...]

But that's moving the goalposts? The original claim was on inference itself, not the whole company.

> The cost to serve tokens is absolutely profitable today and that’s been true for at least a year.

show 1 reply