Companies doing foundational models need to cover the cost of training which is much more expensive ...

jerojero • yesterday at 1:49 PM • 2 replies • view on HN

Companies doing foundational models need to cover the cost of training which is much more expensive than training something like kimi.

Replies

wongarsu • yesterday at 1:53 PM

Yes. I would not consider Kimi a particularly good model relative to its size, and making a SotA model is a lot more expensive. But training costs are explicitly excluded when talking about the cost to serve tokens

gruez • yesterday at 1:52 PM

>Companies doing foundational models need to cover the cost of training [...]

But that's moving the goalposts? The original claim was on inference itself, not the whole company.

> The cost to serve tokens is absolutely profitable today and that’s been true for at least a year.

➕ show 1 reply

alt Hacker News

Replies