logoalt Hacker News

not_mathlast Saturday at 11:21 PM1 replyview on HN

Given that providers of open source models can offer Kimi K2.5 at input $0.60 and output $2.50 per million tokens, I think the cost of inference must be around that. We would still need to compare the tokens per second.


Replies

Computer0yesterday at 5:13 AM

Fair but we technically do not know the parameter count