Given that providers of open source models can offer Kimi K2.5 at input $0.60 and output $2.50 per m...

not_math • last Saturday at 11:21 PM • 1 reply • view on HN

Given that providers of open source models can offer Kimi K2.5 at input $0.60 and output $2.50 per million tokens, I think the cost of inference must be around that. We would still need to compare the tokens per second.

Replies

Computer0 • yesterday at 5:13 AM

Fair but we technically do not know the parameter count

alt Hacker News

Replies