logoalt Hacker News

harlequinetcietoday at 2:11 PM1 replyview on HN

Whenever someone figures out why it's consuming so many tokens lately, that's the post worth upvoting.


Replies

solidasparagustoday at 6:08 PM

What do you mean? Costs spiked with the introduction of the 1M context window I believe due to larger average cached input tokens, which dominate cost.

show 1 reply