logoalt Hacker News

lostmsuyesterday at 10:43 PM1 replyview on HN

The implementation details are irrelevant to the discussion of the true cost of running the models.


Replies

mzltoday at 6:04 AM

The cost of running things like prompt caching is defined by the implementation as that gives the infrastructure costs.