The implementation details are irrelevant to the discussion of the true cost of running the models.
The cost of running things like prompt caching is defined by the implementation as that gives the infrastructure costs.
The cost of running things like prompt caching is defined by the implementation as that gives the infrastructure costs.