It was very easy to see before this started that the models were just repetitive inflators.
It’s not that they should “scale back” their use as much as the metric should be improvement/tokens. Tokens used is a denominator in any worth calculation.