logoalt Hacker News

zozbot234yesterday at 9:21 PM0 repliesview on HN

Ultimately the most sensible way of handling this is you end up with "surge pricing" for the highest-priority tokens whenever the inference platform is congested, over and above the base subscription (but perhaps ultimately making the subscription a bit cheaper).