logoalt Hacker News

case540yesterday at 4:43 PM0 repliesview on HN

I assume it’s time to first output token so it’s basically throughput. How fast can it output 8001 tokens