I’ve been using 1,000 t/s on a near frontier model for a month now. It’s very useful for agentic coding.
It does require new approaches for me personally since I get a lot less time to think or read its output.
Which model and how can you achieve that speed, if you don't mind me asking?
Which model and how can you achieve that speed, if you don't mind me asking?