What's time to first token? Raw throughput is usually not the problem in local setups in my experience.