> Batching multiple users up thus increases overall throughput at the cost of making users wait for the batch to be full.
writer has not heard of continuous batching. this is no longer an issue. this is what makes claude code that affordable. https://huggingface.co/blog/continuous_batching