I wouldn't be too harsh, scaling x10 YoY is a bit hard on the infra!
OpenAI managed it way better, but we might have Microsoft to thank for that.
isn't serving Claude embarrassingly parallel tho?
OpenAI managed it way better, but we might have Microsoft to thank for that.