logoalt Hacker News

winwangyesterday at 5:28 PM1 replyview on HN

There's the other (orthogonal) possible explanation of using more GPUs for stress-testing before product launch.


Replies

zamadatixtoday at 8:24 AM

That's less an orthogonal explanation and more an example of why they'd do something like serve a quantized model.