logoalt Hacker News

EGregyesterday at 8:58 PM0 repliesview on HN

Why does ChatGPT slow down so much when the conversations get long, while Claude does compaction?

My best guess is -- ChatGPT is running something in your browser to try to determine the best things to send down to the model API –- when it should have been running quantized models on its own server.