I typically find myself using a context of between 150-500k with GPT models so local models are simp...

greenavocado • yesterday at 5:48 PM • 2 replies • view on HN

I typically find myself using a context of between 150-500k with GPT models so local models are simply not enough and I stopped using them.

stymaar • yesterday at 5:53 PM

That's way higher than their optimal ceiling (and absolutely suboptimal from a token cost point of view), why are you doing that?

➕ show 1 reply

c0rruptbytes • yesterday at 6:01 PM

large contexts degrade the performance - attention doesn't work will for large windows like that and cloud models are kind of hacking it

local models do involve some context engineering to get it okay, but it's not that rough

alt Hacker News