> applying this compression algorithm at scale may significantly relax the memory bottleneck issu...

konaraddi • today at 12:22 PM • 2 replies • view on HN

> applying this compression algorithm at scale may significantly relax the memory bottleneck issue.

I don’t think they’re going to downsize though, I think the big players are just going to use the freed up memory for more workflows or larger models because the big players want to scale up. It’s a cat and mouse race for the best models.

Replies

miohtama • today at 1:51 PM

It will also help with local inference, making AI without big players possible.

➕ show 1 reply

Verdex • today at 12:46 PM

Known in the business as 'pulling a jevons'

alt Hacker News

Replies