logoalt Hacker News

oompydoompy74yesterday at 10:26 AM1 replyview on HN

Remaining dependent on proprietary frontier models that you can only access via an API makes no sense whatsoever. My hope is that the future is open weight models running on local hardware.


Replies

naaskingyesterday at 2:32 PM

Eventually, yes. ParoQuant is hopefully the future here, 4-bit weights with no real degradation from FP16:

https://github.com/z-lab/paroquant