logoalt Hacker News

elliotbnvltoday at 3:08 PM1 replyview on HN

It is insane that we are comparing locally-hostable models to leading cloud providers, it is wild to me that this article even exists.

We have come a long way, and very clearly have a long way yet to go.


Replies

nijavetoday at 3:17 PM

Calling GLM-5.2 locally hostable is a bit of a stretch. It's 1.5Ti of weights at bf16. FP8 requires >800Gi of VRAM which is well into data center multi-GPU systems

show 1 reply