logoalt Hacker News

nijavetoday at 3:17 PM1 replyview on HN

Calling GLM-5.2 locally hostable is a bit of a stretch. It's 1.5Ti of weights at bf16. FP8 requires >800Gi of VRAM which is well into data center multi-GPU systems


Replies

elliotbnvltoday at 3:29 PM

It's more about the trajectory.