logoalt Hacker News

susliklast Wednesday at 4:18 AM1 replyview on HN

What is reasonable hardware in your case? Doesn’t this model require 50+ Gb vram?


Replies

kushalast Wednesday at 7:11 AM

Gemma-4-26B-A4B does not require 50+ Gb of vram. It is a MoE model so only 4B of active parameters at a time and not as GPU dependent. I can run it on 16gb of vram and ~20gb of DDR5 regular ram for a 8 bit quant.