I think gemma-4-26b-a4b and Qwen3.6-35B-A3B show that there's something very interesting about ...

simonw • today at 3:38 PM • 1 reply • view on HN

I think gemma-4-26b-a4b and Qwen3.6-35B-A3B show that there's something very interesting about a local model that does mixture-of-experts (which helps a lot with performance) and has in the order of 30 billion parameters.

These models are very capable, and use around 20-30GB of RAM while they are running.

Provided you have 64GB of RAM that leaves space for running other applications at the same time.

Replies

chrisweekly • today at 3:50 PM

Obtaining that 64GB RAM is a meaningful obstacle for many.

➕ show 2 replies

alt Hacker News

Replies