logoalt Hacker News

Ladiosslast Friday at 9:22 AM0 repliesview on HN

My system has 16 Gb VRAM / 32 Gb RAM, and ollama runs qwen3.6:latest at decent speed just fine. The 35b model is a moe, so I guess the whole model is offloaded.