logoalt Hacker News

bachmeieryesterday at 8:52 PM2 repliesview on HN

I'm not much interested in vibe coding (for those who aren't aware that LLMs have other uses). The specific model I've been using with Ollama is hf.co/unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF:UD-Q4_K_XL and it's amazing how fast it is on 64 GB of RAM and i5-13400 CPU. No GPU on this computer. Gemma 4 E4B will think for a couple of minutes vs 3-5 seconds for Qwen. It's hard to believe how much you can do with such limited hardware using their models.


Replies

mailleyesterday at 10:44 PM

What are your use cases?

metalliqazyesterday at 11:25 PM

I have a much more powerful PC and I would not call Qwen3-Coder-30B-A3B "fast" on my machine by any stretch of the word. How are you running it?