logoalt Hacker News

oceanplexianyesterday at 7:31 PM5 repliesview on HN

If you want to do coding with a local LLM your best bet is a 6 year old Nvidia 3090 which is substantially more powerful than the highest end overhyped Apple product for 1/5th the price.


Replies

chorizoyesterday at 7:57 PM

That’s 24GB VRAM. Not enough to run a 27B model at a useful quant+context size.

show 5 replies
ThunderSizzletoday at 10:15 AM

The cheapest 3090s I could find with any sort of guarantee were pushing $1500.

An AMD AI Pro R9700 32GB brand new is $1350 right now.

After some tweaking, I had it running faster than the models the 3090 could run, and it could obviously run with higher context limits and bigger models due to the extra vram.

iagooaryesterday at 8:23 PM

My problem is I won't accept anything lower than the 96GB the RTX Pro 6000 Blackwell has. My dream is a workstation with 2x Pro 6000 to run DeepSeek v4 Flash comfortably, possibly qwen 3.6 / ornith on turbo speed.

But man, I have never purchased a computer which is more expensive than a decent family car.

show 1 reply
jnovekyesterday at 8:04 PM

An M1 Ultra has 800gbps unified memory. It’s nothing to do with Apple, it’s their microarchitecture. They’re just about the only game in town with high-bandwidth memory if you want >24GB (for less than $10k, anyway).

show 3 replies
dheerayesterday at 9:25 PM

32GB V100

show 1 reply