logoalt Hacker News

SlavikCAlast Thursday at 5:50 PM1 replyview on HN

I'm running it on my Intel Xeon W5 with 256GB of DDR5 and Nvidia 72GB VRAM. Paid $7-8k for this system. Probably cost twice as much now.

Using UD-IQ4_NL quants.

Getting 13 t/s. Using it with thinking disabled.


Replies

GrayShadelast Friday at 5:33 AM

I get 20 t/s on the UD-Q6_K_XL quant, Radeon 6800 XT.