logoalt Hacker News

christina97today at 2:29 AM0 repliesview on HN

Start with a quant, you can run the Qwen 27B model at 4-bit on one 3090, presumably 6/8-bit on 2x3090.