logoalt Hacker News

Patrick_Devineyesterday at 10:31 PM0 repliesview on HN

Try it with mxfp8 or bf16. It's a decent model for doing tool calling, but I wouldn't recommend using it with 4 bit quantization.