logoalt Hacker News

TheCycoONElast Monday at 8:32 PM1 replyview on HN

I have 32GB of RAM with 16GB VRAM and I haven't had a lot of luck running larger models like this. Are you able to expand on that?


Replies

slimlast Monday at 9:03 PM

use llama.cpp with cuda

show 1 reply