I have 32GB of RAM with 16GB VRAM and I haven't had a lot of luck running larger models like th...

TheCycoONE • last Monday at 8:32 PM • 1 reply • view on HN

I have 32GB of RAM with 16GB VRAM and I haven't had a lot of luck running larger models like this. Are you able to expand on that?

slim • last Monday at 9:03 PM

use llama.cpp with cuda

➕ show 1 reply

alt Hacker News