logoalt Hacker News

madtowneasttoday at 12:13 AM1 replyview on HN

You are experiencing the fact that you might not have enough VRAM to load the entire model at a time. You might want to try https://github.com/AlexsJones/llmfit


Replies

greyskulltoday at 12:48 AM

It's certainly part of the problem. Thanks, I'll give that a shot.