Fine tuning these models (at least with PPO or equivalent) requires even more VRAM than inference do...

andy_ppp • yesterday at 4:45 AM • 1 reply • view on HN

Fine tuning these models (at least with PPO or equivalent) requires even more VRAM than inference does, potentially 2-3 times more.

rusk • yesterday at 10:52 AM

You could use PEFT? Operating on only a subset of weights is fairly standard practice nowadays …

➕ show 1 reply

alt Hacker News