If what you refer to by “on demand training ” is fine tuning, it's going to be much more effici...

littlestymaar • yesterday at 5:03 PM • 1 reply • view on HN

If what you refer to by “on demand training ” is fine tuning, it's going to be much more efficient on a small model than a big one.

red75prime • yesterday at 6:36 PM

LoRA can work with big models. But I mean sample-efficient RL.

alt Hacker News