logoalt Hacker News

oezitoday at 1:53 AM0 repliesview on HN

Would it be feasible to do a soft RLHF using steering when an agents gives an undesired response?