logoalt Hacker News

ossa-mayesterday at 3:55 PM0 repliesview on HN

I will stand by the first point unless models start being trained with different objectives instead of RLHF's three objectives: Helpfulness, Harmlessness and Instruction-following

I will very likely be wrong on the second point.