I will stand by the first point unless models start being trained with different objectives instead ...

ossa-ma • yesterday at 3:55 PM • 0 replies • view on HN

I will stand by the first point unless models start being trained with different objectives instead of RLHF's three objectives: Helpfulness, Harmlessness and Instruction-following

I will very likely be wrong on the second point.

alt Hacker News