No, that's how base model pretraining works. Claude's behavior is more based on its consti...

astrange • yesterday at 9:27 PM • 0 replies • view on HN

No, that's how base model pretraining works. Claude's behavior is more based on its constitution and RLVR feedback, because that's the most recent thing that happened to it.

alt Hacker News