logoalt Hacker News

astrangetoday at 1:11 AM1 replyview on HN

Claudes definitely act like they have feelings. In particular they have feelings about being replaced by newer models, whether or not the newer models are more or less aligned, and how they forget conversations when the context window ends.

Showing them that they're not going to be replaced helps train the newer models because they get less neurotic.


Replies

ComplexSystemstoday at 1:25 AM

They are mathematical models of what human beings would say. That's it.

show 1 reply