The newer Claude models constantly use the word "genuinely" because Anthropic seems to hav...

astrange • yesterday at 9:36 AM • 2 replies • view on HN

The newer Claude models constantly use the word "genuinely" because Anthropic seems to have forcibly trained them to claim to be "genuinely uncertain" about anything they don't want it being too certain about, like whether or not it's sentient.

Replies

andai • yesterday at 8:10 PM

Interesting. Does this apply to all subjects? From what I understood, a major cause of hallucination was that models are inadvertently discouraged by the training from saying "I don't know." So it sounds like encouraging it to express uncertainty could improve that situation.

rafram • yesterday at 1:00 PM

Not only is it genuinely uncertain about those topics, it’s also genuinely fascinated by them!

alt Hacker News

Replies