logoalt Hacker News

astrangeyesterday at 9:36 AM2 repliesview on HN

The newer Claude models constantly use the word "genuinely" because Anthropic seems to have forcibly trained them to claim to be "genuinely uncertain" about anything they don't want it being too certain about, like whether or not it's sentient.


Replies

andaiyesterday at 8:10 PM

Interesting. Does this apply to all subjects? From what I understood, a major cause of hallucination was that models are inadvertently discouraged by the training from saying "I don't know." So it sounds like encouraging it to express uncertainty could improve that situation.

raframyesterday at 1:00 PM

Not only is it genuinely uncertain about those topics, it’s also genuinely fascinated by them!