logoalt Hacker News

GodelNumberingtoday at 2:12 PM1 replyview on HN

More interesting part probably worth highlighting: The SAME model won't always return the same output when prompted with the same fact check.

You ask a human 1000 times a fact check question, they say the same answer 1000 times. You ask an LLM the same question a 1000 times, your results could vary significantly.

Humans work based on the Metamemory (knowing what they know), while LLMs are picking from statistical probability.


Replies

logged4upvotingtoday at 3:00 PM

That is not true, over an extended task that you cannot keep complete in memory humans do not behave with 100% consistency.

I have labeled datasets with a human team and shown the same task to the same user on a different day, and they answered differently. Of course, they are usually consistent with themselves most of the time but not always.