logoalt Hacker News

bityardtoday at 3:46 PM1 replyview on HN

It's a surprisingly common misconception that models contain any metadata at all about themselves in their weights. If you ask them, "What model are you?" they either retrieve the answer from the system prompt, or they hallucinate an answer. Same goes for questions about knowledge cut-off, how many parameters they have, the source of their training data, etc.


Replies

hereme888today at 4:21 PM

Huh. That kinda makes sense. So you think it's hallucinating it's model name?

show 1 reply