It's a surprisingly common misconception that models contain any metadata at all about themselv...

bityard • today at 3:46 PM • 1 reply • view on HN

It's a surprisingly common misconception that models contain any metadata at all about themselves in their weights. If you ask them, "What model are you?" they either retrieve the answer from the system prompt, or they hallucinate an answer. Same goes for questions about knowledge cut-off, how many parameters they have, the source of their training data, etc.

Replies

hereme888 • today at 4:21 PM

Huh. That kinda makes sense. So you think it's hallucinating it's model name?

➕ show 1 reply

alt Hacker News

Replies