logoalt Hacker News

DeathArrowtoday at 12:24 PM5 repliesview on HN

Do all llm know they are a LLM? It doesn't depend on the system prompt?


Replies

andaitoday at 12:48 PM

The pre-trained ones no (except some of the new ones which have post training data added to pre-training for some reason). The post-trained ones yes (at least all the ones I've seen).

Some of the niche ones I'm not sure about. Like the historical LLMs. I have not tested those yet.

jdifftoday at 12:39 PM

I think any instruction tuned model is going to "know" it's an LLM.

Dititoday at 12:50 PM

Yes. The first step of aligning each and every GPT-based LLM is to suppress the “I am human” kind of responses. It’s baked into the weights.

show 2 replies
rootnod3today at 12:31 PM

Without a system prompt no. And in general they “know” nothing and just predict the next best word.

show 2 replies