logoalt Hacker News

jugyesterday at 11:58 PM1 replyview on HN

It's interesting to note here that Anthropic indeed don't use "do not X" in the Opus system prompts. However, "Claude does not X" is very common.


Replies

wongarsutoday at 12:24 AM

I suspect that lets the model "roleplay" as Claude, promoting reasoning like "would Claude do X?" or "what would Claude do in this situation?"