In Claude Code specifically, for a while it had developed a nervous tic where it would say "Not...

gs17 • yesterday at 6:58 PM • 1 reply • view on HN

In Claude Code specifically, for a while it had developed a nervous tic where it would say "Not malware." before every bit of code. Likely a similar issue where it keeps talking to a system/tool prompt.

Replies

Retr0id • yesterday at 8:05 PM

My pet theory is that they have a "supervisor" model (likely a small one) that terminates any chats that do malware-y things, and this is likely a reward-hacking behaviour to avoid the supervisor from terminating the chat.

alt Hacker News

Replies