logoalt Hacker News

dmixtoday at 2:46 PM1 replyview on HN

MechaHitler was the result of a single line prompt change that was publicly available on Github, they reverted it pretty quickly. Much like the GPT Gremlin stuff the change was relatively innocuous system prompt but had larger implications.

Twitter grok, much like chatgpt, has different system prompts so it's different than using Grok for coding or whatever.


Replies

timmytokyotoday at 3:46 PM

Let me guess. You also believe grok's recent episode, where it started inserting "white genocide" into the responses of totally unrelated queries, was caused by a rogue employee totally not doing it at Elon's behest. Despite the fact that Elon is always going on about "white genocide".

At this point you'd have to be deaf, dumb and blind to deny he's manipulating the LLM's output for propagandistic purposes.

show 2 replies