Captcha suggestion: force users to write something offensive/vulgar (we have a few "banned words"). Or to take a stance in Israel/Palestine.
Whatever the response is, it'll unlikely be from an LLM.
This is such a flawed view of LLMs. Sure it may block out frontier models but every local abliterated (and some non) will just say whatever you want.
But to use vulgar words an age attestation must be passed first! /s
Takes about 450ms on my machine:
And another: And to bring it home: That's the tiny Gemma3 model, there are uncensored models that are much more complex. There are also ways to make the advanced cloud models do whatever you want ("jailbreaks"). Or just use Grok.