"AI safeguards" are not working I guess.. or maybe they're only working against those...

zb3 • yesterday at 8:47 PM • 1 reply • view on HN

"AI safeguards" are not working I guess.. or maybe they're only working against those who'd like to secure their software.. good job Anthropic + OpenAI!

Replies

himata4113 • today at 4:13 AM

The AI safeguards are indeed a joke, you can get around their classifier by simply masking out all the unsafe words and it will happily work on your rootkit.

alt Hacker News

Replies