logoalt Hacker News

zb3yesterday at 8:47 PM1 replyview on HN

"AI safeguards" are not working I guess.. or maybe they're only working against those who'd like to secure their software.. good job Anthropic + OpenAI!


Replies

himata4113today at 4:13 AM

The AI safeguards are indeed a joke, you can get around their classifier by simply masking out all the unsafe words and it will happily work on your rootkit.