logoalt Hacker News

silver_silvertoday at 12:57 AM1 replyview on HN

“Works on my machine” actually isn’t a good enough response in this case, or to the comment about the video of the man being shot. LLMs are infamously easy to jailbreak and children are very good at getting around guardrails. You should at the very least be doing intense adversarial prompt testing but honestly this idea is just inherently poorly thought out. I guarantee you it’s going to expose children to harmful content


Replies

andrewdugtoday at 1:07 AM

We'll keep testing and working to improve it. Thank you for the feedback.