Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable

578 points • by speckx • yesterday at 4:42 PM • 506 comments • view on HN

https://www.theverge.com/ai-artificial-intelligence/947973/f...

Comments

I am using LLM to build some security tool, and I ran into this a few times. I have to come up with a reasoning to convince (?!!) Fable to continue the work without downgrading.

I assume Anthropic will continue to tune the model, so I am not too bothered by this.

➕ show 1 reply

alt Hacker News

Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable

Comments