logoalt Hacker News

Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable

578 pointsby speckxyesterday at 4:42 PM506 commentsview on HN

https://www.theverge.com/ai-artificial-intelligence/947973/f...


Comments

guardiangodyesterday at 10:59 PM

I am using LLM to build some security tool, and I ran into this a few times. I have to come up with a reasoning to convince (?!!) Fable to continue the work without downgrading.

I assume Anthropic will continue to tune the model, so I am not too bothered by this.

show 1 reply