logoalt Hacker News

InsideOutSantatoday at 5:19 PM1 replyview on HN

Amazon did not remove any guardrails from Fable.


Replies

0xytoday at 5:26 PM

What? I personally experienced Fable outright refuse to do ANY security-related tasks, including hardening code or modifying security-related features. That was a guardrail. It was bypassed.

Anthropic themselves specifically called them safeguards. [1]

"When Fable’s classifiers detect a request related to cybersecurity, biology and chemistry, or distillation, the response is automatically handled by Claude Opus 4.8 instead"

This is exactly what was bypassed. They got Fable to work on security topics.

[1] https://www.anthropic.com/news/claude-fable-5-mythos-5

show 1 reply