I still am struggling to understand why they informed the government about something that is known t...

Topfi • yesterday at 6:12 PM • 20 replies • view on HN

I still am struggling to understand why they informed the government about something that is known to be an issue in every LLM. There is no LLM that cannot be jailbroken, so unless this means that we have reached the absolute maximum publicly accessible US made LLMs are allowed to operate at with GPT 5.5, this is not grounded in any sane regulation attempt.

Does anyone know what limits Fable 5 has overstepped in the eyes of the government? Parameter count? Certain benchmark results? Training computer?

Cause if it’s just the ability to assist with cyberattacks and being jailbreakable, there is no model previously released that isn’t equally guilty.

Remember that for GPT 5.5 and 5.4, OpenAI also restricted the cybersecurity focused use under designated models, otherwise rerouting to 5.3-codex like Fable did with Opus 4.8. And both OpenAI models can also be jailbroken all the same.

Basically, what was the reason to tell the government now and not with Opus 4.5 or GPT 5.4? sama has been doing the rounds with apocalyptic predictions…

Replies

themgt • yesterday at 9:17 PM

I submitted separately, but this Axios report has some details that call a lot of the speculation in this thread into question, i.e. that this wasn't much of a "jailbreak" at all and that it's not Anthropic-specific - the White House intends to generally regulate Mythos-class models (whatever exactly that means):

Between the lines: The government's response "seems way out of line with what's actually in the research report," Luta Security CEO Katie Moussouris, who Anthropic shared the Amazon report with, told Axios.

Moussouris said the researchers were able to find security vulnerabilities by asking questions normal defenders would ask AI, which is exactly what the model was intended to do.

An administration official told Axios they do not view other models as national security threats because they do not surpass the bar that Mythos set.

Anything at Mythos level or above would need to go through the administration to ensure the government's national security apparatus is hardened enough, the official added.

https://www.axios.com/2026/06/13/anthropic-amazon-white-hous...

➕ show 2 replies

metalspot • today at 12:40 PM

This is obviously political and the entire narrative is fabrication.

David Sacks is publicly gloating about it: https://x.com/DavidSacks/status/2065853007619588171

I can't really say that Anthropic didn't get what they deserved. They exploited security threats to sell their product and play political games, and now their rivals are rubbing it in their faces.

➕ show 2 replies

irthomasthomas • yesterday at 10:24 PM

They literally asked for it. Two days ago Amodei wrote an essay urging the government to regulate them. He explicitly cited Mythos, as proof that frontier AI has acquired autonomous hacking capabilities that threaten critical infrastructure and national security.

  "Mythos Preview scrambled the global cybersecurity landscape. But its broader significance is that it proves beyond doubt that AI models are now tools of global and national strategic consequence." 


  "The government should have the power to block or deter deployment of the model if it is determined, in light of third-party assessment, to present unacceptable risks. This power must be scoped to the above four specific risks and there must be protective measures against political favoritism or arbitrary decisions"

https://darioamodei.com/post/policy-on-the-ai-exponential

A third-party demonstrated that it was possible to jailbreak the safety measures of Fable to access the raw Mythos abilities. Abilities which Anthropic say are too dangerous for the public.

Edit. From David Sacks:

  — A highly credible trusted partner of both Anthropic and the USG who was testing Fable came forward with a jailbreak of those guardrails. The Admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused.

   — In their blog post, Anthropic defended its decision by saying the jailbreak isn’t serious. That is not what the trusted partner and the USG believe; nor is that kind of minimizing language consistent with Anthropic’s brand as the AI safety company. It’s difficult to fathom how they could claim a jailbreak allowing operability of a cyber weapon could be defined as not “serious".

➕ show 2 replies

trinsic2 • yesterday at 10:52 PM

>I still am struggling to understand why they informed the government about something that is known to be an issue in every LLM. There is no LLM that cannot be jailbroken, so unless this means that we have reached the absolute maximum publicly accessible US made LLMs are allowed to operate at with GPT 5.5, this is not grounded in any sane regulation attempt.

I wondering where you are getting the idea that there is an sane regulation right now?

thayne • yesterday at 9:18 PM

The only reason I can see is because Amazon wanted something like this to happen. But I'm not sure what Amazon would gain from that, since they don't have their own competing frontier models.

➕ show 4 replies

lebovic • yesterday at 6:38 PM

Claims of retribution aside, one steelman is that Mythos is likely the most capable model that's usable by folks like the NSA [1], and decision-makers across the USG and industry partners have seen a stream of reports of Mythos successfully finding serious vulnerabilities over the past couple months due to Glasswing.

So even if GPT 5.5 is just as capable in these scenarios (which, imo, it largely is), it is not known by the government apparatus as having the same capabilities.

Personally, I think we crossed the threshold of capabilities with Opus 4.6 [2], which translated to an even more capable open-weight GLM 5.1 (which it is rumored to have distilled Opus 4.6) [3][4]. But the USG and its partners aren't fully rational actors with perfect data, so it's possible they're only viscerally aware of these capabilities in the context of Mythos.

[1]: https://www.reuters.com/business/us-security-agency-is-using...

[2]: Opus 4.6 was used for https://www.noahlebovic.com/testing-an-autonomous-hacker/

[3]: See GLM 5.1 scoring in https://www.cybergym.io/cybergym/

[4]: https://dualuse.dev/posts/chinese-models-are-sometimes-bette...

➕ show 1 reply

nowittyusername • yesterday at 9:11 PM

The simple answer is that Trump has a stick up his ass against Anthropic and is also fond of stock market manipulation. No need to get too deep when it comes to dealing with that orange shmuck.

➕ show 1 reply

Jcampuzano2 • yesterday at 6:54 PM

The reason is pretty obvious. Anthropic tried to play hardball with the government and now they are under their thumb for scrutiny of any and every little thing they do.

That's what this admin is known for. If you do even what a normal person would think is sane but they don't like it, well now they need to make you bow down and break you so you "learn your lesson".

It doesn't help that they themselves marketed this model as being especially dangerous in the publics hands. If this was just another model drop and none of the fear mongering I don't doubt this probably wouldn't have had any issues.

➕ show 4 replies

zaptheimpaler • today at 3:16 AM

This is corporate Game of Thrones, nothing more. Amazon, maybe in alliance/deals with others as well saw an opportunity to hurt their rival. Or maybe they were instructed to report this by the WH themselves. Hegseth and the WH will happily take any excuse to hurt Anthropic after the confrontation with DOW, being the vindictive cronies they are.

➕ show 1 reply

vrganj • yesterday at 6:15 PM

Its not Fable 5 that overstepped in the eyes of the US government.

It's Anthropic.

This is transparent revenge for them daring to try and push back a little on enabling war crimes.

➕ show 6 replies

ReflectedImage • today at 12:41 AM

Probably a con job. The AI companies don't think they will be able to significantly improve their models in the next year or so, so they are stalling with government regulations whilst taking in investor money.

vessenes • today at 7:55 AM

I’d invert - given their significant competition for government business, what would be a reason for not doing this?

sagarpatil • today at 7:03 AM

Doesn’t Amazon own 14% of Anthropic?

SilverElfin • today at 1:47 AM

Anthropic themselves have played up the dangers of Mythos, limited its release, etc. So if it can be jail broken then it specifically deserves controls, per Dario’s own manifestos. David Sacks - the “AI Czar” - also said the government asked Anthropic to patch the issue but they refused, which is bizarre. And that led to the export ban.

➕ show 1 reply

m3kw9 • yesterday at 9:42 PM

Because based upon on what Anthropic has told the “AI people” and military, it is dangerous if an adversary gets its hands in the cyber capabilities. Knowing that if they ignored it and something did happen, heads will roll. Blame Anthropic for that, or wait if they are all for safety, they shouldnt complain.

classified • today at 6:06 AM

> why they informed the government

Having no moat, they want to manipulate the government into creating one for them.

agrijakhetarpal • today at 1:17 AM

> I still am struggling to understand

And? Does it matter?

giancarlostoro • yesterday at 8:30 PM

Reminds me of people freaking out about the Grok Bikini thing, but GPT and Googles image model they all do the same behavior. Clearly biased against Elon Musk despite it being a problem for every single image model out there.

alt Hacker News

Replies