logoalt Hacker News

themgtyesterday at 9:17 PM5 repliesview on HN

I submitted separately, but this Axios report has some details that call a lot of the speculation in this thread into question, i.e. that this wasn't much of a "jailbreak" at all and that it's not Anthropic-specific - the White House intends to generally regulate Mythos-class models (whatever exactly that means):

Between the lines: The government's response "seems way out of line with what's actually in the research report," Luta Security CEO Katie Moussouris, who Anthropic shared the Amazon report with, told Axios.

Moussouris said the researchers were able to find security vulnerabilities by asking questions normal defenders would ask AI, which is exactly what the model was intended to do.

An administration official told Axios they do not view other models as national security threats because they do not surpass the bar that Mythos set.

Anything at Mythos level or above would need to go through the administration to ensure the government's national security apparatus is hardened enough, the official added.

https://www.axios.com/2026/06/13/anthropic-amazon-white-hous...


Replies

Aeoluntoday at 1:00 PM

The governments national security apparatus was using a public signal group and invited a reporter into it. I don't think we should use them as the standard for secure.

show 1 reply
warumdarumtoday at 7:41 AM

Why amazon? I bet the three letters had a hissy fit field day worrying that their expensive hancrafted zero days would evaporate and software would get more secure. So, the government is throwing a wrench for the NSA

softwaredougyesterday at 10:14 PM

That’s a terrible way to create AI regulations

If they actually cared about this issue we’d have predictable laws and regulatory bodies that let companies actually plan

There’s a reason royal fiat doesn’t lead to healthy economies. It’s just confusing and chaotic. It’s not clear why anyone would invest in a new model now.

Then the next administration comes in and instantly, by fiat, they decide to lift the ban. The market just gets jerked around with no ability to plan long term investments.

show 4 replies
rustyhancocktoday at 9:06 AM

> the White House intends to generally regulate Mythos-class models (whatever exactly that means)

This is not at all surprising. And I hope people don't make the mistake that it's a "this administration" problem.

It was obviously from the early days of these LLMs that the shoe was going to drop and we (as Joe public) would not retain access. I mean that once ChatGPT3 dropped it was clear there was some level of functionality at which we would be denied further access.

The only carve out will be as per older technical innovations the US is more concerned with foreign national access than US citizen access at home.

I don't remember the details with encryption but it was basically you have to ship a breakable version for the rest of the world, and you generally sometimes ship a backdoored version.

And Anthropic is more concerned by what they are asked to do to US citizens than the broader group.

Same story with encryption, CPUs, GPUs, blah blah blah.

show 4 replies
Topfiyesterday at 9:52 PM

Interesting. Hope there is any clarification on what "Mythos level" is and why 5.5-cyber doesn't arise to it. Any metric I could come up with (parameters, pre-train compute, benchmark scores, etc.) seems somewhere between imperfect and utterly nonsensical. Pure speculation, but GPT-5 series models including the new 5.5 pre-train appear far closer to Sonnet than Opus or Fable in pure parameter count, so maybe that's it, but the "they do not surpass the bar that Mythos set" line sounds more like there is a believe that Mythos/Fable are more capable in cybersecurity tasks, whereas the data [0] doesn't seem to bare this out. I did not do any cybersecurity assessment of Fable 5 myself, partly due to personal reasons that make that something I'm abstaining from, but my coding evals showed that while task adherence and assessment wise it was neck and neck with 5.5, the task inference was a major jump again (something prior Anthropic models tended to already do incredibly well on) and while that makes it a far better model to work with for UX experiments, I don't see how that translates to cybersecurity, along with the aforementioned publicly available evals by AISI.

Seeing as neither Mythos nor GPT-5.5 had been pre-trained with a particular focus on cybersecurity, this would have to mean any model that benchmarks better than GPT-5.4 or Opus 4.6 on these tasks cannot be used by None-US-Citizens. If such guidance isn't enforced for all US labs, I think that's irrefutable evidence that this isn't about cybersecurity or "the bar that Mythos set"...

[0] https://xcancel.com/AISecurityInst/status/205458976317312633...