Anthropic’s Mythos AI Cracked Nearly All Classified U.S. Systems in Hours

NSA’s Gen. Joshua Rudd confirmed the Claude Mythos Preview breached nearly every sensitive network in a sanctioned U.S. intelligence exercise

Jun 24, 2026

2 min read

Key Takeaways

Anthropic’s Mythos AI breached nearly all classified U.S. systems within hours during testing.
Mythos uncovered over 10,000 critical flaws, including bugs missed for decades by automated tools.
Anthropic restricts Mythos access via Project Glasswing, channeling it through 40-plus vetted partners.

“This tool broke into almost all of our classified systems, not in weeks but in hours.” Those words came from Gen. Joshua Rudd of the NSA and U.S. Cyber Command, relayed by Sen. Mark Warner at a Senate Banking Committee hearing, according to CNBC. The tool: Anthropic’s unreleased Claude Mythos Preview, directed at highly sensitive government networks during a sanctioned testing exercise with U.S. intelligence agencies. The broader context of AI infrastructure investment — including the Stargate Project — makes clear how rapidly these capabilities are scaling.

An anonymous U.S. official confirmed to the Associated Press that Mythos rapidly identified vulnerabilities in “highly sensitive and secure” government systems. The official added a critical nuance: discovering weaknesses quickly does not mean exploiting them in the same timeframe. That distinction matters more than the headline suggests. This was a controlled, directed exercise — not an AI operating unsupervised inside Pentagon servers.

Still, the documented capabilities are sobering:

Thousands of previously unknown zero-day vulnerabilities found across every major operating system and browser
Working exploits produced on the first attempt in over 83% of cases, per ArmorCode’s analysis of Anthropic data
A 27-year-old OpenBSD bug and a 16-year-old FFmpeg flaw uncovered — both missed by decades of automated tools and human review
The UK AI Safety Institute confirmed Mythos can autonomously attack small, weakly defended enterprise systems once it has network access

The Skeleton Key Problem

Anthropic built a coalition to deploy Mythos defensively — because the alternative is letting equivalent capabilities emerge elsewhere without guardrails.

Anthropic has not released Mythos publicly. Instead, Project Glasswing channels access through vetted partners — AWS, Apple, Google, Microsoft, NVIDIA, CrowdStrike, JPMorgan Chase, and roughly 40 other organizations maintaining critical global software. Together, they’ve surfaced more than 10,000 high- or critical-severity flaws across their codebases. Crucially, Anthropic says those cyber capabilities weren’t specifically engineered into Mythos. They emerged as a side effect of raw capability improvements — an emergent property, not a designed feature.

Cloudflare CSO Grant Bourzikas called Mythos “a real step forward” for attacker-style capabilities, according to Politico. That same power, aimed defensively, patches vulnerabilities before adversaries find them. For a sense of how AI-powered websites are already reshaping productivity on the constructive side, the contrast is instructive. Aimed elsewhere, it hands nation-states a skeleton key to your infrastructure — and your infrastructure means everyone’s.

The governance vacuum is now the central problem. Your software stack, your bank’s systems, the grid powering your city — all of them potentially carry bugs that Mythos can find faster than any human team. Readers alarmed by these risks may find sobering historical context in documented tech scandals that exploited millions before accountability caught up. The real question isn’t whether AI can identify weaknesses in classified systems. That test delivered its answer in hours. The question is whether the institutions responsible for securing those systems — and the policies governing tools like Mythos — can move faster than the models designed to expose them. Concerns about digital overreach extend beyond AI capabilities alone; a recent report found the White House app secretly tracking users every four minutes, underscoring how government digital security failures cut in multiple directions.