The AI Too Dangerous to Release (They Are Releasing It)

Anthropic’s Claude Mythos discovers 6,202 critical bugs in open-source projects while company admits lacking security controls

Annemarije de Boer Avatar
Annemarije de Boer Avatar

By

Image: Deposit Photos

Key Takeaways

Key Takeaways

  • Claude Mythos automatically discovers 6,202 critical vulnerabilities across 1,000 open-source projects
  • Anthropic admits lacking safeguards to prevent weaponization yet promises future public access
  • Cybersecurity stocks drop 5-11% as AI vulnerability discovery outpaces human security teams

Finding software vulnerabilities used to require teams of security researchers months of painstaking analysis. Anthropic’s Claude Mythos does it automatically—and that’s exactly the problem.

The company admits no one, including itself, has built safeguards strong enough to prevent such models from being weaponized. Yet Anthropic simultaneously promises to make “Mythos-class models” publicly available once it develops “far stronger safeguards.”

This contradiction sits at the heart of AI’s cybersecurity revolution.

When AI Outpaces Human Security Teams

Mythos has already scanned more than 1,000 widely-used open-source projects, surfacing 6,202 high or critical-severity vulnerabilities. Among its discoveries: a 27-year-old bug in OpenBSD that survived decades of manual security review. The model doesn’t just find vulnerabilities—it can weaponize them, constructing working exploits that could enable convincing phishing sites or certificate forgery attacks.

Current access remains tightly controlled through Project Glasswing, limiting the model to vetted organizations like:

  • AWS
  • Apple
  • Microsoft
  • Major cybersecurity vendors

Even so, some open-source maintainers have asked Anthropic to slow down its disclosure rate because they lack resources to patch the flood of legitimate bugs Mythos keeps finding.

The Safeguards That Don’t Exist Yet

Here’s where things get complicated. Anthropic distinguishes between the current “Mythos Preview” (which will never go public) and future “Mythos-class models” that supposedly will. The company offers no concrete timeline beyond “near future” and no technical specifics about what “far stronger safeguards” would actually look like.

Meanwhile, unauthorized access has already occurred due to internal security lapses—raising questions about whether Anthropic can secure such powerful AI internally, let alone control its external distribution. The White House has intervened to block proposed access expansion from 50 to 120 organizations over national security concerns, creating a system of informal AI licensing through government pressure rather than legal frameworks.

The vulnerability discovery arms race has officially gone algorithmic. Cybersecurity stock prices dropped 5-11% when Mythos capabilities became public, while governments from Japan to India ordered emergency surveillance reviews. Your security team may soon need AI-powered tools just to keep pace with AI-powered attackers—assuming you can access them first.

Share this

At Gadget Review, our guides, reviews, and news are driven by thorough human expertise and use our Trust Rating system and the True Score. AI assists in refining our editorial process, ensuring that every article is engaging, clear and succinct. See how we write our content here →