The arrival of Claude Mythos Preview marks a disturbing shift in the silicon-based arms race. This unreleased model from Anthropic demonstrates a level of autonomy that mirrors a professional human researcher, specifically in its ability to execute long-range tasks. Unlike previous systems that identified isolated syntax errors, this iteration excels at chaining vulnerabilities. It links seemingly innocuous flaws into sophisticated exploit sequences, bypassing traditional security layers that rely on the obscurity of complex code.

Project Glasswing and the containment strategy

Recognizing that these capabilities could prove catastrophic in the wild, an industry-wide coalition has launched Project Glasswing. This defensive front includes giants like Microsoft, Google, and Apple, aiming to weaponize the AI for defense before it is co-opted by adversaries. The logic is simple yet desperate: give the defenders a head start with the very tools that could dismantle their infrastructure. This acknowledges a fundamental truth in modern ethics—we can no longer assume a slow rollout will provide safety; we must actively pre-empt the inevitable exploitation of powerful code.

Resurrecting flaws in legacy infrastructure

Claude Mythos AI exploits 27-year-old bugs, triggering Project Glasswing's defense — An initiative to secure the world's software | Project Glasswing

The most startling revelation comes from the model's success against foundational software. It recently unearthed a bug in OpenBSD that remained hidden for 27 years. In Linux, the model demonstrated the ability to escalate user permissions to administrator levels by simply running a binary. These are not just theoretical risks; they are the cracks in the foundation of the global internet. The efficiency is unprecedented, with researchers reporting they have found more bugs in weeks than in their entire careers combined.

The ethical mandate for collective defense

Software has effectively eaten the world, and by extension, our vulnerabilities are now societal rather than technical. As Anthropic coordinates with the US Government, the focus must remain on the "should we" of deployment. We are entering an era where cybersecurity is the ultimate form of social security. Maintaining this digital fabric requires a transparency that many tech firms find uncomfortable, but as the capability gap closes, isolation is a luxury we can no longer afford.

The offensive capability of unreleased models

Project Glasswing and the containment strategy

Resurrecting flaws in legacy infrastructure

The ethical mandate for collective defense

An initiative to secure the world's software | Project Glasswing

Anthropic