NEWS

Anthropic Mythos Release Plan Exposes Patch Bottleneck

Published

2 months ago

May 27, 2026

Anthropic’s Claude Mythos Preview is still locked behind Project Glasswing, but the company’s May update makes the public release question sharper: the AI bug finder has shown it can find serious flaws faster than humans can verify, disclose and patch them. The company says Mythos-class models will reach general release only after stronger safeguards are ready.

That delay is a release gate, but the harder problem sits outside the model lab. The first Glasswing results show a security market moving from scarcity to overload, where the expensive work shifts from finding bugs to proving which ones matter and getting fixes installed.

The Gate Is Safeguards, Not Demand

In its Project Glasswing initial update, Anthropic said no company, including itself, has safeguards strong enough to prevent models with these cyber skills from being misused. That is the reason it gave for holding back a general release, even as it said it wants to make Mythos-class systems broadly available in the near future.

The company has not attached a date to that phrase. Instead, it plans to widen access first to more critical partners, including US and allied governments. That puts public access behind two tests: whether the safeguards can block harmful cyber outputs, and whether defenders can absorb the volume of findings without creating a backlog of known but unfixed weaknesses.

The April launch was already a compromise. In the Project Glasswing launch notice, Anthropic said partners including Amazon Web Services, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA and Palo Alto Networks would get restricted access for defensive security work. The same notice committed up to $100 million in usage credits and $4 million in direct donations to open-source security groups.

Anthropic Mythos AI bug finder release safeguards and security risks.

A Vulnerability Flood Meets a Human Queue

The striking number is not just how many flaws the model found. It is how few have moved all the way through the repair chain. Anthropic said the system scanned more than 1,000 open-source projects and produced 23,019 total findings, including 6,202 it estimated as high or critical severity.

23,019 findings came from scans of more than 1,000 open-source projects.
6,202 findings were estimated by the model as high or critical severity.
90.6 percent of 1,752 assessed high or critical findings were confirmed as valid true positives.
75 of 530 reported high or critical bugs had been patched at the time of the update.

That last ratio is roughly 14 percent. Anthropic gives several reasons, including the early stage of the 90-day Coordinated Vulnerability Disclosure (CVD, private reporting before public detail) window and likely undercounting when projects fix quietly. Even so, the patch bottleneck is now visible in the numbers.

The company’s live coordinated vulnerability disclosure dashboard sharpened the point. As of May 22, it listed 1,596 disclosed vulnerabilities across 281 open-source projects, with 97 patched and 88 assigned a Common Vulnerabilities and Exposures record or GitHub Security Advisory.

wolfSSL Turns an Abstract Risk Into a Browser Problem

The wolfSSL case explains why this story is bigger than one AI product launch. wolfSSL is a cryptography library used in embedded systems, internet infrastructure and connected devices. When a certificate verification flaw lands there, the failure mode can look ordinary to a user: a site appears legitimate while trust checks have been weakened underneath.

The National Vulnerability Database entry for CVE-2026-5194 in wolfSSL describes missing hash, digest size and object identifier checks that could reduce the security of ECDSA certificate-based authentication. Common Vulnerabilities and Exposures (CVE, the public identifier system for disclosed security flaws) records can sound dry. This one sits close to the trust layer people rely on when they log in to a bank, email account or enterprise portal.

wolfSSL’s own security vulnerabilities page lists the issue as critical, credits Nicholas Carlini from Anthropic for the report, and says it was fixed in version 5.9.1 with a one-day time to fix. That quick repair matters. It also shows the unevenness of the new era: a mature vendor can move fast, while smaller projects may depend on volunteers who are already buried under low-quality AI generated reports.

Partners Are Building a Triage Machine

Project Glasswing is turning into a test of process design. The bug finder can generate candidate findings. Humans and specialist firms still have to reproduce them, judge severity, notify maintainers and help fixes land. Anthropic says six outside security research firms are helping with that triage: Ada Logics, Anvil, Calif.io, Doyensec, Ophion Security and Trail of Bits.

Stage	Main Actor	What Changes	Open Risk
Restricted preview	Anthropic and Glasswing partners	High-end vulnerability discovery stays inside selected defensive teams	Attackers may gain similar capability from other models
Open-source scanning	Security firms and maintainers	Large batches of findings move into coordinated disclosure	Volunteer projects may not have enough time or people
Enterprise tooling	Security teams using public models	Companies can scan and patch their own code faster	Poorly scoped scans can flood queues with noise
Public release	Future model users	Mythos-class capability reaches a wider market	Safeguards must block exploit abuse at scale

Cloudflare, the web infrastructure company, reached a similar conclusion after testing the model on more than 50 repositories. In its Project Glasswing technical post, the company said the model was better at producing proofs of exploitability and chaining small primitives into more serious attacks, while warning that generic coding agents remain a poor fit for broad vulnerability coverage.

The Public Version Will Not Be the Same Tool

Readers waiting for a simple consumer launch may be disappointed. Anthropic has already said it does not plan to make the preview model generally available. The target is a safer Mythos-class model, tested with stronger filters and paired with tools that steer security work into approved uses.

The distinction matters because the current preview was built for a controlled research setting. Cloudflare said the version it tested did not include the extra safeguards used in generally available models, and that the model’s own refusals were inconsistent. Sometimes it pushed back on legitimate proof of concept work. Sometimes a slightly different prompt changed the answer.

For defenders, the practical split is simple:

Shorten patch cycles so confirmed security fixes do not sit for weeks after disclosure.
Use public AI tools for narrower repair work, especially when a company owns the affected codebase.
Keep logs, multifactor authentication and hardened defaults in place, since not every critical bug will be patched before attackers learn about it.

Anthropic is already nudging customers in that direction. Its update says Claude Security, a public beta for Claude Enterprise customers, helped users patch more than 2,100 vulnerabilities in three weeks through Claude Opus 4.7. The company argues that enterprise-owned code can be fixed faster than open-source packages because the same organization controls discovery, approval and deployment.

The Release Clock Runs Through Governments

The public release decision now has a national security audience. The UK AI Security Institute, a government research body within the Department for Science, Innovation and Technology, said in its evaluation of Claude Mythos Preview that the model succeeded on expert-level capture-the-flag tasks 73 percent of the time and completed a 32-step corporate network attack simulation from start to finish in 3 of 10 attempts.

Those tests were controlled and easier than defended real networks, which the institute states plainly. Still, they explain why governments are now part of the access plan. A model that can automate parts of vulnerability research is useful to defenders, intelligence agencies, cloud providers and software vendors. The same ability is dangerous when pointed at weak systems without permission.

So the next move is not a product page going live. It is a test of whether Anthropic and its partners can prove three things at once: safeguards that stop misuse, triage systems that do not drown maintainers, and patch pipelines that move faster than future attackers. If those tests hold, the public gets a powerful security tool. If they fail, the restricted preview may look less like a delay and more like the last quiet phase before automated bug hunting becomes routine.