Anthropic Probes Unauthorized Access to Claude Mythos Cyber Model
Krasa AI
2026-04-25
5 minute read
Anthropic Probes Unauthorized Access to Claude Mythos Cyber Model
Anthropic is investigating reports that unauthorized users gained access to Claude Mythos, the company's restricted cybersecurity-focused model, on the same day it was made public. The model was supposed to be available only to roughly 50 vetted partners under Project Glasswing. Bloomberg and TechCrunch reported the breach this week, and Anthropic confirmed it is reviewing access logs.
The incident is awkward for a model whose entire premise is helping defenders stay ahead of attackers. It also lands at a moment when policymakers are watching closely how frontier labs control access to their most capable models.
What Claude Mythos Actually Does
Anthropic announced Claude Mythos Preview in early April as a general-purpose language model with unusually strong cybersecurity capabilities. Internal testing showed Mythos can identify and exploit zero-day vulnerabilities in every major operating system and every major web browser when directed by an authorized user.
Anthropic positioned Mythos as a defensive tool: software vendors, infrastructure operators, and security teams use it to find vulnerabilities before attackers do. The company kept the model out of its public APIs and tightly controlled access through Project Glasswing.
Why this matters: most frontier models are trained to refuse cyber-offensive tasks. Mythos is the opposite — it is trained to help with them, but only inside a permission boundary. The whole bet of the program rests on whether that boundary actually holds.
Project Glasswing: The 50-Company Wall
Project Glasswing is Anthropic's framework for sharing Mythos with trusted users. The named partners include Amazon Web Services, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, Microsoft, and Nvidia. Anthropic has also extended access to about 40 additional organizations responsible for building or maintaining critical software infrastructure.
Glasswing partners run Mythos against their own products and share findings with the rest of the cohort. The idea is that letting major vendors hunt for zero-days at scale, with a model trained for the job, will close vulnerabilities faster than waiting for attackers to find them first.
In practice, the model is hosted by Anthropic with API access gated by per-partner credentials. There is no public download. There is no consumer Claude product through which Mythos can be invoked. That tight control is what makes the reported breach so significant.
What's Reportedly Happened
According to reports from Bloomberg, TechCrunch, and SiliconANGLE, an unauthorized group accessed Mythos through a credential or token issued to one of the partner organizations. The exact vector has not been disclosed. Anthropic has not confirmed whether the access was achieved by stealing legitimate credentials, exploiting an integration, or some other method.
What Anthropic has said publicly is that it identified the unauthorized access shortly after launch and is investigating. The company has not yet specified how long the access lasted, what the unauthorized users were able to do with the model, or whether any specific exploits were generated and exfiltrated.
The incident is being treated as a model-access security event rather than a traditional data breach — there is no evidence that Anthropic's training data or Glasswing partners' shared findings were exposed. The concern is what the unauthorized users could have produced using Mythos before being cut off.
Industry Impact
The reported breach is a stress test for Anthropic's responsible-scaling policy. The company has long argued that gated access plus monitoring is sufficient mitigation for models with cyber-offensive capability. If a Glasswing token can be pulled and reused by outsiders within hours of launch, that argument gets harder to make.
For other frontier labs, the timing is uncomfortable. OpenAI, Google, and Meta have all been working on their own restricted-access programs for high-capability models. Each will now face questions from regulators, customers, and security researchers about how their gating compares.
For Glasswing partners, the immediate work is auditing how Mythos credentials are stored and used inside their organizations. Several of the named partners are also Anthropic's largest customers and biggest cloud providers — meaning Anthropic has powerful incentives to handle the investigation transparently.
Why this matters: dual-use models are no longer hypothetical. Mythos is in the wild, partly under control. How Anthropic and its partners handle this incident will shape the rules every lab follows when shipping the next generation of capable models.
Expert Perspectives
Security researchers note that the difficulty of policing model access is fundamentally different from traditional software. A leaked credential gives the unauthorized user the full capability of the model — there is no software patch that fixes that. The only mitigations are credential rotation, faster anomaly detection, and capability limits inside the model itself.
Some commentators argue the incident vindicates calls for more transparency about exactly which capabilities Mythos has. Others worry the opposite — that publicizing the breach will draw more attention to the model and accelerate parallel efforts by less responsible actors.
What's Next
Expect three things in the coming weeks. First, Anthropic will publish a more detailed post-mortem explaining how access was obtained and what the unauthorized users were able to do. Second, Glasswing partners will tighten internal controls on Mythos usage — likely including hardware-bound credentials and stricter logging. Third, U.S. and U.K. AI Safety Institutes will likely request formal briefings, given that Mythos sits squarely inside their jurisdiction over models with cyber-offensive capability.
Outside Anthropic, watch for parallel announcements from other frontier labs about how they will gate similarly capable future models. The industry has been moving toward more restrictive access for cyber-tuned models all year. This incident will accelerate that trend.
Bottom Line
A model designed to help defenders was reportedly accessed by people who weren't supposed to have it, hours after launch. Anthropic's Project Glasswing was the most ambitious gated-access program any frontier lab has tried. If it can't hold, the entire industry will need a new playbook for shipping dual-use AI — and policymakers will write one for them if they don't move fast.
Sources
Anthropic
Official
"Mythos Preview is a new general-purpose language model. This model performs strongly across the board, but it is strikingly capable at computer security tasks."
TechCrunch
News
"An unauthorized group has reportedly gained access to Anthropic's exclusive cyber tool Mythos."
Bloomberg
News
Fortune
News
Don't fall behind
Expert AI Implementation →Related Articles
NVIDIA Cosmos 3: First Open Physical AI Omnimodel Cuts Training Cycles to Days
NVIDIA's Cosmos 3 launches at Computex 2026 — a fully open foundation model that unifies vision, world generation, and action for robots and autonomous systems.
min read
Anthropic Adds Services Track and Partner Hub to Claude Network
Anthropic launches a 3-tier Services Track and a public Partner Hub. 40,000 firms have applied; 10,000 consultants are certified.
min read
Apoha Exits Stealth With $36M to Build 'Liquid Brain' AI for Materials
UK startup Apoha emerges with $36M Series A and a wild new data type: how materials vibrate in liquid. The pitch is AI for materials discovery.
min read