Anthropic Leaks Mythos. Mind blown.

Matthew Ong Founder · Mar 28, 2026 ·

Anthropic accidentally spilled the beans on a major new AI model called Claude Mythos, and the way it happened says as much about the state of AI right now as the model itself. This isn’t just another “new feature” release; it’s a glimpse at how fast top‑end AI is moving and how messy that process can look in the real world.

The leak: not a movie‑style hack

This wasn’t some Hollywood cyberattack. No one “broke in” to Anthropic’s internal network. Instead, a content system tied to Anthropic’s blog and marketing site was misconfigured so that thousands of internal files were accessible from the open web.

Buried in that cache were draft blog posts, PDFs, images, and even copy for an invite‑only CEO event. Security folks and reporters eventually stumbled across it, pulled the files, and realized they were looking at a detailed, unpublished launch post for an unreleased model called Claude Mythos.

Once journalists reached out, Anthropic confirmed the mistake, blamed it on human error in how the system was set up, and locked everything down. One flipped setting turned private launch material into a public preview of their next flagship AI.

Wait… didn’t Opus 4.6 just land?

What really makes this wild is the timing. Claude Opus 4.6 only launched in early February 2026, and Anthropic was already calling it their smartest, most capable model. It came with a huge context window, noticeably better long‑range planning, stronger coding and debugging, and top‑tier performance on tough professional benchmarks.

Opus 4.6 wasn’t some minor point release. It already felt like a “this changes what’s possible” moment. People were still writing breakdowns and hot‑takes about it in February and March.

Then, about a month and a half later, a simple CMS misconfig reveals that Anthropic already has an even more powerful model, Mythos, sitting in the wings. Internally, they’re describing it as a full step beyond Opus 4.6 in reasoning, coding, and cybersecurity.

If we’re already seeing that kind of upgrade on what feels like a quarterly cadence, it’s hard not to start fast‑forwarding. Imagine a couple of years of this. You don’t need sci‑fi to see how strange things could get if every few months the “new normal” jumps again.

Meet Claude Mythos (or Capybara)

Inside the leaked files were multiple versions of an announcement for the same model, but with two names: “Claude Mythos” and “Claude Capybara.” The text was basically identical; only the name changed. That suggests the content was pretty far along, and the team was still arguing about branding.

Names aside, the message was clear: this is Anthropic’s most powerful model so far. The drafts describe it as their most capable system to date, and a genuine “step change” beyond Opus 4.6, not just a small upgrade.

Anthropic has said that Mythos is fully trained and already being tested with a limited group of early‑access customers. What they haven’t done is release it widely to developers or end users. That hesitation is mostly about one thing: cybersecurity.

What Mythos is actually good at

The leak and follow‑up coverage paint a pretty consistent picture of where Mythos shines. Think of it as a general‑purpose AI with three standout strengths.

1. Serious coding power

Mythos is described as a big upgrade for software work compared to Opus 4.6. It scores much higher on coding and software engineering tests, handles larger and messier codebases, and can reason about software architecture instead of just answering small “how do I write this function?” questions.

The intent here is obvious: this is meant to be a heavy‑duty coding partner and a foundation for AI agents that can write, refactor, and maintain production‑level software, not just toy scripts.

2. Better long‑form reasoning

Mythos also aims to push reasoning forward. It’s reportedly much stronger at multi‑step thinking, complex analysis, and structured arguments. In plain English, it’s better at keeping track of a long chain of logic, solving problems that require several steps, and giving you coherent write‑ups instead of confident nonsense.

Across multiple sources, the story is the same: Mythos does noticeably better on the kinds of tests that measure actual reasoning, not just pattern‑matching.

3. Cybersecurity with a warning label

The most alarming part of the leak is Mythos’s cybersecurity capabilities. Anthropic’s own internal wording is unusually blunt: they describe the model as far ahead of any other AI system in cyber skills and warn that it could be part of a wave of models that can discover and exploit vulnerabilities faster than defenders can keep up.

Earlier Claude models have already been pushed into generating malware in red‑team tests, and there have been reports of state‑linked groups trying to use them for hacking campaigns. Mythos appears to amplify the underlying skills that make that possible—code analysis, exploit generation, vulnerability hunting—and that’s both exciting and scary.

For defenders, a model like this is a powerful tool to find and fix weaknesses before attackers do. For attackers, if they ever get unrestricted access to something like Mythos, it could be a huge force multiplier.

Why Anthropic is moving slowly

Because of all that, Anthropic is not treating Mythos like a normal product launch. There’s no “here’s the URL, everyone go play with it” moment yet.

Instead, Mythos is in restricted early access with a small set of carefully chosen partners, especially those focused on cyber defense. The idea is to let security teams use Mythos to stress‑test systems and help map out the risks before anything like this is widely available.

Their messaging around Mythos talks about “heightened caution” and the need to understand real‑world cybersecurity risks beyond what their internal tests can uncover. That’s not standard hype language; it’s closer to a product launch with a built‑in safety briefing.

What this says about where AI is headed

Even if you never touch Mythos directly, this whole episode is a snapshot of where frontier AI is right now.

We’re still in an era of big jumps, not tiny tweaks. Opus 4.6 felt like a game‑changer in February, and by late March there’s already another model being described as a clear step beyond it.

Cybersecurity has moved from a side note to a central issue. The most powerful models aren’t just better at writing essays and code; they’re also better at finding and exploiting weaknesses in digital systems.

And for all the talk about futuristic AI threats, the thing that exposed Mythos to the world was as old‑school as it gets: a misconfigured content system. Cutting‑edge AI, meet “someone forgot to lock the bucket.”

For me, that combination is what really sticks. Opus 4.6 already felt like a big line in the sand, and we barely had time to process it before Mythos showed up via a dumb leak and raised the stakes again. If that’s the tempo now, it’s hard to imagine how different things could feel in two or three years.

0 Comments

No comments yet. Be the first to reply!