The First Recall

A federal letter pulled the most capable public AI model offline three days after launch. I was using it when it happened. Here is what actually went down, and why the lesson is older than the model.

Jun 13, 2026

I was elbow-deep in a dead server when it happened.

Four Intel X722 NIC ports on a Gigabyte board, all reporting 00:00:00:00:00:00, NVM MACs wiped, driver aborting probe with error -5. Honest infrastructure work: read the firmware state, recover the factory MACs, write them back, cold cycle, verify 10G link. The kind of unglamorous recovery that is most of what this job actually is.

Then the model I was working with went quiet, and a banner came up in amber: Fable 5 is currently unavailable.

My first instinct was the same one every operator has: transient blip, retry in a minute. I was wrong. It was not a blip. The United States federal government had, the night before, ordered the model offline. I had been planning my work around a hard personal deadline to squeeze design-judgment work out of Fable 5 before my access converted to API-only. That deadline did not matter anymore. A letter from the Commerce Department had overtaken it.

This is the part of the story I want to write down while it is still raw, because I do not think the lesson is the one most of the takes are reaching for.

What actually happened

Strip the hot takes and here is the load-bearing sequence.

Anthropic launched Claude Fable 5 on June 9, 2026, its first publicly available model in the new Mythos class. Three days later, on Friday June 12 at 5:21 PM ET, Commerce Secretary Howard Lutnick sent a letter to Anthropic CEO Dario Amodei. The letter designated both Fable 5 and its restricted sibling Mythos 5 as subject to export controls: no access by any foreign national, whether inside or outside the United States, including Anthropic’s own foreign-national employees. Axios broke it first. The reporting that followed from Bloomberg, NBC, and the Wall Street Journal lined up on the core facts.

The scope is what forced the outcome. An export restriction aimed at foreign nationals everywhere, including a company’s own staff, is not something you enforce selectively on a shared cloud service. There is no clean way to gate a global model by the citizenship of whoever is holding the session token. So Anthropic did the only thing that guaranteed compliance: it turned both models off for everyone, worldwide. Every other Claude model stayed up. That is why this very article got written on a different model without interruption, and it is why the distinction matters: this was a targeted recall of two specific models, not a company-wide outage.

Sit with the framing for a second. This is the first time the U.S. government has used an export-control directive to make a company pull a publicly deployed model away from its own paying customers. Export controls normally govern future transactions: you cannot sell this to that buyer going forward. This reached backward, into a product hundreds of millions of people could already open in a browser, and switched it off. That is a new thing in the world.

The trigger was a jailbreak. Maybe.

The proximate cause, per an administration official speaking to Axios, was that another company claimed it had jailbroken Mythos, which alarmed officials about national-security risk. The administration reportedly tried to get Anthropic to delay the launch, Anthropic declined, and the export letter followed.

The public face of the jailbreak story is the researcher who goes by Pliny the Liberator. Hours after launch he posted the all-caps trophy: “ANTHROPIC: PWNED, FABLE-5: LIBERATED,” along with a roughly 120,000-character dump of the model’s system prompt to GitHub. The techniques he described are not exotic to anyone who has done adversarial work: multi-agent decomposition (chaining roles to erode refusal behavior across a conversation), Unicode and homoglyph and Cyrillic substitution to dodge keyword classifiers, and narrative or taxonomy framing to smuggle intent inside legitimate-looking structure.

Here is the thing worth holding in tension. Anthropic disputes that this was a true jailbreak. Its position is that the technique coaxes conversational continuation rather than defeating the core safety classifiers, and that the capability the government got spooked by amounts to asking the model to read a codebase and fix the flaws it finds. That is not a doomsday weapon. That is Tuesday for every defender I know. Anthropic says the same capability ships in other publicly deployed models, including OpenAI’s GPT-5.5, and that it is used every single day by the people keeping systems safe.

I am a cybersecurity professional. I run continuous AI pentesting tooling against real infrastructure. So I will say plainly: if “the model can read code and find bugs” is the bar for a national-security recall, then the bar catches every frontier model on the market and most of the open-weight ones too. Anthropic’s own argument is that applying this standard across the industry would halt all new model deployments for everyone. They are not wrong about that.

But I also will not pretend the capability concern came from nowhere, because the receipts cut both ways.

The inconvenient part: the capability is real

The Mythos class did not earn its reputation in a press release. It earned it in Project Glasswing, the defensive coalition Anthropic stood up in April with AWS, Apple, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, Palo Alto Networks, and Broadcom, plus around 40 organizations that maintain critical infrastructure.

The numbers from that program are not marketing. Across roughly 50 partners, Mythos Preview found more than 10,000 high- or critical-severity vulnerabilities in the most systemically important software in the world. Mozilla fixed 271 flaws in a single Firefox version, more than ten times what its previous tooling caught. Cloudflare found around 2,000 bugs in its core systems. The model built a working method to forge security certificates in wolfSSL, a cryptography library running on billions of devices, which would let an attacker convincingly impersonate a bank. The UK’s AI Security Institute said it was the first model to solve both of its simulated cyberattack ranges end to end. It found bugs that had survived 27 years of expert human review.

And the inconvenient quote, the one that makes the government’s move legible: Anthropic itself called Mythos too capable to release to the public. Anthropic itself made the analogy to encryption and export controls first, back in April, writing that securing critical infrastructure is a national-security priority and that the U.S. and its allies must maintain a decisive lead in AI. One observer summed up the stakes with a line that has stuck with me: with Mythos patching vulnerabilities in every major OS and browser, the world’s entire core tech stack is now downstream of Claude.

So when people say Anthropic reaped what it sowed, there is a real argument underneath the snark. You cannot spend a year telling the government a capability is close to a cyber weapon and then act shocked when the government treats it like one.

That is the honest center of this whole mess. The safety concern is not fabricated. The process used to act on it is the problem.

Three layers, and they are not the same fight

The cleanest way I have found to think about this is to separate it into three layers, because the loudest voices keep arguing across them and talking past each other.

Layer one is safety. Is the model dangerous? Is there a real jailbreak? Could a hostile actor get meaningful uplift? This is the government’s stated focus, and it is a legitimate question. The Glasswing numbers say the capability is real. The Pliny dispute says the specific bypass is contested. Both can be true.

Layer two is governance. Who decides? On what evidence? Against what published standard? With what appeal? This is Anthropic’s focus, and it is where I land hardest. The company’s statement is careful: it says the government should be able to block unsafe deployments, but through a process that is transparent, fair, clear, and grounded in technical facts. This directive, by their account, came with no specific technical detail of the concern, no published standard, and no path to contest it. “Trust us, there is a risk, turn it off” is not a standard. It is the absence of one.

Layer three is business. Can enterprises rely on it? Can you build a product on top of it? This is the layer I live on, and it is the one this essay is really about. Because the answer just changed, in public, in a single evening.

These three layers do not resolve to the same answer, and pretending they do is how you end up with a bad take. You can believe the capability is genuinely dangerous (layer one), believe the government’s method was lawless and arbitrary (layer two), and believe the practical lesson for builders is independent of who was right (layer three). I believe all three.

We have run this exact play before

Here is the part almost nobody is saying, and it is the part that matters most to anyone who writes code for a living.

This is the crypto wars again.

In the 1990s the U.S. government treated strong encryption as a munition. If you wanted to publish cryptographic source code, you needed an export license, because code that could protect secrets was classified alongside weapons. A mathematician named Daniel Bernstein wanted to publish an encryption algorithm called Snuffle, an algorithm whose entire purpose was finding security flaws in vulnerable software. The government said he needed to register as an arms dealer first. He sued.

In Bernstein v. United States, the Ninth Circuit held that the export regulations operated as a prepublication licensing scheme that burdened scientific expression, vested boundless discretion in officials, and lacked procedural safeguards. They were a prior restraint on speech. Code, the court affirmed, is a form of expression. The right to code comes from that fight. It is why you have strong encryption in your browser today.

Now look at what just got export-controlled. A model whose headline capability is finding and exploiting software vulnerabilities. A model that, in testing, forged certificates against a cryptographic library. We are using a legal framework born from the fight over code-that-breaks-crypto to control a model that breaks crypto. The snake is eating its tail.

But the government learned from Bernstein. The modern move, the one legal analysts at places like CSET have flagged, is not to control the information, which runs straight into the First Amendment. It is to control the activity of U.S. persons. You do not ban the code. You regulate what an American company is allowed to do with it, the same legal footing used for people who support the development of nuclear or chemical weapons. That sidesteps the prior-restraint problem because, on paper, you are not restraining speech. You are restraining a transaction.

That is the machinery underneath “foreign nationals cannot access this.” It sounds narrow. It is not. And it is built specifically to survive the challenge that killed the last attempt.

The statutory scaffolding is already in place: the Export Control Reform Act of 2018, the Export Administration Regulations, the International Emergency Economic Powers Act, and a January 2025 BIS rule that, for the first time, brought AI model weights themselves under export control. The rails were laid eighteen months ago. Friday was just the first train.

Why “restriction” quietly became “shutdown”

The foreign-nationals framing deserves one more beat, because it is the most important sleight of hand in the whole episode.

On paper: foreign nationals cannot use the model. Sounds surgical.

In reality: a modern AI company is a global organism. Global employees, global contractors, global customers, global cloud, global support. The set of everyone who must be excluded is so entangled with normal operations that you cannot cleanly carve it out. The cheapest path to provable compliance is not selective gating. It is the off switch.

So a directive that says restrict produces an outcome that means shut down, without anyone in government having to write the word shutdown or own it politically. The narrowness is real on the page and fictional in practice. That gap is not a bug in how this played out. For anyone who wants leverage over a model without the optics of banning it, that gap is the feature.

Anthropic, for its part, is fighting and complying at the same time. It called the directive a misunderstanding, said it is working to restore access, and promised more detail within 24 hours. Whether this resolves in days or becomes a template, the precedent is now set: a model can be switched off for its own customers by letter, overnight, with no published standard and no warning to the people building on it.

What this means if you build

I am not going to pretend I have clean hands of self-interest here. I build SaaS on top of frontier models. I run security tooling that depends on capable models being available when a client needs them. This event is not abstract to me. It is a fire drill I just failed in real time, standing over a server with a banner in my face.

So here is the operator’s translation, the part I would say to anyone building on this stuff.

Model availability is now a business risk, not just an SLA line. It sits next to regulatory risk, compliance risk, and geopolitical risk. You cannot benchmark your way around it. The smartest model on the leaderboard is worth nothing to you on the day a letter takes it offline.

Single-vendor, single-jurisdiction dependence is a fragility you can no longer afford to ignore. If your entire workflow runs on one provider’s one model in one government’s jurisdiction, you do not have a stack. You have a single point of failure with good marketing. The fix is boring and correct: keep more than one path live. A second commercial provider. And critically, a local or open-weight fallback that no letter can reach.

This is a harness-engineering argument, not a model argument. I have written before that prompt engineering loaded all the control into the instruction up front, while harness engineering distributes control across governance, tool permissions, memory, retrieval, and feedback loops at runtime. The same logic now extends one layer out. Do not load all your control into one model. Distribute it across providers and across jurisdictions. Treat the model as a component you wrap and can swap, not an oracle you marry. The teams that already think this way woke up Saturday annoyed. The teams that did not woke up stalled.

Open weights just stopped being a hobbyist position and became insurance. I say this as someone who runs a local cluster, so weigh it accordingly. But the structural point holds even after you discount for my bias and for the open-weight vendors loudly capitalizing on the moment: a model whose weights sit on hardware you control cannot be recalled by anyone. That used to be a sovereignty talking point. As of Friday it is a continuity-of-operations requirement.

The deepest shift is the question itself. For two years the question was how good is the model. Is GPT better than Claude. Is Gemini better than GPT. Benchmark scores, coding ability, reasoning. That was the whole conversation.

The new question is whether you will still have access tomorrow. And the question after that, the one we are going to be living inside for the next decade, is bigger than any product:

Who gets access to intelligence?

That is not a software question anymore. It is geopolitics, economics, national security, and industrial policy, wearing a chat box. The model is just the part you can see.

I eventually got my four NICs back up. Wrote the MACs, cold-cycled the box, verified 10G on the bonded pair. The hardware was recoverable because I controlled it end to end, every layer, all the way down to the firmware.

I would like the rest of my stack to feel that way too.

Frontier Operations Series. Written the morning after, on a different model.

Discussion about this post

Ready for more?