Anthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI
Anthropic isn't hiding its frustration. "We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people," the company wrote in a blog post.
The U.S. government on Friday ordered Anthropic to immediately shut off access to two of its most powerful AI models — Claude Fable 5 and Claude Mythos 5 — citing national security concerns. Anthropic announced on X that it has complied, but it made clear it thinks the government got this one wrong .
The directive, which Anthropic said it received on Friday at 5:21 pm ET, forces the company to disable both models for all users worldwide — not just the foreign nationals the government’s export control order was nominally aimed at. Access to Anthropic’s other models isn’t affected.
Why does any of this matter? Mythos is Anthropic’s most capable AI model, one the company previewed in early April and has kept tightly restricted ever since because of what Anthropic described as its exceptional ability to find security vulnerabilities in software. According to Anthropic, Mythos identified flaws in every major operating system and web browser it tested, so rather than release it broadly, the company launched a controlled program called Project Glasswing, sharing it with roughly 50 vetted organizations, including Amazon, Apple, Google, Microsoft, and CrowdStrike, to use for defensive cybersecurity work.
Fable 5, released just three days ago, was Anthropic’s answer to the obvious commercial pressure: a version of Mythos fitted with guardrails that block responses in high-risk areas like cybersecurity and biology, making it safe enough for general release, the company argued. It was immediately the most capable AI model available to the public, according to benchmark tests from Vals AI, a company that tracks AI tech performance.
The government’s directive is framed as an export control action, restricting foreign national access to the models. But in a lengthy blog post , Anthropic says its understanding is that the underlying concern is a claimed jailbreak of Fable 5. So far, the company says, the government has provided only verbal evidence of a “potential narrow, non-universal jailbreak” — one that, as Anthropic describes it, amounts to prompting the model to read a specific codebase and identify software flaws. And by the way, adds the company, it’s a “level of capability” that’s already widely available in other publicly accessible models, including OpenAI’s GPT-5.5. It’s also used routinely by cybersecurity professionals for defensive purposes, says Anthropic.
Anthropic’s broader argument is that its strongest safeguards operate through independent classifier systems that function separately from the model itself, meaning that even if someone convinces Fable to keep talking past a refusal, the underlying protections against the most dangerous outputs remain in place.
Clearly, none of that was enough to stop the government from acting, and Anthropic isn’t hiding its frustration. “We disagree that the finding of a narrow potential jailbreak should be cause for recalling a commercial model deployed to hundreds of millions of people,” the company wrote. “If this standard was applied across the industry, we believe it would essentially halt all new model deployments for all frontier model providers.”
Anthropic is widely expected to pursue an IPO this year and has staked much of its public identity on being the safety-conscious alternative to its rivals. The irony isn’t lost on observers that the very caution Anthropic displayed in restricting Mythos — which it promoted as a model so dangerous it couldn’t be released publicly — has now apparently attracted exactly the kind of government scrutiny that could disrupt its business most.
OpenAI’s Sam Altman must be enjoying this, at least. In April, he told podcaster Ashlee Vance that Anthropic’s handling of Mythos amounted to “ fear-based marketing .” “It is clearly incredible marketing to say, ‘We have built a bomb. We were about to drop it on your head. We will sell you a bomb shelter for $100 million,’” Altman said. Altman, whose company is also widely expected to pursue an IPO as soon as possible, didn’t predict a government shutdown, but he identified something that has come back to bite Anthropic for now, which is that when you spend months telling the world your AI is uniquely dangerous, the world — the U.S. government included — tends to listen.
When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence.
Get an inside look at what it takes to scale and succeed from leaders at Mach Industries, Founders Fund, and Shinkei Systems. Through candid fireside chats and high-impact networking, you’ll walk away with valuable insights and new connections.
Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world Marina Temkin
Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world
Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world
Cybersecurity researchers aren’t happy about the guardrails on Anthropic’s Fable Lorenzo Franceschi-Bicchierai
Cybersecurity researchers aren’t happy about the guardrails on Anthropic’s Fable
Cybersecurity researchers aren’t happy about the guardrails on Anthropic’s Fable
Google just fired a warning shot in the AI subscription price wars Lucas Ropek Connie Loizos
Google just fired a warning shot in the AI subscription price wars
Google just fired a warning shot in the AI subscription price wars
WWDC 2026: Everything announced on Siri AI, iOS 27, Apple Intelligence, and more Morgan Little Aisha Malik
WWDC 2026: Everything announced on Siri AI, iOS 27, Apple Intelligence, and more
WWDC 2026: Everything announced on Siri AI, iOS 27, Apple Intelligence, and more
Anthropic’s Claude Fable 5 is a version of Mythos the public can access today Rebecca Bellan
Anthropic’s Claude Fable 5 is a version of Mythos the public can access today
Anthropic’s Claude Fable 5 is a version of Mythos the public can access today
It’s not FAANG anymore. It’s MANGOS. Julie Bort
Microsoft’s open source tools were hacked to steal passwords of AI developers Zack Whittaker
Microsoft’s open source tools were hacked to steal passwords of AI developers
Microsoft’s open source tools were hacked to steal passwords of AI developers
Key takeaways
- The suspension of Anthropic's AI models highlights the need for a balance between innovation and security in the regulation of emerging technologies.
- Brazil can draw inspiration from Anthropic's situation to develop a regulatory framework that promotes innovation without compromising national security.
- Anthropic's response suggests that government concerns may be based on outdated perceptions of AI capabilities.
Editorial analysis
The U.S. government's decision to suspend access to Anthropic's Claude Fable 5 and Claude Mythos 5 models raises crucial questions about AI regulation, especially at a time when Brazil is striving to establish its own guidelines in technology. This measure reflects a growing concern over national security regarding AI models with advanced capabilities for identifying vulnerabilities, which could directly impact Brazilian companies that utilize or develop similar technologies. Brazil, aiming to foster an innovation-friendly environment, may feel pressured to adopt stricter regulations to ensure cybersecurity and data protection, potentially affecting the pace of AI solution development in the country.
Moreover, Anthropic's response suggests a disconnect between government concerns and the reality of AI usage in the market. The claim that the potential for "jailbreak" is a non-universal concern already present in widely used AI models, such as OpenAI's GPT-5.5, indicates that regulation may be based on outdated or exaggerated perceptions. For the Brazilian tech ecosystem, this could serve as a warning about the need for closer dialogue between developers, regulators, and the cybersecurity community to ensure that policies are evidence-based rather than assumption-driven.
What to watch for next is how Anthropic and other AI companies will respond to this situation. The possibility of an appeal or reassessment of the government decision could open the door for discussions on transparency in AI regulation. For Brazil, keeping an eye on these developments may provide valuable insights into how to build a regulatory framework that not only protects national security but also promotes innovation and competitiveness in the tech sector. Additionally, the situation highlights the importance of having a clear understanding of AI models' capabilities and limitations to avoid hasty decisions that could stifle technological progress.
Finally, Anthropic's situation can be seen as a reflection of the tensions between innovation and security. As AI technologies continue to evolve, it will be essential for both regulators and companies to find a balance that allows for technological advancement without compromising public safety. This is a lesson that Brazil can take into account when developing its own AI policies, ensuring that it does not miss the opportunity to become a leader in technology in Latin America.
What this coverage includes
- Clear source attribution and link to the original publication.
- Editorial framing about relevance, impact, and likely next developments.
- Review for readability, context, and duplication before publication.
Original source:
TechCrunch AIAbout this article
This article was curated and published by AIDaily as part of our editorial coverage of artificial intelligence developments. The content is based on the original source cited below, enriched with editorial context and analysis. Automated tools may assist with translation and initial structuring, but publication decisions, factual review, and contextual framing remain editorial responsibilities.
Learn more about our editorial process