LLMs

Cybersecurity vets protest ‘dangerous’ US government ban on Anthropic’s most powerful models

Publicado porRedacao AIDaily
5 min de leitura
Autor na fonte original: Lorenzo Franceschi-Bicchierai

A group made up of dozens of cybersecurity experts urged the White House to remove export control restrictions on Anthropic’s models Fable and Mythos, arguing that the order is going to limit the ability of cybersecurity defenders to secure their software and products.

Compartilhar:

A group made up of dozens of cybersecurity experts, including several well-known veterans of the industry, published an open letter to the U.S. government asking it to lift the export control order on Anthropic’s Fable and Mythos models.

According to the open letter, “this action has taken the best models away from [cybersecurity] defenders” who now can’t use the models to find vulnerabilities and make their software and products more secure.

“To pull the best capabilities away from defenders without a good reason when our adversaries are rapidly advancing is dangerous,” read the letter.

On Friday, the U.S. government ordered Anthropic to limit the export of Fable and Mythos citing national security concerns, without explaining the specific reasons behind the order, according to Anthropic . In response, the company suspended access to the models to all users worldwide.

As of this writing, the letter is signed by 76 cybersecurity experts, including: former Facebook chief of security Alex Stamos; Casey Ellis, the founder bug bounty platform Bugcrowd; famed cryptographer and former Apple security design and architecture manager Jon Callas; computer scientist Paul Vixie; Dino Dai Zovi, the former head of applied security engineering at Block; Katie Mossouris, the founder of Luta Security; and Rachel Tobac, the CEO of the security awareness training firm SocialProof Security.

When Mythos launched as a preview in April, Anthropic claimed it was so powerful at finding security vulnerabilities that the company needed to tightly restrict access to prevent malicious hackers or foreign adversaries from using it to cause havoc on the internet. In practice, that meant Anthropic gave around 50 companies initial access to Mythos, recently expanding that group to include around 150 organizations in 15 countries.

Last week, Anthropic released Fable , a public version of Mythos that the company said had strict guardrails to block its use in the fields of biology, chemistry, and cybersecurity, as well as to stop others from distilling the model in order to recreate it. The guardrails on Fable were so strict that many cybersecurity experts found that it stopped essentially any prompts related to cybersecurity .

Anthropic said that the White House export control order may have been based on a report that there was a method to bypass — or so-called jailbreaking — Fable to unlock its powerful Mythos-level capabilities.

Contact Us Do you have more information about the Amazon paper that prompted the ban? We’d love to hear from you. From a non-work device and network, you can contact Lorenzo Franceschi-Bicchierai securely on Signal at +1 917 257 1382, or via Telegram and Keybase @lorenzofb, or email .

According to Katie Moussouris, one of the signatories of the open letter, the method was demonstrated by Amazon researchers in a paper that is not public, but that she has reviewed.

But Moussouris said in a blog post that the paper did not actually demonstrate a real jailbreak. Instead, she wrote, the researchers simply asked Fable to fix open source code with public and known vulnerabilities along with “deliberately planted vulnerabilities,” after the model initially refused to “review the code for security issues.”

“The behavior described in the paper cannot meaningfully be fixed, and any attempt would only weaken the model for defense,” Moussouris wrote. “Defenders need to be able to ask AI to fix the bugs in a file, explain why the fix matters, and write tests that confirm the patch works. That is not a guardrail bypass. It is the most valuable thing an AI model can do for defensive security: executing the find, fix, and test loop defenders run every day.”

Moussouris’ critique was echoed in the open letter, which also said that the group of experts believe the method in the Amazon paper “can be replicated” on OpenAI’s GPT-5.5, on Anthropic’s own publicly-available Claude Opus 4.8 and Sonnet, “and even Chinese models like Kimi 2.7.”

The letter also asked for transparently and fairly enforced regulations created by “a democratic rule-making process” that are based on scientific research done by industry and academic experts, and “used only to the minimal extent necessary to ensure the safety of the American public.”

When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence.

Lorenzo Franceschi-Bicchierai is a Senior Writer at TechCrunch, where he covers hacking, cybersecurity, surveillance, and privacy.

You can contact or verify outreach from Lorenzo by emailing lorenzo@techcrunch.com , via encrypted message at +1 917 257 1382 on Signal, and @lorenzofb on Keybase/Telegram.

Get an inside look at what it takes to scale and succeed from leaders at Mach Industries, Founders Fund, and Shinkei Systems. Through candid fireside chats and high-impact networking, you’ll walk away with valuable insights and new connections.

The FBI built its own replica small town to simulate real-world cyberattacks Zack Whittaker

The FBI built its own replica small town to simulate real-world cyberattacks

The FBI built its own replica small town to simulate real-world cyberattacks

Meta’s months-old AI unit is a soul-crushing gulag, say the engineers stuck inside it Connie Loizos

Meta’s months-old AI unit is a soul-crushing gulag, say the engineers stuck inside it

Meta’s months-old AI unit is a soul-crushing gulag, say the engineers stuck inside it

Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world Marina Temkin

Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world

Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world

Cybersecurity researchers aren’t happy about the guardrails on Anthropic’s Fable Lorenzo Franceschi-Bicchierai

Cybersecurity researchers aren’t happy about the guardrails on Anthropic’s Fable

Cybersecurity researchers aren’t happy about the guardrails on Anthropic’s Fable

Google just fired a warning shot in the AI subscription price wars Lucas Ropek Connie Loizos

Google just fired a warning shot in the AI subscription price wars

Google just fired a warning shot in the AI subscription price wars

Anthropic’s Claude Fable 5 is a version of Mythos the public can access today Rebecca Bellan

Anthropic’s Claude Fable 5 is a version of Mythos the public can access today

Anthropic’s Claude Fable 5 is a version of Mythos the public can access today

It’s not FAANG anymore. It’s MANGOS. Julie Bort

Pontos-chave

  • A proibição do governo dos EUA pode limitar a capacidade de defesa cibernética, impactando diretamente a segurança de software no Brasil.
  • A falta de transparência nas decisões de exportação pode gerar incertezas no ecossistema tecnológico global, incluindo o Brasil.
  • A pressão de especialistas em segurança pode levar a uma reavaliação das políticas de acesso a tecnologias de IA.

Análise editorial

A recente controvérsia envolvendo a proibição do governo dos EUA sobre os modelos Fable e Mythos da Anthropic levanta questões significativas para o setor de tecnologia no Brasil e globalmente. A restrição ao acesso a essas ferramentas avançadas de IA, que são cruciais para a detecção de vulnerabilidades em software, pode ter um impacto direto na capacidade das empresas de segurança cibernética de proteger seus produtos. No Brasil, onde a digitalização avança rapidamente, a dependência de tecnologias de segurança robustas é ainda mais crítica, uma vez que o país enfrenta um aumento nas ameaças cibernéticas.

Além disso, a situação destaca um dilema maior sobre a regulação da IA e a segurança nacional. Enquanto a proteção contra o uso malicioso de tecnologias avançadas é válida, a limitação do acesso para defensores pode criar um desequilíbrio no campo de batalha digital. A falta de transparência nas decisões do governo dos EUA também gera incertezas, não apenas para as empresas americanas, mas para o ecossistema global de tecnologia, incluindo startups e empresas estabelecidas no Brasil que buscam inovação em segurança cibernética.

Observando o futuro, será crucial monitorar como a Anthropic e outras empresas de IA responderão a essas restrições e se haverá uma reavaliação das políticas de exportação que possam permitir um acesso mais equilibrado. A pressão de especialistas em segurança cibernética pode influenciar mudanças nas abordagens regulatórias, especialmente se houver um reconhecimento crescente de que a colaboração e o compartilhamento de tecnologia são essenciais para enfrentar ameaças cibernéticas em evolução. O Brasil, com sua comunidade de tecnologia em crescimento, pode se beneficiar ao se posicionar como um defensor da inovação responsável, buscando parcerias que promovam a segurança sem comprometer o avanço tecnológico.

Por fim, a situação ressalta a importância de um diálogo contínuo entre reguladores, desenvolvedores de tecnologia e especialistas em segurança. A construção de um ecossistema que favoreça a inovação, ao mesmo tempo em que protege contra abusos, será fundamental para garantir que o Brasil e outros países possam competir em um cenário global cada vez mais complexo e desafiador.

O que esta cobertura entrega

  • Atribuicao clara de fonte com link para a publicacao original.
  • Enquadramento editorial sobre relevancia, impacto e proximos desdobramentos.
  • Revisao de legibilidade, contexto e duplicacao antes da publicacao.

Fonte original:

TechCrunch AI

Sobre este artigo

Este artigo foi curado e publicado pelo AIDaily como parte da nossa cobertura editorial sobre desenvolvimentos em inteligência artificial. O conteúdo é baseado na fonte original citada abaixo, enriquecido com contexto e análise editorial. Ferramentas automatizadas podem auxiliar tradução e estruturação inicial, mas a decisão de publicar, a revisão factual e o enquadramento de contexto seguem responsabilidade editorial.

Saiba mais sobre nosso processo editorial