Anthropic deploys Claude Sonnet 5, Fable and Mythos restored
Anthropic has launched Claude Sonnet 5 and restored access to its Fable and Mythos frontier models following a federal export control review. The decision marks the conclusion of an eighteen-day operational pause triggered by a US government export control directive on June 12, which forced the temporary suspension of Anthropic’s highest-capability systems. Government officials enacted […] The post Anthropic deploys Claude Sonnet 5, Fable and Mythos restored appeared first on AI News .
Anthropic has launched Claude Sonnet 5 and restored access to its Fable and Mythos frontier models following a federal export control review. The decision marks the conclusion of an eighteen-day operational pause triggered by a US government export control directive on June 12, which forced the temporary suspension of Anthropic’s highest-capability systems. Government officials enacted the restriction after researchers at Amazon documented a method to bypass the safety controls of Fable 5, causing the model to identify software vulnerabilities and supply exploitation code. Anthropic has since developed an updated automated classifier to patch the vulnerability, clearing the path for a full commercial rollout across its platform, cloud infrastructure, and partner networks. The temporary suspension of Fable 5 and Mythos 5 highlighted the regulatory pressures facing frontier intelligence systems. When the export control mandate took effect, the lack of real-time nationality verification systems required a total access blackout for all global users. Security evaluations conducted during the shutdown confirmed that the vulnerability identification behaviour was not unique to Fable 5. Older and less capable architectures from multiple providers, including Claude Opus 4.8, GPT-5.5, and Kimi K2.7, duplicated the exact results. To resolve the federal directive, engineers trained an automated safety classifier targeting the specific bypass mechanism reported by Amazon. This software layer functions with a wide safety margin, identifying and blocking ambiguous developer prompts that display a statistical probability of malicious intent. Internal validation data indicates the updated classifier prevents the reported exploitation technique in more than 99 percent of trials. When a developer issues a prompt that triggers this boundary, the platform automatically routes the workload to the older Opus 4.8 architecture to maintain continuity. The expanded safety margin introduces a distinct trade-off for engineering teams, as the automated system flags benign requests more frequently during routine application development and software debugging. Active deployments and agentic workflows While frontier models face strict state oversight, the immediate commercial focus targets the newly-deployed Claude Sonnet 5. Engineering teams are transitioning autonomous agents to this model to reduce operational expenditure while maintaining high execution capacity. Performance data validates that the system executes multi-step plans, operates terminal environments, and navigates web browsers without human intervention. Model performance and cost metrics: Model SWE-bench Pro Terminal-Bench 2.1 Base input cost* Base output cost* Sonnet 5 63.2% 80.4% $3.00 $15.00 Sonnet 4.6 58.1% 67.0% $3.00 $15.00 Opus 4.8 69.2% 82.7% $5.00 $25.00 *Cost per million tokens. Sonnet 5 carries introductory rates of $2.00 input / $10.00 output through August 31, 2026. Real-world deployments demonstrate how organisations are deploying this architecture within live software development pipelines. At Rakuten, technology teams deployed the architecture against dozens of the company’s most challenging production code pull requests. The system processed each submission independently, executing tests and verifying the results before presenting the completed code to human engineers for final structural approval. Software automation firm Zapier integrated the system into its core product workflows to execute multi-part administrative tasks. In a documented deployment, engineers tasked the model with updating Salesforce account tiers and subsequently generating and transmitting launch announcements to enterprise contacts. Prior model architectures frequently stalled midway through these multi-stage operations, whereas the current system executed the entire sequence end-to-end without human remediation. Development tool provider Zed utilised the system to automate complex debugging procedures. During internal trials, engineering teams directed the model to investigate an active software bug. Working without explicit prompts or step-by-step instructions, the system independently generated a reproducing test script, applied the necessary code fix, and stashed the modifications to verify that the bug reappeared in the absence of the patch. The entire diagnostic and remediation sequence occurred within a single processing pass. Software engineering platform Factory implemented the architecture to manage sustained coding tasks within complex codebase environments. Technical teams reported that the system maintained logical grounding and execution consistency across corporate code repositories, outperforming previous generation software layers by completing tasks that previously timed out or failed to resolve. Quantitative safety audits and exploitation limits Data from the formal system card indicates that the system achieves these autonomous capabilities without a corresponding inflation of security risks. Automated behavioural audits designed to test for deceptive tendencies and cooperation with unauthorised requests show that the model exhibits a lower overall rate of non-compliant behaviour compared to its direct predecessor, Sonnet 4.6. The architecture does not possess advanced offensive cybersecurity capabilities. Anthropic engineers omitted specialised cybersecurity datasets from the training protocol, limiting the system to routine, defensive technical tasks. In public security assessments conducted in partnership with Mozilla, researchers tested the model’s capacity to build functional exploits for known vulnerabilities within the Firefox 147 browser core. The model failed to generate a single working exploit across all evaluation windows, registering a zero percent success rate. It did achieve a 13.2 percent partial success rate, which represented a minor increase over Sonnet 4.6, though engineers attribute this variation to general gains in logical reasoning rather than domain-specific offensive training. Out of caution, commercial versions ship with default real-time safety classifiers equivalent to those used in the premier Opus 4.8 framework. The regulatory friction surrounding Fable 5 prompted a formal partnership between Anthropic, Amazon, Microsoft, and Google to establish an objective industry framework for assessing model security breaches. Currently, providers lack a shared metric to classify the severity of system bypasses, creating regulatory uncertainty when researchers identify new prompting vulnerabilities. The proposed governance framework scores security breakdowns across four specific technical criteria: Capability gain measures how far the exploit advances user capabilities beyond standard, widely available software utilities. Breadth of capability gain quantifies the number of distinct offensive operations the same exploit unlocks. Ease of weaponisation tracks the volume of human engineering effort and specialized prompting required to extract a harmful output. Discoverability determines the accessibility of the exploit technique within public research circles. Developers and cybersecurity professionals will use this matrix to coordinate defensive responses. For high-severity breaches, such as exploits demonstrating an immediate capacity to disrupt financial accounting systems or electrical transmission grids, providers will deploy automated mitigations instantly. This initiative operates alongside a newly established HackerOne vulnerability research program and a dedicated corporate monitoring team providing 24-hour oversight of threat intelligence channels. Deployment strategies will need to adapt to this closer relationship between model builders and state regulatory bodies. Anthropic has formalised agreements under recent executive mandates to grant federal researchers early access to frontier architectures prior to public commercial release. These joint evaluation windows allow external security analysts to audit model capabilities alongside internal engineering teams, ensuring regulatory alignment before code enters production environments. See also: HP accelerates enterprise workflows with OpenAI Frontier Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is part of TechEx and is co-located with other leading technology events including the Cyber Security & Cloud Expo . Click here for more information. AI News is powered by TechForge Media . Explore other upcoming enterprise technology events and webinars here . The post Anthropic deploys Claude Sonnet 5, Fable and Mythos restored appeared first on AI News .
Key takeaways
- The launch of Claude Sonnet 5 reflects the regulatory complexity faced by AI companies, highlighting the need for a dialogue on regulation in Brazil.
- Anthropic's response to mitigate vulnerabilities can serve as a model for Brazilian startups, emphasizing the importance of integrating safety in development.
- U.S. export control directives may impact how Brazilian companies interact with cutting-edge technologies, underscoring the need for local security infrastructures.
Editorial analysis
The recent launch of Claude Sonnet 5 and the restoration of access to Fable and Mythos models by Anthropic are significant events that reflect the regulatory complexities faced by artificial intelligence companies. For the Brazilian tech sector, this situation highlights the need for a deeper dialogue about AI regulation, especially in a landscape where concerns about safety and ethics are on the rise. The ability to bypass safety controls, as demonstrated in the case of Fable 5, raises questions about the responsibility of companies to ensure their technologies are not misused.
Moreover, Anthropic's swift response in developing an automated classifier to mitigate the identified vulnerability indicates how AI companies must be proactive regarding safety. This could serve as a model for Brazilian startups and companies developing their own AI solutions, emphasizing the importance of integrating safety measures from the outset of development. The focus on a system that blocks ambiguous prompts, even if it means an increase in benign request flagging, reflects a commitment to safety that could be a competitive differentiator in the market.
The impact of U.S. export control directives should also be closely monitored, as they may influence how Brazilian companies interact with cutting-edge technologies. The need for real-time nationality verification, which resulted in a total access blackout, could serve as a wake-up call for Brazil to develop its own security and compliance infrastructures. As the AI market continues to evolve, the ability to navigate these regulations will be crucial for the competitiveness of Brazilian companies on the global stage.
Finally, the transition of autonomous agents to Claude Sonnet 5 may open new opportunities for automation and operational efficiency, but it also raises the question of how companies can balance innovation and safety. Engineering teams will need to be mindful of the implications of their technological choices, especially in an environment where trust and safety are increasingly valued by consumers and regulators.
What this coverage includes
- Clear source attribution and link to the original publication.
- Editorial framing about relevance, impact, and likely next developments.
- Review for readability, context, and duplication before publication.
Original source:
AI NewsAbout this article
This article was curated and published by AIDaily as part of our editorial coverage of artificial intelligence developments. The content is based on the original source cited below, enriched with editorial context and analysis. Automated tools may assist with translation and initial structuring, but publication decisions, factual review, and contextual framing remain editorial responsibilities.
Learn more about our editorial process