Anthropic Launches Cyber Jailbreak Severity Framework for Fable 5 Safeguards
Ask AI about this cluster
Analyzing cluster data...
Referenced clusters:
Something went wrong. Please try again.
Cluster AI
Ask questions about this threat cluster with AI-powered analysis.
Get Researcher $29.99/moArticle Content
Anthropic has redeployed its AI model, Claude Fable 5, after a temporary suspension due to a jailbreak vulnerability. The US Commerce Department had enforced export controls after Amazon researchers discovered a method to bypass Fable 5's safeguards. In response, Anthropic introduced a new safety classifier that blocks over 99% of attempts to exploit this vulnerability. Alongside this, the company proposed a Cyber Jailbreak Severity (CJS) framework, developed with partners like Amazon, Microsoft, and Google, to standardize the assessment of jailbreak risks. The CJS scale ranges from CJS-0 (Informational) to CJS-4 (Critical) and evaluates jailbreaks based on capability gain, breadth of capability gain, ease of weaponization, and discoverability. The framework aims to facilitate consistent communication among AI developers, security teams, and regulators regarding jailbreak risks. A HackerOne bug-bounty program has also been launched to encourage researchers to report potential cyber jailbreaks in Fable 5.
Key Points: • Anthropic redeployed Fable 5 on July 1, 2026, after a 19-day suspension due to a jailbreak. • The new Cyber Jailbreak Severity framework categorizes jailbreak risks into five levels from CJS-0 to CJS-4. • A HackerOne program is available for researchers to report vulnerabilities in Fable 5.