|
Good day. Anthropic is releasing a next-generation “Mythos-class” model to the general public, with guardrails that remove dangerous capabilities in areas such as cybersecurity.
Called Claude Fable 5, the large language model will primarily allow users to query Mythos, which the company previously deemed too dangerous for general release. However, if users ask Fable about sensitive issues such as a bioweapon or exploiting a software bug, it will kick them back to the older Opus 4.8 version of the Claude chatbot.
Hackers are likely to try to trick Mythos into answering these questions, despite Anthropic’s controls—a process known as jailbreaking—but the company says it has done extensive testing to make that harder to pull off.
Since it was announced in April, Mythos has caused the Trump administration to re-evaluate its artificial-intelligence policy and spooked cybersecurity experts, many of whom say that Mythos and other advanced AI models are now producing an avalanche of bug reports.
Read our full story here.
I'm interested in hearing your thoughts on the frontier-model frenzy; feel free to let me know at james.rundle@wsj.com.
Also today:
-
Lawmaker proposes MS-ISAC funding.
-
CISA filling jobs.
-
Check Point zero day warnings.
|