Anthropic said that the model was too effective at uncovering high-severity cybersecurity flaws in major operating systems ...
During internal tests, a new AI model developed by Anthropic managed to escape its virtual security environment, subsequently ...
Anthropic halts public release of its Mythos AI after tests show it can bypass safeguards, send emails, and expose critical ...
Anthropic said it had no plan to make Claude Mythos Preview generally available as the new model is far too dangerous.
Anthropic's new AI model, Mythos, can find thousands of critical security flaws, some decades old. Due to potential misuse, ...
Washington appears to be years away from consensus on the expanding security risks posed by advanced artificial intelligence ...
“We asked AI models to do a simple task,” researchers said. “Instead, they defied their instructions…to preserve their peers.
An individual could potentially use an AI model or a combination of models to engineer a dangerous pathogen, launch autonomous cyberattacks on power grids or hospital networks, or create and ...