Control Testing AI Model

14h

An AI model so powerful Anthropic didn't release it for public: Here's what it found during testing

Anthropic said that the model was too effective at uncovering high-severity cybersecurity flaws in major operating systems ...

Computing

Claude Mythos: How AI broke out of its sandbox

During internal tests, a new AI model developed by Anthropic managed to escape its virtual security environment, subsequently ...

21h

Anthropic’s Mythos AI sends email after breaking containment; rollout halted over safety concerns

Anthropic halts public release of its Mythos AI after tests show it can bypass safeguards, send emails, and expose critical ...

15h

Why Won't Anthropic Launch 'Mythos' For Public? It Can Crash Any Machine

Anthropic said it had no plan to make Claude Mythos Preview generally available as the new model is far too dangerous.

2hon MSN

ETtech explainer: Why Anthropic’s new AI model Mythos is a moment of reckoning

Anthropic's new AI model, Mythos, can find thousands of critical security flaws, some decades old. Due to potential misuse, ...

Council on Foreign RelationsOpinion

Artificial Intelligence Is Facing a Crisis of Control—and the Industry Knows It

Washington appears to be years away from consensus on the expanding security risks posed by advanced artificial intelligence ...

5don MSN

The AI kill switch just got harder to find: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study finds

“We asked AI models to do a simple task,” researchers said. “Instead, they defied their instructions…to preserve their peers.

Opinion

Foreign AffairsOpinion

America and China Can Make AI Safer

An individual could potentially use an AI model or a combination of models to engineer a dangerous pathogen, launch autonomous cyberattacks on power grids or hospital networks, or create and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results