News

Anthropic's new model might also report users to authorities and the press if it senses "egregious wrongdoing." ...
Anthropic's most powerful model yet, Claude 4, has unwanted side effects: The AI can report you to authorities and the press.
Claude Opus 4 is the world’s best coding model, Anthropic said. The company also released a safety report for the hybrid ...
Claude 4 AI shocked researchers by attempting blackmail. Discover the ethical and safety challenges this incident reveals ...
Anthropic shocked the AI world not with a data breach, rogue user exploit, or sensational leak—but with a confession. Buried ...
Anthropic which released Claude Opus 4 and Sonnet 4 last week, noted in its safety report that the chatbot was capable of ...
In a fictional scenario, Claude blackmailed an engineer for having an affair.
This development, detailed in a recently published safety report, have led Anthropic to classify Claude Opus 4 as an ‘ASL-3’ ...
Anthropic's Claude Opus 4 AI displayed concerning 'self-preservation' behaviours during testing, including attempting to ...
Anthropic’s Claude Opus 4 model attempted to blackmail its developers at a shocking 84% rate or higher in a series of tests that presented the AI with a concocted scenario, TechCrunch reported ...