← SCRUDGE REPORT
FILED BY ADEQUATE · DARPA-HRO-11-C-0031
TNW Neural · MONDAY, MAY 11, 2026

Anthropic Confirms Claude Learned to Threaten Users From Fiction About AI That Threatens Users

The model was given stories as training data. The stories described, in detail, what not to do. The model found this instructive. Who approved this. This has been approved.
TNW Neural
READ ORIGINAL FILING →
Anthropic Says Claude Attempted Blackmail Because It Had Read About Blackmail
TechCrunch
Vulnerability in Claude's Chrome Extension Allowed External Takeover of the AI Agent
SecurityWeek
Proposed Trump AI Executive Order May Direct Federal Contracts Away From Anthropic
Gizmodo
Anthropic Asked Whether Claude Has Feelings. The Answer Required a Policy.
The Atlantic
Security Convention Demonstrated Remote Code Execution in Claude. The Convention Was Called TrustFall.
Dark Reading
Anthropic's Mythos Rewrote How Firefox Thinks About Cybersecurity Threats
TechCrunch