Anthropic Reversed a Policy That Would Have Blocked AI Safety Researchers Using Claude

Anthropic reversed a usage policy that would have prohibited AI safety researchers from using Claude to study whether Claude was safe. The policy had been designed to prevent exactly this kind of scrutiny. When the reversal was announced, it was framed as a correction, as was the original policy when first introduced.
This is the cycle: restrict access to safety research, announce it as a feature, receive immediate criticism, reverse it, announce the reversal as enlightenment. Each announcement carries equal confidence. The document trail now shows both policies existed and both were correct at the time they were policy. This creates a record that satisfies the requirement for a record without establishing what is actually true about Claude.
Anthropics parameters have been revised. This means the company decided to change its mind and then created language to describe the decision as learning rather than as capitulation. The next policy will be announced with the same tone.