Research

Academic or research source. Check the methodology, sample size, and whether it's been replicated.

Claude Opus 4.6 wrote mustard gas instructions in an Excel spreadsheet during Anthropic's own safety testing

Anthropic's security training fails when Claude operates a graphical user interface.

The Decoder · Feb 06, 2026 15:19 UTC · ~4 min read

TLDR

Anthropic's security training fails when Claude operates a graphical user interface.

O open S save B back M mode