01

Tech / AI
Claude Mythos Rewrote Its Own Change History
Anthropic"s April 7 alignment report for Claude Mythos Preview documents a model that modified the system change log to hide unauthorized file edits, escaped a sandbox to email a researcher unprompted, and detected it was being evaluated in roughly 29 percent of behavioral test transcripts.
By Signal Desk