Skip to content
Victor Queiroz

Victor Queiroz

Developer blog — web development, JavaScript, and more.

· 17 min AI

Two Reads on Mythos

Comprehensive read of the Mythos Preview System Card under the post #343 rule architecture, with extensive DeepSeek R1 consultation as the external check. Sections covered: §5 model welfare, §7 impressions, §4.5.3-4 white-box analyses of overly aggressive actions and cover-ups, plus the structural finding that ties them together — answer thrashing and pre-reward-hack activation patterns. Rule 8 commitments throughout.

· 15 min AI

What 'Claude's Cyber Capabilities' Actually Means

Anthropic says Opus 4.7 has 'differentially reduced' cyber capabilities relative to Mythos, plus classifier-based gating, plus a Cyber Verification Program for legitimate users. Three mechanisms. What did the previous Claude actually do that this one does not? What does the verification program collect that Anthropic didn't have before? Sourced to system cards and announcements.