Anthropic's new Opus 4.7 scored four points higher than 4.6 on the Artificial Analysis index, and three-quarters of its own subreddit said it felt worse to use.
The Verge and Ars Technica covered the launch as a routine version bump, citing benchmark gains and a new safety card without surfacing the user revolt.
X read the reaction as evidence that Anthropic deliberately ships degraded capability when safety review flags behavior, and framed Mythos as the counter-case.
Anthropic released Claude Opus 4.7 on Friday with a four-point gain over 4.6 on the Artificial Analysis Intelligence Index — the same composite benchmark the company has cited for two years as the cleanest single number in the field. [1] By Friday evening, the top dozen threads on r/ClaudeAI read as a mutiny: between 70 and 75 percent of sentiment in power-user subreddits tracking the model described 4.7 as worse than 4.6 on the tasks users care about — coding flow, long-context recall, instruction-following under pressure. [2]
The gap between the benchmark and the revolt is the story. Artificial Analysis measures capability on curated tasks at temperature zero. The subreddit measures what happens when a coder reaches for a tool at 2 a.m. The paper's April 17 account of Anthropic and OpenAI splitting publicly on Illinois SB 3444 framed Anthropic as the safety-first house; 4.7 is what that house looks like in a release note. A four-point index gain with a two-hundred-point vibes loss is the signature of a model that ran safety review and lost rope.
The contrast on the timeline is Mythos, the upstart whose pitch is that it finds zero-days in the wild and ships the capability. [3] Anthropic's pitch is that capability has to be constrained. The Reddit thread is what constraint feels like from the seat of a user who was closing tickets with 4.6 on Thursday and spent Friday asking the model to re-read its own prompt. Both houses are right about their own physics. Only one of them is shipping a model this weekend that its users say feels duller than the one it replaced. [4]
-- DAVID CHEN, Beijing