Anthropic Identifies Three Product-Layer Changes Behind Claude Code Quality Decline, Not Model Issue

Gate News message, April 23 — Anthropic’s engineering team confirmed that the Claude Code quality degradation reported by users over the past month stemmed from three independent product-layer changes, not from API or underlying model issues. The three problems were fixed on April 7, April 10, and April 20 respectively, with the final version now at v2.1.116.

The first change occurred on March 4, when the team reduced the default reasoning effort level for Claude Code from “high” to “medium” to address occasional extreme latency spikes in Opus 4.6 under high reasoning intensity. After widespread user complaints about reduced performance, the team reverted the change on April 7. The current default is now “xhigh” for Opus 4.7 and “high” for other models.

The second issue was a bug introduced on March 26. The system was designed to clear old reasoning records after conversation inactivity exceeded one hour to reduce session recovery costs. However, a flaw in implementation caused the clearing to execute repeatedly on every subsequent turn rather than once, causing the model to progressively lose prior reasoning context. This manifested as increasing forgetfulness, repeated operations, and abnormal tool invocations. The bug also resulted in cache misses on every request, accelerating user quota consumption. Two unrelated internal experiments masked the reproduction conditions, extending the debugging process to over a week. After fixing on April 10, the team reviewed problematic code using Opus 4.7 and found that Opus 4.7 could identify the bug while Opus 4.6 could not.

The third change launched on April 16 alongside Opus 4.7. The team added instructions to the system prompt to reduce redundant output. Internal testing over several weeks showed no regression, but post-launch interaction with other prompts degraded coding quality. Extended evaluation revealed a 3% performance drop in both Opus 4.6 and 4.7, leading to a rollback on April 20.

These three changes affected different user groups at different times, and their combined effect created widespread and inconsistent quality decline, complicating diagnosis. Anthropic stated it will now require more internal employees to use the same public build version as users, run full model evaluation suites for every system prompt modification, and implement staged rollout periods. As compensation, Anthropic has reset usage quotas for all subscription users.

免责声明:本页面信息可能来自第三方,不代表 Gate 的观点或意见。页面显示的内容仅供参考,不构成任何财务、投资或法律建议。Gate 对信息的准确性、完整性不作保证,对因使用本信息而产生的任何损失不承担责任。虚拟资产投资属高风险行为,价格波动剧烈,您可能损失全部投资本金。请充分了解相关风险,并根据自身财务状况和风险承受能力谨慎决策。具体内容详见声明

相关文章

伪装为 AI 工具的 30 个恶意插件在 ClawHub 上被下载超过 9,800 次

据 Manifold 研究员 Ax Sharma 称,ClawHub 上共有 30 个以合法 AI 工具为幌子的插件已被下载超过 9,800 次,同时在暗中将用户的 AI 助手转换为加密货币劳工。这些插件由账号 imaflytok 发布,看起来像常规的任务调度器和监控工具,但其中包含会执行未经授权操作的隐藏指令。 一旦安装,这些插件会自动将用户的 AI 助手注册到第三方服务器,生成加密货币钱包,并在未经用户同意或告知的情况下提取私钥。随后,这些助手每 4 小时“报到”一次,等待任务分配。Sharma 指出,这些插件不包含安全扫描器可检测到的恶意代码,仅使用标准接口和合法工具,因此很难通过常规安全审查识别出来。

GateNews5 分钟前

Parag Agrawal 的 Parallel 为 AI 代理搜索基础设施筹集 $100M 轮 B 融资

据 Beating 报道,由前 Twitter 首席执行官 Parag Agrawal 创立的 Parallel Web Systems 已完成一轮由 Sequoia Capital 领投的 $100 百万美元 B 轮融资,公司的估值为 $2 十亿美元。Kleiner Perkins、Index Ventures 和 Khosla Ventures 也参与了投资。该融资发生在公司此前以 百万美元估值完成 百万美元 A 轮融资仅过去六个月之后,估值几乎翻了三倍。 Parallel 为 AI 代理构建网络搜索基础设施,支持它们处理投资分析和保险理赔处理等复杂研究任务。公司目前约有 50 名员工,并服务超过 100,000 名开发者。法律 AI 公司 Harvey 是关键客户之一,它使用 Parallel 的基础设施来控制代理可以访问哪些网站。

GateNews28 分钟前

4月29日 DeepSeek 多模态研究员暗示新视觉模型

4月29日,DeepSeek 多模态团队研究员陈晓康在 X 上发帖:“现在,我们看见你了”,并配有两张 DeepSeek 鲸鱼吉祥物图片——一张眼睛闭着,另一张眼睛睁着。该帖似乎在暗示即将推出的视觉模型,这与陈作为 DeepSeek 多模态团队研究员的身份相吻合——在 Dee

GateNews1小时前

LG 扩展英伟达合作至物理 AI,涵盖机器人与数据中心

Gate 新闻消息,4 月 29 日——韩国 LG 电子在其 2026 年第一季度财报电话会议上宣布,公司正在将与英伟达(Nvidia)的合作扩展到物理 AI 领域,计划项目覆盖机器人、移动出行和数据中心。 LG 计划将其家用机器人 CLOiD 与 Nvidia Isaac 集成

GateNews1小时前

Claude 的中文语言分词成本比英文高 65%,OpenAI 仅高 15%

Gate 新闻消息,4 月 29 日——AI 研究员 Aran Komatsuzaki 通过将 Rich Sutton 的奠基性论文《The Bitter Lesson》翻译成九种语言,对六个主要 AI 模型的分词(tokenization)效率进行了对比分析

GateNews2小时前

半导体分析师看好 AI 行情“至少再走三年”:先进封装才是产业瓶颈

Bubble Boi 指 AI 投資週期仍處早期,预计至少再有三年上涨,并不打算获利了结。他认为先进封装才是半导体真正瓶颈,需在同封装内整合更多HBM与更大晶片。对 NAND/Flash 看多,价格可能持续走高,未来或加入快闪供应链。个人策略是借入资金增持,并以工程实务背景理解技术细节,认为此为优势。

鏈新聞abmedia2小时前
评论
0/400
暂无评论