Thursday, April 16, 2026

Feed

Daily briefing

晨报 · 2026-04-16

重点

Anthropic Opus 4.7 今晨正式 GA，定价不变 + 显式 concede 落后 Mythos + 直接回应 “模型变笨” 质疑。两月一迭代（4.5→4.6→4.7），$5/$25 per 1M tokens 不变，全线 Claude 产品 + API + Bedrock + Vertex + Microsoft Foundry 同步上线 Anthropic 官博 · 9to5Mac · Axios · CNBC。三件值得记的事：(1) Anthropic 公开承认 Opus 4.7 “less broadly capable than Mythos Preview”——首次在产品命名体系里把 frontier 和 GA 拆成两条线，Mythos 留给 Project Glasswing 的 vetted 合作方；(2) 新 xhigh effort level（在 high 和 max 之间）+ task budgets 系统 dev preview，给长任务更细粒度的 reasoning/latency 权衡；(3) 新 tokenizer——同样输入 token 数上升 1.0-1.35×，加上 “higher effort levels 在 agentic 后期会多思考”，实际 cost 对 heavy 用户会感知上升。Box 内评：相比 4.6 模型调用 -56% / 工具调用 -50% / 响应快 24% / AI Units -30%。GitHub Copilot 今日同步上线 7.5× 倍率 promo 到 4/30，Opus 4.7 将在未来几周内替换 4.5/4.6——这等于 Anthropic 在 Copilot 上用一次促销把老 Opus 整条下架 GitHub Changelog。最关键的 framing 动作：Anthropic 在发布稿里直接引用 AMD senior director 在 GitHub 上的”Claude has regressed to the point it cannot be trusted to perform complex engineering”抱怨，明确否认”把 compute 挪给 Mythos”的阴谋论。这对昨日 r/LocalLLaMA #1 的”全模型变笨”帖子（581 upvotes / 345 comments）是正面的产品侧回应，也是 Anthropic 首次在模型发布稿里主动处理社区”nerfing”叙事。注意：The Information 泄露的 AI Design Tool 今天没有同步发布——Opus 4.7 是 package 里的 model piece，design tool 仍是 pending item（Polymarket 原本给 62% 概率 4/18 前发布，现在可能要修正）。
OpenAI 同日用一次 Codex 大更新回敬 Anthropic，并宣布 “super app out in the open”。距离 Anthropic 昨日 Claude Code 桌面重做 + Routines 48 小时，OpenAI 今早推出 Codex 史上最大单次更新：内嵌 computer use（直接对标 Claude Cowork）、111 个新 plugins（skills + app integrations + MCP servers 三合一）、built-in browser + 评论式交互、built-in image gen（gpt-image-1.5 在 Codex 里生成 mockup / 前端 / 游戏素材）、两项 memory 预览（跨 session context + 主动行动建议）。Codex head Thibault Sottiaux：“We’re building the super app out in the open. This release is about developers. In the future, we will broaden it up to a wider audience.” Yahoo Tech · OpenAI Codex changelog · Releasebot。今日最大的 strategic signal：Anthropic 和 OpenAI 几乎踩着同一节奏做完全对称的动作——两家都在把模型 + IDE + 桌面 computer use + 内置 image gen + plugin marketplace 打包成一个面向开发者的 super app，OpenAI 多了 Atlas 浏览器一条腿。昨日 Anthropic 先手，今日 OpenAI 对位；本周内两家都把”developer as entry point”这个分销逻辑讲完了。
Qwen3.6-35B-A3B 开源，直接取代 Qwen3.5 成为 LocalLLaMA 默认权重。Alibaba 今日开源 MoE 模型（35B total / 3B active）：Apache 2.0、agentic coding 水平与 10× active 参数的 dense 模型相当、multimodal thinking / non-thinking 双模式。r/LocalLLaMA 第一时间冲到 1286 upvotes / 421 评论 (#1 hot)，OP 贴出 benchmark 图显示 Qwen3.6-35B-A3B 在 coding 基准上超过 dense 的 27B Qwen3.5-27B 帖子 · Qwen blog · HuggingFace。Top 评论（217 赞）：“Well this seems absolutely lovely. What a good couple months for local LLMs, huh?”——昨日的”全模型变笨”情绪+今日开源权重的连续释放=本周 LocalLLaMA 情绪从 defensive 转为 triumphant。还有隐藏 reveal：Qwen 官方 blog 结尾写 “Qwen3.6 open-source family keeps expanding, stay tuned for our future releases”——3.6 是一个家族而不是单模型。
Cognition 两记连招：Devin 原生进 Windsurf 2.0 + 新定价层级同日落地。Cognition 4/15 发布 Devin in Windsurf，把 Devin 作为 cloud agent 嵌进 Windsurf 2.0（昨天已收购整合的 IDE）：本地 agent 负责 “thinking / prototyping”，Devin 负责 “delegated implementation”，PR 回到 Windsurf 里 review。同日新定价发布——退役老的 Core + Team，新 lineup 是 Free / Pro / Max / Teams / Enterprise；Ask Devin / DeepWiki / Devin Review 开始收费（Review 典型一次 $2-3）。这是 Cognition 对 Anthropic Routines + OpenAI Codex 双面夹击的答案：不做桌面 super app，而是把 Devin 收回 Windsurf 这个 IDE 里、同时把免费产品开始变现。本周内三家 coding agent 厂商（Anthropic / OpenAI / Cognition）在分销策略上正式分叉。
S&P 500 破 7,000 关口（+0.80% 收 7,022.95），Nasdaq 10 → 11 连阳收 24,016.02。Iran 战事”close to over”预期 + 银行季持续 beat + AI 交易复苏的三重叠加，S&P 500 和 Nasdaq 同日收新高。Dow -0.15% 是唯一异常值。Allbirds +582%（鞋厂改 AI 后 pivot），Robinhood +10%（SEC 批准 day-trading 新规），ASML 盘后 -2% 尽管 beat。BAC / MS / PNC 盘中均涨。盘后 Nasdaq 继续创新高，4/16 盘前 CNBC 显示 S&P 继续 +0.1%。值得记录的 meta 信号：Allbirds 580% 跳涨是”AI pivot trade”在 micro-cap 段的剧烈版本——一家从鞋转 AI 的公司一天涨 6 倍，说明市场对”AI-labeled”的贴标签溢价在 2026 Q2 仍未 cool off。

笔记

📡 HN 信号

注：hntoplinks /today 截至写稿时仍然回落到 4/15 数据（已覆盖），且 news.ycombinator.com/front 返回 2/12 缓存——今日 HN 实时 front page 难以 reliably 拉到新内容。以下按 4/16 实际 fresh reporting 与 yesterday-unique 内容筛选：

Anthropic Claude Opus 4.7 is generally available — Anthropic 官博 4/16 上线，同日 HN 后续帖子可预见。Anthropic · 9to5Mac 从发布会语言判断：Anthropic 这次把”我们承认 Mythos 比 GA 强”写进正式发布稿是产品品牌动作——把 frontier 和 GA 拆成两条轨，前者限定在 vetted cybersecurity 合作方（Glasswing），后者面对全市场。Apple Silicon / 多云 SKU 同日全部 ready。
OpenAI’s latest Codex update builds the groundwork for its upcoming super app — Yahoo Tech 4/16 17:00 UTC computer use / 111 plugins / built-in browser / built-in image gen / memory。Sottiaux 原话：“We’re building the super app out in the open”——OpenAI 首次用 “super app” 这个词公开 frame 它在做的事。对位 Anthropic Cowork + Claude Code 桌面重做 + Routines。
US agencies quietly test Anthropic’s Mythos despite Trump ban (Politico via Invezz) — Invezz 4/15 Commerce Dept 的 Center for AI Standards and Innovation 测试 Mythos cybersecurity 能力；3+ 国会委员会 staff 申请 / 举行了 Mythos briefing。Pentagon 的 blacklist 和 Commerce / Congress 的 quiet engagement 同时存在——“AI policy is easier to announce than to enforce cleanly”（Invezz 原文判断）。这是 Anthropic 在被 Pentagon 打压的同时仍然在 ex-Pentagon 部门扩大存在感的关键信号。
Most of you are rejecting AI. The data shows you’re running out of time (Fortune) — Fortune 4/16 Fortune 4/16 6 小时前发的大稿，数据驱动。和昨日 r/LocalLLaMA 全模型变笨 + Bryan Cantrill “peril of laziness lost” + aphyr “future of everything is lies” 构成同一 meta 话题的主流媒体版本：AI 采纳阻力。Fortune 把它定性为”quiet quitting trust”——是 mainstream media 本周第一次直接用就业 / 信任 / 辞职这三个词串起来描述 AI 采纳问题。
Gartner: Only 28% of I&O AI use cases fully succeed, 20% fail outright (782 I&O leaders surveyed) — Gartner press 20% 彻底失败 + 只有 28% ROI 达标——这是对昨日 OX Security “critical risk +400% YoY” / OpenAI $852B 估值被 FT 质疑这条线的第三条支撑数据：企业 AI 真实 ROI 数字首次由 Gartner 以 782-lead 样本发布，为”AI 使用的阻力”提供了经济学底座。对所有 enterprise AI 销售 motion 都是一个新的对话起点。
Avid × Google Cloud: Gemini + Vertex AI 嵌进 Avid Media Composer / Content Core（NAB Show 4/19-22） — Google Cloud PR 4/16 影视后期工作流被 Gemini 和 Vertex AI 深度接管。这是 Google Cloud 在 4/16 早上同时发的第二条——结合 Stellantis × Microsoft strategic deal（同日宣布 AI-led digital transformation），hyperscaler 的行业渠道推进在今日有双信号（汽车 + 媒体）。
Aehr Test Systems record $41M ASIC burn-in order from lead hyperscale AI customer (AEHR) — Aehr PR 4/16 H2 bookings >$92M，Sonoma 平台支持 AI processor ASIC 的高功率封装级 burn-in。半导体端被动元件 / 测试公司仍然在独立于大涨跌之外持续吃 AI capex 红利——今日昨日大厂涨跌之外的独立信号。

🔬 Reddit 观察

r/LocalLLaMA — 今日主角：Qwen3.6-35B-A3B 开源

🔥 Qwen3.6-35B-A3B released! — 1286 upvotes / 421 评论 · 帖子 MoE 35B total / 3B active，Apache 2.0。OP (ResearchCrafty1804) 随即补 benchmark 图：Qwen3.6-35B-A3B 在多个 coding 基准上超过 dense 的 27B Qwen3.5-27B，并且在 agentic coding / reasoning 上显著领先其直接前身 Qwen3.5-35B-A3B。Top 评论（217 赞）：“Well this seems absolutely lovely. What a good couple months for local LLMs, huh?” OP 自评论（268 赞）指向 benchmark 图链接。第三评论（75 赞）指出 Qwen 官方 blog 末尾的 tease：“Qwen3.6 open-source family keeps expanding, stay tuned for our future releases”——意味着这只是 3.6 系列的第一发。与昨日的”全模型变笨”阴谋论+今日 Anthropic 公开回应叠加：LocalLLaMA 社区情绪从 defensive (“大厂在偷偷降级”) 切换到 triumphant (“权重模型在吃闭源的午餐”)。
🔥 Released Qwen3.6-35B-A3B (同一模型的第二发) — 336 upvotes / 81 评论 · 帖子同日同一 release 的第二个主帖。两帖合计 1622 upvotes / 502 评论，罕见的单日单主题双榜首。
🪴 Anyone else get more excited for new open source models than new flagship ones? — 365 upvotes / 59 评论 · 帖子 Meme 配图但问题很真——今日权重模型发布的情绪已经超过闭源旗舰。
🧠 Local AI is the Best（相关 meme）+ I’ll take an open-model release over a closed SOTA any day, who’s with me? — 200 upvotes / 13 评论今日 LocalLLaMA 的 top 6 里有 3 条同一情绪。本周情绪密度：周内共四次 Qwen / Gemma / Bonsai / Mozilla Thunderbolt 等开源相关帖子进入 top 10。
⚠️ More reasons to go local: Claude is beginning to require identity verification — 150 upvotes / 18 评论 · 帖子 · Anthropic support page Anthropic 开始要求部分用户提交 ID + 面部识别做身份验证。这是 Claude 用户触达 gate 的一次向上迁移——在 AI Chats 可入证（昨日 HN #14）、Google/ICE 数据调阅（昨日 HN #1）等隐私议题背景下，“要不要把身份信息交给 LLM 厂”正式从”假问题”变成”真问题”。对昨日已经讨论的”private vault 相对优势”是又一条数据点。
🧩 DeepSeek updated DeepGEMM testing Mega MoE — 100 upvotes / 10 评论 · 帖子 · GitHub PR DeepSeek 悄悄在测下一个 Mega MoE 架构（从 commit diff 推测）。结合 Qwen3.6 今日发布，中国 open-weight 双线（阿里 + DeepSeek）持续在推进。
💡 Mozilla “Thunderbolt” open-source enterprise AI client — 58 upvotes / 30 评论 · Phoronix Mozilla AI 出了一个开源企业级 AI client。在 HuggingFace 下有很大 Claude / ChatGPT 替代市场，但发布的时机（Anthropic ID 验证 + Claude.ai 连续故障）精准。

r/MachineLearning

🔄 Failure to reproduce modern paper claims — 127 upvotes / 27 评论 · 帖子 OP “7 篇 paper / 4 篇复现失败”——与昨日的 ICLR 2025 Oral paper 质量问题 + ICML 2026 评审延期形成连续三天 ML 学术发表质量问题的主题。本周内 MachineLearning 社区的注意力从”模型能力进展”转移到”学术发表管道断裂”。
⚠️ [ICML 2026] Scores increased and then decreased — 29 upvotes / 10 评论 · 帖子 AC discussion 阶段 reviewer 把分数重新降回的现象。ML 社区对”AC 阶段 score games”的讨论第一次有具体案例 post。
🧩 Built a political benchmark for LLMs. KIMI K2 can’t answer about Taiwan, GPT-5.3 refuses 100% with opt-out — 8 upvotes / 15 评论 · 帖子 · GitHub 98 问 / 14 政策领域 / 2D 政治罗盘。主要有看点的三条发现：(1) KIMI 在 Taiwan 上 guaranteed 拒答；(2) GPT-5.3 在给出 opt-out 时 100% refuse；(3) Claude Opus 4.6 被独立跑。“LLM 政治 refusal rate”是一个正在形成的新 benchmark 类别。

r/SideProject

💼 Your cold emails are going to spam in 2026 – deliverability checklist — 35 upvotes / 22 评论 · 帖子 30+ agency / startup setup 审计后的 checklist。把”邮件投递率”作为 indie 销售卡点而不是 copy / timing——是今日 SideProject 最 practical 贴。
🎉 Got my first paying customer yesterday and I can’t stop smiling — 54 upvotes / 56 评论 · 帖子 React Native 生成工具，$19 用户 65% credit 用完。典型 indie 第一单故事。
👨‍👦 Built an app with my 9-year-old son during parental leave. A year later, it’s live on the App Store — 11 upvotes / 11 评论 · 帖子 42 岁 dad + 9 岁儿子做 Pokemon 卡识别估价 app (Cashem)。延续一周内的”反 AI slop / 手工 indie”情绪线。
🔧 I open-sourced a pipeline that finds boring B2B pains from court filings — 10 upvotes / 9 评论 · 帖子 OP 论点：“每家消费 app 都在 VC 烧钱军备竞赛，每家 dev tool 都有 47 个对手。只有 boring industries 是真正的 profitable software opportunity。“——这种”反 AI 大势 / 做 boring problems”的 indie 定位在 r/SideProject 出现频率明显升高。

📄 AI Research / 技术深探

今日 alphaXiv 最值得追的几条线：

“Externalization in LLM Agents: A Unified Review of Memory, Skills, Protocols and Harness Engineering” (alphaXiv trending) 把 memory / skills / protocols / harness engineering 作为一个”外部化 (externalization)” 统一范式：把认知负担从 model weights 外迁到持久外部结构。这正好是昨日 Cursor 3 “Composer 2 + Agents Window + Cloud” 五层架构叙事的学术版本，也是 Anthropic Claude Code Routines / OpenAI Codex 111 plugins 今日发布的底层理论。如果这条 thread 持续，今年下半年 “agent 工程学” 将成为独立子学科。
“Neural Computers (NCs)” (Meta AI + KAUST) “neural 模型把 computation / memory / I/O 统一进一个 learned runtime state”；prototype NCs 可以直接从 I/O traces 生成终端屏幕和控制 GUI。这是 “computer use” 架构的更激进假设：不去做 tool call，而是让模型本身成为 runtime。Gemini 今日在 Avid 的 Vertex AI 深度整合、OpenAI Codex 今日内建 computer use——这条论文是这个方向的学术上游。
“In-Place Test-Time Training (In-Place TTT)” 通过 repurpose 现有 MLP block 做 chunk-wise 动态适应，在 RULER 等 long-context benchmark 上持续提升，计算开销”可以忽略”。这是 Anthropic 今日”xhigh” effort level + task budgets 系统在学术端的平行解法——都是试图在 long-horizon 任务上用动态 inference-time allocation 取代 static context expansion。
“Paper2Agent” (arxiv:2509.06917) 自动把 paper + 配套 codebase 转成一个 MCP server（可直接接 Claude Code）。配合 OpenAI Codex 今日”111 plugins = skills + app integrations + MCP”的设计，**“papers as MCP servers”**是把学术流量直接接入 agent 工作流的可行接口。这对 [[research/ai/claude-code-skills-ecosystem.md]] 类议题是一个直接的上游设计。

💰 融资与产品动态

Cognition 自家两连发（昨日 + 今日）：

Devin in Windsurf 2.0（4/15）：本地 + 云 agent 分工模式；本地 agent 做 planning 和 prototyping，Devin 做 implementation + QA。PR 回到 Windsurf 评审。“cloud-vs-local agent 分工” framework 首次被明确产品化。
Devin 新 pricing（4/14）：退役 Core / Team，引入 Free / Pro / Max / Teams / Enterprise。Ask Devin / DeepWiki / Devin Review 开始收费（Review 每次约 $2-3）。Cognition 在 SI 渠道战（昨日 Cognizant + Infosys）之外补上 self-serve 定价层——两条战线同时推进。

Anthropic Opus 4.7 配套商业动作：

GitHub Copilot 7.5× premium multiplier promo 到 4/30。Opus 4.5 / 4.6 在 Copilot 模型选择器里将被逐步下架。Anthropic 主动 kill 自家前代模型的 Copilot 入口——降低 model stock overhang。
Box 官方提供 benchmark：相比 Opus 4.6，Opus 4.7 模型调用 -56% / 工具调用 -50% / 响应 24% 更快 / AI Units -30%。对 enterprise customers 的 cost-based adoption 叙事极强——如果真能到 Box 说的水平，4.7 对 4.6 的替换会非常快。
Price 不变（$5 / $25）+ tokenizer 导致 input tokens 上升 1.0-1.35× = 实际 unit cost 对 heavy 用户有 5-20% 隐性上涨。

其他发布 / 融资（今日新 + 延续）：

Avid + Google Cloud 战略合作（NAB Show 4/19-22）：Gemini / Vertex AI 嵌进 Avid Media Composer / Content Core，影视后期 agentic workflow 正式成型。
Stellantis + Microsoft 战略合作（4/16 宣布）：AI-led strategy + digital transformation。汽车 + media 两条行业渠道今日同步。
Aehr Test Systems $41M AI ASIC burn-in 订单（AEHR / Nasdaq）：H2 bookings 超 $92M。半导体测试设备的 AI capex 红利继续 compounding。

宏观融资信号：

OpenAI $852B 估值被 FT 质疑（4/14）+ Anthropic Opus 4.7 定价不变 + Cognition 开始把免费产品收费 = **2026 Q2 AI 商业模式从”估值故事”转向”unit economics 故事”**的三条独立证据。
Q1 2026 全球 VC $300B / AI 占 80% / 四家独占 65% 的格局已 lock-in；今日 Aehr 这种”独立于大头 AI 独角兽之外、吃 AI capex”的 micro-cap 公司的利好是今日 AI trade 的 secondary beneficiary 信号。

📊 市场脉搏

4/15 收盘 + 4/16 盘前 / 盘中：

资产	水平	变动
S&P 500	7,022.95	+0.80%（首次破 7,000 关口 + 历史新高）
Nasdaq Composite	24,016.02	+1.59%（11 连阳 + 创纪录）
Dow 30	48,463.72	-0.15%
Russell 2000	2,713.66	+0.30%
VIX	18.17	-1.03%
10Y Treasury	4.281%	+2.5 bps
WTI Crude	$91.53	+0.26% (4/16 盘中)
Gold	$4,823.00	-0.01%
Bitcoin	$75,164	+1.48%
Allbirds (BIRD)	—	+582.33%（鞋厂 pivot AI）
Robinhood (HOOD)	—	+10.41%（SEC 批准 day-trading 新规）
MSFT	—	+4.64%
ORCL	—	+4.18%
NOW	—	+7.18%
CRM	—	+3.67%
BAC	—	+1.79%
MS	—	+4.52%
ASML	—	-2.41%（beat but -2%）

4/16 盘前 / 盘中（CNBC live / Trading Economics）：S&P +0.1% 续创新高；Nasdaq 在昨日水平附近；Dow +110 点。板块：能源 / 材料 / 房地产领涨，医疗 / 消费可选落后。Nasdaq-100 创 26,298 历史新高（4/16 TE 数据）。

关键叙事：

Trump 在 Fox Business 说 Iran war “close to over”。Pakistan 被列为可能的第二轮谈判的促成方。4/16 CNBC 报道第二轮 US-Iran 谈判”under discussion”但未官方 scheduled。
Hormuz blockade 实况：昨日 3 艘 Iran-linked 油轮通过海峡（不去 Iran 港所以不在封锁范围内）。市场在 price “war 持续但 Hormuz 会谈成”的双情景。
银行季 Q1 全面 beat：BAC EPS $1.11 beat，MS +4.5%，BNY Mellon +1.3%。
UBS “safe haven failure” 报告：gold / yen / euro / bunds 在 2026 Iran-war 期间全部未能提供有效对冲——传统避险资产的 regime 已改变。
Allbirds +582%：一家做鞋的公司宣布”pivot to AI-focused business model”，单日跳 6 倍。这是 2026 Q2 “AI labeling premium”在 micro-cap 的 extreme 版本——一个会被抄底和 short squeeze 同时打的标签效应。
Nikkei 225 +2.38% 收 59,518——亚洲追随美股 risk-on，Japan 到历史高位。

👀 Watchlist

Anthropic / Claude Code

🎯 Claude Opus 4.7 GA 发布（今日重中之重）：
- 两月一迭代周期确立（4.5→4.6→4.7）
- 所有 Claude 产品 + API + Bedrock + Vertex + Microsoft Foundry 同步上线
- $5 / $25 per 1M tokens 定价不变；但新 tokenizer 使 input token 数上升 1.0-1.35×；且 higher effort levels 在 agentic 后期会多思考 = 实际 unit cost 对 heavy 用户上升
- xhigh effort level（high 和 max 之间的新档）+ task budgets 系统 dev preview
- 公开 concede Mythos Preview 更强——Mythos 留给 Project Glasswing vetted 合作方
- 作为测试平台部署”自动检测 / 屏蔽 prohibited 或 high-risk cybersecurity uses”的新 safeguards
- Cyber Verification Program 向安全研究员开放合规路径
- Axios 版本强调这是 Anthropic 第一次直接回应 “Claude is being nerfed” 社区叙事——把一位 AMD senior director 在 GitHub 上的抱怨写进发布稿、否认”挪 compute 给 Mythos”。这是对昨日 r/LocalLLaMA #1 全模型变笨帖子的产品级回应。
🎨 AI Design Tool 今天没有同步发布——The Information 泄露中的 design tool 部分仍是 pending。Polymarket 原 4/18 前发布 62% 概率可能需要下调。
🔐 Claude 身份验证：Anthropic 开始对部分 Claude.ai 用户要求 ID + 面部识别（r/LocalLLaMA 150 upvotes 贴）。本周 Claude 用户 gate 向上迁移，反向给 local-weight 社区造势。
🛑 Claude.ai elevated errors（昨日起 HN #11 211 评论）：仍在 recovery，但今日 4.7 发布期间没有再出现新 major incident（根据 claudestatus.com 昨晚的后续状态）。

OpenAI / Codex

🚀 Codex 史上最大单次更新（今日 press briefing，Thibault Sottiaux 主讲）：
- Built-in computer use（直接对标 Claude Cowork）
- 111 个新 plugins = skills + app integrations + MCP servers
- Built-in browser + 网页评论式交互（“change this margin”）
- Built-in image gen via gpt-image-1.5（Codex 内生成 mockup / 前端设计 / 游戏素材）
- Memory preview：跨 session context + 主动行动建议（早上开工时推送”同事在你的 Google Doc draft 留了评论”）
- 新 marketplace install 系统（GitHub / git URL / 本地目录 / marketplace.json URL）
- 新 TUI memory mode + Ctrl+R 反向搜索 + sandbox-aware filesystem APIs
- Sottiaux 原话：“We’re building the super app out in the open.”——OpenAI 首次公开使用 “super app” 这个 framing
🛡 GPT-5.4-Cyber（昨日发布）通过 Trusted Access for Cyber 走窄分发，继续完善
💲 OpenAI $852B 估值被 FT 质疑（昨日 HN #15）在今日发布冲击后的叙事边际还在观察

Cursor / Anysphere

📊 InfoQ 发布 Cursor 3 深度分析（9 小时前）——“Agent-First Interface, Moving Beyond the IDE Model”。描述 Anysphere 的 5 层（triggers → tools → model → runtime → interface）战略。社区在 HN / Reddit 分化：一派称 Cursor 放弃 IDE-first 身份，另一派认为管理 agent fleet 是趋势。
Composer 2 vs Opus 4.7 对比值得做：Composer 2 在 Terminal-Bench 2.0 是 61.7 / 73.7 on SWE-bench Multilingual。Opus 4.7 的同类数字今日未披露，但 Anthropic 说 “outperforms Opus 4.6 across industry benchmarks for agentic coding”——Cursor 3 的自家 Composer 2 对 Anthropic 4.7 的相对定位需要等下周 third-party benchmark 数据。

Cognition / Devin

🆕 Devin in Windsurf 2.0（4/15）：cloud-vs-local agent 分工框架首次明确——本地 agent 做 thinking / prototyping；Devin cloud agent 做 implementation + PR + QA；PR 在 Windsurf 里 review。
💲 Devin 新 self-serve pricing（4/14）：退役 Core + Team，Free / Pro / Max / Teams / Enterprise 新层级；Ask Devin / DeepWiki / Devin Review 开始收费。Review 每次约 $2-3。Cognition 对 Anthropic Routines + OpenAI Codex 双面夹击的答案不是做桌面 super app，而是收回 Windsurf 这个 IDE + 免费产品变现。
Cognizant + Infosys SI 渠道（昨日收录）在新 self-serve 层级加持下进入第二阶段。

LangChain

本月仍无重大发版。Interrupt 2026（5/13-14 SF）日期未变。与昨日分析一致：LangChain Hosted Agents 相对 Anthropic Routines 的商业压力今日不减反增——OpenAI 今日 111 个 plugins marketplace 进一步压缩了 “managed agent infra” 第三方供应商的独立市场空间。

Omnara

之前 HN 的 Launch HN（Omnara YC S25，“Run Claude Code and Codex from anywhere”）本周无新发版，但今日 Anthropic Routines + OpenAI Codex computer use 双面上线让 Omnara 这类 “从任意地方跑 coding agent” 的中间层产品 value prop 被明显挤压。

🛍️ Product Hunt 情绪

今日 4/16 leaderboard 尚在 5 天 lockdown（“come back in 5 days”），4/14 同样。以下是 Hunted.Space 4 月月度 top 100 今日截图出的关键排名 + 观察：

4 月到目前为止 top 产品命中 watchlist 的：

Claude Code Routines — #2 Productivity / #2 Developer Tools（488 upvotes / 13 评论）
Claude Code Voice Mode — #1 Productivity（409 upvotes / 16 评论）
Claude for Word — #1 Productivity（386 upvotes / 5 评论）
Cursor 3 — #3 Developer Tools（376 upvotes / 18 评论）
Claude Managed Agents — #15 SaaS（163 upvotes / 1 评论）
Google Gemma 4 — #1 Open Source（446 upvotes / 8 评论）
Ollama v0.19 — #2 Open Source（425 upvotes / 12 评论）

值得注意的新观察：

“Figma for Agents” 在 Design Tools #1（528 upvotes / 20 评论）——Figma 对 Anthropic Design Tool leak 的直接回应产品。本月 PH 上 design tool 大战的第一枪已经打响。
“Ray” / “Capso” / “Caveman” / “Skills Janitor” / “CC-BEEPER” 等 Claude Code skills/plugins 周边产品持续占据 GitHub / Open Source 分类头部。第三方 Claude Code 生态现在有多个 >200 upvote 的 companion tools。
Anthropic 自己本月 4 周连发（Advisor / Word / Routines / Voice Mode / Managed Agents）——Anthropic 使用 PH 做新功能曝光的节奏是全行业最高的。OpenAI 今日 Codex 大更新明显没走 PH（走 Yahoo Tech / Releasebot 等主流 tech 渠道）——OpenAI 对 PH 的相对冷淡 + Anthropic 对 PH 的高频使用 = 两家 DTC 渠道策略的方法论差异。

评论/投票比 quality check：

Claude Code Routines 488/13 = 评论率 2.7%（低）——typical 的”大厂新功能被刷票但社区讨论不充分”模式
Figma for Agents 528/20 = 评论率 3.8%（低-中）——有机度比 Routines 略高但不算狂热
Claude Code Voice Mode 409/16 = 评论率 3.9%——同上
对比 SideProject 今日 top 帖的评论率（35/22 = 63%, 54/56 = 104%） = 真有机社区 vs PH leaderboard quality 的差距继续存在

想法

Opus 4.7 的 “公开承认落后 Mythos” 是一次反 nerfing 叙事的精准 framing 动作。昨日 r/LocalLLaMA #1 的”全模型变笨”帖子（581 upvotes / 345 评论）形成了一个”大厂在偷偷降级”的社区 consensus narrative。今日 Anthropic 用三个动作对这个叙事直接 counter：(1) 引用 AMD senior director GitHub 抱怨进发布稿，不 gaslight；(2) 明确说 “没有把 compute 挪给 Mythos”；(3) 把 Mythos 和 GA 在品牌层面拆成两条线（frontier vs. broadly capable），让 “我的 Claude 是不是被 Mythos 抽了 compute” 这个问题在架构上就无法成立。这个 framing 值得记录为 AI 公司应对社区信任危机的 case study——不是否认问题，而是把问题结构化成无法被攻击的形式。值得开 [[research/ai/anthropic-opus-4-7-nerfing-narrative.md]] 跟踪未来两周社区情绪变化。
本周 coding agent 三强的分销分叉终于清晰了：Anthropic = DTC + 自家桌面 super app + 让 individual dev 直接 orchestrate cron agent（Routines）；OpenAI = 内置所有能内置的（computer use + browser + image gen + 111 plugins）+ Atlas 浏览器 + “super app out in the open”；Cognition = 不做桌面 super app，而是把 Devin 嵌进被收购的 Windsurf IDE + SI 渠道（Cognizant + Infosys）+ 把免费产品变现。三家在”开发者分销终端”这个问题上给出了完全不同的答案：桌面 / super app / IDE+SI。对 [[research/ai/coding-agent-distribution-2026.md]] 是本周最关键的 datapoint——这个分叉会影响未来 12 个月的企业采购决策。
Opus 4.7 tokenizer 变化对我的 brain workflow 的直接影响：同样 input → 1.0-1.35× tokens + agentic 后期更多 thinking output = heavy 用户（我每日 morning briefing 算中等 heavy）的实际 unit cost 上涨 5-20%。三个响应选项：(a) 保持现状观察一周实际账单变化；(b) 把 Opus 4.7 从 max effort 降到 high 或 xhigh（xhigh 今天新增、可能比 high 稍慢但比 max 快且更便宜）；(c) 把一部分 morning briefing 工作（尤其是简单的 HN / Reddit 浏览）下沉到 Sonnet 4.x 或自建 Qwen3.6-35B-A3B 本地 router。c 选项值得试验——今日 Qwen3.6 正好是一个合适的本地权重候选。值得开 [[projects/brain-cost-optimization-opus-4-7.md]]。
Qwen3.6-35B-A3B 的 Apache 2.0 + 3B active 参数 = 是否可以把 brain vault 里的某些隐私敏感任务（比如人脉档案的 summarize / 研究笔记的 reorganize）完全下沉到本地？ 35B total / 3B active 的 MoE 在一张 24GB 显卡（RTX 4090 / 5090）或 Mac Studio M3 Ultra 上应该 comfortably 跑。这正好和今日 Claude 身份验证 + 昨日 AI Chats 可入证 + 昨日 Google/ICE 数据调阅是同一根藤上的决策链：把需要纯私有的工作从云 LLM 下沉到本地权重。值得开 [[projects/brain-local-llm-fallback.md]] 作为一个 6-8 周实验计划。
“papers as MCP servers”（Paper2Agent, arxiv:2509.06917）+ OpenAI Codex 111 plugins marketplace + Anthropic Claude Code Skills 生态 = 学术流量直接接入 agent 工作流的 infra 层已成形。今日 alphaXiv trending 的 “Externalization in LLM Agents” 统一 review + Paper2Agent 的实践 + 两家大厂 plugin marketplace 的商业化，三者合起来意味着：学术论文的 reuse 效率将从”读 paper + 复制代码”升级为”install as MCP plugin”。对科研导向的个人工作流（比如我的 research notes），这是一个可以测试的 upgrade。值得开 [[research/ai/paper-as-mcp-server-2026.md]]。
Allbirds +582%（单日）= “AI labeling premium” 在 micro-cap 段的极端情形。一家做鞋的公司宣布 pivot AI 后涨 6 倍——这既是 AI trade 的深度（连完全无关行业都能被打标签涨）也是其 late-stage 特征（不再需要实质业务）。对我自己的启示不是交易机会，而是 signal quality：如果连 AllBirds 这类公司被 AI label 都能 +582%，那么 public market 上 “AI native” vs “AI pivot” vs “AI wash” 的差异化定价已经接近失灵。值得在 [[research/market/ai-labeling-premium-2026.md]] 里记录这一天作为一个 reference point。

值得建档的条目（仅供参考，不自动创建）

research/ai/anthropic-opus-4-7-launch.md — 发布 + 定价策略 + 与 Mythos 品牌拆分 + 反 nerfing 叙事
research/ai/anthropic-opus-4-7-nerfing-narrative.md — 社区”全模型变笨”叙事 vs Anthropic 发布稿回应的 case study
research/ai/openai-codex-super-app-2026.md — “Super app out in the open” framing + computer use + 111 plugins 全栈
research/ai/coding-agent-distribution-2026.md — Anthropic DTC / OpenAI super app / Cognition IDE+SI 三方分叉地图
research/ai/qwen-3-6-35b-a3b.md — Apache 2.0 MoE，3B active，agentic coding 新权重标杆
research/ai/paper-as-mcp-server-2026.md — Paper2Agent + OpenAI plugins + Claude Code Skills 的学术 reuse 层
projects/brain-cost-optimization-opus-4-7.md — tokenizer 变化下 morning briefing 的 cost 对策
projects/brain-local-llm-fallback.md — Qwen3.6 作为本地权重 fallback 的 6-8 周实验
research/market/ai-labeling-premium-2026.md — Allbirds +582% 作为 AI label premium 极端样本
orgs/aehr.md — AEHR 独立于大头 AI 公司但吃 AI capex 红利的半导体测试设备代表
orgs/box-ai-evaluations.md — Box 作为 enterprise AI benchmark 第三方信号源的新角色