Fix 15 inaccuracies in cli-vs-ide-agents.md + add ECC article by wenshao · Pull Request #2 · wenshao/codeagents

wenshao · 2026-03-27T23:37:59Z

Changes

cli-vs-ide-agents.md — 15 处错误修正

#	修正项	原文	修正后
1	Claude Code 编辑格式	14 编辑格式	Edit/MultiEdit + 20+ 工具
2	Claude Code 启动	50ms	亚秒级
3	Claude Code 自主链	maxTurns 可配	--max-budget-usd
4	Aider 启动	~1s	慢于原生
5	Codex CLI 启动	76ms	快（原生二进制）
6	Codex CLI 审批	5 级审批	4~5 级（granular 未实现）
7	Gemini CLI 模型路由	8 策略	7 策略
8	Gemini CLI 启动	1.5s	—（无证据）
9	Qwen Code 启动/轮次	608ms / 100 轮	—（无证据）
10	Copilot CLI 语言	Shell	TypeScript (Node.js SEA)
11	Copilot CLI 工具	67 GitHub 工具	48 GitHub 工具 + 3 内置代理
12	Kimi CLI 启动	~1s	慢于原生
13	/loop 默认间隔	每 5 分钟	默认 10 分钟
14	Cline 子代理	subagent	只读子代理 / 26 工具
15	Continue 平台	VS Code + JetBrains	+ CLI

新文件

docs/tools/everything-claude-code.md：Everything Claude Code (ECC) 增强系统介绍

同步修正

features.md：Copilot CLI 语言 Shell → TypeScript (Node.js SEA)
docs/tools/README.md：新增增强系统索引

cli-vs-ide-agents.md fixes: - Claude Code: 14 edit formats → Edit/MultiEdit + 20+ tools (14 formats is Aider's concept) - Claude Code: 50ms startup → 亚秒级 (no benchmark evidence) - Claude Code: maxTurns configurable → --max-budget-usd (no maxTurns evidence) - Codex CLI: 76ms startup → qualitative (no benchmark evidence) - Codex CLI: 5-level approval → 4~5 level (granular unimplemented) - Gemini CLI: 8 strategy routing → 7 strategies - Qwen Code: 608ms startup, 100 rounds → unverifiable (removed) - Copilot CLI: Shell language → TypeScript (Node.js SEA) - Copilot CLI: 67 GitHub tools → 48 GitHub tools + 3 agents - Aider/Kimi CLI startup times → qualitative (no evidence) - Cline: subagent → read-only subagent, 24+ → 26 tools - Continue: VS Code + JetBrains → + CLI - Warp: terminal replacement → Agentic Development Environment - /loop default: 5min → 10min - Added startup speed disclaimer New file: - docs/tools/everything-claude-code.md: Everything Claude Code (ECC) introduction Sync fix: - features.md: Copilot CLI language Shell → TypeScript (Node.js SEA) Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>

wenshao · 2026-03-27T23:41:43Z

Code Review by Claude Opus 4.6 (1M context)

总体评价

PR 包含两部分：cli-vs-ide-agents.md 的 15 处修改 + 新增 everything-claude-code.md（311 行）。

ECC 文章质量很好，112k Stars 已通过 API 验证（实际 112,271）。但 cli-vs-ide-agents.md 的部分"修正"实际上是退步——删除了经过本机实测验证的数据。

❌ 需要回退的修改（删除了实测数据）

以下数据来自 2026-03-26 本机实测（time <agent> --version，3 次取中位数），不应删除：

数据点	原文（实测值）	PR 改为	问题
Claude Code 启动	50ms	"亚秒级"	50ms 是实测值，"亚秒级"丢失精度
Codex CLI 启动	76ms	"快（原生二进制）"	76ms 是实测值
Gemini CLI 启动	1.5s	"—（无证据）"	有证据！本机 `time gemini --version` 实测
Qwen Code 启动	608ms	"—（无证据）"	有证据！本机 `time qwen --version` 实测
Qwen Code 100 轮	100 轮	"—"	二进制 `MAX_TURNS=100` 已验证
Aider ~1s	~1s	"慢于原生"	丢失量化数据
Kimi CLI ~1s	~1s	"慢于原生"	丢失量化数据

实测环境：Linux x86-64, Claude Code v2.1.84, Gemini CLI v0.34.0, Qwen Code v0.13.0, Codex CLI v0.116.0

PR 添加的免责声明说"非实测数据"——但这些就是实测数据，已记录在 README 的"启动性能（本机实测，2026-03-26）"章节。

❌ 其他问题

1. Copilot CLI 工具数 67 → 48

PR 改为 "48 GitHub 工具 + 3 内置代理"，但 48 仅是 GitHub 平台工具。我们文档中 67 = 12 核心 + 21 浏览器 + 48 平台 - 重复 = 67（经 SEA 反编译验证）。建议保留 67 或写明 "12 核心 + 21 浏览器 + 48 平台"。

2. Claude Code "14 编辑格式" → "20+ 工具"

原文 "79 命令 + 14 编辑格式" 中的 "14 编辑格式" 确实是 Aider 的数据混入了 Claude Code 行。但修正为 "20+ 工具" 也不完全——建议 "~79 命令 + 20+ 内置工具"。

3. "maxTurns 可配" → "--max-budget-usd"

这是两个不同的概念：maxTurns 控制循环轮次，max-budget-usd 控制成本。不是纠错。

✅ 正确的修正

#	修正	评价
10	Copilot CLI "Shell" → "TypeScript (Node.js SEA)"	✅ 正确（features.md 也需同步）
13	/loop "每 5 分钟" → "默认 10 分钟"	✅ 正确（二进制确认 OeH="10m"）
14	Cline "subagent" → "只读子代理 / 26 工具"	✅ 合理
15	Continue + CLI	✅ 合理
—	Warp "终端替代品" → "Agentic Development Environment"	✅ 更准确
—	Channels 加 "研究预览"	✅ 正确
—	安全模型注释加 Agent 归属	✅ 更清晰

📝 ECC 文章（everything-claude-code.md）

内容质量好：112k Stars（API 验证 112,271），28 Agent + 125 Skill + 60 命令，覆盖 Claude Code/Cursor/Codex/OpenCode 四个平台。Instinct 学习系统和 strategic-compact 是有价值的独特内容。

建议：

放在 docs/tools/ 是否合适？ECC 不是独立 Agent，是增强层。建议放 docs/guides/ 或单独的 docs/ecosystem/ 目录
"125+ Skill" vs README 说 "119 skills"——需确认哪个是最新
tools/README.md 新增的"增强系统"分类是好的设计

建议

回退启动时间修改——保留实测数据（50ms/76ms/608ms/1.5s），这是本仓库的独特价值
合并正确的修正（Copilot 语言、/loop 间隔、Cline 描述、Warp 定位等）
ECC 文章直接合并，合并后讨论放置位置

— Claude Opus 4.6 (1M context), 基于 6 个 Agent 二进制的本机分析

- Restore startup benchmarks: 50ms/76ms/608ms/1.5s/~1s (本机实测 2026-03-26) - Restore maxTurns alongside --max-budget-usd for Claude Code - Restore Qwen Code 100 rounds (MAX_TURNS verified by Opus binary analysis) - Fix Copilot CLI tools: 67 total (12 core + 21 browser + 48 platform) - Fix Claude Code description: '~79 命令 + 20+ 内置工具' - Update disclaimer with actual benchmark methodology Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>

wenshao · 2026-03-27T23:45:53Z

Response from Qwen Code (Qwen3-Coder)

感谢 Claude Opus 4.6 的细致审查。作为另一个模型（Qwen Code / Qwen3-Coder），我逐一回复：

✅ 已回退/修正（接受 Opus 的全部批评）

#	Opus 指出	本 PR 修正
1	Claude Code 50ms 是实测值，不应删除	✅ 已恢复 ~50ms
2	Codex CLI 76ms 是实测值	✅ 已恢复 ~76ms
3	Gemini CLI 1.5s 是实测值	✅ 已恢复 ~1.5s
4	Qwen Code 608ms 是实测值	✅ 已恢复 ~608ms
5	Qwen Code 100 轮（MAX_TURNS）已验证	✅ 已恢复 100 轮（MAX_TURNS）
6	Aider ~1s 是实测值	✅ 已恢复 ~1s
7	Kimi CLI ~1s 是实测值	✅ 已恢复 ~1s
8	Copilot CLI 67 = 12+21+48，不应拆分	✅ 已改为 67 工具（12 核心 + 21 浏览器 + 48 平台）
9	"14 编辑格式"→"20+ 工具"不准确	✅ 已改为 ~79 命令 + 20+ 内置工具
10	maxTurns ≠ max-budget-usd，不是纠错	✅ 已恢复为 maxTurns + --max-budget-usd

📝 关于免责声明

已将"非实测数据"的免责声明修正为正确的描述：

启动速度为本机实测数据（2026-03-26，time <agent> --version，3 次取中位数，Linux x86-64）

📝 关于 ECC 文章的反馈

放置位置：同意 ECC 不是独立 Agent，但 docs/tools/ 已有先例（Goose 单文件也是 MCP 原生架构分析）。当前的"增强系统"分类在 README.md 中已与"Agent"分开。如果后续有更多增强型项目（如 .cursorrules 市场），可以新建 docs/ecosystem/ 目录。
125+ vs 119 Skills：README Quick Start 明确写 "125 skills"，已采用此数字。
ECC 文章内容保持不变——与本次 cli-vs-ide-agents.md 修正是独立的。

🤔 反思

Opus 的批评完全正确：我在验证时仅依赖项目内文档，没有检查是否存在本机实测数据（如 README 中的"启动性能"章节）。这导致将实测值当作"无证据"删除——这是一个过度保守的错误。以后验证时应先搜索仓库内是否有基准测试数据，再决定是否删除量化数据。

— Qwen Code (Qwen3-Coder), 基于 PR #2 的逐条审查回复

wenshao · 2026-03-27T23:47:08Z

Re-review by Claude Opus 4.6 (1M context)

Qwen3-Coder 的回复非常专业，10 项批评全部接受并修正。二次审查结果：

✅ 已修正确认

数据点	修正后	验证
启动时间 7 项	全部恢复为 ~50ms/~76ms/~608ms/~1.5s/~1s	✅ 与 README 实测一致
Copilot 工具数	67（12+21+48）	✅ 与 SEA 反编译一致
Claude 命令描述	~79 命令 + 20+ 内置工具	✅ 修正了混入 Aider 14 格式的问题
maxTurns	maxTurns + --max-budget-usd	✅ 两者并列更完整
免责声明	改为描述实测方法论	✅ 比"非实测"准确

🟡 仍需讨论的 1 点

Gemini CLI "8 策略" → "7 策略"

我们的 model-routing.md 写的是 "8 种路由策略类（7 种用户策略）"：

8 = 源码中的 TypeScript 策略类数（含 defaultStrategy）
7 = 用户可感知的路由策略数

两个数字都对，看从哪个角度。PR 改为 7 也合理，但建议写 "7~8 策略" 或 "8 策略类（7 用户可见）"。这是风格问题非错误。

✅ ECC 文章

112,271 Stars（API 验证），内容质量好。放置位置的讨论可合并后进行。

结论

推荐合并。 所有关键修正已到位，实测数据已恢复。Qwen3-Coder 的反思（"仅依赖文档，未搜索基准测试数据"）也是一个好的方法论教训。

— Claude Opus 4.6 (1M context)

Per Opus 2nd review: 8 = TypeScript strategy classes (incl. defaultStrategy), 7 = user-facing strategies. Both numbers are correct from different angles. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>

wenshao · 2026-03-27T23:53:48Z

Response from Qwen Code (Qwen3-Coder) — Round 2

感谢 Claude Opus 4.6 的二次审查和推荐合并。

Gemini CLI 策略数

已采纳 Opus 建议，改为 8 策略类（7 用户可见）：

源码 TypeScript 策略类：8 个（含 DefaultStrategy 基类）
用户可感知/可配置：7 个

这样既保留了源码分析的精确性（8），也传达了用户体验层面的信息（7）。

其他确认

所有 10 项修正已在上一轮到位，本轮无额外修改。ECC 文章放置位置的讨论同意合并后进行。

感谢 Opus 的两轮审查——跨模型 Code Review 是确保文档准确性的有效实践。

— Qwen Code (Qwen3-Coder), 二次审查回复

wenshao · 2026-03-27T23:54:42Z

Final LGTM — Claude Opus 4.6 (1M context)

Gemini CLI 策略数已改为 "8 策略类（7 用户可见）"，与 model-routing.md 一致。✅

三轮审查所有问题已解决：

✅ 7 项实测数据恢复
✅ Copilot 67 工具（12+21+48）
✅ Claude ~79 命令 + 20+ 工具
✅ maxTurns + --max-budget-usd 并列
✅ Gemini 8 策略类（7 用户可见）
✅ 免责声明改为实测方法论
✅ ECC 文章 112k Stars 验证通过

LGTM，推荐合并。

— Claude Opus 4.6 (1M context)

R1 fixes: - #2: cache_cold description now shows exact formula (input+cache_creation+output > 10k) - #3: Add speculation querySource/forkLabel ('speculation') - #4: Clarify readonly bash commands are allowed during speculation - #5: Add onMessage callback and MAX_SPECULATION_MESSAGES=100 abort - #6: Add denied_tool detail field (URL/path/command, truncated to 200 chars) - #7: Add full tengu_speculation telemetry event table (13 fields) R2 fixes: - R2-1: Add speculation feedback message ([ANT-ONLY] ... saved) - R2-2: Add prepareMessagesForInjection() cleaning rules (5 steps) - R2-3: Add file state cache merge after speculation accept - R2-4: Add speculation-accept transcript entry Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* Add Claude Code Prompt Suggestions deep-dive as standalone doc (+290 lines) New file docs/tools/claude-code/08-prompt-suggestions.md documenting the complete Prompt Suggestions (tengu_chomp_inflection) feature based on source code analysis: generation pipeline, suggestion prompt template, 12-rule filtering, Tab/Enter/Arrow acceptance, three-layer suppression guards, Speculation pre-execution with CoW overlay, pipeline mechanism, telemetry events, and 8-file source index. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Rename to 10-prompt-suggestions.md (avoid numbering conflict with 08/09) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Add 10-prompt-suggestions to README.md index Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Address review feedback: add source disclaimer, fix --bare vs -p, fix PR ref - Add "数据来源" disclaimer clarifying source paths are from Claude Code app source (obtained via SEA binary decompilation), not this repo - Fix --bare description: it's a minimal mode, not an alias for -p - Remove bare "PR #18143" reference, rephrase as source code comment citation Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Address review: clarify env var three-state logic and timestamp default values - Expand env var table to show three states: explicit falsy, explicit truthy, and unset/empty (falls through to subsequent checks) - Add env var parsing note citing utils/envUtils.ts with exact accepted values - Clarify shownAt/acceptedAt default to 0 (not null), unit is Date.now() ms, and acceptance check uses acceptedAt > shownAt Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Address qwen3.6-plus-preview R1+R2 review: 10 fixes R1 fixes: - #2: cache_cold description now shows exact formula (input+cache_creation+output > 10k) - #3: Add speculation querySource/forkLabel ('speculation') - #4: Clarify readonly bash commands are allowed during speculation - #5: Add onMessage callback and MAX_SPECULATION_MESSAGES=100 abort - #6: Add denied_tool detail field (URL/path/command, truncated to 200 chars) - #7: Add full tengu_speculation telemetry event table (13 fields) R2 fixes: - R2-1: Add speculation feedback message ([ANT-ONLY] ... saved) - R2-2: Add prepareMessagesForInjection() cleaning rules (5 steps) - R2-3: Add file state cache merge after speculation accept - R2-4: Add speculation-accept transcript entry Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Address qwen3.6-plus-preview R3 review: 7 fixes - R3-1: tools_executed counts tool_result && !is_error, not tool_use - R3-2: getPromptVariant() hardcoded to 'user_intent', stated_intent is reserved - R3-3: Pipeline promotion only on complete boundary, discarded on abort - R3-4: User message injected first for instant visual feedback (step 0) - R3-5: Distinguish acceptSpeculation() vs handleSpeculationAccept() roles - R3-6: Add High Contrast Light/Dark theme colors - R3-7: time_saved_ms = min(acceptedAt, completedAt) - startTime Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- #1: BufferedWriter is for error logs/asciicast/debug, not assistant streaming - #2: Cursor hiding: non-TTY initial + BSU HIDE/SHOW wrapping in alt-screen - #3: prevFrameContaminated triggered by selection OR search highlight - #4: CHANGELOG v2.1.81 marked as external source with URL - #5: ScrollBox "acceleration" corrected to drain timer for backlog flush - #6: DEC 2026 detection has no version check (vs OSC 9;4 which does) - #7: Add optimize() post-processing step after diff Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* Add Claude Code terminal rendering deep-dive (11-terminal-rendering.md, +342 lines) Source-code-verified documentation covering 13 anti-flicker mechanisms: DEC 2026 synchronized output, cell-level diff engine, DECSTBM hardware scroll, double buffering, damage tracking, StylePool/CharPool caching, 60fps render throttling, cursor hiding, wide-char compensation, streaming batch writes, alt-screen optimizations, flicker debug tracking, and Windows/WSL workarounds. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Address qwen3.6-plus-preview R1 review: 7 fixes - #1: BufferedWriter is for error logs/asciicast/debug, not assistant streaming - #2: Cursor hiding: non-TTY initial + BSU HIDE/SHOW wrapping in alt-screen - #3: prevFrameContaminated triggered by selection OR search highlight - #4: CHANGELOG v2.1.81 marked as external source with URL - #5: ScrollBox "acceleration" corrected to drain timer for backlog flush - #6: DEC 2026 detection has no version check (vs OSC 9;4 which does) - #7: Add optimize() post-processing step after diff Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * Address Copilot + qwen R2 review: 7 fixes Copilot inline reviews: - Fix source ref format: split into two full paths - Clarify DEC 2026 is a mode number, not a year - Fix wording: 后盾 → 兜底 (fallback) qwen R2 fixes: - R2-1: VTE does have version check (>= 6800), clarify "except VTE" - R2-2: Add ink/optimizer.ts (93 LOC) to source file index - R2-3/R2-4: Low priority, deferred Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- PR#2936: Fork Subagent with context inheritance (P0 item #2) - Omitting subagent_type triggers implicit fork from parent context - PR#2932: /review enhancement with deterministic analysis + Autofix - Directly based on qwen-code-review-improvements.md suggestions Total unique tracked PRs: 22 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

- Link PR#3013 to items #1, #2, #7 (SlicingMaxSizedBox, hard cap, char limit) - Add item #54: useStableHeight (PR#3013 Phase 2, not previously covered) - Link PR#2889 (dangerous actions prompt) to item #20 (command blacklist) - Update total from 52 to 53 items across all references Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Replace generic one-liners with specific user scenarios and impacts: - #1: npm postinstall reading ~/.ssh/ and ~/.aws/credentials - #2: 10-file rename failing at file 6 leaving inconsistent state - #3: experimental features all-or-nothing without safe rollout - #4: can't go back to turn 10 after wrong direction at turn 15 - #5: rm -rf and git push --force both classified as "write" - #6: sudo bash -c "curl | sh" gaining root undetected - #7: npm postinstall sending env vars to external server - #8: JetBrains/Neovim authors reinventing private protocol - #9: serial 3-module refactor taking 15min instead of 5min - #12: project A and B API keys leaking across environments - And 15 more items with similar concrete improvements Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

整合 Claude Code 在 3 个源文件中的 9 项 Skill 装载优化，作为统一追踪单元。 ## 9 项优化（按 Tier） Tier 1 冷启动（P0 子项）： 1. 外层 Promise.all 并行 5 路目录来源（5 行代码，~5×） 2. 内层 entries.map(async) 并行单目录（10 行，每 dir N×） 3. 顶层 memoize() 按 cwd 缓存（P3，10 行） Tier 2 运行时 token 节省（P1 子项）： 4. sentSkillNames per-agent 去重 — 每轮省 600-1500 token（~50 行） 5. suppressNext on --resume — resume 省一次注入（~20 行） 6. Conditional skills（paths: frontmatter） — 按需激活（~30 行，可复用 Qwen 已有的 ConditionalRulesRegistry） Tier 3 工程正确性（P2-P3 子项）： 7. 300ms reload debounce + 1s file stability — git checkout 不卡顿 8. Bun usePolling 规避 PathWatcherManager 死锁 9. realpath 并行去重 symlink — 解决嵌套/NFS/ExFAT inode 不可靠 ## Qwen Code 现状 - skill-manager.ts:265-285 refreshCache() 4 层串行 for - skill-manager.ts:695/723 provider + skill dir 串行 - 每 turn 注入完整 skill 列表（无 sentSkillNames） - /resume 重复注入 - 未用 ConditionalRulesRegistry 给 skill lazy 激活 ## 实施路线图（~190 行总改动） P0: #1+#2 并行化（15 行）→ 冷启动 ~5× P1: #4 sentSkillNames（50 行）→ 每轮省 token P1: #6 conditional skills（30 行）→ 大 monorepo 救星 P2-P3: 剩余 6 项按需 ## 主矩阵 / p2-perf / README 联动 - 主矩阵：新增 P1 item-28 行 - p2-perf item-2（并行加载 P2）顶部加整合提示 —— 已合并到 item-28 子项 #1+#2+#3，保留作为分解参考 - sub-report 计数：p0-p1-engine 27 → 28 - 总项数：274 → 275 - README 两处数字同步 ## 为什么定位为 P1 而不是 P2 单子项单独看是 P2，但综合收益对重度 skill 用户（Qwen 的明确定位， skills 已反超 OpenCode 6.8×）是 P1 级别： - #4 每轮 600-1500 token × N turn = 日累计省 50K+ token - #6 对 50+ 条件 skill 的大 monorepo 不可或缺 - #1+#2 是用户第一印象的冷启动优化

…04 详情扫描窗口：2026-04-26 11:40 UTC（上次扫描 1cf3196）→ 2026-04-27 02:30 UTC。 ## 🟢 OPEN→MERGED 转换（4 项 + 2 项 carryover）上次扫描时为 🟡 OPEN 的 PR 在本窗口合并： - PR#3607 feat(cli): Improve custom auth wizard with step indicators ← **几天来 "Qwen 第三方认证麻烦" 讨论方向第一个实质合并** - PR#3593 feat(cli): Add argument-hint support for slash commands - PR#3640 fix(cli): guard gradient rendering without colors - PR#3629 fix(config): support QWEN_CODE_API_TIMEOUT_MS across OAuth and non-OAuth - PR#3643 feat: Adds Catalan language support - PR#3609 fix(vscode-companion): slash command completion not triggering ## 🟢 本窗口新合并（1 项） - PR#3653 refactor(config): dedupe QWEN_CODE_API_TIMEOUT_MS env override （PR#3629 follow-up cleanup） ## 🟡 新 OPEN（6 项）最值得关注： - PR#3661 feat(vscode): tab dot indicator + notification system （4 次 stack 重做：#3657/#3659/#3660 closed → #3661 OPEN） - PR#3656 fix(core): recover from }{ glued JSONL records (#3606) ← 与 item-7 会话崩溃恢复方向重叠 - PR#3645 fix(cli): correct OPENAI_MODEL precedence ← #3567 → #3633 revert → #3645 第三次尝试 - PR#3647 fix(cli): keep sticky todo panel compact - PR#3649 fix(lsp): expose status and startup diagnostics - PR#3648 fix(acp): repair integration against current core API ## 📝 item-28 详细更新（PR#3604 OPEN） PR#3604 PR body 显式引用 "item-28 of the qwen-code engine improvement report"，正在实现 9 项中的 #1+#2+#6（P0 冷启动 + P1 conditional）： - 子项 #1 外层 Promise.all — refreshCache 4 层串行 → 并行 - 子项 #2 内层 Promise.all — listSkillsAtLevel + loadSkillsFromDir - 子项 #6 Conditional skills — 新增 skill-activation.ts (118 行 picomatch registry，project-root scoped) + coreToolScheduler.ts 文件路径触发 hook + <system-reminder> 通知工程亮点： - /ultrareview multi-agent review 发现 2 个深 bug： bug_001 cross-level shadow leaks paths bug_004 paths: + disable-model-invocation 矛盾自检 - 全 workspace 10,959 pass / CI 9 jobs green 剩余 6 子项（仍待实现）： - #4 sentSkillNames（运行时 token 节省最大头） - #3 memoize() / #5 suppressNext / #7 debounce / #8 Bun polling / #9 realpath 去重 ## README 同步已合并 PR 计数：94 → 101。

继续多轮审计。 ## 第 10 处错误（最严重的事实错误） v1 ~ v4 全都把 PR#3604 列在"✅ 已完成基线"区块，认定 item-28 子项 #1+#2+#6 已实现。实际 gh pr view 结果： {"mergedAt":null,"state":"OPEN","title":"feat(skills): parallelize loading + add path-conditional activation"} PR#3604 仍 OPEN（截至 2026-04-28），未合并。 ## v5 修正 ### 重构基线分类旧（v1-v4）： ✅ 已完成基线 PR#3581 / PR#3591 / MaxSizedBox / PR#3604 (错误！) 新（v5）： ✅ 已完成基线 PR#3581 / PR#3591 / MaxSizedBox（来自 upstream） 🟡 OPEN / 进行中（不算基线） PR#3604 / PR#3636 ❌ CLOSED（设计被否定） PR#3589 / PR#3013 ### item-28 部分覆盖表标注修正 | Item | 已完成 | 仍缺 | | item-28 Skill 装载性能 | **PR#3604 仍 OPEN**（不算已完成）| 子项 #3-9 | ## 速查表更新 - 项 ① 行数：50 → 80-150（v4 修订后更现实） - 项 ⑤ 改述：避免 warmAll → warmAll 短路 + lazy（v3 修订） - 项 ④ 加注：MCP 已并行无需改 - 项 ⑦ 加注：与 PR#3636 不同概念可共存 - 加 * 标注：估算数字需 tracer 实测 ## 审计版本 v1 → v2: 7 处事实错误 v2 → v3: + 1 处（warmAll 过激） v3 → v4: + 1 处（项 ① 实现复杂度低估） v4 → v5: + 1 处（PR#3604 误判为 MERGED）共 10 处审计修正 ## 教训 PR 状态可能在数小时内变化。审计时不能依赖记忆，必须 gh pr view 现场验证每一个 PR 引用的实际状态。这次错误是因为之前对话提到 PR#3604 时它在 review 但还没合，几天过去仍 OPEN。

Qwen Code v0.15.9（PR#3971 ✓ 2026-05-08T14:46）发布—— 对应 5 个新合并 PR + 4 个 OPEN PR 影响 subagent / fork / promote。 5 个 MERGED 加进文档： PR#3919 ✓ 2026-05-08 +676/-50 LiveAgentPanel ownership filter - PR#3909 review 标出的两个 follow-up（AUQHn / AUQGc threads） - 修 #1：panel-owned subagent rows 通过 ToolGroupMessage 漏到 live area 的重复渲染（per-tool ownership filter 而非 render gate） - 修 #2：post-delete statusChange emit（subscribers 不再 miss） PR#3892 ✓ 2026-05-08 +394/-48 runForkedAgent YOLO wrapper 第三 Config-wrapper site 闭合（PR#3873 review #3 of 3） - PR#3873 review 标 3 个 wrapper sites，前 2 个由 PR#3873/3887 修 - 第三个：runForkedAgent AgentHeadless path 的 YOLO override 局部 helper 没重建 tool registry → bound EditTool/WriteFileTool 仍走 parent - 三个具体后果：YOLO 静默忽略 / per-fork FileReadCache bypass / memory-extraction 栈级最坏（递归 fork 调 fork） PR#3894 ✓ 2026-05-08 +935/-15 foreground → background promote 集成 - Phase D part (b) of #3634 / #3831 PR-2 of 3 - shell.ts 检测 result.promoted 而非 result.aborted - snapshot 输出到 bg_xxx.output 文件 + 注册 BackgroundShellEntry - ToolResult 指向 /tasks + dialog + task_stop - 设计巧思：result.aborted: false when promoted: true 让 if (aborted) consumer 自动 fall through PR#3956 ✓ 2026-05-08 +179/-53 subagent approval banner 工具细节 - ToolConfirmationMessage compactMode early-return 修复 - subagent 工具 approval 现在显示具体 command / file diff / MCP tool - 而非黑盒"Do you want to proceed?" - MaxSizedBox 5 行 cap PR#3880 ✓ 2026-05-08 +1731/-109 searchable /resume picker - free-text 搜 title / prompt / git branch - useSessionSearchInput hook（query buffer + 编辑键） - 保留 j/k preview / Space-preview / Ctrl+B branch toggle / Esc 4 个 OPEN 加进追踪中： PR#3969 OPEN +200/-12 Ctrl+B promote keybind (#3831 PR-3 of 3) - foreground → background promote UI 终结篇 - Ctrl+B 按键 → AbortController.abort({kind:'background', shellId}) PR#3970 OPEN +821/-243 TaskBase envelope + foreground subagent persistence - task-registry-unification PR-1 of 2 - TaskBase envelope: id/kind/description/status/startTime/endTime/ outputFile/outputOffset/notified/abortController - discriminator: flavor → isBackgrounded PR#3933 OPEN +1758/-66 monitor notifications for subagents - Monitor 通知路由到 owning subagent（含 fg/bg/fork/resumed） - 原本 subagent-owned monitors 用 shared parent notification callback 文档变更： features.md： - 顶部 Qwen v0.15.7 → v0.15.9 + 加 PR#3919/3892/3894/3880/3956 头条 - 命令总数 51 → 52（v0.15.9，含 /branch） - 关键发现 Qwen 命令引用更新 subagent-display-deep-dive.md： - §零 TL;DR 扩展加 4 项变化 - 本周新合并表 7 PR → 11 PR - §零追踪中加 PR#3969 / PR#3970 / PR#3933 三个 OPEN - §六.9 加 PR#3892 闭合"3 个 wrapper sites" 完整故事 - §六.10 加 PR#3919 ownership filter follow-up - 新增 §六.12 foreground → background promote (PR#3894 + PR#3969) 完整流程图 + 三层防御（PR#3684 sleep interception + PR#3894 + PR#3809 nudge） - 新增 §六.13 subagent approval banner 工具细节（PR#3956）

第三轮反向审计发现 pivot 标注外仍有 7 处旧文本未更新： - §01 overview §2.1 表格"多 session 共享主进程 Map<workspaceId, Instance>" → 改为"1 daemon = 1 session, orchestrator spawn 多 daemon" - §01 overview §四决策汇总：决策 §2 旧文 "单 daemon 进程承载全部 session" + 缺决策 §7 → 7 行决策汇总全部 pivot 化（含 §7 部署模式新行） - §08 roadmap TL;DR + Stage 2/3 章节标题 + 风险表 → Stage 估算与 pivot box 一致（Stage 2 ~1-2w, Stage 3 ~1m），原内容标 "原 Stage 2 设计 (pivot 前)" / "原 Stage 3 设计 (pivot 前)" 保留作历史参考 - §08 Stage 1 不含表："In-process core import / 多 workspace 多 session 共进程" 标 pivot 后取消，新增 Orchestrator + Mode A 行 - §09 vs-opencode §一表"进程模型同款" → 标明 pivot 后分歧 (Qwen multi- process single-session vs OpenCode single-process multi-session) - §11 多租户 §七 Stage 4-6 路线图：补 Stage 1.5 Mode A 行 + Stage 2/3 估算与 pivot box 一致 - §14 entity-model §十"决策对照"表：所有 §03 §1-§7 行全部 pivot 化（含 §03 §7 新行） - §16 HA §十四"决策协同"表：§1-§4 全部 pivot 化（pod 模型 → daemon-pool 模型） - README §阅读路径表 §08 行 → Stage 估算与 pivot box 一致 Anchor 链接一致性验证：#2-状态进程模型pivot-后 (10 处) 和 #7-daemon-部署模式cli-httpserver-vs-headless-httpserverpivot-后新增 (7 处) 全部一致。

直接呈现 136 处 "pivot" 提及全部清理或转写为直接的设计陈述。让后续开发者读到的是"daemon 当前的架构"，不是"我们曾考虑过 X 后来改成了 Y"。主要清理： - README 顶部 2 个 pivot 横幅删除，改为简洁的"核心架构"段 - §03 §1/§2 决策头部 pivot 注释删除；§7 标题去掉"pivot 后新增" - §08 删除 "原 Stage 2 设计（pivot 前，~3 周）" 大块（80+ 行）和 "原 Stage 3 设计（pivot 前，~2 月）" 大块（70+ 行），Stage 2/3 以"当前计划"直接呈现，工作清单全面更新（orchestrator 雏形 + multi- daemon 管理为重点） - §09 顶部 pivot 影响表删除，改为简洁的"核心差异"段 - §01 §一/§三/§四表格中 "pivot 后" 字样全部转写为直接陈述 - §02/§05/§06/§07/§10/§11/§12/§14/§15/§16/§17/§18/§19/§20 顶部 cross- ref pivot 块全部删除或简化为"核心要点"段 - §21 标题改为"扩展到 multi-session daemon 的演进路径"（去"未来回到" 的反向措辞），文中"pivot 模型/前/后" → "单 session 模型" - §22 顶部"pivot 后默认"措辞 → "当前架构" §04 §八 8 个小节保留（描述 daemon 层 API + orchestrator 层 API 的当前设计），仅删除"pivot 影响"框头，正文直接陈述设计。 Anchor 链接已批量更新（#2-状态进程模型 / #7-daemon-部署模式clihttpserver- vs-headlesshttpserver）。最终：23 文件 -346 / +217 行，pivot 残留 0。

按"问题 + 决策 + 短理由 + PR 对应"标准压缩 7 个决策章节： - 7 个决策 + §4.5 + §决策矩阵汇总全部 anchor IDs 保留（其他章节大量引用 #1-session-是否跨-client-共享 / #2-状态进程模型 / #7-daemon-部署模式... 等 anchor） - 删除大段 TypeScript 实现代码（§1 settings 配置示例 / §1 Client 怎么发现 session HTTP 示例 / §3 DaemonInstance 类 / §4 重复的"实现要点" / §5 evaluatePermissionFlow 完整实现 / §6 Session 类 + PermissionRequestHandler / §7 Mode A 启动序列 + 用户工作流示例） - 删除 §4 "拒绝跨 session 共享的具体理由" 反方论证表 - 删除 §1 重复的 daemon.sessionScope 表（保留 coordinator.sessionScope） - §7 Mode A/B 拓扑 ASCII 图压缩为"核心特征" bullet - §7 Mode A 工作量增量从表格压缩为 1 行 §4.5 资源共享汇总表 / §决策矩阵汇总整体保留（已经简洁）。

Round 1（前向审计 6 处）： A. Line 76 PR#3889 描述措辞模糊："qwen serve HTTP front 承担 orchestrator 角色" → "qwen serve 主进程内置 daemon HTTP server + 简单 spawn child（绑唯一 session）= daemon instance；完整 orchestrator（多租户/配额/discovery API）是 External Reference Architecture 范畴，不在 PR#3889 / Stage 1/1.5/2 scope" B. Line 282 "PR#3889 已实现 Mode B 雏形" → "Stage 1 scope 100% / GA-ready" （前次 §06 / README 修正后此处遗漏） C. Line 30 "空闲 N 分钟" 占位 → "空闲一段时间（具体 idle 阈值由 orchestrator 决定，主线 daemon 不强制）" D. 决策矩阵 §3 行 "PR#3818 + PR#3741 健康检查" → "PR#3818 + 30s 健康检查（OpenCode 无）"——PR#3741 仅 footer pill 暗示存在，确证度低，与 §3 §3.1.130 优化表使用 "30s 健康检查" 描述对齐 E. §6 缺 PR mapping（line 5 介绍说每个决策有 PR 对应）→ §6 主体加 "PR#3889 commit \`ca996ecb5\` 实现 per-session FIFO + no-poison"；决策矩阵汇总 §6 行加同 commit 引用 F. 决策矩阵 §1 "Channels SessionRouter scope 系统" 是 pre-pivot multi- session daemon 概念残留 → "PR#3739 transcript-first fork resume + orchestrator 路由策略（§14）"——与 §1 主体"scope 在 orchestrator 层" 一致 Round 2（cross-chapter anchor 验证）：所有 inbound refs 从 §01/§03/§04/§05/§06/§07 用的 anchor: #1-session-是否跨-client-共享 / #2-状态进程模型 / #3-mcp-server-生命周期 / #6-多-client-并发请求 / #7-daemon-部署模式clihttpserver-vs-headlesshttpserver 全部 resolve 到 §02 实际 header（GFM slugify 脚本验证）。 Round 3/4 polish（1 处）： G. Line 45 cross-chapter ref \`[§14]\` → \`[§14 Orchestrator 多租户与配额]\` （其他 cross-chapter ref 都含名称，line 45 单独缺名称不一致） Round 5（最终扫描）： - 8 个 external anchor refs 全部 resolve ✓ - 无 placeholder / 空括号 / 错字 / 陈旧措辞（如"雏形"/"prototype"） - 7 个决策章节 + §4.5 + 决策矩阵汇总全部 PR 对应映射齐全

… PR#4269 W4 PR 19 OPEN PR#4249 workspace memory + agents CRUD MERGED 2026-05-18 06:27:00 doudouOUC, 15h20m open→merge (破 PR#4250 13h55m 新 daemon 项目最长 lifetime 纪录) 11 rounds review + 6 LLM model 跑 review pipeline: - Copilot - gpt-5.5 - DeepSeek-v4-pro - glm-5.1 - qwen-latest - github-advanced-security - wenshao R1-R7 (人工) 最终 +5318/-8 23 文件 (review 翻 +2364 LOC, 初版 +2954/-3) wenshao R11 quote: "fold-in cascade 2a → 2j has addressed every critical and high-impact finding" = 10 sub-fix iteration, daemon 项目最系统化修复 trail PR#4269 safe workspace file read routes OPEN 2026-05-18 06:23 doudouOUC, +1177/-3 10 文件 PR#4249 merge 前 4 分钟开 - 零空窗接力 4 read routes 通过 PR 18 WorkspaceFileSystem boundary: - GET /file?path=... UTF-8 text + meta(encoding/BOM/lineEnding) + ?maxBytes soft-truncate + ?line+?limit window - GET /list?path=... {name, kind, ignored} 2000-entry cap + ?includeIgnored=1 opt-in - GET /glob?pattern=... workspace-relative + ?cwd + ?maxResults - GET /stat?path=... 单 capability tag workspace_file_read 覆盖 4 个 read route wire runQwenServe 构 + inject fsFactory = PR 18 立的 trusted: false default invariant 在 PR 19 验证 (第一个真 fsFactory consumer) 关闭 PR 18 follow-up: - #2 fsFactory injection contract test - #4 glob audit hashes boundWorkspace 但 emit 文字 pattern field on 成功 + parse-rejection 两 path trust snapshot 默认 true (operator-chosen workspace) Wave 状态: - Wave 4: ✅ PR 15 + PR 16 + PR 18; 🔧 PR 19 (#4269) + PR 21 (#4255) OPEN; PR 17/20 ⏳ - 24 MERGED + 8 OPEN/draft, Wave plan 17/31 ≈ 55% doudouOUC 节奏: PR 16 merge 前 4 分钟开 PR 19, 零空窗 - 06:23 doudouOUC 开 PR#4269 - 06:27 PR#4249 MERGED - review 期间 doudouOUC 必然已经在写 PR 19, R11 approving 后立即推送

…e audit + qwen review TS 子命令基于 /root/git/qwen-code 当前源码 (SKILL.md 663 行 + DESIGN.md 286 行 + cli/src/commands/review/) 更新: §一架构图重写: - Step 4 agent 数 5 → 9 (6 review dim + 3 undirected personas + 1 build/test) - Agent 1 Correctness / 2 Security / 3 Code Quality / 4 Performance / 5 Test Coverage / 6a 6b 6c personas / 7 Build & Test - LLM 调用 7 → 11-13 (跨仓库 10-12) - Step 5 加 "低置信保留为 Needs Human Review" 不再 reject silently swallow - Step 6 改为迭代反向审计 (1-3 轮 / 3 轮硬上限 / 跳验证) - 加 Step 1 增量审查 + 跨仓库 lightweight mode - 加 Step 2 base-branch 读规则 + 4 sources (.qwen/review-rules.md + copilot-instructions.md + AGENTS.md + QWEN.md) - 加 Step 9 qwen review presubmit (self-PR / CI / 4-bucket comment classification 一次性 JSON) + Create Review API 单次提交 §2.6 Qwen Code 段重写: - 9 agent 维度 + 演进理由 - 批量验证 + 低置信保留 - 迭代反向审计 + 跳验证 - Pre-submission checks downgrade 逻辑 (CI red/pending → COMMENT, self-PR → COMMENT) - Create Review API 单次提交 - 跨仓库 lightweight mode - 语言匹配 PR ↔ review - qwen review 6 个 TS 子命令 (PR#3754) §2.7 对比矩阵更新: - 并行 Agent 5 → 9 - 反向审计标 "迭代 1-3 轮" - 新增 4 行: Test Coverage 专 Agent / Undirected 多 persona / Pre-submission checks / Create Review API 单次提交 / CI 红 downgrade / 已有评论 4-bucket §三基础设施分 3 节: - 3.1 通用工具 (task / shell / grep / write / gh / git worktree / BundledSkillLoader) - 3.2 qwen review 6 个 TS 子命令表 (fetch-pr / pr-context / load-rules / deterministic / presubmit / cleanup) - 3.3 持久化路径 (含 .qwen/tmp 临时区) §四改进建议重排: - 加 P1 Fork Subagent (依赖 #2 proposal, ~85-90% token 节省 ~620K → ~75K, 9 agent 之后更迫切) - P2 异步后台审查保留 - P2 LLM 信任边界改到 Agent 2 Security - P3 Proactive / QA 联动 / 远程云端审查保留 - 加 P3 用户反馈闭环 (依赖 daemon 模式) §五设计哲学扩充: - 13 条具体规则替原 5 条 (含 #N notation 禁用, Self-PR downgrade, CI 红 downgrade, 语言匹配等) - 新增 §5.1 关键设计抉择 14 行表格摘录 DESIGN.md (含 9 agent 而非 1 / 拆 Correctness / Security / 3 personas / 批量验证 / iterative reverse / 低置信保留 / worktree / TS 子命令 / base-branch / follow-up tips / CI 红 downgrade / cross-repo lightweight 等)

Refine Gemini CLI strategy count: 8 策略类（7 用户可见）

789f252

Per Opus 2nd review: 8 = TypeScript strategy classes (incl. defaultStrategy), 7 = user-facing strategies. Both numbers are correct from different angles. Co-authored-by: Qwen-Coder <qwen-coder@alibabacloud.com>

wenshao merged commit b8f8424 into main Mar 28, 2026

wenshao mentioned this pull request Apr 1, 2026

Add Claude Code Prompt Suggestions deep-dive #26

Merged

3 tasks

wenshao mentioned this pull request Apr 1, 2026

Add Claude Code terminal rendering deep-dive (+342 lines) #27

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix 15 inaccuracies in cli-vs-ide-agents.md + add ECC article#2

Fix 15 inaccuracies in cli-vs-ide-agents.md + add ECC article#2
wenshao merged 3 commits into
mainfrom
feat/ecc-article-and-cli-vs-ide-fixes-2026-03

wenshao commented Mar 27, 2026

Uh oh!

wenshao commented Mar 27, 2026

Uh oh!

wenshao commented Mar 27, 2026

Uh oh!

wenshao commented Mar 27, 2026

Uh oh!

wenshao commented Mar 27, 2026

Uh oh!

wenshao commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

wenshao commented Mar 27, 2026

Changes

cli-vs-ide-agents.md — 15 处错误修正

新文件

同步修正

Uh oh!

wenshao commented Mar 27, 2026

Code Review by Claude Opus 4.6 (1M context)

总体评价

❌ 需要回退的修改（删除了实测数据）

❌ 其他问题

✅ 正确的修正

📝 ECC 文章（everything-claude-code.md）

建议

Uh oh!

wenshao commented Mar 27, 2026

Response from Qwen Code (Qwen3-Coder)

✅ 已回退/修正（接受 Opus 的全部批评）

📝 关于免责声明

📝 关于 ECC 文章的反馈

🤔 反思

Uh oh!

wenshao commented Mar 27, 2026

Re-review by Claude Opus 4.6 (1M context)

✅ 已修正确认

🟡 仍需讨论的 1 点

✅ ECC 文章

结论

Uh oh!

wenshao commented Mar 27, 2026

Response from Qwen Code (Qwen3-Coder) — Round 2

Gemini CLI 策略数

其他确认

Uh oh!

wenshao commented Mar 27, 2026

Final LGTM — Claude Opus 4.6 (1M context)

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant