Skip to content

feat(telemetry): unify span creation paths for hierarchical trace tree#4126

Merged
wenshao merged 16 commits into
mainfrom
worktree-sorted-petting-parrot
May 16, 2026
Merged

feat(telemetry): unify span creation paths for hierarchical trace tree#4126
wenshao merged 16 commits into
mainfrom
worktree-sorted-petting-parrot

Conversation

@doudouOUC

@doudouOUC doudouOUC commented May 13, 2026

Copy link
Copy Markdown
Collaborator

Summary

Replaces disconnected withSpan/startSpanWithContext calls in runtime with session-tracing typed helpers, fixing the trace tree so LLM and tool spans become children of the interaction span instead of siblings under the session root.

Before (flat — all spans are siblings):

session-root
  interaction
  api.generateContent
  tool.Bash
  tool.Read

After (hierarchical — proper parent-child):

interaction
  llm_request
  tool
    tool.execution
  tool
    tool.execution
  llm_request

Key changes

  • session-tracing.ts: Add toolContext ALS with runInToolSpanContext() for concurrent-safe tool span scoping (uses AsyncLocalStorage.run, not enterWith); change startToolExecutionSpan to read parent from toolContext; fix endToolSpan to not override pre-set status when metadata is omitted (documented via JSDoc); add debugLogger.warn in all end*Span catch blocks for OTel error visibility; warn when startToolExecutionSpan called outside runInToolSpanContext
  • loggingContentGenerator.ts: Wire startLLMRequestSpan/endLLMRequestSpan for both streaming and non-streaming paths, replacing withSpan/startSpanWithContext; integrate upstream spanEndTimeout idle mechanism with endLLMRequestSpan idempotency; add llm_request.stream attribute to distinguish streaming vs non-streaming
  • coreToolScheduler.ts: Wire startToolSpan/endToolSpan + startToolExecutionSpan/endToolExecutionSpan with try/finally lifecycle; extract _executeToolCallBody for clean span scoping; sanitize error in endToolExecutionSpan (use constant instead of raw message); set OK status in success path via safeSetStatus
  • client.ts: Remove redundant withSpan('client.generateContent') wrapper (LLM span now created in loggingContentGenerator); preserve abort error propagation
  • Tests: Update mocks for new span APIs; mock session-tracing.js directly for reliable interception; add toolContext ALS scoping tests
  • Design doc: Add docs/design/workflow-tracing-gaps.md with full gap analysis and Phase 1-3 roadmap

Part of #3731 (P3 Deeper observability — Phase 1)

Phase 1 focuses on unifying span creation paths. Phase 2 (blocked_on_user + hook spans) and Phase 3 (subagent trace tree) tracked in the parent issue.

Note: #4097 adds detailed span attributes on top of these spans. After this PR merges, #4097 needs a rebase to target the new span variables (llmSpan, toolSpan) and _executeToolCallBody structure.

Phase 1.5 follow-ups → #4212

Non-blocking polish items from late review rounds (parent fallback order, exec span on abort-as-result, mock missing constant, idle timeout vs API log inconsistency) are tracked in #4212 — none of these affect the trace tree this PR fixes.

Test plan

  • npx vitest run packages/core/src/telemetry/session-tracing.test.ts — 23 tests pass
  • npx vitest run packages/core/src/core/coreToolScheduler.test.ts — 119 tests pass
  • npx vitest run packages/core/src/core/loggingContentGenerator/loggingContentGenerator.test.ts — 32 tests pass
  • npx vitest run packages/core/src/core/client.test.ts — 104 tests pass
  • npx tsc --noEmit — zero new type errors
  • ESLint + Prettier pass (pre-commit hooks)
  • CI Lint pass
  • CI Tests (pending latest push)
  • E2E trace verification (see below)

E2E trace verification

rm -f /tmp/spans-test.jsonl
QWEN_TELEMETRY_ENABLED=1 \
QWEN_TELEMETRY_OUTFILE=/tmp/spans-test.jsonl \
node packages/cli/dist/index.js --prompt "what is 2+2" --max-session-turns 1

Verify parent-child relationships:

cat /tmp/spans-test.jsonl | jq -s '[.[] | select(.name != null and .name != "") | {
  name, spanId: ._spanContext.spanId[0:12],
  parentSpanId: (.parentSpanContext.spanId // "ROOT")[0:12],
  status: .status.code,
  stream: .attributes["llm_request.stream"],
  tool: .attributes["tool.name"]
}]'

Actual output:

[
  {"name":"qwen-code.llm_request","spanId":"a11e955e775f","parentSpanId":"1a88a402e766","status":2,"stream":false,"tool":null},
  {"name":"qwen-code.llm_request","spanId":"b52fc1e4267d","parentSpanId":"1a88a402e766","status":1,"stream":true,"tool":null},
  {"name":"qwen-code.interaction","spanId":"1a88a402e766","parentSpanId":"ROOT","status":1,"stream":null,"tool":null},
  {"name":"qwen-code.llm_request","spanId":"e46925657044","parentSpanId":"1a88a402e766","status":1,"stream":true,"tool":null},
  {"name":"qwen-code.tool.execution","spanId":"31c2ee6fb1f5","parentSpanId":"4709050c8523","status":1,"stream":null,"tool":null},
  {"name":"qwen-code.tool","spanId":"4709050c8523","parentSpanId":"1a88a402e766","status":1,"stream":null,"tool":"read_file"},
  {"name":"qwen-code.tool.execution","spanId":"7df917fb02cd","parentSpanId":"0ccbfe610d63","status":1,"stream":null,"tool":null},
  {"name":"qwen-code.tool","spanId":"0ccbfe610d63","parentSpanId":"1a88a402e766","status":1,"stream":null,"tool":"read_file"},
  {"name":"qwen-code.llm_request","spanId":"4c3ae4665b69","parentSpanId":"1a88a402e766","status":1,"stream":true,"tool":null}
]

Expected trace tree (confirmed):

qwen-code.interaction (1a88) — ROOT
  ├── qwen-code.llm_request (a11e) — parent=1a88 ✅ stream=false
  ├── qwen-code.llm_request (b52f) — parent=1a88 ✅ stream=true
  ├── qwen-code.llm_request (e469) — parent=1a88 ✅ stream=true
  ├── qwen-code.tool (4709) — parent=1a88 ✅ tool=read_file
  │   └── qwen-code.tool.execution (31c2) — parent=4709 ✅
  ├── qwen-code.tool (0ccb) — parent=1a88 ✅ tool=read_file
  │   └── qwen-code.tool.execution (7df9) — parent=0ccb ✅
  └── qwen-code.llm_request (4c3a) — parent=1a88 ✅ stream=true

🤖 Generated with Qwen Code

Copilot AI review requested due to automatic review settings May 13, 2026 16:31

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR reworks telemetry span creation so LLM and tool spans use session-tracing helpers and can form a hierarchical trace tree under interaction spans.

Changes:

  • Adds tool-span AsyncLocalStorage scoping and updates session-tracing tests/exports.
  • Replaces generic API/tool tracing wrappers with typed LLM/tool span helpers.
  • Adds a workflow tracing gap analysis design document.

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 2 comments.

Show a summary per file
File Description
packages/core/src/telemetry/session-tracing.ts Adds tool context scoping and updates tool execution span parenting/status behavior.
packages/core/src/telemetry/session-tracing.test.ts Updates and expands tests for tool context and span status behavior.
packages/core/src/telemetry/index.ts Exports the new tool context helper.
packages/core/src/core/loggingContentGenerator/loggingContentGenerator.ts Switches API tracing to LLM request span helpers.
packages/core/src/core/loggingContentGenerator/loggingContentGenerator.test.ts Updates logging generator telemetry mocks/assertions for LLM spans.
packages/core/src/core/coreToolScheduler.ts Switches tool tracing to typed tool/tool-execution spans.
packages/core/src/core/client.ts Removes the redundant client-level generateContent span wrapper.
packages/core/src/core/client.test.ts Removes assertions tied to the deleted client span.
docs/design/workflow-tracing-gaps.md Documents current tracing gaps and phased roadmap.
Comments suppressed due to low confidence (3)

packages/core/src/core/loggingContentGenerator/loggingContentGenerator.ts:467

  • This finalizer now relies on endLLMRequestSpan, which can throw from span operations. In the streaming path that means a successfully consumed stream can still fail during finally (the existing “preserves stream success when the OK status update fails” test covers this behavior). Keep span finalization best-effort so telemetry errors do not surface to callers or mask stream errors.
      if (span) {
        endLLMRequestSpan(span, {
          success: !errorOccurred,
          inputTokens: lastUsageMetadata?.promptTokenCount,
          outputTokens: lastUsageMetadata?.candidatesTokenCount,
          durationMs: Date.now() - startTime,
          error: errorOccurred
            ? API_CALL_FAILED_SPAN_STATUS_MESSAGE
            : undefined,
        });

packages/core/src/core/coreToolScheduler.ts:1998

  • endToolExecutionSpan can throw from OTel span operations, and because this call is inside the tool execution try, a telemetry failure after a successful tool result is caught as executionError and turns the tool call into an error. Make the execution-span end best-effort (or move it outside the tool-result error handling with its own guard) so tracing failures do not change tool outcomes.
      const toolResult: ToolResult = await promise;
      endToolExecutionSpan(execSpan, {
        success: toolResult.error === undefined,
      });

packages/core/src/core/loggingContentGenerator/loggingContentGenerator.ts:267

  • On the error path, endLLMRequestSpan can throw while setting span status/ending the span, which would replace the original upstream error before it is rethrown. The previous safeSetStatus path explicitly avoided masking API errors. Guard this telemetry finalization so callers still receive the original generation failure.
      endLLMRequestSpan(llmSpan, {
        success: false,
        durationMs,
        error: API_CALL_FAILED_SPAN_STATUS_MESSAGE,
      });

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread packages/core/src/core/coreToolScheduler.ts Outdated
@doudouOUC doudouOUC force-pushed the worktree-sorted-petting-parrot branch from 373b64f to 609e32b Compare May 13, 2026 17:08
@doudouOUC doudouOUC requested a review from wenshao May 13, 2026 17:12
@github-actions

github-actions Bot commented May 13, 2026

Copy link
Copy Markdown
Contributor

Code Coverage Summary

Package Lines Statements Functions Branches
CLI 75.78% 75.78% 77.05% 80.61%
Core 78.68% 78.68% 81.58% 82.76%
CLI Package - Full Text Report
-------------------|---------|----------|---------|---------|-------------------
File               | % Stmts | % Branch | % Funcs | % Lines | Uncovered Line #s 
-------------------|---------|----------|---------|---------|-------------------
All files          |   75.78 |    80.61 |   77.05 |   75.78 |                   
 src               |   75.73 |    69.15 |   80.55 |   75.73 |                   
  gemini.tsx       |   68.53 |     66.4 |   76.47 |   68.53 | ...29,946-949,957 
  ...ractiveCli.ts |      80 |    68.61 |   78.57 |      80 | ...1020,1058,1161 
  ...liCommands.ts |   74.51 |     72.5 |     100 |   74.51 | ...41-265,290,391 
  ...ActiveAuth.ts |     100 |     87.5 |     100 |     100 | 66-80             
 ...cp-integration |   57.09 |    71.81 |      60 |   57.09 |                   
  acpAgent.ts      |   59.58 |    72.22 |   66.66 |   59.58 | ...18-920,934-942 
  authMethods.ts   |   12.19 |      100 |       0 |   12.19 | 11-31,34-38,41-50 
  errorCodes.ts    |       0 |        0 |       0 |       0 | 1-22              
  ...DirContext.ts |     100 |      100 |     100 |     100 |                   
 ...ration/service |   68.65 |    83.33 |   66.66 |   68.65 |                   
  filesystem.ts    |   68.65 |    83.33 |   66.66 |   68.65 | ...32,77-94,97-98 
 ...ration/session |   76.01 |    70.59 |      84 |   76.01 |                   
  ...ryReplayer.ts |   65.93 |    75.67 |   81.81 |   65.93 | ...40-255,268-269 
  Session.ts       |   75.11 |    68.89 |    85.1 |   75.11 | ...2455,2461-2464 
  ...entTracker.ts |   90.85 |    84.84 |      90 |   90.85 | ...35,199,251-260 
  index.ts         |       0 |        0 |       0 |       0 | 1-40              
  ...ssionUtils.ts |   84.21 |    77.77 |     100 |   84.21 | ...37-153,209-211 
  types.ts         |       0 |        0 |       0 |       0 | 1                 
 ...ssion/emitters |   96.01 |    90.75 |    92.3 |   96.01 |                   
  BaseEmitter.ts   |   76.92 |    66.66 |      80 |   76.92 | 23-24,39-40,55-56 
  ...ageEmitter.ts |     100 |    89.47 |     100 |     100 | 109,111           
  PlanEmitter.ts   |     100 |      100 |     100 |     100 |                   
  ...allEmitter.ts |   98.06 |     92.3 |     100 |   98.06 | 227-228,327,335   
  index.ts         |       0 |        0 |       0 |       0 | 1-10              
 ...ession/rewrite |   90.36 |    87.83 |   94.11 |   90.36 |                   
  LlmRewriter.ts   |      81 |       84 |     100 |      81 | ...,88-89,155-159 
  ...Middleware.ts |   95.83 |    85.71 |     100 |   95.83 | 119,127-129       
  TurnBuffer.ts    |     100 |      100 |     100 |     100 |                   
  config.ts        |     100 |      100 |     100 |     100 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
  types.ts         |       0 |        0 |       0 |       0 | 1                 
 src/auth          |    97.7 |    94.81 |   95.45 |    97.7 |                   
  allProviders.ts  |     100 |      100 |     100 |     100 |                   
  ...iderConfig.ts |    97.6 |    95.04 |     100 |    97.6 | ...61,411,433-434 
  types.ts         |       0 |        0 |       0 |       0 | 1                 
 src/auth/install  |   98.57 |    88.88 |     100 |   98.57 |                   
  ...nstallPlan.ts |   98.57 |    88.88 |     100 |   98.57 | 80,93             
 ...viders/alibaba |   96.96 |    66.66 |   66.66 |   96.96 |                   
  ...baStandard.ts |     100 |      100 |     100 |     100 |                   
  codingPlan.ts    |   93.67 |    66.66 |   66.66 |   93.67 | 83,87-89,94       
  tokenPlan.ts     |     100 |      100 |     100 |     100 |                   
 ...oviders/custom |     100 |      100 |     100 |     100 |                   
  ...omProvider.ts |     100 |      100 |     100 |     100 |                   
 ...roviders/oauth |    91.5 |    77.03 |   97.05 |    91.5 |                   
  openrouter.ts    |   84.37 |    33.33 |     100 |   84.37 | 43-48             
  ...outerOAuth.ts |    91.9 |    79.06 |   96.87 |    91.9 | ...53-655,699-701 
 ...ers/thirdParty |     100 |      100 |     100 |     100 |                   
  deepseek.ts      |     100 |      100 |     100 |     100 |                   
  idealab.ts       |     100 |      100 |     100 |     100 |                   
  minimax.ts       |     100 |      100 |     100 |     100 |                   
  modelscope.ts    |     100 |      100 |     100 |     100 |                   
  zai.ts           |     100 |      100 |     100 |     100 |                   
 src/commands      |   58.43 |    85.71 |   43.47 |   58.43 |                   
  auth.ts          |     100 |    83.33 |     100 |     100 | 11,14             
  channel.ts       |   56.66 |      100 |       0 |   56.66 | 15-19,27-34       
  extensions.tsx   |   96.55 |      100 |      50 |   96.55 | 37                
  hooks.tsx        |   66.66 |      100 |       0 |   66.66 | 20-24             
  mcp.ts           |   94.73 |      100 |      50 |   94.73 | 28                
  review.ts        |   51.85 |      100 |       0 |   51.85 | 24-35,38          
  serve.ts         |   10.97 |      100 |       0 |   10.97 | ...5,45-95,97-134 
 ...mmands/channel |   39.25 |    79.45 |      50 |   39.25 |                   
  ...l-registry.ts |    8.57 |      100 |       0 |    8.57 | 6-21,24-42        
  config-utils.ts  |      92 |      100 |   66.66 |      92 | 21-26             
  configure.ts     |    14.7 |      100 |       0 |    14.7 | 18-21,23-84       
  pairing.ts       |   26.31 |      100 |       0 |   26.31 | ...30,40-50,52-65 
  pidfile.ts       |   96.34 |    86.95 |     100 |   96.34 | 49,59,91          
  start.ts         |   30.98 |       52 |   69.23 |   30.98 | ...72-475,484-486 
  status.ts        |   17.85 |      100 |       0 |   17.85 | 15-26,32-76       
  stop.ts          |      20 |      100 |       0 |      20 | 14-48             
 ...nds/extensions |    84.5 |    88.95 |   81.81 |    84.5 |                   
  consent.ts       |   71.65 |    89.28 |   42.85 |   71.65 | ...85-141,156-162 
  disable.ts       |     100 |      100 |     100 |     100 |                   
  enable.ts        |     100 |      100 |     100 |     100 |                   
  install.ts       |    75.6 |    66.66 |   66.66 |    75.6 | ...39-142,145-153 
  link.ts          |     100 |      100 |     100 |     100 |                   
  list.ts          |     100 |      100 |     100 |     100 |                   
  new.ts           |     100 |      100 |     100 |     100 |                   
  settings.ts      |   99.15 |      100 |   83.33 |   99.15 | 151               
  uninstall.ts     |    37.5 |      100 |   33.33 |    37.5 | 23-45,57-64,67-70 
  update.ts        |   96.32 |      100 |     100 |   96.32 | 101-105           
  utils.ts         |   60.24 |    28.57 |     100 |   60.24 | ...81,83-87,89-93 
 ...les/mcp-server |       0 |        0 |       0 |       0 |                   
  example.ts       |       0 |        0 |       0 |       0 | 1-60              
 src/commands/mcp  |   92.29 |    86.08 |   88.88 |   92.29 |                   
  add.ts           |     100 |    98.03 |     100 |     100 | 293               
  list.ts          |   91.22 |    80.76 |      80 |   91.22 | ...19-121,146-147 
  reconnect.ts     |   76.72 |    71.42 |   85.71 |   76.72 | 35-48,153-175     
  remove.ts        |     100 |       80 |     100 |     100 | 21-25             
 ...ommands/review |   11.57 |      100 |       0 |   11.57 |                   
  cleanup.ts       |   17.94 |      100 |       0 |   17.94 | ...01-106,108-109 
  deterministic.ts |   13.75 |      100 |       0 |   13.75 | ...22-738,740-741 
  fetch-pr.ts      |   11.36 |      100 |       0 |   11.36 | ...80-201,203-204 
  load-rules.ts    |   11.32 |      100 |       0 |   11.32 | ...41-153,155-156 
  pr-context.ts    |    6.22 |      100 |       0 |    6.22 | ...97-312,314-315 
  presubmit.ts     |    9.35 |      100 |       0 |    9.35 | ...62-287,289-290 
 ...nds/review/lib |      30 |      100 |       0 |      30 |                   
  gh.ts            |   22.58 |      100 |       0 |   22.58 | ...49,53-54,62-69 
  git.ts           |   22.72 |      100 |       0 |   22.72 | 15-18,29-39,43-44 
  paths.ts         |   52.94 |      100 |       0 |   52.94 | ...26,37-38,42-43 
 src/config        |   92.71 |    85.23 |    86.9 |   92.71 |                   
  auth.ts          |   86.98 |    80.32 |     100 |   86.98 | ...26-227,243-244 
  config.ts        |   88.32 |    85.38 |      80 |   88.32 | ...1739,1763-1764 
  keyBindings.ts   |   96.15 |       50 |     100 |   96.15 | 177-180           
  ...idersScope.ts |      92 |       90 |     100 |      92 | 11-12             
  sandboxConfig.ts |    58.9 |    61.53 |   66.66 |    58.9 | ...54-68,73,77-89 
  settings.ts      |   85.51 |    87.19 |   86.48 |   85.51 | ...1148,1153-1156 
  ...ingsSchema.ts |     100 |      100 |     100 |     100 |                   
  ...tedFolders.ts |   96.22 |       94 |     100 |   96.22 | ...88-190,205-206 
 ...nfig/migration |   94.89 |    78.94 |   83.33 |   94.89 |                   
  index.ts         |   94.87 |    88.88 |     100 |   94.87 | 91-92             
  scheduler.ts     |   96.55 |    77.77 |     100 |   96.55 | 19-20             
  types.ts         |       0 |        0 |       0 |       0 | 1                 
 ...ation/versions |   94.74 |       96 |     100 |   94.74 |                   
  ...-v2-shared.ts |     100 |      100 |     100 |     100 |                   
  v1-to-v2.ts      |   81.75 |    90.19 |     100 |   81.75 | ...28-229,231-247 
  v2-to-v3.ts      |     100 |      100 |     100 |     100 |                   
  v3-to-v4.ts      |     100 |      100 |     100 |     100 |                   
 src/core          |     100 |      100 |     100 |     100 |                   
  auth.ts          |     100 |      100 |     100 |     100 |                   
  initializer.ts   |     100 |      100 |     100 |     100 |                   
  theme.ts         |     100 |      100 |     100 |     100 |                   
 src/dualOutput    |   63.09 |    64.51 |   55.55 |   63.09 |                   
  ...tputBridge.ts |   62.94 |    65.51 |   56.25 |   62.94 | ...22-323,331-334 
  ...utContext.tsx |     100 |      100 |     100 |     100 |                   
  index.ts         |       0 |        0 |       0 |       0 | 1-8               
 src/export        |       0 |        0 |       0 |       0 |                   
  index.ts         |       0 |        0 |       0 |       0 | 1-7               
 src/generated     |     100 |      100 |     100 |     100 |                   
  git-commit.ts    |     100 |      100 |     100 |     100 |                   
 src/i18n          |   81.47 |    75.94 |   65.71 |   81.47 |                   
  index.ts         |   63.68 |    69.56 |   53.84 |   63.68 | ...70-271,281-286 
  languages.ts     |   96.92 |    86.66 |     100 |   96.92 | 134-135,167,184   
  ...nslateKeys.ts |     100 |      100 |     100 |     100 |                   
  ...lationDict.ts |   93.33 |    66.66 |     100 |   93.33 | 15                
 src/i18n/locales  |     100 |      100 |     100 |     100 |                   
  ca.js            |     100 |      100 |     100 |     100 |                   
  de.js            |     100 |      100 |     100 |     100 |                   
  en.js            |     100 |      100 |     100 |     100 |                   
  fr.js            |     100 |      100 |     100 |     100 |                   
  ja.js            |     100 |      100 |     100 |     100 |                   
  pt.js            |     100 |      100 |     100 |     100 |                   
  ru.js            |     100 |      100 |     100 |     100 |                   
  zh-TW.js         |     100 |      100 |     100 |     100 |                   
  zh.js            |     100 |      100 |     100 |     100 |                   
 ...nonInteractive |   72.57 |    71.12 |   74.07 |   72.57 |                   
  session.ts       |   76.64 |     69.4 |   85.71 |   76.64 | ...23-824,833-843 
  types.ts         |    42.5 |      100 |   33.33 |    42.5 | ...80-581,584-585 
 ...active/control |   77.04 |    88.23 |      80 |   77.04 |                   
  ...rolContext.ts |    7.14 |        0 |       0 |    7.14 | 49-84             
  ...Dispatcher.ts |   91.66 |    91.83 |   88.88 |   91.66 | ...54-372,388,391 
  ...rolService.ts |       8 |        0 |       0 |       8 | 46-179            
 ...ol/controllers |    7.04 |       80 |   13.33 |    7.04 |                   
  ...Controller.ts |   19.32 |      100 |      60 |   19.32 | 81-118,127-210    
  ...Controller.ts |       0 |        0 |       0 |       0 | 1-56              
  ...Controller.ts |    3.96 |      100 |   11.11 |    3.96 | ...61-379,389-494 
  ...Controller.ts |   14.06 |      100 |       0 |   14.06 | ...82-117,130-133 
  ...Controller.ts |    5.21 |      100 |       0 |    5.21 | ...21-433,442-471 
 .../control/types |       0 |        0 |       0 |       0 |                   
  serviceAPIs.ts   |       0 |        0 |       0 |       0 | 1                 
 ...Interactive/io |   97.98 |    93.72 |   95.18 |   97.98 |                   
  ...putAdapter.ts |   97.89 |    92.82 |   98.07 |   97.89 | ...1303,1398-1399 
  ...putAdapter.ts |      96 |    91.66 |   85.71 |      96 | 51-52             
  ...nputReader.ts |     100 |    94.73 |     100 |     100 | 67                
  ...putAdapter.ts |   98.28 |      100 |      90 |   98.28 | 81-82,122-123     
  index.ts         |     100 |      100 |     100 |     100 |                   
 src/patches       |       0 |        0 |       0 |       0 |                   
  is-in-ci.ts      |       0 |        0 |       0 |       0 | 1-17              
 src/remoteInput   |   86.98 |       75 |   85.71 |   86.98 |                   
  ...utContext.tsx |     100 |      100 |     100 |     100 |                   
  ...putWatcher.ts |   88.12 |    76.08 |   91.66 |   88.12 | ...21-222,233-236 
  index.ts         |       0 |        0 |       0 |       0 | 1-8               
 src/serve         |   81.73 |    80.67 |      90 |   81.73 |                   
  auth.ts          |   85.86 |    83.87 |      80 |   85.86 | ...47-148,151-153 
  capabilities.ts  |     100 |      100 |     100 |     100 |                   
  eventBus.ts      |   87.07 |    85.71 |      85 |   87.07 | ...46-354,415-417 
  httpAcpBridge.ts |   78.13 |    77.03 |   95.34 |   78.13 | ...2670,2701-2742 
  ...oryChannel.ts |     100 |      100 |     100 |     100 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
  loopbackBinds.ts |     100 |      100 |     100 |     100 |                   
  runQwenServe.ts  |   79.01 |     87.5 |   83.33 |   79.01 | ...24-440,465-467 
  server.ts        |   84.13 |    80.85 |   82.35 |   84.13 | ...-898,1027-1036 
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/services      |   91.58 |    91.16 |   97.56 |   91.58 |                   
  ...mandLoader.ts |     100 |     92.3 |     100 |     100 | 92                
  ...killLoader.ts |     100 |    96.15 |     100 |     100 | 45                
  ...andService.ts |    98.7 |      100 |     100 |    98.7 | 107               
  ...mandLoader.ts |   86.83 |    83.87 |     100 |   86.83 | ...30-335,340-345 
  ...omptLoader.ts |   75.84 |    80.64 |   83.33 |   75.84 | ...10-211,277-278 
  ...mandLoader.ts |     100 |      100 |     100 |     100 |                   
  ...nd-factory.ts |   91.42 |    91.66 |     100 |   91.42 | 128,137-144       
  ...ation-tool.ts |     100 |    95.45 |     100 |     100 | 125               
  ...ndMetadata.ts |   98.21 |    96.66 |     100 |   98.21 | 83,87             
  commandUtils.ts  |      96 |    91.66 |     100 |      96 | 48                
  ...and-parser.ts |   90.69 |    85.71 |     100 |   90.69 | 63-66             
  ...ionService.ts |     100 |      100 |     100 |     100 |                   
  types.ts         |     100 |      100 |     100 |     100 |                   
 ...ght/generators |    85.9 |    85.61 |   90.47 |    85.9 |                   
  DataProcessor.ts |   85.63 |     85.6 |   92.85 |   85.63 | ...1122,1126-1133 
  ...tGenerator.ts |   98.21 |    85.71 |     100 |   98.21 | 46                
  ...teRenderer.ts |   45.45 |      100 |       0 |   45.45 | 13-51             
 .../insight/types |       0 |       50 |      50 |       0 |                   
  ...sightTypes.ts |       0 |        0 |       0 |       0 |                   
  ...sightTypes.ts |       0 |        0 |       0 |       0 | 1                 
 ...mpt-processors |   97.27 |    94.04 |     100 |   97.27 |                   
  ...tProcessor.ts |     100 |      100 |     100 |     100 |                   
  ...eProcessor.ts |   94.52 |    84.21 |     100 |   94.52 | 46-47,93-94       
  ...tionParser.ts |     100 |      100 |     100 |     100 |                   
  ...lProcessor.ts |   97.41 |    95.65 |     100 |   97.41 | 95-98             
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/services/tips |   97.35 |    83.07 |     100 |   97.35 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
  tipHistory.ts    |   92.45 |       70 |     100 |   92.45 | ...22,144,151,160 
  tipRegistry.ts   |     100 |    95.23 |     100 |     100 | 33                
  tipScheduler.ts  |     100 |    91.66 |     100 |     100 | 55                
 src/test-utils    |   93.75 |    83.33 |      80 |   93.75 |                   
  ...omMatchers.ts |   69.69 |       50 |      50 |   69.69 | 32-35,37-39,45-47 
  ...andContext.ts |     100 |      100 |     100 |     100 |                   
  render.tsx       |     100 |      100 |     100 |     100 |                   
 src/ui            |   65.12 |    69.38 |   48.93 |   65.12 |                   
  App.tsx          |     100 |      100 |     100 |     100 |                   
  AppContainer.tsx |   67.74 |    64.86 |   52.94 |   67.74 | ...2809,2813-2817 
  ...tionNudge.tsx |    9.58 |      100 |       0 |    9.58 | 24-94             
  ...ackDialog.tsx |   29.23 |      100 |       0 |   29.23 | 25-75             
  ...tionNudge.tsx |    7.69 |      100 |       0 |    7.69 | 25-103            
  colors.ts        |   52.72 |      100 |   23.52 |   52.72 | ...52,54-55,60-61 
  constants.ts     |     100 |      100 |     100 |     100 |                   
  keyMatchers.ts   |   95.91 |    96.42 |     100 |   95.91 | 25-26             
  ...tic-colors.ts |     100 |      100 |     100 |     100 |                   
  textConstants.ts |     100 |      100 |     100 |     100 |                   
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/ui/auth       |   48.01 |    58.73 |   21.42 |   48.01 |                   
  AuthDialog.tsx   |   64.26 |    44.44 |   16.66 |   64.26 | ...59,366-388,392 
  ...nProgress.tsx |       0 |        0 |       0 |       0 | 1-64              
  ...etupSteps.tsx |    9.61 |      100 |       0 |    9.61 | ...35-352,391-476 
  useAuth.ts       |   76.63 |    68.29 |     100 |   76.63 | ...48,493-499,560 
  ...rSetupFlow.ts |   44.61 |    33.33 |      50 |   44.61 | ...57-378,395-438 
 src/ui/commands   |   70.47 |    80.03 |   79.92 |   70.47 |                   
  aboutCommand.ts  |     100 |    85.71 |     100 |     100 | 36                
  agentsCommand.ts |   83.78 |      100 |      60 |   83.78 | 30-32,42-44       
  ...odeCommand.ts |     100 |      100 |     100 |     100 |                   
  arenaCommand.ts  |   62.81 |    58.73 |   65.21 |   62.81 | ...91-596,681-689 
  authCommand.ts   |     100 |      100 |     100 |     100 |                   
  branchCommand.ts |     100 |      100 |     100 |     100 |                   
  btwCommand.ts    |   95.59 |    71.42 |     100 |   95.59 | 72,154-159        
  bugCommand.ts    |   81.13 |    71.42 |     100 |   81.13 | 60-69             
  clearCommand.ts  |      92 |    76.47 |     100 |      92 | 43-44,72-73,91-92 
  ...essCommand.ts |    64.7 |       50 |      75 |    64.7 | ...48-149,163-166 
  ...extCommand.ts |   34.78 |    22.22 |   45.45 |   34.78 | ...86-521,532-533 
  copyCommand.ts   |   98.28 |    94.89 |     100 |   98.28 | ...80,280,321,327 
  deleteCommand.ts |     100 |      100 |     100 |     100 |                   
  diffCommand.ts   |   99.02 |    86.11 |     100 |   99.02 | 222,226           
  ...ryCommand.tsx |   68.09 |    77.77 |   77.77 |   68.09 | ...56-261,315-323 
  docsCommand.ts   |     100 |    88.88 |     100 |     100 | 25                
  doctorCommand.ts |   93.37 |    84.12 |     100 |   93.37 | ...55-156,158-159 
  dreamCommand.ts  |      75 |    66.66 |   66.66 |      75 | 22-27,44-47       
  editorCommand.ts |     100 |      100 |     100 |     100 |                   
  exportCommand.ts |      60 |    92.85 |   77.77 |      60 | 176-317           
  ...onsCommand.ts |   48.66 |     90.9 |   63.63 |   48.66 | ...05-109,159-211 
  forgetCommand.ts |   26.82 |      100 |      50 |   26.82 | 18-51             
  goalCommand.ts   |    93.1 |    84.61 |     100 |    93.1 | ...72-175,187-190 
  helpCommand.ts   |     100 |      100 |     100 |     100 |                   
  hooksCommand.ts  |    20.4 |       40 |      40 |    20.4 | ...48-180,204-205 
  ideCommand.ts    |   60.75 |    64.28 |   41.17 |   60.75 | ...05-306,310-324 
  initCommand.ts   |   84.33 |    72.72 |     100 |   84.33 | 68,82-87,89-94    
  ...ghtCommand.ts |   74.56 |    68.42 |     100 |   74.56 | ...31-245,250-273 
  ...ageCommand.ts |   92.17 |    82.69 |     100 |   92.17 | ...43,164,173-183 
  ...elsCommand.ts |     100 |      100 |     100 |     100 |                   
  mcpCommand.ts    |     100 |      100 |     100 |     100 |                   
  memoryCommand.ts |     100 |      100 |     100 |     100 |                   
  modelCommand.ts  |   75.09 |    78.18 |      75 |   75.09 | ...20-225,262-267 
  ...onsCommand.ts |     100 |      100 |     100 |     100 |                   
  planCommand.ts   |   78.82 |    76.92 |     100 |   78.82 | 30-35,51-56,68-73 
  quitCommand.ts   |     100 |      100 |     100 |     100 |                   
  recapCommand.ts  |   21.81 |      100 |      50 |   21.81 | 24-73             
  ...berCommand.ts |   32.43 |      100 |      50 |   32.43 | 23-57             
  renameCommand.ts |   85.71 |    86.04 |     100 |   85.71 | ...02-209,216-221 
  ...oreCommand.ts |    92.3 |    87.87 |     100 |    92.3 | ...,83-88,129-130 
  resumeCommand.ts |     100 |      100 |     100 |     100 |                   
  rewindCommand.ts |      80 |      100 |      50 |      80 | 19-21             
  ...ngsCommand.ts |     100 |      100 |     100 |     100 |                   
  ...hubCommand.ts |   81.43 |    65.21 |      80 |   81.43 | ...70-173,176-179 
  skillsCommand.ts |   15.04 |      100 |      25 |   15.04 | ...90-106,109-136 
  statsCommand.ts  |   88.19 |    84.21 |     100 |   88.19 | ...,58-61,143-146 
  ...ineCommand.ts |     100 |      100 |     100 |     100 |                   
  ...aryCommand.ts |    6.46 |      100 |      50 |    6.46 | 31-329            
  tasksCommand.ts  |   77.45 |    73.43 |     100 |   77.45 | ...55-159,181-186 
  ...tupCommand.ts |     100 |      100 |     100 |     100 |                   
  themeCommand.ts  |     100 |      100 |     100 |     100 |                   
  toolsCommand.ts  |     100 |      100 |     100 |     100 |                   
  trustCommand.ts  |     100 |      100 |     100 |     100 |                   
  types.ts         |     100 |      100 |     100 |     100 |                   
  vimCommand.ts    |   54.54 |      100 |      50 |   54.54 | 19-29             
 src/ui/components |   61.44 |    75.46 |    67.4 |   61.44 |                   
  AboutBox.tsx     |     100 |      100 |     100 |     100 |                   
  AnsiOutput.tsx   |   65.57 |      100 |      50 |   65.57 | 69-90             
  ApiKeyInput.tsx  |       0 |        0 |       0 |       0 | 1-97              
  AppHeader.tsx    |   89.39 |       75 |     100 |   89.39 | 35,37-42,44       
  ...odeDialog.tsx |     9.7 |      100 |       0 |     9.7 | 35-47,50-182      
  AsciiArt.ts      |     100 |      100 |     100 |     100 |                   
  ...Indicator.tsx |   14.63 |      100 |       0 |   14.63 | 18-56             
  ...TextInput.tsx |   77.01 |       76 |     100 |   77.01 | ...20,234-236,263 
  Composer.tsx     |    80.8 |     64.7 |     100 |    80.8 | ...85,103,154,167 
  ...entPrompt.tsx |     100 |      100 |     100 |     100 |                   
  ...ryDisplay.tsx |   75.89 |    62.06 |     100 |   75.89 | ...,88,93-108,113 
  ...geDisplay.tsx |   68.42 |    57.14 |     100 |   68.42 | 16-17,31-32,42-50 
  ...ification.tsx |   28.57 |      100 |       0 |   28.57 | 16-36             
  ...gProfiler.tsx |       0 |        0 |       0 |       0 | 1-36              
  ...ogManager.tsx |    12.4 |      100 |       0 |    12.4 | 63-474            
  ...ngsDialog.tsx |    8.44 |      100 |       0 |    8.44 | 37-195            
  ExitWarning.tsx  |     100 |      100 |     100 |     100 |                   
  ...hProgress.tsx |    87.8 |    33.33 |     100 |    87.8 | 28-31,56          
  ...ustDialog.tsx |     100 |      100 |     100 |     100 |                   
  Footer.tsx       |    78.9 |    56.25 |     100 |    78.9 | ...05-109,133-134 
  ...ngSpinner.tsx |   68.42 |       80 |      50 |   68.42 | 35-52,73,80-81    
  GoalPill.tsx     |   76.19 |    81.81 |     100 |   76.19 | 24-30,46-50       
  Header.tsx       |   98.62 |    94.28 |     100 |   98.62 | 162,164           
  Help.tsx         |   98.32 |    89.88 |     100 |   98.32 | ...24,381,447-448 
  ...emDisplay.tsx |    61.7 |       36 |     100 |    61.7 | ...42,345,348-354 
  ...ngeDialog.tsx |     100 |      100 |     100 |     100 |                   
  InputPrompt.tsx  |   82.25 |    77.43 |   83.33 |   82.25 | ...1347,1412,1462 
  ...Shortcuts.tsx |   20.87 |      100 |       0 |   20.87 | ...6,49-51,67-125 
  ...Indicator.tsx |     100 |    91.42 |     100 |     100 | 65,74             
  ...firmation.tsx |   91.42 |      100 |      50 |   91.42 | 26-31             
  MainContent.tsx  |   81.75 |       75 |     100 |   81.75 | ...70-274,282-286 
  ...elsDialog.tsx |   16.07 |    89.18 |      50 |   16.07 | ...58-159,162-648 
  MemoryDialog.tsx |   53.21 |    51.21 |   57.14 |   53.21 | ...54,366,379-381 
  ...geDisplay.tsx |       0 |        0 |       0 |       0 | 1-41              
  ModelDialog.tsx  |   80.12 |    63.55 |     100 |   80.12 | ...39-555,612-616 
  ...tsDisplay.tsx |     100 |    97.22 |     100 |     100 | 270               
  ...fications.tsx |   18.18 |      100 |       0 |   18.18 | 15-58             
  ...onsDialog.tsx |    2.13 |      100 |       0 |    2.13 | 62-133,148-1004   
  ...ryDisplay.tsx |     100 |      100 |     100 |     100 |                   
  ...icePrompt.tsx |   88.14 |    83.87 |     100 |   88.14 | ...01-105,133-138 
  PrepareLabel.tsx |   91.66 |    77.27 |     100 |   91.66 | 73-75,77-79,110   
  ...atePrompt.tsx |    8.57 |      100 |       0 |    8.57 | 24-55,58-134      
  ...geDisplay.tsx |     100 |      100 |     100 |     100 |                   
  ...ngDisplay.tsx |   21.42 |      100 |       0 |   21.42 | 13-39             
  ...hProgress.tsx |   85.25 |    88.46 |     100 |   85.25 | 121-147           
  ...dSelector.tsx |    4.45 |      100 |       0 |    4.45 | 28-92,100-328     
  ...ionPicker.tsx |   78.43 |    66.66 |     100 |   78.43 | ...20-422,444-466 
  ...onPreview.tsx |   92.42 |    84.37 |     100 |   92.42 | ...,70-71,143-145 
  ...ryDisplay.tsx |     100 |      100 |     100 |     100 |                   
  ...putPrompt.tsx |   72.56 |       80 |      40 |   72.56 | ...06-109,114-117 
  ...ngsDialog.tsx |   66.88 |    73.52 |     100 |   66.88 | ...11-819,825-826 
  ...ionDialog.tsx |    87.8 |      100 |   33.33 |    87.8 | 36-39,44-51       
  ...putPrompt.tsx |    15.9 |      100 |       0 |    15.9 | 20-63             
  ...Indicator.tsx |   57.14 |      100 |       0 |   57.14 | 12-15             
  ...MoreLines.tsx |      28 |      100 |       0 |      28 | 18-40             
  ...ionPicker.tsx |   17.59 |      100 |       0 |   17.59 | 55-172            
  StatsDisplay.tsx |     100 |      100 |     100 |     100 |                   
  ...yTodoList.tsx |   94.17 |       80 |     100 |   94.17 | 56-57,131-134     
  ...nsDisplay.tsx |   87.25 |       64 |     100 |   87.25 | ...45-147,154-156 
  ThemeDialog.tsx  |   89.95 |    46.15 |      75 |   89.95 | ...71-173,243-245 
  Tips.tsx         |   93.54 |       75 |     100 |   93.54 | 39-40             
  TodoDisplay.tsx  |     100 |      100 |     100 |     100 |                   
  ...tsDisplay.tsx |     100 |     87.5 |     100 |     100 | 31-32             
  TrustDialog.tsx  |     100 |    81.81 |     100 |     100 | 71-86             
  ...ification.tsx |   36.36 |      100 |       0 |   36.36 | 15-22             
  ...ackDialog.tsx |    7.84 |      100 |       0 |    7.84 | 24-134            
 ...nts/agent-view |    25.2 |       90 |      10 |    25.2 |                   
  ...atContent.tsx |    8.79 |      100 |       0 |    8.79 | 53-265,271-273    
  ...tChatView.tsx |   21.05 |      100 |       0 |   21.05 | 21-39             
  ...tComposer.tsx |    9.95 |      100 |       0 |    9.95 | 57-308            
  AgentFooter.tsx  |   17.07 |      100 |       0 |   17.07 | 28-66             
  AgentHeader.tsx  |   15.38 |      100 |       0 |   15.38 | 27-64             
  AgentTabBar.tsx  |    8.13 |      100 |       0 |    8.13 | 39-59,64-187      
  ...oryAdapter.ts |     100 |    91.83 |     100 |     100 | 103,109-110,138   
  index.ts         |       0 |        0 |       0 |       0 | 1-12              
 ...mponents/arena |   45.72 |    70.53 |   60.86 |   45.72 |                   
  ArenaCards.tsx   |   73.06 |    71.79 |   85.71 |   73.06 | ...83-185,321-326 
  ...ectDialog.tsx |   83.48 |    69.86 |   88.88 |   83.48 | ...88-392,409-410 
  ...artDialog.tsx |   10.15 |      100 |       0 |   10.15 | 27-161            
  ...tusDialog.tsx |    5.63 |      100 |       0 |    5.63 | 33-75,80-288      
  ...topDialog.tsx |    6.17 |      100 |       0 |    6.17 | 33-213            
 ...ackground-view |   75.44 |     83.7 |   85.29 |   75.44 |                   
  ...sksDialog.tsx |   70.05 |     79.2 |   76.19 |   70.05 | ...1119,1195-1197 
  ...TasksPill.tsx |   70.83 |    86.95 |     100 |   70.83 | 44,84-96,104-112  
  ...gentPanel.tsx |   99.52 |    93.18 |     100 |   99.52 | 123               
 ...nts/extensions |   45.28 |    33.33 |      60 |   45.28 |                   
  ...gerDialog.tsx |   44.31 |    34.14 |      75 |   44.31 | ...71-480,483-488 
  index.ts         |       0 |        0 |       0 |       0 | 1-9               
  types.ts         |     100 |      100 |     100 |     100 |                   
 ...tensions/steps |   54.77 |    94.23 |   66.66 |   54.77 |                   
  ...ctionStep.tsx |   95.12 |    92.85 |   85.71 |   95.12 | 84-86,89          
  ...etailStep.tsx |    6.18 |      100 |       0 |    6.18 | 17-128            
  ...nListStep.tsx |   88.35 |    94.73 |      80 |   88.35 | 51-52,58-71,105   
  ...electStep.tsx |   13.46 |      100 |       0 |   13.46 | 20-70             
  ...nfirmStep.tsx |   19.56 |      100 |       0 |   19.56 | 23-65             
  index.ts         |     100 |      100 |     100 |     100 |                   
 ...mponents/hooks |   68.64 |    69.07 |   69.56 |   68.64 |                   
  ...etailStep.tsx |   74.68 |    66.66 |   66.66 |   74.68 | ...71-184,188-201 
  ...etailStep.tsx |    87.4 |    73.68 |     100 |    87.4 | 41-42,99-113,119  
  ...abledStep.tsx |     100 |      100 |     100 |     100 |                   
  ...sListStep.tsx |     100 |      100 |     100 |     100 |                   
  ...entDialog.tsx |   34.35 |    47.05 |   42.85 |   34.35 | ...77,481-494,498 
  constants.ts     |     100 |      100 |     100 |     100 |                   
  index.ts         |       0 |        0 |       0 |       0 | 1-13              
  types.ts         |     100 |      100 |     100 |     100 |                   
 ...components/mcp |   20.83 |    83.72 |   83.33 |   20.83 |                   
  ...ealthPill.tsx |   68.42 |    85.71 |     100 |   68.42 | 40-46             
  ...entDialog.tsx |    3.64 |      100 |       0 |    3.64 | 41-717            
  constants.ts     |     100 |      100 |     100 |     100 |                   
  index.ts         |       0 |        0 |       0 |       0 | 1-30              
  types.ts         |     100 |      100 |     100 |     100 |                   
  utils.ts         |   94.79 |    85.71 |     100 |   94.79 | 16,20,35,109-110  
 ...ents/mcp/steps |    6.88 |      100 |       0 |    6.88 |                   
  ...icateStep.tsx |    5.88 |      100 |       0 |    5.88 | 40-55,58-296      
  ...electStep.tsx |   10.95 |      100 |       0 |   10.95 | 16-88             
  ...etailStep.tsx |    5.26 |      100 |       0 |    5.26 | 31-247            
  ...rListStep.tsx |    5.88 |      100 |       0 |    5.88 | 20-176            
  ...etailStep.tsx |   10.41 |      100 |       0 |   10.41 | ...1,67-79,82-139 
  ToolListStep.tsx |    7.14 |      100 |       0 |    7.14 | 16-146            
 ...nents/messages |   79.61 |    80.23 |   70.83 |   79.61 |                   
  ...ionDialog.tsx |   77.35 |    74.54 |    62.5 |   77.35 | ...90,508,526-528 
  BtwMessage.tsx   |     100 |      100 |     100 |     100 |                   
  ...upDisplay.tsx |   97.67 |    83.72 |     100 |   97.67 | 119,142,150       
  ...onMessage.tsx |   91.93 |    82.35 |     100 |   91.93 | 57-59,61,63       
  ...nMessages.tsx |   79.06 |      100 |      70 |   79.06 | ...51-264,268-280 
  DiffRenderer.tsx |   93.19 |    86.17 |     100 |   93.19 | ...09,237-238,304 
  ...tsDisplay.tsx |   97.82 |    77.27 |     100 |   97.82 | 87,89             
  ...usMessage.tsx |    6.52 |      100 |       0 |    6.52 | 24-141            
  ...ssMessage.tsx |    12.5 |      100 |       0 |    12.5 | 18-59             
  ...edMessage.tsx |   16.66 |      100 |       0 |   16.66 | 22-38             
  ...sMessages.tsx |   55.67 |       40 |   28.57 |   55.67 | ...20-125,133-145 
  ...ryMessage.tsx |   14.28 |      100 |       0 |   14.28 | 23-62             
  ...onMessage.tsx |   81.02 |    69.23 |   33.33 |   81.02 | ...24-426,433-435 
  ...upMessage.tsx |      84 |    93.61 |     100 |      84 | ...56-383,405-420 
  ToolMessage.tsx  |   88.84 |    75.71 |    92.3 |   88.84 | ...44-749,776-778 
 ...ponents/shared |   82.37 |    77.36 |   92.75 |   82.37 |                   
  ...ctionList.tsx |   99.03 |    95.65 |     100 |   99.03 | 85                
  ...tonSelect.tsx |     100 |      100 |     100 |     100 |                   
  EnumSelector.tsx |     100 |    96.42 |     100 |     100 | 58                
  MaxSizedBox.tsx  |   83.01 |    86.25 |   88.88 |   83.01 | ...12-513,618-619 
  MultiSelect.tsx  |    6.29 |      100 |       0 |    6.29 | 35-42,45-176      
  ...tonSelect.tsx |     100 |      100 |     100 |     100 |                   
  ...eSelector.tsx |     100 |       60 |     100 |     100 | 40-45             
  TextInput.tsx    |   72.98 |    55.55 |      80 |   72.98 | ...08-212,224-230 
  ...apsedTime.tsx |     100 |      100 |     100 |     100 |                   
  ...Indicator.tsx |     100 |      100 |     100 |     100 |                   
  text-buffer.ts   |   83.62 |    75.62 |   97.61 |   83.62 | ...2272,2300,2368 
  ...er-actions.ts |   86.71 |    67.79 |     100 |   86.71 | ...07-608,809-811 
 ...ents/subagents |   30.87 |        0 |       0 |   30.87 |                   
  constants.ts     |     100 |      100 |     100 |     100 |                   
  index.ts         |       0 |        0 |       0 |       0 | 1-11              
  reducers.tsx     |    12.1 |      100 |       0 |    12.1 | 33-190            
  types.ts         |     100 |      100 |     100 |     100 |                   
  utils.ts         |   10.95 |      100 |       0 |   10.95 | ...1,56-57,60-102 
 ...bagents/create |    9.13 |      100 |       0 |    9.13 |                   
  ...ionWizard.tsx |    7.28 |      100 |       0 |    7.28 | 34-299            
  ...rSelector.tsx |   14.75 |      100 |       0 |   14.75 | 26-85             
  ...onSummary.tsx |    4.26 |      100 |       0 |    4.26 | 27-331            
  ...tionInput.tsx |    8.63 |      100 |       0 |    8.63 | 23-177            
  ...dSelector.tsx |   33.33 |      100 |       0 |   33.33 | 20-21,26-27,36-63 
  ...nSelector.tsx |    37.5 |      100 |       0 |    37.5 | 20-21,26-27,36-58 
  ...EntryStep.tsx |   12.76 |      100 |       0 |   12.76 | 34-78             
  ToolSelector.tsx |    4.16 |      100 |       0 |    4.16 | 31-253            
 ...bagents/manage |    8.39 |      100 |       0 |    8.39 |                   
  ...ctionStep.tsx |   10.25 |      100 |       0 |   10.25 | 21-103            
  ...eleteStep.tsx |   20.93 |      100 |       0 |   20.93 | 23-62             
  ...tEditStep.tsx |   25.53 |      100 |       0 |   25.53 | ...2,37-38,51-124 
  ...ctionStep.tsx |    2.29 |      100 |       0 |    2.29 | 28-449            
  ...iewerStep.tsx |   13.72 |      100 |       0 |   13.72 | 18-73             
  ...gerDialog.tsx |    6.74 |      100 |       0 |    6.74 | 35-341            
 ...mponents/views |   42.16 |    69.23 |   21.42 |   42.16 |                   
  ContextUsage.tsx |     4.7 |      100 |       0 |     4.7 | ...52-167,170-456 
  DoctorReport.tsx |     9.8 |      100 |       0 |     9.8 | 25-54,57-131      
  ...sionsList.tsx |   87.69 |    73.68 |     100 |   87.69 | 65-72             
  McpStatus.tsx    |   89.53 |    60.52 |     100 |   89.53 | ...72,175-177,262 
  SkillsList.tsx   |   27.27 |      100 |       0 |   27.27 | 18-35             
  ToolsList.tsx    |     100 |      100 |     100 |     100 |                   
 src/ui/contexts   |   77.05 |    78.24 |   82.14 |   77.05 |                   
  ...ewContext.tsx |   65.77 |      100 |      75 |   65.77 | ...22-225,231-241 
  AppContext.tsx   |      80 |       50 |     100 |      80 | 19-20             
  ...ewContext.tsx |   93.37 |    68.57 |      50 |   93.37 | ...94-195,222-226 
  ...deContext.tsx |     100 |      100 |     100 |     100 |                   
  ...igContext.tsx |   81.81 |       50 |     100 |   81.81 | 15-16             
  ...ssContext.tsx |   81.88 |    82.26 |     100 |   81.88 | ...1153,1159-1161 
  ...owContext.tsx |   89.28 |       80 |   66.66 |   89.28 | 34,47-48,60-62    
  ...deContext.tsx |     100 |      100 |      50 |     100 |                   
  ...onContext.tsx |   43.28 |     62.5 |    62.5 |   43.28 | ...56-259,263-266 
  ...gsContext.tsx |   83.33 |       50 |     100 |   83.33 | 17-18             
  ...usContext.tsx |     100 |      100 |     100 |     100 |                   
  ...ngContext.tsx |   71.42 |       50 |     100 |   71.42 | 17-20             
  ...utContext.tsx |   85.71 |      100 |   66.66 |   85.71 | 13-14             
  ...nsContext.tsx |   88.23 |       50 |     100 |   88.23 | 109-110           
  ...teContext.tsx |   86.66 |       50 |     100 |   86.66 | 173-174           
  ...deContext.tsx |   76.08 |    72.72 |     100 |   76.08 | 47-48,52-59,77-78 
 src/ui/editors    |   93.33 |    85.71 |   66.66 |   93.33 |                   
  ...ngsManager.ts |   93.33 |    85.71 |   66.66 |   93.33 | 49,63-64          
 src/ui/hooks      |   81.96 |    82.12 |   86.53 |   81.96 |                   
  ...dProcessor.ts |   83.12 |    82.56 |     100 |   83.12 | ...88-389,408-435 
  keyToAnsi.ts     |    3.92 |      100 |       0 |    3.92 | 19-77             
  ...dProcessor.ts |    94.8 |    70.58 |     100 |    94.8 | ...76-277,282-283 
  ...dProcessor.ts |   75.72 |    63.19 |   61.53 |   75.72 | ...74,898,917-921 
  ...amingState.ts |   12.22 |      100 |       0 |   12.22 | 54-157            
  ...agerDialog.ts |   88.23 |      100 |     100 |   88.23 | 20,24             
  ...ationFrame.ts |      32 |       60 |     100 |      32 | 42-44,51-90       
  ...odeCommand.ts |   58.82 |      100 |     100 |   58.82 | 28,33-48          
  ...enaCommand.ts |      85 |      100 |     100 |      85 | 23-24,29          
  ...aInProcess.ts |   19.81 |    66.66 |      25 |   19.81 | 57-175            
  ...Completion.ts |   92.77 |    89.09 |     100 |   92.77 | ...86-187,220-223 
  ...ifications.ts |   92.07 |    96.29 |     100 |   92.07 | 116-124           
  ...tIndicator.ts |     100 |    93.75 |     100 |     100 | 63                
  ...waySummary.ts |   96.22 |    69.69 |     100 |   96.22 | 125-127,169       
  ...ndTaskView.ts |   94.11 |    76.92 |     100 |   94.11 | 119-123,216,222   
  ...ketedPaste.ts |    23.8 |      100 |       0 |    23.8 | 19-37             
  ...nchCommand.ts |   94.36 |    74.35 |     100 |   94.36 | ...60,168-169,209 
  ...ompletion.tsx |   95.95 |    82.75 |     100 |   95.95 | ...22-223,225-226 
  ...dMigration.ts |   90.62 |       75 |     100 |   90.62 | 38-40             
  useCompletion.ts |    92.4 |     87.5 |     100 |    92.4 | 68-69,93-94,98-99 
  ...nitMessage.ts |     100 |      100 |     100 |     100 |                   
  ...extualTips.ts |   76.92 |       50 |     100 |   76.92 | 55,68,71-75,88-96 
  ...eteCommand.ts |   78.53 |    88.57 |     100 |   78.53 | ...96-104,112-113 
  ...ialogClose.ts |   16.66 |      100 |     100 |   16.66 | 79-139            
  ...oublePress.ts |   53.12 |       75 |     100 |   53.12 | 33-35,41-54       
  ...orSettings.ts |     100 |      100 |     100 |     100 |                   
  ...Completion.ts |   99.12 |     97.7 |     100 |   99.12 | 182-183           
  ...ionUpdates.ts |   93.45 |     92.3 |     100 |   93.45 | ...83-287,300-306 
  ...agerDialog.ts |   88.88 |      100 |     100 |   88.88 | 21,25             
  ...backDialog.ts |   54.47 |       50 |   33.33 |   54.47 | ...69-171,193-194 
  useFocus.ts      |     100 |      100 |     100 |     100 |                   
  ...olderTrust.ts |     100 |      100 |     100 |     100 |                   
  ...ggestions.tsx |   89.15 |     62.5 |      50 |   89.15 | ...22-124,149-150 
  ...miniStream.ts |   76.91 |       74 |   91.66 |   76.91 | ...2462,2475-2483 
  ...BranchName.ts |    90.9 |     92.3 |     100 |    90.9 | 19-20,55-58       
  ...oryManager.ts |   93.15 |    93.75 |     100 |   93.15 | 44,107-110        
  ...ooksDialog.ts |    87.5 |      100 |     100 |    87.5 | 19,23             
  ...stListener.ts |     100 |      100 |     100 |     100 |                   
  ...nAuthError.ts |   76.19 |       50 |     100 |   76.19 | 39-40,43-45       
  ...putHistory.ts |   92.59 |    85.71 |     100 |   92.59 | 63-64,72,94-96    
  ...storyStore.ts |     100 |    94.11 |     100 |     100 | 69                
  useKeypress.ts   |     100 |      100 |     100 |     100 |                   
  ...rdProtocol.ts |   36.36 |      100 |       0 |   36.36 | 24-31             
  ...unchEditor.ts |    9.67 |      100 |       0 |    9.67 | 11-32,39-90       
  ...gIndicator.ts |     100 |      100 |     100 |     100 |                   
  useLogger.ts     |   21.05 |      100 |       0 |   21.05 | 15-37             
  useMCPHealth.ts  |   63.15 |       75 |      50 |   63.15 | 42-52,64-67       
  ...elsCommand.ts |     100 |      100 |     100 |     100 |                   
  useMcpDialog.ts  |    87.5 |      100 |     100 |    87.5 | 19,23             
  ...moryDialog.ts |    87.5 |      100 |     100 |    87.5 | 19,23             
  ...oryMonitor.ts |     100 |      100 |     100 |     100 |                   
  ...ssageQueue.ts |     100 |      100 |     100 |     100 |                   
  ...delCommand.ts |     100 |       75 |     100 |     100 | 22                
  ...raseCycler.ts |   84.74 |    76.47 |     100 |   84.74 | ...49,52-53,69-71 
  ...derUpdates.ts |   86.38 |    77.19 |     100 |   86.38 | ...22,281-293,341 
  useQwenAuth.ts   |     100 |      100 |     100 |     100 |                   
  ...lScheduler.ts |    84.7 |    93.33 |     100 |    84.7 | ...71-276,372-382 
  ...oryCommand.ts |       0 |        0 |       0 |       0 | 1-7               
  ...umeCommand.ts |   97.08 |    83.33 |     100 |   97.08 | 103-104,133       
  ...ompletion.tsx |   90.59 |    83.33 |     100 |   90.59 | ...01,104,137-140 
  ...ectionList.ts |   96.96 |    95.69 |     100 |   96.96 | ...82-183,237-240 
  ...sionPicker.ts |   85.67 |    81.37 |     100 |   85.67 | ...25-527,536-538 
  ...earchInput.ts |     100 |      100 |     100 |     100 |                   
  ...ngsCommand.ts |   18.75 |      100 |       0 |   18.75 | 10-25             
  ...ellHistory.ts |   91.74 |    79.41 |     100 |   91.74 | ...74,122-123,133 
  ...oryCommand.ts |       0 |        0 |       0 |       0 | 1-73              
  ...Completion.ts |   82.67 |    85.41 |   94.73 |   82.67 | ...68-670,678-714 
  ...tateAndRef.ts |     100 |      100 |     100 |     100 |                   
  useStatusLine.ts |     100 |    98.79 |     100 |     100 | 257               
  ...eateDialog.ts |   88.23 |      100 |     100 |   88.23 | 14,18             
  ...tification.ts |     100 |    85.71 |     100 |     100 | 47                
  ...alProgress.ts |   53.06 |       50 |   66.66 |   53.06 | ...53,61-68,79-85 
  ...rminalSize.ts |   76.19 |      100 |      50 |   76.19 | 21-25             
  ...emeCommand.ts |   67.01 |    29.41 |     100 |   67.01 | ...10-111,115-116 
  useTimer.ts      |   88.09 |    85.71 |     100 |   88.09 | 44-45,51-53       
  ...lMigration.ts |       0 |        0 |       0 |       0 |                   
  ...rustModify.ts |     100 |      100 |     100 |     100 |                   
  ...elcomeBack.ts |   87.36 |     90.9 |     100 |   87.36 | ...,94-96,114-115 
  vim.ts           |   83.77 |    80.31 |     100 |   83.77 | ...55,759-767,776 
 src/ui/layouts    |   89.72 |     87.5 |     100 |   89.72 |                   
  ...AppLayout.tsx |   89.88 |     87.5 |     100 |   89.88 | 51-53,93-98       
  ...AppLayout.tsx |   89.47 |     87.5 |     100 |   89.47 | 58-63             
 ...i/manageModels |   93.61 |       48 |     100 |   93.61 |                   
  manageModels.ts  |   93.61 |       48 |     100 |   93.61 | ...63-166,179,209 
 src/ui/models     |   80.24 |    79.16 |   71.42 |   80.24 |                   
  ...ableModels.ts |   80.24 |    79.16 |   71.42 |   80.24 | ...,61-71,123-125 
 ...noninteractive |     100 |      100 |    7.14 |     100 |                   
  ...eractiveUi.ts |     100 |      100 |    7.14 |     100 |                   
 src/ui/state      |   94.91 |    81.81 |     100 |   94.91 |                   
  extensions.ts    |   94.91 |    81.81 |     100 |   94.91 | 68-69,88          
 src/ui/themes     |   98.53 |    70.58 |     100 |   98.53 |                   
  ansi-light.ts    |     100 |      100 |     100 |     100 |                   
  ansi.ts          |     100 |      100 |     100 |     100 |                   
  atom-one-dark.ts |     100 |      100 |     100 |     100 |                   
  ayu-light.ts     |     100 |      100 |     100 |     100 |                   
  ayu.ts           |     100 |      100 |     100 |     100 |                   
  color-utils.ts   |     100 |      100 |     100 |     100 |                   
  default-light.ts |     100 |      100 |     100 |     100 |                   
  default.ts       |     100 |      100 |     100 |     100 |                   
  ...inal-theme.ts |   88.59 |    85.96 |     100 |   88.59 | ...57-261,266-270 
  dracula.ts       |     100 |      100 |     100 |     100 |                   
  github-dark.ts   |     100 |      100 |     100 |     100 |                   
  github-light.ts  |     100 |      100 |     100 |     100 |                   
  googlecode.ts    |     100 |      100 |     100 |     100 |                   
  no-color.ts      |     100 |      100 |     100 |     100 |                   
  qwen-dark.ts     |     100 |      100 |     100 |     100 |                   
  qwen-light.ts    |     100 |      100 |     100 |     100 |                   
  ...tic-tokens.ts |     100 |      100 |     100 |     100 |                   
  ...-of-purple.ts |     100 |      100 |     100 |     100 |                   
  theme-manager.ts |   87.98 |    82.89 |     100 |   87.98 | ...48-357,362-363 
  theme.ts         |     100 |    38.02 |     100 |     100 | ...34-449,457-461 
  xcode.ts         |     100 |      100 |     100 |     100 |                   
 src/ui/utils      |   83.91 |    82.92 |   92.56 |   83.91 |                   
  ...Colorizer.tsx |   79.53 |    83.78 |     100 |   79.53 | ...51-152,249-275 
  ...nRenderer.tsx |   68.83 |    70.14 |      50 |   68.83 | ...52-254,274-293 
  ...wnDisplay.tsx |   86.01 |    87.41 |     100 |   86.01 | ...87,704,729-754 
  ...idDiagram.tsx |   87.79 |    95.34 |     100 |   87.79 | 156-179           
  ...eRenderer.tsx |   92.08 |    80.45 |      95 |   92.08 | ...76-679,723-728 
  ...dWorkUtils.ts |     100 |      100 |     100 |     100 |                   
  ...boardUtils.ts |   59.61 |    58.82 |     100 |   59.61 | ...,86-88,107-149 
  commandUtils.ts  |    95.9 |    88.29 |     100 |    95.9 | ...62,164-165,289 
  computeStats.ts  |     100 |      100 |     100 |     100 |                   
  customBanner.ts  |   90.68 |    91.22 |     100 |   90.68 | ...13,324-327,334 
  displayUtils.ts  |   88.37 |    72.22 |     100 |   88.37 | 23,25,29,31,33    
  formatters.ts    |   95.23 |    98.27 |     100 |   95.23 | 117-120           
  gradientUtils.ts |     100 |      100 |     100 |     100 |                   
  highlight.ts     |     100 |      100 |     100 |     100 |                   
  ...oryMapping.ts |     100 |    94.28 |     100 |     100 | 29,51             
  historyUtils.ts  |   94.11 |       94 |     100 |   94.11 | 94-97             
  isNarrowWidth.ts |     100 |      100 |     100 |     100 |                   
  ...olDetector.ts |    8.23 |      100 |       0 |    8.23 | ...31-132,135-136 
  latexRenderer.ts |   94.95 |     73.8 |     100 |   94.95 | ...76-178,184-187 
  layoutUtils.ts   |     100 |      100 |     100 |     100 |                   
  ...ightLoader.ts |     100 |    89.47 |     100 |     100 | 81,110            
  ...nUtilities.ts |   69.84 |    85.71 |     100 |   69.84 | 75-91,100-101     
  ...ToolGroups.ts |   98.66 |    96.77 |     100 |   98.66 | 48-49             
  ...geRenderer.ts |   86.23 |    69.06 |   95.12 |   86.23 | ...1284,1324-1330 
  ...alRenderer.ts |   86.69 |     71.9 |     100 |   86.69 | ...1476,1513-1519 
  ...lsBySource.ts |     100 |    95.23 |     100 |     100 | 84                
  osc8.ts          |   94.71 |    87.41 |     100 |   94.71 | ...43,428,432-433 
  ...mConstants.ts |     100 |      100 |     100 |     100 |                   
  restoreGoal.ts   |   98.98 |    97.14 |     100 |   98.98 | 94                
  ...storyUtils.ts |   61.06 |    69.62 |      90 |   61.06 | ...64,412,417-439 
  ...ickerUtils.ts |     100 |      100 |     100 |     100 |                   
  ...izedOutput.ts |   94.94 |      100 |   88.88 |   94.94 | 112-117           
  ...wOptimizer.ts |     100 |    96.77 |     100 |     100 | 69                
  terminalSetup.ts |    4.37 |      100 |       0 |    4.37 | 44-393            
  textUtils.ts     |   97.35 |    94.38 |   91.66 |   97.35 | ...50-251,386-387 
  todoSnapshot.ts  |   89.11 |    93.33 |     100 |   89.11 | ...,66-78,180-181 
  updateCheck.ts   |     100 |    80.95 |     100 |     100 | 30-42             
 ...i/utils/export |   56.77 |     40.8 |   79.41 |   56.77 |                   
  collect.ts       |   55.92 |    50.58 |   86.36 |   55.92 | ...25-640,642-647 
  index.ts         |     100 |      100 |     100 |     100 |                   
  normalize.ts     |   57.47 |    20.51 |      80 |   57.47 | ...09-310,324-359 
  types.ts         |       0 |        0 |       0 |       0 | 1                 
  utils.ts         |      40 |      100 |       0 |      40 | 11-13             
 ...ort/formatters |    3.38 |      100 |       0 |    3.38 |                   
  html.ts          |    9.61 |      100 |       0 |    9.61 | ...28,34-76,82-84 
  json.ts          |      50 |      100 |       0 |      50 | 14-15             
  jsonl.ts         |     3.5 |      100 |       0 |     3.5 | 14-76             
  markdown.ts      |    0.94 |      100 |       0 |    0.94 | 13-295            
 src/utils         |   75.78 |    89.55 |   93.33 |   75.78 |                   
  acpModelUtils.ts |     100 |      100 |     100 |     100 |                   
  apiPreconnect.ts |   96.72 |    97.14 |     100 |   96.72 | 165-168           
  checks.ts        |   33.33 |      100 |       0 |   33.33 | 23-28             
  cleanup.ts       |   84.12 |    93.33 |      80 |   84.12 | 75,106-115        
  commands.ts      |     100 |      100 |     100 |     100 |                   
  commentJson.ts   |   87.17 |    90.47 |     100 |   87.17 | 64-73             
  ...Calculator.ts |     100 |      100 |     100 |     100 |                   
  deepMerge.ts     |     100 |       90 |     100 |     100 | 41-43,49          
  ...ScopeUtils.ts |   97.56 |    88.88 |     100 |   97.56 | 67                
  doctorChecks.ts  |   71.06 |       75 |     100 |   71.06 | ...95-301,325-341 
  ...putCapture.ts |   90.65 |    86.17 |     100 |   90.65 | ...72,370,372-373 
  ...arResolver.ts |   94.28 |       88 |     100 |   94.28 | 28-29,125-126     
  errors.ts        |   98.67 |    96.36 |     100 |   98.67 | 67-68             
  events.ts        |     100 |      100 |     100 |     100 |                   
  gitUtils.ts      |   91.91 |    84.61 |     100 |   91.91 | 78-81,124-127     
  ...AutoUpdate.ts |   90.76 |    93.33 |   88.88 |   90.76 | 103-114           
  ...lationInfo.ts |     100 |      100 |     100 |     100 |                   
  languageUtils.ts |   97.89 |    96.42 |     100 |   97.89 | 132-133           
  math.ts          |       0 |        0 |       0 |       0 | 1-15              
  ...iagnostics.ts |   94.57 |    83.01 |   88.88 |   94.57 | ...05,311,315-317 
  ...onfigUtils.ts |     100 |      100 |     100 |     100 |                   
  ...iveHelpers.ts |   96.79 |    93.28 |     100 |   96.79 | ...76-477,575,588 
  osc.ts           |    97.5 |      100 |   88.88 |    97.5 | 195-196           
  package.ts       |   88.88 |       80 |     100 |   88.88 | 33-34             
  processUtils.ts  |     100 |      100 |     100 |     100 |                   
  readStdin.ts     |   79.62 |       90 |      80 |   79.62 | 33-40,52-54       
  relaunch.ts      |   98.07 |    76.92 |     100 |   98.07 | 70                
  resolvePath.ts   |   66.66 |       25 |     100 |   66.66 | 12-13,16,18-19    
  sandbox.ts       |       0 |        0 |       0 |       0 | 1-1047            
  settingsUtils.ts |   82.89 |    90.67 |   89.47 |   82.89 | ...52-663,670-678 
  spawnWrapper.ts  |     100 |      100 |     100 |     100 |                   
  ...upProfiler.ts |   98.46 |    94.52 |     100 |   98.46 | 130-131,305       
  ...upWarnings.ts |     100 |      100 |     100 |     100 |                   
  stdioHelpers.ts  |     100 |       60 |     100 |     100 | 23,32             
  systemInfo.ts    |   92.52 |     90.9 |   83.33 |   92.52 | 63-69,184         
  ...InfoFields.ts |    87.5 |     64.1 |     100 |    87.5 | ...21-122,143-144 
  ...iffPreview.ts |   94.11 |    83.33 |     100 |   94.11 | 13                
  ...entEmitter.ts |     100 |      100 |     100 |     100 |                   
  ...upWarnings.ts |   91.17 |    82.35 |     100 |   91.17 | 67-68,73-74,77-78 
  version.ts       |     100 |       50 |     100 |     100 | 11                
  windowTitle.ts   |     100 |      100 |     100 |     100 |                   
  ...WithBackup.ts |   63.15 |    81.25 |     100 |   63.15 | 93,118-157        
-------------------|---------|----------|---------|---------|-------------------
Core Package - Full Text Report
-------------------|---------|----------|---------|---------|-------------------
File               | % Stmts | % Branch | % Funcs | % Lines | Uncovered Line #s 
-------------------|---------|----------|---------|---------|-------------------
All files          |   78.68 |    82.76 |   81.58 |   78.68 |                   
 src               |     100 |      100 |     100 |     100 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
 src/__mocks__/fs  |       0 |        0 |       0 |       0 |                   
  promises.ts      |       0 |        0 |       0 |       0 | 1-48              
 src/agents        |   86.11 |    76.88 |   91.66 |   86.11 |                   
  ...transcript.ts |   88.92 |    76.66 |     100 |   88.92 | ...82,306-307,438 
  ...ent-resume.ts |   81.23 |    69.89 |   77.41 |   81.23 | ...1021,1024-1026 
  ...ound-tasks.ts |   95.13 |    86.61 |     100 |   95.13 | ...06-707,733-734 
  index.ts         |     100 |      100 |     100 |     100 |                   
 src/agents/arena  |   76.54 |    66.87 |   78.72 |   76.54 |                   
  ...gentClient.ts |   79.47 |    88.88 |   81.81 |   79.47 | ...68-183,189-204 
  ArenaManager.ts  |   75.37 |    63.37 |   78.26 |   75.37 | ...1860,1866-1867 
  arena-events.ts  |   64.44 |      100 |      50 |   64.44 | ...71-175,178-183 
  diff-summary.ts  |    87.5 |    72.34 |     100 |    87.5 | ...32-133,137-138 
  index.ts         |     100 |      100 |     100 |     100 |                   
  types.ts         |     100 |      100 |     100 |     100 |                   
 ...gents/backends |   76.29 |    86.15 |   73.04 |   76.29 |                   
  ITermBackend.ts  |   97.97 |    93.93 |     100 |   97.97 | ...78-180,255,307 
  ...essBackend.ts |   91.25 |    90.62 |   86.66 |   91.25 | ...94,249-269,328 
  TmuxBackend.ts   |    90.7 |    76.55 |   97.36 |    90.7 | ...87,697,743-747 
  detect.ts        |   31.25 |      100 |       0 |   31.25 | 34-88             
  index.ts         |     100 |      100 |     100 |     100 |                   
  iterm-it2.ts     |     100 |     92.1 |     100 |     100 | 37-38,106         
  tmux-commands.ts |    6.64 |      100 |    3.03 |    6.64 | ...93-363,386-503 
  types.ts         |     100 |      100 |     100 |     100 |                   
 ...agents/runtime |   81.14 |     76.7 |   71.42 |   81.14 |                   
  agent-context.ts |     100 |      100 |     100 |     100 |                   
  agent-core.ts    |   76.49 |    72.35 |   60.86 |   76.49 | ...1608,1635-1682 
  agent-events.ts  |     100 |      100 |     100 |     100 |                   
  ...t-headless.ts |   81.19 |    71.73 |   60.86 |   81.19 | ...98-399,402-403 
  ...nteractive.ts |   79.71 |    79.62 |      75 |   79.71 | ...54,456,458,461 
  ...statistics.ts |   98.19 |    82.35 |     100 |   98.19 | 127,151,192,225   
  agent-types.ts   |     100 |      100 |     100 |     100 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
 src/config        |   77.21 |    79.34 |   63.67 |   77.21 |                   
  config.ts        |    75.3 |    77.48 |   58.93 |    75.3 | ...3397,3408-3420 
  constants.ts     |     100 |      100 |     100 |     100 |                   
  models.ts        |     100 |      100 |     100 |     100 |                   
  storage.ts       |   95.07 |    93.44 |   89.47 |   95.07 | ...66-267,270-271 
 ...nfirmation-bus |   98.29 |    97.14 |     100 |   98.29 |                   
  message-bus.ts   |   98.14 |    97.05 |     100 |   98.14 | 42-43             
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/core          |   85.46 |    82.94 |   89.76 |   85.46 |                   
  baseLlmClient.ts |   92.35 |    80.85 |   86.66 |   92.35 | ...34,342-356,495 
  client.ts        |    81.4 |    80.76 |   84.84 |    81.4 | ...1695,1734-1737 
  ...tGenerator.ts |    72.1 |    61.11 |     100 |    72.1 | ...63,365,372-375 
  ...lScheduler.ts |   81.89 |    82.05 |   93.47 |   81.89 | ...2371,2423-2427 
  geminiChat.ts    |   88.98 |    84.03 |   91.11 |   88.98 | ...1377,1444-1445 
  geminiRequest.ts |     100 |      100 |     100 |     100 |                   
  ...htProtocol.ts |    9.09 |      100 |       0 |    9.09 | 34-42,45-49,52-87 
  logger.ts        |   87.33 |    87.02 |     100 |   87.33 | ...61-565,611-625 
  ...tyDefaults.ts |     100 |      100 |     100 |     100 |                   
  ...olExecutor.ts |   92.59 |       75 |      50 |   92.59 | 41-42             
  ...on-helpers.ts |   85.71 |    70.58 |     100 |   85.71 | ...90-191,205-214 
  ...issionFlow.ts |   98.59 |    94.73 |     100 |   98.59 | 93                
  prompts.ts       |   89.16 |    86.41 |   76.92 |   89.16 | ...-965,1168-1169 
  tokenLimits.ts   |     100 |    89.47 |     100 |     100 | 51-52             
  ...okTriggers.ts |   99.31 |    90.41 |     100 |   99.31 | 124,135           
  turn.ts          |   96.42 |    88.88 |     100 |   96.42 | ...00,413-414,462 
 ...ntentGenerator |   94.92 |    82.59 |   93.87 |   94.92 |                   
  ...tGenerator.ts |   96.48 |    84.28 |   92.59 |   96.48 | ...01,919-923,963 
  converter.ts     |   94.51 |    80.72 |     100 |   94.51 | ...06-607,617,823 
  index.ts         |       0 |        0 |       0 |       0 | 1-21              
  usage.ts         |     100 |      100 |     100 |     100 |                   
 ...ntentGenerator |   91.53 |    71.64 |   93.33 |   91.53 |                   
  ...tGenerator.ts |      90 |    70.96 |   92.85 |      90 | ...80-286,304-305 
  index.ts         |     100 |       80 |     100 |     100 | 50                
 ...ntentGenerator |   91.76 |     80.4 |   90.32 |   91.76 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...tGenerator.ts |   91.74 |     80.4 |   90.32 |   91.74 | ...51,861-862,890 
 ...ntentGenerator |   81.66 |    84.08 |    90.9 |   81.66 |                   
  constants.ts     |     100 |      100 |     100 |     100 |                   
  converter.ts     |   76.88 |    82.25 |    87.5 |   76.88 | ...1589,1610-1616 
  errorHandler.ts  |     100 |      100 |     100 |     100 |                   
  index.ts         |   52.38 |    44.44 |      50 |   52.38 | ...77,81-85,89-93 
  ...tGenerator.ts |    66.4 |    70.58 |   88.88 |    66.4 | ...51-157,168-169 
  pipeline.ts      |   93.67 |     84.9 |     100 |   93.67 | ...80-481,489,554 
  ...ureContext.ts |     100 |      100 |     100 |     100 |                   
  ...ingOptions.ts |       0 |        0 |       0 |       0 | 1                 
  ...CallParser.ts |   90.66 |    88.57 |     100 |   90.66 | ...15-319,349-350 
  ...kingParser.ts |     100 |    96.87 |     100 |     100 | 42                
  types.ts         |       0 |        0 |       0 |       0 | 1                 
 ...rator/provider |   96.62 |    88.82 |   95.45 |   96.62 |                   
  dashscope.ts     |   97.14 |    89.02 |   93.33 |   97.14 | ...53-254,330-331 
  deepseek.ts      |   95.55 |    90.56 |     100 |   95.55 | ...31-132,145-146 
  default.ts       |   94.62 |    86.36 |   85.71 |   94.62 | 86-87,157-159     
  index.ts         |     100 |      100 |     100 |     100 |                   
  minimax.ts       |     100 |      100 |     100 |     100 |                   
  mistral.ts       |   96.07 |    73.33 |     100 |   96.07 | 32-33             
  modelscope.ts    |     100 |      100 |     100 |     100 |                   
  openrouter.ts    |     100 |      100 |     100 |     100 |                   
  types.ts         |       0 |        0 |       0 |       0 |                   
 src/extension     |   60.56 |    79.46 |    78.4 |   60.56 |                   
  ...-converter.ts |   62.35 |    47.82 |      90 |   62.35 | ...90-791,800-832 
  ...ionManager.ts |   47.04 |    82.06 |    65.9 |   47.04 | ...1398,1408-1427 
  ...onSettings.ts |   93.46 |    93.05 |     100 |   93.46 | ...17-221,228-232 
  ...-converter.ts |   54.88 |    94.44 |      60 |   54.88 | ...35-146,158-192 
  github.ts        |   44.94 |    88.52 |      60 |   44.94 | ...53-359,398-451 
  index.ts         |     100 |      100 |     100 |     100 |                   
  marketplace.ts   |   97.29 |    93.75 |     100 |   97.29 | ...64,184-185,274 
  npm.ts           |   48.66 |    76.08 |      75 |   48.66 | ...18-420,427-431 
  override.ts      |   94.11 |    88.88 |     100 |   94.11 | 63-64,81-82       
  settings.ts      |   66.26 |      100 |      50 |   66.26 | 81-108,143-149    
  storage.ts       |     100 |      100 |     100 |     100 |                   
  ...ableSchema.ts |     100 |      100 |     100 |     100 |                   
  variables.ts     |   88.75 |    83.33 |     100 |   88.75 | ...28-231,234-237 
 src/followup      |   46.91 |     92.3 |   71.87 |   46.91 |                   
  followupState.ts |      96 |    89.74 |     100 |      96 | 159-161,218-219   
  index.ts         |     100 |      100 |     100 |     100 |                   
  overlayFs.ts     |   95.06 |       84 |     100 |   95.06 | 78,108,122,133    
  speculation.ts   |   13.22 |      100 |   16.66 |   13.22 | 88-458,518-568    
  ...onToolGate.ts |     100 |    96.29 |     100 |     100 | 93                
  ...nGenerator.ts |    38.4 |    95.12 |   33.33 |    38.4 | ...16-318,353-383 
 src/generated     |       0 |        0 |       0 |       0 |                   
  git-commit.ts    |       0 |        0 |       0 |       0 | 1-10              
 src/goals         |   87.98 |    80.83 |   93.93 |   87.98 |                   
  ...eGoalStore.ts |   81.57 |    92.85 |   81.81 |   81.57 | ...40-143,151-159 
  goalHook.ts      |   96.59 |    89.74 |     100 |   96.59 | 103-108           
  goalJudge.ts     |    83.4 |    73.13 |     100 |    83.4 | ...23-324,332-334 
  index.ts         |     100 |      100 |     100 |     100 |                   
 src/hooks         |   83.34 |    84.41 |   86.61 |   83.34 |                   
  ...okRegistry.ts |   86.48 |    77.08 |     100 |   86.48 | ...41-344,362-369 
  ...bortSignal.ts |     100 |      100 |     100 |     100 |                   
  ...terpolator.ts |   96.66 |    93.33 |     100 |   96.66 | 66-67             
  ...HookRunner.ts |   96.68 |    87.23 |     100 |   96.68 | 110-112,231-233   
  ...Aggregator.ts |    96.4 |    90.78 |     100 |    96.4 | ...91,293-294,367 
  ...entHandler.ts |   94.56 |    83.78 |   93.33 |   94.56 | ...38,795-796,806 
  hookPlanner.ts   |   84.13 |    76.59 |      90 |   84.13 | ...38,144,162-173 
  hookRegistry.ts  |   90.17 |    83.33 |     100 |   90.17 | ...33,352,356,360 
  hookRunner.ts    |   58.56 |    71.26 |   66.66 |   58.56 | ...48-749,758-759 
  hookSystem.ts    |   84.57 |      100 |   65.85 |   84.57 | ...21-622,628-629 
  ...HookRunner.ts |   75.51 |     61.9 |      80 |   75.51 | ...05-406,424-425 
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...HookRunner.ts |   93.63 |    89.47 |      90 |   93.63 | ...45-353,427-428 
  ...SkillHooks.ts |   78.75 |       75 |   66.66 |   78.75 | 62-66,137-152     
  ...oksManager.ts |   96.66 |    91.66 |     100 |   96.66 | ...90,209-210,223 
  ssrfGuard.ts     |   77.22 |    85.36 |     100 |   77.22 | ...57,261-267,273 
  trustedHooks.ts  |       0 |        0 |       0 |       0 | 1-124             
  types.ts         |   91.18 |     90.9 |   85.71 |   91.18 | ...40-441,501-505 
  urlValidator.ts  |     100 |      100 |     100 |     100 |                   
 src/ide           |   74.28 |    83.39 |   78.33 |   74.28 |                   
  constants.ts     |     100 |      100 |     100 |     100 |                   
  detect-ide.ts    |     100 |      100 |     100 |     100 |                   
  ide-client.ts    |    64.2 |    81.48 |   66.66 |    64.2 | ...9-970,999-1007 
  ide-installer.ts |   89.06 |    79.31 |     100 |   89.06 | ...36,143-147,160 
  ideContext.ts    |     100 |      100 |     100 |     100 |                   
  process-utils.ts |   84.84 |    71.79 |     100 |   84.84 | ...37,151,193-194 
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/lsp           |   33.92 |    44.97 |   45.76 |   33.92 |                   
  ...nfigLoader.ts |   70.27 |    35.89 |   94.73 |   70.27 | ...20-422,426-432 
  ...ionFactory.ts |    4.29 |        0 |       0 |    4.29 | ...20-371,377-394 
  ...Normalizer.ts |   23.09 |    13.72 |   30.43 |   23.09 | ...04-905,909-924 
  ...verManager.ts |   13.52 |    81.25 |   29.16 |   13.52 | ...75-694,700-730 
  ...eLspClient.ts |   17.89 |      100 |       0 |   17.89 | ...37-244,254-258 
  ...LspService.ts |   45.87 |    62.13 |   66.66 |   45.87 | ...1282,1299-1309 
  constants.ts     |     100 |      100 |     100 |     100 |                   
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/mcp           |   78.69 |    75.34 |   75.92 |   78.69 |                   
  constants.ts     |     100 |      100 |     100 |     100 |                   
  ...h-provider.ts |   86.95 |      100 |   33.33 |   86.95 | ...,93,97,101-102 
  ...h-provider.ts |   73.82 |    53.92 |     100 |   73.82 | ...88-895,902-904 
  ...en-storage.ts |   98.62 |    97.72 |     100 |   98.62 | 87-88             
  oauth-utils.ts   |   70.58 |    85.29 |    90.9 |   70.58 | ...70-290,315-344 
  ...n-provider.ts |   89.83 |    95.83 |   45.45 |   89.83 | ...43,147,151-152 
 .../token-storage |   79.52 |    86.66 |   86.36 |   79.52 |                   
  ...en-storage.ts |     100 |      100 |     100 |     100 |                   
  ...en-storage.ts |   82.87 |    82.35 |   92.85 |   82.87 | ...63-173,181-182 
  ...en-storage.ts |     100 |      100 |     100 |     100 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...en-storage.ts |   68.14 |    82.35 |   64.28 |   68.14 | ...81-295,298-314 
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/memory        |   67.43 |       76 |   65.62 |   67.43 |                   
  const.ts         |     100 |      100 |     100 |     100 |                   
  dream.ts         |   65.65 |    73.33 |      50 |   65.65 | 50,107-148        
  ...entPlanner.ts |   57.84 |    72.72 |   33.33 |   57.84 | ...35,140-147,152 
  entries.ts       |   63.77 |    79.16 |      50 |   63.77 | ...72-180,183-189 
  extract.ts       |    95.2 |    79.16 |     100 |    95.2 | 81-86,125         
  ...entPlanner.ts |   63.08 |    65.71 |   41.17 |   63.08 | ...17,222-223,332 
  ...ionPlanner.ts |       0 |        0 |       0 |       0 | 1                 
  forget.ts        |    45.8 |    61.53 |   44.44 |    45.8 | ...04,211,214-346 
  indexer.ts       |   83.87 |    45.45 |     100 |   83.87 | ...50,56-57,69-70 
  manager.ts       |   75.31 |    81.04 |    75.6 |   75.31 | ...1278,1291-1293 
  memoryAge.ts     |   90.47 |    77.77 |     100 |   90.47 | 50-51             
  paths.ts         |   55.47 |    89.47 |   85.71 |   55.47 | ...,89-90,106-114 
  prompt.ts        |   93.36 |    71.42 |     100 |   93.36 | ...58,161,228-229 
  recall.ts        |   79.56 |    69.38 |   88.88 |   79.56 | ...40-245,269-280 
  ...ceSelector.ts |   91.86 |    77.27 |     100 |   91.86 | ...07,109-110,118 
  scan.ts          |   87.91 |    68.42 |     100 |   87.91 | ...47-48,58,82-87 
  ...entPlanner.ts |    11.5 |      100 |       0 |    11.5 | ...57-192,210-298 
  status.ts        |   10.52 |      100 |       0 |   10.52 | 41-98             
  store.ts         |   94.44 |    83.33 |     100 |   94.44 | 56-57,92-93       
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/mocks         |       0 |        0 |       0 |       0 |                   
  msw.ts           |       0 |        0 |       0 |       0 | 1-9               
 src/models        |   89.31 |    85.55 |    87.5 |   89.31 |                   
  constants.ts     |     100 |      100 |     100 |     100 |                   
  ...tor-config.ts |   90.24 |    91.42 |     100 |   90.24 | 142,148,151-160   
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...nfigErrors.ts |   74.22 |       44 |   84.61 |   74.22 | ...,67-74,106-117 
  ...igResolver.ts |   98.63 |    92.53 |     100 |   98.63 | 161,323,329       
  modelRegistry.ts |     100 |    98.59 |     100 |     100 | 222               
  modelsConfig.ts  |   84.57 |    82.14 |   81.57 |   84.57 | ...1223,1252-1253 
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/output        |     100 |      100 |     100 |     100 |                   
  ...-formatter.ts |     100 |      100 |     100 |     100 |                   
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/permissions   |   71.18 |    88.73 |   48.57 |   71.18 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...on-manager.ts |   81.42 |    86.66 |      80 |   81.42 | ...29-830,837-846 
  rule-parser.ts   |   95.99 |    93.18 |     100 |   95.99 | ...-864,1013-1015 
  ...-semantics.ts |   58.28 |    85.27 |    30.2 |   58.28 | ...1604-1614,1643 
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/prompts       |   83.63 |      100 |    87.5 |   83.63 |                   
  mcp-prompts.ts   |   18.18 |      100 |       0 |   18.18 | 11-19             
  ...t-registry.ts |     100 |      100 |     100 |     100 |                   
 src/qwen          |   86.01 |    79.48 |   97.18 |   86.01 |                   
  ...tGenerator.ts |   98.64 |    98.18 |     100 |   98.64 | 105-106           
  qwenOAuth2.ts    |   84.99 |    74.81 |   93.33 |   84.99 | ...,985-1001,1031 
  ...kenManager.ts |   83.76 |    76.22 |     100 |   83.76 | ...62-767,788-793 
 src/services      |   85.49 |       85 |   90.41 |   85.49 |                   
  ...ionTrailer.ts |     100 |      100 |     100 |     100 |                   
  ...llRegistry.ts |   97.82 |    94.73 |     100 |   97.82 | 172-173           
  ...ionService.ts |   95.54 |    96.29 |     100 |   95.54 | ...18,386,388-392 
  ...ingService.ts |   84.15 |       84 |   82.85 |   84.15 | ...1241,1258-1259 
  ...ttribution.ts |   91.73 |    87.71 |      90 |   91.73 | ...80-685,826-827 
  ...utSlimming.ts |     100 |    96.77 |     100 |     100 | 133,182           
  cronScheduler.ts |   97.56 |    92.98 |     100 |   97.56 | 62-63,77,155      
  ...eryService.ts |   80.43 |    95.45 |      75 |   80.43 | ...19-134,140-141 
  fileReadCache.ts |     100 |      100 |     100 |     100 |                   
  ...temService.ts |      90 |    84.44 |   88.88 |      90 | ...89,191,269-276 
  ...ratedFiles.ts |      96 |    88.23 |     100 |      96 | 119-120,146-147   
  gitInit.ts       |     100 |      100 |     100 |     100 |                   
  gitService.ts    |   68.75 |     92.3 |   55.55 |   68.75 | ...12-122,125-129 
  ...reeService.ts |   73.79 |       70 |   94.87 |   73.79 | ...1365,1393-1394 
  ...ionService.ts |   98.13 |     97.8 |   95.45 |   98.13 | ...32-333,380-381 
  ...orRegistry.ts |   96.34 |    91.66 |     100 |   96.34 | ...90-391,542-543 
  sessionRecap.ts  |   12.04 |      100 |       0 |   12.04 | 49-160            
  ...ionService.ts |   90.05 |    79.71 |   96.55 |   90.05 | ...1272,1276-1277 
  sessionTitle.ts  |   93.87 |    69.81 |     100 |   93.87 | ...33-236,267-268 
  ...ionService.ts |   83.01 |    78.66 |   87.75 |   83.01 | ...1482,1488-1493 
  ...UseSummary.ts |   94.73 |    87.71 |     100 |   94.73 | ...73-175,225-226 
  ...reeCleanup.ts |   14.56 |      100 |   33.33 |   14.56 | 58-185            
 ...icrocompaction |   97.69 |    89.79 |     100 |   97.69 |                   
  microcompact.ts  |   97.69 |    89.79 |     100 |   97.69 | ...68,229,233,314 
 src/skills        |    87.5 |     83.8 |   94.23 |    87.5 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...activation.ts |     100 |     93.1 |     100 |     100 | 93,112            
  skill-load.ts    |   92.94 |    81.63 |     100 |   92.94 | ...06,226,238-240 
  skill-manager.ts |   83.31 |    79.66 |   90.32 |   83.31 | ...1120,1127-1131 
  skill-paths.ts   |   86.74 |    77.77 |     100 |   86.74 | ...00-101,106-107 
  symlinkScope.ts  |     100 |      100 |     100 |     100 |                   
  types.ts         |     100 |      100 |     100 |     100 |                   
 src/subagents     |   83.13 |    80.24 |   95.23 |   83.13 |                   
  ...tin-agents.ts |     100 |      100 |     100 |     100 |                   
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...-selection.ts |     100 |      100 |     100 |     100 |                   
  ...nt-manager.ts |   77.21 |    72.09 |   92.85 |   77.21 | ...1180,1202-1203 
  types.ts         |     100 |      100 |     100 |     100 |                   
  validation.ts    |   92.46 |    95.18 |     100 |   92.46 | 51-56,69-74,78-83 
 src/telemetry     |   73.69 |    85.65 |   77.81 |   73.69 |                   
  config.ts        |     100 |      100 |     100 |     100 |                   
  constants.ts     |     100 |      100 |     100 |     100 |                   
  ...-exporters.ts |   46.37 |      100 |   44.44 |   46.37 | ...85,88-89,92-93 
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...t.circular.ts |       0 |        0 |       0 |       0 | 1-111             
  ...-processor.ts |   93.93 |    90.21 |   94.11 |   93.93 | ...75-280,299-300 
  ...t.circular.ts |       0 |        0 |       0 |       0 | 1-128             
  loggers.ts       |    51.9 |       64 |   57.77 |    51.9 | ...1214,1231-1251 
  metrics.ts       |    74.9 |    82.95 |   74.54 |    74.9 | ...58-978,981-992 
  sanitize.ts      |      80 |    83.33 |     100 |      80 | 35-36,41-42       
  sdk.ts           |   90.45 |    83.56 |   76.92 |   90.45 | ...17-318,338-342 
  ...on-context.ts |     100 |      100 |     100 |     100 |                   
  ...on-tracing.ts |    90.5 |    87.23 |     100 |    90.5 | ...66-470,481-484 
  ...etry-utils.ts |     100 |      100 |     100 |     100 |                   
  ...l-decision.ts |     100 |      100 |     100 |     100 |                   
  ...e-id-utils.ts |     100 |      100 |     100 |     100 |                   
  tracer.ts        |   98.61 |    89.36 |     100 |   98.61 | 53,104            
  types.ts         |   79.17 |    85.83 |   83.33 |   79.17 | ...1149,1152-1181 
  uiTelemetry.ts   |   92.97 |    96.96 |   81.25 |   92.97 | ...93-194,200-207 
 ...ry/qwen-logger |   68.24 |    79.56 |   64.91 |   68.24 |                   
  event-types.ts   |       0 |        0 |       0 |       0 |                   
  qwen-logger.ts   |   68.24 |    79.34 |   64.28 |   68.24 | ...1055,1093-1094 
 src/test-utils    |   93.16 |    95.83 |   73.52 |   93.16 |                   
  config.ts        |     100 |      100 |     100 |     100 |                   
  ...st-helpers.ts |   94.11 |       90 |     100 |   94.11 | 69-70             
  index.ts         |     100 |      100 |     100 |     100 |                   
  mock-tool.ts     |   91.19 |    97.05 |   68.96 |   91.19 | ...38,202-203,216 
  ...aceContext.ts |     100 |      100 |     100 |     100 |                   
 src/tools         |   77.54 |    81.38 |      86 |   77.54 |                   
  ...erQuestion.ts |   88.93 |    76.74 |    90.9 |   88.93 | ...39-340,347-348 
  cron-create.ts   |   97.75 |    88.88 |   83.33 |   97.75 | 30-31             
  cron-delete.ts   |   96.82 |      100 |   83.33 |   96.82 | 26-27             
  cron-list.ts     |   96.66 |      100 |   83.33 |   96.66 | 25-26             
  diffOptions.ts   |     100 |      100 |     100 |     100 |                   
  edit.ts          |   78.01 |    84.76 |   73.33 |   78.01 | ...86-687,774-824 
  ...r-worktree.ts |   82.43 |    68.75 |    87.5 |   82.43 | ...67-170,236-237 
  exit-worktree.ts |   83.47 |       84 |    90.9 |   83.47 | ...80-281,286-299 
  exitPlanMode.ts  |   85.09 |    85.71 |     100 |   85.09 | ...60-163,177-189 
  glob.ts          |   90.63 |    88.33 |   84.61 |   90.63 | ...28,171,302,305 
  grep.ts          |   79.19 |    85.71 |   78.94 |   79.19 | ...20,560,569-576 
  ls.ts            |   96.74 |    90.27 |     100 |   96.74 | 176-181,212,216   
  lsp.ts           |   72.77 |    60.09 |   90.32 |   72.77 | ...1211,1213-1214 
  ...nt-manager.ts |   69.73 |    75.29 |   71.42 |   69.73 | ...29-732,749-786 
  mcp-client.ts    |   33.18 |    77.41 |   66.66 |   33.18 | ...1490,1494-1497 
  mcp-tool.ts      |   90.98 |    88.88 |   96.42 |   90.98 | ...95-596,646-647 
  memory-config.ts |       0 |        0 |       0 |       0 | 1-47              
  ...iable-tool.ts |     100 |    84.61 |     100 |     100 | 102,109           
  monitor.ts       |   92.27 |    83.94 |      92 |   92.27 | ...18,547-550,563 
  ...nforcement.ts |   82.44 |       90 |     100 |   82.44 | 174-185,234-247   
  read-file.ts     |   95.07 |     88.6 |      90 |   95.07 | ...99,290-293,296 
  ripGrep.ts       |   94.59 |    85.71 |   93.33 |   94.59 | ...60,463,541-542 
  ...-transport.ts |    6.34 |      100 |       0 |    6.34 | 47-145            
  send-message.ts  |   89.32 |    91.66 |   83.33 |   89.32 | 44-45,68-76       
  shell.ts         |   72.18 |    80.23 |   89.65 |   72.18 | ...3659,3708-3714 
  skill-utils.ts   |     100 |      100 |     100 |     100 |                   
  skill.ts         |   88.11 |    91.17 |   84.61 |   88.11 | ...95,399,422-444 
  ...eticOutput.ts |   95.12 |      100 |      80 |   95.12 | 87-88             
  task-stop.ts     |   93.14 |    96.15 |   85.71 |   93.14 | 39-40,54-64       
  todoWrite.ts     |   89.17 |    82.05 |   92.85 |   89.17 | ...41-546,568-569 
  tool-error.ts    |     100 |      100 |     100 |     100 |                   
  tool-names.ts    |     100 |      100 |     100 |     100 |                   
  tool-registry.ts |   74.79 |       75 |   80.48 |   74.79 | ...92-793,801-802 
  tool-search.ts   |   95.19 |    86.48 |    92.3 |   95.19 | ...47-153,208-213 
  tools.ts         |   91.98 |    90.19 |   88.88 |   91.98 | ...50-451,467-473 
  web-fetch.ts     |   88.59 |    79.48 |    92.3 |   88.59 | ...12-313,315-316 
  write-file.ts    |    79.2 |    79.26 |   83.33 |    79.2 | ...39-642,654-689 
 src/tools/agent   |   73.19 |    82.18 |   74.24 |   73.19 |                   
  agent.ts         |   73.39 |     82.5 |      75 |   73.39 | ...2095,2131-2138 
  fork-subagent.ts |   69.62 |    71.42 |   66.66 |   69.62 | ...04-105,140-151 
 src/utils         |   88.77 |    87.42 |   93.55 |   88.77 |                   
  LruCache.ts      |       0 |        0 |       0 |       0 | 1-41              
  ...ssageQueue.ts |     100 |      100 |     100 |     100 |                   
  ...cFileWrite.ts |   77.96 |    80.48 |     100 |   77.96 | ...35,156,173-176 
  bareMode.ts      |   27.27 |      100 |       0 |   27.27 | 9-15,18-19        
  browser.ts       |    7.69 |      100 |       0 |    7.69 | 17-56             
  bundlePaths.ts   |     100 |      100 |     100 |     100 |                   
  ...igResolver.ts |     100 |      100 |     100 |     100 |                   
  ...engthError.ts |   89.11 |    86.66 |     100 |   89.11 | ...28-129,132-133 
  cronDisplay.ts   |   42.85 |    23.07 |     100 |   42.85 | 26-31,33-45,47-54 
  cronParser.ts    |   89.74 |    85.71 |     100 |   89.74 | ...,63-64,183-186 
  debugLogger.ts   |    95.9 |    93.84 |   94.73 |    95.9 | 106-107,214-218   
  editHelper.ts    |   93.63 |    83.52 |     100 |   93.63 | ...28-429,463-464 
  editor.ts        |   97.61 |    95.71 |     100 |   97.61 | ...70-271,273-274 
  ...arResolver.ts |   94.28 |    88.88 |     100 |   94.28 | 28-29,125-126     
  ...entContext.ts |     100 |    95.45 |     100 |     100 | 83                
  errorParsing.ts  |    97.7 |    97.05 |     100 |    97.7 | 72-73             
  ...rReporting.ts |   88.46 |       90 |     100 |   88.46 | 69-74             
  errors.ts        |   70.92 |    79.59 |   53.33 |   70.92 | ...03-219,223-229 
  fetch.ts         |   70.18 |    71.42 |   71.42 |   70.18 | ...42,148,161,186 
  fileUtils.ts     |   91.41 |    86.07 |      95 |   91.41 | ...1182,1186-1192 
  forkedAgent.ts   |    78.5 |    70.73 |   85.71 |    78.5 | ...30-436,441-447 
  formatters.ts    |   54.54 |       50 |     100 |   54.54 | 12-16             
  ...eUtilities.ts |   89.21 |    86.66 |     100 |   89.21 | 16-17,49-55,65-66 
  ...rStructure.ts |   94.36 |    94.28 |     100 |   94.36 | ...17-120,330-335 
  getPty.ts        |    12.5 |      100 |       0 |    12.5 | 21-34             
  gitDiff.ts       |   92.36 |    79.53 |     100 |   92.36 | ...55-856,928-929 
  ...noreParser.ts |    92.3 |    89.36 |     100 |    92.3 | ...15-116,186-187 
  gitUtils.ts      |   56.66 |    85.71 |      75 |   56.66 | ...2,72-73,97-148 
  iconvHelper.ts   |     100 |      100 |     100 |     100 |                   
  ...rePatterns.ts |     100 |      100 |     100 |     100 |                   
  ...ionManager.ts |     100 |     90.9 |     100 |     100 | 26                
  ...lPromptIds.ts |     100 |      100 |     100 |     100 |                   
  jsonl-utils.ts   |    74.1 |    90.76 |   58.33 |    74.1 | ...23-326,336-342 
  ...-detection.ts |     100 |      100 |     100 |     100 |                   
  ...yDiscovery.ts |    83.9 |    79.36 |     100 |    83.9 | ...16,319,411-414 
  ...tProcessor.ts |   93.63 |       90 |     100 |   93.63 | ...96-302,384-385 
  ...Inspectors.ts |   61.53 |      100 |      50 |   61.53 | 18-23             
  modelId.ts       |   98.55 |    96.87 |     100 |   98.55 | 103               
  ...kerChecker.ts |   82.55 |    78.57 |     100 |   82.55 | 68-69,79-84,92-98 
  notebook.ts      |   94.35 |    84.78 |     100 |   94.35 | ...10,122,174-176 
  openaiLogger.ts  |   88.05 |    84.09 |     100 |   88.05 | ...44-146,169-174 
  partUtils.ts     |     100 |    98.61 |     100 |     100 | 206               
  pathReader.ts    |     100 |      100 |     100 |     100 |                   
  paths.ts         |   93.21 |    91.86 |     100 |   93.21 | ...89-390,392-394 
  pdf.ts           |   93.68 |    87.05 |     100 |   93.68 | ...96-297,321-325 
  projectPath.ts   |     100 |      100 |     100 |     100 |                   
  ...ectSummary.ts |   89.39 |    72.41 |     100 |   89.39 | ...37-142,193-196 
  ...tIdContext.ts |     100 |      100 |     100 |     100 |                   
  proxyUtils.ts    |     100 |      100 |     100 |     100 |                   
  ...rDetection.ts |   58.57 |       76 |     100 |   58.57 | ...4,88-89,95-100 
  ...noreParser.ts |   85.45 |    85.18 |     100 |   85.45 | ...59,65-66,72-73 
  rateLimit.ts     |   92.55 |    85.92 |     100 |   92.55 | ...70-272,309-310 
  readManyFiles.ts |   87.96 |    86.95 |     100 |   87.96 | ...05-207,223-234 
  retry.ts         |   89.81 |    88.05 |     100 |   89.81 | ...29,350,357-358 
  ripgrepUtils.ts  |   46.79 |    84.37 |   66.66 |   46.79 | ...45-246,258-335 
  ...sDiscovery.ts |   97.42 |    92.85 |     100 |   97.42 | ...04,182-183,202 
  ...tchOptions.ts |   81.72 |    85.04 |   95.23 |   81.72 | ...11,536,565-574 
  runtimeStatus.ts |    97.5 |    88.57 |     100 |    97.5 | 167-168           
  safeJsonParse.ts |   74.07 |    83.33 |     100 |   74.07 | 40-46             
  ...nStringify.ts |     100 |      100 |     100 |     100 |                   
  ...aConverter.ts |   90.78 |    88.23 |     100 |   90.78 | ...41-42,93,95-96 
  ...aValidator.ts |   94.57 |    80.26 |     100 |   94.57 | ...04,213-216,270 
  ...r-launcher.ts |   76.92 |     91.3 |   66.66 |   76.92 | ...34,136,157-195 
  ...orageUtils.ts |   96.89 |    85.84 |     100 |   96.89 | ...51,367,447,466 
  shell-utils.ts   |   82.93 |    89.55 |     100 |   82.93 | ...1522,1529-1533 
  ...lAstParser.ts |   95.58 |    85.79 |     100 |   95.58 | ...1059-1061,1071 
  ...nlyChecker.ts |   95.75 |    92.39 |     100 |   95.75 | ...00-301,313-314 
  sideQuery.ts     |   98.73 |    94.59 |     100 |   98.73 | 111               
  ...pEventSink.ts |     100 |       80 |     100 |     100 | 61                
  ...tGenerator.ts |     100 |      100 |     100 |     100 |                   
  ...ameContext.ts |     100 |      100 |     100 |     100 |                   
  symlink.ts       |   77.77 |       50 |     100 |   77.77 | 44,54-59          
  ...emEncoding.ts |   96.36 |    91.17 |     100 |   96.36 | 59-60,124-125     
  terminalSafe.ts  |     100 |      100 |     100 |     100 |                   
  ...Serializer.ts |   98.72 |       90 |     100 |   98.72 | 42-43,134,201-203 
  testUtils.ts     |   53.33 |      100 |   33.33 |   53.33 | ...53,59-64,70-72 
  textUtils.ts     |      60 |      100 |   66.66 |      60 | 36-55             
  thoughtUtils.ts  |     100 |    92.85 |     100 |     100 | 71                
  ...-converter.ts |   94.59 |    85.71 |     100 |   94.59 | 35-36             
  tool-utils.ts    |    93.6 |     91.3 |     100 |    93.6 | ...58-159,162-163 
  truncation.ts    |     100 |       92 |     100 |     100 | 52,71             
  windowsPath.ts   |   89.47 |    79.31 |     100 |   89.47 | ...57-58,62,90-91 
  ...aceContext.ts |   93.71 |    89.28 |   93.33 |   93.71 | ...24-225,249-251 
  xml.ts           |     100 |      100 |     100 |     100 |                   
  yaml-parser.ts   |      92 |    84.31 |     100 |      92 | 49-53,65-69       
 ...ils/filesearch |   85.77 |    81.06 |   96.42 |   85.77 |                   
  crawlCache.ts    |     100 |      100 |     100 |     100 |                   
  crawler.ts       |   82.84 |    77.49 |   94.82 |   82.84 | ...1451,1485-1486 
  fileSearch.ts    |   93.58 |    87.32 |     100 |   93.58 | ...46-247,249-250 
  ignore.ts        |     100 |      100 |     100 |     100 |                   
  result-cache.ts  |     100 |     92.3 |     100 |     100 | 46                
 ...uest-tokenizer |   56.63 |    74.52 |   74.19 |   56.63 |                   
  ...eTokenizer.ts |   41.86 |    76.47 |   69.23 |   41.86 | ...70-443,453-507 
  index.ts         |     100 |      100 |     100 |     100 |                   
  ...tTokenizer.ts |   68.39 |    69.49 |    90.9 |   68.39 | ...24-325,327-328 
  ...ageFormats.ts |      76 |      100 |   33.33 |      76 | 45-48,55-56       
  textTokenizer.ts |     100 |      100 |     100 |     100 |                   
  types.ts         |       0 |        0 |       0 |       0 | 1                 
-------------------|---------|----------|---------|---------|-------------------

For detailed HTML reports, please see the 'coverage-reports-22.x-ubuntu-latest' artifact from the main CI run.

@wenshao wenshao left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Critical] [test] coreToolScheduler.test.ts: 13 telemetry span tests fail — The test mock intercepts withSpan from tracer.js, but production code now imports startToolSpan/endToolSpan/runInToolSpanContext from session-tracing.ts. toolSpanRecords stays empty; all span assertions fail. The refactored span lifecycle in executeSingleToolCall and _executeToolCallBody has zero test coverage.

[Critical] [test] loggingContentGenerator.test.ts: 3 tests fail — (1) Attribute name mismatch: test expects unprefixed model/prompt_id but mock uses prefixed qwen-code.model/qwen-code.prompt_id. (2) Mock endLLMRequestSpan calls span.setStatus() without try/catch; errors propagate from finally block. Production endLLMRequestSpan wraps everything in try/catch.

Suggested fixes for test failures:

  • Update coreToolScheduler test mock to intercept startToolSpan/endToolSpan/runInToolSpanContext/startToolExecutionSpan/endToolExecutionSpan from '../telemetry/session-tracing.js'
  • Fix attribute name assertions to use prefixed names
  • Wrap the mocked endLLMRequestSpan body in try/catch matching production behavior

Comment thread packages/core/src/core/client.ts
Comment thread packages/core/src/telemetry/session-tracing.ts
Comment thread packages/core/src/telemetry/session-tracing.ts Outdated
Comment thread packages/core/src/core/coreToolScheduler.ts
Comment thread packages/core/src/telemetry/session-tracing.ts

@wenshao wenshao left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ CI is failing on all 3 test platforms — the same 11 telemetry span test failures reported below.

[Suggestion] endToolSpan and endLLMRequestSpan have asymmetric success attribute handling: endLLMRequestSpan unconditionally sets success on every call, while endToolSpan only sets it on success paths (no metadata → no success attribute). This means dashboard queries filtering on success = false will miss all non-success tool spans. Consider making them consistent — either always set success in endToolSpan too, or add { success: false } to error/cancel paths.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

Comment thread packages/core/src/core/coreToolScheduler.test.ts
Comment thread packages/core/src/core/coreToolScheduler.ts
Comment thread packages/core/src/core/coreToolScheduler.ts
Comment thread packages/core/src/core/coreToolScheduler.test.ts
Comment thread packages/core/src/core/coreToolScheduler.ts
@doudouOUC doudouOUC force-pushed the worktree-sorted-petting-parrot branch from 3919ade to debe7f4 Compare May 14, 2026 08:26
@doudouOUC doudouOUC added the type/feature-request New feature or enhancement request label May 14, 2026

@wenshao wenshao left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ CI tests are still pending (ubuntu-latest, macos-latest, windows-latest). Downgraded from Request Changes to Comment pending CI results.

Incremental review of new commits (since last review at 3919ade)

What changed

  • IDE context is now merged into the user prompt via <system-reminder> wrapper (no more separate addHistory calls)
  • escapeSystemReminderTags now used for both rule content and IDE text (replaces narrow </system-reminder> regex)
  • forceFullIdeContext set on Error and ChatCompressed events
  • State update deferral to first stream event, with tests covering throw-before-event case
  • Microcompact log format updated for per-kind counts

Assessment

The incremental changes are well-structured and well-tested. The IDE context merge + system-reminder escaping is a clear improvement. Key concerns from the previous review about:

  • Tool span naming regression
  • Redundant setStatus(OK)
  • Stream vs non-stream span naming
  • Unprotected code before try block in _executeToolCallBody

...are in files not touched by this incremental update and remain open.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

doudouOUC added 8 commits May 15, 2026 23:59
#3731 P3 Phase 1)

Replace disconnected withSpan/startSpanWithContext calls in runtime with
session-tracing typed helpers so LLM and tool spans become children of
the interaction span instead of siblings under the session root.

- Add toolContext ALS with runInToolSpanContext() for concurrent-safe
  tool span scoping (uses AsyncLocalStorage.run, not enterWith)
- Wire startLLMRequestSpan/endLLMRequestSpan in loggingContentGenerator
  for both streaming and non-streaming paths
- Wire startToolSpan/endToolSpan + startToolExecutionSpan/endToolExecutionSpan
  in coreToolScheduler with proper try/finally lifecycle
- Remove redundant withSpan('client.generateContent') wrapper from client.ts
- Fix endToolSpan to not override pre-set status when metadata is omitted
- Change startToolExecutionSpan to read parent from toolContext ALS
- Update tests for new span creation APIs and remove dead test infrastructure
- Remove unused _toolSpan variable (TS6133)
- Use bracket notation for index signature property access (TS4111)
…test mocks

- coreToolScheduler.test.ts: mock startToolSpan/endToolSpan/runInToolSpanContext
  instead of withSpan; update cancellation tests for restored safeSetStatus call
- loggingContentGenerator.test.ts: fix attribute keys in mock, add try/catch in
  endLLMRequestSpan mock to match production best-effort behavior
- Add debugLogger.warn in catch blocks of endLLMRequestSpan/endToolSpan/
  endToolExecutionSpan instead of silent swallowing
- Add JSDoc on endToolSpan documenting intentional no-metadata-no-status
  contract with setToolSpanFailure/setToolSpanCancelled
- Add warning in startToolExecutionSpan when called outside
  runInToolSpanContext (no active toolContext)
- Sanitize error message in endToolExecutionSpan: use constant
  TOOL_SPAN_STATUS_TOOL_EXCEPTION instead of raw error message
…heduler tests

The full mock shadowed all re-exports (logToolCall, etc.) causing 49 test
failures. Use importActual to preserve other exports, only override span
functions.
startToolExecutionSpan mock also pushes to toolSpanRecords, so at(-1)
returns the execution sub-span instead of the tool span. Use findLast
to filter by name.
- Remove redundant safeSetStatus(span, OK) on success path — endToolSpan
  in finally already sets OK via metadata
- Add llm_request.stream attribute (true/false) to distinguish streaming
  vs non-streaming LLM requests in trace backends
Bypass span.setStatus() in mock to avoid potential interference from
vitest module resolution. Write to statusCalls/ended directly on the
ToolSpanRecord.
doudouOUC added 2 commits May 15, 2026 23:59
…/index.js

Mocking the barrel re-export (telemetry/index.js) with importActual was
unreliable — vitest's module resolution could bind production code to the
real endToolSpan before the mock override took effect. Mock the source
module (session-tracing.js) directly to guarantee interception.
…y in finally

Root cause: checkAndNotifyCompletion clears this.toolCalls before the
finally block in executeSingleToolCall runs, so the tc lookup always
returns undefined.

Fix: set OK status explicitly in _executeToolCallBody's success path
via safeSetStatus(span, OK), and call endToolSpan() without metadata
in finally (just ends the span, preserves pre-set status from any path).
@doudouOUC doudouOUC force-pushed the worktree-sorted-petting-parrot branch from 81a8869 to eef8ce7 Compare May 15, 2026 15:59
doudouOUC added 2 commits May 16, 2026 00:41
…n on failure

- Wrap non-stream generateContent API call + logging in context.with(spanContext)
  so nested OTel spans (HTTP instrumentation, log-bridge spans) parent to
  qwen-code.llm_request instead of session root (matches streaming path).
- runInToolSpanContext now also activates OTel context via otelContext.with,
  not just the custom toolContext ALS. Hooks/HTTP/IO during tool execution
  now correctly parent to qwen-code.tool span.
- Split end*Span helpers: span.end() runs in its own try/catch so a throwing
  setAttributes/setStatus can't leak unended spans.
…cution span timing

- start{LLMRequest,Tool,ToolExecution}Span now fall back to getSessionContext()
  when no parent context, instead of otelContext.active(). Side-query LLM calls
  (auto-title, recap) now stay in the session trace instead of starting a new
  detached trace.
- Move startToolExecutionSpan() to BEFORE invocation.execute(), matching
  claude-code. Previously the synchronous setup inside execute (shell command
  preprocessing, child_process.spawn) ran outside the execution span.
);
this.notifyToolCallsUpdate();
};
promise = invocation.execute(

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Critical] invocation.execute(...) now runs before the try block, so synchronous setup failures bypass the failure path. This PR intentionally starts the execution sub-span before execute() to bracket synchronous setup, but if execute() throws before returning a promise (for example during shell setup), the catch block below never runs: the execution span is not ended with failure metadata, failure hooks are skipped, and the tool call can miss the normal terminal error response. Move the try so it wraps both the execute() call and the await promise, and keep the existing catch/failure handling shared for synchronous and async failures.

— gpt-5.5 via Qwen Code /review

} catch {
// OTel errors must not interrupt the consumer.
}
endLLMRequestSpan(span, {

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Suggestion] After the idle timer fires, the helper has already ended and removed the span, but this wrapper does not keep a local spanEnded/timed-out flag. If the consumer resumes after a long idle gap, later chunks still update lastUsageMetadata, but the final endLLMRequestSpan(...) call is a no-op, so the exported span remains an error/timed-out span and loses the real terminal status/token counts. The previous code kept a local spanEnded guard for this timeout path. Consider restoring a local guard/timeout state so resumed streams do not silently discard their final telemetry, or only end once the generator is truly abandoned.

— gpt-5.5 via Qwen Code /review

// active via runInToolSpanContext, and tool implementations don't
// currently emit nested OTel spans of their own — the span boundary
// is purely for timing/attribution.
const execSpan = startToolExecutionSpan();

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Suggestion] The scheduler tests mock startToolExecutionSpan / endToolExecutionSpan, but they do not assert that this new execution sub-span is started and ended with the expected metadata on the concrete scheduler paths. A regression could drop the sub-span or mark ToolResult.error / thrown exceptions as successful without failing these tests. Please add scheduler-level assertions for success, ToolResult.error, thrown invocation exceptions, and pre-hook denial not starting an execution span.

— gpt-5.5 via Qwen Code /review

this.wrapped.generateContent(req, userPromptId),
const llmSpan = startLLMRequestSpan(req.model, userPromptId);
try {
llmSpan.setAttribute('llm_request.stream', false);

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Suggestion] This PR adds llm_request.stream to preserve stream vs non-stream observability after both paths moved to the shared qwen-code.llm_request span name, but the tests only check span creation/status and do not assert this attribute. Please have the span mock record setAttribute calls and assert generateContent() sets llm_request.stream to false while generateContentStream() sets it to true.

— gpt-5.5 via Qwen Code /review

Comment thread packages/core/src/core/coreToolScheduler.test.ts
Comment thread packages/core/src/telemetry/session-tracing.ts
Comment thread packages/core/src/telemetry/session-tracing.test.ts
Comment thread packages/core/src/core/coreToolScheduler.ts
Comment thread packages/core/src/core/coreToolScheduler.ts
Comment thread packages/core/src/telemetry/session-tracing.ts Outdated
…ce, test coverage

- coreToolScheduler.executeSingleToolCall: move try-block to wrap
  invocation.execute() so synchronous throws (e.g. shell setup failure)
  flow into the same catch path as async rejections. Previously a sync
  throw would leak the execution span and skip failure hooks.

- loggingStreamWrapper: track spanEndedByTimeout flag so a stream that
  resumes after the 5-min idle timeout does not run the final
  endLLMRequestSpan (which would no-op anyway, but the flag also stops
  resetSpanTimeout from queuing further timer callbacks).

- coreToolScheduler.test: add execution sub-span assertions for success,
  ToolResult.error, thrown invocation exceptions, and pre-hook denial.

- loggingContentGenerator.test: capture setAttribute calls into the mock
  span attributes record; assert llm_request.stream is false for non-stream
  and true for stream paths.
@doudouOUC doudouOUC requested a review from wenshao May 16, 2026 01:18
… gaps

- endLLMRequestSpan now uses spanCtx.span for mutations (matches
  endToolSpan/endToolExecutionSpan pattern). Same object, but consistent
  lookup pattern prevents future drift.

- Mocks capture endLLMRequestSpan and endToolSpan/endToolExecutionSpan
  metadata so tests can assert token counts, durationMs, success, error
  are forwarded correctly. Add assertions on:
    * Non-stream LLM: inputTokens, outputTokens, success on response path
    * Non-stream LLM: success: false + sanitized error on rejection
    * Stream LLM: final lastUsageMetadata reaches endLLMRequestSpan
    * Tool execution sub-span: success: true on happy path
    * Tool execution sub-span: success: false on ToolResult.error
    * Tool execution sub-span: success: false + sanitized error on throw

- Add OTel error resilience tests: when setAttributes or setStatus throws,
  span.end() must still run and the span must be removed from activeSpans.
  Covered for endLLMRequestSpan, endToolSpan, endToolExecutionSpan.

@wenshao wenshao left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Suggestion — Pattern: loggingStreamWrapper test gaps] Three test coverage gaps in the streaming wrapper:

  1. Idle timeout path completely untested — timer setup, reset on chunk arrival, cleanup on normal completion, spanEndedByTimeout gate logic, and timeout-triggered span end with metadata are not covered. Use vi.useFakeTimers().
  2. Stream error tests miss endMetadata assertions (L1037, L1087) — production code passes { success: false, inputTokens, outputTokens, durationMs, error } to endLLMRequestSpan, but tests only check statuses and ended.
  3. Stream error after usage metadata never tested — existing error tests throw before lastUsageMetadata is set, so inputTokens/outputTokens forwarding on the error path remains unverified.

Comment thread packages/core/src/telemetry/session-tracing.ts
Comment thread packages/core/src/telemetry/session-tracing.ts
Comment thread packages/core/src/core/coreToolScheduler.ts
}
endLLMRequestSpan(span, {
success: false,
durationMs: Date.now() - startTime,

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

[Suggestion] The idle timeout callback closes the span with success: false, but if the consumer later drains remaining buffered chunks after the timeout, safelyLogApiResponse still logs a "successful" API response. The span in the trace says "timed out / error" while the API log says "success" — contradictory signals that can cause false escalations during incident response.

Suggested change
durationMs: Date.now() - startTime,
// After the idle timeout fires, also skip the success log:
if (spanEndedByTimeout) return; // added at the start of the normal completion path

— DeepSeek/deepseek-v4-pro via Qwen Code /review

@wenshao wenshao left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

跑了真实 E2E(DeepSeek 后端 + OTel file exporter,触发 run_shell_command 工具调用,--approval-mode=yolo)。Trace 拓扑彻底修好

qwen-code.interaction (f8b3ffb…)         ← root
├── qwen-code.llm_request    (0600f75…)   ← turn 1
├── qwen-code.tool           (0ec9017…)
│   └── qwen-code.tool.execution (4a1618b…)
└── qwen-code.llm_request    (353fa37…)   ← turn 2

5 个 span 同一 traceId a185dc9…,proper parent-child。PR #4097 那次的拓扑分裂彻底解决 ✅

合入前唯一想先修的一处 —— success 属性在 qwen-code.tool 上完全缺失

实测导出(成功路径):

{"name":"qwen-code.tool","status":{"code":1},
 "attributes":["call_id","duration_ms","tool.name","tool_name"]}     // ← 无 success

{"name":"qwen-code.llm_request","status":{"code":1},
 "attributes":["success"]}                                          // ← 有 success

根因

coreToolScheduler.ts:1871endToolSpan 唯一 的调用点,且不带 metadata:

} finally {
  endToolSpan(toolSpan);
}

session-tracing.ts:360-364endToolSpan 仅在 metadata.success !== undefined 时写 success 属性。所以任何 tool span(不论成功/失败/取消)都不会带 success 属性

endLLMRequestSpan 的对应分支(session-tracing.ts:254)是无条件 endAttributes['success'] = metadata.success;,所以 qwen-code.llm_request 始终带 success

影响

后端用 success = false 过滤 tool 失败(最常见的失败率 dashboard 查询)会完全漏掉所有 tool 失败 —— 因为 tool span 上根本就没有 success 字段。tool.failure_kind 属性确实在失败路径写了,但需要后端单独识别这个非标 attr,跟 success boolean 不能复用同一套查询。

建议改法(二选一)

方案 1(推荐,最小改动):让 setToolSpanFailure / setToolSpanCancelledsuccess: false,并在成功路径补一行 success: true

// session-tracing.ts: setToolSpanFailure
function setToolSpanFailure(span: Span, failureKind: string, message: string): void {
  try {
    span.setAttribute(TOOL_FAILURE_KIND_ATTRIBUTE, failureKind);
    span.setAttribute('success', false);   // ← add
  } catch { /* … */ }
  safeSetStatus(span, { code: SpanStatusCode.ERROR, message });
}

// setToolSpanCancelled 同理加一行 span.setAttribute('success', false);

// coreToolScheduler.ts:2180
this.setStatusInternal(callId, 'success', successResponse);
safeSetStatus(span, { code: SpanStatusCode.OK });
span.setAttribute('success', true);        // ← add

方案 2:在 finally 处带 metadata 调 endToolSpan,但需要从 ALS 或 SpanContext 里读出 terminal 状态(this.toolCalls 在 finally 时已被 checkAndNotifyCompletion 清空,不能直接查)。

方案 1 更直白,符合现有 setToolSpan* 助手"封装一切状态副作用"的语义。

单测断言(防回归)

coreToolScheduler.test.ts:3401-3453 已有 4 条 tool.execution lifecycle 断言;麻烦在 qwen-code.tool 这边也加 3 条:

it('tool span: success=true attribute on success', async () => {
  await runSingleTool();
  const tool = getToolSpan();
  expect(tool!.spanAttributes).toHaveProperty('success', true);
});

it('tool span: success=false on ToolResult.error', async () => {  });
it('tool span: success=false on thrown invocation', async () => {  });

—— 改完这一处 + 测试,我就 approve。其它 5 条 Suggestion(abort 状态分辨、idle timeout vs api log 不一致、Phase 2 占位 union 等)可以新开 issue follow-up,不阻塞合入。

doudouOUC added 2 commits May 16, 2026 21:37
…metry

- session-tracing.ts SpanContext.type: comment 'tool.blocked_on_user' |
  'hook' as Phase 2 forward-declarations (no helpers wired yet).

- endToolExecutionSpan: align no-metadata-no-status behavior with
  endToolSpan. Currently no caller omits metadata, but the asymmetric
  default (OK vs preserve-pre-set) was a maintenance trap.

- loggingContentGenerator generateContent (non-stream) catch block:
  call endLLMRequestSpan BEFORE the logging block, mirroring the
  streaming path. Defense-in-depth against logging-side rejections.

- loggingContentGenerator: restore abort-specific span status message.
  All three LLM error paths (non-stream catch, stream eager-error catch,
  stream loggingStreamWrapper finally) now use
  API_CALL_ABORTED_SPAN_STATUS_MESSAGE when req.config.abortSignal.aborted,
  matching the original withSpan('client.generateContent') behavior.
  Trace backends can now distinguish cancellations from real failures.

- coreToolScheduler _executeToolCallBody catch: distinguish abort vs
  exception in execSpan error message. New constant
  TOOL_SPAN_STATUS_TOOL_CANCELLED prevents operators filtering exec
  spans for errors from seeing cancellation false positives.

- New test asserting exec span uses cancelled-by-user message when the
  invocation throws after abort.
E2E review found qwen-code.tool spans never carry the `success` boolean
attribute (the helper only writes it when metadata is passed, and the
finally block calls endToolSpan(toolSpan) without metadata). This breaks
the most common observability query — filtering tool failures with
`success = false` — because tool spans don't have that field at all.

Fix: setToolSpanFailure / setToolSpanCancelled now also call
span.setAttribute('success', false); the success path in
_executeToolCallBody adds span.setAttribute('success', true) after
safeSetStatus(span, OK). Mirrors the unconditional `success` attribute
on llm_request spans, so backends can use one query for both span types.

Add 4 scheduler-level tests asserting the success attribute on:
- success path
- ToolResult.error path
- thrown invocation path
- cancellation path
@doudouOUC

Copy link
Copy Markdown
Collaborator Author

@wenshao 已修复,commit 44b55ebe5

采用方案 1(最小改动):

  • setToolSpanFailure / setToolSpanCancelled 都加了 span.setAttribute('success', false)
  • 成功路径在 _executeToolCallBodysafeSetStatus(span, OK) 后加 span.setAttribute('success', true)

新增 4 条防回归断言(coreToolScheduler.test.ts):

  • tool span: success=true attribute on success
  • tool span: success=false attribute on ToolResult.error
  • tool span: success=false attribute on thrown invocation exception
  • tool span: success=false attribute on cancellation

后端用 success = false 过滤 tool 失败现在和 qwen-code.llm_request 走同一套查询。323 unit tests 全过,零 type error。

@wenshao

wenshao commented May 16, 2026

Copy link
Copy Markdown
Collaborator

本地真实测试报告

Head: 44b55ebe (13:48Z) "always write 'success' attribute on tool spans"

我 11:44Z 的 Critical(success 属性 tool span 缺失)已修

coreToolScheduler.ts:124-148

 function setToolSpanFailure(span, failureKind, message) {
   try {
     span.setAttribute(TOOL_FAILURE_KIND_ATTRIBUTE, failureKind);
+    // Always write `success: false` so trace backends can filter tool
+    // failures with the same query they use for llm_request spans —
+    // mirrors the unconditional `success` attribute on llm_request.
+    span.setAttribute('success', false);
   } catch { /* OTel errors must not block the failure status update. */ }
   safeSetStatus(span, { code: SpanStatusCode.ERROR, message });
 }

 function setToolSpanCancelled(span) {
   try {
     span.setAttribute(TOOL_FAILURE_KIND_ATTRIBUTE, TOOL_FAILURE_KIND_CANCELLED);
+    span.setAttribute('success', false);
   } catch { /* … */ }
   safeSetStatus(span, { code: SpanStatusCode.UNSET });
 }

 // _executeToolCallBody 成功路径
 this.setStatusInternal(callId, 'success', successResponse);
 safeSetStatus(span, { code: SpanStatusCode.OK });
+try { span.setAttribute('success', true); } catch { /* … */ }

3 处都加了 try/catch 防 OTel 异常阻塞 API 行为。

真实 E2E — 成功 + 失败两路径

$ # 成功路径:run_shell_command 'ls -1'
qwen-code.tool (bb4c5d0a…)  status=OK   success=true   tool_name=run_shell_command
qwen-code.tool.execution    status=OK   success=true
$ # 失败路径:read_file /this/path/does/not/exist.txt
qwen-code.tool              status=ERROR success=false  failure_kind=tool_error  tool_name=read_file
qwen-code.tool.execution    status=ERROR success=false

后端 success = false dashboard 查询能 catch 到 tool 失败了。

Trace 拓扑维持 hierarchical

成功路径的 5 个 span 同一个 traceId 498c68b0…

qwen-code.interaction         (8c25fdd6…)  root
├── qwen-code.llm_request     (073cbc71…)  turn 1
├── qwen-code.tool            (bb4c5d0a…)  tool_name=run_shell_command, success=true
│   └── qwen-code.tool.execution (b86df479…)  success=true
└── qwen-code.llm_request     (f5d78ec7…)  turn 2

9cf43709 顺手解了我 11:44Z 6 条 Suggestion 中的 5 条

  1. SpanContext.type 'tool.blocked_on_user' | 'hook' 加 Phase 2 占位注释
  2. endToolExecutionSpan 默认行为对齐 endToolSpan(无 metadata 不写 status)
  3. generateContent 非流 catch:endLLMRequestSpan 在 logging 之前调用,对齐流路径
  4. ✅ 三处 LLM error path 区分 abort(新增 API_CALL_ABORTED_SPAN_STATUS_MESSAGE)—— 后端能区分用户取消 vs 真实 API 失败
  5. tool.execution catch 区分 abort(新增 TOOL_SPAN_STATUS_TOOL_CANCELLED)+ 新增取消用例测试

第 6 条(stream idle timeout 后续 chunk 仍触发 success api log 的矛盾)commit msg 未提,是 stream race 边角,可 follow-up。

测试结果

测试套件 结果
telemetry/session-tracing.test.ts 27/27 pass
core/coreToolScheduler.test.ts 128/128 pass(新增 4 条 success attr 断言 + 1 条 cancelled 用例)
core/loggingContentGenerator.test.ts 37/37 pass
core/client.test.ts 131/131 pass
tsc --noEmit (core + cli) 0 错
GitHub CI 全绿(Lint, Test ubuntu/macos/windows, CodeQL, Coverage)

合计 323/323 单测过。

LGTM ✅

@wenshao wenshao left a comment

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

已发布详细测试报告(见 issue comment)。Critical(tool span 缺 success 属性)已修,11:44Z 6 条 Suggestion 也解了 5 条;成功 + 失败两路径真实 E2E 验证 success attr 写入正确,trace 拓扑保持 hierarchical;323/323 单测过,CI 全绿,tsc 0 错。LGTM ✅

@wenshao wenshao merged commit 8dfbdaa into main May 16, 2026
9 checks passed
wenshao pushed a commit that referenced this pull request May 19, 2026
…og/span consistency (#4302)

* fix(telemetry): Phase 1.5 polish — fallback order, abort-as-result, log/span consistency

Follow-up to #4126 (Phase 1: unify span creation paths). Four small fixes
surfaced in the late review rounds; tracked in #4212.

1. session-tracing fallback order — add resolveParentContext() helper that
   mirrors tracer.ts:getParentContext(): when interactionContext / toolContext
   ALS is empty, prefer the active OTel span over the synthetic session-root
   fallback. Without this, an LLM or tool call nested inside another OTel
   span (subagent inside a tool, instrumented HTTP path, etc.) re-parented
   to session root and flattened the trace tree. Three sites unified.

2. tool.execution span misreporting cancellation as success — when a tool
   observes signal.aborted and resolves with a normal ToolResult (no .error),
   the execution sub-span used to close as success while the parent tool
   span ended cancelled. Snapshot signal.aborted once after await, mirror it
   into endToolExecutionSpan, and stamp a sanitized error reason
   (Tool execution cancelled by user / Tool execution failed) so trace
   backends can distinguish the two without cross-referencing the parent.

3. loggingContentGenerator test mock missing API_CALL_ABORTED_SPAN_STATUS_MESSAGE
   — production code imports it; Vitest returned undefined and quietly
   masked the abort branch in tests.

4. Stream idle timeout vs api_response success log — when the 5-min idle
   timeout closes the LLM span as failed, skip the post-loop
   safelyLogApiResponse / safelyLogOpenAIInteraction / addModelOutputAttributes
   so telemetry doesn't carry a "success" api_response log alongside a
   timed-out span.

Closes #4212.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): address #4302 review — catch-path gating, exec span cancelled status, sync tags

Four follow-ups from the #4302 review on top of the Phase 1.5 polish:

1. loggingContentGenerator.ts catch block — gate safelyLogApiError +
   safelyLogOpenAIInteraction on !spanEndedByTimeout. The success path
   was gated in the original PR but the error path wasn't, so a downstream
   throw after idle timeout would still emit api_error alongside a timed-out
   span — the same contradictory-telemetry pair the timeout fix targets.

2. endToolExecutionSpan now accepts a `cancelled` discriminator. Without
   it, success: false unconditionally sets SpanStatusCode.ERROR while the
   parent tool span uses setToolSpanCancelled (UNSET) — child shows ERROR,
   parent shows cancelled, trace backends filtering for errors false-positive
   on user cancels. Cancelled exec spans now keep status UNSET like the
   parent while still recording success: false / error reason attributes.

3. coreToolScheduler.ts catch block now snapshots signal.aborted into a
   local `aborted` constant and passes it through endToolExecutionSpan +
   the if-branch — same idiom as the success path (#4212), eliminating
   the divergent style.

4. SYNC tags on tracer.ts:getParentContext and session-tracing.ts:
   resolveParentContext so future drift between the two parent-resolution
   paths surfaces in grep before it flattens the trace tree.

Tests:
- session-tracing.test.ts — endToolExecutionSpan cancelled: true keeps
  status UNSET; sanity check that the non-cancelled path still maps
  success: false to ERROR.
- coreToolScheduler.test.ts — extends the live-output cancellation test
  and the catch-after-abort test to assert cancelled: true; updates the
  success/failed-result tests to expect cancelled: false; adds a
  cancelled-stays-falsy-on-real-exception test.
- loggingContentGenerator.test.ts — new test that fires the idle timeout
  then throws downstream and verifies neither logApiError nor logInteraction
  fires.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)
doudouOUC added a commit that referenced this pull request May 21, 2026
…4321)

* feat(telemetry): Phase 2 — tool.blocked_on_user + hook spans

Adds two OTel span types under the existing hierarchical session-tracing
infrastructure (#3731 Phase 2; depends on Phase 1 #4126 and Phase 1.5 #4302):

1. `qwen-code.tool.blocked_on_user` — brackets the time a tool spends in
   awaiting_approval waiting for the user. Child of the tool span. Records
   decision (proceed_once / proceed_always / cancel / aborted /
   auto_approved) and source (cli / ide / hook / auto / system). Status
   stays UNSET — waiting is neither OK nor ERROR.

2. `qwen-code.hook` — wraps each pre/post-hook fire site so a slow hook can
   be told from a slow tool. Records hook_event (PreToolUse / PostToolUse /
   PostToolUseFailure), tool_name, shouldProceed, shouldStop, blockType,
   hasAdditionalContext. Status stays UNSET on intentional blocking
   decisions; ERROR only when the hook itself throws.

To make blocked_on_user a child of the tool span, the tool span lifecycle
moved from `executeSingleToolCall` to `_schedule`'s validating-loop —
covering validating → awaiting_approval → executing in one span. Two new
private Maps on CoreToolScheduler hold span refs across method boundaries
(callId-keyed). Centralized cleanup via `finalizeToolSpan` /
`finalizeBlockedSpan` private helpers ensures every terminal status path
also ends the corresponding span.

Eight terminal sites now finalize the tool span: signal.aborted at loop
entry, hard deny, plan-mode block, non-interactive deny, permission-hook
deny, background-agent deny, _schedule catch, executeSingleToolCall
finally. Five blocked_on_user end sites: handleConfirmationResponse cancel
and proceed branches, autoApproveCompatiblePendingTools, _schedule catch
under signal.aborted, and the global-error catch. ModifyWithEditor stays
inside one blocked_on_user span until the final proceed/cancel — the
duration_ms reflects total user think-time including editor side trips.

Six hook fire sites are wrapped: firePreToolUseHook, firePostToolUseHook,
and four safelyFirePostToolUseFailureHook variants (success-path
interrupt, toolResult.error path, catch-path interrupt, catch-path real
exception). fireNotificationHook is intentionally NOT wrapped — it's
fire-and-forget and the duration is meaningless.

Mirrors claude-code's session-tracing pattern but deliberately diverges on
one point: every end-helper takes the span object explicitly via
`getSpanId(span)` lookup instead of `findLast`-by-type. Under concurrent
tool calls, claude-code's findLast can end the wrong blocked span; passing
the ref directly is concurrency-safe.

Tests:
- session-tracing.test.ts: 11 new tests covering parent resolution
  (explicit parent for blocked_on_user, ALS-based for hook), idempotent
  end, NOOP behavior, error-status mapping, and a concurrency regression
  test (two parallel blocked spans ended in reverse order).
- coreToolScheduler.test.ts: mock extended with the four new helpers and
  two new metadata fields. New tests cover the tool span outliving a
  pre-hook deny path, blocked_on_user ending with cancel via the
  awaiting_approval flow, hook span recording shouldProceed=false /
  blockType='denied' on pre-hook block and shouldStop=true /
  blockType='stop' on post-hook stop, and a leak guard that asserts
  every recorded lifecycle span is ended after a successful tool call.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): address #4321 review — Copilot inline + code-reviewer + silent-failure-hunter

Eight discrete fixes plus two new tests, all surfaced in the Phase 2 review
rounds. Grouped here because they touch the same handful of code paths.

Copilot inline (#4321 PR):
1. startToolSpan attrs naming: drop redundant `tool_name` (helper already
   sets `'tool.name'` from the first arg) and rename `call_id` to the
   namespaced `'tool.call_id'`. Two sites: `_schedule` validating-loop
   start, and the defensive fallback in executeSingleToolCall. Without
   this, traces emit non-namespaced `tool_name` / `call_id` attributes
   that consumers grepping for `tool.call_id` miss.
2. PreToolUse hook span: propagate the actual `preHookResult.blockType`
   ('denied' / 'ask' / 'stop') instead of collapsing every block to
   'denied'. Also record `hasAdditionalContext` for parity with the
   PostToolUse / failure-hook spans.
3. blocked_on_user `source` detection: use `config.getIdeMode()` (best-
   effort) so IDE-driven decisions don't all show up as `'cli'`.
   Centralized in a new `getBlockedSource()` helper.

silent-failure-hunter / code-reviewer:
4. Hook span error-tracking is dead code. firePreToolUseHook /
   firePostToolUseHook / safelyFirePostToolUseFailureHook all swallow
   throws internally — every `catch (e) { endMeta = { error, ... };
   throw e }` block in the scheduler was unreachable. Simplify all 6
   sites to `try { ... } finally { endHookSpan(...) }`. The default
   `endMeta = { success: false }` keeps the span sensible if a future
   hook impl decides to throw.
5. handleConfirmationResponse had no error handling. modifyWithEditor /
   _applyInlineModify / attemptExecutionOfScheduledCalls can throw and
   would otherwise leak both the tool span and the blocked_on_user span
   until the 30-min TTL fires. Wrap the body in a try/catch that
   finalizes both spans on rethrow. Extracted the body to
   `_handleConfirmationResponseInner` for clarity.
6. Add `'error'` to the `ToolBlockedDecision` union for system-error
   closes, so dashboards counting `decision: 'cancel'` don't get
   polluted by thrown exceptions.
7. _schedule's outer catch was labelling its non-aborted close as
   `'cancel'`. Switch to `'error'` (uses #6).
8. signal.aborted vs explicit user Cancel: when both are true, the old
   code reported `'aborted'/'system'` even though the user actually
   clicked Cancel. Reverse the precedence so `outcome === Cancel`
   wins, with `getBlockedSource()` for the source.

Tests:
- T1: extend the existing ProceedAlways auto-approve test to assert the
  two siblings' blocked spans end with `decision: 'auto_approved'`,
  `source: 'auto'`, while the first tool ends as `'proceed_always'`/cli.
- T2: existing cancel-during-confirmation test now also asserts exactly
  one blocked span is recorded for the lifecycle — the same invariant
  ModifyWithEditor's intentional preservation across editor side trips
  must not break.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): close autoApprove blocked-span leak + cover three new behaviors

Two follow-ups from the post-#6767469b2 review pass on PR #4321:

1. autoApproveCompatiblePendingTools error path was logging-only and
   leaving the sibling tool's blocked_on_user span open until the 30-min
   TTL fires. Symmetric with the success branch's
   finalizeBlockedSpan('auto_approved', 'auto'), the catch now finalizes
   with ('error', 'system') so the trace deterministically explains why
   the sibling didn't auto-approve.

2. Three behaviors introduced by 6767469 had no test coverage:
   - decision='error' from _schedule's outer catch when
     getConfirmationDetails throws (asserts tool span ends, no blocked
     span ever opens since the throw happens pre-awaiting_approval).
   - source='ide' when getBlockedSource() honors getIdeMode (Cancel
     path with getIdeMode: () => true).
   - Explicit Cancel takes precedence over a concurrent signal.aborted
     in the decision label — the bug the precedence flip was meant to
     fix is now regression-tested.

Extracted a small `buildApprovalScheduler` helper for the two
awaiting_approval-flow tests; the throw-on-confirmation test reuses
StructuredErrorOnConfirmationTool.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): revert autoApprove catch finalizeBlockedSpan (#4321 codex P3)

The previous commit 32f94d3 added a `finalizeBlockedSpan(callId, 'error',
'system')` to the autoApproveCompatiblePendingTools catch in the name of
"symmetry with the success branch". Codex review pointed out the bug:
that catch fires when evaluatePermissionFlow throws for a SIBLING tool,
but the sibling itself is still in `awaiting_approval` — the user can
still respond. By closing the blocked span at the catch, the eventual
handleConfirmationResponse → finalizeBlockedSpan call becomes a no-op
(Map.delete already cleared it), and the user's actual decision /
source attributes are lost from the trace.

Revert that line. The previous behavior was correct: log the error,
leave the span open, let the user's eventual decision close it
correctly. If the user never responds, the 30-min TTL in
session-tracing.ts cleans up the orphan span — same fallback that
already covered every other "user walks away" scenario.

The "leak" the original change was trying to fix was a phantom: the
span IS finalized once the user (or the abort signal) drives the tool
to a terminal state. The TTL is just the safety net.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): split tool.failure_kind labels + cover proceed_once decision

Two #4321 review comments from wenshao, both Critical:

1. `TOOL_FAILURE_KIND_PRE_HOOK_BLOCKED` was being emitted for FIVE distinct
   non-PreToolUse-hook deny paths in `_schedule`:
   - finalPermission === 'deny' (hard deny)
   - plan-mode block
   - non-interactive deny
   - permission_request hook deny
   - background-agent deny
   Dashboards filtering by `failure_kind = 'pre_hook_blocked'` were
   silently picking up all of these, undermining the attribute. Add
   distinct constants + status messages for each path. The original
   PRE_HOOK_BLOCKED label is now used at exactly one site — the actual
   PreToolUse hook deny in `_executeToolCallBody`.

2. `decision: 'proceed_once'` was untested. Existing tests covered
   'cancel' and 'proceed_always' (auto-approve) but not the most common
   user interaction. Add a test that schedules an approval-required tool,
   confirms with ProceedOnce, and asserts the blocked span ends with
   `decision: 'proceed_once'`, `source: 'cli'`.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): address #4321 wenshao Critical + bot summary nits

Three review items folded into one follow-up:

1. wenshao Critical (`coreToolScheduler.ts:1851`) — `ModifyWithEditor`
   path silently returned when `getPreferredEditor()` was undefined,
   leaking blocked + tool spans on user-walks-away. Add a
   `debugLogger.warn` so the silent failure is at least visible in debug
   telemetry. Deliberately do NOT finalize spans here, matching the
   Codex P3 / autoApprove decision: ModifyWithEditor stays inside one
   awaiting period, the user can still recover via Cancel/Proceed which
   closes the spans correctly, and the 30-min TTL is the safety net for
   give-up scenarios. Finalizing prematurely would make the user's
   eventual decision a no-op (Map already cleared) and lose the actual
   decision/source attributes.

2. Bot summary Medium (`session-tracing.ts:557-562`) — add a
   `debugLogger.debug` when `startToolBlockedOnUserSpan` falls back to
   `resolveParentContext` because the tool span isn't in `activeSpans`
   anymore. Helps diagnose unexpected ordering during development.

3. Bot summary Low (`constants.ts`) — JSDoc the two new span name
   constants.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* refactor(telemetry): extract withHookSpan helper + drop dead finalizeToolSpan param

Two #4321 review Suggestions from wenshao:

1. The 6 hook fire sites (PreToolUse, PostToolUse, 4× PostToolUseFailure)
   each repeated the same try/finally + endMeta init + endHookSpan
   pattern. Future hook span protocol changes had to be made in lockstep.
   Extract a private generic helper:

       withHookSpan<T>(opts, fn, toEndMeta): Promise<T>

   Each fire site collapses from ~12 lines of try/finally scaffolding to
   ~3 lines passing in the fire callback + endMeta builder. The
   `let postHookResult!:` definite-assignment hack at the PostToolUse
   site is gone because the helper returns the awaited result directly.

2. `finalizeToolSpan(callId, metadata?)` had a dead `metadata`
   parameter — every caller pre-sets the span status via
   `setToolSpan{Failure,Cancelled}` and called `finalizeToolSpan` with no
   argument. Removed the parameter.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): hook span error tracking + TTL cleanup safety + call_id back-compat

Three #4321 review threads from wenshao (#4321 codex P3-equivalent +
two structural concerns):

1. **[Critical] Hook spans reported success on swallowed hook failures.**
   firePreToolUseHook / firePostToolUseHook /
   firePostToolUseFailureHook (and the safelyFire wrapper in
   coreToolScheduler) all catch transport / dispatch errors internally
   and return safe defaults. Before this fix, withHookSpan's `toEndMeta`
   ran on the safe default and recorded `success: true` — a crashing
   hook was indistinguishable from one that allowed execution.
   Add a `hookError?: string` field to the three result types, populate
   it in each catch, and have all 6 toEndMeta callbacks return
   `{ success: false, error: hookError }` when present.
   Existing "graceful error" tests updated to expect the new field.

2. **[Suggestion] ensureCleanupInterval not kicked from new helpers.**
   The 30-min TTL cleanup safety net for leaked spans only starts when
   `startInteractionSpan` is first called. Sub-agent or side-query code
   paths that call `startToolBlockedOnUserSpan` / `startHookSpan`
   without an interaction span first never trigger cleanup. Both
   helpers now call the (idempotent) `ensureCleanupInterval()` early.

3. **[Suggestion] `call_id` → `'tool.call_id'` rename is breaking for
   downstream consumers.** Phase 1's `startToolSpan(name, { tool_name,
   call_id })` shipped non-namespaced attribute keys. My Phase 2 #4321
   review-fix dropped both. Dual-emit `call_id` (legacy alias) +
   `'tool.call_id'` for one release cycle so existing dashboards /
   alerts don't silently return zero. Comment notes the legacy key is
   removed in the next release.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): close hookError plumbing gaps from final pre-merge audit

Final-pass review surfaced two gaps in the hookError contract added in
eafe688:

1. **Real bug (silent-failure-hunter HIGH)**: The three fire helpers
   (firePreToolUseHook / firePostToolUseHook /
   firePostToolUseFailureHook) populate `hookError` only in their catch
   blocks. But the `if (!response.success || !response.output)`
   short-circuit at lines 121 / 220 / 299 silently dropped
   `response.error` from the runner layer (URL validation failures, fn
   exceptions, prompt-runner crashes). Hooks that never even threw —
   just had a failing runner — surfaced as "successful allow" in
   telemetry. Forward `response.error?.message` into hookError on the
   short-circuit path so the operator sees the actual cause.

2. **Defensive default in withHookSpan**: the initial
   `endMeta = { success: false }` produced UNSET status (no `error`
   field, so endHookSpan skips the setStatus(ERROR) branch). Today the
   only path that hits this default is "fn() throws before toEndMeta",
   which is unreachable because all hook helpers catch internally — but
   the contract should still map to ERROR if the invariant ever
   changes. Default now carries an explanatory error string.

Test: new `coreToolScheduler.test.ts` case where messageBus.request
resolves with success:false + a real Error; asserts the PreToolUse hook
span's `hookMetadata.error` is the runner's message (instead of being
silently absent).

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* test(telemetry): cover #4321 rethrow path + 2 of the new failure_kind labels

Two test gaps surfaced by wenshao [Suggestion] threads:

1. **handleConfirmationResponse outer catch was untested.** The
   defensive recovery path that finalizes both spans on
   originalOnConfirm / modifyWithEditor / attemptExecution throws
   had no coverage. New test calls handleConfirmationResponse
   directly with a throwing onConfirm, asserts:
   - blocked span ends with `decision: 'error'`, `source: 'system'`
   - tool span carries `tool.failure_kind: 'tool_exception'`
   - the original error is rethrown to the caller

2. **5 new permission-flow failure_kind labels had zero
   coverage.** Add representative tests for the two highest-volume
   paths:
   - `permission_denied` — PM hard-deny via a tool whose
     getDefaultPermission returns 'deny'
   - `non_interactive_denied` — `isInteractive: () => false`
     scheduling an edit-tool that needs confirmation
   The other three (plan_mode_blocked / permission_hook_denied /
   background_agent_denied) are covered transitively via the
   existing pre_hook_blocked + plan-mode tests; if they regress,
   the same code path's existing assertions would notice.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): adopt 4 wenshao Critical/Suggestion findings on PR #4321

Inline review findings:
- coreToolScheduler.ts: signal.abort drains scheduler-local
  toolSpans/blockedSpans Maps via deferred setTimeout(0) — bridges the
  gap between session-tracing's 30-min TTL (which ends underlying spans
  but cannot reach the Maps) and walk-away-during-awaiting_approval. The
  drain is deferred so explicit Cancel via handleConfirmationResponse
  and mid-execution setToolSpanCancelled paths still win the race and
  set canonical labels.
- coreToolScheduler.test.ts: regression test for permission_hook_denied
  (firePermissionRequestHook deny branch at _schedule:1683) and
  background_agent_denied (getShouldAvoidPermissionPrompts auto-deny at
  _schedule:1697). Both branches were untested — silently dropping
  setToolSpanFailure on either would lose attribution.
- coreToolScheduler.ts: defensive-fallback span in executeSingleToolCall
  uses canonicalToolName(toolName) so dashboards grouping by span name
  don't see two entries for migrated/MCP tools whose canonical and raw
  names differ.

Review-body finding:
- session-tracing.ts: TTL safety net stamps qwen-code.span.ttl_expired
  + qwen-code.span.duration_ms attributes and emits a debug log before
  ending stale spans. Operators can now distinguish "abandoned and
  garbage-collected by the safety net" from "deliberately ended without
  status/attrs". Refactored cleanup loop into sweepStaleSpans(now) and
  exposed runTTLSweepForTesting for unit coverage.

Tests: +3 scheduler tests (~220 LOC), +2 session-tracing tests (~36
LOC). 247/247 in affected files.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): adopt 7 DeepSeek /review findings on PR #4321

Adopted ([Critical]):
- coreToolScheduler.ts: ModifyWithEditor `!editorType` path now sets
  `qwen-code.tool.modify_with_editor_unavailable: true` on the live tool
  span so operators can detect the silent-bail-out state in production
  traces without enabling debug logging.
- coreToolScheduler.test.ts: regression test for plan_mode_blocked
  failure_kind path (ApprovalMode.PLAN + non-read-only confirmation
  tool).
- coreToolScheduler.test.ts: regression test for the pre-aborted
  signal early-exit in `_schedule` — asserts
  setToolSpanCancelled (UNSET status) without entering execution.

Adopted ([Suggestion]):
- coreToolScheduler.ts: `withHookSpan` now `catch`-es and surfaces the
  actual thrown message instead of the hardcoded
  `'hook fn threw before toEndMeta'` sentinel. Currently unreachable
  (hook helpers swallow internally) but defensive against contract
  drift.
- coreToolScheduler.ts: re-add `tool_name` (non-namespaced) as a legacy
  alias on both startToolSpan call sites, mirroring the `call_id` /
  `tool.call_id` dual-emit window so pre-Phase-2 dashboards filtering
  on `tool_name` don't silently stop matching during the rollout.
- coreToolScheduler.test.ts: regression test for the
  `_schedule`-driven aborted decision label on the blocked_on_user
  span (companion to the existing tool-span drain test).
- coreToolScheduler.ts: PreToolUse / PostToolUse `toEndMeta` now
  include `shouldProceed: true` / `shouldStop: false` when `hookError`
  is set, mirroring the runtime's allow-on-hook-failure semantics.

Pushed back (separate PR-level reply):
- "sibling failure prematurely closes confirmed tool span" — not
  reachable: `_executeToolCallBody` swallows execution errors so the
  only paths into `handleConfirmationResponse`'s catch are
  `originalOnConfirm` / `modifyWithEditor` / `_applyInlineModify`,
  none of which run after `attemptExecutionOfScheduledCalls` started
  any sibling.
- "PostToolUseFailure hook spans not asserted" — broader scope, defer.
- "finalizeToolSpan accept required metadata" — invariant-redesign,
  out of scope for this PR.

Tests: +3 scheduler tests; 250/250 green in affected files
(coreToolScheduler 154 + session-tracing 49 + toolHookTriggers 47).

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): adopt 3 wenshao /review findings on PR #4321

- coreToolScheduler.ts: handleConfirmationResponse outer catch now
  branches on signal.aborted — a throw caused by the abort signal
  (e.g. ModifyWithEditor child interrupted by Ctrl+C) lands as
  decision:'aborted'/UNSET status instead of 'error'/tool_exception,
  matching the sister catch in `_schedule` and keeping dashboard
  abort-vs-error counts honest (Critical-shaped Suggestion).

- coreToolScheduler.ts: drop the per-batch abort listener at the end
  of `_schedule` when no batch entries remain in toolSpans /
  blockedSpans. Prevents Node's MaxListenersExceededWarning in
  long-lived sessions where the same AbortSignal sees many _schedule
  batches without a real abort. Listeners that still cover
  awaiting_approval entries stay attached — the user's eventual
  decision closes the spans, and the listener becomes a no-op when it
  later fires (or auto-removes via `{ once: true }` on real abort).

- coreToolScheduler.test.ts: 2 regression tests for PostToolUseFailure
  hook span variants — `is_interrupt:true` on user-abort vs
  `is_interrupt:false` on real-exception. Operators rely on this flag
  to separate user-initiated cancellations from system errors in
  dashboards; a copy-paste regression flipping the value across the 4
  PostToolUseFailure call sites was previously invisible.

Tests: 252/252 across affected files (coreToolScheduler 156 +
session-tracing 49 + toolHookTriggers 47).

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): adopt 7 wenshao /review round-3 findings on PR #4321

Adopted ([Critical]):

- coreToolScheduler.ts: full per-batch abort listener cleanup. Replaced
  the closure-local Set + end-of-_schedule cleanup with a class-level
  callIdToBatch Map keyed off a shared BatchAbortState. The listener is
  now released by `finalizeToolSpan` → `releaseBatchListenerIfDrained`
  whenever the last live batch entry drains, regardless of whether
  finalize happens synchronously inside _schedule, later via
  handleConfirmationResponse, or via executeSingleToolCall. Closes
  the awaiting_approval-batches-leak-listeners gap from the previous
  partial fix.

- coreToolScheduler.ts: re-check signal.aborted in the _schedule
  for-loop after `evaluatePermissionFlow`/`getConfirmationDetails`/
  `firePermissionRequestHook` and BEFORE setting awaiting_approval +
  starting the blocked span. Without this, a signal that aborts during
  one of those awaits opens a blocked span on an already-aborted
  signal whose drainSpansForBatch may have already fired, leaving the
  new entry permanently orphaned.

- session-tracing.ts: introduce truncateSpanError(s) (1KB cap) and
  apply it to every endXSpan site that writes metadata.error to span
  attributes / status messages (LLM, tool, tool execution, hook).
  Hook server responses, raw exception stacks, or hostile inputs can
  be unbounded; some OTel backends drop the entire span when any
  field exceeds their limit.

Adopted ([Suggestion]):

- coreToolScheduler.ts: per-callId try/catch inside drainSpansForBatch.
  One bad finalize no longer skips the rest of the batch; failures
  are logged via debugLogger.warn instead of bubbling up as an
  unhandled timer-callback exception.

- session-tracing.ts: TTL sweep robustness — wraps setAttributes and
  span.end() in separate try/catch blocks so a setAttributes throw
  can't leak the OTel span; stamps `decision: 'aborted'`/
  `source: 'system'` on TTL-expired blocked_on_user spans so
  dashboards filtering by decision count walk-aways consistently with
  explicit user aborts; includes tool.name + tool.call_id in the
  warn log so it's actionable in production without a trace-backend
  lookup.

- coreToolScheduler.ts: extract the 4 byte-identical PostToolUseFailure
  toEndMeta lambdas into a single `postToolUseFailureEndMeta` member.
  Future protocol changes only need to touch one place.

- coreToolScheduler.test.ts: 3 new tests
  * outer-catch aborted branch — pre-aborted signal + throwing
    onConfirm asserts decision='aborted'/source='system' and
    failure_kind='cancelled'.
  * ModifyWithEditor !editorType — uses a getModifyContext-shimmed
    MockEditTool to enter the modifiable branch and asserts
    qwen-code.tool.modify_with_editor_unavailable=true.
  * per-batch listener removed when batch drains synchronously —
    asserts AbortSignal listenerCount and `callIdToBatch` size.

Pushed back (deferred):

- "firePermissionRequestHook in withHookSpan + hookError field" —
  same as previous deferral. Touches the public PermissionRequestHookResult
  type re-exported from packages/core/src/index.ts; declined per the
  guardrail on public-API changes.

Tests: 255/255 across affected files (coreToolScheduler 159 +
session-tracing 49 + toolHookTriggers 47).

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): polish 2 wenshao /review round-4 nits on PR #4321

- session-tracing.ts: rename `SPAN_ERROR_MAX_BYTES` → `SPAN_ERROR_MAX_CHARS`
  and update the JSDoc to be honest that `truncateSpanError` truncates by
  UTF-16 code units rather than bytes. CJK/emoji-heavy errors land in the
  ~2-3KB UTF-8 range under the same code-unit cap, but that's still well
  under all major OTel backends' per-attribute limits (Jaeger/Honeycomb
  ~64KB, OTLP default ~32KB), so we keep the simpler char-count bound
  rather than paying the encoder cost on every endXSpan.

- coreToolScheduler.ts: move the `withHookSpan` JSDoc block to sit
  directly above the method. The previous order had two consecutive
  JSDoc blocks separated by `postToolUseFailureEndMeta`, which orphaned
  the `withHookSpan` doc — IDE hover tooltips would surface the wrong
  documentation.

Tests: 208/208 in affected files; tsc --noEmit clean.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): adopt 4 wenshao /review round-5 findings on PR #4321

Adopted ([Suggestion]):

- coreToolScheduler.ts: `setToolSpanFailure` now applies
  `truncateSpanError` to the status message at this single ingress
  point. Many of its 10+ call sites pass raw `error.message` which can
  be unbounded — the same backend-drop risk that drove
  `truncateSpanError` for the endXSpan attribute writes. Static-
  constant callers see no change since their messages are well under
  the 1024-char cap. Required exporting `truncateSpanError` from
  `session-tracing.ts` and re-exporting from `telemetry/index.ts`.

- coreToolScheduler.ts: in `_schedule`, after the for-loop runs to
  completion, drop the abort listener if `batchState.callIds.size === 0`.
  Closes the all-error-batch leak path: if every newToolCall had
  `status !== 'validating'` (e.g., invalid params, tool not registered,
  queue full), no `finalizeToolSpan` ever fires for the batch and
  `releaseBatchListenerIfDrained` is never invoked. Without this drop,
  one dead listener accumulates per all-error batch.

- coreToolScheduler.ts: `handleConfirmationResponse` outer catch now
  emits a `debugLogger.warn` before rethrowing. Without it, if the
  caller (CLI confirmation UI layer) doesn't log the rejection, the
  error disappears from application logs entirely — operators
  grepping by callId would see nothing despite the trace backend
  showing `failure_kind: tool_exception`.

- session-tracing.test.ts: 4 new tests
  * `truncateSpanError` returns short strings unchanged
  * `truncateSpanError` truncates over 1024 chars + appends sentinel
  * `truncateSpanError` boundary at exactly 1024 chars
  * TTL sweep stamps `decision: 'aborted'` + `source: 'system'` on
    blocked_on_user spans (covers the branch added in review-3 round)

Pushed back ([Suggestion]):

- "TTL sweep can't reach scheduler-local Maps" — accurate but the fix
  is non-trivial: a parallel scheduler-side TTL sweep duplicates the
  session-tracing sweep's bookkeeping, and the practical impact is
  bounded (Maps die with the scheduler instance, which is per-session
  in CLI mode). The bigger leak (listener accumulation on shared
  signals) is already covered by `releaseBatchListenerIfDrained`.
  Marking as out-of-scope architectural follow-up.

Tests: 259/259 across affected files (coreToolScheduler 159 +
session-tracing 53 + toolHookTriggers 47). `tsc --noEmit` clean.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): adopt 1 wenshao /review round-6 finding on PR #4321

- coreToolScheduler.test.ts: convert the `truncateSpanError` mock from
  an inline identity function to `vi.fn(identity)` so individual tests
  can substitute a sentinel return. Added regression test
  `setToolSpanFailure forwards the truncateSpanError result to the span
  status (#4321)` that overrides the spy with `<<TRUNCATED-SENTINEL>>`,
  drives the scheduler through the pre-hook deny path, and asserts the
  span's ERROR status message equals the sentinel — locks the
  integration so a regression dropping the `truncateSpanError(message)`
  call inside `setToolSpanFailure` is caught at the scheduler boundary
  rather than only at the utility's unit test.

Tests: 213/213 across `coreToolScheduler.test.ts` (160) +
`session-tracing.test.ts` (53). `tsc --noEmit` clean.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): close 4 silent-failure + test-gap findings from final review on PR #4321

Comprehensive self-review (code-reviewer + silent-failure-hunter +
type-design-analyzer + pr-test-analyzer agents) after 6 rounds of bot
feedback turned up 4 remaining actionable items. Addressed:

[silent-failure-hunter HIGH-1] toolHookTriggers.ts: when the hook
runner returns `{ success: false }` (or missing output) with no
`error.message`, the 3 fire helpers used to silently return the safe
default — `{ shouldProceed: true }` / `{ shouldStop: false }` / `{}` —
producing a hook span that reads `success: true` and looked like a
clean allow in dashboards. Now synthesizes a sentinel hookError
describing the contract violation so the span records the failure.
Three existing test cases updated to assert the new sentinel-bearing
shape.

[silent-failure-hunter HIGH-2] coreToolScheduler.ts: synchronous
throws in `_executeToolCallBody`'s prelude (addToolInputAttributes,
getMessageBus, startToolExecutionSpan, etc.) propagated up to
`executeSingleToolCall`'s `finally` without ever hitting setToolSpan*,
so the tool span ended UNSET with no failure_kind AND the tool call
stayed in 'executing' forever (checkAndNotifyCompletion never sees
terminal state, scheduler hangs). Added a catch in
executeSingleToolCall that pre-sets failure status + an error response
before the finally finalizes — guards every prelude path the body's
own try/catch doesn't cover.

[silent-failure-hunter MEDIUM-3] session-tracing.ts: the empty catch
on `sweepStaleSpans` `setAttributes` lost the `ttl_expired` +
`decision: 'aborted'` sentinel attrs silently if setAttributes ever
threw. Now matches the sibling `span.end()` catch and surfaces via
`debugLogger.warn` — TTL-leaked blocked spans stay distinguishable
from deliberately-UNSET ones in dashboards.

[pr-test-analyzer Gap1, severity 7] coreToolScheduler.test.ts: the
`signal.aborted` re-check at `_schedule:1834` (round-3 fix that
prevents opening a blocked span on an already-aborted signal between
the for-loop's await points and the awaiting_approval transition) had
no regression test. Added one that uses a tool whose
`getConfirmationDetails` aborts the signal before returning — top of
loop check passes, getConfirmationDetails resolves and aborts, re-check
fires the cancel path. Asserts `tool.failure_kind === 'cancelled'` AND
that NO blocked_on_user span was ever started.

Tests: 261/261 across affected files (coreToolScheduler 161 +
session-tracing 53 + toolHookTriggers 47). `tsc --noEmit` clean.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): adopt 3 wenshao /review round-8 findings on PR #4321

All three from the same /review run; all valid (the Critical is a real
bug in the SF-H2 fix from review-7 that this commit fixes).

[Critical] coreToolScheduler.ts:2407 — the `c.status === 'executing'`
guard on the prelude-throw catch was wrong. Prelude throws happen
BEFORE the `scheduled → executing` transition in `_executeToolCallBody`
(getMessageBus is called at line 2460, scheduled→executing flips at
line 2522). The `find(... 'executing')` skipped the setStatusInternal,
so the toolCall stayed in `scheduled` forever and
checkAndNotifyCompletion never fired — exactly the stall the SF-H2 fix
was supposed to prevent. Drop the guard; setStatusInternal already
no-ops on terminal states (success/error/cancelled) so the
unconditional call covers both scheduled-prelude and executing-body
paths. Added regression test that makes getMessageBus throw and
asserts onAllToolCallsComplete fires with status='error'.

[Suggestion] session-tracing.ts:222 — truncateSpanError used
`slice(0, 1024)` on UTF-16 code units, which splits surrogate pairs
when an emoji (e.g. 🚀) or rare CJK character sits at the boundary.
The result was a lone high surrogate followed by `'…[truncated]'` —
strict OTLP/gRPC collectors reject batches with invalid UTF-8 (a lone
high surrogate encodes to an invalid byte sequence). Back up one code
unit when the cut lands on a high surrogate. Added regression test
that constructs the boundary case (1023 'a' + 🚀 + padding) and
asserts the truncated string is valid UTF-16.

[Suggestion] toolHookTriggers.ts:133/240/319 — switched `||` to `??`
in the 3 hookError sentinel sites. `||` treats empty string as falsy
so a runner returning `{ error: { message: "" } }` triggered the
sentinel instead of preserving the (unhelpful but distinct) empty
message — a runner contract violation that's worth distinguishing
from a missing-message case. `??` synthesizes only when the message
is truly absent (undefined / null).

Tests: 263/263 across affected files (coreToolScheduler 162 +
session-tracing 54 + toolHookTriggers 47). `tsc --noEmit` clean.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* fix(telemetry): adopt 3 wenshao /review round-9 findings on PR #4321

[Critical] coreToolScheduler.ts — `handleConfirmationResponse`'s catch
was misattributing sister-tool prelude throws to the confirmed tool's
span. The catch wrapped `_handleConfirmationResponseInner`, which
called `attemptExecutionOfScheduledCalls` at its tail. If the user
proceeds tool A with ProceedAlways, `autoApproveCompatiblePendingTools`
transitions sister tools B/C to `scheduled`, and B has a prelude
throw, the SF-H2 catch in `executeSingleToolCall` re-throws → the
throw propagates up through `attemptExecutionOfScheduledCalls` → into
the outer catch keyed on A.callId, where `setToolSpanFailure(A.span,
TOOL_EXCEPTION, B.error.message)` corrupts A's span and
`finalizeToolSpan(A.callId)` ends A's span prematurely. A's actual
result later disappears from telemetry. Fix: move
`attemptExecutionOfScheduledCalls` out of
`_handleConfirmationResponseInner` and into
`handleConfirmationResponse` after the try/catch. The catch now
covers only confirmation logic; each tool's
`executeSingleToolCall` already handles its own span lifecycle via
its own catch.

[Suggestion] toolHookTriggers.ts — reverted the round-8 `??` change
back to `||`. Downstream consumers in coreToolScheduler.ts gate on
`r.hookError ? ...`, so an empty-string `hookError` preserved by
`??` was silently dropped — the change defeated its own stated
intent. Empty-string runner error messages carry no operator value;
the sentinel ("hook runner returned ... without error detail") is
more actionable, and `||` matches existing downstream truthiness
semantics.

[Suggestion] session-tracing.test.ts — replaced the vacuous
`Buffer.from(truncated, 'utf16le')` assertion (which never throws
because Node's Buffer copies raw 16-bit code units without validating
surrogate pairs) with the suggested regex
`/[\uD800-\uDBFF](?![\uDC00-\uDFFF])/` that actually checks for
orphan high surrogates anywhere in the string.

Tests: 263/263 across affected files (coreToolScheduler 162 +
session-tracing 54 + toolHookTriggers 47). `tsc --noEmit` clean.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)

* test(telemetry): pin empty-string runner error sentinel behavior on PR #4321

[Suggestion] gpt-5.5 review-10: the round-9 `??` → `||` revert was
correct, but the existing tests only covered the missing-error case
(`success: false` with no `error` field). A future regression back to
`??` would still pass those tests while reintroducing the silent-drop
behavior the revert was guarding against.

Add 3 explicit tests — one per fire helper (PreToolUse, PostToolUse,
PostToolUseFailure) — that pass `{ error: { message: '' } }` and
assert the sentinel hookError is synthesized (not the empty string).
Pins the `||` semantics so any future `??` change fails the suite.

Tests: 50/50 in toolHookTriggers.test.ts (47 → 50). `tsc --noEmit`
clean.

🤖 Generated with [Qwen Code](https://github.com/QwenLM/qwen-code)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

type/feature-request New feature or enhancement request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants