feat(cli): add baseline /doctor memory diagnostics by qqqys · Pull Request #4180 · QwenLM/qwen-code

qqqys · 2026-05-15T14:01:02Z

Summary

Add /doctor memory as the first lightweight memory diagnostic slice.
Collect paste-safe process memory, V8 heap statistics/spaces, active handle/request counts, and process metadata.
Add targeted tests for the diagnostic formatter/collector and command routing.

Test Plan

npm test --workspace=packages/cli -- --run src/utils/memoryDiagnostics.test.ts src/ui/commands/doctorCommand.test.ts
npm run typecheck --workspace=packages/cli currently fails on pre-existing unrelated package/core API drift; no memoryDiagnostics or doctorCommand errors remain.

Part of #3000
Closes #4179

github-actions · 2026-05-15T14:02:56Z

📋 Review Summary

This PR adds a new /doctor memory subcommand that collects and displays process memory diagnostics, including V8 heap statistics, active handles/requests, and process metadata. The implementation is well-structured with comprehensive tests and follows existing patterns in the codebase. Overall, this is a solid addition to the diagnostics toolkit.

🔍 General Feedback

Good separation of concerns: The memory diagnostics logic is cleanly separated into its own module (memoryDiagnostics.ts) with clear interfaces and pure formatting functions.
Consistent patterns: The implementation follows the same patterns as doctorChecks.ts and the existing command structure.
Test coverage: Both the utility functions and command integration are tested, including interactive and non-interactive modes.
Paste-safe output: The formatter avoids ANSI codes, making the output safe to paste into bug reports.
Error resilience: The code gracefully handles cases where V8 APIs or process internals are unavailable.

🎯 Specific Feedback

🟡 High

File: memoryDiagnostics.ts:47-58 - The countProcessInternals function accesses undocumented Node.js internals (_getActiveHandles, _getActiveRequests). While the try-catch provides safety, these methods could change without notice in Node.js updates. Consider adding a comment documenting that these are internal APIs and should be reviewed when Node.js major versions are updated.
File: doctorCommand.ts:29-46 - The memory subcommand bypasses the abort signal check. If a user aborts during memory diagnostics collection (which could be slow for large heaps), the operation will continue. Consider adding if (abortSignal?.aborted) return; after collecting diagnostics.

🟢 Medium

File: memoryDiagnostics.ts:1 - Missing copyright year in the license header. The file has "Copyright 2025" but should use the current year 2026 for new files, matching the project's convention.
File: memoryDiagnostics.ts:96 - The formatBytes function returns 'unavailable' for non-finite numbers, but the MemoryDiagnostics interface types suggest these should always be numbers. Consider whether the type signature should reflect this possibility (e.g., number | undefined) or add an assertion.
File: doctorCommand.ts:28 - The subCommand variable uses args.trim().toLowerCase() but doesn't handle the case where args might be undefined (though the function signature suggests it's a string). Consider using args?.trim().toLowerCase() ?? '' for extra safety.
File: doctorCommand.test.ts:47-69 - The mock setup for memory diagnostics is duplicated in beforeEach. Consider extracting this to a helper function or using vi.mocked(...).mockImplementation() to reduce repetition if more tests are added.

🔵 Low

File: doctorCommand.ts:24-26 - The completion function hardcodes ['memory'] as candidates. As more subcommands are added, this will need to be updated. Consider extracting to a constant array that can be easily extended.
File: memoryDiagnostics.ts:108 - The output format uses "MiB" consistently, which is good. Consider adding a comment explaining why MiB (mebibytes) was chosen over MB (megabytes) for consistency with Node.js conventions.
File: memoryDiagnostics.test.ts:14 - The test name "collects baseline process and V8 memory fields" is descriptive but long. Consider shortening to "collects baseline memory fields" for consistency with other test names in the codebase.
File: doctorCommand.ts:1 - The import of HistoryItemInfo is only used for the type cast on line 33. Consider whether this type could be inferred to reduce imports.

✅ Highlights

Excellent test coverage: The tests cover both interactive and non-interactive modes, ensuring the command works correctly in all execution contexts.
Thoughtful error handling: The use of unavailable flags throughout the diagnostics allows the function to return partial data rather than failing completely.
Clean formatting function: The formatMemoryDiagnostics function is well-structured with clear sections and consistent formatting.
Good use of TypeScript interfaces: The MemoryDiagnostics interface clearly documents the shape of the data, making the code self-documenting.
Proper i18n consideration: While this PR doesn't add translations, the structure allows for easy internationalization of the output format in the future if needed.

qqqys · 2026-05-15T14:40:37Z

Addressed the review feedback in 928fd6187:

Added abort handling around /doctor memory collection/format/render, including pre-collection and post-collection checks.
Added regression tests for both abort paths.
Documented use of Node internal active handle/request APIs and kept them best-effort.
Made memory subcommand parsing defensive and extracted subcommand constants.
Removed the extra HistoryItemInfo import/type annotation.
Added MiB unit rationale and shortened the memory diagnostics test name.
Updated the new memory diagnostics utility header year.

Verification:

cd packages/cli
npx vitest run src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.test.ts
npx eslint src/ui/commands/doctorCommand.ts src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.ts src/utils/memoryDiagnostics.test.ts
npm run typecheck --workspace @qwen-code/qwen-code

All passed locally.

qqqys · 2026-05-15T14:53:44Z

Started the next /doctor memory phase and pushed 6f9fa3794.

Added an Assessment section to the memory report:

Status: ok / warn
Heap pressure: heapUsed / heap_size_limit
RSS vs heap-total gap for non-heap growth
Signals for high V8 heap pressure, high non-heap memory, and large active-handle counts
Actionable recommendations, including when to restart, capture heap snapshots, and inspect tool-result/native-buffer growth

Validation:

cd packages/cli
npx vitest run src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.test.ts
npx eslint src/ui/commands/doctorCommand.ts src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.ts src/utils/memoryDiagnostics.test.ts
npm run typecheck --workspace @qwen-code/qwen-code

Result: 13 tests passed, eslint passed, typecheck passed.

qqqys · 2026-05-15T15:22:49Z

Started the next /doctor memory phase and pushed 12d44b079.

Added opt-in heap snapshot capture:

/doctor memory --snapshot writes a V8 .heapsnapshot
Default output dir: ~/.qwen/memory-snapshots/
Stable filename shape: qwen-code-heap-<pid>-<timestamp>.heapsnapshot
The command appends the written snapshot path to the report
Snapshot capture is gated behind an explicit flag because it can be synchronous/heavy
Preserves abort checks before collection, after formatting, and before rendering

Validation:

cd packages/cli
npx vitest run src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.test.ts
npx eslint src/ui/commands/doctorCommand.ts src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.ts src/utils/memoryDiagnostics.test.ts
npm run typecheck --workspace @qwen-code/qwen-code

Result: 15 tests passed, eslint passed, typecheck passed.

qqqys · 2026-05-15T15:38:47Z

Next phase update: added opt-in memory pressure sampling for /doctor memory.

What changed:

Added /doctor memory --sample for a short in-process memory trend sample.
Captures 3 samples at 1s intervals by default.
Reports sample count, RSS delta, heap-used delta, and per-sample RSS / heap-used / external / array-buffer values.
Keeps the default /doctor memory fast and single-shot; sampling is opt-in because it waits briefly to observe trend direction.
Preserves abort handling after the async sampling step so cancelled commands do not render stale output.
Updated command hint/examples to include --sample and --snapshot.

Validation:

cd packages/cli && npx vitest run src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.test.ts → 2 files passed, 19 tests passed
npx eslint src/ui/commands/doctorCommand.ts src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.ts src/utils/memoryDiagnostics.test.ts → passed
npm run typecheck --workspace @qwen-code/qwen-code → passed

qqqys · 2026-05-15T16:10:42Z

Snapshot hardening update for /doctor memory --snapshot.

What changed:

Added an explicit warning that heap snapshots may contain prompts, file contents, tool results, and other sensitive data, so they should not be shared publicly without review.
Wrapped heap snapshot writing in error handling: if snapshot creation fails, /doctor memory --snapshot still returns the memory diagnostics report and appends Heap snapshot failed: ... instead of dropping the whole command.
Added regression tests for both the sensitive-data warning and snapshot failure path.

Validation:

cd packages/cli && npx vitest run src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.test.ts → 2 files passed, 20 tests passed
npx eslint src/ui/commands/doctorCommand.ts src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.ts src/utils/memoryDiagnostics.test.ts → passed
npm run typecheck --workspace @qwen-code/qwen-code → passed

wenshao · 2026-05-16T00:02:25Z

+  now?: () => Date;
+  memoryUsage?: () => NodeJS.MemoryUsage;
+  wait?: (ms: number) => Promise<void>;
+}


[Critical] v8.writeHeapSnapshot() is synchronous — blocks the event loop for seconds with zero progress feedback. On a process near its heap limit (the exact scenario /doctor memory is meant to diagnose), the snapshot itself can trigger OOM and crash the process.

Suggested fix: Show a progress message before the blocking call ("Writing heap snapshot, this may take a moment..."). In buildMemoryInsights, when heapPressure >= 0.85, warn the user or refuse the snapshot.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T00:02:25Z

+  arrayBuffers: number;
+}
+
+export interface CollectMemoryPressureSamplesOptions {


[Critical] No disk space protection for heap snapshots. Each call writes a full heap dump (potentially hundreds of MB), files accumulate indefinitely in ~/.qwen/memory-snapshots/ with no size estimation, rate limiting, or auto-cleanup. A malicious script or agent-mode prompt injection could fill the user's disk.

Suggested fix: Add rate limiting (max 1/min), auto-cleanup (keep last N snapshots), and warn before writing if estimated size exceeds a threshold.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T00:02:25Z

+
+    if (index < count) {
+      await wait(intervalMs);
+    }


[Critical] buildMemoryInsights ignores diagnostics.v8.unavailable. When v8.getHeapStatistics() throws, getMemoryDiagnostics sets v8.unavailable = true, but buildMemoryInsights downgrades it to an empty object via heapStatistics ?? {}, resulting in status: 'ok' with zero signals — while all V8 fields display "unavailable". The data says "broken", the assessment says "fine".

Suggested change

}

function buildMemoryInsights(diagnostics: MemoryDiagnostics): MemoryInsights {

if (diagnostics.v8.unavailable) {

return {

status: 'warn',

signals: ['V8 heap statistics are unavailable; pressure assessment skipped.'],

recommendations: ['The Node.js V8 heap statistics API failed. Verify the Node.js version is supported.'],

};

}

const heapStatistics = diagnostics.v8.heapStatistics ?? {};

// ...existing logic...

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T00:02:25Z

+        return;
+      }
+
+      const diagnostics = getMemoryDiagnostics();


[Critical] Interactive mode success paths for --sample and --snapshot are completely untested. Only abort/error paths exist for interactive mode; success tests only cover non-interactive. The primary UI code path at lines 97-103 (context.ui.addItem) is not verified for either flag in interactive mode.

Suggested fix: Add interactive-mode success tests for --sample, --snapshot, and --sample --snapshot combo, asserting that context.ui.addItem is called with the expected report text.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T00:02:25Z

+          messageType = 'error';
+          report = `${report}\n\nHeap snapshot failed: ${formatErrorMessage(error)}`;
+        }
+      }


[Suggestion] In interactive mode, messageType is set to 'error' on snapshot failure (line 87) but addItem hardcodes type: 'info' (line 99). A failed snapshot displays identically to a successful one — the error state is invisible visually.

Suggested change

}

context.ui.addItem(

{

type: messageType === 'error' ? 'error' : 'info',

text: report,

},

Date.now(),

);

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T00:02:25Z

+export function formatMemoryPressureSamples(
+  samples: MemoryPressureSample[],
+): string {
+  const first = samples[0];


[Suggestion] formatMemoryPressureSamples uses first && last to compute deltas, but this is truthy for a single-element array (both reference the same object). The result displays "0.0 MiB" instead of "unavailable", misleadingly implying memory changed by exactly zero.

Suggested change

const first = samples[0];

const rssDelta =

first && last && samples.length > 1 ? last.rss - first.rss : undefined;

const heapUsedDelta =

first && last && samples.length > 1 ? last.heapUsed - first.heapUsed : undefined;

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T00:02:25Z

+) {
+  // These process methods are undocumented Node.js internals. They provide
+  // useful diagnostic counts, but may change across Node.js major versions; if
+  // unavailable or unstable, report `unavailable` instead of failing /doctor.


[Suggestion] countProcessInternals returns { count: 0, unavailable: false } when the internal API returns a non-array value. This silently reports a precise-but-wrong count of 0 instead of honestly marking it as unavailable. Since these are undocumented Node.js internals (_ prefixed), their return format may change across versions.

Suggested change

// unavailable or unstable, report `unavailable` instead of failing /doctor.

const entries = (getter as () => unknown[])();

if (!Array.isArray(entries)) {

return { count: 0, unavailable: true };

}

return { count: entries.length, unavailable: false };

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T00:08:58Z

+  wait?: (ms: number) => Promise<void>;
+}
+
+function defaultHeapSnapshotDir(): string {


[Suggestion] Heap snapshot files are written with default OS permissions (directory: 0755, file: 0644). Heap snapshots contain a full serialization of all in-memory JavaScript objects — including API keys held in environment variables, conversation prompts, authentication tokens, and file contents previously read into memory. On shared machines (CI runners, multi-user dev laptops, SSH bastion hosts), any local user can read ~/.qwen/memory-snapshots/*.heapsnapshot.

Suggested change

function defaultHeapSnapshotDir(): string {

fs.mkdirSync(outputDir, { recursive: true, mode: 0o700 });

try { fs.chmodSync(outputDir, 0o700); } catch { /* already exists */ }

And after writeSnapshot(filePath), tighten the file:

try { fs.chmodSync(result, 0o600); } catch { /* best effort */ }

— glm-5.1 via Qwen Code /review

wenshao · 2026-05-16T00:08:58Z

+        }
+      }
+
+      if (abortSignal?.aborted) {


[Suggestion] After a successful heap snapshot write, this abort check can silently discard the entire report — including the sensitive-data warning. The .heapsnapshot file persists on disk containing prompts, file contents, and tool results, but the user receives no notification that it was created. This undermines the purpose of the warning.

Move the abort check before the snapshot write, or skip it when a snapshot file exists on disk so the report (with the sensitive-data warning) is always delivered:

Suggested change

if (abortSignal?.aborted) {

if (shouldWriteHeapSnapshot && !abortSignal?.aborted) {

try {

const heapSnapshotPath = writeMemoryHeapSnapshot();

report = `${report}\n\nHeap snapshot written: ${heapSnapshotPath}\n${HEAP_SNAPSHOT_SENSITIVE_DATA_WARNING}`;

} catch (error) {

messageType = 'error';

report = `${report}\n\nHeap snapshot failed: ${formatErrorMessage(error)}`;

}

}

// Deliver report even if aborted — user may have a snapshot on disk.

— glm-5.1 via Qwen Code /review

qqqys · 2026-05-16T01:34:28Z

Addressed the latest review feedback in e0f04ac14:

Added per-directory heap snapshot rate limiting (default max 1/min) before the synchronous snapshot write.
Kept disk-space guard, snapshot cleanup, and private permissions; made chmod hardening best-effort so diagnostics remain usable on non-POSIX filesystems.
Added regression coverage for rate limiting.
Expanded interactive-mode success coverage for snapshot progress feedback and the --sample --snapshot combination.

Verification:

cd packages/cli && npx vitest run src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.test.ts
cd packages/cli && npx eslint src/ui/commands/doctorCommand.ts src/ui/commands/doctorCommand.test.ts src/utils/memoryDiagnostics.ts src/utils/memoryDiagnostics.test.ts
cd packages/cli && npm run typecheck --workspace @qwen-code/qwen-code

Result: 32 tests passed; eslint passed; typecheck passed.

wenshao

⚠️ CI check "Test (windows-latest, Node 22.x)" is failing — please verify whether this is related to the PR changes or a pre-existing flake.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:13Z

+          });
+        }
+
+        try {


[Critical] hasHighHeapPressure(diagnostics) uses stale data when --sample and --snapshot are combined.

getMemoryDiagnostics() is called at line 86, then collectMemoryPressureSamples runs for ~3 seconds. If heap pressure crosses the 85% threshold during sampling, the pre-sample diagnostics won't reflect it, and the OOM safeguard is bypassed.

Suggested change

try {

if (hasHighHeapPressure(getMemoryDiagnostics())) {

Re-fetch diagnostics after sampling completes so the check sees current heap state.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:13Z

+        }
+      }
+
+      if (abortSignal?.aborted && !report.includes('Heap snapshot written:')) {


[Critical] The abort gate !report.includes('Heap snapshot written:') silently discards completed --sample results when --snapshot is not also specified.

When a user runs /doctor memory --sample (no --snapshot), the 3-second sampling completes and formats its output into report. But because the report never contains the string "Heap snapshot written:", the abort check at line 135 always returns undefined, throwing away the valid sample data.

Replace the string-based gate with an explicit boolean flag:

Suggested change

if (abortSignal?.aborted && !report.includes('Heap snapshot written:')) {

if (abortSignal?.aborted && shouldWriteHeapSnapshot && !snapshotWritten) {

return;

}

Set let snapshotWritten = false before the try block and snapshotWritten = true inside the success branch.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:13Z

+    fs.chmodSync(writtenPath, 0o600);
+  } catch {
+    // Best-effort hardening; the report warns that snapshots are sensitive.
+  }


[Critical] If cleanupOldHeapSnapshots throws (e.g., transient filesystem error), the exception propagates and doctorCommand reports "Heap snapshot failed" — but the snapshot file was already successfully written at line 225.

Oncall engineers would waste time re-running the command and investigating a nonexistent failure.

Suggested change

}

recordHeapSnapshotWrite(outputDir, now);

try {

cleanupOldHeapSnapshots(outputDir, maxSnapshots);

} catch {

// Best-effort cleanup; snapshot already written successfully.

}

return writtenPath;

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:13Z

+        if (abortSignal?.aborted) {
+          return;
+        }
+


[Suggestion] setPendingItem('Writing heap snapshot...') is never cleared after the snapshot write completes (success or failure). The default doctor flow clears it in a finally block (lines 193-196), but the memory subcommand path lacks this cleanup.

Add context.ui.setPendingItem(null) after the try-catch block to prevent stale pending state in the UI.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:14Z

+
+function hasHighHeapPressure(
+  diagnostics: ReturnType<typeof getMemoryDiagnostics>,
+): boolean {


[Suggestion] hasHighHeapPressure here and buildMemoryInsights in memoryDiagnostics.ts:322 independently compute the same heapUsed / heapSizeLimit >= 0.85 threshold. If the threshold is ever changed, the snapshot gate and the diagnostic report assessment would silently diverge.

Extract a shared isHighHeapPressure(diagnostics) function (or a constant) from memoryDiagnostics.ts so both call sites use a single source of truth.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:14Z

+        name.startsWith('qwen-code-heap-') && name.endsWith('.heapsnapshot'),
+    )
+    .map((name) => path.join(outputDir, name))
+    .sort((a, b) => b.localeCompare(a));


[Suggestion] cleanupOldHeapSnapshots sorts snapshot files by full path using localeCompare. Filenames have format qwen-code-heap-{pid}-{timestamp}.heapsnapshot. PID "999" sorts after "12345" lexicographically, so newer snapshots from a low-numeric PID could be deleted while older snapshots from a high-numeric PID are retained.

Extract and compare only the timestamp portion (ISO-8601) for chronological ordering:

Suggested change

.sort((a, b) => b.localeCompare(a));

.sort((a, b) => {

const tsA = path.basename(a).match(/qwen-code-heap-\d+-(.+)\.heapsnapshot/)?.[1] ?? '';

const tsB = path.basename(b).match(/qwen-code-heap-\d+-(.+)\.heapsnapshot/)?.[1] ?? '';

return tsB.localeCompare(tsA);

})

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:14Z

+    return DOCTOR_SUBCOMMANDS.filter((candidate) =>
+      candidate.startsWith(trimmed),
+    );
+  },


[Suggestion] The new completion() function is completely untested. As a public API that drives tab-completion behavior, it should have tests covering: empty partial arg → returns ['memory'], partial match 'mem' → ['memory'], no match 'x' → [].

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:14Z

+  ) {
+    signals.push(
+      'Active handle count is high; long-lived timers, sockets, or file watchers may be accumulating.',
+    );


[Suggestion] The high active-handle-count signal path (activeHandles.count >= 1000) in buildMemoryInsights is untested. Add a test case with activeHandles: { count: 1500, unavailable: false } and assert the report includes both the "Active handle count is high" signal and the corresponding MCP/watcher recommendation.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:39:14Z

+    diagnostics.memory.heapUsed / heapSizeLimit >= 0.85
+  );
+}
+


[Suggestion] hasHighHeapPressure edge cases are untested: heap_size_limit: 0, undefined, or heapStatistics entirely missing. Add parameterized tests to verify the function returns false (safe to snapshot) in these cases.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao

⚠️ CI failing on Test (windows-latest, Node 22.x). Static review found 1 Critical and 7 Suggestions below.

wenshao · 2026-05-16T03:40:23Z

+function getAvailableBytes(outputDir: string): number {
+  const stats = fs.statfsSync(outputDir);
+  return stats.bavail * stats.bsize;
+}


[Critical] getAvailableBytes calls fs.statfsSync(outputDir) without try/catch. On filesystems that don't support statfs (certain FUSE mounts, network filesystems), this throws and crashes the entire writeMemoryHeapSnapshot call — the user gets an unhandled error instead of a graceful fallback.

Suggested change

}

function getAvailableBytes(outputDir: string): number {

try {

const stats = fs.statfsSync(outputDir);

return stats.bavail * stats.bsize;

} catch {

return Number.MAX_SAFE_INTEGER; // skip disk check on unsupported FS

}

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:40:23Z

+  for (const filePath of snapshots.slice(maxSnapshots)) {
+    fs.rmSync(filePath, { force: true });
+  }
+}


[Suggestion] cleanupOldHeapSnapshots uses .sort((a, b) => b.localeCompare(a)) on filenames like qwen-code-heap-{PID}-{timestamp}.heapsnapshot. Variable-length PIDs cause lexicographic sort to order incorrectly across process runs (e.g., PID 9 vs PID 12345), so newer snapshots may be deleted while older ones are preserved.

Suggested change

}

.sort((a, b) => {

const aStat = fs.statSync(a);

const bStat = fs.statSync(b);

return bStat.mtimeMs - aStat.mtimeMs;

});

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:40:23Z

+
+function hasHighHeapPressure(
+  diagnostics: ReturnType<typeof getMemoryDiagnostics>,
+): boolean {


[Suggestion] hasHighHeapPressure() independently computes heapUsed / heapSizeLimit >= 0.85, duplicating the identical logic in buildMemoryInsights() in memoryDiagnostics.ts. If the threshold changes in one place but not the other, snapshot gating and diagnostic reporting will diverge. Reuse buildMemoryInsights:

Suggested change

): boolean {

function hasHighHeapPressure(diagnostics: MemoryDiagnostics): boolean {

const insights = buildMemoryInsights(diagnostics);

return insights.heapPressure !== undefined && insights.heapPressure >= 0.85;

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:40:23Z

+            throw new Error(
+              'Heap snapshot skipped: V8 heap pressure is already high, and writing a synchronous heap snapshot could make the process unresponsive or trigger OOM. Restart Qwen Code first if it is unstable, or retry before memory pressure reaches the warning threshold.',
+            );
+          }


[Suggestion] The abort guard uses report.includes('Heap snapshot written:') — control flow is coupled to a formatted message string. If the success message text changes, the guard silently breaks and successfully-written snapshot reports are discarded on abort. Use an explicit boolean flag instead:

Suggested change

}

let heapSnapshotWritten = false;

// in try block after writeMemoryHeapSnapshot: heapSnapshotWritten = true;

// ...

if (abortSignal?.aborted && !heapSnapshotWritten) {

return;

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:40:23Z

+    .filter(
+      (name) =>
+        name.startsWith('qwen-code-heap-') && name.endsWith('.heapsnapshot'),
+    )


[Suggestion] cleanupOldHeapSnapshots calls readdirSync/rmSync without try/catch. If the directory is deleted between the successful snapshot write and cleanup, the exception kills the call and the user sees "Heap snapshot failed: ENOENT" — even though a valid .heapsnapshot file was already written to disk. Wrap cleanup in try/catch as best-effort.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:40:23Z

+  const getter = (process as unknown as Record<string, unknown>)[name];
+  if (typeof getter !== 'function') {
+    return { count: 0, unavailable: true };
+  }


[Suggestion] process._getActiveHandles() is called unconditionally in getMemoryDiagnostics(). On processes with thousands of handles (MCP servers, watchers), this synchronous call blocks the event loop for hundreds of ms to seconds, causing a noticeable UI freeze during /doctor memory. Consider making handle counting opt-in behind a --verbose flag, or at minimum log elapsed time when it exceeds ~100ms.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:40:23Z

+      if (shouldWriteHeapSnapshot) {
+        if (abortSignal?.aborted) {
+          return;
+        }


[Suggestion] In interactive mode, setPendingItem('Writing heap snapshot...') is called before the write but never cleared afterward. The "Writing..." message persists permanently in the UI alongside the final result. Add context.ui.setPendingItem(null) before addItem.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:40:23Z

+const MEMORY_SUBCOMMAND = 'memory';
+const DOCTOR_SUBCOMMANDS = [MEMORY_SUBCOMMAND] as const;
+const HEAP_SNAPSHOT_SENSITIVE_DATA_WARNING =
+  'Heap snapshot may contain prompts, file contents, tool results, and other sensitive data. Do not share it publicly without reviewing it first.';


[Suggestion] HEAP_SNAPSHOT_SENSITIVE_DATA_WARNING and inline error messages in the action handler are hardcoded English strings, while other strings in the same file use t() for i18n. Wrap user-facing strings in t() for consistency with project i18n conventions.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:29Z

+      candidate.startsWith(trimmed),
+    );
+  },
+  action: async (context, args) => {


[Critical] completion function has zero test coverage. Tab-completion logic (/doctor m → memory) has no test exercising it. If completion breaks, no test will catch it.

Suggested change

action: async (context, args) => {

// Add test for completion function:

// '' → ['memory'], 'm' → ['memory'], 'xyz' → []

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:29Z

+  }
+
+  if (
+    diagnostics.activeHandles.count >= 1000 &&


[Critical] High active-handle-count signal path (activeHandles.count >= 1000 → 'Active handle count is high') is untested. This is a key resource-leak diagnostic. No test fixture passes a high count.

Suggested change

diagnostics.activeHandles.count >= 1000 &&

// Add test with activeHandles: { count: 1000, unavailable: false }

// Assert output includes 'Active handle count is high'

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:29Z

+
+function estimateHeapSnapshotBytes(): number {
+  const heapStats = v8.getHeapStatistics();
+  return Math.max(heapStats.total_heap_size, process.memoryUsage().heapTotal);


[Critical] estimateHeapSnapshotBytes uses a 1x multiplier — real V8 heap snapshots are typically 2–5× larger (per-object metadata, edge tables, string tables). The disk-space guard at line 213 can pass when the actual snapshot would fill remaining disk. Additionally, the v8.getHeapStatistics() call lacks try/catch, unlike getMemoryDiagnostics() which wraps the same call. If V8 throws, the error propagates as unhelpful 'Heap snapshot failed: <raw V8 error>'.

Suggested change

return Math.max(heapStats.total_heap_size, process.memoryUsage().heapTotal);

const OVERHEAD_FACTOR = 3;

function estimateHeapSnapshotBytes(): number {

try {

const heapStats = v8.getHeapStatistics();

return Math.max(heapStats.total_heap_size, process.memoryUsage().heapTotal) * OVERHEAD_FACTOR;

} catch {

return process.memoryUsage().heapTotal * OVERHEAD_FACTOR;

}

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:29Z

+  );
+
+  const writtenPath = writeSnapshot(filePath);
+  try {


[Critical] When v8.writeHeapSnapshot(filePath) throws mid-write, a partial .heapsnapshot file remains on disk indefinitely. cleanupOldHeapSnapshots is only called on success (line 234) — failed writes skip cleanup. The corrupt partial file wastes disk space and occupies a maxSnapshots slot.

Suggested change

try {

let writtenPath: string;

try {

writtenPath = writeSnapshot(filePath);

} catch (err) {

try { fs.rmSync(filePath, { force: true }); } catch {}

throw err;

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:29Z

+  }
+}
+
+const lastHeapSnapshotWriteByDir = new Map<string, number>();


[Critical] Module-level Map lastHeapSnapshotWriteByDir is only set()-ed, never delete()-d. In long-running processes this is a slow memory leak (one entry per unique directory). Also, stale rate-limit state persists across test runs with no cleanup export, breaking test isolation.

Suggested change

const lastHeapSnapshotWriteByDir = new Map<string, number>();

export function clearHeapSnapshotRateLimit(): void {

lastHeapSnapshotWriteByDir.clear();

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:30Z

+  };
+}
+
+export interface WriteMemoryHeapSnapshotOptions {


[Suggestion] The getMemoryDiagnostics catch branch (where v8.getHeapStatistics() throws, setting v8Unavailable = true) has zero test coverage. Only the happy path is tested.

Suggested change

export interface WriteMemoryHeapSnapshotOptions {

// Add test: mock v8.getHeapStatistics to throw,

// assert v8.unavailable === true, heapStatistics === undefined

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:30Z

+
+  const writtenPath = writeSnapshot(filePath);
+  try {
+    fs.chmodSync(writtenPath, 0o600);


[Suggestion] cleanupOldHeapSnapshots is called after recordHeapSnapshotWrite without its own try/catch. If cleanup throws (e.g., readdirSync EACCES), the error propagates and masks the fact that the heap snapshot was successfully written and rate-limited. Users see 'Heap snapshot failed: <cleanup error>'.

Suggested change

fs.chmodSync(writtenPath, 0o600);

recordHeapSnapshotWrite(outputDir, now);

try {

cleanupOldHeapSnapshots(outputDir, maxSnapshots);

} catch {

// Best-effort cleanup; snapshot was already written successfully.

}

return writtenPath;

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:30Z

+      }
+
+      const diagnostics = getMemoryDiagnostics();
+


[Suggestion] When --sample --snapshot is combined, hasHighHeapPressure(diagnostics) uses the diagnostics captured before sampling. collectMemoryPressureSamples runs ~3s — heap usage can climb from safe (<85%) to dangerous (>85%) during this window, defeating the OOM-prevention guard.

Suggested change

if (shouldWriteHeapSnapshot) {

// Re-fetch diagnostics after sampling for current heap pressure

const latestDiagnostics = getMemoryDiagnostics();

if (hasHighHeapPressure(latestDiagnostics)) {

throw new Error('Heap snapshot skipped: heap pressure is high.');

}

// ... write snapshot

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:30Z

+  let v8Unavailable = false;
+
+  try {
+    heapStatistics = v8.getHeapStatistics() as unknown as Record<


[Suggestion] getMemoryDiagnostics wraps both v8.getHeapStatistics() and v8.getHeapSpaceStatistics() in a single try/catch. If getHeapStatistics() succeeds but getHeapSpaceStatistics() throws, v8Unavailable is set to true while heapStatistics is populated. Downstream buildMemoryInsights reports 'V8 unavailable' while formatMemoryDiagnostics still displays the heap data — a contradictory report.

Suggested change

heapStatistics = v8.getHeapStatistics() as unknown as Record<

// Use separate try/catch for each V8 call, or track per-statistic availability

let heapStatistics: v8.HeapStatistics | undefined;

try {

heapStatistics = v8.getHeapStatistics();

} catch { /* heapStatistics stays undefined */ }

let heapSpaceStatistics: v8.HeapSpaceStatistics[] | undefined;

try {

heapSpaceStatistics = v8.getHeapSpaceStatistics();

} catch { /* heapSpaceStatistics stays undefined */ }

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T03:44:30Z

+  heapUsed: number;
+  external: number;
+  arrayBuffers: number;
+}


[Suggestion] collectMemoryPressureSamples does not accept or check an abortSignal. The sampling loop (for + await wait(intervalMs)) runs ~3s without any abort check. doctorCommand.ts checks abortSignal before and after but cannot interrupt during sampling.

Suggested change

}

export async function collectMemoryPressureSamples(

options: CollectMemoryPressureSamplesOptions & { signal?: AbortSignal } = {}

): Promise<MemoryPressureSample[]> {

// ...

for (let i = 0; i < count; i++) {

if (signal?.aborted) break;

// ... sample

if (i < count - 1) {

await wait(intervalMs);

}

}

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:02:21Z

+    )
+    .map((name) => path.join(outputDir, name))
+    .sort((a, b) => {
+      const aStat = fs.statSync(a);


[Critical] fs.statSync follows symlinks — a broken symlink in ~/.qwen/memory-snapshots/ throws in the sort comparator, and the outer try/catch silently swallows it. From that point on, every future cleanup fails silently and snapshots accumulate without bound.

Suggested change

const aStat = fs.statSync(a);

.sort((a, b) => {

const getMtime = (p: string) => {

try {

return fs.lstatSync(p).mtimeMs;

} catch {

return 0;

}

};

const mtimeDelta = getMtime(b) - getMtime(a);

if (mtimeDelta !== 0) return mtimeDelta;

return (

extractHeapSnapshotTimestamp(b).localeCompare(

extractHeapSnapshotTimestamp(a),

) || path.basename(b).localeCompare(path.basename(a))

);

})

Also wrap individual fs.rmSync calls in the deletion loop so one stuck file doesn't block the rest.

— glm-5.1 via Qwen Code /review

wenshao · 2026-05-16T05:02:21Z

+      if (mtimeDelta !== 0) return mtimeDelta;
+
+      return (
+        extractHeapSnapshotTimestamp(b).localeCompare(


[Suggestion] The new tiebreaker branch (extractHeapSnapshotTimestamp + basename fallback) is the only behavioral change in this PR, but the existing test uses different mtimes so mtimeDelta !== 0 always short-circuits. Add a test with two snapshots having identical mtime to exercise the new code path:

it('resolves equal-mtime ties by filename timestamp', () => { const older = path.join(outputDir, 'qwen-code-heap-1-2026-05-15T11-00-00-000Z.heapsnapshot'); const newer = path.join(outputDir, 'qwen-code-heap-1-2026-05-15T12-00-00-000Z.heapsnapshot'); fs.writeFileSync(older, 'a'); fs.writeFileSync(newer, 'b'); const sameMtime = new Date('2026-05-15T12:00:00.000Z'); fs.utimesSync(older, sameMtime, sameMtime); fs.utimesSync(newer, sameMtime, sameMtime); writeMemoryHeapSnapshot({ outputDir, now: new Date('2026-05-15T13:00:00.000Z'), maxSnapshots: 2, writeSnapshot: (p) => { fs.writeFileSync(p, 'new'); return p; }, }); expect(fs.existsSync(older)).toBe(false); expect(fs.existsSync(newer)).toBe(true); });

— glm-5.1 via Qwen Code /review

wenshao · 2026-05-16T05:02:21Z

+
+function extractHeapSnapshotTimestamp(filePath: string): string {
+  return (
+    path


[Suggestion] The filename template at ~line 266 (qwen-code-heap-${pid}-${timestamp}.heapsnapshot) and this parsing regex are ~65 lines apart with no shared constant. A future change to the template (e.g., adding a session ID) would silently break sort semantics. Extract shared constants:

Suggested change

path

const SNAPSHOT_PREFIX = 'qwen-code-heap';

const SNAPSHOT_EXT = '.heapsnapshot';

// Filename construction:

`${SNAPSHOT_PREFIX}-${process.pid}-${formatSnapshotTimestamp(now)}${SNAPSHOT_EXT}`

// Parsing regex (derived from the same constants):

const SNAPSHOT_RE = new RegExp(`^${SNAPSHOT_PREFIX}-\\d+-(.+)\\${SNAPSHOT_EXT}$`);

— glm-5.1 via Qwen Code /review

wenshao · 2026-05-16T05:02:22Z

+function getAvailableBytes(outputDir: string): number {
+  try {
+    const stats = fs.statfsSync(outputDir);
+    return stats.bavail * stats.bsize;


[Suggestion] fs.statfsSync is unavailable on Windows. The catch returns MAX_SAFE_INTEGER, making the disk-space guard a complete no-op on that platform. Consider adding a Windows-specific disk query, or at minimum log a warning so the gap is observable:

Suggested change

return stats.bavail * stats.bsize;

} catch {

// statfsSync unavailable on this platform (e.g., Windows);

// disk-space guard is inactive.

return Number.MAX_SAFE_INTEGER;

}

— glm-5.1 via Qwen Code /review

wenshao · 2026-05-16T05:43:20Z

    return t('Run installation and environment diagnostics');
  },
  kind: CommandKind.BUILT_IN,
  supportedModes: ['interactive', 'non_interactive', 'acp'] as const,


[Critical] ACP mode allows external agents to trigger heap snapshot writes.

supportedModes includes 'acp', and the action handler has no mode-based restriction on --snapshot. An external MCP/ACP agent issuing /doctor memory --snapshot executes writeMemoryHeapSnapshot() unimpeded, dumping all process memory (including API keys, prompts, and file contents) to a predictable path on disk. The snapshot path is returned verbatim to the agent.

Suggested change

supportedModes: ['interactive', 'non_interactive', 'acp'] as const,

// In the action handler, before the --snapshot block:

if (executionMode === 'acp' && shouldWriteHeapSnapshot) {

return {

type: 'message' as const,

messageType: 'error',

content: t('Heap snapshots are not available in agent (ACP) mode for security reasons.'),

};

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:20Z

+  }
+}
+
+function getAvailableBytes(outputDir: string): number {


[Critical] getAvailableBytes silently bypasses disk-space protection when fs.statfsSync throws.

When fs.statfsSync fails (e.g., macOS where it's unavailable, permission errors, FUSE mounts), the catch block returns Number.MAX_SAFE_INTEGER, making the disk-space guard availableBytes - estimatedSnapshotBytes < minFreeBytesAfterSnapshot always evaluate to false. The protection is silently disabled with no log, no warning, and no user-visible signal.

Suggested change

function getAvailableBytes(outputDir: string): number {

function getAvailableBytes(outputDir: string): number | undefined {

try {

const stats = fs.statfsSync(outputDir);

return stats.bavail * stats.bsize;

} catch {

return undefined;

}

}

// At call site:

const availableBytes = getAvailableBytes(outputDir);

if (availableBytes === undefined) {

throw new Error(t('Cannot determine available disk space for heap snapshot. The filesystem may not support this operation.'));

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:20Z

+      }
+
+      if (shouldSampleMemory) {
+        const samples = await collectMemoryPressureSamples({


[Suggestion] Missing try/catch around collectMemoryPressureSamples await.

If collectMemoryPressureSamples throws (unlikely in practice, but the default wait/memoryUsage injectors could throw in edge cases), the report variable already containing formatMemoryDiagnostics(diagnostics) with RSS, heap usage, active handles, and process metadata is lost. The user sees only a raw error message instead of the base diagnostics that were successfully collected.

Suggested change

const samples = await collectMemoryPressureSamples({

let samples: MemoryPressureSample[] | undefined;

try {

samples = await collectMemoryPressureSamples({

sampleCount: 3,

intervalMs: 1000,

signal: abortSignal,

});

} catch (err) {

messageType = 'error';

report = `${report}\n\n${t('Memory pressure sampling failed:')} ${formatErrorMessage(err)}`;

}

if (samples) {

report = `${report}\n\n${formatMemoryPressureSamples(samples)}`;

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:20Z

+    '/doctor memory --sample',
+    '/doctor memory --snapshot',
+  ],
+  completion: async (_context, partialArg) => {


[Suggestion] trimStart() doesn't handle trailing whitespace, causing a completion dead-end.

When a user types /doctor memory (with trailing space after the full subcommand), partialArg.trimStart() produces 'memory ' with trailing space intact. The filter candidate.startsWith('memory ') returns [] — no completions are offered, and the user cannot discover --sample or --snapshot flags via Tab.

Suggested change

completion: async (_context, partialArg) => {

const trimmed = partialArg.trim();

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:20Z

+  return error instanceof Error ? error.message : String(error);
+}
+
+function formatHeapSnapshotErrorMessage(error: unknown): string {


[Suggestion] formatHeapSnapshotErrorMessage uses string prefix matching (message.startsWith('Heap snapshot')) that is coupled to another module's error message wording.

Errors from memoryDiagnostics.ts like 'Insufficient free disk space for heap snapshot' don't start with 'Heap snapshot', producing redundant output: "Heap snapshot failed: Insufficient free disk space for heap snapshot". If error messages in memoryDiagnostics.ts are reworded, this logic silently breaks. Consider using a structured error type instead.

Suggested change

function formatHeapSnapshotErrorMessage(error: unknown): string {

// Use a custom error class or discriminated result rather than string prefix matching:

class HeapSnapshotError extends Error {

constructor(message: string) {

super(message);

}

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:21Z

+
+  const estimatedSnapshotBytes = estimateSnapshotBytesOption();
+  const availableBytes = getAvailableBytesOption(outputDir);
+  if (availableBytes - estimatedSnapshotBytes < minFreeBytesAfterSnapshot) {


[Suggestion] fs.chmodSync catch-all silently swallows EACCES/EPERM without differentiation from benign failures.

Both fs.chmodSync(outputDir, 0o700) and fs.chmodSync(writtenPath, 0o600) use empty catch {} blocks. "Best-effort hardening" is the stated intent, but EACCES/EPERM (real permission errors the user should know about) are silenced identically to ENOSYS (filesystem doesn't support chmod). The user is told permissions are "private" when they may not be.

Suggested change

if (availableBytes - estimatedSnapshotBytes < minFreeBytesAfterSnapshot) {

// Log or warn on EACCES/EPERM, silently ignore ENOSYS/ENOTSUP

try {

fs.chmodSync(outputDir, 0o700);

} catch (err: any) {

if (err?.code === 'EACCES' || err?.code === 'EPERM') {

// Re-throw or return a warning — permissions are not as expected

throw err;

}

// ENOSYS, ENOTSUP — best-effort, silently skip

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:21Z

+    ...(heapSpaceLines.length > 0 ? heapSpaceLines : ['  - unavailable']),
+    '',
+    'Runtime internals',
+    `  Active handles: ${formatActiveCount(diagnostics.activeHandles)}`,


[Suggestion] Zero heap_size_limit renders as misleading "0.0 MiB".

When V8 reports heap_size_limit: 0 ("no explicit limit"), formatBytes renders it as "0.0 MiB", implying the heap has zero capacity — which is false and alarming. getHeapPressure already correctly handles this by returning undefined, but the raw display bypasses that guard.

Suggested change

` Active handles: ${formatActiveCount(diagnostics.activeHandles)}`,

const heapSizeLimit = asFiniteNumber(heapStatistics['heap_size_limit']);

const displayLimit = (heapSizeLimit !== undefined && heapSizeLimit > 0)

? formatBytes(heapSizeLimit)

: 'unavailable';

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:21Z

+        report = `${report}\n\n${formatMemoryPressureSamples(samples)}`;
+      }
+
+      if (shouldWriteHeapSnapshot) {


[Suggestion] ACP mode output format is inconsistent between /doctor and /doctor memory.

The original /doctor returns structured JSON via JSON.stringify({ checks, summary }). The new /doctor memory subcommand returns human-readable plain text in the content field of the same { type: 'message' } envelope. ACP consumers that parse /doctor output as JSON will silently break when receiving the plain-text memory report.

Suggested change

if (shouldWriteHeapSnapshot) {

// Wrap in JSON for consistency with /doctor:

return {

type: 'message' as const,

messageType,

content: JSON.stringify({ type: 'memory', report, heapSnapshotWritten }),

};

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:21Z

+    try {
+      fs.rmSync(filePath, { force: true });
+    } catch {
+      // Best-effort cleanup for partial snapshots after a failed write.


[Suggestion] Heap snapshot filename is fully predictable: qwen-code-heap-{process.pid}-{timestamp}.heapsnapshot.

PID and timestamp are easily obtainable. A local attacker who gains access to the machine can find snapshots in ~/.qwen/memory-snapshots/ without discovery effort — just wait for or trigger a /doctor memory --snapshot. Combined with best-effort chmod, this makes information gathering trivial.

Suggested change

// Best-effort cleanup for partial snapshots after a failed write.

// Add a random component to the filename:

const snapshotId = crypto.randomUUID().slice(0, 8);

const filePath = path.join(

outputDir,

`qwen-code-heap-${process.pid}-${formatSnapshotTimestamp(now)}-${snapshotId}.heapsnapshot`,

);

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:43:21Z

+  now = () => new Date(),
+  memoryUsage = process.memoryUsage,
+  wait = defaultWait,
+}: CollectMemoryPressureSamplesOptions = {}): Promise<MemoryPressureSample[]> {


[Suggestion] Invalid sampleCount values (negative, zero) are silently normalized to 1, inconsistent with NaN/Infinity treatment.

normalizeSampleCount uses Math.max(1, Math.floor(sampleCount)), silently converting sampleCount: 0 or sampleCount: -5 to 1 sample. But NaN and Infinity are normalized to the default of 3. A caller passing sampleCount: 0 (intending "no sampling") gets 1 sample with misleading results instead of an error or the default.

Suggested change

}: CollectMemoryPressureSamplesOptions = {}): Promise<MemoryPressureSample[]> {

function normalizeSampleCount(sampleCount: number): number {

if (!Number.isFinite(sampleCount) || sampleCount <= 0) {

return 3; // default

}

return Math.floor(sampleCount);

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao

⚠️ CI check "Test (windows-latest, Node 22.x)" is failing — please verify whether this is related to the PR changes or a pre-existing flake.

One untested coverage gap: getAvailableBytes catch fallback (returns MAX_SAFE_INTEGER) and estimateHeapSnapshotBytes catch fallback (returns heapTotal * 3) have zero test coverage — no test exercises statfsSync throwing or v8.getHeapStatistics() throwing within these functions.

wenshao · 2026-05-16T05:56:16Z

+  }
+}
+
+function getAvailableBytes(outputDir: string): number {


[Critical] getAvailableBytes silently bypasses disk-space protection when fs.statfsSync throws.

When fs.statfsSync(outputDir) throws (non-standard filesystems, FUSE mounts, permission issues), the function returns Number.MAX_SAFE_INTEGER instead of propagating the error. The disk-space guard (availableBytes - estimatedSnapshotBytes < minFreeBytesAfterSnapshot) becomes mathematically impossible to trigger — the protection is completely bypassed just when it's needed most.

The old code (before this PR) let statfsSync throw naturally (fail-closed). The new try/catch converts it to fail-open. Disk-space protection should fail-closed: if you can't verify available space, skip the snapshot rather than silently writing to a potentially full disk.

Suggested change

function getAvailableBytes(outputDir: string): number {

function getAvailableBytes(outputDir: string): number {

try {

const stats = fs.statfsSync(outputDir);

return stats.bavail * stats.bsize;

} catch (error) {

// If we can't verify available space, skip the snapshot to be safe.

throw new Error(

`Unable to check available disk space: ${formatErrorMessage(error)}`,

);

}

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:56:16Z

+        });
+
+        if (abortSignal?.aborted) {
+          return;


[Critical] Abort after --sample silently discards the already-generated diagnostics report.

When a user runs /doctor memory --sample, getMemoryDiagnostics() and formatMemoryDiagnostics() have already completed successfully. After collectMemoryPressureSamples() finishes (which may have taken ~3s), the abort check at this line returns with zero output — the fully-computed report is thrown away.

The user waited for sampling and pressed Ctrl+C, but gets absolutely nothing back. The abort should return the partial results that were already computed, not discard everything.

Suggested change

return;

if (abortSignal?.aborted) {

// Return the diagnostics we already computed, even if sampling was interrupted.

const result = shouldSampleMemory

? `${report}\n\n${formatMemoryPressureSamples(samples)}`

: report;

return {

type: 'doctor',

content: result,

messageType: 'info',

};

}

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:56:16Z

+
+function estimateHeapSnapshotBytes(): number {
+  const overheadFactor = 3;
+  try {


[Suggestion] Undocumented overheadFactor = 3 magic number in estimateHeapSnapshotBytes.

The 3x multiplier used to estimate heap snapshot size has no comment explaining its origin. Without documentation, future maintainers may reduce it (thinking it's overly conservative) and cause real OOMs in disk-constrained environments. Add a brief comment explaining the heuristic.

Suggested change

try {

// Conservative 3x estimate: V8 heap snapshots include per-object metadata,

// edge tables, and string tables, typically 1.5–3× the live heap size.

const overheadFactor = 3;

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:56:16Z

+}
+
+function formatHeapSnapshotErrorMessage(error: unknown): string {
+  const message = formatErrorMessage(error);


[Suggestion] formatHeapSnapshotErrorMessage uses hardcoded English startsWith('Heap snapshot') check that will break when i18n translations are added.

The check message.startsWith('Heap snapshot') assumes error messages always begin with English text. But throw new Error(t('Heap snapshot skipped: ...')) wraps the message through t(). If a non-English translation is ever added, the startsWith check fails, and the function prepends a translated t('Heap snapshot failed:') prefix — producing a double-prefixed message like "[translated failed:] [translated skipped: ...]".

Consider using a custom error type, an error code property, or checking against the untranslated key instead of the translated message.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:56:16Z

+function defaultHeapSnapshotDir(): string {
+  return path.join(os.homedir(), '.qwen', 'memory-snapshots');
+}
+


[Suggestion] os.homedir() may throw without a fallback in defaultHeapSnapshotDir().

os.homedir() throws ERR_INVALID_ARG_VALUE when HOME is unset (headless Docker containers, restricted CI environments). The error propagates from writeMemoryHeapSnapshot()'s default parameter evaluation, and the user sees a confusing "Heap snapshot failed: ..." with no hint about the missing home directory.

The codebase has precedent for handling this: packages/core/src/core/prompts.ts wraps os.homedir() in try/catch with a fallback. Consider using os.homedir() || os.tmpdir() or wrapping in try/catch with a fallback to path.resolve('.qwen', 'memory-snapshots').

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T05:56:16Z

+function normalizeSampleCount(sampleCount: number): number {
+  if (!Number.isFinite(sampleCount)) {
+    return 3;
+  }


[Suggestion] collectMemoryPressureSamples does not respond to abort during await wait(intervalMs).

The abort signal is only checked at the start of each loop iteration. If abort arrives during await wait(intervalMs) (default 1000ms), the function doesn't respond until wait completes — up to 1s of unresponsiveness. While minor relative to the ~3s total sampling duration, it violates abort semantics (operations should stop promptly).

Consider using an abort-aware wait wrapper that races setTimeout against the signal's abort event, or reducing the sampling interval.

— DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao

No new high-confidence issues found. All previously identified Critical issues have been addressed — the code has improved significantly since the last review. 5 low-confidence findings noted for author consideration (TOCTOU window on snapshot file permissions, untested error branches in countProcessInternals, and minor duplication). Tests pass (44/44), CI green. LGTM! ✅ — DeepSeek/deepseek-v4-pro via Qwen Code /review

wenshao · 2026-05-16T09:19:21Z

Pre-merge Verification Report ✅

Date: 2026-05-16 | Tester: maintainer

Build & Quality

Check	Result
`npm run build`	✅
`npm run bundle`	✅
`tsc --noEmit` (typecheck)	✅ 0 errors
`eslint` (changed files)	✅ 0 errors, 0 warnings
Unit tests `memoryDiagnostics.test.ts`	✅ 23/23
Unit tests `doctorCommand.test.ts`	✅ 21/21

tmux Real-User TUI Testing

Scenario	Result
`/doctor memory` (basic diagnostics)	✅
`/doctor memory --sample` (pressure sampling)	✅ 3 samples, correct deltas
`/doctor memory --snapshot` (heap dump)	✅ 84MB snapshot, mode 0600, sensitivity warning shown
`/doctor memory --sample --snapshot` (combined)	✅ rate-limit triggered, RSS 503MB → `warn` status correctly surfaced
`/doctor` (no args, existing functionality)	✅ unaffected
`/doctor m` + Tab completion	✅ completes to `memory`
Non-interactive `-p "/doctor memory" --output-format json`	✅ `is_error: false`, 48ms

Key Observations

Heap snapshot written to ~/.qwen/memory-snapshots/ with private permissions (0600)
Sensitivity warning displayed after snapshot: "may contain prompts, file contents, tool results"
Rate-limiting enforced: second snapshot within cooldown window correctly blocked
High RSS/heap gap correctly triggers warn status with actionable recommendations
All 10 V8 heap spaces reported individually

Artifacts

tmp/doctor-memory-tmux-20260516-165601/tmux-readable-full.log — 334-line step-by-step readable log
tmp/doctor-memory-tmux-20260516-165601/report.md — full Chinese report

Ready to merge.

wenshao

LGTM. All tests pass (44/44 unit + full tmux real-user TUI matrix), typecheck/lint clean, heap snapshot permission safety and rate-limiting verified.

feat(cli): add baseline doctor memory diagnostics

e943cbb

fix(cli): address doctor memory review feedback

928fd61

feat(cli): add doctor memory assessment

6f9fa37

yiliang114 mentioned this pull request May 15, 2026

OOM in long sessions: V8 heap pressure can exceed limit before token-based compaction runs #4185

Closed

feat(cli): support doctor memory heap snapshots

12d44b0

yiliang114 mentioned this pull request May 15, 2026

fix(core): add heap-pressure auto-compaction safety net #4186

Merged

feat(cli): add doctor memory sampling

fb6cca1

fix(cli): harden doctor memory heap snapshots

f93c25f

wenshao requested changes May 16, 2026

View reviewed changes

wenshao reviewed May 16, 2026

View reviewed changes

github-actions Bot mentioned this pull request May 16, 2026

📊 AI CLI 工具社区动态日报 2026-05-16 zx0828/big_model_radar#58

Open

fix(cli): harden doctor memory heap snapshots

cadd4d6

github-actions Bot mentioned this pull request May 16, 2026

📊 AI CLI 工具社区动态日报 2026-05-16 gsscsd/big_model_radar#351

Open

fix(cli): harden memory heap snapshot diagnostics

e0f04ac

github-actions Bot mentioned this pull request May 16, 2026

📊 AI CLI 工具社区动态日报 2026-05-16 ivanweng2077/big_model_radar#49

Open

wenshao requested changes May 16, 2026

View reviewed changes

qqqys added 2 commits May 16, 2026 11:54

fix(cli): harden doctor memory snapshots

cfabcb7

fix(cli): stabilize heap snapshot cleanup ordering

b589155

wenshao requested changes May 16, 2026

View reviewed changes

fix(cli): harden heap snapshot cleanup

f4ea6f1

wenshao requested changes May 16, 2026

View reviewed changes

qqqys added 2 commits May 16, 2026 14:08

test(cli): cover memory snapshot fallbacks

98c0175

fix(cli): harden doctor memory abort and disk checks

f58a4eb

wenshao approved these changes May 16, 2026

View reviewed changes

wenshao merged commit 96b30ee into QwenLM:main May 16, 2026
10 checks passed

github-actions Bot mentioned this pull request May 18, 2026

fix(core): replace structuredClone with shallow copy to prevent OOM in long sessions #4286

Merged

yiliang114 mentioned this pull request May 21, 2026

[P3] Memory Diagnostics / 内存诊断 #3000

Open

qwen-code-ci-bot mentioned this pull request May 29, 2026

qwen --resume 子进程内存持续增长，最终 OOM #4624

Closed

-    }
+function buildMemoryInsights(diagnostics: MemoryDiagnostics): MemoryInsights {
+  if (diagnostics.v8.unavailable) {
+    return {
+      status: 'warn',
+      signals: ['V8 heap statistics are unavailable; pressure assessment skipped.'],
+      recommendations: ['The Node.js V8 heap statistics API failed. Verify the Node.js version is supported.'],
+    };
+  }
+  const heapStatistics = diagnostics.v8.heapStatistics ?? {};
+  // ...existing logic...
+}

-      }
+context.ui.addItem(
+  {
+    type: messageType === 'error' ? 'error' : 'info',
+    text: report,
+  },
+  Date.now(),
+);

-  const first = samples[0];
+const rssDelta =
+  first && last && samples.length > 1 ? last.rss - first.rss : undefined;
+const heapUsedDelta =
+  first && last && samples.length > 1 ? last.heapUsed - first.heapUsed : undefined;

-  // unavailable or unstable, report `unavailable` instead of failing /doctor.
+const entries = (getter as () => unknown[])();
+if (!Array.isArray(entries)) {
+  return { count: 0, unavailable: true };
+}
+return { count: entries.length, unavailable: false };

	function defaultHeapSnapshotDir(): string {
	fs.mkdirSync(outputDir, { recursive: true, mode: 0o700 });
	try { fs.chmodSync(outputDir, 0o700); } catch { /* already exists */ }

-      if (abortSignal?.aborted) {
+      if (shouldWriteHeapSnapshot && !abortSignal?.aborted) {
+        try {
+          const heapSnapshotPath = writeMemoryHeapSnapshot();
+          report = `${report}\n\nHeap snapshot written: ${heapSnapshotPath}\n${HEAP_SNAPSHOT_SENSITIVE_DATA_WARNING}`;
+        } catch (error) {
+          messageType = 'error';
+          report = `${report}\n\nHeap snapshot failed: ${formatErrorMessage(error)}`;
+        }
+      }
+      // Deliver report even if aborted — user may have a snapshot on disk.

+                        });
+                      }
+                      try {

-  }
+  recordHeapSnapshotWrite(outputDir, now);
+  try {
+    cleanupOldHeapSnapshots(outputDir, maxSnapshots);
+  } catch {
+    // Best-effort cleanup; snapshot already written successfully.
+  }
+  return writtenPath;

-    .sort((a, b) => b.localeCompare(a));
+    .sort((a, b) => {
+      const tsA = path.basename(a).match(/qwen-code-heap-\d+-(.+)\.heapsnapshot/)?.[1] ?? '';
+      const tsB = path.basename(b).match(/qwen-code-heap-\d+-(.+)\.heapsnapshot/)?.[1] ?? '';
+      return tsB.localeCompare(tsA);
+    })

-): boolean {
+function hasHighHeapPressure(diagnostics: MemoryDiagnostics): boolean {
+  const insights = buildMemoryInsights(diagnostics);
+  return insights.heapPressure !== undefined && insights.heapPressure >= 0.85;
+}

-          }
+let heapSnapshotWritten = false;
+// in try block after writeMemoryHeapSnapshot: heapSnapshotWritten = true;
+// ...
+if (abortSignal?.aborted && !heapSnapshotWritten) {
+  return;
+}

	action: async (context, args) => {
	// Add test for completion function:
	// '' → ['memory'], 'm' → ['memory'], 'xyz' → []

	diagnostics.activeHandles.count >= 1000 &&
	// Add test with activeHandles: { count: 1000, unavailable: false }
	// Assert output includes 'Active handle count is high'

-  try {
+let writtenPath: string;
+try {
+  writtenPath = writeSnapshot(filePath);
+} catch (err) {
+  try { fs.rmSync(filePath, { force: true }); } catch {}
+  throw err;
+}

-const lastHeapSnapshotWriteByDir = new Map<string, number>();
+export function clearHeapSnapshotRateLimit(): void {
+  lastHeapSnapshotWriteByDir.clear();
+}

	export interface WriteMemoryHeapSnapshotOptions {
	// Add test: mock v8.getHeapStatistics to throw,
	// assert v8.unavailable === true, heapStatistics === undefined

+if (shouldWriteHeapSnapshot) {
+  // Re-fetch diagnostics after sampling for current heap pressure
+  const latestDiagnostics = getMemoryDiagnostics();
+  if (hasHighHeapPressure(latestDiagnostics)) {
+    throw new Error('Heap snapshot skipped: heap pressure is high.');
+  }
+  // ... write snapshot
+}

-    heapStatistics = v8.getHeapStatistics() as unknown as Record<
+// Use separate try/catch for each V8 call, or track per-statistic availability
+let heapStatistics: v8.HeapStatistics | undefined;
+try {
+  heapStatistics = v8.getHeapStatistics();
+} catch { /* heapStatistics stays undefined */ }
+let heapSpaceStatistics: v8.HeapSpaceStatistics[] | undefined;
+try {
+  heapSpaceStatistics = v8.getHeapSpaceStatistics();
+} catch { /* heapSpaceStatistics stays undefined */ }

-}
+export async function collectMemoryPressureSamples(
+  options: CollectMemoryPressureSamplesOptions & { signal?: AbortSignal } = {}
+): Promise<MemoryPressureSample[]> {
+  // ...
+  for (let i = 0; i < count; i++) {
+    if (signal?.aborted) break;
+    // ... sample
+    if (i < count - 1) {
+      await wait(intervalMs);
+    }
+  }
+}

-    path
+const SNAPSHOT_PREFIX = 'qwen-code-heap';
+const SNAPSHOT_EXT = '.heapsnapshot';
+// Filename construction:
+`${SNAPSHOT_PREFIX}-${process.pid}-${formatSnapshotTimestamp(now)}${SNAPSHOT_EXT}`
+// Parsing regex (derived from the same constants):
+const SNAPSHOT_RE = new RegExp(`^${SNAPSHOT_PREFIX}-\\d+-(.+)\\${SNAPSHOT_EXT}$`);

Conversation

qqqys commented May 15, 2026

Summary

Test Plan

Uh oh!

github-actions Bot commented May 15, 2026

📋 Review Summary

🔍 General Feedback

🎯 Specific Feedback

🟡 High

🟢 Medium

🔵 Low

✅ Highlights

Uh oh!

qqqys commented May 15, 2026

Uh oh!

qqqys commented May 15, 2026

Uh oh!

qqqys commented May 15, 2026

Uh oh!

qqqys commented May 15, 2026

Uh oh!

qqqys commented May 15, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qqqys commented May 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wenshao left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wenshao left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

qqqys commented May 16, 2026 •

edited

Loading

-    return stats.bavail * stats.bsize;
+  } catch {
+    // statfsSync unavailable on this platform (e.g., Windows);
+    // disk-space guard is inactive.
+    return Number.MAX_SAFE_INTEGER;
+  }

-  supportedModes: ['interactive', 'non_interactive', 'acp'] as const,
+// In the action handler, before the --snapshot block:
+if (executionMode === 'acp' && shouldWriteHeapSnapshot) {
+  return {
+    type: 'message' as const,
+    messageType: 'error',
+    content: t('Heap snapshots are not available in agent (ACP) mode for security reasons.'),
+  };
+}

-function getAvailableBytes(outputDir: string): number {
+function getAvailableBytes(outputDir: string): number | undefined {
+  try {
+    const stats = fs.statfsSync(outputDir);
+    return stats.bavail * stats.bsize;
+  } catch {
+    return undefined;
+  }
+}
+// At call site:
+const availableBytes = getAvailableBytes(outputDir);
+if (availableBytes === undefined) {
+  throw new Error(t('Cannot determine available disk space for heap snapshot. The filesystem may not support this operation.'));
+}

-        const samples = await collectMemoryPressureSamples({
+let samples: MemoryPressureSample[] | undefined;
+try {
+  samples = await collectMemoryPressureSamples({
+    sampleCount: 3,
+    intervalMs: 1000,
+    signal: abortSignal,
+  });
+} catch (err) {
+  messageType = 'error';
+  report = `${report}\n\n${t('Memory pressure sampling failed:')} ${formatErrorMessage(err)}`;
+}
+if (samples) {
+  report = `${report}\n\n${formatMemoryPressureSamples(samples)}`;
+}

	completion: async (_context, partialArg) => {
	const trimmed = partialArg.trim();

-function formatHeapSnapshotErrorMessage(error: unknown): string {
+// Use a custom error class or discriminated result rather than string prefix matching:
+class HeapSnapshotError extends Error {
+  constructor(message: string) {
+    super(message);
+  }
+}

-  if (availableBytes - estimatedSnapshotBytes < minFreeBytesAfterSnapshot) {
+// Log or warn on EACCES/EPERM, silently ignore ENOSYS/ENOTSUP
+try {
+  fs.chmodSync(outputDir, 0o700);
+} catch (err: any) {
+  if (err?.code === 'EACCES' || err?.code === 'EPERM') {
+    // Re-throw or return a warning — permissions are not as expected
+    throw err;
+  }
+  // ENOSYS, ENOTSUP — best-effort, silently skip
+}

-    `  Active handles: ${formatActiveCount(diagnostics.activeHandles)}`,
+const heapSizeLimit = asFiniteNumber(heapStatistics['heap_size_limit']);
+const displayLimit = (heapSizeLimit !== undefined && heapSizeLimit > 0)
+  ? formatBytes(heapSizeLimit)
+  : 'unavailable';

-      // Best-effort cleanup for partial snapshots after a failed write.
+// Add a random component to the filename:
+const snapshotId = crypto.randomUUID().slice(0, 8);
+const filePath = path.join(
+  outputDir,
+  `qwen-code-heap-${process.pid}-${formatSnapshotTimestamp(now)}-${snapshotId}.heapsnapshot`,
+);

-}: CollectMemoryPressureSamplesOptions = {}): Promise<MemoryPressureSample[]> {
+function normalizeSampleCount(sampleCount: number): number {
+  if (!Number.isFinite(sampleCount) || sampleCount <= 0) {
+    return 3; // default
+  }
+  return Math.floor(sampleCount);
+}