fix: filter delivery-mirror from all consumer paths (LLM context, webchat, API) by kiyoakii · Pull Request #40716 · openclaw/openclaw

kiyoakii · 2026-03-09T07:12:47Z

Problem

Internal delivery-mirror audit entries (provider=openclaw, model=delivery-mirror) leak into all three consumer paths, causing escalating duplicate assistant messages that degrade over the life of a session:

LLM context window pollution — duplicates compound from 2x to 6-8x over a conversation, wasting tokens and confusing the model
Webchat UI duplication — users see every assistant reply rendered twice (or more)
Thinking block API rejections — extra assistant entries shift message indices, causing Anthropic to reject thinking blocks with errors visible to end users (BUG: Control UI (webchat) double-records assistant messages in session JSONL #39469)
Cross-channel impact — reported on Telegram, BlueBubbles/iMessage, and Webchat ([Bug]: Telegram duplicate messages - text and audio sent twice #30316)

Fixes #33263, #38061, #39469.
Related: #30316, #39795.

Note: Other open PRs (#38075 etc.) only filter chat.history. This PR covers all three consumer paths — including LLM context (the most impactful) and the sessions API.

Changes

Path	File	What
LLM context	`tool-result-context-guard.ts`	Filter via shared predicate in `transformContext` pipeline
Webchat UI	`chat.ts`	Filter before slice/byte-budget so audit entries don't consume the bounded window
API	`sessions.ts`	Filter in `sessions.get` handler
Predicate	`transcript.ts`	New `isDeliveryMirrorMessage()` using `in` narrowing (zero type assertions)

The write path is intentionally unchanged — delivery-mirror entries remain in session JSONL as an audit trail. appendCustomEntry() was investigated but SessionManager._persist() defers writes until a type:"message" + role:"assistant" entry exists, making it unreliable for standalone entries. Existing session files already contain old-format entries, so consumer-side filters are needed regardless.

Tests

7 new tests across 2 files covering the predicate (positive match, role/provider/model mismatches, non-object input) and the transformContext integration (strips delivery-mirror; preserves array identity when none present).

Unrelated

Commit f5c618eb fixes a pre-existing oxfmt formatting issue in src/cli/daemon-cli/lifecycle.test.ts.

greptile-apps · 2026-03-09T07:18:08Z

Greptile Summary

This PR fixes internal delivery-mirror audit entries (written by appendAssistantMessageToSessionTranscript with provider=openclaw, model=delivery-mirror) from leaking into all three consumer paths — LLM context window, webchat history (chat.history), and the sessions API (sessions.get) — which was causing escalating duplicate assistant messages and Anthropic API rejections.

Key changes:

Introduces a shared isDeliveryMirrorMessage(msg: unknown): boolean predicate in transcript.ts using proper in-operator narrowing (zero type assertions), making it safe to call on raw unknown objects from JSONL reads.
Filters delivery-mirror entries in tool-result-context-guard.ts before enforceToolResultContextBudgetInPlace, so these internal audit entries no longer inflate the context budget estimate (a secondary correctness improvement beyond de-duplication).
A .some() pre-check in tool-result-context-guard.ts preserves array identity for the common case where no delivery-mirror messages are present, avoiding unnecessary allocations.
The limit/max comparisons in sessions.ts and chat.ts are correctly applied to the filtered length so the pagination window accounts only for visible messages.
The JSONL write path is intentionally untouched — delivery-mirror entries remain on disk as an audit trail.
7 new tests cover the predicate exhaustively (positive match, role/provider/model mismatches, null/undefined/non-object) and two integration tests validate the transformContext behaviour (strips delivery-mirror; preserves array identity when none present).

Confidence Score: 5/5

This PR is safe to merge — it is a targeted consumer-side filter with no write-path changes, correct predicate logic, and good test coverage.
The implementation is minimal and correct: the isDeliveryMirrorMessage predicate is robustly written with proper type-narrowing, all three identified consumer paths are covered, the ordering of filter-before-budget-enforcement is right, and pagination limits are applied to filtered lengths. Tests exhaustively cover the predicate contract and the transformContext integration. No production data is mutated or lost — delivery-mirror entries remain in JSONL. No significant logic gaps or edge-case risks were found.
No files require special attention.

_{Last reviewed commit: f5c618e}

kiyoakii · 2026-03-09T08:11:49Z

@byungsker Thanks! All CI checks succeeded. Good to merge?

openclaw-barnacle · 2026-04-25T04:25:21Z

This pull request has been automatically marked as stale due to inactivity.
Please add updates or it will be closed.

steipete · 2026-04-26T09:13:49Z

Dedupe pass: keeping this as the main PR for the Track B assistant-message-shape slice under #69208.

Why this one stays canonical:

focused file set around delivery-mirror filtering and consumer paths
narrower than Hide transcript-only OpenClaw history artifacts #69217, which carries the same relevant idea mixed with unrelated Microsoft/channel/fetch changes
maps cleanly to the delivery-mirror/gateway-injected assistant-artifact branch without trying to solve the broader replay/idempotency family

This does not imply the full umbrella is fixed; it just makes this PR the clean candidate for the assistant-artifact consumer-path work.

clawsweeper · 2026-04-26T09:20:50Z

Codex review: needs changes before merge.

Summary
The PR adds a shared delivery-mirror predicate/filter, applies it to LLM context, WebChat chat.history, and sessions.get, and updates docs, changelog, and regression tests.

Reproducibility: yes. from source: current main writes provider: "openclaw" / model: "delivery-mirror" assistant rows and the chat.history and sessions.get paths return/project recent transcript messages without this filter. I did not run a live gateway reproduction in this read-only sweep.

Next step before merge
A narrow repair can update the canonical PR/replacement to current main and fix the visible-window edge case without a product decision.

Security
Cleared: The diff is limited to TypeScript filtering, tests, docs, and changelog; it does not add dependencies, workflows, package scripts, secret handling, or new code-execution surfaces.

Review findings

[P3] Keep mirror rows out of the bounded history window — src/gateway/server-methods/chat.ts:1670

Review details

Best possible solution:

Refresh the canonical PR on current main, keep delivery-mirror rows as persisted audit data, and filter internal transcript artifacts from model-, WebChat-, and API-facing consumers without letting hidden rows consume visible history limits.

Do we have a high-confidence way to reproduce the issue?

Yes from source: current main writes provider: "openclaw" / model: "delivery-mirror" assistant rows and the chat.history and sessions.get paths return/project recent transcript messages without this filter. I did not run a live gateway reproduction in this read-only sweep.

Is this the best way to solve the issue?

Yes, with a rebase caveat: consumer-boundary filtering is the narrow maintainable fix because it preserves the audit write path while hiding internal artifacts. The updated branch should account for current main's bounded session-history readers so filtering happens before visible limits are finalized.

Full review comments:

[P3] Keep mirror rows out of the bounded history window — src/gateway/server-methods/chat.ts:1670
This filters after chat.history has already requested only max recent rows from the transcript. If those tail rows include delivery-mirror artifacts, hidden rows still consume the requested window and clients can receive fewer visible messages than requested; pull enough rows or make the recent reader count only visible messages before applying the limit.
Confidence: 0.78

Overall correctness: patch is correct
Overall confidence: 0.76

Acceptance criteria:

pnpm test src/agents/pi-embedded-runner/tool-result-context-guard.test.ts src/config/sessions/sessions.test.ts src/gateway/server.chat.gateway-server-chat-b.test.ts
pnpm test src/gateway/session-utils.fs.test.ts src/gateway/server-methods/server-methods.test.ts
pnpm check:changed

What I checked:

canonical PR context: A maintainer comment keeps this PR as the main delivery-mirror consumer-path candidate for Track B under Umbrella: duplicate transcript, replay, and context assembly across channels #69208, narrower than broader duplicate/replay work. (3b99a1a159da)
delivery-mirror write path: Current main still writes mirrored delivery audit rows as assistant messages with provider: "openclaw" and model: "delivery-mirror". (src/config/sessions/transcript.ts:190, 89a15fddaf84)
WebChat consumer path still lacks this filter: chat.history reads recent messages, augments them, and sends rawMessages into display projection without a delivery-mirror-specific filter on current main. (src/gateway/server-methods/chat.ts:1748, 89a15fddaf84)
sessions API still lacks this filter: sessions.get returns messages from readRecentSessionMessagesWithStatsAsync without filtering delivery-mirror artifacts on current main. (src/gateway/server-methods/sessions.ts:1765, 89a15fddaf84)
LLM replay overlap already exists: Current main already drops delivery-mirror and gateway-injected OpenClaw assistant turns from provider replay, so this PR's model-context slice overlaps newer replay sanitization. (src/agents/pi-embedded-runner/replay-history.ts:229, 89a15fddaf84)
docs still describe old API behavior: Current protocol docs still say sessions.get returns the full stored row and list chat.history normalization without delivery-mirror omission. Public docs: docs/gateway/protocol.md. (docs/gateway/protocol.md:413, 89a15fddaf84)

Likely related people:

vincentkoc: The provided timeline shows a narrow repair pushed by vincentkoc to keep this PR canonical, and current main also shows recent maintenance by Vincent Koc. (role: recent maintainer and branch repair owner; confidence: medium; commits: 3b99a1a159da, 89a15fddaf84; files: src/gateway/server-methods/chat.ts, src/gateway/server-methods/sessions.ts, src/config/sessions/transcript.ts)
steipete: A maintainer comment explicitly scoped this PR as the canonical Track B assistant-artifact consumer-path candidate under the duplicate transcript umbrella. (role: maintainer triage owner; confidence: medium; commits: 585ce38015ef; files: src/gateway/server-methods/chat.ts, src/gateway/server-methods/sessions.ts, src/agents/pi-embedded-runner/replay-history.ts)
andyylin: The current changelog records adjacent work on redundant delivery-mirror transcript appends, and the PR timeline later mentions/subscribes this person. (role: adjacent delivery-mirror maintainer; confidence: low; files: CHANGELOG.md, src/config/sessions/transcript.ts)

Remaining risk / open question:

The PR is reported mergeable: false, and current main's bounded async history readers changed the integration point this branch must adapt to.
Tests were not executed in this read-only review; the discussion reports prior targeted validation and pnpm check:changed, but not proof against the exact current main SHA.

Codex review notes: model gpt-5.5, reasoning high; reviewed against 89a15fddaf84.

…PI responses delivery-mirror entries (provider=openclaw, model=delivery-mirror) are internal cross-channel delivery audit records written by appendAssistantMessageToSessionTranscript(). They use appendMessage() which creates type:"message" entries indistinguishable from real LLM responses, causing them to leak into buildSessionContext() and API responses. This causes duplicate assistant messages in the LLM context window (escalating from 2x to 6-8x over a session), duplicate renders in webchat UI, and raw audit entries in API responses. Fix: filter delivery-mirror messages at the three consumer points: - transformContext pipeline (LLM context) - chat.history handler (webchat UI) - sessions.get handler (API) Add isDeliveryMirrorMessage() predicate co-located with the write path in transcript.ts. Add regression tests for all filter paths. The write path (appendMessage) is intentionally unchanged - delivery-mirror entries remain in the JSONL as an audit trail. appendCustomEntry() was considered but _persist() in SessionManager defers writes until an assistant message exists, making it unreliable for standalone audit entries. Fixes #33263, #38061, #39469 Related: #30316, #39795

- Move delivery-mirror filter before slice/byte-budget in chat.history so internal entries do not consume bounded window slots - Drop unnecessary .toLowerCase() on role check for consistent strict equality across all three predicate fields - Add symmetric test case for model match with provider mismatch

vincentkoc · 2026-04-28T18:53:23Z

ProjectClownfish pushed a narrow repair to this branch so the original contributor path can stay canonical.

Source PR: #40716
Validation: pnpm -s vitest run src/agents/pi-embedded-runner/tool-result-context-guard.test.ts src/config/sessions/sessions.test.ts; pnpm check:changed
Contributor credit is preserved in the branch history and PR context.

openclaw-barnacle Bot added app: web-ui App: web-ui gateway Gateway runtime agents Agent runtime and tooling size: S labels Mar 9, 2026

kiyoakii mentioned this pull request Mar 9, 2026

fix(test): fix broken Telegram DM auth assertion on main #40726

Closed

kiyoakii force-pushed the fix/filter-delivery-mirror-from-context branch from f5c618e to 1424398 Compare March 9, 2026 07:30

openclaw-barnacle Bot added the cli CLI command changes label Mar 9, 2026

This was referenced Mar 9, 2026

[Bug]: Webchat duplicates assistant replies via delivery-mirror transcript entry #33263

Closed

BUG: Control UI (webchat) double-records assistant messages in session JSONL #39469

Closed

kiyoakii force-pushed the fix/filter-delivery-mirror-from-context branch from f40da04 to 77a667d Compare March 9, 2026 07:48

openclaw-barnacle Bot removed the cli CLI command changes label Mar 9, 2026

This comment was marked as spam.

Sign in to view

kiyoakii force-pushed the fix/filter-delivery-mirror-from-context branch from 77a667d to a237940 Compare March 9, 2026 15:18

This was referenced Apr 20, 2026

[Bug]: MSTeams DM inbound messages appear duplicated in agent session context #67323

Closed

Umbrella: duplicate transcript, replay, and context assembly across channels #69208

Open

lukeboyett mentioned this pull request Apr 20, 2026

system-event-shape consumer-path leakage; propose per-event audience classification #69492

Closed

openclaw-barnacle Bot added the stale Marked as stale due to inactivity label Apr 25, 2026

clawsweeper Bot mentioned this pull request Apr 26, 2026

fix(gateway): filter delivery-mirror entries from chat.history #38075

Closed

5 tasks

steipete mentioned this pull request Apr 26, 2026

Hide transcript-only OpenClaw history artifacts #69217

Closed

openclaw-barnacle Bot removed the stale Marked as stale due to inactivity label Apr 27, 2026

kiyoakii and others added 3 commits April 28, 2026 18:53

fix: filter delivery-mirror entries from consumer history paths

3b99a1a

vincentkoc force-pushed the fix/filter-delivery-mirror-from-context branch from a237940 to 3b99a1a Compare April 28, 2026 18:53

openclaw-barnacle Bot added docs Improvements or additions to documentation size: M and removed size: S labels Apr 28, 2026

vincentkoc added clownfish:human-review clawsweeper Tracked by ClawSweeper automation and removed clownfish:merge-ready labels Apr 28, 2026

WT-WSL mentioned this pull request May 8, 2026

fix(gateway): hide transcript-only history artifacts #79172

Open

25 tasks

kiyoakii closed this by deleting the head repository May 8, 2026

luzhidong mentioned this pull request May 12, 2026

WebChat: new message replaced by previous message content — delivery-mirror replay race #80938

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: filter delivery-mirror from all consumer paths (LLM context, webchat, API)#40716

fix: filter delivery-mirror from all consumer paths (LLM context, webchat, API)#40716
kiyoakii wants to merge 3 commits into
openclaw:mainfrom
kiyoakii:fix/filter-delivery-mirror-from-context

kiyoakii commented Mar 9, 2026

Uh oh!

greptile-apps Bot commented Mar 9, 2026

Uh oh!

This comment was marked as spam.

kiyoakii commented Mar 9, 2026

Uh oh!

This comment was marked as spam.

openclaw-barnacle Bot commented Apr 25, 2026

Uh oh!

steipete commented Apr 26, 2026

Uh oh!

clawsweeper Bot commented Apr 26, 2026 •

edited

Loading

Uh oh!

vincentkoc commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

kiyoakii commented Mar 9, 2026

Problem

Changes

Tests

Unrelated

Uh oh!

greptile-apps Bot commented Mar 9, 2026

Greptile Summary

Confidence Score: 5/5

Uh oh!

This comment was marked as spam.

kiyoakii commented Mar 9, 2026

Uh oh!

This comment was marked as spam.

openclaw-barnacle Bot commented Apr 25, 2026

Uh oh!

steipete commented Apr 26, 2026

Uh oh!

clawsweeper Bot commented Apr 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vincentkoc commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

clawsweeper Bot commented Apr 26, 2026 •

edited

Loading