Compaction/Safeguard: add summary quality audit retries by rodrigouroz · Pull Request #25556 · openclaw/openclaw

rodrigouroz · 2026-02-24T15:27:25Z

Summary

Describe the problem and fix in 2–5 bullets:

Problem: compaction summary quality could still regress (missing required headings, dropped asks/identifiers), and retries had edge-case behavior gaps.
Why it matters: degraded summaries can lose continuity, while unsafe retry feedback formatting can weaken prompt trust boundaries.
What changed:
- Added bounded quality audit + retry flow, then hardened it across follow-up commits.
- Quality checks now use exact heading-line matching, policy-aware identifier enforcement, and stronger Unicode-aware latest-ask overlap checks.
- Audit inputs are scoped to model-summarized content only (not verbatim preserved turns), and retry failures now keep the last successful summary instead of canceling compaction.
- Reused shared prompt sanitization utilities by adding wrapUntrustedPromptDataBlock in src/agents/sanitize-for-prompt.ts and consuming it from compaction safeguard (no local sanitizer duplication).
- Quality guard is now opt-in by default (qualityGuardEnabled defaults to false unless explicitly configured).
What did NOT change (scope boundary): no config-schema surface changes in this PR.

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

Closes n/a (final close tracked in Compaction Runner: wire post-compaction memory sync #25561)
Related [Bug]: Compaction summarization does not preserve exact identifiers #19207, (fix): Compaction: preserve recent context and sync session memory post-compact #20038, Compaction/Safeguard: require structured summary headings #25555

User-visible / Behavior Changes

Compaction quality retries are available but now disabled by default unless runtime enables qualityGuardEnabled.
When enabled, retries are bounded and keep the last successful summary if a retry attempt errors.

Security Impact (required)

New permissions/capabilities? (No)
Secrets/tokens handling changed? (No)
New/changed network calls? (No)
Command/tool execution surface changed? (No)
Data access scope changed? (No)
If any Yes, explain risk + mitigation:

Repro + Verification

Environment

OS: macOS
Runtime/container: Node 22 + pnpm
Model/provider: n/a (unit tests)
Integration/channel (if any): n/a
Relevant config (redacted): compaction safeguard runtime defaults

Steps

pnpm test src/agents/pi-extensions/compaction-safeguard.test.ts src/agents/sanitize-for-prompt.test.ts
pnpm tsgo
pnpm format

Expected

Updated safeguard and prompt-sanitization tests pass.
Typecheck passes with strict test fixture casts.
Formatting is clean.

Actual

Passed locally.

Evidence

Attach at least one:

Failing test/log before + passing after
Trace/log snippets
Screenshot/recording
Perf numbers (if relevant)

Human Verification (required)

What you personally verified (not just CI), and how:

Verified quality audit semantics for sections, identifier-policy behavior, latest-ask overlap, and preserved-turn bypass protections.
Verified retry failure fallback keeps a usable summary.
Verified shared sanitizer wrapper is reused in compaction safeguard and covered by unit tests.
What you did not verify: production traffic/token cost impact under sustained load.

Compatibility / Migration

Backward compatible? (Yes)
Config/env changes? (No)
Migration needed? (No)
If yes, exact upgrade steps:

Failure Recovery (if this breaks)

How to disable/revert this change quickly: keep qualityGuardEnabled unset/false, or revert this PR.
Files/config to restore: src/agents/pi-extensions/compaction-safeguard.ts, src/agents/sanitize-for-prompt.ts
Known bad symptoms reviewers should watch for: increased compaction latency only when quality guard is explicitly enabled.

Risks and Mitigations

List only real risks for this PR. Add/remove entries as needed. If none, write None.

Risk: when operators enable quality guard, retries can still increase latency/token usage.
- Mitigation: strict retry clamp + fallback behavior on retry errors.
Risk: false-positive ask overlap checks in edge language mixes.
- Mitigation: Unicode-aware tokenization + stopword filtering + stronger overlap thresholds + tests.

Stack: 4/9, depends on #25555.

greptile-apps

_{6 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2026-02-24T15:29:36Z

src/agents/pi-extensions/compaction-safeguard.ts

+function extractOpaqueIdentifiers(text: string): string[] {
+  const matches =
+    text.match(
+      /([A-Fa-f0-9]{8,}|https?:\/\/\S+|\/[\w./-]+|[A-Za-z]:\\[\w\\.-]+|[A-Za-z0-9._-]+\.[A-Za-z0-9._/-]+:\d{1,5}|\b\d{6,}\b)/g,


regex pattern \/[\w./-]+ will match single-char paths like /a or /x, which are likely false positives

the pattern intends to match Unix-style paths but lacks a minimum length constraint; consider requiring at least 2 path segments or a minimum length to avoid matching single-letter paths that are unlikely to be meaningful identifiers

Prompt To Fix With AI

This is a comment left during a code review. Path: src/agents/pi-extensions/compaction-safeguard.ts Line: 294 Comment: regex pattern `\/[\w./-]+` will match single-char paths like `/a` or `/x`, which are likely false positives the pattern intends to match Unix-style paths but lacks a minimum length constraint; consider requiring at least 2 path segments or a minimum length to avoid matching single-letter paths that are unlikely to be meaningful identifiers How can I resolve this? If you propose a fix, please make it concise.

openclaw-barnacle · 2026-03-04T04:12:49Z

This pull request has been automatically marked as stale due to inactivity.
Please add updates or it will be closed.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b16cd70531

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-04T19:08:39Z

src/agents/pi-extensions/compaction-safeguard.ts

+  const missingIdentifiers = params.identifiers.filter((id) => !params.summary.includes(id));
+  if (missingIdentifiers.length > 0) {
+    reasons.push(`missing_identifiers:${missingIdentifiers.slice(0, 3).join(",")}`);


Respect identifier policy during quality audit

auditSummaryQuality always treats omitted identifiers as a failure and the retry path then tells the model to “include every required section with exact identifiers preserved,” which overrides the existing identifierPolicy behavior from resolveExactIdentifierSectionInstruction (including off and redaction-oriented custom policies). In sessions configured to avoid strict identifier retention, this can force sensitive IDs/tokens back into persisted compaction summaries and changes configured behavior rather than enforcing it.

Useful? React with 👍 / 👎.

Addressed in 183e644. The quality audit is now policy-aware: identifier-presence checks run only when identifierPolicy is strict, and retry feedback no longer forces strict retention when policy is off/custom.

greptile-apps · 2026-03-04T19:15:44Z

Greptile Summary

This PR introduces a bounded quality-audit and retry flow for compaction summaries, along with several hardening fixes across follow-up commits. When qualityGuardEnabled is set (opt-in, defaults to false), the summarization loop audits the generated summary for required section headings, identifier retention, and latest-ask overlap; if the audit fails and retries remain, it feeds structured feedback into the next attempt and falls back to the last successful summary on retry errors.

Key changes:

Quality guard (auditSummaryQuality, extractOpaqueIdentifiers, hasAskOverlap): exact heading-line matching (consistent with hasRequiredSummarySections), policy-aware identifier enforcement, Unicode-aware latest-ask overlap with stopword filtering and double-match threshold for longer asks.
Scoped audit inputs: identifier seeding and latestUserAsk extraction now use only model-summarized content (messagesToSummarize + turnPrefixMessages), excluding preservedRecentMessages — preventing verbatim preserved turns from masking omissions in the generated summary.
Shared sanitization: local sanitizeUntrustedInstructionText removed; replaced by the new exported wrapUntrustedPromptDataBlock in src/agents/sanitize-for-prompt.ts.
Retry fallback: on retry failure, the last successful summary is kept rather than propagating the error and cancelling compaction.
Hex identifier normalization: pure-hex identifiers are normalized to uppercase and matched case-insensitively to prevent false-positive retention failures across case variations.
isQueryStopWordToken extracted from extractKeywords in query-expansion.ts for reuse.

All previously raised review concerns (section substring vs. line match, preserved-turn identifier bypass, hex case-sensitivity, unsanitized reasons) have been addressed and verified in this branch.

Confidence Score: 4/5

This PR is safe to merge; quality guard is opt-in (defaults off), all critical issues from prior review rounds are verified as fixed, and retry fallback behavior is sound.
The PR successfully hardens compaction summary quality with a bounded retry mechanism. All previously raised correctness concerns (section heading matching, identifier/ask extraction scope, hex normalization, prompt sanitization) have been verified as addressed in the code. The feature is gated behind qualityGuardEnabled: false by default, eliminating production risk without explicit opt-in. One minor observation remains: the qualityGuardMaxRetries = 0 edge case creates a silent no-op (audit runs but no retry occurs), which is worth documenting or warning about for operators who enable the feature. This is a configuration/UX concern, not a correctness or safety issue.
src/agents/pi-extensions/compaction-safeguard.ts — one configuration edge case noted in feedback; otherwise the complex retry loop is well-implemented and thoroughly hardened.

_{Last reviewed commit: 268b292}

greptile-apps · 2026-03-04T20:52:14Z

src/agents/pi-extensions/compaction-safeguard.ts

+        const canRegenerate =
+          messagesToSummarize.length > 0 ||
+          (preparation.isSplitTurn && turnPrefixMessages.length > 0);
+        if (!qualityGuardEnabled || !canRegenerate) {
+          summary = summaryWithPreservedTurns;
+          break;
+        }
+        const quality = auditSummaryQuality({
+          summary: summaryWithoutPreservedTurns,
+          identifiers,
+          latestAsk: latestUserAsk,
+          identifierPolicy,
        });
-        const splitTurnSection = `**Turn Context (split turn):**\n\n${prefixSummary}`;
-        summary = historySummary.trim()
-          ? `${historySummary}\n\n---\n\n${splitTurnSection}`
-          : splitTurnSection;
+        summary = summaryWithPreservedTurns;
+        if (quality.ok || attempt >= totalAttempts - 1) {
+          break;


When qualityGuardEnabled = true but qualityGuardMaxRetries = 0 (reachable via resolveQualityGuardMaxRetries(-1) → 0), totalAttempts = 1. The quality audit runs but the condition attempt >= totalAttempts - 1 evaluates to true on the first iteration (0 >= 0), causing an immediate break without any retry attempt. The quality feedback is generated but discarded, making the retry feature silently inactive.

This is a valid configuration path (the clamping test explicitly checks resolveQualityGuardMaxRetries(-1) === 0), but callers enabling the guard with 0 retries will see no retry behavior without any warning, making the feature a no-op.

Consider either:

Logging a warning when the guard is enabled with 0 retries, or

Documenting that qualityGuardMaxRetries: 0 disables the retry loop while keeping the audit flag semantically enabled

Prompt To Fix With AI

This is a comment left during a code review. Path: src/agents/pi-extensions/compaction-safeguard.ts Line: 917-932 Comment: When `qualityGuardEnabled = true` but `qualityGuardMaxRetries = 0` (reachable via `resolveQualityGuardMaxRetries(-1) → 0`), `totalAttempts = 1`. The quality audit runs but the condition `attempt >= totalAttempts - 1` evaluates to true on the first iteration (`0 >= 0`), causing an immediate break without any retry attempt. The quality feedback is generated but discarded, making the retry feature silently inactive. This is a valid configuration path (the clamping test explicitly checks `resolveQualityGuardMaxRetries(-1) === 0`), but callers enabling the guard with 0 retries will see no retry behavior without any warning, making the feature a no-op. Consider either: 1. Logging a warning when the guard is enabled with 0 retries, or 2. Documenting that `qualityGuardMaxRetries: 0` disables the retry loop while keeping the audit flag semantically enabled How can I resolve this? If you propose a fix, please make it concise.

rodrigouroz · 2026-03-05T13:25:57Z

@jalehman This is my next follow up, Thank you!

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 1c8fe6b7b9

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-03-05T19:25:33Z

src/agents/pi-extensions/compaction-safeguard.ts

+            summaryWithoutPreservedTurns = historySummary.trim()
+              ? `${historySummary}\n\n---\n\n${splitTurnSection}`
+              : splitTurnSection;


Audit history summary before appending split-turn section

When isSplitTurn is true, summaryWithoutPreservedTurns is built by concatenating historySummary with splitTurnSection, and the quality audit runs on that combined text. If the main historySummary omits required headings, but the split-turn prefix summary still contains them, auditSummaryQuality can incorrectly pass and skip retries. This makes the new quality guard ineffective for split-turn compactions, because headings from the prefix can mask structural regressions in the actual history summary.

Useful? React with 👍 / 👎.

…rigouroz

…rigouroz

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 4f37e30dc0

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-03-05T21:07:30Z

src/agents/pi-extensions/compaction-safeguard.ts

+            summaryWithoutPreservedTurns = historySummary.trim()
+              ? `${historySummary}\n\n---\n\n${splitTurnSection}`
+              : splitTurnSection;


Audit history summary before appending split-turn section

In split-turn compactions, summaryWithoutPreservedTurns is built by concatenating historySummary with splitTurnSection before auditSummaryQuality runs, so required-heading checks can be satisfied by headings that exist only in the prefix summary. This means a malformed historySummary (missing ## Decisions, etc.) can still pass quality audit whenever the split-turn prefix contains those headings, causing retries to be skipped and degraded history summaries to persist in split-turn sessions.

Useful? React with 👍 / 👎.

…rigouroz

…rigouroz

@jalehman

Merged via squash. Prepared head SHA: be473ef Co-authored-by: rodrigouroz <384037+rodrigouroz@users.noreply.github.com> Co-authored-by: jalehman <550978+jalehman@users.noreply.github.com> Reviewed-by: @jalehman

@jalehman

Merged via squash. Prepared head SHA: be473ef Co-authored-by: rodrigouroz <384037+rodrigouroz@users.noreply.github.com> Co-authored-by: jalehman <550978+jalehman@users.noreply.github.com> Reviewed-by: @jalehman

@jalehman

Merged via squash. Prepared head SHA: be473ef Co-authored-by: rodrigouroz <384037+rodrigouroz@users.noreply.github.com> Co-authored-by: jalehman <550978+jalehman@users.noreply.github.com> Reviewed-by: @jalehman

rodrigouroz mentioned this pull request Feb 24, 2026

Config/Compaction: expose safeguard preserve and quality settings #25557

Merged

18 tasks

openclaw-barnacle bot added agents Agent runtime and tooling size: L experienced-contributor labels Feb 24, 2026

greptile-apps bot reviewed Feb 24, 2026

View reviewed changes

rodrigouroz force-pushed the codex/pr20038-04 branch 2 times, most recently from 6db0640 to 8971441 Compare February 24, 2026 15:38

rodrigouroz mentioned this pull request Feb 24, 2026

(fix): Compaction: preserve recent context and sync session memory post-compact #20038

Closed

18 tasks

rodrigouroz force-pushed the codex/pr20038-04 branch from 8971441 to 94ab001 Compare February 24, 2026 15:52

openclaw-barnacle bot added the app: web-ui App: web-ui label Feb 24, 2026

rodrigouroz force-pushed the codex/pr20038-04 branch 5 times, most recently from f073ec7 to a35475c Compare February 24, 2026 16:39

rodrigouroz marked this pull request as draft February 24, 2026 16:59

rodrigouroz force-pushed the codex/pr20038-04 branch from a35475c to d783ffb Compare February 26, 2026 17:30

openclaw-barnacle bot removed the app: web-ui App: web-ui label Feb 26, 2026

openclaw-barnacle bot added the stale Marked as stale due to inactivity label Mar 4, 2026

rodrigouroz force-pushed the codex/pr20038-04 branch from d783ffb to b16cd70 Compare March 4, 2026 19:03

openclaw-barnacle bot added size: M and removed size: L labels Mar 4, 2026

rodrigouroz marked this pull request as ready for review March 4, 2026 19:04

chatgpt-codex-connector bot reviewed Mar 4, 2026

View reviewed changes

greptile-apps bot reviewed Mar 4, 2026

View reviewed changes

chatgpt-codex-connector bot reviewed Mar 5, 2026

View reviewed changes

jalehman self-assigned this Mar 5, 2026

jalehman force-pushed the codex/pr20038-04 branch from 1c8fe6b to 4f37e30 Compare March 5, 2026 21:01

jalehman added a commit to rodrigouroz/openclaw that referenced this pull request Mar 5, 2026

fix: keep safeguard quality guard opt-in (openclaw#25556) thanks @rod…

4f37e30

…rigouroz

openclaw-barnacle bot added size: XL and removed size: L labels Mar 5, 2026

chatgpt-codex-connector bot reviewed Mar 5, 2026

View reviewed changes

rodrigouroz and others added 13 commits March 5, 2026 13:38

Compaction/Safeguard: add summary quality audit retries

3c0039c

Compaction/Safeguard: align quality audit policy checks

5f1afa9

Compaction/Safeguard: harden summary quality checks

c0b22c0

Compaction/Safeguard: avoid preserved-turn audit bypass

6bc3178

Compaction/Safeguard: reduce quality-audit false positives

083787d

Compaction/Safeguard: harden ask overlap and retry fallback

7afc8ba

Compaction/Safeguard: fix strict test message casts

c7e5128

Compaction/Safeguard: opt-in quality retries and sanitize feedback

0c8c786

Compaction/Safeguard: reuse shared untrusted prompt wrapper

b11cf2e

Compaction/Safeguard: normalize hex identifier retention checks

73b8876

Compaction/Safeguard: reuse shared multilingual stopword logic

d780f62

Agents: wire safeguard quality guard + CJK overlap

566a795

fix: keep safeguard quality guard opt-in (openclaw#25556) thanks @rod…

be473ef

…rigouroz

jalehman force-pushed the codex/pr20038-04 branch from 4f37e30 to be473ef Compare March 5, 2026 21:38

jalehman merged commit 036c329 into openclaw:main Mar 5, 2026
12 checks passed

github-actions bot mentioned this pull request Mar 5, 2026

📡 Upstream Digest — 2026-03-05 22:24 UTC curtismercier/openclaw-mods#187

Open

alexyyyander mentioned this pull request Mar 7, 2026

fix/gateway token mismatch 38617 #38676

Closed

alexey-pelykh mentioned this pull request Mar 10, 2026

Cherry-pick: compaction safeguards, prompt hooks, config fixes remoteclaw/remoteclaw#823

Open

Uh oh!

Conversation

rodrigouroz commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Change Type (select all)

Scope (select all touched areas)

Linked Issue/PR

User-visible / Behavior Changes

Security Impact (required)

Repro + Verification

Environment

Steps

Expected

Actual

Evidence

Human Verification (required)

Compatibility / Migration

Failure Recovery (if this breaks)

Risks and Mitigations

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

openclaw-barnacle bot commented Mar 4, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

rodrigouroz Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Uh oh!

greptile-apps bot Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

rodrigouroz commented Mar 5, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rodrigouroz commented Feb 24, 2026 •

edited

Loading

greptile-apps bot commented Mar 4, 2026 •

edited

Loading