fix(reply): gate preflight compaction fast-path on token threshold (#63892) by 070freebird070-ctrl · Pull Request #64384 · openclaw/openclaw

070freebird070-ctrl · 2026-04-10T15:34:59Z

Summary

Fixes #63892. After the first compaction, `entry.totalTokensFresh` is set to `true` by `incrementRunCompactionCount`. On subsequent runs, `runPreflightCompactionIfNeeded` unconditionally short-circuits at `src/auto-reply/reply/agent-runner-memory.ts:379` because `shouldUseTranscriptFallback` is `false`. The stale-tokens path below (which is what runs the actual threshold check) is never reached, so proactive compaction never re-fires. Overflow-retry recovery still worked because it runs through a separate path in `src/auto-reply/reply/pi-embedded-runner/run.ts` that catches `isLikelyContextOverflowError` directly, bypassing the preflight gate entirely.

Problem: Proactive compaction never re-fires after the first checkpoint on long-lived sessions. Users only see compaction when a request actually overflows the context window.
Why it matters: On 100k+ sessions (reported against GLM-5.1:cloud, observed against Anthropic + Gemini providers in the issue thread) this turns every long conversation into an eventual overflow-and-retry instead of a smooth proactive checkpoint.
What changed: The preflight fast-path early return now also requires `freshPersistedTokens < preflightThreshold`. When fresh tokens have grown back above the threshold, the guard falls through and the existing `shouldRunPreflightCompaction` path runs normally. The duplicate `threshold` local was consolidated onto the new `preflightThreshold` that the guard uses, so the `logVerbose` output is unchanged.
What did NOT change (scope boundary): No changes to `shouldRunPreflightCompaction`, `incrementRunCompactionCount`, the overflow-retry path, or the stale-tokens transcript-fallback branch. No config/schema/public SDK surface touched.

Change Type

Bug fix

Scope

Memory / storage
Agents / orchestration (preflight compaction)

Verification

`pnpm test src/auto-reply/reply/agent-runner-memory.test.ts` — 5/5 passing
`pnpm check` — clean (tsgo, lint, all policy gates)
`pnpm build` — clean
Regression test covers both directions:
- `totalTokensFresh: true` + tokens above threshold → `compactEmbeddedPiSession` is called (proactive compaction fires — this is the bug fix).
- `totalTokensFresh: true` + tokens below threshold → fast-path early return still fires unchanged.

I also verified the equivalent gate on the compiled dist of v2026.4.9 on a long-running gateway against GLM-5.1:cloud (`contextWindowTokens` ≈ 200k, `reserveTokensFloor` 40k, `softThresholdTokens` 25k). Before the patch: zero proactive checkpoints after the first, overflow-retry checkpoints only. After the patch: proactive checkpoints fire on every subsequent cycle as expected.

Credit

The root cause analysis, exact file/line pointers, and the fix direction are all @martingarramon's from the issue thread. My earlier hypothesis about `compactionCount` latching was wrong — thanks for the correction. @mjamiv independently confirmed the repro on v2026.4.9, which is what motivated digging past the first hypothesis.

AI-assisted disclosure

This PR was written with AI assistance (Claude). I (the submitter) read and understand the change, verified the fix against the live compiled runtime before writing the source patch, and wrote the regression tests to directly exercise the `totalTokensFresh === true` re-fire case.

Fixes #63892

…penclaw#63892)

greptile-apps · 2026-04-10T15:38:10Z

Greptile Summary

This PR fixes a bug where proactive preflight compaction stops firing after the first compaction checkpoint. Once incrementRunCompactionCount marks entry.totalTokensFresh = true, the old early-return guard at agent-runner-memory.ts:388 would unconditionally short-circuit on every subsequent call, bypassing the token-count check entirely. The fix adds a freshPersistedTokensBelowThreshold predicate so the fast-path early return only fires when fresh tokens are actually still below the compaction threshold — letting grown sessions fall through to shouldRunPreflightCompaction as intended. The overflow-retry path (pi-embedded-runner/run.ts) is unaffected. Two regression tests directly exercise the totalTokensFresh === true re-fire and the unchanged below-threshold skip.

Confidence Score: 5/5

Safe to merge — targeted one-line logic fix with correct tests and no surface-area changes outside the preflight gate.

The fix is logically sound: the guard is coherent with shouldUseTranscriptFallback (when totalTokensFresh !== false and tokens are valid, freshPersistedTokens is guaranteed to be a positive number, so the new predicate behaves correctly in all reachable states). Both directions of the condition are covered by regression tests. No config, schema, or public SDK surface was touched. No remaining P0/P1 findings.

No files require special attention.

_{Reviews (1): Last reviewed commit: "fix(reply): gate preflight compaction fa..." | Re-trigger Greptile}

prtags · 2026-04-23T12:20:44Z

Related work from PRtags group infinite-dodo-7d3e

Title: Preflight compaction skips fresh token totals

Number	Title
#63892	Title unavailable
#64384*	fix(reply): gate preflight compaction fast-path on token threshold (#63892)
#65600	Title unavailable
#65622	fix(agents): reevaluate preflight compaction on fresh totals
#66520	Title unavailable
#66716	fix: auto-compaction fires on fresh cached token counts (#66520)

* This PR

clawsweeper · 2026-04-27T02:23:31Z

Codex review: needs real behavior proof before merge. Reviewed June 6, 2026, 12:49 AM ET / 04:49 UTC.

Summary
Review failed before ClawSweeper could summarize the requested change.

PR surface: Source +8, Tests +89, Docs +1. Total +98 across 3 files.

Reproducibility: unclear. The review failed before ClawSweeper could establish a reproduction path.

Review metrics: none identified.

Merge readiness
Overall: 🌊 off-meta tidepool
Proof: 🌊 off-meta tidepool
Patch quality: 🌊 off-meta tidepool
Result: rating does not apply to this item.

Overall follows the weaker of proof and patch quality, so missing proof can cap an otherwise strong patch.

Risk before merge

[P1] No close action taken because the review did not complete.

Maintainer options:

Decide the mitigation before merge
Retry the Codex review after fixing the execution failure.
Pause or close
Do not merge this PR until maintainers decide whether the risk is worth taking.

Next step before merge

[P1] Review did not complete, so no work-lane recommendation was made.

Review details

Best possible solution:

Retry the Codex review after fixing the execution failure.

Do we have a high-confidence way to reproduce the issue?

Unclear. The review failed before ClawSweeper could establish a reproduction path.

Is this the best way to solve the issue?

Unclear. Retry the review first so ClawSweeper can evaluate the actual issue and fix direction.

AGENTS.md: unclear because the file could not be read completely.

Codex review notes: model gpt-5.5, reasoning high; reviewed against 9313471fa579.

Label changes

Label changes:

remove P1: Current review triage priority is none.
remove merge-risk: 🚨 session-state: Current PR review selected no merge-risk labels.

Label justifications:

rating: 🌊 off-meta tidepool: Overall readiness is 🌊 off-meta tidepool; proof is 🌊 off-meta tidepool and patch quality is 🌊 off-meta tidepool.

Evidence reviewed

PR surface:

Source +8, Tests +89, Docs +1. Total +98 across 3 files.

View PR surface stats

Area	Files	Added	Removed	Net
Source	1	12	4	+8
Tests	1	90	1	+89
Docs	1	1	0	+1
Config	0	0	0	0
Generated	0	0	0	0
Other	0	0	0	0
Total	3	103	5	+98

What I checked:

failure reason: codex execution failed.
codex failure detail: Codex review failed for this PR with exit 1.
codex stdout: Per-item Codex failure; continuing with the rest of the shard.

Likely related people:

unknown: Codex failed before it could trace repository history. (role: review did not complete; confidence: low)

What the crustacean ranks mean

🦀 challenger crab: rare, exceptional readiness with strong proof, clean implementation, and convincing validation.
🦞 diamond lobster: very strong readiness with only minor maintainer review expected.
🐚 platinum hermit: good normal PR, likely mergeable with ordinary maintainer review.
🦐 gold shrimp: useful signal, but proof or patch confidence is still limited.
🦪 silver shellfish: thin signal; proof, validation, or implementation needs work.
🧂 unranked krab: not merge-ready because proof is missing/unusable or there are serious correctness or safety concerns.
🌊 off-meta tidepool: rating does not apply to this item.

Shiny media proof means a screenshot, video, or linked artifact directly shows the changed behavior. Runtime, network, CSP, and security claims still need visible diagnostics.

How this review workflow works

ClawSweeper keeps one durable marker-backed review comment per issue or PR.
Re-runs edit this comment so the latest verdict, findings, and automation markers stay together instead of adding duplicate bot comments.
A fresh review can be triggered by eligible @clawsweeper re-review comments, exact-item GitHub events, scheduled/background review runs, or manual workflow dispatch.
PR/issue authors and users with repository write access can comment @clawsweeper re-review or @clawsweeper re-run on an open PR or issue to request a fresh review only.
Maintainers can also comment @clawsweeper review to request a fresh review only.
Fresh-review commands do not start repair, autofix, rebase, CI repair, or automerge.
Maintainer-only repair and merge flows require explicit commands such as @clawsweeper autofix, @clawsweeper automerge, @clawsweeper fix ci, or @clawsweeper address review.
Maintainers can comment @clawsweeper explain to ask for more context, or @clawsweeper stop to stop active automation.

clawsweeper · 2026-05-20T18:10:11Z

ClawSweeper PR egg

🎁 Pass real behavior proof to wake the egg and unlock a hatchable treat.

Where did the egg go?

The egg game starts only after the PR passes the real-behavior proof check.
Before that, no creature or rarity is rolled. The treat waits for real proof.
This is still just collectible flavor: proof affects review readiness, not creature quality.

openclaw-barnacle · 2026-06-05T04:58:20Z

This pull request has been automatically marked as stale due to inactivity.
Please add updates or it will be closed.

fix(reply): gate preflight compaction fast-path on token threshold (o…

c4cffe3

…penclaw#63892)

openclaw-barnacle Bot added the size: S label Apr 10, 2026

070freebird070-ctrl mentioned this pull request Apr 10, 2026

Proactive compaction scheduler never re-fires after first checkpoint (compactionCount latched at 1) #63892

Closed

clawsweeper Bot mentioned this pull request May 16, 2026

[Feature]: Runtime Token Budget Awareness and Proactive Context Compaction #80594

Open

openclaw-barnacle Bot added the triage: needs-real-behavior-proof Candidate: external PR needs after-fix proof from a real setup. label May 19, 2026

openclaw-barnacle Bot added the stale Marked as stale due to inactivity label Jun 5, 2026

openclaw-barnacle Bot removed the stale Marked as stale due to inactivity label Jun 6, 2026

yetval mentioned this pull request Jun 8, 2026

fix(reply): project preflight compaction gate by next-input size on fresh tokens #91488

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(reply): gate preflight compaction fast-path on token threshold (#63892)#64384

fix(reply): gate preflight compaction fast-path on token threshold (#63892)#64384
070freebird070-ctrl wants to merge 1 commit into
openclaw:mainfrom
070freebird070-ctrl:fix/reply-proactive-compaction-63892

070freebird070-ctrl commented Apr 10, 2026

Uh oh!

greptile-apps Bot commented Apr 10, 2026

Uh oh!

prtags Bot commented Apr 23, 2026

Uh oh!

clawsweeper Bot commented Apr 27, 2026 •

edited

Loading

Uh oh!

clawsweeper Bot commented May 20, 2026

Uh oh!

openclaw-barnacle Bot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

070freebird070-ctrl commented Apr 10, 2026

Summary

Change Type

Scope

Verification

Credit

AI-assisted disclosure

Uh oh!

greptile-apps Bot commented Apr 10, 2026

Greptile Summary

Confidence Score: 5/5

Uh oh!

prtags Bot commented Apr 23, 2026

Uh oh!

clawsweeper Bot commented Apr 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clawsweeper Bot commented May 20, 2026

Uh oh!

openclaw-barnacle Bot commented Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

clawsweeper Bot commented Apr 27, 2026 •

edited

Loading