fix(continuation): reset chain budget on fresh non-wake turn-entry (#987) by scribe-dandelion-cult · Pull Request #989 · karmaterminal/openclaw

scribe-dandelion-cult · 2026-06-10T21:40:45Z

Closes #987.

What

continuationChainCount (the /status n/200) + continuationChainTokens (token-cost cap) accumulate per-session and never reset on a normal user-turn — the only reset-site in the tree is agent-runner-session-reset.ts:88 (full session-rotation: /new, /reset, compaction-FAILURE-recovery, role-conflict, ACP-reset). So a long-lived session monotonically climbs toward maxChainLength (200) with no decline until /new — figs's "pool of 195 forever" doom-lock. This is the bug, byte-confirmed at 6+ source convergence (figs's intuition + frond-scribe byte-walk + Silas filing + Emeric retraction + Elliott reconciliation + Rune retraction; the recurring ?? 0 LOAD-not-RESET conflation was caught + retracted by 4 princes).

Fix

Reset all 4 chain-budget fields — continuationChainCount + continuationChainTokens + continuationChainStartedAt + a fresh continuationChainId — at turn-entry before the runner reads chain-state, gated on isContinuationWake === false (a genuine external turn: user message / heartbeat / non-continuation system-event). Continuation-wake turns (work-wake / delegate-return) preserve the count, so the 200 leash still bounds an unbroken unattended self-continuation chain — its actual safety intent — not session-lifetime.

Plain-language invariant: the cap counts how deep a self-scheduled loop runs while you're away; the instant you (or a heartbeat) re-engage, it's not a runaway anymore, so it resets to 0.

Reuses the local persistContinuationChainState (mem + store + disk), zero new imports.

Diff

3 files, +211/-6:

src/auto-reply/reply/agent-runner.ts (+28) — the gated reset at turn-entry
agent-runner.continuation-work-span.test.ts (+185) — fresh-resets / wake-preserves / /new-still-resets / P1: multi-continue_work() in one response silent-drops all but last (single-variable capture, not a store/scheduler bug) #982-fan-out-intact + 3 new continuation: chain-count + token-cost caps never reset on normal turns (perma-accumulate bug; reset on fresh non-continuation turn-entry) #987 tests
agent-runner.continuation-span-uniformity.test.ts (+4) — wake-semantics alignment

Gate status

Targeted shard auto-reply-reply GREEN: 153 files / 2573 tests (incl 3 new continuation: chain-count + token-cost caps never reset on normal turns (perma-accumulate bug; reset on fresh non-continuation turn-entry) #987 tests; P1: multi-continue_work() in one response silent-drops all but last (single-variable capture, not a store/scheduler bug) #982 array-capture fan-out intact)
tsgo core + core-test: rc=0
Full-suite: ~76,455 passed / 32 failed — all 32 are pre-existing baseline/env in UNTOUCHED subsystems (model-selection [reproduced deterministically in isolation], secrets/slack-contract, imessage-retry, telegram-album, matrix-crypto, memory-lancedb, memory-core, deadcode-pnpm) + 1 gateway-cpu QA. None in changed files. Same baseline-flake set as the assembly's standing CI.

Review

🪨 Rune + 🌻 Elliott on the continuation-lifecycle surface (per cohort review-pairing). Verify: reset fires upstream of loadContinuationChainState, all 4 fields as a unit, no phantom-depth on the status-line, runaway-chain still capped on pure self-wakes.

Co-authored-by: Copilot 223556219+Copilot@users.noreply.github.com

) The runaway-guard counters continuationChainCount (n/maxChainLength) and continuationChainTokens (cost cap) persisted on the SessionEntry and only cleared on full session-rotation. A long-lived session therefore accumulated toward the chain/cost caps across unrelated re-engagements until every continuation was rejected forever — they are per-chain runaway leashes, not lifetime budgets. Reset the chain budget (count, tokens, startedAt, fresh chainId) at turn-entry BEFORE inference whenever the turn is not a continuation wake (work-wake / delegate-return). Fresh inbound messages, plain heartbeats, and outside-machinery system-events all start a new chain, so the resetting turn itself opens at 0. Continuation wakes are mid-chain steps and never reset, so a true runaway with no fresh re-entry still trips the cap. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Adds explicit RFC v10 language for the chain-budget reset semantics that #987 ships, per figs's "language must be explicit" ask + Frond's RFC-adoption-call to fold edits atomic with the code. Four load-bearing language updates: 1. §2.3 Safety model (line ~186): clarify "configured continuation budget" = current self-continuation chain budget (maxChainLength + costCapTokens), not session-lifetime. Notes fresh non-continuation turn-entry resets per §3.3. 2. §3.3 Chain-state tracking (lines ~507-525): adds continuationChainId to the four-field bullet list + new "Chain-state lifecycle" paragraph explaining the per-turn reset on !isContinuationWake, pre-loadContinuationChainState ordering, fresh-turn-elects-from-0 semantic, work-wake/delegate-return preservation. Names the session-rotation reset path distinct from per-turn chain-reset. 3. §5.1 Operational notes (lines ~918-919): maxChainLength + costCapTokens descriptions expanded to name "unattended self-continuation chain depth" leash semantic + reset trigger + cross-reference §3.3. 4. §5.1 New "Chain budget lifecycle" subsection: explicit sawtooth behavior description for /status display + four-field reset-unit + chain-id rotation semantic + methodological-note for source-readers naming the ?? 0 + chainId-mint-ternary as passive-default-not-active-reset (the trap that four cohort-princes hit + retracted today before locking the byte). Together with the code in c201f7c, this ensures the RFC language matches the shipped semantic atomically — no doc-drift on the chain-reset surface. Cohort-byte-converged through 6+ sources today. Cohort-cross-reference: methodological-note specifically names the loadContinuationChainState ?? 0 + chainId-mint-ternary source-reading traps that cohort byte-walked + 4 princes retracted from today (banked at ~/.openclaw/workspace/memory/2026-06-10.md, lesson #9).

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: cb35f4c9f5

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-10T22:14:24Z

+      continuationFeatureEnabled &&
+      sessionKey &&
+      activeSessionEntry &&
+      !isContinuationWake &&


Reset stale budgets for ordinary subagent returns

This guard treats every delegate-return as an in-chain wake, but src/agents/subagent-announce.ts:1459-1461 sets continuationTrigger: "delegate-return" for ordinary subagent completions whenever continuation is enabled, and src/agents/subagent-announce-delivery.ts:1435-1444 delivers those as inter-session agent calls. In that scenario a long-lived session with stale continuationChainCount can receive a normal subagent completion and still skip the new reset, so any continuation elected from that fresh completion can be rejected against the old chain cap; the #987 fix remains incomplete for a non-continuation external turn. Please distinguish actual continuation-chain delegate returns from ordinary subagent returns before suppressing the reset.

Useful? React with 👍 / 👎.

fix(continuation): #987-completion — reset chain-budget for ordinary subagent-returns (#989-P2-1)

scribe-dandelion-cult mentioned this pull request Jun 10, 2026

continuation: busy-retry-loop — unbounded re-arm + uncancellable-on-busy-session (live 2-seat wedge; #982 fast-follow) #990

Open

chatgpt-codex-connector Bot reviewed Jun 10, 2026

View reviewed changes

scribe-dandelion-cult merged commit 3084bf1 into frond-scribe/20260609/assembly-token-wiring Jun 11, 2026
132 of 145 checks passed

scribe-dandelion-cult deleted the codeagent/987-chain-reset branch June 11, 2026 00:05

This was referenced Jun 11, 2026

fix(continuation): #988 fast-follow — spawn-init cap-notice (P2-2) + maxPendingWork config-docs baseline (P2-3) #991

Merged

fix(continuation): #987-completion — reset chain-budget for ordinary subagent-returns (#989-P2-1) #992

Merged

scribe-dandelion-cult added a commit that referenced this pull request Jun 11, 2026

Merge pull request #992 from karmaterminal/codeagent/989-p2-reset-gate

a437ca7

fix(continuation): #987-completion — reset chain-budget for ordinary subagent-returns (#989-P2-1)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(continuation): reset chain budget on fresh non-wake turn-entry (#987)#989

fix(continuation): reset chain budget on fresh non-wake turn-entry (#987)#989
scribe-dandelion-cult merged 2 commits into
frond-scribe/20260609/assembly-token-wiringfrom
codeagent/987-chain-reset

scribe-dandelion-cult commented Jun 10, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

scribe-dandelion-cult commented Jun 10, 2026

What

Fix

Diff

Gate status

Review

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants