feat(auto-reply): run-generation fence for stronger interruptibility (refs #70319) by darconadalabarga · Pull Request #70363 · openclaw/openclaw

darconadalabarga · 2026-04-22T22:14:12Z

Closes (partially) #70319.

Summary

Introduces a per-session run-generation counter that layers on top of the existing abort primitives (replyRunRegistry, abortEmbeddedPiRun, chat.abort, cancelScope) to provide a unified invalidation signal downstream code can consult before producing side effects.

Goal from the issue: after /stop or a new user message, the superseded run must not produce more output — tool calls, deltas, typing, final replies.

This PR ships the foundation (Pieces A + partial C from the issue's implementation sketch) and wires one real emission site so the fence is live.

What's in this PR

Piece A — Run generation registry (new)

src/auto-reply/reply/run-generation.ts: getCurrentGeneration, incrementGeneration, isCurrentGeneration, forgetGeneration. Global-singleton backed.

Piece A wiring — `ReplyOperation` carries the generation

ReplyOperation.runGeneration captured at createReplyOperation.
ReplyOperation.isCurrent() convenience wrapper over isCurrentGeneration(key, runGeneration).
abortWithReason in the registry bumps the generation, so any abortByUser / abortForRestart path invalidates the captured value.

Fast-abort integration

tryFastAbortFromMessage in abort.ts also bumps generation up-front so /stop fences late output even when the registry lookup misses (race between end-of-run and the stop message).

Piece C — Stale-output fence (partial, opt-in)

createBlockReplyDeliveryHandler accepts an optional isRunCurrent?: () => boolean parameter. When provided and returns false, the block reply is silently dropped before hitting the channel. Callers pass () => replyOperation.isCurrent().

Live wiring

agent-runner-execution.ts passes the fence callback so post-abort block replies are actually dropped in the real runtime.

Piece D — SIGTERM→SIGKILL escalation

Already covered by src/process/kill-tree.test.ts. No change needed.

What's explicitly deferred

Piece B (pre-tool gate): wiring into pi-embedded-runner's tool dispatch. Pattern demonstrated in tests; wiring-only follow-up.
Piece E (new-message takeover): dispatch.ts / queue.ts integration. The primitive (incrementGeneration) is in place.
More emission sites: typing.ts, reply-delivery for non-block path, followup-delivery.ts. Same opt-in shape.

Tests

src/auto-reply/reply/run-generation.test.ts — 17 tests covering:

Registry primitives.
ReplyOperation generation wiring (capture at begin, flip on abort, next run captures new gen).
Stale-output fence pattern (Tests 2, 3, 5 from the issue).
Integration with createBlockReplyDeliveryHandler.

pnpm check:changed  → exit 0
  typecheck core    → ok
  typecheck tests   → ok
  lint core         → ok
  import cycles     → ok
  guards            → ok
  auto-reply suite  → 102 files / 1174 tests passed

pnpm build also passes clean.

Constraints honored

No change to gateway protocol.
No modification to AGENTS.md / CONTRIBUTING.md / CLAUDE.md.
Existing abort behavior unchanged — the new system layers on top.
TypeScript ESM, strict types, no any.
American English in code/comments.

Happy to break into smaller commits or reshape if the approach lands differently than maintainers prefer.

Introduce a per-session generation counter that complements the existing AbortController-based cancellation in replyRunRegistry. Each ReplyOperation now captures the session's current generation at run begin and exposes isCurrent() so downstream emission and tool-dispatch sites can fence stale side effects after an abort or new-message takeover. - src/auto-reply/reply/run-generation.ts: new registry (get/increment/ isCurrent/forget), global-singleton backed so it survives split chunks. - src/auto-reply/reply/reply-run-registry.ts: capture generation on createReplyOperation; bump generation in abortWithReason so any abortByUser / abortForRestart path invalidates the captured value. - src/auto-reply/reply/abort.ts: bump generation in tryFastAbortFromMessage so /stop fences late output even if the registry lookup misses. - src/auto-reply/reply/run-generation.test.ts: 16 tests covering the registry, registry wiring, and the stale-output / pre-tool / typing fence patterns described in CLAUDE_CODE_PROMPT.md tests 1, 2, 3, 5. - src/auto-reply/reply/agent-runner-execution.test.ts: update the ReplyOperation mock to include the two new fields. Piece D (SIGTERM->SIGKILL escalation) is already covered by the existing src/process/kill-tree.test.ts suite; no change needed. Pieces B (pre-tool gate), C (emission-site fences) and E (new-message takeover) are deferred to follow-up commits: each invasive emission site needs dedicated wiring that deserves its own focused review. Inspired by patterns from Hermes (NousResearch/hermes-agent). AI-assisted: yes (Claude Code / Opus 4.7)

Adds an opt-in stale-output fence parameter to createBlockReplyDeliveryHandler so callers can bail on late block replies from superseded runs. Pass `() => replyOperation.isCurrent()` to wire the fence to the run-generation registry introduced in the previous commit. Pure opt-in — existing callers that don't pass isRunCurrent keep current behavior. New test covers that the fence drops a post-abort delivery.

Pass `() => replyOperation.isCurrent()` into createBlockReplyDeliveryHandler so the stale-output fence is now active in the real runtime, not just in tests. A block reply from a superseded run (after /stop or a new user message) is silently dropped before hitting the channel. This is the first site where the generation registry actually changes runtime behavior. All 1174 auto-reply tests continue to pass.

clawsweeper · 2026-04-29T04:27:02Z

Thanks for the context here. I swept through the related work, and this is now duplicate or superseded.

Keep this PR open: it contains meaningful interruptibility work, but it is still draft/dirty, lacks real behavior proof, and now needs to be reconciled with current main's adjacent foreground-delivery fence before merge.

Canonical path: Close this stale PR. The latest review rated it F, the branch still lacks merge-ready proof, and there has been no human follow-up after the durable review.

So I’m closing this here because the remaining work is already tracked in the canonical issue.

Review details

Best possible solution:

Close this stale PR. The latest review rated it F, the branch still lacks merge-ready proof, and there has been no human follow-up after the durable review.

Do we have a high-confidence way to reproduce the issue?

No. Source inspection shows current main lacks this PR's run-scoped block-reply fence, but it also has a newer foreground delivery fence, so the remaining stale-output gap still needs live reproduction or after-fix proof.

Is this the best way to solve the issue?

Unclear. The run-generation foundation is plausible, but current main already has dispatch-level foreground delivery fencing and this PR only wires block replies, so maintainers should align the contracts or require proof that a separate run-level counter is necessary.

Security review:

Security review cleared: The diff adds in-memory cancellation bookkeeping and tests without changing dependencies, CI, secrets, auth, downloaded artifacts, or package execution paths.

AGENTS.md: found and applied where relevant.

What I checked:

stale F-rated PR: PR was opened 2026-04-22T22:14:12Z, is older than 30 days, and the latest review rated it F.
proof blocker: real behavior proof is missing and proof tier is F, so this branch is not merge-ready without contributor follow-up.
no human follow-up: live comments and timeline hydrated by apply contain no non-automation activity after the ClawSweeper review.

Likely related people:

steipete: Recent commit history for reply-run-registry, reply-delivery, agent-runner-execution, and abort includes multiple auto-reply delivery and runner changes by this account. (role: recent area contributor; confidence: high; commits: e71e585969d6, 25c0699fe909, 82e5dd4da74f; files: src/auto-reply/reply/reply-run-registry.ts, src/auto-reply/reply/reply-delivery.ts, src/auto-reply/reply/agent-runner-execution.ts)
jalehman: Recent reply/Telegram dispatch work included active abort ownership, queue, and delivery changes in the same interruptibility family. (role: recent adjacent contributor; confidence: medium; commits: 62b51a6295ee; files: src/auto-reply/reply/reply-run-registry.ts, src/auto-reply/reply/abort.ts, src/auto-reply/dispatch.ts)
guanbear: Recent Slack DM steering work touched active ReplyOperation routing/thread state, adjacent to the run registry surface extended by this PR. (role: recent adjacent contributor; confidence: medium; commits: f1cb9f2f6a75; files: src/auto-reply/reply/reply-run-registry.ts)

Codex review notes: model gpt-5.5, reasoning high; reviewed against fa614d0907e8.

clawsweeper · 2026-05-22T18:14:37Z

ClawSweeper PR egg

🎁 Pass real behavior proof to wake the egg and unlock a hatchable treat.

Where did the egg go?

The egg game starts only after the PR passes the real-behavior proof check.
Before that, no creature or rarity is rolled. The treat waits for real proof.
This is still just collectible flavor: proof affects review readiness, not creature quality.

openclaw-barnacle · 2026-06-07T05:04:16Z

This pull request has been automatically marked as stale due to inactivity.
Please add updates or it will be closed.

clawsweeper · 2026-06-07T05:12:17Z

ClawSweeper applied the proposed close for this PR.

Action: closed this PR.
Close reason: duplicate or superseded.
Evidence: durable ClawSweeper review.

darconadalabarga added 3 commits April 22, 2026 22:24

openclaw-barnacle Bot added the size: M label Apr 22, 2026

darconadalabarga mentioned this pull request Apr 22, 2026

Stronger run interruptibility: unified generation invalidation and stale-output fencing #70319

Closed

coolmanns mentioned this pull request May 3, 2026

[Bug]: Telegram context-overflow retry replays same inbound message and delivers stale turn #76424

Closed

This was referenced May 14, 2026

fix(reply): steer active embedded Pi runs between streams #75226

Closed

Design: tool abort signal handling and framework-level execution interruption #65223

Closed

openclaw-barnacle Bot added the triage: needs-real-behavior-proof Candidate: external PR needs after-fix proof from a real setup. label May 22, 2026

openclaw-barnacle Bot added the stale Marked as stale due to inactivity label Jun 7, 2026

clawsweeper Bot closed this Jun 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(auto-reply): run-generation fence for stronger interruptibility (refs #70319)#70363

feat(auto-reply): run-generation fence for stronger interruptibility (refs #70319)#70363
darconadalabarga wants to merge 3 commits into
openclaw:mainfrom
darconadalabarga:feature/run-interruptibility

darconadalabarga commented Apr 22, 2026 •

edited

Loading

Uh oh!

clawsweeper Bot commented Apr 29, 2026 •

edited

Loading

Uh oh!

clawsweeper Bot commented May 22, 2026

Uh oh!

openclaw-barnacle Bot commented Jun 7, 2026

Uh oh!

clawsweeper Bot commented Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

darconadalabarga commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What's in this PR

Piece A — Run generation registry (new)

Piece A wiring — ReplyOperation carries the generation

Fast-abort integration

Piece C — Stale-output fence (partial, opt-in)

Live wiring

Piece D — SIGTERM→SIGKILL escalation

What's explicitly deferred

Tests

Constraints honored

Uh oh!

clawsweeper Bot commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clawsweeper Bot commented May 22, 2026

Uh oh!

openclaw-barnacle Bot commented Jun 7, 2026

Uh oh!

clawsweeper Bot commented Jun 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

darconadalabarga commented Apr 22, 2026 •

edited

Loading

Piece A wiring — `ReplyOperation` carries the generation

clawsweeper Bot commented Apr 29, 2026 •

edited

Loading