openclaw
diff --git a/‎docs/.generated/config-baseline.sha256‎
Lines changed: 2 additions & 2 deletions b/‎docs/.generated/config-baseline.sha256‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/gateway/config-agents.md‎
Lines changed: 2 additions & 0 deletions b/‎docs/gateway/config-agents.md‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/plugins/codex-harness-reference.md‎
Lines changed: 20 additions & 0 deletions b/‎docs/plugins/codex-harness-reference.md‎
Lines changed: 20 additions & 0 deletions
diff --git a/‎docs/plugins/codex-harness-runtime.md‎
Lines changed: 59 additions & 0 deletions b/‎docs/plugins/codex-harness-runtime.md‎
Lines changed: 59 additions & 0 deletions
@@ -1,4 +1,4 @@
-53b7621e99d75b98ecc8f4389d38900f84cf213f95dbcc877f36125d763c660d  config-baseline.json
-e92bbf45714e418383118098d4ff15d347fa8ffc7e7837b437b522d2b59ce9fe  config-baseline.core.json
+1bc41871069122543c0820a6beebcc085003750a7fffce27494573712b905d96  config-baseline.json
+35e51bc55d3169981d09b6a660e385e6048fceecd6786ff058dcf497430a9681  config-baseline.core.json
 b901fb766edfd9df630690281476fc4032c64772f69d1d8f7b2e0e913a90f229  config-baseline.channel.json
 5c214ab364011fd95735755f9fa4298aa4de8ad81144ae8dd08d969bb7ba318b  config-baseline.plugin.json
@@ -658,6 +658,7 @@ Periodic heartbeat runs.
         model: "openrouter/anthropic/claude-sonnet-4-6", // optional compaction-only model override
         truncateAfterCompaction: true, // rotate to a smaller successor JSONL after compaction
         maxActiveTranscriptBytes: "20mb", // optional preflight local compaction trigger
+        maxActiveTranscriptTokens: "120k", // optional Codex native thread token reuse guard
         notifyUser: true, // send brief notices when compaction starts and completes (default: false)
         memoryFlush: {
           enabled: true,
@@ -683,6 +684,7 @@ Periodic heartbeat runs.
 - `postCompactionSections`: optional AGENTS.md H2/H3 section names to re-inject after compaction. Reinjection is disabled when unset or set to `[]`. Explicitly setting `["Session Startup", "Red Lines"]` enables that pair and preserves the legacy `Every Session`/`Safety` fallback. Enable this only when the extra context is worth the risk of duplicating project guidance already captured in the compaction summary.
 - `model`: optional `provider/model-id` override for compaction summarization only. Use this when the main session should keep one model but compaction summaries should run on another; when unset, compaction uses the session's primary model.
 - `maxActiveTranscriptBytes`: optional byte threshold (`number` or strings like `"20mb"`) that triggers normal local compaction before a run when the active JSONL grows past the threshold. Requires `truncateAfterCompaction` so successful compaction can rotate to a smaller successor transcript. Disabled when unset or `0`.
+- `maxActiveTranscriptTokens`: optional Codex app-server native-thread token reuse guard (`number` or strings like `"120k"`). When unset, OpenClaw uses Codex's reported model context window with a 300000-token fallback recovery fuse. Set a positive value to override that fuse, or `0` to disable only this token guard while preserving byte limits and semantic binding invalidation.
 - `notifyUser`: when `true`, sends brief notices to the user when compaction starts and when it completes (for example, "Compacting context..." and "Compaction complete"). Disabled by default to keep compaction silent.
 - `memoryFlush`: silent agentic turn before auto-compaction to store durable memories. Set `model` to an exact provider/model such as `ollama/qwen3:8b` when this housekeeping turn should stay on a local model; the override does not inherit the active session fallback chain. Skipped when workspace is read-only.
 
 
@@ -423,6 +423,26 @@ injected; heartbeat turns get a collaboration-mode pointer to read the file when
 it exists and is non-empty. `BOOTSTRAP.md` and `MEMORY.md` when present are
 forwarded as OpenClaw turn input reference context.
 
+These bootstrap bytes are not the same thing as the native reuse guard. The
+active-token guard is a warm-thread reuse threshold over the persisted/native
+Codex transcript and mirrored session totals; it is not simply "bootstrap size".
+When unset, OpenClaw uses Codex's reported model context window with a
+300000-token fallback recovery fuse. Operators can set `maxActiveTranscriptTokens`
+to a lower threshold when legacy native threads should rotate earlier.
+
+The token-efficient context-engine path is `thread_bootstrap`. When the saved
+Codex binding still matches the active context-engine id, policy fingerprint,
+projection epoch/fingerprint, and dynamic-tool surface, OpenClaw treats the
+large bootstrap/projection payload as already present in the native thread and
+logs `thread-bootstrap-semantic-reuse` instead of reprojecting it every turn.
+Successful context-engine-owned compaction preserves that binding when the
+projection mode remains `thread_bootstrap`: same-file compaction leaves it in
+place, and successor-transcript rollover copies it to the successor before
+clearing the archived original. If those identities change, OpenClaw starts a
+fresh Codex thread and reprojects context once for the new epoch. This keeps
+long-running agents fast without pretending a stale bootstrap or stale
+context-engine projection is still valid.
+
 ## Environment overrides
 
 Environment overrides remain available for local testing:
 
@@ -218,6 +218,12 @@ OpenClaw returns after starting that native operation. It does not wait for
 completion, impose a separate OpenClaw timeout, restart the shared Codex
 app-server, or record the operation as an OpenClaw-completed compaction.
 
+If context-engine overflow recovery rotates a transcript before a turn starts,
+OpenClaw preserves compatible `thread_bootstrap` bindings when their semantic
+identity still matches the projected context. Legacy, ownerless, and
+non-bootstrap bindings are abandoned so the next turn can rehydrate a fresh
+thread from engine-managed context.
+
 When a context engine requests Codex thread-bootstrap projection, OpenClaw
 projects tool-call names and ids, input shapes, and redacted tool-result content
 into the fresh Codex thread. It does not copy raw tool-call argument values into
@@ -233,6 +239,59 @@ Because Codex owns the canonical native thread, `tool_result_persist` does not
 currently rewrite Codex-native tool result records. It only applies when
 OpenClaw is writing an OpenClaw-owned session transcript tool result.
 
+## Troubleshooting token pressure
+
+Codex harness latency can come from three different pressure points. They look
+similar in logs, but they have different fixes.
+
+| Pressure point                    | Owner            | What it means                                                                                                                                                            |
+| --------------------------------- | ---------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
+| Model or app-server context limit | Codex app-server | Codex accepted a native thread or turn and then rejected, compacted, or failed because the real model request could not fit.                                             |
+| OpenClaw assembly precheck        | OpenClaw         | OpenClaw's rendered turn prompt, developer instructions, context-engine projection, media, and reserves are too large before the turn is submitted to app-server.        |
+| Native-thread reuse rotation      | OpenClaw + Codex | OpenClaw has a saved Codex thread binding, but the persisted/native transcript is over the configured warm-thread reuse guard or the binding identity no longer matches. |
+
+The native-thread reuse guard is not the model context window. When unset,
+OpenClaw uses Codex's reported model context window, with a 300000-token
+fallback recovery fuse when Codex has not reported one. It is a proactive
+threshold for deciding whether an existing native Codex thread is still a good
+warm resume candidate. Setting `maxActiveTranscriptTokens` to `120k` preserves
+an 86000 token native rollout on models with smaller reported windows, while
+setting it to `50k` rotates a 60000 token binding. Setting the token guard to
+`0` disables only proactive token rotation; byte guards and semantic binding
+checks can still rotate.
+
+For context-engine `thread_bootstrap`, the efficient path is a matching
+context-engine id, policy fingerprint, projection epoch, projection
+fingerprint, and dynamic-tool surface. In that case OpenClaw logs
+`thread-bootstrap-semantic-reuse` and skips the proactive token and byte guards,
+because the large projection was already bootstrapped into that Codex thread.
+Successful context-engine-owned compaction preserves that binding when the
+projection remains `thread_bootstrap`. If compaction keeps the same session
+file, the binding stays in place. If compaction rolls over to a successor
+session file, OpenClaw copies the binding to the successor and clears the
+archived original so the next turn can resume the warm Codex thread. Legacy,
+ownerless, or non-bootstrap bindings are still invalidated by compaction.
+When any semantic identity changes, OpenClaw starts a fresh native thread and
+emits `codex.native_thread.lifecycle` with a rotation reason such as
+`projection-mismatch`, `context-engine-binding-mismatch`,
+`dynamic-tools-mismatch`, `mcp-config-mismatch`,
+`environment-selection-mismatch`, `native-tool-surface-disabled`,
+`plugin-app-config-mismatch`, `auth-profile-mismatch`, `missing-thread-binding`,
+`app-server-rejected-thread`, `native-token-guard`, or `native-byte-guard`.
+Compaction itself emits either `context-engine-compaction-preserved-binding` or
+`context-engine-compaction-invalidated-binding` so the outcome is visible even
+when no fresh thread is started immediately.
+
+The trusted lifecycle diagnostic includes counts, hashed comparison
+fingerprints, and basename session-file labels for rollover, but not raw prompt
+text, bootstrap file contents, tool arguments, local absolute paths, or secrets.
+Its companion log entry is intentionally lower-cardinality and omits scoped
+thread/session ids and fingerprints. Use the trusted event to answer whether a
+slow Codex session is
+rebuilding context because the warm native thread exceeded a guard, because
+OpenClaw assembled too much prompt/context, or because Codex itself rejected the
+native thread or turn.
+
 ## Media and delivery
 
 OpenClaw continues to own media delivery and media provider selection. Image,