openclaw
diff --git a/‎CHANGELOG.md‎
Lines changed: 1 addition & 0 deletions b/‎CHANGELOG.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/.generated/config-baseline.sha256‎
Lines changed: 2 additions & 2 deletions b/‎docs/.generated/config-baseline.sha256‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/concepts/agent-loop.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/concepts/agent-loop.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/concepts/agent.md‎
Lines changed: 9 additions & 11 deletions b/‎docs/concepts/agent.md‎
Lines changed: 9 additions & 11 deletions
diff --git a/‎docs/concepts/messages.md‎
Lines changed: 6 additions & 6 deletions b/‎docs/concepts/messages.md‎
Lines changed: 6 additions & 6 deletions
diff --git a/‎docs/concepts/queue-steering.md‎
Lines changed: 37 additions & 41 deletions b/‎docs/concepts/queue-steering.md‎
Lines changed: 37 additions & 41 deletions
diff --git a/‎docs/concepts/queue.md‎
Lines changed: 14 additions & 18 deletions b/‎docs/concepts/queue.md‎
Lines changed: 14 additions & 18 deletions
diff --git a/‎docs/gateway/config-agents.md‎
Lines changed: 3 additions & 3 deletions b/‎docs/gateway/config-agents.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎docs/gateway/configuration-examples.md‎
Lines changed: 8 additions & 8 deletions b/‎docs/gateway/configuration-examples.md‎
Lines changed: 8 additions & 8 deletions
@@ -81,6 +81,7 @@ Docs: https://docs.openclaw.ai
 - Telegram: support Mini App `web_app` buttons in generic message presentation payloads, allowing `openclaw message send --presentation` to render Telegram Web App inline buttons for private chats. (#81356) Thanks @jzakirov.
 - Scripts: add `OPENCLAW_HEAVY_CHECK_LOCK_SCOPE=worktree` so high-capacity local worktrees can use independent heavy-check locks while shared locks remain the default. Fixes #80729. (#80734) Thanks @samzong.
 - Agents/subagents: deliver native `sessions_spawn` tasks in the child session's first visible `[Subagent Task]` message instead of hiding the task in the sub-agent system prompt, keeping delegation auditable without duplicating tokens. Fixes #78592. Thanks @bradestes and @stainlu.
+- Messages/queue: make mid-turn prompts steer active runs by default via `/queue steer`, preserve `/queue followup` and `/queue collect` for users who want messages to queue by default, and make `/steer` continue as a normal prompt when steering is unavailable. (#77023) Thanks @fuller-stack-dev.
 - Voice Call/Telnyx: add realtime media-streaming call support for conversational voice calls. (#81024) Thanks @dynamite-bud.
 - Gateway/OpenAI HTTP: honor `max_completion_tokens` and `max_tokens` on inbound `/v1/chat/completions` requests so client-provided token caps reach the upstream provider via `streamParams.maxTokens`, with `max_completion_tokens` taking precedence when both are sent. Thanks @Lellansin.
 - Models/OpenAI CLI auth: make `openclaw models auth login --provider openai` start the ChatGPT/Codex account login by default, while `--method api-key` remains the explicit OpenAI API-key setup path.
 
@@ -1,4 +1,4 @@
-f95819d93e9bec5d059440ab54fb4ccb487425cb91d647c8688cd18ef1d4d848  config-baseline.json
-3325af3a6292959bb38166e9136c638dce5d2093d2339076742890848088a972  config-baseline.core.json
+bad30fbdd50ecdc6dd0e3dbbea0a1d7ed02a7e3e0cc30d7b1d4459832e4d1bd8  config-baseline.json
+932ca6c43b47dc342b6c9999815e5f03c5ff46f6372034a4eb507c629a4e49b1  config-baseline.core.json
 ad1d3cb596115d66c21e93de95e229c14c585f0dd4799b4ae3cc29b84761adc6  config-baseline.channel.json
 0dac8944a0d51ae96f97e3809907f8a04d08413434a1a1190240f7e13bb11c4d  config-baseline.plugin.json
@@ -46,7 +46,7 @@ wired end-to-end.
 
 - Runs are serialized per session key (session lane) and optionally through a global lane.
 - This prevents tool/session races and keeps session history consistent.
-- Messaging channels can choose queue modes (collect/steer/followup) that feed this lane system.
+- Messaging channels can choose queue modes (steer/followup/collect/interrupt) that feed this lane system.
   See [Command Queue](/concepts/queue).
 - Transcript writes are also protected by a session write lock on the session file. The lock is
   process-aware and file-based, so it catches writers that bypass the in-process queue or come from
 
@@ -84,17 +84,15 @@ Legacy session folders from other tools are not read.
 
 ## Steering while streaming
 
-When queue mode is `steer`, inbound messages are injected into the current run.
-Queued steering is delivered **after the current assistant turn finishes
-executing its tool calls**, before the next LLM call. Pi drains all pending
-steering messages together for `steer`; legacy `queue` drains one message per
-model boundary. Steering no longer skips remaining tool calls from the current
-assistant message.
-
-When queue mode is `followup` or `collect`, inbound messages are held until the
-current turn ends, then a new agent turn starts with the queued payloads. See
-[Queue](/concepts/queue) and [Steering queue](/concepts/queue-steering) for mode
-and boundary behavior.
+Inbound prompts that arrive mid-run are steered into the current run by default.
+Steering is delivered **after the current assistant turn finishes executing its
+tool calls**, before the next LLM call, and no longer skips remaining tool calls
+from the current assistant message.
+
+`/queue steer` is the default active-run behavior. `/queue followup` and
+`/queue collect` make messages wait for a later turn instead of steering.
+`/queue interrupt` aborts the active run instead. See [Queue](/concepts/queue)
+and [Steering queue](/concepts/queue-steering) for queue and boundary behavior.
 
 Block streaming sends completed assistant blocks as soon as they finish; it is
 **off by default** (`agents.defaults.blockStreamingDefault: "off"`).
 
@@ -125,14 +125,14 @@ default) and per-channel overrides like `channels.slack.historyLimit` or
 
 ## Queueing and followups
 
-If a run is already active, inbound messages can be queued, steered into the
-current run, or collected for a followup turn.
+If a run is already active, inbound messages are steered into the current run by
+default. `messages.queue` selects whether active-run messages steer, queue for
+later, collect into one later turn, or interrupt the active run.
 
 - Configure via `messages.queue` (and `messages.queue.byChannel`).
-- Default mode is `steer`, with a 500ms followup debounce when steering falls
-  back to queued followup delivery.
-- Modes: `steer`, `followup`, `collect`, `steer-backlog`, `interrupt`, and the
-  legacy one-at-a-time `queue` mode.
+- Default mode is `steer`, with a 500ms debounce for Codex steering batches and
+  followup/collect queues.
+- Modes: `steer`, `followup`, `collect`, and `interrupt`.
 
 Details: [Command queue](/concepts/queue) and [Steering queue](/concepts/queue-steering).
 
 
@@ -3,14 +3,15 @@ summary: "How active-run steering queues messages at runtime boundaries"
 read_when:
   - Explaining how steer behaves while an agent is using tools
   - Changing active-run queue behavior or runtime steering integration
-  - Comparing steer, queue, collect, and followup modes
+  - Comparing steering with followup, collect, and interrupt queue modes
 title: "Steering queue"
 ---
 
-When a message arrives while a session run is already streaming, OpenClaw can
-send that message into the active runtime instead of starting another run for
-the same session. The public modes are runtime-neutral; Pi and the native Codex
-app-server harness implement the delivery details differently.
+When a normal prompt arrives while a session run is already streaming, OpenClaw
+tries to send that prompt into the active runtime by default when the queue mode
+is `steer`. No config entry and no queue directive are required for that default
+behavior. Pi and the native Codex app-server harness implement the delivery
+details differently.
 
 ## Runtime boundary
 
@@ -27,44 +28,40 @@ This keeps tool results paired with the assistant message that requested them,
 then lets the next model call see the latest user input.
 
 The native Codex app-server harness exposes `turn/steer` instead of Pi's
-internal steering queue. OpenClaw adapts the same modes there:
-
-- `steer` batches queued messages for the configured quiet window, then sends a
-  single `turn/steer` request with all collected user input in arrival order.
-- `queue` keeps the legacy serialized shape by sending separate `turn/steer`
-  requests.
-- `followup`, `collect`, `steer-backlog`, and `interrupt` stay OpenClaw-owned
-  queue behavior around the active Codex turn.
+internal steering queue. OpenClaw batches queued prompts for the configured
+quiet window, then sends a single `turn/steer` request with all collected user
+input in arrival order.
 
 Codex review and manual compaction turns reject same-turn steering. When a
-runtime cannot accept steering, OpenClaw falls back to the followup queue where
-that mode allows it.
+runtime cannot accept steering in `steer` mode, OpenClaw waits for the active
+run to finish before starting the prompt.
 
-This page explains queue-mode steering for normal inbound messages. For the
-explicit `/steer <message>` command, see [Steer](/tools/steer).
+This page explains queue-mode steering for normal inbound messages when the mode
+is `steer`. If the mode is `followup` or `collect`, normal messages do not enter
+this steering path; they wait until the active run finishes. For the explicit
+`/steer <message>` command, see [Steer](/tools/steer).
 
 ## Modes
 
-| Mode            | Active-run behavior                                                                                                          | Later followup behavior                                                             |
-| --------------- | ---------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------- |
-| `steer`         | Injects all queued steering messages together at the next runtime boundary. This is the default.                             | Falls back to followup only when steering is unavailable.                           |
-| `queue`         | Legacy one-at-a-time steering. Pi injects one queued message per model boundary; Codex sends separate `turn/steer` requests. | Falls back to followup only when steering is unavailable.                           |
-| `steer-backlog` | Same active-run steering behavior as `steer`.                                                                                | Also keeps the same message for a later followup turn.                              |
-| `followup`      | Does not steer the current run.                                                                                              | Runs queued messages later.                                                         |
-| `collect`       | Does not steer the current run.                                                                                              | Coalesces compatible queued messages into one later turn after the debounce window. |
-| `interrupt`     | Aborts the active run, then starts the newest message.                                                                       | None.                                                                               |
+| Mode        | Active-run behavior                                    | Later behavior                                                                      |
+| ----------- | ------------------------------------------------------ | ----------------------------------------------------------------------------------- |
+| `steer`     | Steers the prompt into the active runtime when it can. | Waits for the active run to finish if steering is unavailable.                      |
+| `followup`  | Does not steer.                                        | Runs queued messages later after the active run ends.                               |
+| `collect`   | Does not steer.                                        | Coalesces compatible queued messages into one later turn after the debounce window. |
+| `interrupt` | Aborts the active run instead of steering it.          | Starts the newest message after aborting.                                           |
 
 ## Burst example
 
 If four users send messages while the agent is executing a tool call:
 
-- `steer`: the active runtime receives all four messages in arrival order before
-  its next model decision. Pi drains them at the next model boundary; Codex
-  receives them as one batched `turn/steer`.
-- `queue`: legacy serialized steering. Pi injects one queued message at a time;
-  Codex receives separate `turn/steer` requests.
-- `collect`: OpenClaw waits until the active run ends, then creates a followup
-  turn with compatible queued messages after the debounce window.
+- With default behavior, the active runtime receives all four messages in
+  arrival order before its next model decision. Pi drains them at the next model
+  boundary; Codex receives them as one batched `turn/steer`.
+- With `/queue collect`, OpenClaw does not steer. It waits until the active run
+  ends, then creates a followup turn with compatible queued messages after the
+  debounce window.
+- With `/queue interrupt`, OpenClaw aborts the active run and starts the newest
+  message instead of steering.
 
 ## Scope
 
@@ -73,18 +70,17 @@ session, change the active run's tool policy, or split messages by sender. In
 multi-user channels, inbound prompts already include sender and route context, so
 the next model call can see who sent each message.
 
-Use `collect` when you want OpenClaw to build a later followup turn that can
-coalesce compatible messages and preserve followup queue drop policy. Use
-`queue` only when you need the older one-at-a-time steering behavior.
+Use `followup` or `collect` when you want messages to queue by default instead
+of steering the active run. Use `interrupt` when the newest prompt should
+replace the active run.
 
 ## Debounce
 
-`messages.queue.debounceMs` applies to followup delivery, including `collect`,
-`followup`, `steer-backlog`, and `steer` fallback when active-run steering is not
-available. For Pi, active `steer` itself does not use the debounce timer because
-Pi naturally batches messages until the next model boundary. For the native
-Codex harness, OpenClaw uses the same debounce value as the quiet window before
-sending the batched `turn/steer`.
+`messages.queue.debounceMs` applies to queued `followup` and `collect` delivery.
+In `steer` mode with the native Codex harness, it also sets the quiet window
+before sending batched `turn/steer`. For Pi, active steering itself does not use
+the debounce timer because Pi naturally batches messages until the next model
+boundary.
 
 ## Related
 
 
@@ -30,25 +30,20 @@ When unset, all inbound channel surfaces use:
 - `cap: 20`
 - `drop: "summarize"`
 
-`steer` is the default because it keeps the active model turn responsive without
-starting a second session run. It drains all steering messages that arrived
-before the next model boundary. If the current run cannot accept steering,
-OpenClaw falls back to a followup queue entry.
+Same-turn steering is the default. A prompt that arrives mid-run is injected
+into the active runtime when the run can accept steering, so no second session
+run is started. If the active run cannot accept steering, OpenClaw waits for the
+active run to finish before starting the prompt.
 
 ## Queue modes
 
-Inbound messages can steer the current run, wait for a followup turn, or do both:
+`/queue` controls what normal inbound messages do while a session already has
+an active run:
 
-- `steer`: queue steering messages into the active runtime. Pi delivers all pending steering messages **after the current assistant turn finishes executing its tool calls**, before the next LLM call; Codex app-server receives one batched `turn/steer`. If the run is not actively streaming or steering is unavailable, OpenClaw falls back to a followup queue entry.
-- `queue` (legacy): old one-at-a-time steering. Pi delivers one queued steering message at each model boundary; Codex app-server receives separate `turn/steer` requests. Prefer `steer` unless you need the previous serialized behavior.
-- `followup`: enqueue each message for a later agent turn after the current run ends.
-- `collect`: coalesce queued messages into a **single** followup turn after the quiet window. If messages target different channels/threads, they drain individually to preserve routing.
-- `steer-backlog` (aka `steer+backlog`): steer now **and** preserve the same message for a followup turn.
-- `interrupt` (legacy): abort the active run for that session, then run the newest message.
-
-Steer-backlog means you can get a followup response after the steered run, so
-streaming surfaces can look like duplicates. Prefer `collect`/`steer` if you want
-one response per inbound message.
+- `steer`: inject messages into the active runtime. Pi delivers all pending steering messages **after the current assistant turn finishes executing its tool calls**, before the next LLM call; Codex app-server receives one batched `turn/steer`. If the run is not actively streaming or steering is unavailable, OpenClaw waits until the active run ends before starting the prompt.
+- `followup`: do not steer. Enqueue each message for a later agent turn after the current run ends.
+- `collect`: do not steer. Coalesce queued messages into a **single** followup turn after the quiet window. If messages target different channels/threads, they drain individually to preserve routing.
+- `interrupt`: abort the active run for that session, then run the newest message.
 
 For runtime-specific timing and dependency behavior, see
 [Steering queue](/concepts/queue-steering). For the explicit `/steer <message>`
@@ -72,9 +67,10 @@ Configure globally or per channel via `messages.queue`:
 
 ## Queue options
 
-Options apply to `followup`, `collect`, and `steer-backlog` (and to `steer` or legacy `queue` when steering falls back to followup):
+Options apply to queued delivery. `debounceMs` also sets the Codex steering
+quiet window in `steer` mode:
 
-- `debounceMs`: quiet window before draining queued followups. Bare numbers are milliseconds; units `ms`, `s`, `m`, `h`, and `d` are accepted by `/queue` options.
+- `debounceMs`: quiet window before draining queued followups or collect batches; in Codex `steer` mode, quiet window before sending batched `turn/steer`. Bare numbers are milliseconds; units `ms`, `s`, `m`, `h`, and `d` are accepted by `/queue` options.
 - `cap`: max queued messages per session. Values below `1` are ignored.
 - `drop: "summarize"`: default. Drop the oldest queued entries as needed, keep compact summaries, and inject them as a synthetic followup prompt.
 - `drop: "old"`: drop the oldest queued entries as needed, without preserving summaries.
@@ -99,7 +95,7 @@ keys.
 
 ## Per-session overrides
 
-- Send `/queue <mode>` as a standalone command to store the mode for the current session.
+- Send `/queue <steer|followup|collect|interrupt>` as a standalone command to store the queue mode for the current session.
 - Options can be combined: `/queue collect debounce:0.5s cap:25 drop:summarize`
 - `/queue default` or `/queue reset` clears the session override.
 
 
@@ -1280,13 +1280,13 @@ See [Multi-Agent Sandbox & Tools](/tools/multi-agent-sandbox-tools) for preceden
     ackReactionScope: "group-mentions", // group-mentions | group-all | direct | all
     removeAckAfterReply: false,
     queue: {
-      mode: "steer", // steer | queue (legacy one-at-a-time) | followup | collect | steer-backlog | steer+backlog | interrupt
+      mode: "followup", // steer | followup | collect | interrupt
       debounceMs: 500,
       cap: 20,
       drop: "summarize", // old | new | summarize
       byChannel: {
-        whatsapp: "steer",
-        telegram: "steer",
+        whatsapp: "followup",
+        telegram: "followup",
       },
     },
     inbound: {
 
@@ -113,18 +113,18 @@ Save to `~/.openclaw/openclaw.json` and you can DM the bot from that number.
       visibleReplies: "message_tool", // normal final replies stay private in groups/channels
     },
     queue: {
-      mode: "steer",
+      mode: "followup",
       debounceMs: 500,
       cap: 20,
       drop: "summarize",
       byChannel: {
-        whatsapp: "steer",
-        telegram: "steer",
-        discord: "steer",
-        slack: "steer",
-        signal: "steer",
-        imessage: "steer",
-        webchat: "steer",
+        whatsapp: "followup",
+        telegram: "followup",
+        discord: "collect",
+        slack: "collect",
+        signal: "followup",
+        imessage: "followup",
+        webchat: "followup",
       },
     },
   },