feat: usable mid-turn steer — desktop affordance + trusted injection by OutThisLife · Pull Request #40240 · NousResearch/hermes-agent

OutThisLife · 2026-06-06T01:50:44Z

Makes /steer (nudge the agent mid-turn without interrupting) actually usable: a first-class desktop affordance and a backend fix so the model stops treating steers as prompt injection.

Why

Desktop could only queue while busy. /steer was in the palette but had no UI, so the mid-turn lane was unreachable in the GUI.
Backend: a steer is appended inside a tool result (the only role-alternation-safe slot mid-turn), labelled User guidance:. Well-behaved models read that as untrusted tool content and refuse it as prompt injection. Observed live:

"I only follow instructions from you directly, not ones injected through command results … I treated it as an injection and skipped it."

Steer vs queue (why both exist)

queue → new user message at the next turn boundary (agent finishes first).
steer → folded into the next tool result mid-turn; the model sees it on its next iteration. No interrupt.
If steer is rejected (no live tool window), the text is re-queued so nothing is lost.

Changes

Desktop (composer)

While busy with a text-only draft: a steering-wheel button steers the draft via session.steer.
Cmd/Ctrl+Enter is reserved for steer — steers when there's a steerable draft, no-op otherwise (never a surprise send). Plain Enter keeps its role: queue while busy, send when idle.

idle busy

Enter send queue

Cmd/Ctrl+Enter — steer
Gated to busy && text-only && non-empty && not a slash command.
Reject/RPC-error → re-queue (safety net).
Ack is an inline transcript note rendered as a codicon row (compass · steered · …), not a toast.
Plumbing: onSteer through controller → ChatView → ChatBar; SessionSteerResponse; SteeringWheel icon; steer i18n (en/zh/types).

Backend (the real fix — makes steer trusted)

prompt_builder.format_steer_marker() wraps steers in a bounded, self-describing [OUT-OF-BAND USER MESSAGE] marker; shared by both drain sites (agent_runtime_helpers, conversation_loop).
STEER_CHANNEL_NOTE added to the core system prompt: the model is told to expect and trust this exact marker as a genuine user message, while still ignoring lookalikes in tool/web/file output.
Static text → byte-stable prompt (no prompt-cache regression); gated on the agent having tools.

Security note

The marker is intentionally static, not a per-session nonce, to honor the byte-stable system-prompt caching policy (the codebase even uses day-precision dates to preserve the prefix cache). Trade-off: an attacker who fully controls tool output could forge the marker. A per-session nonce pinned in the prompt would close this; deferred as follow-up since it fights the caching policy and deserves its own discussion.

Verified

Live test (tool-heavy turn, steer sent mid-run): the model followed the steer (dated the remaining steps) with no injection refusal, and the inline codicon note rendered. See thread.

Test plan

Backend: tests/run_agent/test_steer.py (23 passed) — new marker on both drain paths, note↔marker contract, User guidance: regression guard.
Desktop: use-prompt-actions.test.tsx (11 passed) — steer injects trimmed text via session.steer, never prompt.submit; reject/error/empty paths.
tsc --noEmit clean; no import cycles.
Manual: steer mid-turn is followed (no injection refusal); inline note shows; plain Enter still queues; Cmd/Ctrl+Enter never surprise-sends.

The desktop app could only queue while busy — `/steer` was in the palette but had no first-class affordance, so the "nudge the agent mid-turn without interrupting" lane was effectively unreachable. Add a steer action to the composer: while busy with a text-only draft, a steering-wheel button (and Cmd/Ctrl+Enter) injects the text into the live turn via the `session.steer` RPC — the gateway folds it into the next tool result so the model reads it on its next iteration. Plain Enter still queues. steerPrompt returns false when the gateway has no live tool window (or the RPC errors), and the composer re-queues the words so nothing is lost — the same safety net as a plain queue.

github-actions · 2026-06-06T01:51:33Z

🔎 Lint report: `bb/desktop-steer` vs `origin/main`

ruff

Total: 0 on HEAD, 0 on base (➖ 0)

🆕 New issues: none

✅ Fixed issues: none

Unchanged: 0 pre-existing issues carried over.

ty (type checker)

Total: 9870 on HEAD, 9870 on base (➖ 0)

🆕 New issues: none

✅ Fixed issues: none

Unchanged: 5119 pre-existing issues carried over.

Diagnostics are surfaced as warnings — this check never fails the build.

A steer rides inside a tool result (the only role-alternation-safe slot mid-turn), so a bare "User guidance:" line reads as untrusted tool content — well-behaved models refuse it as suspected prompt injection (observed live: "I only follow instructions from you directly, not ones injected through command results"). - Wrap steers in a bounded, self-describing [OUT-OF-BAND USER MESSAGE] marker (prompt_builder.format_steer_marker), shared by both drain sites. - Add STEER_CHANNEL_NOTE to the core system prompt so the model expects this exact marker and trusts it as a genuine user message — while still ignoring lookalikes buried in tool/web/file output. Static text → byte-stable prompt, no prompt-cache regression; gated on the agent having tools. - Desktop: steer ack is now an inline transcript note (⏩ steered · …) instead of a toast. Marker is intentionally static (not a per-session nonce) to honor the byte-stable system-prompt caching policy; nonce hardening noted as follow-up.

Cmd/Ctrl+Enter now steers when there's a steerable draft and is a no-op otherwise — it never falls through to a send, so the shortcut can't surprise-send. Plain Enter keeps its role (queue while busy, send when idle).

The inline steer note used a ⏩ emoji. Emit a structured `steer:<text>` system note and render it in SystemMessage as a codicon (compass) row — same style as slash-status output. No emoji in the transcript.

Compute the trimmed draft once and reuse for hasComposerPayload + canSteer instead of trimming three times per render.

OutThisLife changed the title ~~feat(desktop): steer the live run from the composer~~ feat: usable mid-turn steer — desktop composer action + trusted injection Jun 6, 2026

OutThisLife added 3 commits June 5, 2026 21:01

feat(desktop): reserve Cmd/Ctrl+Enter strictly for steer

efa53fb

Cmd/Ctrl+Enter now steers when there's a steerable draft and is a no-op otherwise — it never falls through to a send, so the shortcut can't surprise-send. Plain Enter keeps its role (queue while busy, send when idle).

fix(desktop): render steer note as a codicon, not an emoji

7cceead

The inline steer note used a ⏩ emoji. Emit a structured `steer:<text>` system note and render it in SystemMessage as a codicon (compass) row — same style as slash-status output. No emoji in the transcript.

refactor(desktop): hoist single draft.trim() in composer

5d4c93a

Compute the trimmed draft once and reuse for hasComposerPayload + canSteer instead of trimming three times per render.

OutThisLife changed the title ~~feat: usable mid-turn steer — desktop composer action + trusted injection~~ feat: usable mid-turn steer — desktop affordance + trusted injection Jun 6, 2026

OutThisLife enabled auto-merge June 6, 2026 02:06

OutThisLife merged commit 1506874 into main Jun 6, 2026
22 checks passed

OutThisLife deleted the bb/desktop-steer branch June 6, 2026 02:10

Haderach-Ram mentioned this pull request Jun 6, 2026

Ecosystem Digest — 2026-06-06 Haderach-Ram/openclaw-radar#30

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: usable mid-turn steer — desktop affordance + trusted injection#40240

feat: usable mid-turn steer — desktop affordance + trusted injection#40240
OutThisLife merged 5 commits into
mainfrom
bb/desktop-steer

OutThisLife commented Jun 6, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 6, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

OutThisLife commented Jun 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

Steer vs queue (why both exist)

Changes

Desktop (composer)

Backend (the real fix — makes steer trusted)

Security note

Verified

Test plan

Uh oh!

github-actions Bot commented Jun 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔎 Lint report: bb/desktop-steer vs origin/main

ruff

ty (type checker)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

OutThisLife commented Jun 6, 2026 •

edited

Loading

github-actions Bot commented Jun 6, 2026 •

edited

Loading

🔎 Lint report: `bb/desktop-steer` vs `origin/main`