Skip to content

fix(search): enforce native web search tool policy#91750

Merged
eleqtrizit merged 4 commits into
openclaw:mainfrom
eleqtrizit:739
Jun 9, 2026
Merged

fix(search): enforce native web search tool policy#91750
eleqtrizit merged 4 commits into
openclaw:mainfrom
eleqtrizit:739

Conversation

@eleqtrizit

@eleqtrizit eleqtrizit commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

Summary

Enforce OpenClaw tool policy before enabling provider-native Codex/OpenAI web_search.

Changes

  • Gate native web-search activation with global, provider, agent, profile, group, sender, sandbox, subagent, and inherited session tool policy layers.
  • Thread native web-search policy context through embedded attempts, compaction stream rebuilds, provider stream hooks, and the OpenAI native-search wrapper.
  • Keep managed web_search visible when native search is unavailable or blocked by policy.
  • Update the Plugin SDK API baseline hash for the intentional optional provider context addition.
  • Add regression coverage for agent/provider/sender/sandbox/inherited denies, managed-tool suppression parity, provider wrapper context propagation, and payload injection suppression.

Validation

  • node scripts/run-vitest.mjs src/agents/codex-native-web-search.test.ts src/llm/providers/stream-wrappers/openai.test.ts src/agents/embedded-agent-runner-extraparams.test.ts
  • corepack pnpm plugin-sdk:api:check
  • corepack pnpm tsgo:core
  • node scripts/check-src-extension-import-boundary.mjs --json
  • node scripts/check-sdk-package-extension-import-boundary.mjs --json
  • node scripts/check-test-helper-extension-import-boundary.mjs --json
  • corepack pnpm check:import-cycles
  • git diff --check
  • corepack pnpm build

Notes

  • This is hardening/policy parity; no secret exfiltration or external mutation was demonstrated in the report.

@openclaw-barnacle openclaw-barnacle Bot added agents Agent runtime and tooling extensions: openai size: M maintainer Maintainer-authored PR labels Jun 9, 2026
@clawsweeper

clawsweeper Bot commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

Codex review: needs maintainer review before merge. Reviewed June 9, 2026, 7:12 PM ET / 23:12 UTC.

Summary
The PR gates native OpenAI/Codex web search behind OpenClaw tool-policy decisions across agent runs, compaction, provider stream wrapping, SDK context, and regression tests.

PR surface: Source +293, Tests +331, Generated 0. Total +624 across 16 files.

Reproducibility: yes. from source: current main gates native web-search activation on config/model/auth but not the layered tool-policy inputs that already govern managed web_search. I did not run tests because this was a read-only review.

Review metrics: 1 noteworthy metric.

  • Provider hook context: 2 optional fields added. The provider SDK context now exposes agentId and the computed native-search policy decision, so maintainers should notice the public plugin contract change before merge.

Merge readiness
Overall: 🐚 platinum hermit
Proof: 🐚 platinum hermit
Patch quality: 🐚 platinum hermit
Result: ready for maintainer review.

Overall follows the weaker of proof and patch quality, so missing proof can cap an otherwise strong patch.

Rank-up moves:

  • none.

Risk before merge

  • [P1] Merging adds two optional fields to provider hook context, so third-party provider plugins may observe or rely on the new native-search policy decision contract.
  • [P1] Native provider web search is a security boundary; a missed runtime caller could still inject hosted web_search despite tool policy, so maintainer review should verify all entry points stay covered.

Maintainer options:

  1. Land with the boolean-only seam (recommended)
    Maintainers can accept the additive provider context after confirming it stays limited to agentId plus the computed native-search policy decision and the posted proof matches the final head.
  2. Require broader upgrade proof
    Ask for an additional plugin SDK/API baseline check or provider-plugin smoke if maintainers want more confidence that external provider hooks tolerate the optional fields.
  3. Pause if the SDK context should not grow
    If maintainers do not want provider hooks to observe any native-search policy decision, pause this PR and rework the enforcement behind a purely core-owned wrapper path.

Next step before merge

  • No automated repair is needed; the remaining action is maintainer handling for a protected, security-sensitive provider/plugin SDK boundary change.

Security
Cleared: No concrete security or supply-chain regression was found in the final diff; the earlier raw identity exposure was removed and the remaining security sensitivity is tracked as merge risk.

Review details

Best possible solution:

Keep the redacted boolean decision seam and land only after maintainer review confirms the plugin SDK baseline, CI, and native-search allow/deny proof for the security-sensitive provider path.

Do we have a high-confidence way to reproduce the issue?

Yes from source: current main gates native web-search activation on config/model/auth but not the layered tool-policy inputs that already govern managed web_search. I did not run tests because this was a read-only review.

Is this the best way to solve the issue?

Yes, with maintainer approval: the final patch computes the full policy in core and passes a redacted boolean decision to provider hooks instead of raw sender/group identity. The additive provider SDK context still needs explicit maintainer acceptance before merge.

AGENTS.md: found and applied where relevant.

Codex review notes: model gpt-5.5, reasoning high; reviewed against 3a9ea1d85bef.

Label changes

Label changes:

  • add proof: sufficient: Contributor real behavior proof is sufficient. The contributor posted redacted live-output for commit 89d23b3 showing denied provider/Codex paths omit native web_search, allowed paths inject it, and provider hooks do not receive raw policy context.
  • add rating: 🐚 platinum hermit: Overall readiness is 🐚 platinum hermit; proof is 🐚 platinum hermit and patch quality is 🐚 platinum hermit.
  • add status: 👀 ready for maintainer look: ClawSweeper has no concrete contributor-facing blocker left for this PR. Sufficient (live_output): The contributor posted redacted live-output for commit 89d23b3 showing denied provider/Codex paths omit native web_search, allowed paths inject it, and provider hooks do not receive raw policy context.
  • remove rating: 🧂 unranked krab: Current PR rating is rating: 🐚 platinum hermit, so this older rating label is no longer current.
  • remove status: 📣 needs proof: Current PR status label is status: 👀 ready for maintainer look.

Label justifications:

  • P1: The PR changes security-sensitive native web-search policy enforcement where an incorrect merge could expose provider-hosted search despite OpenClaw tool policy.
  • merge-risk: 🚨 compatibility: The diff expands provider hook context and refreshes the Plugin SDK API baseline, which can affect third-party provider plugin expectations.
  • merge-risk: 🚨 security-boundary: The diff moves tool-policy decisions into native provider web-search injection paths, which is an authorization/security boundary for external web access.
  • rating: 🐚 platinum hermit: Overall readiness is 🐚 platinum hermit; proof is 🐚 platinum hermit and patch quality is 🐚 platinum hermit.
  • status: 👀 ready for maintainer look: ClawSweeper has no concrete contributor-facing blocker left for this PR. Sufficient (live_output): The contributor posted redacted live-output for commit 89d23b3 showing denied provider/Codex paths omit native web_search, allowed paths inject it, and provider hooks do not receive raw policy context.
  • proof: sufficient: Contributor real behavior proof is sufficient. The contributor posted redacted live-output for commit 89d23b3 showing denied provider/Codex paths omit native web_search, allowed paths inject it, and provider hooks do not receive raw policy context.
Evidence reviewed

PR surface:

Source +293, Tests +331, Generated 0. Total +624 across 16 files.

View PR surface stats
Area Files Added Removed Net
Source 10 304 11 +293
Tests 5 332 1 +331
Docs 0 0 0 0
Config 0 0 0 0
Generated 1 2 2 0
Other 0 0 0 0
Total 16 638 14 +624

What I checked:

  • Repository policy read: Root and scoped AGENTS.md files for agents, embedded runner, plugin SDK, plugins, extensions, and docs were read; they made provider/plugin SDK and security-boundary changes compatibility-sensitive review surfaces. (AGENTS.md:1, 3a9ea1d85bef)
  • Current-main behavior: Current main resolves native Codex search activation from global config, model API/provider, and auth only; it does not include agent/group/sender/sandbox/session tool policy in the native activation decision. (src/agents/codex-native-web-search-core.ts:95, 3a9ea1d85bef)
  • Current wrapper behavior: Current main's Codex native-search stream wrapper calls resolveCodexNativeSearchActivation with config, model provider, model API, and agentDir only, so native payload injection can bypass the richer runtime tool-policy context. (src/llm/providers/stream-wrappers/openai.ts:648, 3a9ea1d85bef)
  • Policy contract: OpenClaw tool policy expands groups such as group:web before allow/deny matching, and all active policies must allow a tool for it to remain available. (src/agents/tool-policy-match.ts:47, 3a9ea1d85bef)
  • Docs contract: The public docs define group:web as web_search, x_search, and web_fetch, while the web-search docs describe native OpenAI/Codex search as replacing or supplementing managed web_search under config. Public docs: docs/gateway/config-tools.md. (docs/gateway/config-tools.md:38, 3a9ea1d85bef)
  • Final PR diff: The final diff removes the earlier raw provider-hook policy context shape and exposes only agentId plus nativeWebSearchAllowedByToolPolicy on provider hook context while computing the full policy decision in core. (src/plugins/types.ts:706, 89d23b37803f)

Likely related people:

  • steipete: GitHub path history shows repeated recent work on Codex native-search docs, OpenAI provider identity, plugin SDK public helpers, and extension sources that overlap this PR's provider/native-search boundary. (role: plugin/provider SDK area owner by history; confidence: high; commits: e996956c294c, 4c33aaa86c16, a16c6ca94b23; files: src/agents/codex-native-web-search-core.ts, src/plugin-sdk/provider-stream.ts, src/plugins/types.ts)
  • vincentkoc: GitHub path history shows recent provider-runtime and provider-stream work, including provider fanout and OpenRouter/provider stream changes adjacent to the stream-wrapper and SDK surfaces touched here. (role: recent provider/runtime contributor; confidence: medium; commits: d07ba5f2656d, 34b3471f8553, 27b15a19e84c; files: src/plugin-sdk/provider-stream.ts, src/plugins/types.ts, src/llm/providers/stream-wrappers/openai.ts)
  • Jacob Tomlinson: The shallow local checkout blames the central native-search and provider-wrapper files to commit 8c3ba33, so this is a weak but current-main routing signal rather than deep authorship proof. (role: current checkout blame boundary; confidence: low; commits: 8c3ba3346311; files: src/agents/codex-native-web-search-core.ts, src/llm/providers/stream-wrappers/openai.ts, src/plugin-sdk/provider-stream.ts)
What the crustacean ranks mean
  • 🦀 challenger crab: rare, exceptional readiness with strong proof, clean implementation, and convincing validation.
  • 🦞 diamond lobster: very strong readiness with only minor maintainer review expected.
  • 🐚 platinum hermit: good normal PR, likely mergeable with ordinary maintainer review.
  • 🦐 gold shrimp: useful signal, but proof or patch confidence is still limited.
  • 🦪 silver shellfish: thin signal; proof, validation, or implementation needs work.
  • 🧂 unranked krab: not merge-ready because proof is missing/unusable or there are serious correctness or safety concerns.
  • 🌊 off-meta tidepool: rating does not apply to this item.

Shiny media proof means a screenshot, video, or linked artifact directly shows the changed behavior. Runtime, network, CSP, and security claims still need visible diagnostics.

How this review workflow works
  • ClawSweeper keeps one durable marker-backed review comment per issue or PR.
  • Re-runs edit this comment so the latest verdict, findings, and automation markers stay together instead of adding duplicate bot comments.
  • A fresh review can be triggered by eligible @clawsweeper re-review comments, exact-item GitHub events, scheduled/background review runs, or manual workflow dispatch.
  • PR/issue authors and users with repository write access can comment @clawsweeper re-review or @clawsweeper re-run on an open PR or issue to request a fresh review only.
  • Maintainers can also comment @clawsweeper review to request a fresh review only.
  • Fresh-review commands do not start repair, autofix, rebase, CI repair, or automerge.
  • Maintainer-only repair and merge flows require explicit commands such as @clawsweeper autofix, @clawsweeper automerge, @clawsweeper fix ci, or @clawsweeper address review.
  • Maintainers can comment @clawsweeper explain to ask for more context, or @clawsweeper stop to stop active automation.

@clawsweeper clawsweeper Bot added the rating: 🌊 off-meta tidepool PR readiness rating does not apply to this item. label Jun 9, 2026
@openclaw-barnacle openclaw-barnacle Bot added docs Improvements or additions to documentation size: L and removed size: M labels Jun 9, 2026
@clawsweeper clawsweeper Bot added rating: 🦪 silver shellfish Thin PR readiness signal; proof, validation, or implementation needs work. status: 📣 needs proof The PR needs real behavior proof before ClawSweeper can clear the contributor ask. P1 High-priority user-facing bug, regression, or broken workflow. merge-risk: 🚨 security-boundary 🚨 May affect sandboxing, authorization, credentials, or sensitive data. and removed rating: 🌊 off-meta tidepool PR readiness rating does not apply to this item. labels Jun 9, 2026
@eleqtrizit eleqtrizit self-assigned this Jun 9, 2026
@clawsweeper clawsweeper Bot added rating: 🧂 unranked krab Not merge-ready due to missing proof or serious correctness/safety concerns. and removed rating: 🦪 silver shellfish Thin PR readiness signal; proof, validation, or implementation needs work. labels Jun 9, 2026
@eleqtrizit

Copy link
Copy Markdown
Contributor Author

@clawsweeper re-review

Follow-up fixes pushed in 7f45e7bf90:

  • Direct OpenAI Responses native web_search now checks the shared native web-search policy decision before injecting { type: "web_search" }.
  • Added OpenAI provider regression coverage for agent-level web_search deny preserving the managed tool.
  • Fixed the compaction hook test expectation for nativeWebSearchPolicyContext.
  • Rebased onto current upstream/main.

Validation after rebase:

  • node scripts/run-vitest.mjs src/agents/codex-native-web-search.test.ts src/llm/providers/stream-wrappers/openai.test.ts src/agents/embedded-agent-runner-extraparams.test.ts extensions/openai/openai-provider.test.ts src/agents/embedded-agent-runner/compact.hooks.test.ts
  • corepack pnpm plugin-sdk:api:check
  • corepack pnpm tsgo:core
  • node scripts/check-src-extension-import-boundary.mjs --json
  • node scripts/check-sdk-package-extension-import-boundary.mjs --json
  • node scripts/check-test-helper-extension-import-boundary.mjs --json
  • git diff --check

@clawsweeper

clawsweeper Bot commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

🦞🧹
ClawSweeper re-review requested.

I asked ClawSweeper to review this item again.
Action: item re-review queued (workflow sweep.yml, event repository_dispatch).
Result: the existing ClawSweeper review comment will be edited in place when the review finishes.

Re-review progress:

@clawsweeper clawsweeper Bot added the merge-risk: 🚨 compatibility 🚨 May break existing users, config, migrations, defaults, or upgrade paths. label Jun 9, 2026
@eleqtrizit

Copy link
Copy Markdown
Contributor Author

Behavioral proof

Commit tested: 89d23b3780

This proof uses the actual OpenAI provider stream wrapper and the core Codex native-search wrapper from the PR branch. The runtime input includes redacted sender/group values, but provider hooks receive only the final boolean policy decision.

Command run locally:

node --import tsx <redacted-runtime-payload-proof-script>

Observed redacted output:

{
  "provider_hook_denied_agent": {
    "allowedFlag": false,
    "leakedRawPolicyContext": false,
    "exposedIdentityKeys": [],
    "payloadTools": [
      { "type": "function", "name": "read" },
      { "type": "function", "name": "web_search" }
    ]
  },
  "provider_hook_allowed_agent": {
    "allowedFlag": true,
    "leakedRawPolicyContext": false,
    "exposedIdentityKeys": [],
    "payloadTools": [
      { "type": "function", "name": "read" },
      { "type": "web_search" }
    ]
  },
  "core_codex_denied_tools": [
    { "type": "function", "name": "read" }
  ],
  "core_codex_allowed_tools": [
    { "type": "function", "name": "read" },
    { "type": "web_search", "external_web_access": true }
  ]
}

What this proves:

  • Denied provider-hook path: native provider web_search is not injected; the managed web_search function remains available for normal OpenClaw policy handling.
  • Allowed provider-hook path: provider-native web_search is injected.
  • Provider-hook boundary: nativeWebSearchPolicyContext is not present, and no sender/group identity keys are exposed to provider hooks.
  • Denied core Codex wrapper path: native web_search is not injected.
  • Allowed core Codex wrapper path: native web_search is injected with live external access from config.

Validation also run after the provider-boundary fix:

node scripts/run-vitest.mjs src/agents/embedded-agent-runner-extraparams.test.ts extensions/openai/openai-provider.test.ts src/agents/codex-native-web-search.test.ts src/llm/providers/stream-wrappers/openai.test.ts src/agents/embedded-agent-runner/compact.hooks.test.ts
corepack pnpm plugin-sdk:api:check
corepack pnpm tsgo:core
git diff --check
node scripts/check-src-extension-import-boundary.mjs --json
node scripts/check-sdk-package-extension-import-boundary.mjs --json
node scripts/check-test-helper-extension-import-boundary.mjs --json
corepack pnpm build

Structured local review:

.agents/skills/autoreview/scripts/autoreview --mode local

Autoreview reported two findings that I checked and rejected:

  • The code-mode options finding does not apply because code-mode option rewriting returns before the policy-deny branch when visible code-mode tools are present.
  • The SDK compatibility finding does not apply to shipped API: the raw native-search policy context/helper were introduced only on this PR branch and were removed here to address the provider-hook privacy boundary before merge.

@eleqtrizit

Copy link
Copy Markdown
Contributor Author

@clawsweeper re-review

@clawsweeper

clawsweeper Bot commented Jun 9, 2026

Copy link
Copy Markdown
Contributor

🦞🧹
ClawSweeper re-review requested.

I asked ClawSweeper to review this item again.
Action: item re-review queued (workflow sweep.yml, event repository_dispatch).
Result: the existing ClawSweeper review comment will be edited in place when the review finishes.

Re-review progress:

@clawsweeper clawsweeper Bot added proof: sufficient ClawSweeper judged the real behavior proof convincing. and removed rating: 🧂 unranked krab Not merge-ready due to missing proof or serious correctness/safety concerns. labels Jun 9, 2026
@clawsweeper clawsweeper Bot added rating: 🐚 platinum hermit Good normal PR readiness with ordinary maintainer review expected. status: 👀 ready for maintainer look ClawSweeper has no concrete contributor-facing blocker left for this PR. and removed status: 📣 needs proof The PR needs real behavior proof before ClawSweeper can clear the contributor ask. labels Jun 9, 2026
@eleqtrizit

Copy link
Copy Markdown
Contributor Author

Maintainer verification: native web-search entry points

I checked ClawSweeper's note about missed native provider web_search runtime callers against current head 89d23b3780.

I do not see an uncovered hosted native web-search injection path:

  • src/plugin-sdk/provider-stream.ts:130 routes the shared OpenAI Responses family through createCodexNativeWebSearchWrapper() with nativeWebSearchAllowedByToolPolicy.
  • extensions/openai/shared.ts:89 routes the direct OpenAI API-key Responses wrapper through createOpenAINativeWebSearchWrapper() with the same boolean policy decision.
  • extensions/openai/native-web-search.ts:88 returns without payload patching when nativeWebSearchAllowedByToolPolicy === false, so direct OpenAI does not inject { type: "web_search" } on deny.
  • src/agents/embedded-agent-runner/extra-params.ts:1099 computes that boolean in core from the full session/group/sender/sandbox policy context before provider hooks see it.
  • src/agents/embedded-agent-runner/run/attempt.ts:2673 covers the code-mode direct wrapper path by passing the raw policy context into createCodexNativeWebSearchWrapper().
  • src/llm/providers/stream-wrappers/openai.ts:641 has an early code-mode branch, but that branch does not inject hosted search; it installs the payload filter from src/llm/providers/stream-wrappers/openai.ts:168, which strips non-code-mode payload tools including hosted web_search.

I also checked the other type: "web_search" hits. The notable non-OpenAI hit is extensions/xai/src/web-search-shared.ts:109, which is a managed web-search provider implementation invoked as the web_search tool, not a hidden model-call stream wrapper that can bypass OpenClaw tool policy.

Maintainer conclusion: the native hosted OpenAI/Codex web-search injection entry points are covered at 89d23b3780; I do not think ClawSweeper's maintainer-review note requires another code change. The separate live blocker is the unrelated checks-node-agentic-control-plane-agent-chat CI failure in src/gateway/server.chat.gateway-server-chat-b.test.ts:2262.

@eleqtrizit

Copy link
Copy Markdown
Contributor Author

Summary of All Verification Checks

Relevance

Confirmed in scope: the issue is a policy-consistency gap where native/provider Codex/OpenAI web_search tool injection bypassed agent-level tool deny policies. This is low-severity hardening (not CVE-class), with no demonstrated secret exfiltration or external mutation. The gap was verified unfixed in current main — no agentId, sessionKey, or tool policy parameters existed in the native search activation path, and no isToolAllowedByPolicies() check was performed before returning state: "native_active".

Compatibility

All changes are backward-compatible: every new parameter is optional, no config keys were removed or renamed, no CLI commands changed, no exported symbols removed, and no config schema changes were made. Existing users without deny policies see identical behavior. The Plugin SDK API baseline hash was updated for the intentional, source-compatible widening of ProviderPrepareExtraParamsContext. Import boundary checks (check-src-extension-import-boundary, check-sdk-package-extension-import-boundary, check-test-helper-extension-import-boundary) all pass with zero violations.

ClawSweeper

ClawSweeper reviewed the PR and flagged an infrastructure timeout (spawnSync codex ETIMEDOUT) — not a code quality rejection. The review was re-triggered after fixes and behavioral proof were posted. ClawSweeper also flagged the exposure of sender/group identity on provider hook context; this was addressed in a follow-up commit by narrowing the provider-facing surface to a resolved boolean (nativeWebSearchAllowedByToolPolicy) instead of raw identity data.

Code Reviews Completed

Multiple reviews were performed covering the full implementation. The core fix was iterated through several rounds: initial 6-layer policy enforcement was expanded to all 11 policy layers (profile, provider-profile, global, global-provider, agent, agent-provider, group, sender, sandbox, subagent, inherited). The direct OpenAI Responses wrapper was also fixed to check the same policy gate. All prior findings (TUI changes in diff, missing session-scoped layers, uncommitted working tree, SDK baseline hash) were addressed in subsequent commits.


Checks Completed

Check Status
Core implementation — 11 policy layer enforcement in native search activation
Direct OpenAI Responses wrapper policy gate
Managed tool suppression parity (managed tool shown when native blocked)
Compaction support (policy context preserved during session compaction)
Provider hook context narrowed to resolved boolean (no raw identity exposure)
Plugin SDK API baseline updated
TypeScript strict mode, no any types
Import boundary checks (3 scripts) — zero violations
Type checking (tsgo:core)
Build (pnpm build)
Import cycle check (pnpm check:import-cycles)
Unit tests — agent deny, session-derived group deny, provider deny, sender deny, sandbox deny, inherited session deny, managed tool suppression parity, OpenAI wrapper payload suppression
Behavioral proof — standalone runtime payload proof using actual provider wrappers with denied/allowed cases
Compatibility — optional params only, no config schema changes, no breaking changes
ClawSweeper review — infra timeout (not code rejection), re-triggered after fixes
CHANGELOG discipline — not edited (correct for contributor PR)
git diff --check — no whitespace errors

Abbreviations

Abbreviation Definition
SDK Software Development Kit
API Application Programming Interface
CI Continuous Integration
CVE Common Vulnerabilities and Exposures
GHSA GitHub Security Advisory
E2E End-to-End
RSS Resident Set Size (memory)
I/O Input/Output
LOC Lines of Code
TUI Terminal User Interface
PR Pull Request

@eleqtrizit eleqtrizit merged commit f0d8048 into openclaw:main Jun 9, 2026
184 of 191 checks passed
github-actions Bot pushed a commit to Desicool/openclaw that referenced this pull request Jun 10, 2026
* fix(search): enforce native web search tool policy

* fix(search): apply session policy to native web search

* fix(search): gate direct OpenAI native search

* fix(search): redact native web search provider context
eleboucher pushed a commit to eleboucher/homelab that referenced this pull request Jun 12, 2026
…26.6.6) (#1040)

This PR contains the following updates:

| Package | Update | Change |
|---|---|---|
| [ghcr.io/openclaw/openclaw](https://openclaw.ai) ([source](https://github.com/openclaw/openclaw)) | patch | `2026.6.5` → `2026.6.6` |

---

### Release Notes

<details>
<summary>openclaw/openclaw (ghcr.io/openclaw/openclaw)</summary>

### [`v2026.6.6`](https://github.com/openclaw/openclaw/blob/HEAD/CHANGELOG.md#202666)

[Compare Source](openclaw/openclaw@v2026.6.5...v2026.6.6)

##### Highlights

- Security boundaries are substantially tighter across transcripts, sandbox binds, host environment inheritance, MCP stdio, Codex HTTP access, native search policy, elevated sender checks, deleted-agent ACP bypasses, loopback tools, Discord moderation, and Teams group actions; exec approvals now fail closed on timeout. ([#&#8203;91529](openclaw/openclaw#91529), [#&#8203;91618](openclaw/openclaw#91618), [#&#8203;91615](openclaw/openclaw#91615), [#&#8203;91619](openclaw/openclaw#91619), [#&#8203;91741](openclaw/openclaw#91741), [#&#8203;91745](openclaw/openclaw#91745), [#&#8203;91746](openclaw/openclaw#91746), [#&#8203;91748](openclaw/openclaw#91748), [#&#8203;91749](openclaw/openclaw#91749), [#&#8203;91750](openclaw/openclaw#91750), [#&#8203;91751](openclaw/openclaw#91751), [#&#8203;91752](openclaw/openclaw#91752), [#&#8203;91763](openclaw/openclaw#91763), [#&#8203;89938](openclaw/openclaw#89938)) Thanks [@&#8203;joshavant](https://github.com/joshavant), [@&#8203;pgondhi987](https://github.com/pgondhi987), [@&#8203;mmaps](https://github.com/mmaps), [@&#8203;eleqtrizit](https://github.com/eleqtrizit), [@&#8203;shakkernerd](https://github.com/shakkernerd), and [@&#8203;drobison00](https://github.com/drobison00).
- Telegram delivery is safer and more coherent: account-scoped topics route to the right agent, streamed text survives tool calls, `/compact` works on generic ingress, callback handling uses concrete APIs, draft chunking is shared, durable dispatch dedupe moved into the SDK, and unauthorized DM text stays out of cache and prompt context. ([#&#8203;91189](openclaw/openclaw#91189), [#&#8203;88682](openclaw/openclaw#88682), [#&#8203;89588](openclaw/openclaw#89588), [#&#8203;90212](openclaw/openclaw#90212), [#&#8203;91876](openclaw/openclaw#91876), [#&#8203;91874](openclaw/openclaw#91874), [#&#8203;91904](openclaw/openclaw#91904), [#&#8203;91478](openclaw/openclaw#91478), [#&#8203;91915](openclaw/openclaw#91915)) Thanks [@&#8203;codysai001](https://github.com/codysai001), [@&#8203;alexzhu0](https://github.com/alexzhu0), [@&#8203;joelnishanth](https://github.com/joelnishanth), [@&#8203;snowzlm](https://github.com/snowzlm), [@&#8203;obviyus](https://github.com/obviyus), and [@&#8203;sallyom](https://github.com/sallyom).
- iMessage recovery and delivery now cover always-on inbound restart, durable echo markers, block streaming, idle approval discovery, hardened outbound transport, and actionable inbound startup diagnostics. ([#&#8203;91335](openclaw/openclaw#91335), [#&#8203;91449](openclaw/openclaw#91449), [#&#8203;88969](openclaw/openclaw#88969), [#&#8203;88530](openclaw/openclaw#88530), [#&#8203;91783](openclaw/openclaw#91783), [#&#8203;91785](openclaw/openclaw#91785)) Thanks [@&#8203;omarshahine](https://github.com/omarshahine), [@&#8203;jmissig](https://github.com/jmissig), and [@&#8203;colmbrogan](https://github.com/colmbrogan).
- Browser and MCP connectivity gained existing-session CDP support, discovered WebSocket validation, default-profile `cdpUrl` handling, safer browser-output boundaries, Streamable HTTP loopback transport, corrected OAuth/SSE authorization handling, and broader schema compatibility. ([#&#8203;91422](openclaw/openclaw#91422), [#&#8203;89851](openclaw/openclaw#89851), [#&#8203;91736](openclaw/openclaw#91736), [#&#8203;91747](openclaw/openclaw#91747), [#&#8203;91451](openclaw/openclaw#91451), [#&#8203;80143](openclaw/openclaw#80143)) Thanks [@&#8203;pgondhi987](https://github.com/pgondhi987), [@&#8203;anagnorisis2peripeteia](https://github.com/anagnorisis2peripeteia), [@&#8203;lifuyue](https://github.com/lifuyue), [@&#8203;eleqtrizit](https://github.com/eleqtrizit), [@&#8203;LiuwqGit](https://github.com/LiuwqGit), and [@&#8203;HemantSudarshan](https://github.com/HemantSudarshan).
- Control UI startup and first-reply latency are lower through cached model metadata, removal of the startup catalog wait, lazy slash-command loading, and first-event tracing with slow-reply diagnostics. ([#&#8203;91531](openclaw/openclaw#91531), [#&#8203;91538](openclaw/openclaw#91538), [#&#8203;91568](openclaw/openclaw#91568), [#&#8203;91583](openclaw/openclaw#91583), [#&#8203;91598](openclaw/openclaw#91598))
- Provider support expands with OpenRouter OAuth onboarding and Claude Fable 5 adaptive thinking, while Codex sessions keep correct compaction ownership, local models skip guardian review, dynamic tool progress normalizes cleanly, and Gemma 4 reasoning replay is preserved. ([#&#8203;91830](openclaw/openclaw#91830), [#&#8203;91882](openclaw/openclaw#91882), [#&#8203;91590](openclaw/openclaw#91590), [#&#8203;88630](openclaw/openclaw#88630), [#&#8203;88768](openclaw/openclaw#88768), [#&#8203;91696](openclaw/openclaw#91696)) Thanks [@&#8203;Patrick-Erichsen](https://github.com/Patrick-Erichsen), [@&#8203;joshavant](https://github.com/joshavant), [@&#8203;bdjben](https://github.com/bdjben), and [@&#8203;Coder-Wangyankun](https://github.com/Coder-Wangyankun).

##### Changes

- CLI progress: emit Claude CLI commentary progress events and bridge inter-tool commentary into channel progress without exposing internal protocol scaffolding. ([#&#8203;89834](openclaw/openclaw#89834), [#&#8203;90883](openclaw/openclaw#90883)) Thanks [@&#8203;anagnorisis2peripeteia](https://github.com/anagnorisis2peripeteia).
- Observability: allow trusted diagnostics channels to capture tool input/output content, add first-assistant-event traces, and warn on slow initial replies. ([#&#8203;91256](openclaw/openclaw#91256), [#&#8203;91568](openclaw/openclaw#91568), [#&#8203;91583](openclaw/openclaw#91583)) Thanks [@&#8203;amknight](https://github.com/amknight).
- Plugins/ClawHub: dogfood reusable package publishing, let dry runs skip publish approval, allow declared installed trusted hooks, report managed plugin version drift, and warn instead of failing on retired Skill Workshop configuration. ([#&#8203;91574](openclaw/openclaw#91574), [#&#8203;91591](openclaw/openclaw#91591), [#&#8203;90004](openclaw/openclaw#90004), [#&#8203;90927](openclaw/openclaw#90927), [#&#8203;90838](openclaw/openclaw#90838)) Thanks [@&#8203;Patrick-Erichsen](https://github.com/Patrick-Erichsen), [@&#8203;brokemac79](https://github.com/brokemac79), and [@&#8203;lonexreb](https://github.com/lonexreb).
- Memory/providers: move the local llama.cpp runtime into its provider plugin, batch embeddings across files, persist the agent model catalog cache, and keep QMD JSON search one-shot while filtering stale REM recall previews. ([#&#8203;91324](openclaw/openclaw#91324), [#&#8203;89138](openclaw/openclaw#89138), [#&#8203;90457](openclaw/openclaw#90457), [#&#8203;91837](openclaw/openclaw#91837), [#&#8203;91851](openclaw/openclaw#91851)) Thanks [@&#8203;osolmaz](https://github.com/osolmaz), [@&#8203;mushuiyu886](https://github.com/mushuiyu886), [@&#8203;ai-hpc](https://github.com/ai-hpc), and [@&#8203;TurboTheTurtle](https://github.com/TurboTheTurtle).
- Channels/mobile: add the QQBot group mention toggle, improve iPad and iPhone control surfaces, and expose the active connection host in the TUI footer. ([#&#8203;91423](openclaw/openclaw#91423), [#&#8203;91557](openclaw/openclaw#91557), [#&#8203;89909](openclaw/openclaw#89909)) Thanks [@&#8203;cxyhhhhh](https://github.com/cxyhhhhh), [@&#8203;Solvely-Colin](https://github.com/Solvely-Colin), and [@&#8203;baskduf](https://github.com/baskduf).
- Performance: prewarm TUI runtime plugins, deduplicate plugin auto-enable fanout, trim dense text-delta snapshots, and reuse prepared startup model metadata. ([#&#8203;90782](openclaw/openclaw#90782), [#&#8203;89978](openclaw/openclaw#89978), [#&#8203;91580](openclaw/openclaw#91580), [#&#8203;91531](openclaw/openclaw#91531)) Thanks [@&#8203;RomneyDa](https://github.com/RomneyDa) and [@&#8203;ai-hpc](https://github.com/ai-hpc).

##### Fixes

- Agent/session recovery: drop stale approval follow-ups after session rebind, remove drained reply-queue items by identity, recover stale main and visible replies, preserve Codex context-engine compaction ownership, lower the default compaction timeout to 180 seconds while respecting explicit configuration, and keep provider-failure terminal lifecycle state correct. ([#&#8203;85679](openclaw/openclaw#85679), [#&#8203;91450](openclaw/openclaw#91450), [#&#8203;91566](openclaw/openclaw#91566), [#&#8203;91840](openclaw/openclaw#91840), [#&#8203;91590](openclaw/openclaw#91590), [#&#8203;91361](openclaw/openclaw#91361), [#&#8203;91895](openclaw/openclaw#91895)) Thanks [@&#8203;openperf](https://github.com/openperf), [@&#8203;yetval](https://github.com/yetval), [@&#8203;joshavant](https://github.com/joshavant), [@&#8203;wangmiao0668000666](https://github.com/wangmiao0668000666), and [@&#8203;TurboTheTurtle](https://github.com/TurboTheTurtle).
- User-visible content boundaries: suppress Codex/Harmony protocol artifacts, neutralize browser and LanceDB memory media directives, redact transcript images, and preserve native `/compact` replies through source suppression. ([#&#8203;89151](openclaw/openclaw#89151), [#&#8203;91422](openclaw/openclaw#91422), [#&#8203;91425](openclaw/openclaw#91425), [#&#8203;91529](openclaw/openclaw#91529), [#&#8203;90212](openclaw/openclaw#90212)) Thanks [@&#8203;joelnishanth](https://github.com/joelnishanth), [@&#8203;pgondhi987](https://github.com/pgondhi987), [@&#8203;joshavant](https://github.com/joshavant), and [@&#8203;snowzlm](https://github.com/snowzlm).
- Channel delivery: keep WhatsApp captured replies attached to the successor controller after restart, retry Feishu rate limits, preserve Mattermost thread replies, canonicalize LINE webhook paths, restore Discord reply hydration and runtime timeout exports, and show OpenAI Realtime WebRTC assistant transcripts. ([#&#8203;85823](openclaw/openclaw#85823), [#&#8203;89659](openclaw/openclaw#89659), [#&#8203;91684](openclaw/openclaw#91684), [#&#8203;91649](openclaw/openclaw#91649), [#&#8203;90263](openclaw/openclaw#90263), [#&#8203;91686](openclaw/openclaw#91686), [#&#8203;90426](openclaw/openclaw#90426)) Thanks [@&#8203;itsuzef](https://github.com/itsuzef), [@&#8203;ladygege](https://github.com/ladygege), [@&#8203;jacobtomlinson](https://github.com/jacobtomlinson), [@&#8203;fuller-stack-dev](https://github.com/fuller-stack-dev), and [@&#8203;shushushv](https://github.com/shushushv).
- Cron: cancel active task runs cleanly, preserve terminal timeout/cancel state, and recover no-deliver tool warnings instead of silently losing the outcome. ([#&#8203;90666](openclaw/openclaw#90666), [#&#8203;90678](openclaw/openclaw#90678)) Thanks [@&#8203;ai-hpc](https://github.com/ai-hpc).
- Gateway/config/auth: share the approval runtime socket token, replace arrays explicitly in `config.patch`, skip the deleted-agent guard only for valid ACP harness sessions, surface headless LaunchAgent state, verify SQLite auth migration before cleanup, and arm QMD startup maintenance. ([#&#8203;87105](openclaw/openclaw#87105), [#&#8203;91551](openclaw/openclaw#91551), [#&#8203;91219](openclaw/openclaw#91219), [#&#8203;91614](openclaw/openclaw#91614), [#&#8203;91740](openclaw/openclaw#91740), [#&#8203;91978](openclaw/openclaw#91978)) Thanks [@&#8203;fuller-stack-dev](https://github.com/fuller-stack-dev) and [@&#8203;scotthuang](https://github.com/scotthuang).
- Providers/Codex: clarify quota errors, restore the Codex synthetic usage line, canonicalize Codex protocol assets, require API-key auth for realtime voice, normalize ACP model refs, preserve Gemma 4 `reasoning_content`, and avoid guardian review for local models. ([#&#8203;91390](openclaw/openclaw#91390), [#&#8203;91709](openclaw/openclaw#91709), [#&#8203;91507](openclaw/openclaw#91507), [#&#8203;91567](openclaw/openclaw#91567), [#&#8203;88630](openclaw/openclaw#88630), [#&#8203;91696](openclaw/openclaw#91696)) Thanks [@&#8203;hxy91819](https://github.com/hxy91819), [@&#8203;brokemac79](https://github.com/brokemac79), [@&#8203;RomneyDa](https://github.com/RomneyDa), [@&#8203;joshavant](https://github.com/joshavant), and [@&#8203;Coder-Wangyankun](https://github.com/Coder-Wangyankun).
- Updates/builds: recover package Gateway restarts after refresh failure, expose plugin convergence repair, fall back to Corepack in PATH-less pnpm environments, seed the correct Docker store packages, and keep ClawHub dry-run and publish paths reusable. ([#&#8203;91581](openclaw/openclaw#91581), [#&#8203;91599](openclaw/openclaw#91599), [#&#8203;91547](openclaw/openclaw#91547), [#&#8203;91591](openclaw/openclaw#91591)) Thanks [@&#8203;fuller-stack-dev](https://github.com/fuller-stack-dev), [@&#8203;sallyom](https://github.com/sallyom), and [@&#8203;Patrick-Erichsen](https://github.com/Patrick-Erichsen).
- UI: require explicit user intent before opening chat sessions and drain restored chat queues after session switches. ([#&#8203;91480](openclaw/openclaw#91480)) Thanks [@&#8203;TurboTheTurtle](https://github.com/TurboTheTurtle).
- Android: avoid the `dataSync` foreground-service type for persistent nodes. ([#&#8203;80082](openclaw/openclaw#80082)) Thanks [@&#8203;davelutztx](https://github.com/davelutztx).
- Native hooks: bound relay lifetimes so abandoned native hook connections cannot linger indefinitely. ([#&#8203;91550](openclaw/openclaw#91550)) Thanks [@&#8203;joshavant](https://github.com/joshavant).

</details>

---

### Configuration

📅 **Schedule**: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 **Automerge**: Disabled by config. Please merge this manually once you are satisfied.

♻ **Rebasing**: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 **Ignore**: Close this PR and you won't be reminded about these updates again.

---

 - [ ] <!-- rebase-check -->If you want to rebase/retry this PR, check this box

---

This PR has been generated by [Renovate Bot](https://github.com/renovatebot/renovate).
<!--renovate-debug:eyJjcmVhdGVkSW5WZXIiOiI0My4xMDEuMSIsInVwZGF0ZWRJblZlciI6IjQzLjEwMS4xIiwidGFyZ2V0QnJhbmNoIjoibWFpbiIsImxhYmVscyI6WyJyZW5vdmF0ZS9jb250YWluZXIiLCJ0eXBlL3BhdGNoIl19-->

Reviewed-on: https://git.erwanleboucher.dev/eleboucher/homelab/pulls/1040
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

agents Agent runtime and tooling docs Improvements or additions to documentation extensions: openai maintainer Maintainer-authored PR merge-risk: 🚨 compatibility 🚨 May break existing users, config, migrations, defaults, or upgrade paths. merge-risk: 🚨 security-boundary 🚨 May affect sandboxing, authorization, credentials, or sensitive data. P1 High-priority user-facing bug, regression, or broken workflow. proof: sufficient ClawSweeper judged the real behavior proof convincing. rating: 🐚 platinum hermit Good normal PR readiness with ordinary maintainer review expected. size: L status: 👀 ready for maintainer look ClawSweeper has no concrete contributor-facing blocker left for this PR.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant