feat(extensions): add Sondera policy guardrails [AI-assisted] by joshdevon · Pull Request #8448 · openclaw/openclaw

joshdevon · 2026-02-04T02:08:13Z

Summary

Adds Sondera, a Cedar-based policy guardrails extension that enforces deterministic security rules on tool calls before they execute.

103 rules across 3 policy packs: Sondera Base (41), OpenClaw System (24), OWASP Agentic
(38)
Blocks dangerous commands (rm -rf, sudo, reverse shells), credential access, and data exfiltration
Redacts secrets (API keys, tokens) from session transcripts
Fail-closed evaluation: only explicit permit rules allow actions
Optional lockdown mode: block all tools unless explicitly permitted
User-configurable custom Cedar rules via Settings UI

Built by Sondera, powered by Cedar (AWS's policy language).

Related Work

This extension depends on plugin hooks landing. Complementary to the ongoing security work:

feat(gateway): support modular guardrails extensions for securing against indirect prompt injections and other agentic threats #6095 — Modular Guardrails Extensions (adds before_tool_call/after_tool_call hooks)
feat: interceptor pipeline for tool, message, and params events #6569 — Interceptor Pipeline
Plugin hooks (agent_end, before_tool_call, etc.) are never invoked #5513 — Plugin hooks timing fix

AI Disclosure 🤖

AI-assisted
Lightly tested against local OpenClaw instance
Team reviewed and understands the code

Test Plan

Install extension: openclaw plugins install @openclaw/sondera
Verify blocking: run sudo whoami in a session → "Blocked by Sondera policy"
Toggle policy packs in Settings UI
Test lockdown mode with custom permit rules

Greptile Overview

Greptile Summary

This PR adds the new extensions/sondera plugin, which evaluates Cedar policies (via @cedar-policy/cedar-wasm) to block risky tool calls pre-execution and redact sensitive tool outputs when persisting results. It also wires tool-call enforcement into the embedded runner by wrapping tool execute() to run before_tool_call hooks and throw a dedicated ToolBlockedError on blocks.

The extension is integrated using the existing plugin hook system (before_tool_call, after_tool_call, tool_result_persist) and exposes configuration (policy packs, lockdown mode, custom rules/path) via openclaw.plugin.json schema/UI hints.

Main issues to address before merge are around Cedar policy parsing robustness (regex-based parsing can skip/truncate real policies), edge cases in lockdown mode policy construction, and a couple of logger calls that can throw when debug isn’t present.

Confidence Score: 2/5

This PR is not safe to merge as-is because policy parsing/initialization issues can silently disable intended guardrails.
The core security value of the PR depends on Cedar policies being parsed and evaluated correctly. The current regex-based policy extraction can skip/truncate real-world when { ... } blocks, and lockdown mode can feed comment-only text as a “policy,” both of which can lead to missing rules or inconsistent behavior. There are also a couple of runtime footguns (logger debug calls without optional chaining; tool wrapping assumes execute exists) that can break hooks/session runs.
extensions/sondera/evaluator.ts, extensions/sondera/index.ts, src/agents/pi-embedded-runner/run/attempt.ts

_{(2/5) Greptile learns from your feedback when you react with thumbs up/down!}

greptile-apps

_{3 files reviewed, 6 comments}

_{Edit Code Review Agent Settings | Greptile}

extensions/sondera/evaluator.ts

extensions/sondera/index.ts

extensions/sondera/evaluator.ts

src/agents/pi-embedded-runner/run/attempt.ts

joshdevon · 2026-02-04T03:48:13Z

Greptile Issues Addressed

Fixed in commits ead6622 and 96cb792:

Issue	Fix
P0 Lockdown mode comment-only	Use empty policy set (Cedar implicit deny) instead of invalid comment string
P1 Logger debug calls	Added `?.` optional chaining to all 6 `api.logger.debug` calls
P1 evaluatePostTool diagnostics	Use `isRedactionPolicy()` with naming convention (`sondera-redact-`, `owasp-redact-`) instead of special-casing `default-allow`
P1 wrapToolForHooks	Added `typeof tool.execute !== "function"` guard

Also replaced regex-based Cedar policy parsing with native cedar.policySetTextToParts() and cedar.policyToJson()

APIs to properly handle nested braces and complex expressions.

Adds the Sondera extension which provides deterministic Cedar policy guardrails for OpenClaw agents. Evaluates policies before tool execution (PRE_TOOL) and redacts sensitive output (POST_TOOL). - 41-rule Sondera base pack (dangerous commands, RCE, sensitive files) - 24-rule OpenClaw system protection pack - 38-rule OWASP Agentic pack (opt-in) - Lockdown mode for block-by-default operation - Custom rules support via config UI or expert mode file

- Change evaluator to fail-closed: only explicit "allow" permits - Handle lockdown mode with no policy packs (Cedar default-deny) - Add debug logging for config initialization - Fix syntax error in policy-sondera-base.cedar

Replace regex-based policy parsing with Cedar's native policySetTextToParts() and policyToJson() APIs. The regex [^}]* stops at the first } character, breaking policies with } in string patterns (e.g., like "*}*"). - evaluator.ts: Use policySetTextToParts() for robust parsing - validate-cedar.ts: Use checkParsePolicySet() for validation - Add error handling for parse failures Fixes Greptile P0: regex truncation issue

- Lockdown mode: use empty policy set (Cedar implicit deny) instead of comment - Add optional chaining to all logger.debug calls - Use naming convention to identify redaction policies (sondera-redact-*, owasp-redact-*) - Guard wrapToolForHooks against tools without execute function

- Remove unused catch parameter `err` in index.ts (3 instances) - Rename unused `ctx` parameters to `_ctx` in hook callbacks - Add curly braces to single-line if statement in evaluator.ts - Use proper TypeScript types instead of `any` in wrapToolForHooks - Fix template literal with unknown error type

…errors The previous change to use `unknown` types broke type compatibility with AnyAgentTool and ToolDefinition. Using `any` with explicit oxlint-disable comments is appropriate here since we're wrapping tools with varying signatures.

On Windows, `new URL(import.meta.url).pathname` returns `/C:/path/...` with a leading slash. When used with path.resolve, this creates invalid paths like `C:\C:\...`. Using fileURLToPath from the url module correctly handles cross-platform path conversion.

Add Sondera fork installation instructions with PR openclaw#8448 reference. Matches blog post and Sondera docs for consistent guidance until hooks are merged into mainline OpenClaw.

Resolve merge conflicts in pnpm-lock.yaml and src/agents/pi-embedded-runner/run/attempt.ts

- Remove duplicate hookRunner declaration in attempt.ts - Cast EvaluationContext to Cedar's Context type in evaluator.ts - Guard against AgentMessage variants without content in index.ts

The formal_conformance job fails on fork PRs because GitHub restricts the GITHUB_TOKEN to read-only for pull_request events from forks. Add continue-on-error to the comment step so the job succeeds gracefully — the drift artifact is still uploaded regardless.

The inferred return type of getSlackSlashMocks references @vitest/spy internals which are not portable under pnpm strict node_modules. Add an explicit return type annotation to prevent the error.

- exec-approvals: add stripUndefinedFields to send.shared mock - discord actions: remove stale loadHandleDiscordMessageAction call - web/media: pass explicit localRoots to avoid os.tmpdir() overlap

# Conflicts: # src/discord/monitor/exec-approvals.test.ts # src/slack/monitor/slash.test-harness.ts

On CI (Linux), file writes within the same second share the same mtime. The session store cache uses mtime to detect stale entries, so when a previous test caches the empty store and this test writes new data in the same second, loadSessionStore returns stale cached data. Clear the cache in beforeEach to ensure each test reads fresh data.

On Windows, path.resolve("/tmp/...") produces "C:\tmp\..." which doesn't match hardcoded Unix paths. Use path.resolve() in the expected values so the test passes on all platforms.

openclaw-barnacle · 2026-02-21T03:05:39Z

Please make this as a third-party plugin that you maintain yourself in your own repo. Docs: https://docs.openclaw.ai/plugin. Feel free to open a PR after to add it to our community plugins page: https://docs.openclaw.ai/plugins/community

openclaw-barnacle bot added the agents Agent runtime and tooling label Feb 4, 2026

greptile-apps bot reviewed Feb 4, 2026

View reviewed changes

joshdevon force-pushed the sondera-pr branch from 84f9fd6 to dfb333c Compare February 5, 2026 15:08

joshdevon force-pushed the sondera-pr branch from dfb333c to 73e846c Compare February 6, 2026 16:03

joshdevon added 10 commits February 6, 2026 14:45

docs(sondera): update community links to OpenClaw

00e0a28

docs(sondera): add pre-release installation instructions

f977faa

Add Sondera fork installation instructions with PR openclaw#8448 reference. Matches blog post and Sondera docs for consistent guidance until hooks are merged into mainline OpenClaw.

fix: update pnpm-lock.yaml after rebase onto upstream/main

96ee949

joshdevon force-pushed the sondera-pr branch from 73e846c to 96ee949 Compare February 6, 2026 19:46

joshdevon added 2 commits February 6, 2026 14:58

fix(sondera): remove duplicate hookRunner declaration after rebase

5642a9d

merge upstream/main to resolve pnpm-lock.yaml conflict

6d07122

Reapor-Yurnero mentioned this pull request Feb 9, 2026

feat(gateway): support modular guardrails extensions for securing against indirect prompt injections and other agentic threats #6095

Closed

Merge upstream/main and resolve conflicts

e261978

Resolve merge conflicts in pnpm-lock.yaml and src/agents/pi-embedded-runner/run/attempt.ts

openclaw-barnacle bot added the size: XL label Feb 15, 2026

joshdevon added 4 commits February 15, 2026 10:51

fix: resolve type errors in sondera extension and attempt runner

fdbb322

- Remove duplicate hookRunner declaration in attempt.ts - Cast EvaluationContext to Cedar's Context type in evaluator.ts - Guard against AgentMessage variants without content in index.ts

Merge remote-tracking branch 'upstream/main' into sondera-pr

e1d5dbc

fix: annotate return type in slash.test-harness to avoid TS2742

1739e17

The inferred return type of getSlackSlashMocks references @vitest/spy internals which are not portable under pnpm strict node_modules. Add an explicit return type annotation to prevent the error.

openclaw-barnacle bot added the channel: slack Channel integration: slack label Feb 15, 2026

fix(test): resolve test failures after upstream merge

0e6deb8

- exec-approvals: add stripUndefinedFields to send.shared mock - discord actions: remove stale loadHandleDiscordMessageAction call - web/media: pass explicit localRoots to avoid os.tmpdir() overlap

Merge remote-tracking branch 'upstream/main' into sondera-pr

389f2e8

# Conflicts: # src/discord/monitor/exec-approvals.test.ts # src/slack/monitor/slash.test-harness.ts

openclaw-barnacle bot added channel: whatsapp-web Channel integration: whatsapp-web and removed channel: slack Channel integration: slack labels Feb 15, 2026

openclaw-barnacle bot added the channel: discord Channel integration: discord label Feb 15, 2026

joshdevon added 2 commits February 15, 2026 12:38

Merge remote-tracking branch 'upstream/main' into sondera-pr

6665e01

fix(test): use path.resolve in cleanup-utils test for Windows compat

393bd3e

On Windows, path.resolve("/tmp/...") produces "C:\tmp\..." which doesn't match hardcoded Unix paths. Use path.resolve() in the expected values so the test passes on all platforms.

openclaw-barnacle bot added the commands Command implementations label Feb 15, 2026

thewilloftheshadow force-pushed the main branch from bfc1ccb to f92900f Compare February 15, 2026 18:46

Merge upstream/main, resolve conflicts in test files

d60787f

openclaw-barnacle bot removed the commands Command implementations label Feb 16, 2026

Merge upstream/main, resolve import conflict in exec-approvals test

8093d06

gstudiogdesigns-max mentioned this pull request Feb 17, 2026

[Bug]: Security - before_tool_call hook return values ({ cancel: true }) are silently discarded — tool executes regardless #19231

Closed

joshdevon added 2 commits February 17, 2026 10:44

style: fix import order in validate-cedar.ts for oxfmt 0.33

5ab7056

fix(test): use local stateDir variable in media test

2034e39

joshdevon changed the title ~~feat(extensions): add Sondera Cedar policy guardrails [AI-assisted]~~ feat(extensions): add Sondera policy guardrails [AI-assisted] Feb 20, 2026

thewilloftheshadow added the r: third-party-extension label Feb 21, 2026

openclaw-barnacle bot closed this Feb 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(extensions): add Sondera policy guardrails [AI-assisted]#8448

feat(extensions): add Sondera policy guardrails [AI-assisted]#8448
joshdevon wants to merge 26 commits intoopenclaw:mainfrom
sondera-ai:sondera-pr

joshdevon commented Feb 4, 2026 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joshdevon commented Feb 4, 2026

Uh oh!

openclaw-barnacle bot commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

joshdevon commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Work

AI Disclosure 🤖

Test Plan

Greptile Overview

Greptile Summary

Confidence Score: 2/5

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

joshdevon commented Feb 4, 2026

Greptile Issues Addressed

Uh oh!

openclaw-barnacle bot commented Feb 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

joshdevon commented Feb 4, 2026 •

edited

Loading