fix(decision-engine): record per-candidate policy verdicts; close partial-block leak by jayzalowitz · Pull Request #82 · jayzalowitz/skytwin

jayzalowitz · 2026-04-26T22:09:57Z

Summary

Closes 2/3 of #80 (items 2 and 3). E2E coverage (item 1) deferred to a follow-up PR.

Item 2 — Partial-block leak in `whatWouldIDo` (the bigger fix)

`DecisionOutcome` gains an optional `policyVerdicts: Record<actionId, 'allowed' | 'requires-approval' | 'denied'>`. Populated by `evaluate()`, not persisted (DB layer ignores unknown fields).
`evaluate()` now evaluates every candidate against policy instead of breaking at the first allowed one. Selection is unchanged (first non-denied still wins), but lower-scored candidates now have verdicts so consumers can filter accurately. K-1 extra in-memory policy checks per decision in the worst case — acceptable given policy evaluation is cheap and decisions typically have <10 candidates.
`whatWouldIDo` filters `alternativeActions` using verdicts. Blocked candidates no longer leak as suggestions. Conservative fallback: when verdicts are unavailable (legacy or stale outcome), alternatives default to empty.

Item 3 — Blocked-by-policy explanation persistence

`ExplanationGenerator` now sets `escalationRationale` for the no-action case. Previously the audit log silently dropped the policy-block reason — Safety Invariant Make SkyTwin usable by a non-technical person end-to-end #2 violation.
`formatForUser` uses a context-aware label: "Why no action was taken" for blocked records, "Why approval was needed" for requires-approval records.

Test plan

`pnpm --filter @skytwin/decision-engine test` — 82/82 (was 79, +3)
`pnpm --filter @skytwin/explanations test` — 33/33 (was 30, +3)
`pnpm test` — 38/38 turbo tasks
`pnpm lint` — 30/30 turbo tasks
`pnpm build` — 19/19 packages

Refs #80

🤖 Generated with Claude Code

…tial-block leak Closes 2/3 of #80. Items 2 (whatWouldIDo partial-block leak) and 3 (blocked-by-policy explanation persistence). E2E coverage (item 1) deferred to a follow-up. What changed: - DecisionOutcome gains an optional policyVerdicts: Record<actionId, 'allowed' | 'requires-approval' | 'denied'>. Populated by evaluate(), not persisted (DB layer ignores unknown fields). - evaluate() now evaluates every candidate against policy instead of breaking at the first allowed one. The first non-denied candidate still wins (no behavior change for selection) but lower-scored candidates get verdicts so consumers can filter accurately. Adds K-1 extra in-memory policy checks per decision in the worst case. - whatWouldIDo filters alternativeActions using verdicts. Blocked candidates no longer leak as suggestions. Conservative fallback: when verdicts are unavailable (legacy code path or stale outcome), alternatives default to empty. - ExplanationGenerator now sets escalationRationale for the no-action case (every candidate blocked). Previously the audit log dropped the policy-block reason. Safety Invariant #2. - formatForUser uses a context-aware label: "Why no action was taken" for blocked-by-policy records, "Why approval was needed" for requires-approval records. Tests: +6 (decision-engine: +3, explanations: +3). All existing tests still pass: decision-engine 82/82, explanations 33/33, full repo 38/38 turbo tasks. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Copilot

Pull request overview

This PR fixes the “partial-block leak” by recording a per-candidate policy verdict during decision evaluation and using those verdicts to filter whatWouldIDo alternatives, and it also ensures blocked-by-policy explanations persist the escalation rationale for audit fidelity.

Changes:

Extend DecisionOutcome with optional policyVerdicts and add the PolicyVerdict type.
Update DecisionMaker.evaluate() to evaluate policy for every candidate and populate policyVerdicts; update whatWouldIDo to conservatively filter alternatives using these verdicts.
Persist escalationRationale for the no-action (all-blocked) path and adjust user formatting labels accordingly; add unit tests for both fixes.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
packages/shared-types/src/index.ts	Re-exports the new `PolicyVerdict` type.
packages/shared-types/src/decision.ts	Adds `DecisionOutcome.policyVerdicts` and defines the `PolicyVerdict` union.
packages/decision-engine/src/decision-maker.ts	Records per-candidate policy verdicts in `evaluate()` and filters `whatWouldIDo` alternatives to non-denied candidates (with conservative fallback).
packages/decision-engine/src/tests/decision-maker.test.ts	Adds coverage for `policyVerdicts` population and partial/legacy filtering behavior in `whatWouldIDo`.
packages/explanations/src/explanation-generator.ts	Ensures `escalationRationale` is set for no-action outcomes and uses a context-aware label in `formatForUser`.
packages/explanations/src/tests/explanation-generator.test.ts	Adds tests for escalation rationale persistence and the new label behavior.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…h work (#87) Captures eight PRs (#77, #78, #79, #81, #82, #83, #84, #85, #86) under an [Unreleased] section. No code changes — pure documentation. Bumping VERSION + cutting a tag is left as a separate decision so the release moment stays explicit. Co-authored-by: Claude Opus 4.7 <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings April 26, 2026 22:09

Copilot started reviewing on behalf of jayzalowitz April 26, 2026 22:15 View session

Copilot AI reviewed Apr 26, 2026

View reviewed changes

jayzalowitz merged commit 7825524 into main Apr 26, 2026
12 checks passed

jayzalowitz deleted the jayzalowitz/policy-verdicts branch April 26, 2026 22:57

jayzalowitz mentioned this pull request Apr 27, 2026

docs: capture safety-kernel and dashboard-polish work in CHANGELOG #87

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(decision-engine): record per-candidate policy verdicts; close partial-block leak#82

fix(decision-engine): record per-candidate policy verdicts; close partial-block leak#82
jayzalowitz merged 1 commit into
mainfrom
jayzalowitz/policy-verdicts

jayzalowitz commented Apr 26, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jayzalowitz commented Apr 26, 2026

Summary

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants