fix(build/orchestrator): codex /review read-only sandbox blocks own verdict write (Bug J) by anbangr · Pull Request #108 · anbangr/gstack

anbangr · 2026-05-27T02:42:10Z

Summary

Closes Bug J — the sixth bug class behind the "no commit / dirty tree" hygiene-gate symptom. The codex /review subagent was launched with -s read-only for non-code or auditOnly phases, blocking codex from writing its own verdict output file. Result: 0-byte verdict, parser sees no GATE PASS/FAIL line, orchestrator triggers primary-impl-rerun recovery, recovery cascades into blind-execution failure.

Canonical incident: BUGREPORT-2026-05-27-build-manual-investigation-d4b344.md filed by /build investigate from the live mitosis-oasis-phase1-part2 run at ~02:06 UTC. Affected phase 5.1 (GovernanceRegistry fixture review), all 4 review iterations.

ERROR codex_core::tools::router:
  error=patch rejected: writing is blocked by read-only sandbox;
  rejected by user approval settings
BLOCKED: read-only sandbox prevented writing
  <worktree>/.llm-tmp/gstack-codex-5.1-4-review-output.md

Root cause

cli.ts:5662-5665 runReviewGates' runGate closure:

const readOnlyArtifactGate =
  opts.phase.kind !== "code" || opts.phase.auditOnly === true;
const sandbox =
  attempt?.sandbox ?? (readOnlyArtifactGate ? "read-only" : undefined);

The intent was to prevent the reviewer from modifying source on non-code phases. But the codex /review prompt requires writing the verdict file. stageCodexIO stages the output inside the worktree (so it survives the cwd-restricted sandbox), but -s read-only blocks ALL writes including the staged output. The verdict ends up empty.

Fix

Drop the read-only force-default. The source-mutation concern is already covered by the SAME defense-in-depth model the feature-review path (sub-agents.ts:3921-3944) uses:

Prompt forbids worktree edits (verdict-only output target)
applyGateHygiene catches any worktree mutation post-spawn → HYGIENE_FAULT
Same-shape repeat detector halts the loop after 2 identical HYGIENE_FAULTs

The attempt.sandbox explicit-override path (shouldRetryCodexGateWithDangerFullAccess sandbox-retry fallback) is preserved verbatim. The env-driven override at GSTACK_BUILD_CODEX_REVIEW_SANDBOX still wins.

22-line comment block at the runGate call site documents the bug shape, why the heuristic was dropped, where source-mutation enforcement actually lives, and what overrides still work — so the next maintainer who touches this dispatcher sees the historical context.

What changed

File	Δ
`build/orchestrator/cli.ts`	+24/-4: drop `readOnlyArtifactGate` heuristic from `runGate` closure; add 22-line Bug J comment block
`build/orchestrator/__tests__/codex-review-sandbox-bug-j.test.ts`	NEW: T-J1..T-J4 (4 tests, 9 expects, all pass)

Defense-in-depth: source-mutation enforcement still works

The fix REMOVES a force-default; it doesn't remove the security boundary. The hygiene gate runs after every gate (cli.ts:5696-5705) and catches any worktree mutation as HYGIENE_FAULT. If codex /review attempts to edit source on a non-code phase, it's still caught — just by the post-spawn classifier instead of the launch-time sandbox.

The feature-review path has used this model for months without incident; phase-review now matches.

Test plan

bun test build/orchestrator/__tests__/codex-review-sandbox-bug-j.test.ts — 4/4 pass
bun test (full free suite) — running, see CI
No regression in existing review-gate tests (test files unchanged)

Architectural payoff for the "no commit / dirty tree" symptom space

Bug	Symptom	Resolved by
A	FEATURE_REDO findings deadlock	PR #103
E	Codex sandbox EPERM on `.git/worktrees/<run>/index.lock`	PR #104
F	Prose summary lacks parseable paths, recovery refuses	PR #105
H	Codex test-writer over-reaches into production code	PR #106
I	"test-spec writer exited 0" hides hygiene reason	PR #107
J	Codex /review read-only sandbox blocks own verdict write	THIS PR

After Bug J lands, the same defense-in-depth that secured feature-review now secures phase-review. Source mutations stay blocked by the hygiene gate (post-spawn detection + HYGIENE_FAULT classification + same-shape repeat halting), and codex's verdict write succeeds on the first try — no cascading primary-impl-rerun recovery, no operator intervention.

Fork-versioning note

Per CLAUDE.md fork rule: fork-local skill work in build/orchestrator/ only. No VERSION / package.json / top-level CHANGELOG.md bump.

Source links

Bug report: ~/.gstack/skill-faults/inbox/BUGREPORT-2026-05-27-build-manual-investigation-d4b344.md
Failing logs: ~/.gstack/build-state/build-mitosis-oasis-phase1-part2-20260526-083551-a484f0aa/phase-5.1-review-{1,2,3,4}.log
Defense-in-depth reference: build/orchestrator/sub-agents.ts:3921-3944 (buildCodexFeatureReviewArgv JSDoc)
Related: PR fix(build/orchestrator): test-spec writer surfaces hygiene reason instead of "exited 0" (Bug I) + side-fix rsync preserves user BUGREPORTs #107 (Bug I attribution), PR fix(build/orchestrator): Bug E — Codex sandbox can't commit on linked worktrees (--add-dir + recovery fallback + critical /ship hardening) #104 (Bug E sandbox EPERM)

🤖 Generated with Claude Code

…erdict write (Bug J) Closes Bug J — `runReviewGates`'s `runGate` closure forced `-s read-only` on every codex /review call where `phase.kind !== "code"` or `auditOnly === true`. The intent was to prevent the reviewer from modifying source on non-code phases. The result: codex couldn't write its own verdict file either, because stageCodexIO stages the output inside the worktree (so it survives the cwd-restricted sandbox), but read-only blocks ALL writes including the staged output. Canonical incident (filed by `/build investigate` from the live mitosis run, ~/.gstack/skill-faults/inbox/BUGREPORT-2026-05-27-build-manual-investigation-d4b344.md): mitosis-oasis-phase1-part2 phase 5.1 (GovernanceRegistry fixture review) failed all 4 review iterations with 0-byte output files. Codex log shows: ERROR codex_core::tools::router: error=patch rejected: writing is blocked by read-only sandbox; rejected by user approval settings BLOCKED: read-only sandbox prevented writing <worktree>/.llm-tmp/gstack-codex-5.1-4-review-output.md Empty output -> parser sees no GATE PASS/FAIL line -> orchestrator triggers primary-impl-rerun recovery -> recovery fails its own hygiene check with "blind execution -- input file unreachable; changes discarded" -> phase FAILED. Affects every codex review call when review.provider=codex on a non-code phase. Estimated to hit 5+ more phases in F5-F9 if not fixed. Fix: drop the read-only force-default. The source-mutation concern that motivated the heuristic is already covered by the SAME defense-in-depth model the feature-review path (sub-agents.ts:3921-3944) uses: 1. Prompt forbids worktree edits (verdict-only output target). 2. applyGateHygiene runs after every gate and catches any worktree mutation post-spawn, converting it to HYGIENE_FAULT. 3. Same-shape repeat detector halts the loop after 2 identical HYGIENE_FAULTs (caller upstream). The `attempt.sandbox` explicit-override path (used by shouldRetryCodexGateWithDangerFullAccess for the sandbox-retry fallback) is preserved verbatim. The env-driven override at `GSTACK_BUILD_CODEX_REVIEW_SANDBOX` still wins. Source-mutation enforcement comes from the hygiene gate, not from forcing read-only at launch. 22-line comment block at the runGate call site documents: - What the bug looked like (read-only blocks verdict write) - Why the heuristic was dropped (broken intent: blocks own writes) - Where source-mutation enforcement actually lives (applyGateHygiene + same-shape repeat detector) - What overrides still work (attempt.sandbox, env var) Tests (T-J1 through T-J4 in build/orchestrator/__tests__/codex-review-sandbox-bug-j.test.ts): - T-J1: static-grep guard — no `const readOnlyArtifactGate =` in cli.ts. The identifier name appears once in the Bug J comment block as intentional documentation; the regex matches the const declaration form (the actual bug source), not the bare token. - T-J2: static-grep guard — no `attempt?.sandbox ?? (readOnlyArtifactGate` or `readOnlyArtifactGate ? "read-only"` patterns. Pins both half-revert scenarios at the bit level. - T-J3: static-grep guard — runGate closure now reads `const sandbox = attempt?.sandbox;` AND the cli.ts contains "Bug J" + "applyGateHygiene" tokens (the comment block cross-reference is required so the next maintainer who touches the runGate dispatcher sees the historical context). - T-J4: behavioral guard — the runGate closure block (matched as source range) MUST NOT contain `"read-only"` literal. Other `"read-only"` literals in cli.ts (judge sandbox at runJudgeRole, runtime sandbox-retry paths) are fine; this test specifically scopes the assertion to the runGate dispatcher. 4/4 pass. Architectural payoff for the "no commit / dirty tree" symptom space: | Bug | Symptom | Resolved by | |---|---|---| | A | FEATURE_REDO findings deadlock | PR #103 | | E | Codex sandbox EPERM on .git/worktrees/<run>/index.lock | PR #104 | | F | Prose summary lacks parseable paths, recovery refuses | PR #105 | | H | Codex test-writer over-reaches into production code | PR #106 | | I | "test-spec writer exited 0" hides hygiene reason | PR #107 | | J | Codex /review read-only sandbox blocks own verdict write | THIS PR | After Bug J lands, the same defense-in-depth that secured feature-review now secures phase-review. Source mutations stay blocked by the hygiene gate (post-spawn detection + HYGIENE_FAULT classification + same-shape repeat halting), and codex's verdict write succeeds on the first try — no cascading primary-impl-rerun recovery, no operator intervention. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

…ighten test guards (Bug J inline hardening) Inline hardening from /review pipeline. Security specialist (HIGH) and Codex adversarial (P0, 9/10) both independently flagged the same MULTI-CONFIRMED security regression in the initial Bug J commit: The initial Bug J fix dropped the read-only sandbox floor on non-code review gates. The PR comment claimed defense-in-depth via applyGateHygiene converting reviewer source mutations to HYGIENE_FAULT. That claim is WRONG. `phaseAllowsGateSourceFixes(opts.phase)` returns true for research/writing/experiment/manual phases unconditionally, which means applyGateHygiene's auto-commit path (`maybeAutoCommitTestOnlyDirty` with `allowNonTestPaths: true`) AUTO-COMMITS the reviewer's source mutations instead of rejecting them. Net effect of the unhardened Bug J: silent reviewer-driven commits on every non-code phase. A prompt-injection in a reviewed artifact could persist source changes through the review gate, and the operator would see a clean GATE PASS. This hardening commit: 1. Scopes the `allowNonTestPaths` escape hatch to the QA gate ONLY. review and reviewSecondary gates are READ-ONLY by contract — they evaluate the implementation and emit a verdict; they don't fix code. QA keeps fix-on-review semantics (that's intentional — QA was designed for it). Introduces a local `gateAllowsSourceFixes` const that ANDs `name === "qa"` with the existing `phaseAllowsGateSourceFixes(opts.phase)` heuristic. Applied at both applyGateHygiene call sites in runReviewGates (primary + sandbox-retry paths). 2. 14-line comment block at the runGate call site documents: - Why the qa-only scoping exists (security regression class) - Why QA is exempt (fix-on-review is intentional for QA) - What the read-only sandbox used to protect structurally (now via hygiene-gate scope rather than OS sandbox) 3. Adds T-J5 (positive static-grep guard): pins the new `gateAllowsSourceFixes` local + both applyGateHygiene call sites using it. 4. Adds T-J6 (negative static-grep guard): runReviewGates block (whole function body via regex match) must NOT contain `allowNonTestPaths: phaseAllowsGateSourceFixes(opts.phase)` — the unhardened code form. A refactor that re-collapses the source fails CI. 5. Widens T-J4 regex window from 2500 → 5000 chars (adversarial Finding 5: original block was at 2249/2500, one new if-block away from overflowing the window and crashing the test with TypeError on the non-null assertion). 6/6 pass on the regression suite (T-J1..T-J6). Deferred concerns (multi-source adversarial findings, will become Bug K): - HEAD-advance not blocked on review gates (Codex P0 #2): a reviewer can `git commit` source changes (the new workspace-write sandbox allows it), write GATE PASS, leave the tree clean — no HYGIENE_FAULT because validatePostAgentHygiene doesn't check `requireNewCommit` for review gates. Fix: pass `requireNewCommit: false` AND add a separate "HEAD must not advance" check on review/reviewSecondary gates. - Failed/timed-out reviewers skip hygiene entirely (Codex P1 #3): applyGateHygiene early-returns on `timedOut || exitCode !== 0` at cli.ts:6177. A reviewer can edit source, crash before verdict, retry runs against the mutated tree. Fix: always run dirty-discard check for review gates regardless of exit code, OR add explicit `discardWorktreeChangesOnFailure: true` opt for review gate calls. - .git/ directory mutations not caught by git status (Security MEDIUM #2): hooks, .git/config remote rewrites, .git/info/exclude tampering all bypass validatePostAgentHygiene because git status doesn't report them. Fix: snapshot mtime+sha256 of .git/config, .git/hooks/*, .git/info/exclude, .git/HEAD in captureGitSnapshot. These three are separate architectural concerns on the same surface that deserve their own dedicated PR (Bug K) rather than a panic patch on Bug J. This PR's qa-only scoping closes the multi-confirmed P0 from 2 of 3 reviewers; Bug K will close the residuals from codex adversarial. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

anbangr · 2026-05-27T02:51:14Z

/review pipeline — 3 reviewers, 1 multi-confirmed P0, 2 deferred residuals

Inline hardening applied (commit `850ab3a9`)

Multi-confirmed P0 (security specialist HIGH + Codex adversarial P0 9/10): the initial Bug J commit's defense-in-depth claim was WRONG. phaseAllowsGateSourceFixes returned true for research/writing/experiment/manual phases unconditionally, which made applyGateHygiene AUTO-COMMIT reviewer source mutations instead of converting them to HYGIENE_FAULT.

Fix: scoped the allowNonTestPaths escape hatch to the QA gate ONLY. review and reviewSecondary are READ-ONLY by contract; QA keeps fix-on-review semantics. 2 added tests (T-J5/T-J6) pin both the positive form (gateAllowsSourceFixes local) and the negative form (no allowNonTestPaths: phaseAllowsGateSourceFixes(opts.phase) anywhere in runReviewGates).

Also widened T-J4 regex window from 2500→5000 chars (Finding 5: original block was at 2249/2500, one new if-block away from crashing the test).

Deferred residual concerns → Bug K (separate PR)

#	Source	Severity	Concern
1	Codex adversarial P0 #2	HIGH	HEAD-advance not blocked on review gates. A reviewer can `git commit` source changes under workspace-write, write GATE PASS, leave the tree clean → no HYGIENE_FAULT because validatePostAgentHygiene doesn't check `requireNewCommit` for review gates
2	Codex adversarial P1 #3	MEDIUM	Failed/timed-out reviewers skip hygiene entirely. applyGateHygiene early-returns on `timedOut \|\| exitCode !== 0` at cli.ts:6177. A reviewer can edit source, crash before verdict, retry runs against mutated tree
3	Security specialist MEDIUM #2	MEDIUM	`.git/` directory mutations (hooks, config rewrites, info/exclude tampering) bypass validatePostAgentHygiene because git status doesn't report them

These are separate architectural concerns on the same surface that deserve their own dedicated investigation rather than a panic patch on this PR. The qa-only scoping closes the multi-confirmed P0 from 2 of 3 reviewers; Bug K will close codex's residuals + the .git tracking gap.

Verdict

bun test build/orchestrator/__tests__/codex-review-sandbox-bug-j.test.ts 6/6 pass.
bun test full suite passing.

Ready to land. Bug K follow-up PR will address the 3 residuals.

🤖 Generated with Claude Code

… from PR #108 (Bug K) (#109) Closes Bug K (K1+K2+K3) — three trust-boundary residuals left open after PR #108's qa-only `gateAllowsSourceFixes` scoping. All three were multi-source confirmed by the security specialist + Codex adversarial in PR #108's review pipeline, documented in the PR comment, and queued as follow-up. This PR closes them together. The underlying architectural shift: PR #108 dropped the read-only sandbox floor on review/reviewSecondary gates so codex could write its verdict file. The PR comment promised defense-in-depth via post-spawn `applyGateHygiene`. Three gaps remained: K1 — HEAD-advance not blocked (HIGH, codex P0 #2) A reviewer can `git commit` source changes under the new workspace-write sandbox, write GATE PASS, leave the tree clean, and the gate accepts the commit silently. validatePostAgentHygiene only checked dirty deltas; HEAD advance was invisible. Fix: `applyGateHygiene` accepts `requireNoHeadAdvance?: boolean`. When true, compares before/after HEAD; mismatch emits "reviewer roles are read-only; only QA may fix-on-review". runReviewGates passes `requireNoHeadAdvance: gateIsReviewer` (true for review/reviewSecondary, false for QA — QA's contract IS to commit fix-on-review changes). K2 — Failed/timed-out reviewers skipped hygiene entirely (MEDIUM, codex P1 #3) Pre-fix early-return at applyGateHygiene's top preserved any worktree mutations the reviewer left behind. The next retry / recovery path ran against the mutated tree. Fix: `applyGateHygiene` accepts `discardOnFailure?: boolean`. When true AND the gate spawn failed (timedOut || exitCode !== 0), call `discardBlindExecutionChanges` to roll back worktree mutations before returning. The original result (failed exit code, timeout flag) is preserved — this is a side-effect cleanup, not a result rewrite. runReviewGates passes `discardOnFailure: true` for ALL gate roles (review/reviewSecondary/qa) because all three share the workspace-write-with-no-OS-floor failure mode. K3 — `.git/` directory mutations bypassed hygiene gate (MEDIUM, security #2) `git status --porcelain` doesn't report mutations under `.git/`: - `.git/config` (remote URL rewrites for exfiltration) - `.git/hooks/*` (post-merge hook injection) - `.git/info/exclude` (silently exclude files from `git add`) - `.git/HEAD` (silent branch swap) - `.git/packed-refs` (ref tampering) Pre-PR-#108 the OS read-only sandbox blocked all of these at the syscall level. With workspace-write, nothing did. Fix: - `GitSnapshot` interface gains optional `gitDirHashes?: Map<string, string>` - `captureGitSnapshot` accepts `captureGitDirContents?: boolean`. When true, sha256-hashes HEAD, config, info/exclude, packed-refs, plus a sorted-listing hash of `hooks/` (with executable-bit detection so chmod +x on a previously-disarmed hook also surfaces). Uses `git rev-parse --git-dir` AND `--git-common-dir` so linked worktrees track BOTH their per-worktree files (HEAD, index) AND the shared common dir's hooks/config (mirrors the Bug E --add-dir resolution pattern). - New `validateGitDirUnchanged` validator diffs before/after gitDirHashes. Each mismatch produces one error naming the exact `.git/` path so operators can grep. - runReviewGates upgrades its `captureGitSnapshot` call to `{ captureContents: true, captureGitDirContents: true }`. - `applyGateHygiene` checks list now includes `validateGitDirUnchanged` alongside the existing two validators. No-op when `before.gitDirHashes` is absent (back-compat: only gates that opt in to capturing get the K3 check). Tests (T-K1..T-K6 + T-K6a..T-K6f in build/orchestrator/__tests__/bug-k-review-gate-trust-boundary.test.ts): Behavioral (K3 enforcement): - T-K1: captureGitSnapshot populates gitDirHashes when requested. - T-K2: gitDirHashes is absent by default (back-compat). - T-K3: validateGitDirUnchanged detects post-merge hook injection (creates `.git/hooks/post-merge` between snapshots, asserts error contains "hooks/" + "reviewer/QA roles must NOT modify git metadata"). - T-K4: validateGitDirUnchanged detects .git/config rewrite (uses `git remote add evil <url>` between snapshots). - T-K5: validateGitDirUnchanged is a no-op when the before snapshot doesn't have gitDirHashes (back-compat regression guard). Static-grep wiring (K1/K2/K3 surface guards): - T-K6a: applyGateHygiene declares requireNoHeadAdvance + discardOnFailure opts. - T-K6b: BOTH runReviewGates call sites (primary gate + sandbox-retry) pass requireNoHeadAdvance=gateIsReviewer + discardOnFailure=true. - T-K6c: captureGitSnapshot accepts captureGitDirContents; runReviewGates passes both captureContents AND captureGitDirContents true. - T-K6d: applyGateHygiene checks list now includes validateGitDirUnchanged. - T-K6e: K1 error message names the exact regression class. - T-K6f: K2 failed-gate cleanup wires through discardBlindExecutionChanges. 11/11 pass on the regression suite. Architectural payoff: | Bug | Symptom | Resolved by | |---|---|---| | A | FEATURE_REDO findings deadlock | PR #103 | | E | Codex sandbox EPERM on linked worktrees | PR #104 | | F | Prose summary lacks parseable paths, recovery refuses | PR #105 | | H | Codex test-writer over-reaches into production code | PR #106 | | I | "test-spec writer exited 0" hides hygiene reason | PR #107 | | J | Codex /review read-only sandbox blocks own verdict | PR #108 | | K | 3 review-gate trust-boundary residuals from PR #108 | THIS PR | After Bug K lands, the review-gate trust boundary is structurally sound: - Reviewer mutations to source: caught by validatePostAgentHygiene (existing) + scoped allowNonTestPaths (PR #108) - Reviewer commits: caught by K1 requireNoHeadAdvance - Reviewer mutations on failed runs: discarded by K2 discardOnFailure - Reviewer mutations to .git metadata: caught by K3 validateGitDirUnchanged The same defense-in-depth model now matches feature-review's (sub-agents.ts:3921-3944) but with full git-state coverage instead of just worktree-state. Plan reference: ~/.claude/plans/fixing-plan-bugs-k-through-n-post-pr-108.md PR #108 comment: #108 (comment) Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

anbangr and others added 2 commits May 27, 2026 10:41

anbangr merged commit 27025f5 into main May 27, 2026

anbangr mentioned this pull request May 27, 2026

fix(build/orchestrator): close 3 review-gate trust-boundary residuals from PR #108 (Bug K) #109

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(build/orchestrator): codex /review read-only sandbox blocks own verdict write (Bug J)#108

fix(build/orchestrator): codex /review read-only sandbox blocks own verdict write (Bug J)#108
anbangr merged 2 commits into
mainfrom
worktree-fix-codex-review-readonly-sandbox-blocks-output

anbangr commented May 27, 2026

Uh oh!

anbangr commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

anbangr commented May 27, 2026

Summary

Root cause

Fix

What changed

Defense-in-depth: source-mutation enforcement still works

Test plan

Architectural payoff for the "no commit / dirty tree" symptom space

Fork-versioning note

Source links

Uh oh!

anbangr commented May 27, 2026

/review pipeline — 3 reviewers, 1 multi-confirmed P0, 2 deferred residuals

Inline hardening applied (commit 850ab3a9)

Deferred residual concerns → Bug K (separate PR)

Verdict

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Inline hardening applied (commit `850ab3a9`)