fix: retry transient Guardian reviewer failures by viyatb-oai · Pull Request #26334 · openai/codex

viyatb-oai · 2026-06-04T07:08:11Z

Why

A Guardian review can fail because the reviewer is temporarily unavailable, not because the requested action violates policy. Treating capacity, rate-limit, timeout, transport, or retryable server failures as immediate denials conflates infrastructure health with a safety decision, blocks otherwise valid work after one transient error, and presents misleading risk language to the user.

What changed

retry the Guardian review once when the first attempt times out or fails with a transient availability error, including capacity, rate-limit, connection, transport, overload, and retryable 5xx failures
disable nested provider request and stream retries for Guardian sessions so the single outer retry is the complete retry budget
add a cancellable full-jitter delay of up to three seconds before availability retries so concurrent failures do not immediately synchronize into a retry spike
retry model timeouts immediately with a shorter 30-second deadline, bounding the worst-case review time to about two minutes instead of three
emit a GuardianWarning before retrying so clients can distinguish reviewer availability from a policy denial
preserve any existing approval retry context and append a bounded, static reviewer-retry explanation
keep raw server and transport errors out of the second model-visible review prompt
fail closed if the retry also fails, while describing prompt construction, reviewer session, and response parsing failures as review failures rather than unacceptable-risk decisions
leave completed Guardian assessments and policy denials unchanged and non-retryable

Scope

This is the reviewer-availability slice split from #26128. It intentionally does not add denial kinds, approval reuse, or new TUI/app UX.

Related independent slices: #26231, #26333, and #26232.

Validation

added an end-to-end test_codex scenario covering a transient reviewer failure followed by a successful review
verifies warning emission, the second Guardian request, preservation of the action and justification, suppression of the raw reviewer error, and execution after approval
added focused coverage for availability and timeout retry plans, including their warning, delay, and deadline behavior

Co-authored-by: Codex noreply@openai.com

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: e014da9666

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

## Why Auto Review should remain the effective approval reviewer when settings cross runtime boundaries. A config or app-server round trip must not change the reviewer identity, and delegated work must not silently fall back to user review. This requires both a stable canonical serialized value and propagation of the effective setting. `auto_review` is the canonical value across protocol and app-server output, while `guardian_subagent` remains accepted as backward-compatible input. ## What changed - serialize `ApprovalsReviewer::AutoReview` consistently as `auto_review` across core protocol and app-server v2 - continue accepting `guardian_subagent` when reading existing config or client requests - carry the active turn's approval reviewer into spawned agents - update config/debug expectations and add delegated-task regression coverage ## Scope This does not change Guardian policy or remove compatibility with existing `guardian_subagent` inputs. It preserves the selected reviewer across serialization, config reloads, app-server settings, and delegated task setup. Related Guardian changes are split independently: - #26231 adds denials and soft denials - #26334 retries transient reviewer failures - #26333 reuses narrowly scoped low-risk approvals - #26232 adds TUI denial recovery ## Validation - `just test -p codex-app-server-protocol` (224 passed) - regression coverage for delegated task reviewer propagation - serialization coverage for canonical `auto_review` output and legacy `guardian_subagent` input --------- Co-authored-by: saud-oai <saud@openai.com>

Co-authored-by: Codex noreply@openai.com

## Why Auto Review should remain the effective approval reviewer when settings cross runtime boundaries. A config or app-server round trip must not change the reviewer identity, and delegated work must not silently fall back to user review. This requires both a stable canonical serialized value and propagation of the effective setting. `auto_review` is the canonical value across protocol and app-server output, while `guardian_subagent` remains accepted as backward-compatible input. ## What changed - serialize `ApprovalsReviewer::AutoReview` consistently as `auto_review` across core protocol and app-server v2 - continue accepting `guardian_subagent` when reading existing config or client requests - carry the active turn's approval reviewer into spawned agents - update config/debug expectations and add delegated-task regression coverage ## Scope This does not change Guardian policy or remove compatibility with existing `guardian_subagent` inputs. It preserves the selected reviewer across serialization, config reloads, app-server settings, and delegated task setup. Related Guardian changes are split independently: - openai#26231 adds denials and soft denials - openai#26334 retries transient reviewer failures - openai#26333 reuses narrowly scoped low-risk approvals - openai#26232 adds TUI denial recovery ## Validation - `just test -p codex-app-server-protocol` (224 passed) - regression coverage for delegated task reviewer propagation - serialization coverage for canonical `auto_review` output and legacy `guardian_subagent` input --------- Co-authored-by: saud-oai <saud@openai.com>

…-reviewer-retries

Co-authored-by: Codex noreply@openai.com

## Why Auto Review should remain the effective approval reviewer when settings cross runtime boundaries. A config or app-server round trip must not change the reviewer identity, and delegated work must not silently fall back to user review. This requires both a stable canonical serialized value and propagation of the effective setting. `auto_review` is the canonical value across protocol and app-server output, while `guardian_subagent` remains accepted as backward-compatible input. ## What changed - serialize `ApprovalsReviewer::AutoReview` consistently as `auto_review` across core protocol and app-server v2 - continue accepting `guardian_subagent` when reading existing config or client requests - carry the active turn's approval reviewer into spawned agents - update config/debug expectations and add delegated-task regression coverage ## Scope This does not change Guardian policy or remove compatibility with existing `guardian_subagent` inputs. It preserves the selected reviewer across serialization, config reloads, app-server settings, and delegated task setup. Related Guardian changes are split independently: - openai#26231 adds denials and soft denials - openai#26334 retries transient reviewer failures - openai#26333 reuses narrowly scoped low-risk approvals - openai#26232 adds TUI denial recovery ## Validation - `just test -p codex-app-server-protocol` (224 passed) - regression coverage for delegated task reviewer propagation - serialization coverage for canonical `auto_review` output and legacy `guardian_subagent` input --------- Co-authored-by: saud-oai <saud@openai.com>

Co-authored-by: Codex noreply@openai.com

## Why Auto Review should remain the effective approval reviewer when settings cross runtime boundaries. A config or app-server round trip must not change the reviewer identity, and delegated work must not silently fall back to user review. This requires both a stable canonical serialized value and propagation of the effective setting. `auto_review` is the canonical value across protocol and app-server output, while `guardian_subagent` remains accepted as backward-compatible input. ## What changed - serialize `ApprovalsReviewer::AutoReview` consistently as `auto_review` across core protocol and app-server v2 - continue accepting `guardian_subagent` when reading existing config or client requests - carry the active turn's approval reviewer into spawned agents - update config/debug expectations and add delegated-task regression coverage ## Scope This does not change Guardian policy or remove compatibility with existing `guardian_subagent` inputs. It preserves the selected reviewer across serialization, config reloads, app-server settings, and delegated task setup. Related Guardian changes are split independently: - openai#26231 adds denials and soft denials - openai#26334 retries transient reviewer failures - openai#26333 reuses narrowly scoped low-risk approvals - openai#26232 adds TUI denial recovery ## Validation - `just test -p codex-app-server-protocol` (224 passed) - regression coverage for delegated task reviewer propagation - serialization coverage for canonical `auto_review` output and legacy `guardian_subagent` input --------- Co-authored-by: saud-oai <saud@openai.com>

Co-authored-by: Codex noreply@openai.com

## Why Auto Review should remain the effective approval reviewer when settings cross runtime boundaries. A config or app-server round trip must not change the reviewer identity, and delegated work must not silently fall back to user review. This requires both a stable canonical serialized value and propagation of the effective setting. `auto_review` is the canonical value across protocol and app-server output, while `guardian_subagent` remains accepted as backward-compatible input. ## What changed - serialize `ApprovalsReviewer::AutoReview` consistently as `auto_review` across core protocol and app-server v2 - continue accepting `guardian_subagent` when reading existing config or client requests - carry the active turn's approval reviewer into spawned agents - update config/debug expectations and add delegated-task regression coverage ## Scope This does not change Guardian policy or remove compatibility with existing `guardian_subagent` inputs. It preserves the selected reviewer across serialization, config reloads, app-server settings, and delegated task setup. Related Guardian changes are split independently: - openai#26231 adds denials and soft denials - openai#26334 retries transient reviewer failures - openai#26333 reuses narrowly scoped low-risk approvals - openai#26232 adds TUI denial recovery ## Validation - `just test -p codex-app-server-protocol` (224 passed) - regression coverage for delegated task reviewer propagation - serialization coverage for canonical `auto_review` output and legacy `guardian_subagent` input --------- Co-authored-by: saud-oai <saud@openai.com>

fix: retry transient Guardian reviewer failures

e014da9

Co-authored-by: Codex noreply@openai.com

viyatb-oai requested a review from a team as a code owner June 4, 2026 07:08

chatgpt-codex-connector Bot reviewed Jun 4, 2026

View reviewed changes

Comment thread codex-rs/core/src/guardian/review.rs Outdated

Comment thread codex-rs/core/src/guardian/review.rs Outdated

Comment thread codex-rs/core/src/guardian/review.rs Outdated

Comment thread codex-rs/core/src/guardian/tests.rs Outdated

fix: clarify Guardian retry messages

518962f

Co-authored-by: Codex noreply@openai.com

viyatb-oai added 4 commits June 8, 2026 21:24

Merge remote-tracking branch 'origin/main' into codex/viyatb/guardian…

025e22c

…-reviewer-retries

test: cover Guardian reviewer retries end to end

e46d7f8

Co-authored-by: Codex noreply@openai.com

test: exercise Guardian retry layer

d0a0d4d

Co-authored-by: Codex noreply@openai.com

fix: jitter Guardian reviewer retries

71240fa

Co-authored-by: Codex noreply@openai.com

viyatb-oai added 3 commits June 8, 2026 22:32

fix: bound Guardian timeout retries

3b7fc81

Co-authored-by: Codex noreply@openai.com

fix: centralize Guardian retry budget

255d2b8

Co-authored-by: Codex noreply@openai.com

docs: explain Guardian retry invariants

bc83c8c

Co-authored-by: Codex noreply@openai.com

viyatb-oai added 2 commits June 9, 2026 10:10

fix: update Guardian timeout test callers

3266df7

Co-authored-by: Codex noreply@openai.com

test: expect Guardian-owned retry policy

d445c36

Co-authored-by: Codex noreply@openai.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: retry transient Guardian reviewer failures#26334

fix: retry transient Guardian reviewer failures#26334
viyatb-oai wants to merge 11 commits into
mainfrom
codex/viyatb/guardian-reviewer-retries

viyatb-oai commented Jun 4, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

viyatb-oai commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What changed

Scope

Validation

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

viyatb-oai commented Jun 4, 2026 •

edited

Loading