Skip to content

Log rollout writer OS errors#24474

Merged
jif-oai merged 1 commit into
mainfrom
etraut/rollout-error-logging
May 26, 2026
Merged

Log rollout writer OS errors#24474
jif-oai merged 1 commit into
mainfrom
etraut/rollout-error-logging

Conversation

@etraut-openai

@etraut-openai etraut-openai commented May 25, 2026

Copy link
Copy Markdown
Collaborator

Why

Refs #24425.

We have seen rollout JSONL corruption that appears consistent with a rollout write failing after partially appending a line, followed by a retry that appends the same item again. The available user logs did not include the underlying OS error, so it is hard to tell whether the trigger was ENOSPC, quota exhaustion, a filesystem error, or something else.

This PR adds the missing diagnostics for future reports.

What changed

  • Include ErrorKind and raw_os_error() in rollout writer failure logs.
  • Preserve the existing append-only rollout write path; this PR is diagnostic-only.

Verification

  • just test -p codex-rollout

@jif-oai jif-oai merged commit e865151 into main May 26, 2026
31 checks passed
@jif-oai jif-oai deleted the etraut/rollout-error-logging branch May 26, 2026 08:33
@github-actions github-actions Bot locked and limited conversation to collaborators May 26, 2026
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants