Skip to content

[codex] Improve PR babysitter CI diagnostics and guardrails#20484

Merged
wiltzius-openai merged 3 commits into
mainfrom
wiltzius/codex/babysit-pr-faster-ci-guardrails
May 1, 2026
Merged

[codex] Improve PR babysitter CI diagnostics and guardrails#20484
wiltzius-openai merged 3 commits into
mainfrom
wiltzius/codex/babysit-pr-faster-ci-guardrails

Conversation

@wiltzius-openai

Copy link
Copy Markdown
Contributor

Summary

  • Surface failed GitHub Actions jobs in the PR babysitter watcher so Codex can fetch job logs as soon as a job fails, instead of waiting for the overall workflow run to complete.
  • Update babysit-pr skill instructions, GitHub API notes, and heuristics to prefer direct job log archives before falling back to gh run view --log-failed.
  • Add guardrails requiring explicit user confirmation before posting replies to human-authored review comments.
  • Add guardrails preventing Codex from patching unrelated flaky tests, CI infrastructure, runner issues, dependency outages, or other failures not caused by the PR branch.

Validation

  • python3 -m pytest .codex/skills/babysit-pr/scripts/test_gh_pr_watch.py

@wiltzius-openai wiltzius-openai marked this pull request as ready for review April 30, 2026 18:57
@wiltzius-openai wiltzius-openai merged commit c39824c into main May 1, 2026
35 of 36 checks passed
@wiltzius-openai wiltzius-openai deleted the wiltzius/codex/babysit-pr-faster-ci-guardrails branch May 1, 2026 02:58
@github-actions github-actions Bot locked and limited conversation to collaborators May 1, 2026
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants