fix #86077: keep fallback errors candidate scoped#86134
Conversation
|
Codex review: needs maintainer review before merge. Reviewed May 25, 2026, 7:16 AM ET / 11:16 UTC. Summary PR surface: Source +39, Tests +161, Docs +1. Total +201 across 6 files. Reproducibility: yes. Current main shows the source path: embedded runner can still use sessionLastAssistant when currentAttemptAssistant is absent, and model-fallback still prefers rawError over the candidate message. Review metrics: 1 noteworthy metric.
Merge readiness Overall follows the weaker of proof and patch quality, so missing proof can cap an otherwise strong patch. Rank-up moves:
Risk before merge
Maintainer options:
Next step before merge Security Review detailsBest possible solution: Land the candidate-scoped fallback attribution fix after maintainer confirmation that the diagnostics behavior change is acceptable and required checks remain green. Do we have a high-confidence way to reproduce the issue? Yes. Current main shows the source path: embedded runner can still use sessionLastAssistant when currentAttemptAssistant is absent, and model-fallback still prefers rawError over the candidate message. Is this the best way to solve the issue? Yes. The latest head is a narrow maintainable fix because it scopes stale assistant/raw-error reuse to the active candidate while preserving same-candidate and PI-stamped compaction diagnostics. AGENTS.md: found and applied where relevant. Codex review notes: model gpt-5.5, reasoning high; reviewed against 0bb9b421f3e6. Label changesLabel justifications:
Evidence reviewedPR surface: Source +39, Tests +161, Docs +1. Total +201 across 6 files. View PR surface stats
What I checked:
Likely related people:
What the crustacean ranks mean
Shiny media proof means a screenshot, video, or linked artifact directly shows the changed behavior. Runtime, network, CSP, and security claims still need visible diagnostics. How this review workflow works
|
|
ClawSweeper PR egg ✨ Hatched: 🌱 uncommon Brave Diff Drake Hatch commandComment Hatchability rules:
Rarity: 🌱 uncommon. What is this egg doing here?
|
a237104 to
1a6d542
Compare
cc3fe4d to
2868713
Compare
d74ce61 to
7f5a821
Compare
36675e0 to
36f648d
Compare
|
@clawsweeper re-review |
|
🦞🧹 I asked ClawSweeper to review this item again. Re-review progress:
|
|
Merged via squash.
Thanks @zhangguiping-xydt! |
Summary
Fixes #86077
Real behavior proof
Regression Test Plan
Root Cause
sessionLastAssistantwas reused whencurrentAttemptAssistantwas absent, even when the last assistant belonged to a previous provider/model; model-fallback then preferred that raw error text for the current candidate.