fix(agent): prevent executor handoff role confusion / 防止执行器交接角色混淆 by SivanCola · Pull Request #3541 · esengine/DeepSeek-Reasonix

SivanCola · 2026-06-08T07:32:10Z

Summary

strengthen the planner-to-executor handoff so executor models ignore planner-only limitations
add an executor handoff guard that retries when the executor answers as the planner instead of using tools
cover the Chinese planner-style confusion case with a coordinator regression test

Testing

go test ./internal/agent ./internal/boot ./internal/config ./internal/cli
cd desktop && go test . -run 'TestSettings|TestSetAgent|Test.*Settings|TestProviderViewFromEntry'
npm run check:css && npm run test:typecheck

chatgpt-codex-connector · 2026-06-08T07:32:15Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Bare permission claims (no write access / 没有写入权限) are the exact vocabulary a correctly-executing model uses to report a real blocker; matching them mis-fired the planner-confusion guard and could hard-error a legitimately-blocked task. Drop those phrases, keep role-identity ones, and lock it with a regression test.

esengine

Solid fix for the planner→executor confusion (#3490), with a regression test. I tightened the confusion matcher to role-identity phrases only — bare permission claims like "no write access" / "没有写入权限" were also matching legitimate blocker reports (the exact vocabulary the handoff prompt asks the executor to use), which could hard-error a genuinely-blocked task. Added a regression test for that. Full go test ./internal/agent is green.

The keyword list matched on vocabulary, so it both missed paraphrases and false-flagged legitimate blocker reports ("no write access" is the exact phrasing the handoff prompt asks the executor to use). Replace it with a structural signal: if the executor reaches a final answer in handoff mode having called zero tools, nudge once with the executor instructions, then trust it — no keyword table, no hard error. Any tool use (including read-only) exempts it, so only a true punt triggers the nudge. Language-independent. Tests cover both the nudge and the no-nudge-when-acting path; verified end-to-end with a live deepseek-pro planner / deepseek-flash executor run that wrote and ran the file without tripping the guard.

…engine#3541) * fix(agent): keep executor from answering as planner * fix(agent): scope handoff-confusion match to role-identity phrases Bare permission claims (no write access / 没有写入权限) are the exact vocabulary a correctly-executing model uses to report a real blocker; matching them mis-fired the planner-confusion guard and could hard-error a legitimately-blocked task. Drop those phrases, keep role-identity ones, and lock it with a regression test. * fix(agent): detect executor punt by behavior, not keywords The keyword list matched on vocabulary, so it both missed paraphrases and false-flagged legitimate blocker reports ("no write access" is the exact phrasing the handoff prompt asks the executor to use). Replace it with a structural signal: if the executor reaches a final answer in handoff mode having called zero tools, nudge once with the executor instructions, then trust it — no keyword table, no hard error. Any tool use (including read-only) exempts it, so only a true punt triggers the nudge. Language-independent. Tests cover both the nudge and the no-nudge-when-acting path; verified end-to-end with a live deepseek-pro planner / deepseek-flash executor run that wrote and ran the file without tripping the guard. --------- Co-authored-by: reasonix <reasonix@deepseek.com>

fix(agent): keep executor from answering as planner

c4af2f4

SivanCola requested a review from esengine as a code owner June 8, 2026 07:32

github-actions Bot added v2 Go rewrite (1.x) — main-v2 branch, active development agent Core agent loop (internal/agent, internal/control) and removed v2 Go rewrite (1.x) — main-v2 branch, active development labels Jun 8, 2026

SivanCola mentioned this pull request Jun 8, 2026

[Bug]: 多模型只规划不执行 #3490

Open

github-actions Bot added the v2 Go rewrite (1.x) — main-v2 branch, active development label Jun 8, 2026

esengine approved these changes Jun 8, 2026

View reviewed changes

esengine enabled auto-merge (squash) June 8, 2026 09:27

esengine disabled auto-merge June 8, 2026 09:33

esengine merged commit 5cf486f into main-v2 Jun 8, 2026
9 checks passed

esengine deleted the codex/executor-handoff-guard branch June 8, 2026 10:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): prevent executor handoff role confusion / 防止执行器交接角色混淆#3541

fix(agent): prevent executor handoff role confusion / 防止执行器交接角色混淆#3541
esengine merged 3 commits into
main-v2from
codex/executor-handoff-guard

SivanCola commented Jun 8, 2026 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Jun 8, 2026

Uh oh!

esengine left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SivanCola commented Jun 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

chatgpt-codex-connector Bot commented Jun 8, 2026

Uh oh!

esengine left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SivanCola commented Jun 8, 2026 •

edited

Loading