fix: dedupe persistent toolset-failure notifications by dgageot · Pull Request #2943 · docker/docker-agent

dgageot · 2026-06-01T11:18:52Z

Toolset initialization and Tools() listing failures were stacking persistent notifications in the TUI when a toolset remained down across multiple conversation turns. The root cause was that the agent side had no mechanism to suppress duplicate warnings, so the TUI received the same error on every turn and piled them on-screen.

This fix adds a once-per-streak guard to the StartableToolSet so a failure warning is emitted only once until the streak clears (either via a successful Start/Tools or an explicit Stop). The failureStreak type encapsulates the streak-tracking logic and is reused for both Start and Tools listing failures, keeping the warning deduplication uniform and reliable across all callers—including non-TUI consumers like ACP, which persist warnings into the transcript.

The TUI-layer dedup logic is removed as it becomes redundant; the agent-side guard is the single source of truth for this behavior.

Closes #2884.

Collapse the two duplicated once-per-streak warning guards (Start and Tools listing) into a small reusable failureStreak type, removing four bool fields and the repeated fail/reset/shouldReport logic.

The agent-side once-per-streak guard already ensures a toolset-failure warning is emitted only once per streak, so the on-screen dedup in handleShow was redundant defense-in-depth. Dropping it restores the simpler notification component; an identical error reappearing after the previous one auto-hides (10s) is acceptable.

docker-agent

Assessment: 🟢 APPROVE

Reviewed pkg/tools/startable.go, pkg/tools/startable_test.go, pkg/agent/agent.go, and pkg/agent/agent_test.go.

Summary: The failureStreak helper is correctly implemented — fail() only sets active/pending on the first call of a streak, reset() clears both fields, and shouldReport() returns true exactly once per streak. Mutex discipline is sound across Start, Stop, ShouldReportFailure, and ShouldReportListFailure. The Tools() method calls the inner toolset outside the lock (correct — avoids holding the lock during a potentially slow RPC) and updates the streak under the lock before any caller can observe it in the sequential collectTools loop. The listFlappyToolSet test helper correctly increments callIdx unconditionally before the nil check, so error sequences are consumed faithfully. Test coverage addresses the once-per-streak guarantee, recovery/reset behavior, and the Stop path. No bugs found in the changed code.

aheritier

Approved. CI green across all checks; changes are sound with appropriate test coverage for the de-duplicated list warnings and failure-streak reset behavior. No blocking concerns.

dgageot added 3 commits June 1, 2026 12:09

fix: dedupe persistent toolset-failure notifications (docker#2884)

97cc545

refactor: extract failureStreak helper in StartableToolSet

c5af087

Collapse the two duplicated once-per-streak warning guards (Start and Tools listing) into a small reusable failureStreak type, removing four bool fields and the repeated fail/reset/shouldReport logic.

dgageot requested a review from a team as a code owner June 1, 2026 11:18

docker-agent reviewed Jun 1, 2026

View reviewed changes

aheritier added area/tui For features/issues/fixes related to the TUI area/mcp MCP protocol, MCP tool servers, integration kind/fix PR fixes a bug (maps to fix: commit prefix) labels Jun 1, 2026

aheritier approved these changes Jun 1, 2026

View reviewed changes

dgageot merged commit 0c2cf04 into docker:main Jun 1, 2026
8 checks passed

BrewTestBot mentioned this pull request Jun 1, 2026

docker-agent 1.70.2 Homebrew/homebrew-core#285692

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: dedupe persistent toolset-failure notifications#2943

fix: dedupe persistent toolset-failure notifications#2943
dgageot merged 3 commits into
docker:mainfrom
dgageot:board/9bf56e61f12ed8e5

dgageot commented Jun 1, 2026

Uh oh!

docker-agent left a comment

Uh oh!

aheritier left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dgageot commented Jun 1, 2026

Uh oh!

docker-agent left a comment

Choose a reason for hiding this comment

Assessment: 🟢 APPROVE

Uh oh!

aheritier left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants