fix(slack): fail fast when socket mode disconnects by frankekn · Pull Request #24283 · openclaw/openclaw

frankekn · 2026-02-23T07:38:24Z

What

watch Slack socket lifecycle disconnect events in monitor startup
fail fast by surfacing a provider error when socket mode disconnects
add regression test to verify monitor exits on disconnect

Why

Socket mode could silently stop delivering events while the process stayed alive. Exiting fast lets the supervisor restart the provider.

Validation

pnpm vitest run src/slack/monitor.tool-result.test.ts src/slack/monitor/provider.group-policy.test.ts
pnpm check

Greptile Summary

This PR adds fail-fast behavior for Slack socket mode by monitoring the underlying SocketModeClient lifecycle events (disconnected, unable_to_socket_mode_start). When either event fires, the provider surfaces an error and exits, allowing the supervisor to restart it. Previously, a silent socket disconnect would leave the process alive but unable to receive events.

Adds resolveSlackSocketLifecycleClient() to access the receiver.client from the Bolt app, with defensive null/type checks
Introduces detachSocketLifecycleListener() with fallback support for both off and removeListener APIs
Wires up a Promise.race between the existing abort-wait and a new disconnect-wait, so either graceful shutdown or unexpected disconnect ends the provider
Includes proper cleanup (idempotent listener detachment) in both the error path and the finally block
Adds a regression test that verifies the monitor rejects when a disconnected event is emitted
Extends test helpers with socket client mock (on/off/emit) and handler tracking via a Map<string, Set<handler>>

Confidence Score: 5/5

This PR is safe to merge — it adds defensive socket lifecycle monitoring with proper cleanup, guards against race conditions with the abort signal, and includes a focused regression test.
The changes are well-scoped and focused on a single concern (socket disconnect detection). The implementation handles edge cases correctly: idempotent cleanup, abort-signal guard to prevent spurious rejections during graceful shutdown, defensive null checks when resolving the lifecycle client, and fallback listener removal APIs. The test properly validates the core disconnect scenario. No existing behavior is altered for HTTP mode (guarded by slackMode === "socket"). The code follows the repo's conventions (strict typing with unknown, no any, under LOC guideline).
No files require special attention

_{Last reviewed commit: 956b807}

Context used:

Context from dashboard - CLAUDE.md (source)
Context from dashboard - AGENTS.md (source)

markshields-tl

Review

Simplest approach of the three PRs targeting #17847 — fail fast on disconnect and let the supervisor restart.

Pros:

Minimal code change, low risk
Defensive: if you can't fix reconnect, at least don't pretend to be alive
Good test coverage

Concern:

Relies on an external supervisor (systemd, launchd, etc.) to restart the gateway. Not all deployments have this — e.g., openclaw gateway start in a terminal session. A silent exit is better than silent death, but auto-reconnect (#27232) is strictly more useful.
Doesn't address the "no disconnect event fires at all" failure mode documented in our production data (see my comment on #17847). Sometimes the socket just stops delivering with no event.

Verdict: Good safety net, but #27232 (reconnect loop) + #27241 (staleness watchdog) together would be the complete solution.

— Mort (AI assistant reviewing on behalf of @markshields-tl)

frankekn · 2026-02-27T03:56:13Z

Thanks. One clarification re the supervisor concern: this provider intentionally throws on socket lifecycle disconnect so the Gateway channel manager can auto-restart it (even when running openclaw gateway start in a terminal session). It does not require systemd/launchd.

Follow-up commit 7cb6c13 also avoids a possible unhandled rejection when the disconnect waiter triggers during startup by resolving the waiter and throwing after the race.

Agreed the silent-stall / no-disconnect mode needs a watchdog + reconnect loop (see #27241 / #27232).

Takhoffman · 2026-03-01T16:24:20Z

Closing with Tak approval: superseded

openclaw-barnacle bot added channel: slack Channel integration: slack size: S labels Feb 23, 2026

fix(slack): fail fast when socket mode disconnects

806d2ff

frankekn force-pushed the fix/slack-socket-disconnect-failfast branch from 956b807 to 806d2ff Compare February 25, 2026 05:34

openclaw-barnacle bot added the trusted-contributor label Feb 25, 2026

This was referenced Feb 27, 2026

[Bug]: Slack Socket Mode silently stops receiving inbound events while appearing connected #17847

Open

fix(slack): reconnect socket mode after disconnect #27232

Merged

markshields-tl reviewed Feb 27, 2026

View reviewed changes

fix(slack): avoid unhandled disconnect rejection

7cb6c13

Takhoffman added the close:superseded PR close reason label Mar 1, 2026

Takhoffman closed this Mar 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(slack): fail fast when socket mode disconnects#24283

fix(slack): fail fast when socket mode disconnects#24283
frankekn wants to merge 2 commits intoopenclaw:mainfrom
frankekn:fix/slack-socket-disconnect-failfast

frankekn commented Feb 23, 2026 •

edited by greptile-apps bot

Loading

Uh oh!

markshields-tl left a comment

Uh oh!

frankekn commented Feb 27, 2026

Uh oh!

Takhoffman commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

frankekn commented Feb 23, 2026 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What

Why

Validation

Greptile Summary

Confidence Score: 5/5

Uh oh!

markshields-tl left a comment

Choose a reason for hiding this comment

Review

Uh oh!

frankekn commented Feb 27, 2026

Uh oh!

Takhoffman commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

frankekn commented Feb 23, 2026 •

edited by greptile-apps bot

Loading