Skip to content

fix(gateway): handle windows pid probe failures#6310

Closed
iacker wants to merge 1 commit into
NousResearch:mainfrom
iacker:fix/windows-pid-liveness-probe
Closed

fix(gateway): handle windows pid probe failures#6310
iacker wants to merge 1 commit into
NousResearch:mainfrom
iacker:fix/windows-pid-liveness-probe

Conversation

@iacker

@iacker iacker commented Apr 8, 2026

Copy link
Copy Markdown
Contributor

Summary

  • centralise gateway PID liveness probing in a cross-platform helper
  • treat Windows SystemError / OSError failures from os.kill(pid, 0) as stale PID records
  • reuse the same helper for profile gateway status checks

Testing

  • python -m pytest tests/gateway/test_status.py tests/hermes_cli/test_profiles.py -q
  • python -m pytest tests/gateway/ -q (fails on pre-existing unrelated test: tests/gateway/test_run_progress_topics.py::test_run_agent_progress_stays_in_originating_topic expecting 💻 terminal: "pwd" but current output is ⚙️ terminal: "pwd")

Closes #5760

@iacker iacker force-pushed the fix/windows-pid-liveness-probe branch from 5e3a372 to a246dd4 Compare April 8, 2026 22:07
@alt-glitch alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/gateway Gateway runner, session dispatch, delivery labels Apr 30, 2026
@alt-glitch

Copy link
Copy Markdown
Collaborator

Related to #11575, #13198, #11156 — multiple competing PRs for the same Windows PID probe fix. See #7227 for the original issue.

@alt-glitch

Copy link
Copy Markdown
Collaborator

Related to #11575, #13198, #11156

@teknium1

teknium1 commented May 7, 2026

Copy link
Copy Markdown
Contributor

Thanks for this — appreciate the work. We're closing the entire cluster of open native-Windows PRs (44 of them spanning installer, terminal routing, file ops, gateway PID handling, encoding, docs, and more) because the surface area needs a designed, consolidated approach rather than piecemeal merges. Cherry-picking individual fixes keeps leaving inconsistencies and we'd rather land Windows support properly, in one coherent pass.\n\nYour PR is catalogued in our internal Windows support plan. When we pick this back up (soon), we'll mine every PR in the cluster for its fix shape and credit all contributors whose work informs the final patch via lines. Watch for the consolidating PR and feel free to chime in with context on the specific failure mode you were hitting.\n\nClosing for now, not as a rejection of the fix — just queueing it for the designed rollout. Thanks again.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

comp/gateway Gateway runner, session dispatch, delivery P2 Medium — degraded but workaround exists type/bug Something isn't working

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Windows: os.kill(pid, 0) raises SystemError/OSError in gateway/status.py

3 participants