fix(gateway): defer goal status notices until after response delivery by liquidchen · Pull Request #19160 · NousResearch/hermes-agent

liquidchen · 2026-05-03T09:30:39Z

Summary

route /goal status notices through the platform adapter send() API, preserving thread metadata
defer completed-goal notices via post-delivery callbacks so the final assistant response appears before ✓ Goal achieved
cancel queued synthetic goal continuations on /goal pause and /goal clear without dropping normal queued user messages
add gateway regression coverage for notice delivery, ordering, and pending-continuation cancellation

Why

The /goal gateway path currently tries to send status notices through adapter.send_message(...), but gateway adapters expose send(...). On Discord this can silently skip the ✓ Goal achieved notice.

Also, _post_turn_goal_continuation() runs after the final response is produced but before the platform adapter has delivered that response, so sending the goal status inline can reverse the visible ordering.

Test Plan

python -m py_compile gateway/run.py gateway/platforms/base.py tests/gateway/test_goal_status_notice.py
python -m pytest -q tests/gateway/test_goal_status_notice.py tests/hermes_cli/test_goals.py -o 'addopts='

Route goal status notices through the platform adapter send API and register post-delivery callbacks so completed-goal notices appear after the final assistant response. Also cancel queued synthetic goal continuations on /goal pause and /goal clear while preserving normal queued user messages.

Weak judge models (e.g. deepseek-v4-flash) return empty strings or prose when asked for the strict {done, reason} JSON verdict. The old code failed-open to continue on every such turn, burning the entire turn budget with log lines like judge returned empty response judge reply was not JSON: "Let me analyze whether the goal..." and /goal clear could not stop it mid-loop without /stop. After N=3 consecutive *parse* failures (transport/API errors don't count — those are transient), the loop auto-pauses and prints: ⏸ Goal paused — the judge model (3 turns) isn't returning the required JSON verdict. Route the judge to a stricter model in ~/.hermes/config.yaml: auxiliary: goal_judge: provider: openrouter model: google/gemini-3-flash-preview Then /goal resume to continue. The counter resets on any usable reply (both "done"/"continue" and API errors) and persists across GoalManager reloads so cross-session resumes carry the correct state. Also fixes test_goal_verdict_send.py sharing a hardcoded session_id across tests — the shared id only worked because the previous _post_turn_goal_continuation was a never-awaited coroutine. Now that PR #19160 made it properly awaited, the xdist test-leakage bug surfaced. Each test gets a unique session_id via uuid suffix.

teknium1 · 2026-05-08T00:33:23Z

Landed via PR #21576 — your commit was rebase-merged onto main with authorship preserved (commit 03ddff8). A follow-up commit (307c85e) auto-pauses the /goal loop after 3 consecutive unparseable judge replies with a config-pointer message, so users hitting the deepseek-v4-flash failure mode get told how to fix it instead of running to turn budget exhaustion. Thanks for the fix!

Weak judge models (e.g. deepseek-v4-flash) return empty strings or prose when asked for the strict {done, reason} JSON verdict. The old code failed-open to continue on every such turn, burning the entire turn budget with log lines like judge returned empty response judge reply was not JSON: "Let me analyze whether the goal..." and /goal clear could not stop it mid-loop without /stop. After N=3 consecutive *parse* failures (transport/API errors don't count — those are transient), the loop auto-pauses and prints: ⏸ Goal paused — the judge model (3 turns) isn't returning the required JSON verdict. Route the judge to a stricter model in ~/.hermes/config.yaml: auxiliary: goal_judge: provider: openrouter model: google/gemini-3-flash-preview Then /goal resume to continue. The counter resets on any usable reply (both "done"/"continue" and API errors) and persists across GoalManager reloads so cross-session resumes carry the correct state. Also fixes test_goal_verdict_send.py sharing a hardcoded session_id across tests — the shared id only worked because the previous _post_turn_goal_continuation was a never-awaited coroutine. Now that PR NousResearch#19160 made it properly awaited, the xdist test-leakage bug surfaced. Each test gets a unique session_id via uuid suffix.

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/gateway Gateway runner, session dispatch, delivery labels May 3, 2026

teknium1 mentioned this pull request May 8, 2026

fix(goals): salvage PR #19160 + auto-pause on consecutive judge parse failures #21576

Merged

teknium1 closed this in #21576 May 8, 2026

alt-glitch mentioned this pull request May 25, 2026

Bug: /goal ✓ Goal achieved notice silently dropped — generation mismatch in post-delivery callback store #31922

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(gateway): defer goal status notices until after response delivery#19160

fix(gateway): defer goal status notices until after response delivery#19160
liquidchen wants to merge 1 commit into
NousResearch:mainfrom
liquidchen:fix/goal-status-notice-ordering

liquidchen commented May 3, 2026

Uh oh!

teknium1 commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

liquidchen commented May 3, 2026

Summary

Why

Test Plan

Uh oh!

teknium1 commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants