fix(gateway): queue voice/audio messages instead of interrupting with empty text by chaizijun1 · Pull Request #8434 · NousResearch/hermes-agent

chaizijun1 · 2026-04-12T15:03:49Z

Summary

When a voice/audio message arrives while an agent is already running, the gateway interrupts the agent with event.text — but voice messages haven't been through STT transcription yet, so event.text is empty. This causes the agent to hang.
Photos already have dedicated queueing logic that avoids interrupting the running agent. This PR applies the same merge_pending_message_event pattern to MessageType.VOICE and MessageType.AUDIO.

Reproducer

Send a voice message to the Telegram bot
Before the agent finishes responding, send a second voice message
Expected: second voice is queued and processed after the first completes
Actual: agent hangs indefinitely (interrupted with empty text)

Changes

gateway/run.py: Add voice/audio check between the existing photo queueing block and the _AGENT_PENDING_SENTINEL check (+12 lines)

Test plan

Send two consecutive voice messages in quick succession — second should be queued and processed after the first
Send a voice message while agent is processing a text message — should queue without interrupt
Send a text message while agent is processing — should still interrupt normally (existing behavior preserved)
Verify photo queueing still works as before

🤖 Generated with Claude Code

… empty text When a voice or audio message arrives while an agent is already running, the gateway calls `running_agent.interrupt(event.text)`. However, `event.text` is empty at this point because STT transcription only happens later inside `_handle_message_with_agent`. The empty-text interrupt causes the agent to hang waiting for model response. Photos already have dedicated queueing logic that avoids this problem. Apply the same pattern to voice/audio messages: queue them via `merge_pending_message_event` so they are processed with full STT transcription after the current agent turn completes. Reproducer: 1. Send a voice message to the Telegram bot 2. Before the agent finishes responding, send a second voice message 3. The agent hangs indefinitely Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

chaizijun1 · 2026-04-15T09:22:18Z

Hi team, gentle bump on this one. This is a bug fix for voice/audio messages causing agent hangs when sent in quick succession. The fix mirrors the existing photo queueing pattern. Happy to adjust anything if needed!

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/gateway Gateway runner, session dispatch, delivery tool/tts Text-to-speech and transcription labels Apr 28, 2026

alt-glitch mentioned this pull request Apr 30, 2026

fix(agent,gateway): voice interrupts + cascading interrupt hang #6600

Closed

alt-glitch mentioned this pull request May 24, 2026

fix(telegram): queue voice follow-ups instead of interrupting in-flight reply (#31328) #31342

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(gateway): queue voice/audio messages instead of interrupting with empty text#8434

fix(gateway): queue voice/audio messages instead of interrupting with empty text#8434
chaizijun1 wants to merge 1 commit into
NousResearch:mainfrom
chaizijun1:fix/queue-voice-messages-on-interrupt

chaizijun1 commented Apr 12, 2026

Uh oh!

chaizijun1 commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chaizijun1 commented Apr 12, 2026

Summary

Reproducer

Changes

Test plan

Uh oh!

chaizijun1 commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants