Bug Description
The gateway ignores stt.enabled: false in config.yaml and always attempts to call OpenAI Whisper API for voice messages, causing 401 errors when no valid OpenAI key is configured.
Expected Behavior
When stt.enabled: false is set, the gateway should skip STT transcription entirely and pass voice messages through without processing, or use a configured alternative provider.
Actual Behavior
- Gateway receives voice message
- Attempts OpenAI Whisper API call regardless of
stt.enabled setting
- Returns "401 Incorrect API key" error to user
- Message flow is blocked
Environment
- Hermes Gateway (latest)
- Config:
stt:
enabled: false
# provider: openai # Tried with/without this
Logs
Processing audio...
Error transcribing audio: 401 Incorrect API key provided...
Workaround
Currently using manual Deepgram transcription via Python script on cached .ogg files from /root/.hermes/audio_cache/.
Suggested Fixes
- Immediate: Honor
stt.enabled: false - skip transcription entirely when disabled
- Better: Support pluggable STT providers (Deepgram, Whisper local, etc.) configured via YAML
- Best: Default to disabled if no valid API key is present, don't hardcode OpenAI
Happy to provide more details or test fixes!
Bug Description
The gateway ignores
stt.enabled: falseinconfig.yamland always attempts to call OpenAI Whisper API for voice messages, causing 401 errors when no valid OpenAI key is configured.Expected Behavior
When
stt.enabled: falseis set, the gateway should skip STT transcription entirely and pass voice messages through without processing, or use a configured alternative provider.Actual Behavior
stt.enabledsettingEnvironment
Logs
Workaround
Currently using manual Deepgram transcription via Python script on cached
.oggfiles from/root/.hermes/audio_cache/.Suggested Fixes
stt.enabled: false- skip transcription entirely when disabledHappy to provide more details or test fixes!