feat: allow custom endpoints to use responses api#1041
Closed
mxyhi wants to merge 1 commit into
Closed
Conversation
Contributor
|
Is there a reason for this support - what models have responses api support 🤔 |
Contributor
Author
|
Contributor
Author
|
9d22a09 to
6bb62eb
Compare
teknium1
pushed a commit
that referenced
this pull request
Mar 17, 2026
Add HERMES_API_MODE env var and model.api_mode config field to let custom OpenAI-compatible endpoints opt into codex_responses mode without requiring the OpenAI Codex OAuth provider path. - _get_configured_api_mode() reads HERMES_API_MODE env (precedence) then model.api_mode from config.yaml; validates against whitelist - Applied in both _resolve_openrouter_runtime() and _resolve_named_custom_runtime() (original PR only covered openrouter) - Fix _dump_api_request_debug() to show /responses URL when in codex_responses mode instead of always showing /chat/completions - Tests for config override, env override, invalid values, named custom providers, and debug dump URL for both API modes Inspired by PR #1041 by @mxyhi.
teknium1
added a commit
that referenced
this pull request
Mar 17, 2026
…de (#1651) Add HERMES_API_MODE env var and model.api_mode config field to let custom OpenAI-compatible endpoints opt into codex_responses mode without requiring the OpenAI Codex OAuth provider path. - _get_configured_api_mode() reads HERMES_API_MODE env (precedence) then model.api_mode from config.yaml; validates against whitelist - Applied in both _resolve_openrouter_runtime() and _resolve_named_custom_runtime() (original PR only covered openrouter) - Fix _dump_api_request_debug() to show /responses URL when in codex_responses mode instead of always showing /chat/completions - Tests for config override, env override, invalid values, named custom providers, and debug dump URL for both API modes Inspired by PR #1041 by @mxyhi. Co-authored-by: mxyhi <mxyhi@users.noreply.github.com>
Contributor
|
Merged via PR #1651. Reimplemented your feature onto current main (282 commits ahead) with authorship preserved. Improvements in the reimplementation:
Thanks for the contribution, @mxyhi! |
teknium1
pushed a commit
that referenced
this pull request
Apr 22, 2026
…imeout Upgrades agent-browser from 0.13.0 to 0.26.0, picking up 13 releases of daemon reliability fixes: - Daemon hang on Linux from waitpid(-1) race in SIGCHLD handler (#1098) - Chrome killed after ~10s idle due to PR_SET_PDEATHSIG thread tracking (#1157) - Orphaned Chrome processes via process-group kill on shutdown (#1137) - Stale daemon after upgrade via .version sidecar and auto-restart (#1134) - Idle timeout not firing (sleep future recreated each loop) (#1110) - Navigation hanging on lifecycle events that never fire (#1059, #1092) - CDP attach hang on Chrome 144+ (#1133) - Windows daemon TCP bind with Hyper-V port conflicts (#1041) - Shadow DOM traversal in accessibility tree snapshots - doctor command for user self-diagnosis Also wires AGENT_BROWSER_IDLE_TIMEOUT_MS into the browser subprocess environment so the daemon self-terminates after our configured inactivity timeout (default 300s). This is the daemon-side counterpart to the Python-side inactivity reaper — the daemon kills itself and its Chrome children when no commands arrive, preventing orphan accumulation even when the Python process dies without running atexit handlers. Addresses #7343 (daemon socket hangs, shadow DOM) and #13793 (orphan accumulation from force-killed sessions).
angelburgosrosado
pushed a commit
to angelburgosrosado/hermes-agent
that referenced
this pull request
Apr 27, 2026
…de (NousResearch#1651) Add HERMES_API_MODE env var and model.api_mode config field to let custom OpenAI-compatible endpoints opt into codex_responses mode without requiring the OpenAI Codex OAuth provider path. - _get_configured_api_mode() reads HERMES_API_MODE env (precedence) then model.api_mode from config.yaml; validates against whitelist - Applied in both _resolve_openrouter_runtime() and _resolve_named_custom_runtime() (original PR only covered openrouter) - Fix _dump_api_request_debug() to show /responses URL when in codex_responses mode instead of always showing /chat/completions - Tests for config override, env override, invalid values, named custom providers, and debug dump URL for both API modes Inspired by PR NousResearch#1041 by @mxyhi. Co-authored-by: mxyhi <mxyhi@users.noreply.github.com>
ulasbilgen
pushed a commit
to ulasbilgen/hermes-adhd-agent
that referenced
this pull request
May 1, 2026
…imeout Upgrades agent-browser from 0.13.0 to 0.26.0, picking up 13 releases of daemon reliability fixes: - Daemon hang on Linux from waitpid(-1) race in SIGCHLD handler (NousResearch#1098) - Chrome killed after ~10s idle due to PR_SET_PDEATHSIG thread tracking (NousResearch#1157) - Orphaned Chrome processes via process-group kill on shutdown (NousResearch#1137) - Stale daemon after upgrade via .version sidecar and auto-restart (NousResearch#1134) - Idle timeout not firing (sleep future recreated each loop) (NousResearch#1110) - Navigation hanging on lifecycle events that never fire (NousResearch#1059, NousResearch#1092) - CDP attach hang on Chrome 144+ (NousResearch#1133) - Windows daemon TCP bind with Hyper-V port conflicts (NousResearch#1041) - Shadow DOM traversal in accessibility tree snapshots - doctor command for user self-diagnosis Also wires AGENT_BROWSER_IDLE_TIMEOUT_MS into the browser subprocess environment so the daemon self-terminates after our configured inactivity timeout (default 300s). This is the daemon-side counterpart to the Python-side inactivity reaper — the daemon kills itself and its Chrome children when no commands arrive, preventing orphan accumulation even when the Python process dies without running atexit handlers. Addresses NousResearch#7343 (daemon socket hangs, shadow DOM) and NousResearch#13793 (orphan accumulation from force-killed sessions).
aj-nt
pushed a commit
to aj-nt/hermes-agent
that referenced
this pull request
May 1, 2026
…imeout Upgrades agent-browser from 0.13.0 to 0.26.0, picking up 13 releases of daemon reliability fixes: - Daemon hang on Linux from waitpid(-1) race in SIGCHLD handler (NousResearch#1098) - Chrome killed after ~10s idle due to PR_SET_PDEATHSIG thread tracking (NousResearch#1157) - Orphaned Chrome processes via process-group kill on shutdown (NousResearch#1137) - Stale daemon after upgrade via .version sidecar and auto-restart (NousResearch#1134) - Idle timeout not firing (sleep future recreated each loop) (NousResearch#1110) - Navigation hanging on lifecycle events that never fire (NousResearch#1059, NousResearch#1092) - CDP attach hang on Chrome 144+ (NousResearch#1133) - Windows daemon TCP bind with Hyper-V port conflicts (NousResearch#1041) - Shadow DOM traversal in accessibility tree snapshots - doctor command for user self-diagnosis Also wires AGENT_BROWSER_IDLE_TIMEOUT_MS into the browser subprocess environment so the daemon self-terminates after our configured inactivity timeout (default 300s). This is the daemon-side counterpart to the Python-side inactivity reaper — the daemon kills itself and its Chrome children when no commands arrive, preventing orphan accumulation even when the Python process dies without running atexit handlers. Addresses NousResearch#7343 (daemon socket hangs, shadow DOM) and NousResearch#13793 (orphan accumulation from force-killed sessions).
02356abc
pushed a commit
to 02356abc/hermes-agent
that referenced
this pull request
May 14, 2026
…de (NousResearch#1651) Add HERMES_API_MODE env var and model.api_mode config field to let custom OpenAI-compatible endpoints opt into codex_responses mode without requiring the OpenAI Codex OAuth provider path. - _get_configured_api_mode() reads HERMES_API_MODE env (precedence) then model.api_mode from config.yaml; validates against whitelist - Applied in both _resolve_openrouter_runtime() and _resolve_named_custom_runtime() (original PR only covered openrouter) - Fix _dump_api_request_debug() to show /responses URL when in codex_responses mode instead of always showing /chat/completions - Tests for config override, env override, invalid values, named custom providers, and debug dump URL for both API modes Inspired by PR NousResearch#1041 by @mxyhi. Co-authored-by: mxyhi <mxyhi@users.noreply.github.com>
02356abc
pushed a commit
to 02356abc/hermes-agent
that referenced
this pull request
May 14, 2026
…imeout Upgrades agent-browser from 0.13.0 to 0.26.0, picking up 13 releases of daemon reliability fixes: - Daemon hang on Linux from waitpid(-1) race in SIGCHLD handler (NousResearch#1098) - Chrome killed after ~10s idle due to PR_SET_PDEATHSIG thread tracking (NousResearch#1157) - Orphaned Chrome processes via process-group kill on shutdown (NousResearch#1137) - Stale daemon after upgrade via .version sidecar and auto-restart (NousResearch#1134) - Idle timeout not firing (sleep future recreated each loop) (NousResearch#1110) - Navigation hanging on lifecycle events that never fire (NousResearch#1059, NousResearch#1092) - CDP attach hang on Chrome 144+ (NousResearch#1133) - Windows daemon TCP bind with Hyper-V port conflicts (NousResearch#1041) - Shadow DOM traversal in accessibility tree snapshots - doctor command for user self-diagnosis Also wires AGENT_BROWSER_IDLE_TIMEOUT_MS into the browser subprocess environment so the daemon self-terminates after our configured inactivity timeout (default 300s). This is the daemon-side counterpart to the Python-side inactivity reaper — the daemon kills itself and its Chrome children when no commands arrive, preventing orphan accumulation even when the Python process dies without running atexit handlers. Addresses NousResearch#7343 (daemon socket hangs, shadow DOM) and NousResearch#13793 (orphan accumulation from force-killed sessions).
olympus-terminal
pushed a commit
to olympus-terminal/hermes-agent
that referenced
this pull request
May 16, 2026
…de (NousResearch#1651) Add HERMES_API_MODE env var and model.api_mode config field to let custom OpenAI-compatible endpoints opt into codex_responses mode without requiring the OpenAI Codex OAuth provider path. - _get_configured_api_mode() reads HERMES_API_MODE env (precedence) then model.api_mode from config.yaml; validates against whitelist - Applied in both _resolve_openrouter_runtime() and _resolve_named_custom_runtime() (original PR only covered openrouter) - Fix _dump_api_request_debug() to show /responses URL when in codex_responses mode instead of always showing /chat/completions - Tests for config override, env override, invalid values, named custom providers, and debug dump URL for both API modes Inspired by PR NousResearch#1041 by @mxyhi. Co-authored-by: mxyhi <mxyhi@users.noreply.github.com>
gweeteve
pushed a commit
to gweeteve/hermes-agent
that referenced
this pull request
Jun 2, 2026
…de (NousResearch#1651) Add HERMES_API_MODE env var and model.api_mode config field to let custom OpenAI-compatible endpoints opt into codex_responses mode without requiring the OpenAI Codex OAuth provider path. - _get_configured_api_mode() reads HERMES_API_MODE env (precedence) then model.api_mode from config.yaml; validates against whitelist - Applied in both _resolve_openrouter_runtime() and _resolve_named_custom_runtime() (original PR only covered openrouter) - Fix _dump_api_request_debug() to show /responses URL when in codex_responses mode instead of always showing /chat/completions - Tests for config override, env override, invalid values, named custom providers, and debug dump URL for both API modes Inspired by PR NousResearch#1041 by @mxyhi. Co-authored-by: mxyhi <mxyhi@users.noreply.github.com>
gweeteve
pushed a commit
to gweeteve/hermes-agent
that referenced
this pull request
Jun 2, 2026
…imeout Upgrades agent-browser from 0.13.0 to 0.26.0, picking up 13 releases of daemon reliability fixes: - Daemon hang on Linux from waitpid(-1) race in SIGCHLD handler (NousResearch#1098) - Chrome killed after ~10s idle due to PR_SET_PDEATHSIG thread tracking (NousResearch#1157) - Orphaned Chrome processes via process-group kill on shutdown (NousResearch#1137) - Stale daemon after upgrade via .version sidecar and auto-restart (NousResearch#1134) - Idle timeout not firing (sleep future recreated each loop) (NousResearch#1110) - Navigation hanging on lifecycle events that never fire (NousResearch#1059, NousResearch#1092) - CDP attach hang on Chrome 144+ (NousResearch#1133) - Windows daemon TCP bind with Hyper-V port conflicts (NousResearch#1041) - Shadow DOM traversal in accessibility tree snapshots - doctor command for user self-diagnosis Also wires AGENT_BROWSER_IDLE_TIMEOUT_MS into the browser subprocess environment so the daemon self-terminates after our configured inactivity timeout (default 300s). This is the daemon-side counterpart to the Python-side inactivity reaper — the daemon kills itself and its Chrome children when no commands arrive, preventing orphan accumulation even when the Python process dies without running atexit handlers. Addresses NousResearch#7343 (daemon socket hangs, shadow DOM) and NousResearch#13793 (orphan accumulation from force-killed sessions).
Egavasyug
pushed a commit
to Egavasyug/hermes-agent
that referenced
this pull request
Jun 10, 2026
…de (NousResearch#1651) Add HERMES_API_MODE env var and model.api_mode config field to let custom OpenAI-compatible endpoints opt into codex_responses mode without requiring the OpenAI Codex OAuth provider path. - _get_configured_api_mode() reads HERMES_API_MODE env (precedence) then model.api_mode from config.yaml; validates against whitelist - Applied in both _resolve_openrouter_runtime() and _resolve_named_custom_runtime() (original PR only covered openrouter) - Fix _dump_api_request_debug() to show /responses URL when in codex_responses mode instead of always showing /chat/completions - Tests for config override, env override, invalid values, named custom providers, and debug dump URL for both API modes Inspired by PR NousResearch#1041 by @mxyhi. Co-authored-by: mxyhi <mxyhi@users.noreply.github.com>
Egavasyug
pushed a commit
to Egavasyug/hermes-agent
that referenced
this pull request
Jun 10, 2026
…imeout Upgrades agent-browser from 0.13.0 to 0.26.0, picking up 13 releases of daemon reliability fixes: - Daemon hang on Linux from waitpid(-1) race in SIGCHLD handler (NousResearch#1098) - Chrome killed after ~10s idle due to PR_SET_PDEATHSIG thread tracking (NousResearch#1157) - Orphaned Chrome processes via process-group kill on shutdown (NousResearch#1137) - Stale daemon after upgrade via .version sidecar and auto-restart (NousResearch#1134) - Idle timeout not firing (sleep future recreated each loop) (NousResearch#1110) - Navigation hanging on lifecycle events that never fire (NousResearch#1059, NousResearch#1092) - CDP attach hang on Chrome 144+ (NousResearch#1133) - Windows daemon TCP bind with Hyper-V port conflicts (NousResearch#1041) - Shadow DOM traversal in accessibility tree snapshots - doctor command for user self-diagnosis Also wires AGENT_BROWSER_IDLE_TIMEOUT_MS into the browser subprocess environment so the daemon self-terminates after our configured inactivity timeout (default 300s). This is the daemon-side counterpart to the Python-side inactivity reaper — the daemon kills itself and its Chrome children when no commands arrive, preventing orphan accumulation even when the Python process dies without running atexit handlers. Addresses NousResearch#7343 (daemon socket hangs, shadow DOM) and NousResearch#13793 (orphan accumulation from force-killed sessions).
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
codex_responsesvia config orHERMES_API_MODE/v1/responsesfor custom endpoints/responseswhen the active API mode iscodex_responsesValidation
/Users/langhuam/workspace/self/hermes-agent/venv/bin/pytest tests/test_runtime_provider_resolution.py tests/test_run_agent_codex_responses.py -qHERMES_DUMP_REQUESTS=1 HERMES_DUMP_REQUEST_STDOUT=1 hermes chat -Q -q "Reply with exactly OK."http://127.0.0.1:9208/v1/responseswith a custom endpoint configured in~/.hermes/config.yaml