Skip to content

feat: add Codex fast mode toggle#6817

Closed
g-guthrie wants to merge 5 commits into
NousResearch:mainfrom
g-guthrie:fix/codex-fast-mode
Closed

feat: add Codex fast mode toggle#6817
g-guthrie wants to merge 5 commits into
NousResearch:mainfrom
g-guthrie:fix/codex-fast-mode

Conversation

@g-guthrie

Copy link
Copy Markdown
Contributor

Summary

  • add a /fast CLI slash command with a Normal/Fast picker and status subcommand
  • only expose /fast for models listed in a central fast-mode backend registry (currently openai-codex + gpt-5.4)
  • send backend-specific request overrides from that registry when fast mode is enabled
  • pass service_tier through the Codex Responses request validator
  • omit max_output_tokens on the ChatGPT Codex backend because it rejects that parameter

Testing

  • pytest tests/cli/test_fast_command.py tests/hermes_cli/test_commands.py tests/run_agent/test_provider_parity.py tests/run_agent/test_run_agent_codex_responses.py -q
  • python -m py_compile cli.py run_agent.py hermes_cli/commands.py hermes_cli/config.py hermes_cli/models.py tests/cli/test_fast_command.py tests/hermes_cli/test_commands.py tests/run_agent/test_provider_parity.py tests/run_agent/test_run_agent_codex_responses.py

Manual verification

  • live-verified against https://chatgpt.com/backend-api/codex using the local openai-codex OAuth session
  • normal request succeeds without service_tier
  • fast request succeeds with service_tier: "priority"
  • both paths omit max_output_tokens on the Codex backend
  • observed a faster wall-clock response for the fast request on gpt-5.4 in local spot checks

@g-guthrie g-guthrie force-pushed the fix/codex-fast-mode branch from cbc1dd0 to 779f20b Compare April 9, 2026 22:03
@g-guthrie

g-guthrie commented Apr 10, 2026

Copy link
Copy Markdown
Contributor Author

Pushed a small UX follow-up to this branch.

What changed:

  • /fast status now reports whether the Codex fast backend is currently available
  • enabling /fast now warns when the fast backend cannot currently resolve, instead of silently falling back to the normal route
  • added focused CLI tests for the unavailable-backend path

Targeted verification run locally:

  • pytest tests/cli/test_fast_command.py tests/hermes_cli/test_commands.py tests/run_agent/test_provider_parity.py tests/run_agent/test_run_agent_codex_responses.py -q
  • Result: 225 passed

@teknium1

Copy link
Copy Markdown
Contributor

Merged via PR #6875. Your commits were cherry-picked onto current main with your authorship preserved. Cleanups applied: removed simple_term_menu/input() (banned patterns), simplified override resolution to single source of truth. Thanks for the contribution @g-guthrie!

@teknium1 teknium1 closed this Apr 10, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants