fix: preserve configured context lengths across runtime paths by tongguang2 · Pull Request #14008 · NousResearch/hermes-agent

tongguang2 · 2026-04-22T12:55:25Z

Summary

This PR fixes a set of context length inconsistencies that still affect custom-provider runtimes across different execution paths.

What it fixes

Normalize persistent context-length cache keys so /v1 and /v1/ resolve to the same provider endpoint
Preserve configured context-length overrides when switching models
Preserve configured context-length overrides when activating fallback models
Avoid incorrectly reusing a per-model custom_providers context override for a different target model
Add regression tests for cache normalization, switch-model handling, fallback handling, and configured override persistence

Root cause

There were two separate issues:

Persistent context-length cache lookups treated .../v1 and .../v1/ as different keys, which caused cache misses for the same custom provider endpoint in different runtime paths.
Runtime context refresh paths (switch_model() / fallback activation) did not consistently resolve the configured context-length override for the active target model and endpoint.

As a result, custom providers that were correctly configured in config.yaml could still fall back to an incorrect detected context length in some paths.

Implementation details

Added base URL normalization for persistent context-length cache writes
Made cache reads backward-compatible with legacy slash variants
Added runtime-target-aware context-length resolution in AIAgent
Reused that runtime-target-aware resolution when refreshing model metadata during model switch and fallback activation

Tests

Verified with:

tests/agent/test_model_metadata.py
tests/run_agent/test_switch_model_context.py
tests/run_agent/test_fallback_model.py
tests/run_agent/test_provider_fallback.py
tests/run_agent/test_invalid_context_length_warning.py
tests/run_agent/test_compression_feasibility.py

Local result:

147 passed

Notes

This PR is intentionally scoped to the runtime-path inconsistency and cache-key normalization issue. It does not introduce broader changes to provider metadata discovery beyond what is needed to make configured context lengths behave consistently.

- normalize context cache keys so /v1 and /v1/ share the same entry - resolve configured context overrides per runtime target during switch and fallback - add regression coverage for cache normalization and per-model override handling

alt-glitch · 2026-04-22T13:07:29Z

Related to #11437 (context length display/runtime alignment), #8785 / #8786 (compression context overrides) — same family of context-length consistency bugs across different runtime paths.

alt-glitch · 2026-04-22T13:07:57Z

Related to #11437, #8785, #8786.

alt-glitch · 2026-04-22T13:08:43Z

Related to #11437, #8785, #8786.

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/agent Core agent loop, run_agent.py, prompt builder labels Apr 22, 2026

This was referenced Apr 22, 2026

fix: resolve context_length from custom_providers on model switch #13052

Closed

fix: reapply compression feasibility guard after model switch #15541

Open

This was referenced May 12, 2026

# Bug Report: model.context_length persists across /model provider switches #24072

Closed

fix(agent): forward custom_providers to context-length probe on fallback activation #25554

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preserve configured context lengths across runtime paths#14008

fix: preserve configured context lengths across runtime paths#14008
tongguang2 wants to merge 1 commit into
NousResearch:mainfrom
tongguang2:fix/context-length-cache-and-runtime-overrides

tongguang2 commented Apr 22, 2026

Uh oh!

alt-glitch commented Apr 22, 2026

Uh oh!

alt-glitch commented Apr 22, 2026

Uh oh!

alt-glitch commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tongguang2 commented Apr 22, 2026

Summary

What it fixes

Root cause

Implementation details

Tests

Notes

Uh oh!

alt-glitch commented Apr 22, 2026

Uh oh!

alt-glitch commented Apr 22, 2026

Uh oh!

alt-glitch commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants