fix: persist resolved context_length after /model switch by vanhoof · Pull Request #36199 · NousResearch/hermes-agent

vanhoof · 2026-06-01T02:26:29Z

Problem

After switch_model() clears _config_context_length to None (line 1411) so the new model can resolve its own context window, the display path in cli.py reads None via getattr(agent, "_config_context_length", None) and falls through every resolution step to the 256K default.

This affects all custom provider setups (LiteLLM, vertex-proxy, or any custom_providers/providers entry) where the model's context length resolves via DEFAULT_CONTEXT_LENGTHS or live probes rather than an explicit config_context_length.

Observed behavior: /model always shows Context: 256,000 tokens regardless of the actual model context window, even when switching between 200K and 1M models.

Fix

The context compressor already resolves the correct value via get_model_context_length() during the switch. Write it back to agent._config_context_length so the /model confirmation message in cli.py (and gateway /info) can read the resolved value instead of None.

7-line change, no new dependencies.

Testing

Manual testing with providers: config entries pointing at a local proxy (vertex-proxy on port 8788, Anthropic wire protocol). Before fix: /model always shows 256K. After fix: shows correct context length per model (1M for claude-opus-4.6, 200K for claude-sonnet-4, etc.).

Signed-off-by: Chris van Hoof vanhoof@ouwish.com

After switch_model() clears _config_context_length to None so the new model can resolve its own context window, the display path in cli.py reads None and falls through to the 256K default for custom providers (LiteLLM, vertex-proxy, etc.) that don't match any probing step. The context compressor already resolves the correct value via get_model_context_length() — write it back to _config_context_length so the /model confirmation message shows the actual context window instead of the fallback. Fixes the display for all custom provider setups where the model's context length resolves via DEFAULT_CONTEXT_LENGTHS or live probes rather than config_context_length. Signed-off-by: Chris van Hoof <vanhoof@ouwish.com>

mxnstrexgl

LGTM — automated review passed. No security, quality, or test coverage issues detected.

vanhoof · 2026-06-02T23:06:17Z

Superseded by #37712, which fixes the root cause (two additional bugs in the resolution chain that made the persistence here persist the wrong value).

alt-glitch added type/bug Something isn't working comp/agent Core agent loop, run_agent.py, prompt builder P2 Medium — degraded but workaround exists labels Jun 1, 2026

mxnstrexgl approved these changes Jun 1, 2026

View reviewed changes

vanhoof mentioned this pull request Jun 2, 2026

fix(config): resolve provider-level context_length on /model switch #37712

Closed

vanhoof closed this Jun 2, 2026

vanhoof mentioned this pull request Jun 2, 2026

fix(config): resolve provider-level context_length on /model switch #37716

Open

This was referenced Jun 2, 2026

[Feature]: Re-budget the context compressor when a router serves a different backend per request #37719

Open

feat(agent): re-budget context compressor when a router swaps the backend #37720

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: persist resolved context_length after /model switch#36199

fix: persist resolved context_length after /model switch#36199
vanhoof wants to merge 1 commit into
NousResearch:mainfrom
vanhoof:fix/model-switch-context-length-display

vanhoof commented Jun 1, 2026

Uh oh!

mxnstrexgl left a comment

Uh oh!

vanhoof commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vanhoof commented Jun 1, 2026

Problem

Fix

Testing

Uh oh!

mxnstrexgl left a comment

Choose a reason for hiding this comment

Uh oh!

vanhoof commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants