fix(agent): clear stale config context_length on model switch by AgentArcLab · Pull Request #22387 · NousResearch/hermes-agent

AgentArcLab · 2026-05-09T07:51:18Z

Problem

When switching models (via /model or fallback), AIAgent._config_context_length is never cleared, so the new model inherits the previous model's context window instead of auto-detecting the correct one via get_model_context_length().

Root Cause

AIAgent._config_context_length is set once during __init__ from model.context_length in config.yaml. This value is never cleared in either:

switch_model() — the /model command handler
_try_activate_fallback() — the failover path on primary model failure

Since get_model_context_length() checks this value at resolution step 0 and returns it immediately, the new/fallback model inherits the old override instead of going through the full resolution chain (custom_providers per-model, endpoint metadata, models.dev, etc.).

Fix

Clear self._config_context_length = None in both code paths, before the runtime field swap. This allows get_model_context_length() to skip the stale step-0 override and properly resolve the context window for the newly selected model through the standard chain (step 0b: custom_providers per-model, then endpoint probe, models.dev, etc.).

Testing

Configure model.context_length: 1048576 in config.yaml for model A (1M context)
Start Hermes with model A — verify 1M context window
Switch to model B (e.g. 200K context) via /model — before fix: still shows 1M; after fix: correctly shows 200K
Switch back to model A — correctly shows 1M again
Trigger a fallback (e.g. rate-limit the primary model) — before fix: fallback model inherits primary's context window; after fix: fallback model resolves its own context window

Closes #21509

When switching models via /model, AIAgent._config_context_length was never cleared, so the new model inherited the previous model's context window instead of auto-detecting the correct one via get_model_context_length(). Clear _config_context_length to None before the runtime field swap so the full resolution chain (custom_providers per-model, endpoint probe, models.dev, etc.) is re-evaluated for the newly selected model. Closes NousResearch#21509

alt-glitch · 2026-05-09T08:14:46Z

Duplicate of #11438 — same root cause (stale _config_context_length not cleared on model switch). #11438 also covers the custom_providers per-model resolution path. See also closed #21509 (identical fix, never merged).

AgentArcLab · 2026-05-09T09:53:29Z

Thanks for the pointer to #11438! You're right that the root cause is the same — stale _config_context_length not cleared on model switch.

A couple of notes on where this PR differs:

Also covers the fallback path. This fix clears _config_context_length in both switch_model() (the /model handler) and _try_activate_fallback() (the failover path on primary model failure). fix(/model): respect per-model context_length from custom_providers config #11438 and the other related PRs (fix(model-switch): honor custom_providers per-model context_length on /model switch (#15779) #15787, fix: resolve context_length from custom_providers on model switch #13052) only address the /model switch path — when the agent falls back to a different model after a rate limit or error, the same stale context_length bug applies.
Minimal scope. This PR is intentionally narrow — only 2 insertion points in run_agent.py, no changes to model_switch.py, cli.py, or gateway/run.py. The confirmation display paths are a separate concern and can be addressed independently.

I noticed that #11438, #15787, and #13052 were all closed without merging, so the bug remains unfixed on main. Happy to adjust the approach if there's a preferred direction — just want to make sure this doesn't fall through the cracks again 🙂

teknium1 · 2026-05-13T01:50:11Z

Merged via PR #24724 (cherry-picked onto current main with your authorship preserved). Thanks for the contribution!

AgentArcLab force-pushed the fix/clear-config-context-length-on-model-switch branch from 738538b to e25ac0e Compare May 9, 2026 07:58

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder area/config Config system, migrations, profiles duplicate This issue or pull request already exists labels May 9, 2026

This was referenced May 12, 2026

[Bug]: TUI shows wrong token limit when switching from local model to GPT-5.5 #24080

Closed

fix: clear stale _config_context_length when switching providers #24079

Closed

teknium1 mentioned this pull request May 13, 2026

fix(agent): clear stale config context_length on model switch #24724

Merged

teknium1 closed this May 13, 2026

ethernet8023 mentioned this pull request May 14, 2026

fix(ci): unblock shared PR checks #21012

Merged

25 tasks

briandevans mentioned this pull request May 15, 2026

fix(agent): forward fallback_providers[].extra_body during fallback activation (#26460) #26483

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): clear stale config context_length on model switch#22387

fix(agent): clear stale config context_length on model switch#22387
AgentArcLab wants to merge 1 commit into
NousResearch:mainfrom
AgentArcLab:fix/clear-config-context-length-on-model-switch

AgentArcLab commented May 9, 2026 •

edited

Loading

Uh oh!

alt-glitch commented May 9, 2026

Uh oh!

AgentArcLab commented May 9, 2026

Uh oh!

teknium1 commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AgentArcLab commented May 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Root Cause

Fix

Testing

Uh oh!

alt-glitch commented May 9, 2026

Uh oh!

AgentArcLab commented May 9, 2026

Uh oh!

teknium1 commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AgentArcLab commented May 9, 2026 •

edited

Loading