fix: auto-invalidate stale context length cache when defaults change by Tranquil-Flow · Pull Request #1852 · NousResearch/hermes-agent

Tranquil-Flow · 2026-03-18T00:30:39Z

Summary

Adds hash-based invalidation to the persistent context length cache (~/.hermes/context_length_cache.yaml)
When a new hermes-agent version changes DEFAULT_CONTEXT_LENGTHS, stale cached values are automatically discarded
Legacy cache files (without the hash field) are treated as stale and invalidated on first read

Problem

The context length cache stores discovered model limits across sessions with no expiration or versioning. If a hermes-agent update changes a model's default context length (e.g., a provider upgrades from 128K to 256K, or a new model is added with different defaults), existing cache entries silently override the new default. Users would be stuck on the old limit until they manually delete ~/.hermes/context_length_cache.yaml.

This affects all models and providers that go through the probing system — not just Anthropic models.

Solution

A defaults_hash field (truncated SHA-256 of DEFAULT_CONTEXT_LENGTHS) is stored alongside cached entries. On every cache read, the stored hash is compared against the current hash:

Match → cache is valid, entries preserved
Mismatch → cache is stale, all entries discarded (forces re-probe)
Missing → legacy file, treated as stale

The hash is recomputed from the sorted dict, so it's deterministic and changes only when actual model defaults change.

Files changed

File	Change
`agent/model_metadata.py`	`_compute_defaults_hash()`, updated `_load_context_cache()` and `save_context_length()`
`tests/agent/test_model_metadata.py`	7 new tests for hash storage, invalidation, legacy migration, determinism

Tradeoff

Changing ANY model's default invalidates ALL cached entries, including correctly-probed local models (e.g., Ollama at 32K). This is acceptable because re-probing is automatic, transparent (one API call per model), and only happens once per hermes-agent update that modifies defaults.

Independent of feat: auto-detect extended context for premium API tiers #1849 (auto-detect extended context for premium tiers) — this PR fixes a pre-existing issue with the cache that affects all models
Works well with feat: auto-detect extended context for premium API tiers #1849, which writes to this cache

Test plan

112 tests passing (69 model_metadata + 29 context_compressor + 14 context overflow — no regressions)
Manual: verify legacy cache file (no defaults_hash) is invalidated on first session
Manual: verify cache survives across sessions when defaults haven't changed

The persistent context length cache (~/.hermes/context_length_cache.yaml) stores discovered model context limits across sessions. Previously, cached values lived forever — if a new hermes-agent version changed the default context length for a model (e.g., upgrading Claude from 200K to 1M for all users), existing cache entries would silently override the new default, leaving users stuck on the old limit. This adds a defaults_hash field to the cache file: a truncated SHA-256 of DEFAULT_CONTEXT_LENGTHS, recomputed on every read. When the hash doesn't match (new hermes-agent version with updated defaults), all cached entries are discarded and models re-probe their actual limits. Legacy cache files without the hash field are treated as stale and invalidated on first read — a one-time migration with no user action needed. Tradeoff: updating ANY model's default invalidates ALL cached entries, including correctly-probed local models. This is acceptable because re-probing is automatic and transparent (one API call per model).

teknium1 · 2026-03-30T01:17:09Z

Merged via PR #3802. Your cache invalidation implementation was cherry-picked onto current main with authorship preserved. Clean work — thanks!

Tranquil-Flow force-pushed the fix/cache-staleness-invalidation branch from de10701 to d81a53a Compare March 18, 2026 00:47

Tranquil-Flow mentioned this pull request Mar 18, 2026

feat: auto-detect extended context for premium API tiers #1849

Closed

3 tasks

teknium1 mentioned this pull request Mar 29, 2026

fix(setup): auto-install matrix-nio during hermes setup #3802

Merged

teknium1 closed this Mar 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: auto-invalidate stale context length cache when defaults change#1852

fix: auto-invalidate stale context length cache when defaults change#1852
Tranquil-Flow wants to merge 1 commit into
NousResearch:mainfrom
Tranquil-Flow:fix/cache-staleness-invalidation

Tranquil-Flow commented Mar 18, 2026 •

edited

Loading

Uh oh!

teknium1 commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Tranquil-Flow commented Mar 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Solution

Files changed

Tradeoff

Related

Test plan

Uh oh!

teknium1 commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Tranquil-Flow commented Mar 18, 2026 •

edited

Loading