fix(agent): clamp compression threshold below context_length to ensure compression can trigger (#14690) by Tranquil-Flow · Pull Request #15431 · NousResearch/hermes-agent

Tranquil-Flow · 2026-04-25T00:17:49Z

What does this PR do?

When context_length == MINIMUM_CONTEXT_LENGTH (64000), the threshold calculation max(int(context_length * 0.50), MINIMUM_CONTEXT_LENGTH) produces max(32000, 64000) = 64000, setting the compression threshold to 100% of the context window. The API errors before tokens ever reach that value, so auto-compression never triggers for models at the minimum context length.

The fix clamps threshold_tokens to at most 95% of context_length after the existing floor, preserving the MINIMUM_CONTEXT_LENGTH floor for large-context models (where the clamp is a no-op) while ensuring compression can always trigger before the API limit.

Related Issue

Fixes #14690

Type of Change

🐛 Bug fix (non-breaking change that fixes an issue)
✨ New feature (non-breaking change that adds functionality)
🔒 Security fix
📝 Documentation update
✅ Tests (adding or improving test coverage)
♻️ Refactor (no behavior change)
🎯 New skill (bundled or hub)

Changes Made

agent/context_compressor.py: After the existing max() floor, clamp threshold_tokens to at most 95% of context_length. Applied in both __init__ and update_model(). Additionally, update_model() now recomputes all derived budgets (tail_token_budget, max_summary_tokens) — previously it only updated threshold_tokens, leaving stale budget values after model switches.

How to Test

Run:

python -m pytest tests/agent/test_context_compressor.py -v

4 new tests in TestThresholdClampAtMinimumContext:
- Threshold is below context_length at MINIMUM_CONTEXT_LENGTH
- should_compress() triggers correctly at threshold boundary
- update_model() produces a working threshold
- Large-context models are unaffected by the clamp

Tested on macOS (Python 3.11).

Checklist

Code

I've read the Contributing Guide
My commit messages follow Conventional Commits (fix(scope):, feat(scope):, etc.)
I searched for existing PRs to make sure this isn't a duplicate
My PR contains only changes related to this fix/feature (no unrelated commits)
I've run pytest tests/ -q and all tests pass
I've added tests for my changes (required for bug fixes, strongly encouraged for features)
I've tested on my platform: macOS 15 (Darwin 24.6.0)

Documentation & Housekeeping

I've updated relevant documentation (README, docs/, docstrings) — or N/A
I've updated cli-config.yaml.example if I added/changed config keys — or N/A
I've updated CONTRIBUTING.md or AGENTS.md if I changed architecture or workflows — or N/A
I've considered cross-platform impact (Windows, macOS) per the compatibility guide — or N/A
I've updated tool descriptions/schemas if I changed tool behavior — or N/A

Screenshots / Logs

python -m pytest tests/agent/test_context_compressor.py -v
# 4 new tests in TestThresholdClampAtMinimumContext pass

alt-glitch · 2026-04-25T00:21:13Z

Likely duplicate of #14878 — both fix the same issue (#14690): compression threshold unreachable at 64K context. Also overlaps with omnibus PR #14696.

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder labels Apr 25, 2026

alt-glitch mentioned this pull request Apr 25, 2026

fix(compressor): prevent MINIMUM_CONTEXT_LENGTH floor from blocking compression at small contexts #15496

Closed

devilardis mentioned this pull request Apr 25, 2026

fix(compression): three bugs causing auto-compression to never trigger #14696

Closed

Tranquil-Flow force-pushed the fix/compress-threshold-14690 branch from 1484a55 to 72ca28d Compare May 18, 2026 21:21

fix(agent): recompute compressor budgets on context updates

387dffe

Tranquil-Flow force-pushed the fix/compress-threshold-14690 branch from 72ca28d to 387dffe Compare May 25, 2026 13:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): clamp compression threshold below context_length to ensure compression can trigger (#14690)#15431

fix(agent): clamp compression threshold below context_length to ensure compression can trigger (#14690)#15431
Tranquil-Flow wants to merge 1 commit into
NousResearch:mainfrom
Tranquil-Flow:fix/compress-threshold-14690

Tranquil-Flow commented Apr 25, 2026 •

edited

Loading

Uh oh!

alt-glitch commented Apr 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Tranquil-Flow commented Apr 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Related Issue

Type of Change

Changes Made

How to Test

Checklist

Code

Documentation & Housekeeping

Screenshots / Logs

Uh oh!

alt-glitch commented Apr 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Tranquil-Flow commented Apr 25, 2026 •

edited

Loading