Fix/underestimates token count by SimbaKingjoe · Pull Request #16117 · NousResearch/hermes-agent

SimbaKingjoe · 2026-04-26T15:45:59Z

What does this PR do?

_find_tail_cut_by_tokens calls len(content) to estimate message tokens, but
when content is a multimodal list (e.g. [{"type": "text", "text": "..."}, {"type": "image_url", ...}]),
len() returns the number of blocks (~2) instead of total character count.
Every multimodal message is therefore estimated as ~10 tokens regardless of
actual size, causing tail protection to expand far beyond the token budget.

The fix sums text lengths across content blocks when content is a list —
mirroring the pattern already used correctly in the sibling method
_prune_old_tool_results (line 488).

Related Issue

Fixes #16087
Related: #14694 (anti-thrashing cascade), supersedes #14395 (closed without merge)

Type of Change

🐛 Bug fix (non-breaking change that fixes an issue)
✅ Tests (adding or improving test coverage)

Changes Made

agent/context_compressor.py — _find_tail_cut_by_tokens(): replace
len(content) with sum(len(p.get("text", "")) for p in content if isinstance(p, dict)) if isinstance(content, list) else len(content)
tests/agent/test_context_compressor.py — added
TestTokenBudgetTailProtection::test_multimodal_list_content_counted_correctly

How to Test

Run the new regression test:

uv run pytest tests/agent/test_context_compressor.py::TestTokenBudgetTailProtection::test_multimodal_list_content_counted_correctly -v

… tail protection and ineffective context compression

…nto main

alt-glitch · 2026-04-26T16:05:32Z

Note: PR #16113 also targets the same issue #16087 with an equivalent fix. Both replace len(content) with summed text lengths for multimodal list content in _find_tail_cut_by_tokens.

maxddai added 2 commits April 26, 2026 23:43

underestimates token count for multimodal messages, causing oversized…

96ae387

… tail protection and ineffective context compression

Merge branch 'main' of https://github.com/SimbaKingjoe/hermes-agent i…

07b26f7

…nto main

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder labels Apr 26, 2026

briandevans mentioned this pull request Apr 26, 2026

fix(compressor): use text char sum for multimodal token estimation in _find_tail_cut_by_tokens #16113

Closed

teknium1 mentioned this pull request Apr 27, 2026

fix(compressor): use text char sum for multimodal token estimation in _find_tail_cut_by_tokens #16369

Merged

teknium1 closed this in #16369 Apr 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix/underestimates token count#16117

Fix/underestimates token count#16117
SimbaKingjoe wants to merge 2 commits into
NousResearch:mainfrom
SimbaKingjoe:fix/underestimates-token-count

SimbaKingjoe commented Apr 26, 2026

Uh oh!

alt-glitch commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

SimbaKingjoe commented Apr 26, 2026

What does this PR do?

Related Issue

Type of Change

Changes Made

How to Test

Uh oh!

alt-glitch commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants