fix(tools): isolate get_tool_definitions quiet_mode cache + dedup LCM injection (#17335) by teknium1 · Pull Request #17889 · NousResearch/hermes-agent

teknium1 · 2026-04-30T10:06:23Z

Salvages #17337 by @Sanjays2402 onto current main. Closes #17335.

Long-lived Gateway processes were sending duplicate tool names to providers that enforce uniqueness (DeepSeek, Xiaomi MiMo, Moonshot/Kimi → HTTP 400). TUI was unaffected because it runs with quiet_mode=False and skips the cache.

Root cause (two layered bugs)

model_tools.get_tool_definitions(quiet_mode=True) aliased its cached list on the first uncached call. The cache-hit path already returned list(cached), but the first call stored and returned the same object. run_agent then mutates self.tools in place, so agent init Terminal tool #1 poisoned the cache and every subsequent init re-appended LCM schemas.
run_agent.py LCM context-engine injection had no dedup, unlike the memory-tools injection right above it.

Fix (defense in depth)

model_tools.py — cache the result then return list(result) on the uncached branch, mirroring the cache-hit path
run_agent.py — build _existing_tool_names from self.tools and skip already-present schemas, mirroring memory-tools dedup

Validation

scripts/run_tests.sh tests/test_get_tool_definitions_cache_isolation.py tests/test_model_tools.py
29 passed in 3.98s

5 new regression tests pin the behavior; 23 existing test_model_tools.py tests still pass. Authorship preserved for @Sanjays2402.

… injection (#17335) Long-lived Gateway processes were sending duplicate tool names to providers that enforce uniqueness: - DeepSeek: 'Tool names must be unique.' - Xiaomi MiMo: 'tools contains duplicate names: lcm_expand' - Moonshot/Kimi: 'function name lcm_grep is duplicated' TUI was unaffected because TUI runs with quiet_mode=False and skips the cache entirely. Root cause (two layered bugs) - model_tools.get_tool_definitions(quiet_mode=True) memoizes its result in _tool_defs_cache. The cache-hit path returned list(cached) (safe), but the FIRST uncached call stored and returned the SAME object. run_agent.py mutates self.tools (memory + LCM context-engine schemas) in-place, so the very first agent init in a Gateway process poisoned the cache, and every subsequent init appended LCM schemas again on top of the already-polluted list. - run_agent.py's context-engine injection (lcm_grep / lcm_describe / lcm_expand) had no dedup, unlike the memory-tools injection right above it which already skips already-present names. Fix (defense in depth, per the issue's suggested fix) - model_tools.get_tool_definitions: on the uncached branch, cache the computed list but return list(result) to the caller. Same pattern as the cache-hit path. - run_agent.py: build _existing_tool_names from self.tools and skip schemas whose names are already present, mirroring the memory-tools block. This also defends against plugin paths that may register the same schemas via ctx.register_tool(). Tests (tests/test_get_tool_definitions_cache_isolation.py) - test_first_uncached_call_returns_fresh_list \u2014 pins the fix; without it, first-call alias caused all the symptoms. - test_cache_hit_returns_fresh_list \u2014 pre-existing behavior stays. - test_caller_mutation_does_not_poison_cache \u2014 simulates run_agent appending lcm_grep / lcm_expand to the returned list and asserts the next call doesn't see them. - test_repeated_caller_mutation_does_not_accumulate \u2014 reproduces the long-lived Gateway accumulation pattern across 5 agent inits. - test_non_quiet_mode_does_not_use_cache \u2014 sanity, explains why TUI was fine. 5/5 pass on the new file; 23/23 still pass on tests/test_model_tools.py.

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/tools Tool registry, model_tools, toolsets comp/agent Core agent loop, run_agent.py, prompt builder labels Apr 30, 2026

teknium1 merged commit e0fa2cf into main Apr 30, 2026
12 checks passed

teknium1 deleted the hermes/hermes-a26af027 branch April 30, 2026 11:32

teknium1 mentioned this pull request Apr 30, 2026

fix(tools): isolate get_tool_definitions quiet_mode cache + dedup LCM injection (#17335) #17337

Closed

github-actions Bot mentioned this pull request May 1, 2026

chore: bump NousResearch/hermes-agent version from v2026.4.23 to v2026.4.30 Docker-Hub-sirmark/docker-hermes-agent#4

Merged

BrewTestBot mentioned this pull request May 7, 2026

hermes-agent 2026.5.7 Homebrew/homebrew-core#281437

Merged

1 task

github-actions Bot mentioned this pull request May 8, 2026

chore: bump NousResearch/hermes-agent version from v2026.4.30 to v2026.5.7 Docker-Hub-sirmark/docker-hermes-agent#5

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(tools): isolate get_tool_definitions quiet_mode cache + dedup LCM injection (#17335)#17889

fix(tools): isolate get_tool_definitions quiet_mode cache + dedup LCM injection (#17335)#17889
teknium1 merged 1 commit into
mainfrom
hermes/hermes-a26af027

teknium1 commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

teknium1 commented Apr 30, 2026

Root cause (two layered bugs)

Fix (defense in depth)

Validation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants