fix(background_review): inherit parent's toolset config to keep tools[] cache-stable (~50% fewer cache-write tokens on long sessions) by ziliangpeng · Pull Request #29568 · NousResearch/hermes-agent

ziliangpeng · 2026-05-21T01:42:14Z

fix(background_review): inherit parent's toolset config to keep `tools[]` cache-stable

Summary

Fixes #29567. The background skill/memory-review fork (agent/background_review.py) constructs its child AIAgent without propagating enabled_toolsets / disabled_toolsets from the parent. When the parent's toolset is narrowed from the registry default (e.g. via hermes tools disable or config.yaml), the fork's default enabled_toolsets=None expands to "all registered tools" and the fork's outbound tools[] becomes a strict superset of the parent's. Anthropic's prompt-cache key includes the tools[] array byte-for-byte, so the divergence forks the cache lineage and forces a full prefix rewrite on every review nudge.

This PR adds the two missing kwargs and aligns the review fork's tools[] with the parent's, extending PR #17276's system-bytes invariant one slot up the cache hierarchy.

Goal

Eliminate cache-write overhead on the background review path. On a captured ~5 hour Sonnet-class session, review-fork requests account for ~51 % of the session's total cache_creation_input_tokens even though they share messages[0..N] and system with the parent byte-for-byte — the divergence in tools[] alone is enough to push every fork req into a cache miss. After the fix, fork requests read from the parent's warmed cache instead of rewriting.

The safety contract from #15204 (the review fork must not dispatch terminal / send_message / delegate_task) is preserved: it is enforced by the post-construction set_thread_tool_whitelist({memory, skills, …}) call a few lines below, which gates dispatch, not what the request body transmits.

Implementation

agent/background_review.py — two added kwargs in the AIAgent(...) call inside _spawn_background_review:

enabled_toolsets=getattr(agent, "enabled_toolsets", None),
disabled_toolsets=getattr(agent, "disabled_toolsets", None),

Plus an explanatory block comment that calls out the cache-key dependency and the relationship to PR #17276.

Symmetric inheritance: when the parent's value is None, the fork's is also None and both expand to the registry default identically; when the parent narrows, the fork inherits the narrowed set verbatim.

Testing

Two test files updated:

tests/run_agent/test_background_review_cache_parity.py — new positive assertion test_review_fork_inherits_parent_toolset_config confirms both kwargs reach the constructor with the parent's values.
tests/run_agent/test_background_review_toolset_restriction.py — the existing test_background_review_does_not_narrow_toolset_schema was renamed to test_background_review_matches_parent_toolset_config and inverted. Its prior invariant ("the fork must NOT pass enabled_toolsets") was built on the implicit assumption that the parent always runs with the registry default; when the parent narrows, leaving the fork at None is what causes divergence. The corrected invariant is symmetric inheritance.

Sanity check: the new positive test fails without the fix and passes with it.

python -m pytest tests/run_agent/test_background_review.py \
                 tests/run_agent/test_background_review_summary.py \
                 tests/run_agent/test_background_review_toolset_restriction.py \
                 tests/run_agent/test_background_review_cache_parity.py -q

Output:

18 passed in 1.85s

End-to-end verification (capturing real /v1/messages traffic via a local HTTP proxy):

Window	Total reqs	Review-fork reqs	Distinct `tools[]` hashes per conv
Pre-fix (~5 h session)	411	87 (30 fork spawns)	2 (main vs fork)
Post-fix (~15 min, after first fork spawn)	32	4 (1 spawn)	1 (identical)

The post-fix fork retains its other identifiers (the _SKILL_REVIEW_PROMPT text on the last user message, the absence of output_config / thinking at the top level — those flow through the secondary completion adapter), but its tools[] array now hashes identically to the parent's.

Scope

agent/background_review.py: two added kwargs + explanatory comment.
Two test files updated as described above.
No production code paths outside the review fork.
No schema, public-API, or runtime-whitelist changes.
No new dependencies.

Related: extends PR #17276 / #25322 (which fixed system-bytes inheritance) to the tools[] slot.

…[] cache-stable ## Summary The background skill/memory-review fork constructed a child `AIAgent` without propagating `enabled_toolsets` / `disabled_toolsets` from the parent. When the parent narrowed its toolset (via `hermes tools disable` or `config.yaml`), the fork's default `enabled_toolsets=None` expanded to "all registered tools" — and the fork's outbound request body sent a wider `tools[]` array than the parent's main-turn request. Anthropic's prompt-cache key includes the `tools[]` array byte-for-byte, so this divergence forked the cache lineage on every nudge and forced a full prefix rewrite. On a captured ~4 hour Claude-via-Hermes session this cost roughly 4.3 M cache-write tokens — about half of those attributable to the per-nudge alternation between the main turn's narrowed `tools[]` and the review fork's wider `tools[]`. ## Goal Extend the byte-stability invariant established by PR NousResearch#17276 (which fixed `system`) to the `tools[]` slot of the request body, so the review fork's outbound request hits the parent's warmed Anthropic prefix cache regardless of how the parent's toolset is configured. ## Implementation Two-line change in `agent/background_review.py`: pass `enabled_toolsets=getattr(agent, "enabled_toolsets", None)` and the matching `disabled_toolsets` kwarg into the `AIAgent(...)` call inside `_spawn_background_review`. Adds an explanatory block comment that calls out the cache-key dependency and the relationship to PR NousResearch#17276. The post-construction runtime whitelist (`set_thread_tool_whitelist({memory, skills})`) is untouched — it still gates which tools the model is allowed to *dispatch*. This change aligns only what the request body *transmits*, not what the review is allowed to do, so the safety contract from issue NousResearch#15204 remains intact. ## Testing - `tests/run_agent/test_background_review_cache_parity.py`: new `test_review_fork_inherits_parent_toolset_config` asserts the parent's `enabled_toolsets` and `disabled_toolsets` reach the review-fork constructor as kwargs. - `tests/run_agent/test_background_review_toolset_restriction.py`: the existing `test_background_review_does_not_narrow_toolset_schema` was inverted (its old "must NOT pass enabled_toolsets" rule was built on the assumption that the parent always ran with the registry default — wrong in practice when the parent is narrowed). Renamed to `test_background_review_matches_parent_toolset_config` and updated to assert the parent's value propagates verbatim. - Verified the new positive test fails without the fix and passes with it. - Full suite for `test_background_review*`: ``` $ python -m pytest tests/run_agent/test_background_review.py \ tests/run_agent/test_background_review_summary.py \ tests/run_agent/test_background_review_toolset_restriction.py \ tests/run_agent/test_background_review_cache_parity.py -q 18 passed in 1.85s ``` ## Scope - `agent/background_review.py`: 2 added kwargs + explanatory comment. - Two test files: one new positive test, one inverted existing test. - No production code paths outside the review fork; no schema changes; no public-API changes. Refs: #1 (root-cause analysis with wire-level cache-write measurements). Extends PR NousResearch#17276's `system`-bytes invariant to the `tools[]` slot.

alt-glitch · 2026-05-21T06:24:08Z

@daimon-nous reproduce and review pls

daimon-nous · 2026-05-21T06:48:26Z

Review: PR #29568

Verdict: Approve — clean, correctly-scoped fix. Natural extension of the system-bytes invariant from PR #17276 / #25434 to the tools[] slot.

Analysis

Bug confirmed on origin/main: _spawn_background_review constructs AIAgent() without enabled_toolsets/disabled_toolsets (line 393). When the parent narrows its toolset (via hermes tools disable or config), the fork defaults to None → all registered tools. Anthropic's cache key includes tools[] byte-for-byte, so this divergence forces a full prefix rewrite per nudge.

Fix is correct:

getattr(agent, "enabled_toolsets", None) / disabled_toolsets — safe, since agent_init.py:389-390 always sets these on the instance
Symmetric inheritance: parent None → fork None (identical expansion); parent narrowed → fork inherits verbatim
Safety contract (Background skill-review agent can perform non-skill side effects after creating a skill #15204) preserved: set_thread_tool_whitelist() at L475 still gates dispatch independently of what schemas are transmitted

Tests are sound:

New positive test (test_review_fork_inherits_parent_toolset_config) uses non-None stubs so a regression that drops the kwargs surfaces (rather than spuriously passing because both sides default to None)
Inverted test correctly reflects the new invariant — the old "must NOT pass enabled_toolsets" rule assumed the parent always runs with the registry default, which breaks when users narrow their toolset

One edge case (non-blocking)

If the parent explicitly disables memory or skills toolsets, the fork inherits that narrowed set and the model won't see memory/skill tool descriptors → the review becomes a silent no-op. Pre-fix behavior accidentally preserved these tools (via None → full expansion), but this was incidental. In practice nobody disables memory while relying on memory review, and the pre-fix behavior was strictly worse (cache divergence on every nudge). Worth a one-line comment but not a blocker.

Scope

agent/background_review.py: 2 kwargs + 18-line comment (verbose but justified given the cache-key subtlety)
2 test files updated, 0 production paths outside background review touched
No schema, API, or whitelist changes

All 18 background review tests pass locally.

- Replace 18-line comment block with 3-line invariant statement - Trim test docstrings from multi-paragraph to single-line summaries - Trim assertion messages from 4-line to 2-line mismatch reports - Replace 5-line WHAT comments in stubs with 1-line WHY comments - Add ziliangdotme@gmail.com -> ziliangpeng to AUTHOR_MAP

alt-glitch · 2026-05-21T07:19:39Z

Merged via #29704. Your commit was cherry-picked onto current main with your authorship preserved in git log (rebase-merge). We trimmed the comment verbosity and added your email to AUTHOR_MAP in a follow-up commit. Thanks for the thorough analysis + fix @ziliangpeng!

This was referenced May 21, 2026

check_fn TTL cache causes massive Anthropic prompt-cache thrashing mid-conversation ziliangpeng/hermes-agent#1

Closed

fix(background_review): propagate parent toolset config to keep tools[] cache-stable ziliangpeng/hermes-agent#2

Closed

ziliangpeng changed the title ~~fix(background_review): inherit parent's toolset config to keep tools[] cache-stable~~ fix(background_review): inherit parent's toolset config to keep tools[] cache-stable (~50% fewer cache-write tokens on long sessions) May 21, 2026

alt-glitch added type/perf Performance improvement or optimization P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder provider/anthropic Anthropic native Messages API labels May 21, 2026

alt-glitch mentioned this pull request May 21, 2026

background_review fork sends wider tools[] than parent, fragments Anthropic prefix cache (~50% wasted cache-write on long sessions) #29567

Closed

alt-glitch force-pushed the ziliang-review-fork-inherit-toolsets branch from e4f130c to a7cbf3e Compare May 21, 2026 07:17

alt-glitch mentioned this pull request May 21, 2026

fix(background_review): inherit parent toolset config for tools[] cache parity (salvage #29568) #29704

Merged

alt-glitch closed this May 21, 2026

ziliangpeng deleted the ziliang-review-fork-inherit-toolsets branch May 25, 2026 23:12

brendonlee20042004-sys mentioned this pull request May 29, 2026

bug: background review blocks MCP memory tools when using external memory provider #34746

Closed

JimStenstrom mentioned this pull request Jun 5, 2026

[Bug]: background-review fork advertises the full tool schema to LOCAL endpoints, making weak local models thrash the deny-wall #39996

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(background_review): inherit parent's toolset config to keep tools[] cache-stable (~50% fewer cache-write tokens on long sessions)#29568

fix(background_review): inherit parent's toolset config to keep tools[] cache-stable (~50% fewer cache-write tokens on long sessions)#29568
ziliangpeng wants to merge 2 commits into
NousResearch:mainfrom
ziliangpeng:ziliang-review-fork-inherit-toolsets

ziliangpeng commented May 21, 2026

Uh oh!

alt-glitch commented May 21, 2026

Uh oh!

daimon-nous Bot commented May 21, 2026

Uh oh!

alt-glitch commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ziliangpeng commented May 21, 2026

fix(background_review): inherit parent's toolset config to keep tools[] cache-stable

Summary

Goal

Implementation

Testing

Scope

Uh oh!

alt-glitch commented May 21, 2026

Uh oh!

daimon-nous Bot commented May 21, 2026

Review: PR #29568

Analysis

One edge case (non-blocking)

Scope

Uh oh!

alt-glitch commented May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix(background_review): inherit parent's toolset config to keep `tools[]` cache-stable