fix(agent): exclude prior-history tool messages from background review summary (salvage #14967) by teknium1 · Pull Request #15057 · NousResearch/hermes-agent

teknium1 · 2026-04-24T10:09:05Z

Salvage of #14967 by @luyao618 onto current main. Chosen over the parallel #14969 for robustness.

What this PR does

Stops the background memory/skill review from re-surfacing stale tool results from the prior conversation as if they just happened. After e.g. creating a cron reminder, subsequent 💾 background-review notifications would include Cron job '<name>' created. again on every run, even though cron wasn't touched.

How

The review agent forks with conversation_history=messages_snapshot, so its _session_messages contains inherited tool messages. The scan that builds the 💾 summary walked the whole list and treated historical tool successes as new review actions.

@luyao618 extracts the scan into a testable AIAgent._summarize_background_review_actions staticmethod that:

Records every tool_call_id in the snapshot and skips review messages whose tool_call_id matches
Falls back to content-equality for tool messages that lack a tool_call_id
Hardens the data.get('success') branch against non-dict JSON payloads (latent bug — bare-string/list content previously raised)

Why this over #14969

#14969 used a slice approach (_session_messages[len(snapshot):]) which is smaller but brittle: if any future init step reorders, filters, or deduplicates the history (compression, prefix-cache replay, future hydration logic), the slice boundary silently drifts and stale results leak through again. ID-based matching is immune. #14967 also matches the issue author's explicit suggested approach verbatim and fixes the non-dict JSON crash.

Validation

	Before	After
Stale 'Cron created' surfaced in later review	yes	skipped
New 'User profile updated' action surfaced	yes	yes
Non-dict JSON tool content (bare string/list)	crash	gracefully skipped

tests/run_agent/test_background_review_summary.py — 8/8 pass (new file)
Full tests/run_agent/ — 940/940 pass (the 2 other failures are pre-existing on current main, unrelated)
E2E: reproduced the exact Background review notification includes stale tool results from conversation history #14944 scenario; fix filters stale cron + preserves new review action

Co-authored-by: @luyao618

@luyao618

…w summary Cherry-pick-of: 27b6a21 (PR #14967 by @luyao618) Co-authored-by: luyao618 <364939526@qq.com>

fix(agent): exclude prior-history tool messages from background revie…

02c587a

…w summary Cherry-pick-of: 27b6a21 (PR #14967 by @luyao618) Co-authored-by: luyao618 <364939526@qq.com>

teknium1 merged commit bc15f52 into main Apr 24, 2026
11 of 12 checks passed

teknium1 deleted the hermes/hermes-37dfb080 branch April 24, 2026 10:10

This was referenced Apr 24, 2026

fix(agent): exclude prior-history tool messages from background review summary #14967

Closed

fix(agent): scan only new tool results in bg review, skip snapshot history (#14944) #14969

Closed

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder labels Apr 24, 2026

alt-glitch mentioned this pull request Apr 26, 2026

fix(agent): skip snapshot messages when scanning bg review tool results #9696

Closed

github-actions Bot mentioned this pull request May 1, 2026

chore: bump NousResearch/hermes-agent version from v2026.4.23 to v2026.4.30 Docker-Hub-sirmark/docker-hermes-agent#4

Merged

subinium mentioned this pull request May 11, 2026

feat(learning): self-improvement loop overhaul — rubric + active-update bias + runtime inheritance (Hermes v0.12 parity) subinium/CrowClaw#305

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): exclude prior-history tool messages from background review summary (salvage #14967)#15057

fix(agent): exclude prior-history tool messages from background review summary (salvage #14967)#15057
teknium1 merged 1 commit into
mainfrom
hermes/hermes-37dfb080

teknium1 commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

teknium1 commented Apr 24, 2026

What this PR does

How

Why this over #14969

Validation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants