feat(agent): per-turn file-mutation verifier footer by teknium1 · Pull Request #24498 · NousResearch/hermes-agent

teknium1 · 2026-05-12T18:49:44Z

Models can no longer silently over-claim file edits — every turn ends with an advisory footer listing any write_file / patch failures that were never superseded by a successful write to the same path.

Resurrected from #22149 (which sat stale because its Layer 1 commit — @briandevans's schema-description fix from #15673 — was independently salvaged and merged as 3adcc64 on Apr 25). This PR is just the substantive Layer 2 commit (verifier), rebuilt onto current main.

What it does

Detects when write_file / patch calls fail during a turn and surfaces them in a footer appended to the assistant's final response:

```
⚠️ File-mutation verifier: 3 file(s) were NOT modified this turn despite any wording above that may suggest otherwise. Run `git status` or `read_file` to confirm.
• concepts/automatic-organization.md — [patch] Could not find match for old_string
• concepts/lora.md — [patch] Could not find match for old_string
• concepts/rag-pipeline.md — [patch] Could not find match for old_string
```

Regression target: Ben Eng's llm-wiki session where grok-4.1-fast batched parallel patches, half failed with "Could not find old_string", and the model summarised the turn claiming every file was edited — forcing the user to manually run `git status` every turn.

Changes

`run_agent.py` — `_FILE_MUTATING_TOOLS` const, `_extract_file_mutation_targets` (handles write_file, patch-replace, patch-v4a single + multi-file via `*** Update File:` / `*** Add File:` / `*** Delete File:` headers), `_extract_error_preview`, `AIAgent._record_file_mutation_result`, `AIAgent._file_mutation_verifier_enabled`, `AIAgent._format_file_mutation_failure_footer`. Recording wired into both `_execute_tool_calls_concurrent` and `_execute_tool_calls_sequential`. Footer emission runs before `transform_llm_output` / `post_llm_call` so plugins still see (and can modify) the augmented text.
`hermes_cli/config.py` — `display.file_mutation_verifier` default `true`
`tests/run_agent/test_file_mutation_verifier.py` — 31 unit tests
`website/docs/user-guide/configuration.md` + `website/docs/reference/environment-variables.md` — `display.file_mutation_verifier` + `HERMES_FILE_MUTATION_VERIFIER`

Validation

	Before	After
Batch of parallel patches, half fail	Model summarises 'patched 5 files', user has to run `git status`	Footer lists every file that did NOT change
Model retries failed patch and recovers	Footer would have listed it	Success removes the path from state; no false positive
`tests/run_agent/test_file_mutation_verifier.py` + `tests/tools/test_file_tools.py`	N/A	60/60 passing

Notes

Zero cost: no LLM calls, no tool schema changes.
Prompt-cache safe: no messages injected into history; footer appended to returned string only.
Message-alternation safe: no synthetic user/tool messages introduced.
Opt-out: `display.file_mutation_verifier: false` or `HERMES_FILE_MUTATION_VERIFIER=0`.

Closes #15524. Supersedes #22149 (will close that PR pointing here).

Detect when write_file / patch calls fail during a turn and are never superseded by a successful write to the same path. When the final text response is delivered, append an advisory footer listing the files that did NOT change — so models that over-claim 'patched 5 files' after 4 silent failures can't hide the lie. Catches the failure mode reported in Ben Eng's llm-wiki session: grok-4.1-fast issued batches of parallel patches, half failed with 'Could not find old_string', and the agent summarised the turn claiming every file was edited. The user had to manually run 'git status' each turn to catch it. The verifier is a pure post-hoc check on tool results — no new LLM calls, no synthetic messages injected into history (prompt cache preserved), no changes to tool argument dispatch. Per-turn state is keyed by path; a later successful write to the same path clears the failure entry so single-file retry recovery is not flagged. Wired into both _execute_tool_calls_concurrent and _execute_tool_calls_sequential, so batched parallel patches and one-at- a-time edits are both covered. Footer emission happens after the agent loop exits, before transform_llm_output / post_llm_call plugin hooks run, so plugins still see (and can modify) the augmented text. Config: display.file_mutation_verifier (bool, default true) + HERMES_FILE_MUTATION_VERIFIER env override. 31 unit tests in tests/run_agent/test_file_mutation_verifier.py cover target extraction (write_file, patch-replace, patch-v4a single and multi-file), error-preview extraction (JSON .error field and plain string), per-turn state transitions (first-error-wins on repeated failure, success supersedes failure), footer rendering (truncation at 10 entries, user-actionable hint), and env/config precedence. Companion docs updated: user-guide/configuration.md + reference/environment-variables.md.

github-actions · 2026-05-12T18:51:04Z

🔎 Lint report: `hermes/hermes-263ad51b` vs `origin/main`

ruff

Total: 0 on HEAD, 0 on base (➖ 0)

🆕 New issues: none

✅ Fixed issues: none

Unchanged: 0 pre-existing issues carried over.

ty (type checker)

Total: 8193 on HEAD, 8191 on base (🆕 +2)

🆕 New issues (5):

Rule	Count
`invalid-argument-type`	3
`unresolved-import`	1
`unresolved-attribute`	1

First entries

run_agent.py:13733: [invalid-argument-type] invalid-argument-type: Argument to function `_is_oauth_token` is incorrect: Expected `str`, found `str | dict[Unknown, Unknown] | Any | ... omitted 3 union elements`
run_agent.py:13736: [invalid-argument-type] invalid-argument-type: Argument to function `len` is incorrect: Expected `Sized`, found `(str & ~AlwaysFalsy) | (dict[Unknown, Unknown] & ~AlwaysFalsy) | (Any & ~AlwaysFalsy) | ... omitted 3 union elements`
run_agent.py:7480: [invalid-argument-type] invalid-argument-type: Argument to function `build_anthropic_client` is incorrect: Expected `str`, found `str | dict[Unknown, Unknown] | Any | ... omitted 3 union elements`
tests/run_agent/test_file_mutation_verifier.py:24: [unresolved-import] unresolved-import: Cannot resolve imported module `pytest`
run_agent.py:15528: [unresolved-attribute] unresolved-attribute: Attribute `rstrip` is not defined on `None` in union `None | str | Unknown`

✅ Fixed issues (3):

Rule	Count
`invalid-argument-type`	3

First entries

run_agent.py:13539: [invalid-argument-type] invalid-argument-type: Argument to function `_is_oauth_token` is incorrect: Expected `str`, found `str | dict[Unknown | str, Unknown | str | dict[str, str]] | Any | ... omitted 3 union elements`
run_agent.py:13542: [invalid-argument-type] invalid-argument-type: Argument to function `len` is incorrect: Expected `Sized`, found `(str & ~AlwaysFalsy) | (dict[Unknown | str, Unknown | str | dict[str, str]] & ~AlwaysFalsy) | (Any & ~AlwaysFalsy) | ... omitted 3 union elements`
run_agent.py:7317: [invalid-argument-type] invalid-argument-type: Argument to function `build_anthropic_client` is incorrect: Expected `str`, found `str | dict[Unknown | str, Unknown | str | dict[str, str]] | Any | ... omitted 3 union elements`

Unchanged: 4306 pre-existing issues carried over.

Diagnostics are surfaced as warnings — this check never fails the build.

Detect when write_file / patch calls fail during a turn and are never superseded by a successful write to the same path. When the final text response is delivered, append an advisory footer listing the files that did NOT change — so models that over-claim 'patched 5 files' after 4 silent failures can't hide the lie. Catches the failure mode reported in Ben Eng's llm-wiki session: grok-4.1-fast issued batches of parallel patches, half failed with 'Could not find old_string', and the agent summarised the turn claiming every file was edited. The user had to manually run 'git status' each turn to catch it. The verifier is a pure post-hoc check on tool results — no new LLM calls, no synthetic messages injected into history (prompt cache preserved), no changes to tool argument dispatch. Per-turn state is keyed by path; a later successful write to the same path clears the failure entry so single-file retry recovery is not flagged. Wired into both _execute_tool_calls_concurrent and _execute_tool_calls_sequential, so batched parallel patches and one-at- a-time edits are both covered. Footer emission happens after the agent loop exits, before transform_llm_output / post_llm_call plugin hooks run, so plugins still see (and can modify) the augmented text. Config: display.file_mutation_verifier (bool, default true) + HERMES_FILE_MUTATION_VERIFIER env override. 31 unit tests in tests/run_agent/test_file_mutation_verifier.py cover target extraction (write_file, patch-replace, patch-v4a single and multi-file), error-preview extraction (JSON .error field and plain string), per-turn state transitions (first-error-wins on repeated failure, success supersedes failure), footer rendering (truncation at 10 entries, user-actionable hint), and env/config precedence. Companion docs updated: user-guide/configuration.md + reference/environment-variables.md.

teknium1 mentioned this pull request May 12, 2026

fix(patch-tool): per-mode required-param hints + per-turn file-mutation verifier footer #22149

Closed

teknium1 merged commit c594a23 into main May 12, 2026
14 of 17 checks passed

teknium1 deleted the hermes/hermes-263ad51b branch May 12, 2026 18:54

alt-glitch added type/feature New feature or request P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder area/config Config system, migrations, profiles labels May 12, 2026

BrewTestBot mentioned this pull request May 16, 2026

hermes-agent 2026.5.16 Homebrew/homebrew-core#283141

Merged

1 task

github-actions Bot mentioned this pull request May 17, 2026

chore: bump NousResearch/hermes-agent version from v2026.5.7 to v2026.5.16 Docker-Hub-sirmark/docker-hermes-agent#6

Merged

This was referenced May 18, 2026

Sprint 1 — Foundation hardening (P0/P1, low-risk high-value) wesleysimplicio/hermes-turbo-agent#22

Closed

Cherry-pick upstream: per-turn file-mutation verifier footer (Hermes #24498) wesleysimplicio/hermes-turbo-agent#31

Closed

luisolave11 mentioned this pull request May 25, 2026

Slack: "is thinking..." status indicator gets stuck when agent ends without sending a reply #32295

Open

RyDoug mentioned this pull request Jun 6, 2026

TTS speaks the file-mutation verifier footer aloud (regression from #24498) #40772

Open

Elshayib mentioned this pull request Jun 7, 2026

fix(verifier): store file-mutation footer separately from final_response (#40772) #41048

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(agent): per-turn file-mutation verifier footer#24498

feat(agent): per-turn file-mutation verifier footer#24498
teknium1 merged 1 commit into
mainfrom
hermes/hermes-263ad51b

teknium1 commented May 12, 2026

Uh oh!

github-actions Bot commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

teknium1 commented May 12, 2026

What it does

Changes

Validation

Notes

Uh oh!

github-actions Bot commented May 12, 2026

🔎 Lint report: hermes/hermes-263ad51b vs origin/main

ruff

ty (type checker)

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

🔎 Lint report: `hermes/hermes-263ad51b` vs `origin/main`