feat(file_tools): post-write delta lint on write_file + patch, add JSON/YAML/TOML/Python in-proc linters by teknium1 · Pull Request #20191 · NousResearch/hermes-agent

teknium1 · 2026-05-05T11:46:40Z

Summary

write_file now runs a post-write syntax check identical to what patch already did, and both tools filter out pre-existing lint errors so the agent only sees errors it actually introduced. Closes the gap where silently corrupt writes (bad quote escaping, truncated output) would persist on disk until a later read caught them.

Context

Found while working with Teknium today: batch-2 of a tips-corpus rewrite silently corrupted hermes_cli/tips.py because a manual string-escape substitution mashed inner quotes together, and write_file shipped the broken content with no signal. The patch tool already has _check_lint() (shell py_compile / node --check / tsc / go vet / rustfmt) but write_file skipped it.

Pattern cross-checked against three peer agents:

Cline (src/integrations/editor/DiffViewProvider.ts): captures preDiagnostics before edit, postDiagnostics after, returns only new errors
OpenCode (packages/opencode/src/tool/write.ts): runs lsp.diagnostics() post-write, appends 'LSP errors detected in this file, please fix: ...' to tool output
Claude Code (src/services/diagnosticTracking.ts): beforeFileEdited() stores baseline, getNewDiagnostics() returns delta

All three use the same shape: post-first, diff against pre. This PR ports that into Hermes's existing _check_lint machinery.

Changes

tools/file_operations.py

New in-process linters (LINTERS_INPROC): .py via ast.parse, .json via json.loads, .yaml via yaml.safe_load, .toml via tomllib. Microsecond cost, no subprocess, no toolchain dependency.
_check_lint(path, content=None) — accepts optional content, prefers in-process linter, falls back to shell linter.
New _check_lint_delta(path, pre_content, post_content) — post-first, pre-lazy: clean post → return immediately, one lint call; errors found + pre available → diff and surface only new errors.
WriteResult gains lint field (mirrors PatchResult.lint).
write_file() captures pre-content for in-process-lintable extensions, calls _check_lint_delta post-write.
patch_replace() switches from _check_lint to _check_lint_delta (already had pre-content in scope).

tools/file_tools.py

Schema description for write_file mentions the post-write lint.

tests/tools/test_file_operations_edge_cases.py

6 existing brace-path tests rewired to .js (still shell-linted) since .py now takes the in-process path.
+9 TestCheckLintInproc tests covering Python/JSON/YAML/TOML clean + error paths and explicit-content short-circuit.
+5 TestCheckLintDelta tests covering hot-path (post clean → no pre-lint), new-file path (pre=None → all errors surface), broken-becomes-clean, new-error-on-clean-base (delta filter works), and the single-error-parser caveat (pre-existing flagged but not silently dropped).

Single-error parser caveat (documented in code)

ast.parse and json.loads stop at the first syntax error. If a pre-existing error masks everything past the edit region, the delta filter can't prove the new content is clean. When pre and post errors match exactly, this PR reports success=False with message="Pre-existing lint errors — this edit didn't introduce new ones but the file is still broken." — transparent about state, no false negatives.

Performance

4.89ms average across 100 clean .py writes (in-process lint + shell write). Clean-write hot path matches main's cost for patch. Pre-content capture is skipped when no linter applies to the extension.

Validation

scripts/run_tests.sh tests/tools/test_file_operations*.py tests/tools/test_file_tools*.py tests/tools/test_file_write_safety.py tests/tools/test_write_deny.py tests/tools/test_patch_parser.py tests/tools/test_file_ops_cwd_tracking.py — 228/228 pass
Live E2E reproduced the exact tips.py corruption incident: broken Python written via write_file_tool, JSON output shows "lint": {"status": "error", "output": "SyntaxError: invalid syntax. Perhaps you forgot a comma? (line 6, column 5)"} — the agent gets the error on the very next turn, can self-correct.

The 18 failures in the broader tests/tools/ sweep (test_delegate.py, test_browser_chromium_check.py, test_daytona_environment.py, etc.) are pre-existing on main and unrelated — verified by stashing the change and re-running.

…ON/YAML/TOML/Python in-process linters Closes the gap where write_file skipped the post-edit syntax check that patch already ran, so silent file corruption (bad quote escaping, truncated writes, etc.) would persist on disk until a later read. ## Changes tools/file_operations.py: - Add in-process linters for .py, .json, .yaml, .toml (LINTERS_INPROC). Python uses ast.parse, JSON/YAML/TOML use stdlib/PyYAML parsers. Zero subprocess overhead; preferred over shell linters when both apply. - _check_lint() now accepts optional content and routes to in-process linter first. Shell linter (py_compile, node --check, tsc, go vet, rustfmt) remains the fallback for languages without an in-process equivalent. - New _check_lint_delta() implements the post-first/pre-lazy pattern borrowed from Cline and OpenCode: lint post-write state first; only if errors are found AND pre-content was captured does it lint the pre-state and diff. If the pre-existing file had the SAME errors the edit didn't introduce anything new, so the file is reported as 'still broken, pre-existing' with success=False but a message explaining the errors were pre-existing. If the edit introduced genuinely new errors, those are surfaced and pre-existing ones are filtered out. - WriteResult gains a lint field. - write_file() captures pre-content for in-process-lintable extensions and calls _check_lint_delta after a successful write. - patch_replace() switches from _check_lint to _check_lint_delta, reusing the pre-edit content it already has in scope. tools/file_tools.py: - Update write_file schema description to mention the post-write lint. tests/tools/test_file_operations_edge_cases.py: - Update existing brace-path tests to use .js (shell linter) now that .py is in-process. - Add TestCheckLintInproc (9 tests) covering Python/JSON/YAML/TOML in-process linters. - Add TestCheckLintDelta (5 tests) covering the post-first/pre-lazy short-circuit, new-file path, and the single-error-parser caveat. ## Performance In-process linters are microseconds per call (ast.parse, json.loads). The hot path (clean write) runs exactly one lint — matches main's cost for patch. Pre-state capture is skipped when the file has no applicable linter. Measured 4.89ms/write average over 100 .py writes including lint. ## Inspiration - Cline's DiffViewProvider.getNewDiagnosticProblems() — filters pre-write diagnostics from post-write diagnostics (src/integrations/editor/DiffViewProvider.ts). - OpenCode's WriteTool — runs lsp.diagnostics() after write and appends errors to tool output (packages/opencode/src/tool/write.ts). - Claude Code's DiagnosticTrackingService — captures baseline via beforeFileEdited() and returns new-diagnostics-only from getNewDiagnostics() (src/services/diagnosticTracking.ts). ## Validation - tests/tools/test_file_operations.py + test_file_operations_edge_cases.py + test_file_tools.py + test_file_tools_live.py + test_file_write_safety.py + test_write_deny.py + test_patch_parser.py + test_file_ops_cwd_tracking.py: 228 passed locally. - Live E2E reproduction of the tips.py corruption incident: broken content written; lint field surfaces 'SyntaxError: invalid syntax. Perhaps you forgot a comma? (line 6, column 5)' — the exact error that would have self-corrected the bug on the next turn.

…ON/YAML/TOML/Python in-process linters (#20191) Closes the gap where write_file skipped the post-edit syntax check that patch already ran, so silent file corruption (bad quote escaping, truncated writes, etc.) would persist on disk until a later read. ## Changes tools/file_operations.py: - Add in-process linters for .py, .json, .yaml, .toml (LINTERS_INPROC). Python uses ast.parse, JSON/YAML/TOML use stdlib/PyYAML parsers. Zero subprocess overhead; preferred over shell linters when both apply. - _check_lint() now accepts optional content and routes to in-process linter first. Shell linter (py_compile, node --check, tsc, go vet, rustfmt) remains the fallback for languages without an in-process equivalent. - New _check_lint_delta() implements the post-first/pre-lazy pattern borrowed from Cline and OpenCode: lint post-write state first; only if errors are found AND pre-content was captured does it lint the pre-state and diff. If the pre-existing file had the SAME errors the edit didn't introduce anything new, so the file is reported as 'still broken, pre-existing' with success=False but a message explaining the errors were pre-existing. If the edit introduced genuinely new errors, those are surfaced and pre-existing ones are filtered out. - WriteResult gains a lint field. - write_file() captures pre-content for in-process-lintable extensions and calls _check_lint_delta after a successful write. - patch_replace() switches from _check_lint to _check_lint_delta, reusing the pre-edit content it already has in scope. tools/file_tools.py: - Update write_file schema description to mention the post-write lint. tests/tools/test_file_operations_edge_cases.py: - Update existing brace-path tests to use .js (shell linter) now that .py is in-process. - Add TestCheckLintInproc (9 tests) covering Python/JSON/YAML/TOML in-process linters. - Add TestCheckLintDelta (5 tests) covering the post-first/pre-lazy short-circuit, new-file path, and the single-error-parser caveat. ## Performance In-process linters are microseconds per call (ast.parse, json.loads). The hot path (clean write) runs exactly one lint — matches main's cost for patch. Pre-state capture is skipped when the file has no applicable linter. Measured 4.89ms/write average over 100 .py writes including lint. ## Inspiration - Cline's DiffViewProvider.getNewDiagnosticProblems() — filters pre-write diagnostics from post-write diagnostics (src/integrations/editor/DiffViewProvider.ts). - OpenCode's WriteTool — runs lsp.diagnostics() after write and appends errors to tool output (packages/opencode/src/tool/write.ts). - Claude Code's DiagnosticTrackingService — captures baseline via beforeFileEdited() and returns new-diagnostics-only from getNewDiagnostics() (src/services/diagnosticTracking.ts). ## Validation - tests/tools/test_file_operations.py + test_file_operations_edge_cases.py + test_file_tools.py + test_file_tools_live.py + test_file_write_safety.py + test_write_deny.py + test_patch_parser.py + test_file_ops_cwd_tracking.py: 228 passed locally. - Live E2E reproduction of the tips.py corruption incident: broken content written; lint field surfaces 'SyntaxError: invalid syntax. Perhaps you forgot a comma? (line 6, column 5)' — the exact error that would have self-corrected the bug on the next turn.

…ON/YAML/TOML/Python in-process linters (NousResearch#20191) Closes the gap where write_file skipped the post-edit syntax check that patch already ran, so silent file corruption (bad quote escaping, truncated writes, etc.) would persist on disk until a later read. ## Changes tools/file_operations.py: - Add in-process linters for .py, .json, .yaml, .toml (LINTERS_INPROC). Python uses ast.parse, JSON/YAML/TOML use stdlib/PyYAML parsers. Zero subprocess overhead; preferred over shell linters when both apply. - _check_lint() now accepts optional content and routes to in-process linter first. Shell linter (py_compile, node --check, tsc, go vet, rustfmt) remains the fallback for languages without an in-process equivalent. - New _check_lint_delta() implements the post-first/pre-lazy pattern borrowed from Cline and OpenCode: lint post-write state first; only if errors are found AND pre-content was captured does it lint the pre-state and diff. If the pre-existing file had the SAME errors the edit didn't introduce anything new, so the file is reported as 'still broken, pre-existing' with success=False but a message explaining the errors were pre-existing. If the edit introduced genuinely new errors, those are surfaced and pre-existing ones are filtered out. - WriteResult gains a lint field. - write_file() captures pre-content for in-process-lintable extensions and calls _check_lint_delta after a successful write. - patch_replace() switches from _check_lint to _check_lint_delta, reusing the pre-edit content it already has in scope. tools/file_tools.py: - Update write_file schema description to mention the post-write lint. tests/tools/test_file_operations_edge_cases.py: - Update existing brace-path tests to use .js (shell linter) now that .py is in-process. - Add TestCheckLintInproc (9 tests) covering Python/JSON/YAML/TOML in-process linters. - Add TestCheckLintDelta (5 tests) covering the post-first/pre-lazy short-circuit, new-file path, and the single-error-parser caveat. ## Performance In-process linters are microseconds per call (ast.parse, json.loads). The hot path (clean write) runs exactly one lint — matches main's cost for patch. Pre-state capture is skipped when the file has no applicable linter. Measured 4.89ms/write average over 100 .py writes including lint. ## Inspiration - Cline's DiffViewProvider.getNewDiagnosticProblems() — filters pre-write diagnostics from post-write diagnostics (src/integrations/editor/DiffViewProvider.ts). - OpenCode's WriteTool — runs lsp.diagnostics() after write and appends errors to tool output (packages/opencode/src/tool/write.ts). - Claude Code's DiagnosticTrackingService — captures baseline via beforeFileEdited() and returns new-diagnostics-only from getNewDiagnostics() (src/services/diagnosticTracking.ts). ## Validation - tests/tools/test_file_operations.py + test_file_operations_edge_cases.py + test_file_tools.py + test_file_tools_live.py + test_file_write_safety.py + test_write_deny.py + test_patch_parser.py + test_file_ops_cwd_tracking.py: 228 passed locally. - Live E2E reproduction of the tips.py corruption incident: broken content written; lint field surfaces 'SyntaxError: invalid syntax. Perhaps you forgot a comma? (line 6, column 5)' — the exact error that would have self-corrected the bug on the next turn.

teknium1 merged commit 5168226 into main May 5, 2026
10 of 11 checks passed

teknium1 deleted the hermes/hermes-205797f8 branch May 5, 2026 11:54

alt-glitch added type/feature New feature or request P2 Medium — degraded but workaround exists tool/file File tools (read, write, patch, search) comp/tools Tool registry, model_tools, toolsets labels May 5, 2026

BrewTestBot mentioned this pull request May 7, 2026

hermes-agent 2026.5.7 Homebrew/homebrew-core#281437

Merged

1 task

github-actions Bot mentioned this pull request May 8, 2026

chore: bump NousResearch/hermes-agent version from v2026.4.30 to v2026.5.7 Docker-Hub-sirmark/docker-hermes-agent#5

Merged

subinium mentioned this pull request May 11, 2026

feat(tools): post-write linting on file writes — Python/JSON/YAML/TOML (Hermes v0.13 parity) subinium/CrowClaw#310

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(file_tools): post-write delta lint on write_file + patch, add JSON/YAML/TOML/Python in-proc linters#20191

feat(file_tools): post-write delta lint on write_file + patch, add JSON/YAML/TOML/Python in-proc linters#20191
teknium1 merged 1 commit into
mainfrom
hermes/hermes-205797f8

teknium1 commented May 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

teknium1 commented May 5, 2026

Summary

Context

Changes

Single-error parser caveat (documented in code)

Performance

Validation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants