fix(agent): keep image tool results from poisoning text-only sessions by teknium1 · Pull Request #25925 · NousResearch/hermes-agent

teknium1 · 2026-05-14T19:59:00Z

Salvage of #25903 onto current main, original commit by @helix4u preserved.

Summary

When a multimodal tool result (computer_use, vision_analyze, browser_vision) comes back on a text-only provider/model, store a text-only fallback into canonical history instead of the raw image_url content. For computer_use specifically, write a clear "switch to a vision-capable model" error JSON; other multimodal tools fall back to the result's text_summary.

Also adds DeepSeek's exact 400 wording (unknown variant \image_url`, expected `text``) to the existing adaptive image-rejection recovery list so an already-poisoned session can self-heal on the next retry.

Root cause

_prepare_messages_for_non_vision_model runs on the legacy/codex_responses branches of _build_chat_kwargs but not on the provider-profile branch (registered providers like DeepSeek). Tracked broadly as #23733; this PR fixes it at the tool-result write site, which keeps history clean across /compress, /resume, /model.

Validation

scripts/run_tests.sh tests/tools/test_computer_use.py::TestRunAgentMultimodalHelpers tests/run_agent/test_vision_aware_preprocessing.py — 19/19 pass

Credit

Closes #25903. Original commit ae3e79637 by @helix4u cherry-picked onto current main with authorship preserved.

Related: #23733, #23743, #23750, #24070.

github-actions · 2026-05-14T20:00:13Z

🔎 Lint report: `hermes/hermes-61c456c4` vs `origin/main`

ruff

Total: 0 on HEAD, 0 on base (➖ 0)

🆕 New issues: none

✅ Fixed issues: none

Unchanged: 0 pre-existing issues carried over.

ty (type checker)

Total: 8338 on HEAD, 8340 on base (✅ -2)

🆕 New issues (3):

Rule	Count
`invalid-argument-type`	3

First entries

run_agent.py:13751: [invalid-argument-type] invalid-argument-type: Argument to function `len` is incorrect: Expected `Sized`, found `(str & ~AlwaysFalsy) | (dict[Unknown, Unknown] & ~AlwaysFalsy) | (Any & ~AlwaysFalsy) | ... omitted 3 union elements`
run_agent.py:13748: [invalid-argument-type] invalid-argument-type: Argument to function `_is_oauth_token` is incorrect: Expected `str`, found `str | dict[Unknown, Unknown] | Any | ... omitted 3 union elements`
run_agent.py:7480: [invalid-argument-type] invalid-argument-type: Argument to function `build_anthropic_client` is incorrect: Expected `str`, found `str | dict[Unknown, Unknown] | Any | ... omitted 3 union elements`

✅ Fixed issues (4):

Rule	Count
`invalid-argument-type`	4

First entries

run_agent.py:7480: [invalid-argument-type] invalid-argument-type: Argument to function `build_anthropic_client` is incorrect: Expected `str`, found `str | dict[Unknown | str, Unknown | str | dict[str, str]] | Any | ... omitted 3 union elements`
run_agent.py:13711: [invalid-argument-type] invalid-argument-type: Argument to function `_is_oauth_token` is incorrect: Expected `str`, found `str | dict[Unknown | str, Unknown | str | dict[str, str]] | Any | ... omitted 3 union elements`
run_agent.py:11522: [invalid-argument-type] invalid-argument-type: Method `__getitem__` of type `Overload[(key: SupportsIndex | slice[SupportsIndex | None, SupportsIndex | None, SupportsIndex | None], /) -> LiteralString, (key: SupportsIndex | slice[SupportsIndex | None, SupportsIndex | None, SupportsIndex | None], /) -> str]` cannot be called with key of type `Literal["content"]` on object of type `str`
run_agent.py:13714: [invalid-argument-type] invalid-argument-type: Argument to function `len` is incorrect: Expected `Sized`, found `(str & ~AlwaysFalsy) | (dict[Unknown | str, Unknown | str | dict[str, str]] & ~AlwaysFalsy) | (Any & ~AlwaysFalsy) | ... omitted 3 union elements`

Unchanged: 4384 pre-existing issues carried over.

Diagnostics are surfaced as warnings — this check never fails the build.

fix(agent): keep image tool results from poisoning text-only sessions

c65d7d6

github-advanced-security AI found potential problems May 14, 2026

View reviewed changes

Comment thread run_agent.py Dismissed

Comment thread run_agent.py Dismissed

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/agent Core agent loop, run_agent.py, prompt builder tool/vision Vision analysis and image generation labels May 14, 2026

teknium1 merged commit a28add1 into main May 14, 2026
17 of 19 checks passed

teknium1 deleted the hermes/hermes-61c456c4 branch May 14, 2026 21:52

teknium1 mentioned this pull request May 14, 2026

fix(agent): keep image tool results from poisoning text-only sessions #25903

Closed

19 tasks

This was referenced May 15, 2026

fix: DeepSeek Chat Completions API rejects image_url content blocks #26364

Open

fix: strip image parts for non-vision models with provider profiles #26498

Closed

kshitijk4poor mentioned this pull request May 16, 2026

Bug: image routing bypassed on api_server /v1/chat/completions — non-vision models receive raw image_url (400) #23733

Closed

alt-glitch mentioned this pull request May 17, 2026

[Bug] computer_use multimodal tool message causes 400 error on providers that don't support multimodal tool content (e.g. Xiaomi MiMo) #27344

Closed

alt-glitch mentioned this pull request May 31, 2026

fix(run_agent): non-vision models get JSON error instead of text summary for computer_use captures #35817

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): keep image tool results from poisoning text-only sessions#25925

fix(agent): keep image tool results from poisoning text-only sessions#25925
teknium1 merged 1 commit into
mainfrom
hermes/hermes-61c456c4

teknium1 commented May 14, 2026

Uh oh!

github-actions Bot commented May 14, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

teknium1 commented May 14, 2026

Summary

Root cause

Validation

Credit

Uh oh!

github-actions Bot commented May 14, 2026

🔎 Lint report: hermes/hermes-61c456c4 vs origin/main

ruff

ty (type checker)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

🔎 Lint report: `hermes/hermes-61c456c4` vs `origin/main`