fix(compressor): summary role can violate consecutive-role constraint#1720

Merged

teknium1 merged 1 commit into

mainfrom

fix/compressor-consecutive-role-violation

Mar 17, 2026

teknium1 commented Mar 17, 2026

Contributor

Summary

The context compressor's summary message role was determined only by the last head message's role, ignoring what the first tail message's role is. When the last head message was assistant and the first tail message was user, the summary role was set to user — creating consecutive user messages that Anthropic's API rejects.

What changed

agent/context_compressor.py: Now checks both the head and tail neighbors when choosing the summary role. Prioritizes not colliding with the head (already committed), then avoids the tail collision if possible without re-colliding with the head.

Test plan

python -m pytest tests/ -n0 -q -k compress → 61 passed, 14 skipped ✔
Existing test test_summary_role_avoids_consecutive_user_messages still passes
Edge case: when both neighbors conflict (impossible to avoid both), prefers head-safe


          fix(compressor): summary role can create consecutive same-role messages

344f377

The summary message role was determined only by the last head message,
ignoring the first tail message. This could create consecutive user
messages (rejected by Anthropic) when the tail started with 'user'.

Now checks both neighbors. Priority: avoid colliding with the head
(already committed). If the chosen role also collides with the tail,
flip it — but only if flipping wouldn't re-collide with the head.

teknium1 merged commit ec272ca into main

1 check passed

teknium1 added a commit that referenced this pull request


          feat: agent resilience — handle truncated tool calls, empty responses…

2ce9edc

…, tool error sanitization

Three resilience features ported from Ironclaw:

1. Discard incomplete tool calls (ironclaw#1632)
   When finish_reason='length' and tool calls are present, they're likely
   incomplete. Discard them, inject a summarize notice. After 3 consecutive
   occurrences, temporarily disable tools.

2. Empty response recovery (ironclaw#1677 + #1720)
   When the LLM returns empty (no content, no tool calls):
   - If meaningful output exists earlier, treat as completion
   - Otherwise nudge once, then fail gracefully
   Max 2 consecutive empties before giving up.

3. Sanitize tool error results (ironclaw#1639)
   Strip XML boundary markers, CDATA sections, and code fences from error
   messages before sending to LLM. Cap at 2000 chars. Prevents
   injection attacks via crafted tool error messages.

18 new tests.

teknium1 mentioned this pull request

feat: agent resilience — truncated tool calls, empty response recovery, error sanitization #3838

Closed

angelburgosrosado pushed a commit to angelburgosrosado/hermes-agent that referenced this pull request


          Merge pull request NousResearch#1720 from NousResearch/fix/compressor…

40668dd

…-consecutive-role-violation

fix(compressor): summary role can violate consecutive-role constraint

angelburgosrosado pushed a commit to angelburgosrosado/hermes-agent that referenced this pull request


          feat: agent resilience — handle truncated tool calls, empty responses…

89a7c17

…, tool error sanitization

Three resilience features ported from Ironclaw:

1. Discard incomplete tool calls (ironclaw#1632)
   When finish_reason='length' and tool calls are present, they're likely
   incomplete. Discard them, inject a summarize notice. After 3 consecutive
   occurrences, temporarily disable tools.

2. Empty response recovery (ironclaw#1677 + NousResearch#1720)
   When the LLM returns empty (no content, no tool calls):
   - If meaningful output exists earlier, treat as completion
   - Otherwise nudge once, then fail gracefully
   Max 2 consecutive empties before giving up.

3. Sanitize tool error results (ironclaw#1639)
   Strip XML boundary markers, CDATA sections, and code fences from error
   messages before sending to LLM. Cap at 2000 chars. Prevents
   injection attacks via crafted tool error messages.

18 new tests.

02356abc pushed a commit to 02356abc/hermes-agent that referenced this pull request


          Merge pull request NousResearch#1720 from NousResearch/fix/compressor…

56d47c9

…-consecutive-role-violation

fix(compressor): summary role can violate consecutive-role constraint

teknium1 mentioned this pull request

security: sanitize tool error strings before injecting into model context (salvage of #3838 piece 3/3) #26823

Merged

teknium1 added a commit that referenced this pull request


          security: sanitize tool error strings before injecting into model con…

627f8a5

…text (#26823)

Adds _sanitize_tool_error() in model_tools and routes both error paths
through it: registry.dispatch's try/except (the primary path for tool
exceptions) and handle_function_call's outer except (defense in depth).

Stripping targets structural framing tokens that the model itself can
react to even though json.dumps already handles wire-layer escaping:
XML role tags (tool_call, function_call, result, response, output,
input, system, assistant, user), CDATA sections, and markdown code
fences. Caps message body at 2000 chars and wraps with [TOOL_ERROR]
prefix.

Defense-in-depth: a tool exception carrying '<tool_call>...' won't
break message framing (json escapes it), but the model still reads
those tokens and they nudge it toward role-confusion framing.

Ported from ironclaw#1639 (one piece of #3838's three-feature scout).
The truncated-tool-call (#1632) and empty-response-recovery (#1677,
#1720) pieces are skipped because main now implements both far more
thoroughly (run_agent.py L8147/L12209/L13012 for truncation retry +
length rewrite; L4500/L15090+ for empty-response scaffolding stripper,
multi-stage nudge, fallback model activation).

olympus-terminal pushed a commit to olympus-terminal/hermes-agent that referenced this pull request


          Merge pull request NousResearch#1720 from NousResearch/fix/compressor…

fd47a41

…-consecutive-role-violation

fix(compressor): summary role can violate consecutive-role constraint

DIZ-admin pushed a commit to DIZ-admin/hermes-agent that referenced this pull request


          security: sanitize tool error strings before injecting into model con…

4fa2d45

…text (NousResearch#26823)

Adds _sanitize_tool_error() in model_tools and routes both error paths
through it: registry.dispatch's try/except (the primary path for tool
exceptions) and handle_function_call's outer except (defense in depth).

Stripping targets structural framing tokens that the model itself can
react to even though json.dumps already handles wire-layer escaping:
XML role tags (tool_call, function_call, result, response, output,
input, system, assistant, user), CDATA sections, and markdown code
fences. Caps message body at 2000 chars and wraps with [TOOL_ERROR]
prefix.

Defense-in-depth: a tool exception carrying '<tool_call>...' won't
break message framing (json escapes it), but the model still reads
those tokens and they nudge it toward role-confusion framing.

Ported from ironclaw#1639 (one piece of NousResearch#3838's three-feature scout).
The truncated-tool-call (NousResearch#1632) and empty-response-recovery (NousResearch#1677,
NousResearch#1720) pieces are skipped because main now implements both far more
thoroughly (run_agent.py L8147/L12209/L13012 for truncation retry +
length rewrite; L4500/L15090+ for empty-response scaffolding stripper,
multi-stage nudge, fallback model activation).

venyon2k pushed a commit to venyon2k/hermes-agent that referenced this pull request


          security: sanitize tool error strings before injecting into model con…

6e58a12

…text (NousResearch#26823)

Adds _sanitize_tool_error() in model_tools and routes both error paths
through it: registry.dispatch's try/except (the primary path for tool
exceptions) and handle_function_call's outer except (defense in depth).

Stripping targets structural framing tokens that the model itself can
react to even though json.dumps already handles wire-layer escaping:
XML role tags (tool_call, function_call, result, response, output,
input, system, assistant, user), CDATA sections, and markdown code
fences. Caps message body at 2000 chars and wraps with [TOOL_ERROR]
prefix.

Defense-in-depth: a tool exception carrying '<tool_call>...' won't
break message framing (json escapes it), but the model still reads
those tokens and they nudge it toward role-confusion framing.

Ported from ironclaw#1639 (one piece of NousResearch#3838's three-feature scout).
The truncated-tool-call (NousResearch#1632) and empty-response-recovery (NousResearch#1677,
NousResearch#1720) pieces are skipped because main now implements both far more
thoroughly (run_agent.py L8147/L12209/L13012 for truncation retry +
length rewrite; L4500/L15090+ for empty-response scaffolding stripper,
multi-stage nudge, fallback model activation).

clckmedia pushed a commit to clckmedia/hermes-agent that referenced this pull request


          security: sanitize tool error strings before injecting into model con…

5c17d60

…text (NousResearch#26823)

Adds _sanitize_tool_error() in model_tools and routes both error paths
through it: registry.dispatch's try/except (the primary path for tool
exceptions) and handle_function_call's outer except (defense in depth).

Stripping targets structural framing tokens that the model itself can
react to even though json.dumps already handles wire-layer escaping:
XML role tags (tool_call, function_call, result, response, output,
input, system, assistant, user), CDATA sections, and markdown code
fences. Caps message body at 2000 chars and wraps with [TOOL_ERROR]
prefix.

Defense-in-depth: a tool exception carrying '<tool_call>...' won't
break message framing (json escapes it), but the model still reads
those tokens and they nudge it toward role-confusion framing.

Ported from ironclaw#1639 (one piece of NousResearch#3838's three-feature scout).
The truncated-tool-call (NousResearch#1632) and empty-response-recovery (NousResearch#1677,
NousResearch#1720) pieces are skipped because main now implements both far more
thoroughly (run_agent.py L8147/L12209/L13012 for truncation retry +
length rewrite; L4500/L15090+ for empty-response scaffolding stripper,
multi-stage nudge, fallback model activation).

(cherry picked from commit 627f8a5)

CumulusService pushed a commit to Cumulus-Service-GmbH/hermes-agent that referenced this pull request


          feat: agent resilience — handle truncated tool calls, empty responses…

08416a3

…, tool error sanitization

Three resilience features ported from Ironclaw:

1. Discard incomplete tool calls (ironclaw#1632)
   When finish_reason='length' and tool calls are present, they're likely
   incomplete. Discard them, inject a summarize notice. After 3 consecutive
   occurrences, temporarily disable tools.

2. Empty response recovery (ironclaw#1677 + NousResearch#1720)
   When the LLM returns empty (no content, no tool calls):
   - If meaningful output exists earlier, treat as completion
   - Otherwise nudge once, then fail gracefully
   Max 2 consecutive empties before giving up.

3. Sanitize tool error results (ironclaw#1639)
   Strip XML boundary markers, CDATA sections, and code fences from error
   messages before sending to LLM. Cap at 2000 chars. Prevents
   injection attacks via crafted tool error messages.

18 new tests.

gweeteve pushed a commit to gweeteve/hermes-agent that referenced this pull request


          Merge pull request NousResearch#1720 from NousResearch/fix/compressor…

0a711d0

…-consecutive-role-violation

fix(compressor): summary role can violate consecutive-role constraint

gweeteve pushed a commit to gweeteve/hermes-agent that referenced this pull request


          security: sanitize tool error strings before injecting into model con…

db4f5be

…text (NousResearch#26823)

Adds _sanitize_tool_error() in model_tools and routes both error paths
through it: registry.dispatch's try/except (the primary path for tool
exceptions) and handle_function_call's outer except (defense in depth).

Stripping targets structural framing tokens that the model itself can
react to even though json.dumps already handles wire-layer escaping:
XML role tags (tool_call, function_call, result, response, output,
input, system, assistant, user), CDATA sections, and markdown code
fences. Caps message body at 2000 chars and wraps with [TOOL_ERROR]
prefix.

Defense-in-depth: a tool exception carrying '<tool_call>...' won't
break message framing (json escapes it), but the model still reads
those tokens and they nudge it toward role-confusion framing.

Ported from ironclaw#1639 (one piece of NousResearch#3838's three-feature scout).
The truncated-tool-call (NousResearch#1632) and empty-response-recovery (NousResearch#1677,
NousResearch#1720) pieces are skipped because main now implements both far more
thoroughly (run_agent.py L8147/L12209/L13012 for truncation retry +
length rewrite; L4500/L15090+ for empty-response scaffolding stripper,
multi-stage nudge, fallback model activation).

Egavasyug pushed a commit to Egavasyug/hermes-agent that referenced this pull request


          Merge pull request NousResearch#1720 from NousResearch/fix/compressor…

72c0da0

…-consecutive-role-violation

fix(compressor): summary role can violate consecutive-role constraint

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet