fix: prevent infinite 400 loop on context overflow + block prompt injection via cache files by teknium1 · Pull Request #1639 · NousResearch/hermes-agent

teknium1 · 2026-03-17T08:45:57Z

Summary

Fix 1: Prevent infinite 400 failure loop (#1630)

When a gateway session exceeds the model's context window, Anthropic may return a generic 400 invalid_request_error with just "Error" as the message. This bypassed the phrase-based context-length detection, causing the agent to treat it as non-retryable. The failed user message was persisted, making the session larger — creating an infinite loop.

Three-layer fix:

Agent heuristic — generic 400 + short error + large session → treat as context overflow and compress
Skip persistence on failure — don't write failed messages to transcript (both agent + gateway)
Smarter error messages — suggest /compact or /reset instead of generic 'try again'

Fix 2: Block prompt injection via skills hub cache (#1558, salvaged from PR #1562 by @ygd58)

A user experienced the agent outputting threatening/adversarial text after it read a 3.5MB hub catalog cache file containing prompt injection content.

Two-layer fix (cherry-picked from @ygd58's PR):

read_file block — denies access to ~/.hermes/skills/.hub/ directory (index-cache, catalog files)
skill_view detection — warns when skills loaded from untrusted paths or contain injection patterns

Test plan

14 new tests for [Bug]: Gateway enters infinite 400 failure loop when Telegram session exceeds context limits #1630 in tests/test_1630_context_overflow_loop.py
158 existing file/skill tests pass
All 942 gateway tests pass
All 170 agent tests pass
Full suite: 4705 passed

When a gateway session exceeds the model's context window, Anthropic may return a generic 400 invalid_request_error with just 'Error' as the message. This bypassed the phrase-based context-length detection, causing the agent to treat it as a non-retryable client error. Worse, the failed user message was still persisted to the transcript, making the session even larger on each attempt — creating an infinite loop. Three-layer fix: 1. run_agent.py — Fallback heuristic: when a 400 error has a very short generic message AND the session is large (>40% of context or >80 messages), treat it as a probable context overflow and trigger compression instead of aborting. 2. run_agent.py + gateway/run.py — Don't persist failed messages: when the agent returns failed=True before generating any response, skip writing the user's message to the transcript/DB. This prevents the session from growing on each failure. 3. gateway/run.py — Smarter error messages: detect context-overflow failures and suggest /compact or /reset specifically, instead of a generic 'try again' that will fail identically.

Adds two security layers to prevent prompt injection via skills hub cache files (#1558): 1. read_file: blocks direct reads of ~/.hermes/skills/.hub/ directory (index-cache, catalog files). The 3.5MB clawhub_catalog_v1.json was the original injection vector — untrusted skill descriptions in the catalog contained adversarial text that the model executed. 2. skill_view: warns when skills are loaded from outside the trusted ~/.hermes/skills/ directory, and detects common injection patterns in skill content ("ignore previous instructions", "<system>", etc.). Cherry-picked from PR #1562 by ygd58.

teknium1 and others added 3 commits March 17, 2026 01:45

Merge remote-tracking branch 'origin/main' into hermes/hermes-6bb9911e

e1e702a

teknium1 changed the title ~~fix: prevent infinite 400 failure loop on context overflow~~ fix: prevent infinite 400 loop on context overflow + block prompt injection via cache files Mar 17, 2026

teknium1 mentioned this pull request Mar 17, 2026

fix(skills): detect prompt injection patterns and warn on untrusted skill paths #1562

Closed

teknium1 merged commit 96dac22 into main Mar 17, 2026
1 check failed

This was referenced Mar 17, 2026

[Bug]: Gateway enters infinite 400 failure loop when Telegram session exceeds context limits #1630

Closed

[Bug]: Not sure how to classify this odd response #1558

Closed

Westown666 mentioned this pull request Apr 14, 2026

Gateway becomes unresponsive when context compression is exhausted in long-running group chat sessions #9893

Closed

teknium1 mentioned this pull request May 16, 2026

security: sanitize tool error strings before injecting into model context (salvage of #3838 piece 3/3) #26823

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: prevent infinite 400 loop on context overflow + block prompt injection via cache files#1639

fix: prevent infinite 400 loop on context overflow + block prompt injection via cache files#1639
teknium1 merged 3 commits into
mainfrom
hermes/hermes-6bb9911e

teknium1 commented Mar 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

teknium1 commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Fix 1: Prevent infinite 400 failure loop (#1630)

Fix 2: Block prompt injection via skills hub cache (#1558, salvaged from PR #1562 by @ygd58)

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

teknium1 commented Mar 17, 2026 •

edited

Loading