[Bug]: Context compression failure uses static placeholder instead of preserved message tail — context permanently lost

**Bug Description**
When a conversation triggers context compression and the summarisation call fails (HTTP 529/500/timeout, etc.), Hermes injects a static fallback text into the conversation instead of the genuinely preserved message tail. The model then completely loses the current task context and responds to stale topics from several rounds ago.

**Specific symptoms**
- User is working on a task/topic
- Conversation hits compression trigger; Hermes calls the LLM for summarisation
- Summarisation call fails (observed: MiniMax HTTP 529)
- Model receives static placeholder text (\"Summary generation was unavailable. N conversation turns were removed...\") instead of the actual preserved message tail
- User asks model to continue the interrupted task
- Model starts responding to content from several rounds ago, not the interrupted task
- User must re-explain the task from scratch to recover

**Contrast with OpenClaw**
OpenClaw using the same MiniMax API does NOT have this problem with the same 529 scenario. This suggests the issue is in Hermes fallback handling, not the API provider itself.

**Root Cause (preliminary)**
Two-part failure:
1. Summarisation LLM call fails due to provider error (529/500/timeout)
2. Fallback mechanism injects a static placeholder message (\"context lost, continue from recent messages\") instead of concatenating the genuinely preserved message tail — model cannot recover the interrupted task from this

**Additional Observations**
- 529 errors appear to affect summarisation calls specifically (primary model calls may succeed while summarisation fails)
- OpenClaw does NOT have this problem with the same MiniMax API key — suggesting Hermes fallback logic is the root cause
- After compression failure, the model next response is consistently off-topic
- User must re-explain the task from scratch to recover

**Environment**
- Hermes agent with MiniMax-M2.7 as primary model (provider: minimax-cn)
- Context compression enabled
- Same MiniMax API key used by OpenClaw (OAuth auth) without this issue
- Issue reproduced multiple times in today's sessions

**Expected vs Actual Behavior**
- Expected: After context compression failure, model should continue naturally from the preserved message tail with full awareness of the in-progress task
- Actual: Model loses the thread entirely and responds to stale topics

**Proposed Fix**
1. Compare context compression fallback logic between OpenClaw and Hermes — how does OpenClaw preserve context on summarisation failure?
2. Change fallback to concatenate actual preserved messages (the message tail that was explicitly kept) rather than static placeholder text
3. Ensure the preserved message tail is always accessible even when summarisation LLM call fails

**Related Issues**
- #11914 (same issue, different reporter, MiniMax provider)
- #12028 (related: token accounting fallback for reasoning models)
- #11821 (related: compression Pass 3 JSON safety fix)
- #12072 (related: streaming stall recovery, merged)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Context compression failure uses static placeholder instead of preserved message tail — context permanently lost #12131

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Bug]: Context compression failure uses static placeholder instead of preserved message tail — context permanently lost #12131

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions