Compaction failure leaves session in permanent failed state with no automatic recovery

## Bug: Compaction failure leaves session in permanent failed state with no automatic recovery

### Summary

When session compaction fails (for any reason: timeout, API error, quota exceeded, model not supporting reasoning, etc.), the session enters `status=failed` permanently. There is no watchdog or automatic recovery mechanism — the channel becomes completely unresponsive (已读不回), and the only workaround is manual intervention.

### Steps to reproduce

1. Have a session that grows large enough to trigger compaction (~500KB+ JSONL)
2. Compaction fails (e.g., model returns error, or times out)
3. Session `status` becomes `"failed"`
4. All subsequent messages to that channel receive no response
5. No automatic escalation, no fallback session, no user notification

### Observed log

Session `e1bc0eb2` at 04:11-04:14:
- Compaction triggered on large session
- Fallback model returned 400 (reasoning required but disabled in system prompt)
- Multiple retry attempts → 429 rate limit
- Session grew even larger from error messages
- Compaction failed → `status=failed` → no further responses
- Auto-recovery via external heartbeat watchdog was the only way out

### Expected behavior

Compaction failure should have automatic escalation:
1. Try alternative compaction model
2. If all models fail, create a new session automatically and notify the user
3. Never leave a session permanently dead with no response

### Impact

- **High**: Users experience '已读不回' with no explanation
- Data loss feel: users don't know history was preserved in `.bak.recovered.*` files  
- Depends on external heartbeat watchdog to recover — not a proper fix

### References

- Related: #64962 (compaction fails to reduce context, no escalation)
- Related: #64103 (session status field misleading — failed status spawns duplicates)
- Related: #52012 (compaction model fallback feature request)

### Suggested fix

Add a compaction-failure watchdog in the gateway that:
1. Detects when compaction has failed
2. Automatically creates a new session for the channel
3. Optionally preserves a backup of the old session file
4. Notifies the user that a new session was started

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Compaction failure leaves session in permanent failed state with no automatic recovery #69202

Bug: Compaction failure leaves session in permanent failed state with no automatic recovery

Summary

Steps to reproduce

Observed log

Expected behavior

Impact

References

Suggested fix

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Compaction failure leaves session in permanent failed state with no automatic recovery #69202

Description

Bug: Compaction failure leaves session in permanent failed state with no automatic recovery

Summary

Steps to reproduce

Observed log

Expected behavior

Impact

References

Suggested fix

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions