fix(compression): use extract_content_or_reasoning for reasoning model summaries by airudotsh · Pull Request #4603 · NousResearch/hermes-agent

airudotsh · 2026-04-02T14:56:18Z

What changed

Use extract_content_or_reasoning() (from auxiliary_client) instead of raw response.choices[0].message.content in the context compressor's _generate_summary().

Why

Reasoning models (DeepSeek-R1, Qwen-QwQ, glm-5-turbo) sometimes put all output inside think/reasoning blocks with an empty content field. The compressor was reading raw content directly, getting an empty string, and silently dropping middle conversation turns without a meaningful summary — causing context continuity loss.

How it fixes it

extract_content_or_reasoning() already handles:

Empty content + structured reasoning field fallback — reads message.reasoning / message.reasoning_content when content is empty
XML-style think tag stripping — removes <think/>, <thinking/>, <reasoning/> blocks from content

Also normalizes dict content (llama.cpp style responses) before extraction to prevent type errors.

Tests

3 new test cases in TestReasoningOnlyExtraction:

Reasoning field extracted when content is empty
Normal content passed through without modification
XML think tags stripped from content

All 37 tests pass.

…l summaries Reasoning models (DeepSeek-R1, Qwen-QwQ, glm-5-turbo) sometimes put all output inside think/reasoning blocks with an empty content field. The compressor was reading raw response.choices[0].message.content directly, getting an empty string, and silently dropping middle turns without a meaningful summary. Use the existing extract_content_or_reasoning() helper (from auxiliary_client) which already handles: - Empty content + structured reasoning field fallback - XML-style think/thinking/reasoning tag stripping Also normalize dict content (llama.cpp) before extraction to prevent type errors. Tests: 3 new cases covering reasoning-only, think-tag, and normal content extraction paths.

alt-glitch · 2026-05-01T20:53:48Z

Likely duplicate of #14847 — same fix: use extract_content_or_reasoning() in context compressor _generate_summary() for reasoning-only model responses.

alt-glitch · 2026-05-01T20:54:18Z

Likely duplicate of #14847 — same fix: use extract_content_or_reasoning() in context compressor _generate_summary() for reasoning-only model responses.

airudotsh mentioned this pull request Apr 2, 2026

fix(compression): include reasoning tokens in context tracking #4614

Closed

Tranquil-Flow mentioned this pull request Apr 8, 2026

fix(auxiliary): guard extract_content_or_reasoning when choices are missing #5969

Open

19 tasks

This was referenced Apr 24, 2026

Use reasoning content for context summaries #14847

Closed

[BUG] DeepSeek V4 thinking mode fails with reasoning_content error #14933

Closed

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/agent Core agent loop, run_agent.py, prompt builder duplicate This issue or pull request already exists labels May 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(compression): use extract_content_or_reasoning for reasoning model summaries#4603

fix(compression): use extract_content_or_reasoning for reasoning model summaries#4603
airudotsh wants to merge 1 commit into
NousResearch:mainfrom
airudotsh:fix/compressor-reasoning-extract

airudotsh commented Apr 2, 2026

Uh oh!

alt-glitch commented May 1, 2026

Uh oh!

alt-glitch commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

airudotsh commented Apr 2, 2026

What changed

Why

How it fixes it

Related

Tests

Uh oh!

alt-glitch commented May 1, 2026

Uh oh!

alt-glitch commented May 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants