fix: accept reasoning-only responses without retries — set content to "(empty)" by teknium1 · Pull Request #5278 · NousResearch/hermes-agent

teknium1 · 2026-04-05T18:30:44Z

Summary

Replaces the 120-line retry/classify/compress/salvage cascade for reasoning-only responses with a simple 15-line acceptance path.

Before

Model returns reasoning but no visible content → retry 3 times → classify → maybe compress → maybe salvage reasoning as content → maybe return error. Wastes 3+ API calls.

After

Model returns reasoning but no visible content → keep reasoning in reasoning field → set content to "(empty)" → done. 1 API call.

What this fixes

Bug: Empty assistant message from reasoning-only fix leaks into Chat Completions API, causing prefill rejection #2128: Empty assistant messages (content: "") no longer leak into session history. Content is "(empty)" — a valid non-empty string that every provider accepts, preventing prefill rejection errors.
Wasted API calls: 1 call instead of 3-4 per reasoning-only response.
Semantic correctness: Reasoning stays in the reasoning field, not stuffed into content.

Changes

run_agent.py: -122 lines, +18 lines. Removed _classify_empty_content_response retry cascade. Replaced with _build_assistant_message + content = "(empty)" + break.
tests/test_run_agent.py: Updated 4 tests to expect the new behavior (no retries, no compression triggers, "(empty)" content).

Test results

tests/test_run_agent.py + tests/test_run_agent_codex_responses.py: 256 passed
E2E: 6/6 passed (structured reasoning, inline think blocks, truly empty, session continuation, zero retries, normal response regression)

Closes #2128

LLMs frequently return numbers as strings ("42" instead of 42) and booleans as strings ("true" instead of true). This causes silent failures with MCP tools and any tool with strictly-typed parameters. Added coerce_tool_args() in model_tools.py that runs before every tool dispatch. For each argument, it checks the tool registry schema and attempts safe coercion: - "42" → 42 when schema says "type": "integer" - "3.14" → 3.14 when schema says "type": "number" - "true"/"false" → True/False when schema says "type": "boolean" - Union types tried in order - Original values preserved when coercion fails or is not applicable Inspired by Block/goose tool argument coercion system.

… "(empty)" Previously, when a model returned reasoning/thinking but no visible content, we entered a 120-line retry/classify/compress/salvage cascade that wasted 3+ API calls trying to "fix" the response. The model was done thinking — retrying with the same input just burned money. Now reasoning-only responses are accepted immediately: - Reasoning stays in the `reasoning` field (semantically correct) - Content set to "(empty)" — valid non-empty string every provider accepts - No retries, no compression triggers, no salvage logic - Session history contains "(empty)" not "" — prevents #2128 session poisoning where empty assistant content caused prefill rejections Removes ~120 lines, adds ~15. Saves 2-3 API calls per reasoning-only response. Fixes #2128.

github-actions · 2026-04-05T18:31:01Z

⚠️ Supply Chain Risk Detected

This PR contains patterns commonly associated with supply chain attacks. This does not mean the PR is malicious — but these patterns require careful human review before merging.

⚠️ WARNING: Install hook files modified

These files can execute code during package installation or interpreter startup.

Files:

hermes_cli/setup.py

Automated scan triggered by supply-chain-audit. If this is a false positive, a maintainer can approve after manual review.

… "(empty)" (NousResearch#5278) * feat: coerce tool call arguments to match JSON Schema types LLMs frequently return numbers as strings ("42" instead of 42) and booleans as strings ("true" instead of true). This causes silent failures with MCP tools and any tool with strictly-typed parameters. Added coerce_tool_args() in model_tools.py that runs before every tool dispatch. For each argument, it checks the tool registry schema and attempts safe coercion: - "42" → 42 when schema says "type": "integer" - "3.14" → 3.14 when schema says "type": "number" - "true"/"false" → True/False when schema says "type": "boolean" - Union types tried in order - Original values preserved when coercion fails or is not applicable Inspired by Block/goose tool argument coercion system. * fix: accept reasoning-only responses without retries — set content to "(empty)" Previously, when a model returned reasoning/thinking but no visible content, we entered a 120-line retry/classify/compress/salvage cascade that wasted 3+ API calls trying to "fix" the response. The model was done thinking — retrying with the same input just burned money. Now reasoning-only responses are accepted immediately: - Reasoning stays in the `reasoning` field (semantically correct) - Content set to "(empty)" — valid non-empty string every provider accepts - No retries, no compression triggers, no salvage logic - Session history contains "(empty)" not "" — prevents NousResearch#2128 session poisoning where empty assistant content caused prefill rejections Removes ~120 lines, adds ~15. Saves 2-3 API calls per reasoning-only response. Fixes NousResearch#2128.

teknium1 added 2 commits April 5, 2026 10:43

teknium1 merged commit a0a1b86 into main Apr 5, 2026

livingghost mentioned this pull request Apr 8, 2026

fix(gateway): suppress "(empty)" visible replies on chat surfaces #6166

Closed

2 tasks

teknium1 mentioned this pull request Apr 9, 2026

fix: retry 3 times with nudge when model returns truly empty response #6488

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: accept reasoning-only responses without retries — set content to "(empty)"#5278

fix: accept reasoning-only responses without retries — set content to "(empty)"#5278
teknium1 merged 2 commits into
mainfrom
hermes/hermes-bae00f49

teknium1 commented Apr 5, 2026

Uh oh!

github-actions Bot commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

teknium1 commented Apr 5, 2026

Summary

Before

After

What this fixes

Changes

Test results

Uh oh!

github-actions Bot commented Apr 5, 2026

⚠️ Supply Chain Risk Detected

⚠️ WARNING: Install hook files modified

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant