fix(agent): preserve reasoning_content replay on DeepSeek v4 + Kimi/Moonshot thinking by teknium1 · Pull Request #18045 · NousResearch/hermes-agent

teknium1 · 2026-04-30T17:49:35Z

Summary

DeepSeek v4 thinking mode (and Kimi / Moonshot thinking) stop 400'ing on multi-turn tool-call replays with "The reasoning_content in the thinking mode must be passed back to the API." Fixes #17400.

Root cause

run_agent.py::_build_assistant_message had a pad branch guarded by msg.get("tool_calls"), which was always falsy because tool_calls were assigned ~60 lines later in the same method. When DeepSeek returned reasoning_content=None on a tool-call turn and streaming captured no thinking text, the turn was persisted bare; the next replay hit the 400. Same enforcement exists on Kimi / Moonshot, reachable through the same code path. A secondary hole: when the OpenAI SDK doesn't know a provider's schema (aggregator passthrough like OpenCode Go → DeepSeek), reasoning_content lands in model.model_extra instead of a typed attribute and the builder never sees it.

Changes

Salvages two open PRs:

fix(deepseek): preserve chat completions reasoning_content replay #16855 (@lsdsjy): captures assistant_tool_calls at method entry so the pad check reads the SDK source of truth, falls back to model.model_extra["reasoning_content"] when the typed attr is absent (covers aggregator paths like OpenCode Go), and mirrors the model_extra fallback in the chat_completions transport normalizer. Uses reasoning_text or "" so captured streaming reasoning is preserved when padding.
fix(agent): pad reasoning_content on DeepSeek/Kimi tool-call turns #17489 (@season179): extends the pad to Kimi / Moonshot via a shared _needs_thinking_reasoning_pad() helper that's reused in _copy_reasoning_content_for_api (dedupes the deepseek or kimi predicate across both sites).

Follow-ups added here:

scripts/release.py: AUTHOR_MAP entries for lsdsjy and season179.
Test helpers (_ATTR_ABSENT, _EXPECT_NOT_PRESENT, _sdk_tool_call, _build_sdk_message) from fix(agent): pad reasoning_content on DeepSeek/Kimi tool-call turns #17489 added alongside fix(deepseek): preserve chat completions reasoning_content replay #16855's TestBuildAssistantMessageDeepSeekReasoningContent.

Closes #16855. Closes #17489. Closes #17400.

Validation

	Targeted tests	Run on
Before fix (stash run_agent.py)	2 Kimi/Moonshot parametrized cases FAIL	`test_deepseek_reasoning_content_echo.py`
After fix	34 pass	`test_deepseek_reasoning_content_echo.py`
After fix	95 pass	`test_deepseek_reasoning_content_echo.py` + `test_chat_completions.py`
Wider sweep	1339 passed, 17 skipped	`tests/run_agent/ tests/agent/transports/`

The targeted empirical check (stash + rerun) proves the new Kimi/Moonshot cases exercise the extension on top of #16855, not trivially pass. The 3 DeepSeek parametrized cases pass in both scenarios because they were already fixed by the #16855 cherry-pick.

Credits

@lsdsjy — original DeepSeek v4 + model_extra fix (fix(deepseek): preserve chat completions reasoning_content replay #16855)
@season179 — Kimi/Moonshot extension, shared predicate (fix(agent): pad reasoning_content on DeepSeek/Kimi tool-call turns #17489)

Co-authored-by: lsdsjy luwinyang@deepseek.com
Co-authored-by: season179 season.saw@gmail.com

@lsdsjy

Builds on #16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs #17400. Co-authored-by: season179 <season.saw@gmail.com>

lsdsjy and others added 2 commits April 30, 2026 10:46

fix(deepseek): preserve v4 reasoning_content on replay

6762a17

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/agent Core agent loop, run_agent.py, prompt builder provider/deepseek DeepSeek API provider/kimi Kimi / Moonshot labels Apr 30, 2026

teknium1 merged commit 76edc40 into main Apr 30, 2026
9 of 11 checks passed

teknium1 deleted the hermes/hermes-4d0d9f34 branch April 30, 2026 18:18

This was referenced Apr 30, 2026

fix(deepseek): preserve chat completions reasoning_content replay #16855

Closed

fix(agent): pad reasoning_content on DeepSeek/Kimi tool-call turns #17489

Closed

reasoning_content dropped in multi-turn tool calls with DeepSeek v4 (causes HTTP 400) #17400

Closed

github-actions Bot mentioned this pull request May 1, 2026

chore: bump NousResearch/hermes-agent version from v2026.4.23 to v2026.4.30 Docker-Hub-sirmark/docker-hermes-agent#4

Merged

Svtter mentioned this pull request May 4, 2026

Empty tool_calls array sent to provider API, causing 400 on strict validators (DeepSeek, NVIDIA NIM) zeroclaw-labs/zeroclaw#6298

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): preserve reasoning_content replay on DeepSeek v4 + Kimi/Moonshot thinking#18045

fix(agent): preserve reasoning_content replay on DeepSeek v4 + Kimi/Moonshot thinking#18045
teknium1 merged 2 commits into
mainfrom
hermes/hermes-4d0d9f34

teknium1 commented Apr 30, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

teknium1 commented Apr 30, 2026

Summary

Root cause

Changes

Validation

Credits

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants