fix(deepseek): preserve chat completions reasoning_content replay by lsdsjy · Pull Request #16855 · NousResearch/hermes-agent

lsdsjy · 2026-04-28T06:20:58Z

Summary

Preserve DeepSeek reasoning_content from OpenAI SDK model_extra during response normalization and assistant message persistence.
Backfill reasoning_content for DeepSeek V4 thinking tool-call assistant turns so replay payloads include content, reasoning_content, and tool_calls.
Add regression coverage plus an opt-in live DeepSeek V4 thinking replay smoke test for both deepseek-v4-flash and deepseek-v4-pro.

Basis

DeepSeek official thinking-mode tool-call guidance requires full replay of reasoning_content after tool-call turns in all subsequent requests:

https://api-docs.deepseek.com/guides/thinking_mode#tool-calls:~:text=Please%20note%20that%2C%20unlike%20turns%20in%20thinking%20mode%20that%20do%20not%20involve%20tool%20calls%2C%20for%20turns%20that%20do%20perform%20tool%20calls%2C%20the%20reasoning_content%20must%20be%20fully%20passed%20back%20to%20the%20API%20in%20all%20subsequent%20requests.

Scope

This PR fixes the OpenAI-compatible Chat Completions path (/chat/completions) used by DeepSeek V4. Anthropic-format compatibility is intentionally out of scope here and is covered separately by #16781:

#16781

Testing

/Users/hfadmin/.hermes/hermes-agent/venv/bin/python -m py_compile run_agent.py agent/transports/chat_completions.py tests/run_agent/test_deepseek_reasoning_content_echo.py tests/run_agent/test_deepseek_v4_thinking_live.py tests/agent/transports/test_chat_completions.py
/Users/hfadmin/.hermes/hermes-agent/venv/bin/python -m pytest tests/run_agent/test_deepseek_reasoning_content_echo.py tests/agent/transports/test_chat_completions.py tests/run_agent/test_deepseek_v4_thinking_live.py -q (69 passed, 2 skipped)
HERMES_LIVE_TESTS=1 pytest tests/run_agent/test_deepseek_v4_thinking_live.py -q -s -n0 (2 passed, verified deepseek-v4-flash and deepseek-v4-pro against the live DeepSeek API)
git diff --check

@lsdsjy

Builds on #16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs #17400. Co-authored-by: season179 <season.saw@gmail.com>

teknium1 · 2026-04-30T18:20:31Z

Merged via PR #18045. Your commit (b9b9ee3e6) was cherry-picked onto current main with your authorship preserved in git log — the DeepSeek v4 + model_extra fallback + transport-layer normalizer fix is in exactly as you wrote it, plus the live replay smoke test. Thanks for the thorough fix.

Follow-up commit 76edc40ab extends the same pad to Kimi / Moonshot thinking mode (salvaged from #17489) since they enforce the same echo-back contract.

#18045

@lsdsjy

Builds on NousResearch#16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs NousResearch#17400. Co-authored-by: season179 <season.saw@gmail.com>

@lsdsjy

Builds on NousResearch#16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs NousResearch#17400. Co-authored-by: season179 <season.saw@gmail.com>

@lsdsjy

Builds on NousResearch#16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs NousResearch#17400. Co-authored-by: season179 <season.saw@gmail.com>

@lsdsjy

Builds on NousResearch#16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs NousResearch#17400. Co-authored-by: season179 <season.saw@gmail.com>

@lsdsjy

Builds on NousResearch#16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs NousResearch#17400. Co-authored-by: season179 <season.saw@gmail.com>

@lsdsjy

Builds on NousResearch#16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs NousResearch#17400. Co-authored-by: season179 <season.saw@gmail.com>

@lsdsjy

Builds on NousResearch#16855 (@lsdsjy) which fixed DeepSeek v4 reasoning_content replay via model_extra fallback + capturing tool_calls at method entry. Kimi / Moonshot thinking mode enforces the same echo-back contract and hits the same 400 when a tool-call turn is persisted without reasoning_content. - _build_assistant_message: pad branch now uses _needs_thinking_reasoning_pad() (DeepSeek OR Kimi) instead of _needs_deepseek_tool_reasoning() alone. - Extract _needs_thinking_reasoning_pad() and reuse it in _copy_reasoning_content_for_api so both sites share one predicate. - tests/run_agent/test_deepseek_reasoning_content_echo.py: add TestBuildAssistantMessagePadsStrictProviders parametrized over DeepSeek (attr=None, attr-absent), Kimi (attr=None), Moonshot (via base_url), and an OpenRouter negative control that must NOT pad. Proven to fail 2/5 cases on Kimi/Moonshot without this change. - scripts/release.py: add AUTHOR_MAP entries for lsdsjy and season179. Refs NousResearch#17400. Co-authored-by: season179 <season.saw@gmail.com>

fix(deepseek): preserve v4 reasoning_content on replay

86449b4

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder provider/deepseek DeepSeek API labels Apr 28, 2026

lsdsjy changed the title ~~fix(deepseek): preserve v4 reasoning_content on replay~~ fix(deepseek): preserve chat completions reasoning_content replay Apr 28, 2026

teknium1 mentioned this pull request Apr 30, 2026

fix(agent): preserve reasoning_content replay on DeepSeek v4 + Kimi/Moonshot thinking #18045

Merged

teknium1 closed this in #18045 Apr 30, 2026

teknium1 mentioned this pull request Apr 30, 2026

fix(agent): pad reasoning_content on DeepSeek/Kimi tool-call turns #17489

Closed

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(deepseek): preserve chat completions reasoning_content replay#16855

fix(deepseek): preserve chat completions reasoning_content replay#16855
lsdsjy wants to merge 1 commit into
NousResearch:mainfrom
lsdsjy:fix/deepseek-v4-reasoning-content

lsdsjy commented Apr 28, 2026 •

edited

Loading

Uh oh!

teknium1 commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

lsdsjy commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Basis

Scope

Testing

Uh oh!

teknium1 commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lsdsjy commented Apr 28, 2026 •

edited

Loading