fix(agent): comprehensive DeepSeek V4 support — context windows, thinking mode, reasoning replay by Tranquil-Flow · Pull Request #15446 · NousResearch/hermes-agent

Tranquil-Flow · 2026-04-25T00:34:03Z

Summary

Unifies 5 fragmented DeepSeek V4 PRs into a single cohesive implementation:

Context windows: Add 1M entries for deepseek-v4-pro, deepseek-v4-flash, deepseek-chat, deepseek-reasoner (128K fallback preserved for older models)
Thinking mode toggle: Plumb thinking.type and reasoning_effort for native DeepSeek API — maps effort values to DeepSeek's supported "high"/"max" pair, strips incompatible sampling params (temperature, top_p, etc.) when thinking is enabled
reasoning_content replay: Inject reasoning_content="" on all assistant messages for DeepSeek replay, scoped to api.deepseek.com and OpenRouter deepseek/ prefix. Respects enabled: false to skip injection
_extract_reasoning guards: Use isinstance(str) checks instead of truthy checks, preventing crashes on non-string reasoning values
reasoning_content normalization: Preserve empty string "" in normalize_response (semantically valid for DeepSeek, was being dropped by truthy check)
_handle_max_iterations: Add missing _copy_reasoning_content_for_api call so the max-iterations summary path doesn't produce 400s
deepseek-chat preserved: deepseek-chat (the non-thinking alias) is NOT forced into thinking mode by default — only deepseek-v4-* and deepseek-reasoner models, or when the user explicitly opts in via reasoning_config

Test plan

34 tests across 5 test classes (context windows, thinking mode, replay, isinstance guards, normalization)
Verified deepseek-chat does NOT force thinking mode by default
Verified deepseek-chat CAN opt-in to thinking with explicit config
Verified temperature stripping only happens when thinking is enabled
Verified non-DeepSeek models are not affected by any of these changes
Verified explicit reasoning_content is preserved (not overwritten with "")

Fixes #15353. Supersedes #14952, #14958, #15325, #15228, #15354.

…king mode, reasoning replay Unifies approaches from PRs NousResearch#14952, NousResearch#14958, NousResearch#15325, NousResearch#15228, NousResearch#15354 into a single cohesive implementation: - Add 1M context window entries for V4 models (deepseek-v4-pro, deepseek-v4-flash, deepseek-chat, deepseek-reasoner) - Plumb thinking.type toggle and reasoning_effort mapping for native DeepSeek API (only "high" and "max" are valid) - Strip incompatible sampling params when thinking is enabled - Inject reasoning_content="" on all assistant messages for DeepSeek replay (scoped to api.deepseek.com and OpenRouter) - Fix _extract_reasoning isinstance checks for empty strings - Preserve empty-string reasoning_content in normalize_response - Add _copy_reasoning_content_for_api call in _handle_max_iterations Fixes NousResearch#15353. Supersedes NousResearch#14952, NousResearch#14958, NousResearch#15325, NousResearch#15228, NousResearch#15354.

ukint-vs · 2026-04-26T16:38:06Z

Superseded by #15478 (merged 2026-04-26) which landed a more comprehensive fix covering both DeepSeek and Kimi, with the regression follow-up in #16097. Closing — upstream main now has the fix.

teknium1 · 2026-04-27T10:38:58Z

Closing as redundant — the DeepSeek reasoning_content thinking-mode 400 and cross-provider leak chain of issues is now fully covered on main:

Direct provider detection + same-provider tool-call pad: commits 93a2d6b, d58b305, ad0ac89, 5ae6081
All-assistant-messages pad rule: commit ad0ac89
Ordering / cross-provider isolation: commit 9daa062 + regression guard 63bf7a2
Cross-provider leak (MiniMax reasoning → DeepSeek): PR fix(agent): block cross-provider reasoning leak to DeepSeek/Kimi (#15748) #16500 (merging shortly)

21 regression tests in tests/run_agent/test_deepseek_reasoning_content_echo.py + 2 new tests for the cross-provider scenario exercise every known path. Thanks for the submission — appreciate the digging on this area.

Tranquil-Flow added 2 commits April 25, 2026 10:16

fix: preserve deepseek-chat default thinking behavior

44dd879

alt-glitch added type/bug Something isn't working P1 High — major feature broken, no workaround comp/agent Core agent loop, run_agent.py, prompt builder provider/deepseek DeepSeek API labels Apr 25, 2026

alt-glitch mentioned this pull request Apr 25, 2026

feat: add DeepSeek-V4 thinking mode via unified thinking_mode parameter #15577

Open

ukint-vs mentioned this pull request Apr 25, 2026

fix(deepseek): inject empty reasoning_content on replay for OpenRouter DeepSeek #15325

Closed

This was referenced Apr 25, 2026

[Bug]: DeepSeek reasoning models not supported (reasoning_content missing) #15679

Closed

fix: ensure reasoning_content consistency for DeepSeek-compatible APIs #15982

Closed

teknium1 closed this Apr 27, 2026

alt-glitch mentioned this pull request Apr 28, 2026

DeepSeek /anthropic (V4 thinking): stripped thinking blocks cause HTTP 400 on replay #16748

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(agent): comprehensive DeepSeek V4 support — context windows, thinking mode, reasoning replay#15446

fix(agent): comprehensive DeepSeek V4 support — context windows, thinking mode, reasoning replay#15446
Tranquil-Flow wants to merge 2 commits into
NousResearch:mainfrom
Tranquil-Flow:fix/deepseek-v4-comprehensive-support

Tranquil-Flow commented Apr 25, 2026

Uh oh!

ukint-vs commented Apr 26, 2026

Uh oh!

teknium1 commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Tranquil-Flow commented Apr 25, 2026

Summary

Test plan

Uh oh!

ukint-vs commented Apr 26, 2026

Uh oh!

teknium1 commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants