fix(deepseek): add native thinking/reasoning_effort support by asdlem · Pull Request #22218 · NousResearch/hermes-agent

asdlem · 2026-05-09T03:00:05Z

Summary

Enable DeepSeek V4+ native thinking mode via extra_body.thinking + top-level reasoning_effort
Previously the generic code path never sent these parameters for the deepseek provider so /reasoning had zero effect

Motivation

DeepSeek uses its own protocol for reasoning (extra_body.thinking={"type":"enabled"} + reasoning_effort="high"/"max"), NOT OpenRouter-style extra_body.reasoning. The codebase only emitted the OpenRouter protocol. This meant /reasoning low/medium/high/xhigh were all no-ops for the deepseek provider.

Closes #15700, #21577

Changes

Introduce DeepSeekProfile (subclass of ProviderProfile) in plugins/model-providers/deepseek/__init__.py

Add build_api_kwargs_extras() that emits the correct parameter shape per effort level:

Hermes `/reasoning`	`extra_body.thinking`	`reasoning_effort`
default / unset	`{type: enabled}`	`high`
`none` (disabled)	`{type: disabled}`	(omitted)
`low`, `medium`, `high`	`{type: enabled}`	`high`
`xhigh`	`{type: enabled}`	`max`

Follows the same pattern as the existing Kimi provider

Test Plan

Unit-level: Python snippet directly testing build_api_kwargs_extras() for all effort levels confirmed correct parameter shape
End-to-end: /reasoning xhigh in Feishu with deepseek-v4-pro confirmed thinking mode active (response detail length increased from ~30 to ~185 chars)
/reasoning high and /reasoning max both confirmed through Feishu handler
No regressions: default behavior (no reasoning config) adds thinking+effort=high, matching DeepSeek defaults

Notes for Reviewers

This is the exact same pattern as plugins/model-providers/kimi/__init__.py — Z.AI/Kimi uses the same protocol
Issue Z.AI / GLM via zai provider never returns reasoning_content — Hermes sends extra_body.reasoning (OpenRouter-style) but Z.AI expects extra_body.thinking={"type":"enabled"} #16533 (Z.AI/GLM missing thinking) is the same root cause and can use the same approach

DeepSeek V4+ uses extra_body.thinking + top-level reasoning_effort (native protocol, same as Kimi/Z.AI), not OpenRouter-style extra_body.reasoning. Previously the generic code path never sent these parameters for the deepseek provider, so /reasoning commands had zero effect. Changes: - Introduce DeepSeekProfile (subclass of ProviderProfile) with build_api_kwargs_extras() that emits the correct parameter shape - Map Hermes effort levels: low/medium/high → reasoning_effort=high, xhigh → reasoning_effort=max - Default (no reasoning config) enables thinking with effort=high - disabled reasoning sends thinking.type=disabled (no effort param) This follows the same pattern as the existing Kimi provider. Fixes: the deepseek provider previously had no reasoning support at all — /reasoning low/medium/high/xhigh were all no-ops.

tuobi2 · 2026-05-13T13:09:19Z

@asdlem — closed my #25000 as duplicate, per @alt-glitch's consolidation guidance. Two pieces from our diff that might be worth adding here:

1. Transport-layer is_deepseek fallback

When get_provider_profile() fails or the profile isn't loaded, the Kimi path has a transport-level is_kimi safety net (chat_completions.py L341-349). DeepSeek should too:

run_agent.py: _is_deepseek detection + param forwarding (2 lines)
chat_completions.py: mirror the is_kimi block for is_deepseek (11 lines)

This ensures thinking works even if the profile subsystem has an issue.

2. Transport parity tests

tests/providers/test_transport_parity.py::TestDeepSeekParity — verifies ChatCompletionsTransport correctly integrates DeepSeekProfile (same pattern as existing TestKimiParity).

Both diffs are visible at https://github.com/NousResearch/hermes-agent/pull/25000/files — feel free to cherry-pick or ignore. Happy to help if you want a separate PR against your branch with just these two additions.

- run_agent.py: detect api.deepseek.com and forward is_deepseek param (mirrors existing is_kimi pattern) - chat_completions.py: handle is_deepseek in extra_body.thinking (same protocol as Kimi — fallback when profile subsystem fails) - tests: TestDeepSeekParity (transport) + TestDeepSeekProfile (profile) (5 tests, all passing) Refs: NousResearch#22218, closes NousResearch#25000

teknium1 · 2026-06-11T15:06:04Z

Automated hermes-sweeper review: this DeepSeek reasoning support is already implemented on current main.

Evidence:

plugins/model-providers/deepseek/__init__.py:47 defines DeepSeekProfile.build_api_kwargs_extras(), emitting DeepSeek's native extra_body.thinking plus top-level reasoning_effort for DeepSeek V4+/reasoner models.
agent/transports/chat_completions.py:526 wires provider-profile extras into the actual chat-completions request kwargs, so the DeepSeek profile path is active in transport.
tests/plugins/model_providers/test_deepseek_profile.py:39 covers the requested wire shape: default thinking enabled, disabled thinking marker, low/medium/high passthrough, and xhigh/max -> max; tests/plugins/model_providers/test_deepseek_profile.py:155 verifies full transport kwargs for deepseek-v4-pro.
Implementing commit: cd9470f41638bd515db096cd934c463205790110 (fix(deepseek): wire thinking-mode via DeepSeekProfile, not legacy fallback), shipped in v2026.5.16.

I also checked the discussion about adding an is_deepseek transport fallback. Main implemented this through the active ProviderProfile path instead; the fixing commit notes the legacy fallback route was not the path DeepSeek uses.

alt-glitch added type/feature New feature or request P3 Low — cosmetic, nice to have comp/plugins Plugin system and bundled plugins provider/deepseek DeepSeek API labels May 9, 2026

This was referenced May 14, 2026

fix(deepseek): subclass ProviderProfile so reasoning_effort + thinking reach the API #25301

Closed

fix(deepseek): wire thinking-mode via DeepSeekProfile (closes #15700, #17212, #17825) #26648

Merged

alt-glitch mentioned this pull request May 23, 2026

fix(deepseek): map low/medium reasoning_effort to high per API spec #30883

Closed

2 tasks

teknium1 closed this Jun 11, 2026

teknium1 added the sweeper:implemented-on-main Sweeper: behavior already present on current main label Jun 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(deepseek): add native thinking/reasoning_effort support#22218

fix(deepseek): add native thinking/reasoning_effort support#22218
asdlem wants to merge 2 commits into
NousResearch:mainfrom
asdlem:fix/deepseek-reasoning-effort

asdlem commented May 9, 2026

Uh oh!

tuobi2 commented May 13, 2026

Uh oh!

teknium1 commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

asdlem commented May 9, 2026

Summary

Motivation

Changes

Test Plan

Notes for Reviewers

Uh oh!

tuobi2 commented May 13, 2026

Uh oh!

teknium1 commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants