fix: fallback content to reasoning_content when DeepSeek returns empty content field by Starfie1d1272 · Pull Request #428 · HKUDS/DeepTutor

Starfie1d1272 · 2026-04-30T18:38:25Z

Problem

DeepSeek v4-flash and v4-pro default to thinking mode enabled (per API docs), returning responses in reasoning_content while leaving content empty. This caused cascade failures across multiple layers:

Symptom	Root cause
`LLM returned empty response`	`_parse()` / `_parse_chunks()` only fell back to `m.reasoning` (DashScope), not `m.reasoning_content` (DeepSeek)
`No JSON object found` in book blocks	`CodeGenerator` / `FlashCardsGenerator` / `TimelineGenerator` / `DeepDiveGenerator` used fragile `json.loads()` directly
Streaming empty response	`factory.py` wrapped reasoning in `<think>` tags stripped by `clean_thinking_tags()`
`'list' object has no attribute 'get'`	`IdeaAgent` didn't guard against array-type JSON responses
`LLM_REASONING_EFFORT` env var silently ignored	resolver path takes priority over `.env`; env override not applied

Fix

1. Content fallback + precedence alignment

_parse() / _parse_chunks() in both provider_core and tutorbot: fall back reasoning_content → content, check reasoning_content before reasoning

2. Streaming path

factory.py _runner: when only reasoning chunks are emitted (no _on_content_delta), emit response.content as fallback

3. Book block JSON parsing

code.py, flash_cards.py, timeline.py, deep_dive.py: replace raw json.loads() → parse_json_response() which handles markdown fences, preamble, and json_repair fallback

4. `LLM_REASONING_EFFORT` env var

config.py: read from env and apply as override regardless of resolver path
factory.py: pass config.reasoning_effort to chat_with_retry / chat_stream_with_retry
For DeepSeek: high/max enables thinking, minimal sends thinking: {type: "disabled"}, empty → auto-detect
.env.example / .env.example_CN: documented

5. idea_agent robustness

Guard against JSON array and non-dict responses

Scope note

This PR is a bug fix — it prevents crashes and empty responses when thinking mode is active. Full thinking mode support (leveraging reasoning_content to improve generation quality, token budget allocation, animation pipeline adaptation) is tracked in #430.

Also tracked in #430: book generation UX improvements (force regenerate, failure diagnostics, prompt leakage prevention, i18n).

Test plan

deepseek-v4-flash + LLM_REASONING_EFFORT=minimal: all book blocks generate without errors
deepseek-v4-flash + LLM_REASONING_EFFORT=high: reasoning_content falls back correctly, no empty responses
deepseek-chat / gpt-4o regression: normal operation unchanged
Idea generation handles malformed LLM JSON

…y content field DeepSeek models (v4-flash, reasoner, etc.) return the actual response in the reasoning_content field while leaving content empty when thinking mode is enabled. The _parse() method only fell back to m.reasoning (DashScope-style), causing "LLM returned empty response" errors. Also fix idea_agent to handle LLM responses that are JSON arrays instead of objects with an "ideas" key.

Copilot

Pull request overview

Fixes DeepSeek “thinking mode” responses and improves idea generation robustness by handling alternate response shapes across OpenAI-compatible providers and agents.

Changes:

Update OpenAI-compat _parse() (services provider_core + tutorbot) to fall back to reasoning_content when content is empty.
Update IdeaAgent to accept JSON array payloads by normalizing them into an {"ideas": ...} object.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 3 comments.

File	Description
deeptutor/tutorbot/providers/openai_compat_provider.py	Adds `reasoning_content` → `content` fallback during response parsing.
deeptutor/services/llm/provider_core/openai_compat_provider.py	Adds the same `reasoning_content` → `content` fallback in the services-layer provider.
deeptutor/agents/question/agents/idea_agent.py	Wraps array-style JSON responses into an object with an `"ideas"` key to prevent crashes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

…dence - Swap reasoning_content/reasoning fallback order in _parse() to match the precedence used when extracting reasoning_content (per Copilot review) - Add reasoning_content → content fallback in _parse_chunks() for both provider_core and tutorbot providers - Fix streaming path in factory.py: when only reasoning chunks are emitted (no direct content), fall back to response.content so downstream consumers receive non-empty responses - Harden idea_agent against non-dict JSON payloads

- Read LLM_REASONING_EFFORT from environment in config.py - Pass config.reasoning_effort to chat_with_retry/chat_stream_with_retry in both complete() and stream() (was previously dropped) - Set to "low"/"medium"/"high" to enable thinking, leave empty to use automatic detection based on reasoning_model_patterns

DeepSeek only supports high/max for reasoning_effort; minimal is a DeepTutor convention that maps to thinking.type=disabled. Update comments to be accurate for each provider.

Replace fragile json.loads() with parse_json_response() in code, flash_cards, timeline, and deep_dive generators. Handles LLM responses with markdown fences, preamble text, and malformed JSON.

get_llm_config() prefers the resolver path over .env, so the env var was silently ignored. Apply it as an override regardless of which path produced the config.

…-content-fallback # Conflicts: # deeptutor/book/blocks/code.py # deeptutor/book/blocks/deep_dive.py # deeptutor/book/blocks/flash_cards.py # deeptutor/book/blocks/timeline.py

pancacake · 2026-05-04T04:08:37Z

Thanks for your contribution!

Copilot AI review requested due to automatic review settings April 30, 2026 18:38

Copilot started reviewing on behalf of Starfie1d1272 April 30, 2026 18:39 View session

Copilot AI reviewed Apr 30, 2026

View reviewed changes

Comment thread deeptutor/agents/question/agents/idea_agent.py

Comment thread deeptutor/tutorbot/providers/openai_compat_provider.py Outdated

Comment thread deeptutor/services/llm/provider_core/openai_compat_provider.py Outdated

Starfie1d1272 added 6 commits May 1, 2026 03:00

docs: add LLM_REASONING_EFFORT to env example templates

3e0de96

docs: fix LLM_REASONING_EFFORT value descriptions per DeepSeek API docs

44686ab

DeepSeek only supports high/max for reasoning_effort; minimal is a DeepTutor convention that maps to thinking.type=disabled. Update comments to be accurate for each provider.

fix: use robust JSON parser in book block generators

ed9e810

Replace fragile json.loads() with parse_json_response() in code, flash_cards, timeline, and deep_dive generators. Handles LLM responses with markdown fences, preamble text, and malformed JSON.

fix: apply LLM_REASONING_EFFORT env var via resolver path as well

66d0f62

get_llm_config() prefers the resolver path over .env, so the env var was silently ignored. Apply it as an override regardless of which path produced the config.

Starfie1d1272 mentioned this pull request Apr 30, 2026

DeepSeek thinking mode full support & book generation UX improvements #430

Closed

5 tasks

Merge remote-tracking branch 'origin/dev' into fix/deepseek-reasoning…

d23ea03

…-content-fallback # Conflicts: # deeptutor/book/blocks/code.py # deeptutor/book/blocks/deep_dive.py # deeptutor/book/blocks/flash_cards.py # deeptutor/book/blocks/timeline.py

pancacake merged commit b1faa72 into HKUDS:dev May 4, 2026

Starfie1d1272 deleted the fix/deepseek-reasoning-content-fallback branch May 4, 2026 05:28

Starfie1d1272 mentioned this pull request May 6, 2026

fix(llm): stop sending reasoning_effort=minimal as top-level param to providers that reject it #453

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: fallback content to reasoning_content when DeepSeek returns empty content field#428

fix: fallback content to reasoning_content when DeepSeek returns empty content field#428
pancacake merged 8 commits into
HKUDS:devfrom
Starfie1d1272:fix/deepseek-reasoning-content-fallback

Starfie1d1272 commented Apr 30, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pancacake commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Starfie1d1272 commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

1. Content fallback + precedence alignment

2. Streaming path

3. Book block JSON parsing

4. LLM_REASONING_EFFORT env var

5. idea_agent robustness

Scope note

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pancacake commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Starfie1d1272 commented Apr 30, 2026 •

edited

Loading

4. `LLM_REASONING_EFFORT` env var