feat(agent): add z.ai/GLM-5 preserved thinking support by neuneu2k · Pull Request #11494 · NousResearch/hermes-agent

neuneu2k · 2026-04-17T08:32:29Z

Enable z.ai/Zhipu GLM-5.x and GLM-4.7 preserved thinking mode for multi-turn agent loops.

Three changes in run_agent.py:

_is_zai_direct() helper — detects zai provider or known z.ai/bigmodel endpoint URLs (api.z.ai, open.bigmodel.cn).
_build_api_kwargs() — injects thinking parameter in extra_body for GLM-5/4.7 models:
- Default: {type: enabled, compact_history: false} (preserved thinking)
- reasoning_config.enabled=false → {type: disabled}
- GLM-4.6/4.5 excluded (they auto-determine thinking)
Message sanitization — re-injects reasoning_content on assistant messages for z.ai so multi-turn reasoning continuity works with compact_history=false.

Response-side extraction was already handled by the generic _extract_reasoning() method (checks reasoning_content field).

Tests: 19 new tests covering detection, parameter injection, config gating, and multi-turn passthrough.

What does this PR do?

The GLM 5 family, and to a lesser degree the 4.7 line, has been trained on preserved interleaved thinking, It's supposed to improve chained tool calling by keeping the reasoning steps in context instead as a short term memory.

This PR enables preserved thinking mode on z.ai models if and only if they are served directly from their inference endpoints.

Related Issue

Fixes Preserved thinking for GLM models when the inference provider supports it.

Type of Change

🐛 Bug fix (non-breaking change that fixes an issue)
✨ New feature (non-breaking change that adds functionality)
🔒 Security fix
📝 Documentation update
✅ Tests (adding or improving test coverage)
♻️ Refactor (no behavior change)
🎯 New skill (bundled or hub)

Code

I've read the Contributing Guide
My commit messages follow Conventional Commits (fix(scope):, feat(scope):, etc.)
I searched for existing PRs to make sure this isn't a duplicate
My PR contains only changes related to this fix/feature (no unrelated commits)
I've run pytest tests/ -q and all tests pass
I've added tests for my changes (required for bug fixes, strongly encouraged for features)
I've tested on my platform: Debian GNU/Linux 12 (bookworm)

Documentation & Housekeeping

I've updated relevant documentation (README, docs/, docstrings) — or N/A
I've updated cli-config.yaml.example if I added/changed config keys — or N/A
I've updated CONTRIBUTING.md or AGENTS.md if I changed architecture or workflows — or N/A
I've considered cross-platform impact (Windows, macOS) per the compatibility guide — or N/A
I've updated tool descriptions/schemas if I changed tool behavior — or N/A

neuneu2k · 2026-04-17T08:52:05Z

I haven't done a pull request in github in ages, my apologies for the quality of the paperwork.

Enable z.ai/Zhipu GLM-5.x and GLM-4.7 preserved thinking mode for multi-turn agent loops. Three changes in run_agent.py: 1. _is_zai_direct() helper — detects zai provider or known z.ai/bigmodel endpoint URLs (api.z.ai, open.bigmodel.cn). 2. _build_api_kwargs() — injects thinking parameter in extra_body for GLM-5/4.7 models: - Default: {type: enabled, compact_history: false} (preserved thinking) - reasoning_config.enabled=false → {type: disabled} - GLM-4.6/4.5 excluded (they auto-determine thinking) 3. Message sanitization — re-injects reasoning_content on assistant messages for z.ai so multi-turn reasoning continuity works with compact_history=false. Response-side extraction was already handled by the generic _extract_reasoning() method (checks reasoning_content field). Tests: 19 new tests covering detection, parameter injection, config gating, and multi-turn passthrough.

neuneu2k marked this pull request as ready for review April 17, 2026 08:49

neuneu2k force-pushed the feature/glm-preserved-thinking branch from cab1736 to cfa2fd0 Compare April 22, 2026 16:55

alt-glitch added type/feature New feature or request P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder provider/zai ZAI provider labels Apr 22, 2026

This was referenced Apr 27, 2026

Z.AI / GLM via zai provider never returns reasoning_content — Hermes sends extra_body.reasoning (OpenRouter-style) but Z.AI expects extra_body.thinking={"type":"enabled"} #16533

Open

fix(agent): enable reasoning_content for Z.AI/GLM models #16592

Open

nibzard mentioned this pull request May 13, 2026

fix(zai): comprehensive Z.AI/GLM provider support #24915

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(agent): add z.ai/GLM-5 preserved thinking support#11494

feat(agent): add z.ai/GLM-5 preserved thinking support#11494
neuneu2k wants to merge 1 commit into
NousResearch:mainfrom
neuneu2k:feature/glm-preserved-thinking

neuneu2k commented Apr 17, 2026 •

edited

Loading

Uh oh!

neuneu2k commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

neuneu2k commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Related Issue

Type of Change

Code

Documentation & Housekeeping

Uh oh!

neuneu2k commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

neuneu2k commented Apr 17, 2026 •

edited

Loading