fix(vision): Z.AI (智谱 GLM) vision model compatibility by agilejava · Pull Request #19346 · NousResearch/hermes-agent

agilejava · 2026-05-03T20:13:10Z

Problem

Z.AI (智谱 GLM) vision models (glm-4v-flash, glm-4v-plus, glm-4v, etc.) fail when used as the auxiliary.vision provider due to two compatibility issues:

Issue 1: Error 1210 — max_tokens rejected on multimodal calls

Z.AI rejects the max_tokens parameter for vision model requests with error code 1210 ("API 调用参数有误"). The error string does not contain "max_tokens", so the existing unsupported-parameter retry logic in call_llm() never fires. This results in a hard failure with no recovery.

Issue 2: Wrong endpoint inheritance (silent failure)

When the main runtime provider uses Z.AI's Anthropic-compatible endpoint (open.bigmodel.cn/api/anthropic), the vision client inherits this endpoint via resolve_vision_provider_client(). Z.AI's Anthropic wire cannot properly handle image content — models silently fail with responses like "I can't see the image" (received base64 data but couldn't process it through the Anthropic translation layer). This is a silent failure with no HTTP error, making it extremely hard to diagnose.

Changes

resolve_vision_provider_client() — Force Z.AI vision to use the OpenAI-compatible endpoint (open.bigmodel.cn/api/paas/v4) instead of inheriting the Anthropic wire from the main runtime provider.
_build_call_kwargs() — Skip max_tokens for Z.AI vision models (detected by 4v/5v/-v suffix in model name).
_AnthropicCompletionsAdapter — Support _skip_zai_max_tokens flag for cases where the adapter is still used.
_to_openai_base_url() — Rewrite Z.AI Anthropic URLs to OpenAI-compatible path (handles edge case URL rewriting).
call_llm() retry — Detect Z.AI-specific error 1210 and strip max_tokens before retrying.

Testing

Verified with glm-4v-flash on Z.AI free tier — vision analysis works correctly
Main runtime (GLM-4/GLM-5 text models) unaffected — changes only activate for Z.AI vision models
vision_analyze tool returns correct descriptions of test images

Config used for testing

auxiliary:
  vision:
    provider: zai
    model: glm-4v-flash

Main provider: zai with base_url: https://open.bigmodel.cn/api/anthropic

…ax_tokens handling Z.AI (智谱 GLM) vision models (glm-4v-flash, glm-4v-plus, etc.) have two compatibility issues when used through the Anthropic-compatible endpoint: 1. **Error 1210 — max_tokens rejected on multimodal calls**: Z.AI rejects the max_tokens parameter for vision model requests with error code 1210 ("API 调用参数有误"). The error string does not contain "max_tokens", so the existing unsupported-parameter retry logic never fires. 2. **Wrong endpoint inheritance**: When the main runtime provider uses Z.AI's Anthropic-compatible endpoint (open.bigmodel.cn/api/anthropic), the vision client inherits this endpoint. But Z.AI's Anthropic wire cannot properly handle image content — models silently fail ("I can't see the image") or reject max_tokens. Changes: - resolve_vision_provider_client(): force Z.AI vision to use OpenAI-compatible endpoint (open.bigmodel.cn/api/paas/v4) instead of inheriting Anthropic wire - _build_call_kwargs(): skip max_tokens for Z.AI vision models (4v/5v/-v suffix) - _AnthropicCompletionsAdapter: support _skip_zai_max_tokens flag - _to_openai_base_url(): rewrite Z.AI Anthropic URLs to OpenAI-compatible path - call_llm() retry: detect Z.AI error 1210 and strip max_tokens before retry

alt-glitch added type/bug Something isn't working P2 Medium — degraded but workaround exists comp/agent Core agent loop, run_agent.py, prompt builder provider/zai ZAI provider tool/vision Vision analysis and image generation labels May 3, 2026

teknium1 mentioned this pull request May 7, 2026

fix(vision): Z.AI (智谱 GLM) vision model compatibility (salvage #19346) #21268

Merged

teknium1 closed this in #21268 May 7, 2026

This was referenced May 16, 2026

vision_analyze tool fails with GLM (glm-4v-flash) provider: error 1210 request format mismatch #26827

Closed

fix(auxiliary): handle custom glm vision parameter mismatch #26928

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(vision): Z.AI (智谱 GLM) vision model compatibility#19346

fix(vision): Z.AI (智谱 GLM) vision model compatibility#19346
agilejava wants to merge 1 commit into
NousResearch:mainfrom
agilejava:fix/zai-vision-compat

agilejava commented May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

agilejava commented May 3, 2026

Problem

Issue 1: Error 1210 — max_tokens rejected on multimodal calls

Issue 2: Wrong endpoint inheritance (silent failure)

Changes

Testing

Config used for testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants