server : preserve anthropic thinking blocks in conversion by T0mSIlver · Pull Request #20120 · ggml-org/llama.cpp

T0mSIlver · 2026-03-05T00:08:57Z

Fix Anthropic /v1/messages conversion to preserve assistant thinking blocks as reasoning_content when converting to internal OpenAI-compatible chat messages.

Fixes #20090.

AI usage disclosure:

AI was used in an assistive role for code review suggestions, small implementation adjustments, and command execution support.
I manually reviewed the final patch and validation results.

…0090)

ngxson

please add a test case with /apply-template to prove that this works

T0mSIlver · 2026-03-05T13:48:21Z

Added two test cases:

test_anthropic_thinking_history_in_count_tokens : sends Anthropic-format messages with interleaved thinking + tool use through /v1/messages/count_tokens and verifies thinking blocks increase the token count (i.e. they're not silently dropped by convert_anthropic_to_oai).
test_anthropic_thinking_history_in_template : uses /apply-template with the Qwen3 template to verify the converted reasoning_content renders inside tags in the prompt.

Both use the Qwen3 template which natively handles reasoning_content.

florianbrede-ayet · 2026-03-05T15:20:34Z

@T0mSIlver just wanted to open a PR for the same issue when I came across yours. I've been struggling with the same problems and qwen35 models. The core fix is basically identical to mine - i compiled it against ROCm and it solves the issue with claude code.

On a side note, it also fixes the autoparser branch where you'd previously get 500 {"error":{"code":500,"message":"Failed to parse input at pos 67: ","type":"server_error"}} (or other low positions within malformed thinking blocks) after some turns.

T0mSIlver requested review from ggerganov and ngxson as code owners March 5, 2026 00:08

T0mSIlver changed the title ~~server : preserve anthropic thinking blocks in conversion (#20090)~~ server : preserve anthropic thinking blocks in conversion Mar 5, 2026

server : preserve anthropic thinking blocks in conversion (ggml-org#2…

09b3429

…0090)

T0mSIlver force-pushed the fix/20090-anthropic-thinking-conversion branch from e01f9e3 to 09b3429 Compare March 5, 2026 00:13

github-actions bot added examples server labels Mar 5, 2026

ngxson requested changes Mar 5, 2026

View reviewed changes

server : add tests for anthropic thinking block conversion

330df34

T0mSIlver force-pushed the fix/20090-anthropic-thinking-conversion branch from 2650560 to 330df34 Compare March 5, 2026 13:46

T0mSIlver requested a review from ngxson March 5, 2026 13:48

github-actions bot added the python python script changes label Mar 5, 2026

ngxson approved these changes Mar 6, 2026

View reviewed changes

ngxson merged commit e68f2fb into ggml-org:master Mar 6, 2026
73 of 81 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server : preserve anthropic thinking blocks in conversion#20120

server : preserve anthropic thinking blocks in conversion#20120
ngxson merged 2 commits intoggml-org:masterfrom
T0mSIlver:fix/20090-anthropic-thinking-conversion

T0mSIlver commented Mar 5, 2026 •

edited

Loading

Uh oh!

ngxson left a comment

Uh oh!

T0mSIlver commented Mar 5, 2026

Uh oh!

florianbrede-ayet commented Mar 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

T0mSIlver commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngxson left a comment

Choose a reason for hiding this comment

Uh oh!

T0mSIlver commented Mar 5, 2026

Uh oh!

florianbrede-ayet commented Mar 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

T0mSIlver commented Mar 5, 2026 •

edited

Loading