Support Kimi-K2.5 model by yhyang201 · Pull Request #17789 · sgl-project/sglang

yhyang201 · 2026-01-27T01:18:48Z

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review Process

Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
After green CI and required approvals, ask Merge Oncalls to merge.

yhyang201 · 2026-01-27T01:19:00Z

/tag-and-rerun-ci

mickqian · 2026-01-27T01:27:42Z

/rerun-failed-ci

mickqian · 2026-01-27T01:28:25Z

/rerun-failed-ci

yhyang201 · 2026-01-27T01:55:24Z

/rerun-failed-ci

yhyang201 · 2026-01-27T02:25:25Z

/rerun-failed-ci

yhyang201 · 2026-01-27T02:55:26Z

/rerun-failed-ci

mickqian · 2026-01-27T02:55:58Z

already tested on a previous PR, bypassing

yayoimizuha · 2026-01-27T15:14:39Z

Tested with 547e2d0

It doesn't work well with moonshotai/Kimi-K2.5 when using --reasoning-parser kimi_k2, but it seems to work fine when using --reasoning-parser deepseek-r1 instead.

yayoimizuha · 2026-01-27T15:59:55Z

I’m launching it with --tool-call-parser kimi_k2, but the tool calling is always fails. Could there be an issue here as well?
By the way, the results from K2-Vendor-Verifier seem to be fine. This script naively parses the responses on the script side, and it appears to behave differently from the typical OpenAI-compatible API function calling.

{
  "model": "moonshotai/Kimi-K2.5",
  "success_count": 1999,
  "failure_count": 1,
  "finish_stop": 902,
  "finish_tool_calls": 1079,
  "finish_others": 18,
  "finish_others_detail": {
    "length": 18
  },
  "schema_validation_error_count": 7,
  "successful_tool_call_count": 1072,
  "usage": {
    "prompt_tokens": 0,
    "completion_tokens": 0,
    "total_tokens": 0
  },
  "eval_started_at": "2026-01-27T15:32:35.895384",
  "eval_finished_at": "2026-01-27T15:50:13.615195",
  "eval_duration_ms": 1057719
}

yhyang201 · 2026-01-29T02:49:10Z

yayoimizuha

Your feedback is very important. I’ll look into it.

JustinTong0323 · 2026-01-29T08:20:46Z

Hi @yayoimizuha, we sincerely appreciate your feedback. Regarding the two issues you pointed out, namely the reasoning parser and the tool call parser, they should have been resolved in the latest main. Would you kindly give it another try? We are truly grateful for your support and understanding.

Co-authored-by: Mick <mickjagger19@icloud.com>

yayoimizuha · 2026-01-31T01:58:59Z

Thank you. Upon checking the latest commit(https://github.com/sgl-project/sglang/tree/5d00150e9965f467399236ebf7819bbff5e385bb), it seems that the Reasoning part now works correctly.

There are still issues with tool calls that need to be considered and addressed.
Regarding tool calls, there's still a possibility that my own settings are incorrect, so I'm continuing to investigate.
Roo-Code's output(The model did not provide any response content. This may indicate an issue with the API or the model's output. is outputting infinitely.):

Date/time: 2026-01-31T01:51:57.066Z
Extension version: 3.46.0
Provider: openai (proxy)
Model: moonshotai/Kimi-K2.5

Unexpected API Response: The language model did not provide any assistant messages. This may indicate an issue with the API or the model's output.

Doesn't work correctly in OpenCode either.

{
    "$schema": "https://opencode.ai/config.json",
    "provider": {
        "llm-provider-mizuha-dev-com": {
            "npm": "@ai-sdk/openai-compatible",
            "name": "llm-provider.mizuha-dev.com",
            "options": {
                "baseURL": "https://llm-provider.mizuha-dev.com/v1"
            },
            "models": {
                "moonshotai/Kimi-K2.5": {
                    "name": "Kimi K2.5"
                }
            }
        }
    }
}

As noted, the following pull request sets xgrammar==0.1.31 & SGLANG_TOOL_STRICT_LEVEL=2.
#17914

eraser00 · 2026-02-03T09:44:48Z

I’m launching it with --tool-call-parser kimi_k2, but the tool calling is always fails. Could there be an issue here as well? By the way, the results from K2-Vendor-Verifier seem to be fine. This script naively parses the responses on the script side, and it appears to behave differently from the typical OpenAI-compatible API function calling.
{
  "model": "moonshotai/Kimi-K2.5",
  "success_count": 1999,
  "failure_count": 1,
  "finish_stop": 902,
  "finish_tool_calls": 1079,
  "finish_others": 18,
  "finish_others_detail": {
    "length": 18
  },
  "schema_validation_error_count": 7,
  "successful_tool_call_count": 1072,
  "usage": {
    "prompt_tokens": 0,
    "completion_tokens": 0,
    "total_tokens": 0
  },
  "eval_started_at": "2026-01-27T15:32:35.895384",
  "eval_finished_at": "2026-01-27T15:50:13.615195",
  "eval_duration_ms": 1057719
}

If --reasoning-parser kimi_k2 and --tool-call-parser kimi_k2 are both enabled, the schema_validation_error_count should be 0 in K2-Vendor-Verifier with the sglang latest main.

Could you please help to provide all these 7 schema_validation failing cases?

yayoimizuha · 2026-02-04T05:55:33Z

I didn't quite understand how to extract those seven items, so I'm sending you the entire results.jsonl file. It's compressed with zstd, so thank you for your help.

https://mizuha-dev.com/files/results.jsonl.zst

Co-authored-by: Mick <mickjagger19@icloud.com>

mickqian and others added 24 commits January 26, 2026 16:17

init

6b94d49

add config

933aae1

model file init

645b396

update

611089e

load failed

40163d2

load succeed

d3a3cef

warmup succeed

1e5e1ae

add K2VLForConditionalGeneration

449311b

upd

c3fdbb2

upd

3a32397

clean

069ea10

fix

609a030

clean

8c9fd34

upd

34eb4ba

update

0565e31

fix non-thinking bugs

7a686f1

clean

474e8fc

fix mm dp attention bug

644a2c4

fixed launch errors in other models.

8cb404c

fix rebase

656003f

fix

267e8a8

fix

b8a190b

remove

cddf230

remove wrong copyright

791d6cc

yhyang201 requested review from Fridge003, JustinTong0323, Qiaolin-Yu, ispobock, merrymercy and mickqian as code owners January 27, 2026 01:18

yhyang201 requested review from CatherineSue, hebiao064 and slin1237 as code owners January 27, 2026 01:18

github-actions Bot added the Multi-modal multi-modal language model label Jan 27, 2026

github-actions Bot added the run-ci label Jan 27, 2026

sgl-project deleted a comment from gemini-code-assist Bot Jan 27, 2026

This comment was marked as resolved.

Sign in to view

mickqian added the high priority label Jan 27, 2026

yhyang201 changed the title ~~wip~~ Support Kimi-K2.5 model Jan 27, 2026

mickqian approved these changes Jan 27, 2026

View reviewed changes

mickqian merged commit 479ab7a into sgl-project:main Jan 27, 2026
189 of 217 checks passed

Chen-0210 pushed a commit to Chen-0210/sglang that referenced this pull request Jan 30, 2026

model: support Kimi-K2.5 (sgl-project#17789)

ddcb446

Co-authored-by: Mick <mickjagger19@icloud.com>

yayoimizuha mentioned this pull request Jan 31, 2026

[Bug] Kimi-k2.5 Reasoning Parser Not Working #17873

Closed

5 tasks

Johnsonms pushed a commit to Johnsonms/sglang that referenced this pull request Feb 14, 2026

model: support Kimi-K2.5 (sgl-project#17789)

1fa9c50

Co-authored-by: Mick <mickjagger19@icloud.com>

ch45er mentioned this pull request Mar 7, 2026

[Bug] Clarify Kimi K2.5 model support #20096

Open

5 tasks

yhyang201 deleted the new_branch branch April 16, 2026 07:15

Conversation

yhyang201 commented Jan 27, 2026

Motivation

Modifications

Accuracy Tests

Benchmarking and Profiling

Checklist

Review Process

Uh oh!

yhyang201 commented Jan 27, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

mickqian commented Jan 27, 2026

Uh oh!

mickqian commented Jan 27, 2026

Uh oh!

yhyang201 commented Jan 27, 2026

Uh oh!

yhyang201 commented Jan 27, 2026

Uh oh!

yhyang201 commented Jan 27, 2026

Uh oh!

mickqian commented Jan 27, 2026

Uh oh!

Uh oh!

yayoimizuha commented Jan 27, 2026

Uh oh!

yayoimizuha commented Jan 27, 2026

Uh oh!

yhyang201 commented Jan 29, 2026

Uh oh!

JustinTong0323 commented Jan 29, 2026

Uh oh!

yayoimizuha commented Jan 31, 2026

Uh oh!

eraser00 commented Feb 3, 2026

Uh oh!

yayoimizuha commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yayoimizuha commented Feb 4, 2026 •

edited

Loading