[GLM-4.7] GLM Model support for GLM-Lite by zRzRzRzRzRzRzR · Pull Request #31386 · vllm-project/vllm

zRzRzRzRzRzRzR · 2025-12-26T10:06:30Z

using with transformers 5.0.0 with GLM-Lite model， transformers PR here

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

chatgpt-codex-connector · 2025-12-26T10:06:38Z

Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits.

gemini-code-assist

Code Review

This pull request adds support for the GLM-Lite model and its MTP (Multi-Token Prediction) variant for speculative decoding. The changes include new model implementation files, updates to model registries, and modifications to benchmark configurations. The implementation appears to leverage existing patterns from models like DeepseekV2. My review has identified a critical issue in the test configuration that could lead to incorrect testing, and a high-severity type hint error in the new model implementation.

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

mergify · 2026-01-05T17:42:52Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @zRzRzRzRzRzRzR.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

Signed-off-by: Yuxuan Zhang <2448370773@qq.com>

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

zRzRzRzRzRzRzR · 2026-01-17T07:43:16Z

like this?

Signed-off-by: Roger Wang <hey@rogerw.io>

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by: Yuxuan Zhang <2448370773@qq.com>

zRzRzRzRzRzRzR added 5 commits December 26, 2025 15:35

draft

8da8276

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

format

89ffdc6

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

test

d40e2f0

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

test model

1458fdf

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

mtp

d4984b6

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

zRzRzRzRzRzRzR requested review from DarkLight1337, ProExpertProg, WoosukKwon, hmellor, houseroad, mgoin, robertgshaw2-redhat, tlrmchlsmth, yewentao256, youkaichao and ywang96 as code owners December 26, 2025 10:06

zRzRzRzRzRzRzR marked this pull request as draft December 26, 2025 10:06

mergify Bot added new-model Requests to new models performance Performance-related issues labels Dec 26, 2025

gemini-code-assist Bot reviewed Dec 26, 2025

View reviewed changes

Comment thread tests/models/registry.py Outdated

Comment thread vllm/model_executor/models/glm4_moe_lite.py

mtp

25e0748

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

zRzRzRzRzRzRzR marked this pull request as ready for review December 27, 2025 06:18

zRzRzRzRzRzRzR added 3 commits December 27, 2025 14:18

Merge branch 'main' into glm

f238639

Merge branch 'vllm-project:main' into glm

aa188f9

Merge branch 'main' into glm

bb5daa3

mergify Bot added the needs-rebase label Jan 5, 2026

zRzRzRzRzRzRzR changed the title ~~GLM Testing~~ [GLM-4.7] GLM Model support for GLM-Lite Jan 15, 2026

Merge branch 'main' into glm

ef38ed6

Signed-off-by: Yuxuan Zhang <2448370773@qq.com>

mergify Bot added the tool-calling label Jan 16, 2026

github-project-automation Bot added this to Tool Calling Jan 16, 2026

DarkLight1337 reviewed Jan 17, 2026

View reviewed changes

Comment thread tests/models/registry.py Outdated

add is_available_online

8fa6a2b

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

DarkLight1337 reviewed Jan 17, 2026

View reviewed changes

Comment thread tests/models/registry.py Outdated

add is_available_online

394c00f

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>

DarkLight1337 approved these changes Jan 17, 2026

View reviewed changes

DarkLight1337 enabled auto-merge (squash) January 17, 2026 07:47

github-actions Bot added the ready ONLY add when PR is ready to merge/full CI is needed label Jan 17, 2026

ywang96 added 4 commits January 18, 2026 21:28

Merge branch 'main' into glm

72290cd

typo

3d5ffd6

fix import

bb4f7e0

Signed-off-by: Roger Wang <hey@rogerw.io>

typing

c1393c5

Signed-off-by: Roger Wang <hey@rogerw.io>

ywang96 disabled auto-merge January 19, 2026 09:18

ywang96 merged commit 71832ba into vllm-project:main Jan 19, 2026
27 of 56 checks passed

github-project-automation Bot moved this to Done in Tool Calling Jan 19, 2026

DocShotgun mentioned this pull request Jan 19, 2026

Feature Request: Support Glm4MoeLiteForCausalLM ggml-org/llama.cpp#18931

Closed

4 tasks

gopalsarda pushed a commit to gopalsarda/vllm that referenced this pull request Jan 20, 2026

[GLM-4.7] GLM Model support for GLM-Lite (vllm-project#31386)

5d3235b

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by: Yuxuan Zhang <2448370773@qq.com>

zRzRzRzRzRzRzR deleted the glm branch January 21, 2026 05:37

ai-infos mentioned this pull request Jan 25, 2026

[Doc]: Share Working / Failed Models nlzy/vllm-gfx906#29

Open

mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request May 10, 2026

[GLM-4.7] GLM Model support for GLM-Lite (vllm-project#31386)

e2519b0

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by: Yuxuan Zhang <2448370773@qq.com>

my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request May 15, 2026

[GLM-4.7] GLM Model support for GLM-Lite (vllm-project#31386)

069617f

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by: Yuxuan Zhang <2448370773@qq.com>

my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request May 15, 2026

[GLM-4.7] GLM Model support for GLM-Lite (vllm-project#31386)

cb7d3bb

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by: Yuxuan Zhang <2448370773@qq.com>

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request May 19, 2026

[GLM-4.7] GLM Model support for GLM-Lite (vllm-project#31386)

41c94ff

Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com> Signed-off-by: Yuxuan Zhang <2448370773@qq.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GLM-4.7] GLM Model support for GLM-Lite#31386

[GLM-4.7] GLM Model support for GLM-Lite#31386
ywang96 merged 21 commits into
vllm-project:mainfrom
zRzRzRzRzRzRzR:glm

zRzRzRzRzRzRzR commented Dec 26, 2025 •

edited

Loading

Uh oh!

chatgpt-codex-connector Bot commented Dec 26, 2025

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

mergify Bot commented Jan 5, 2026

Uh oh!

Uh oh!

Uh oh!

zRzRzRzRzRzRzR commented Jan 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

zRzRzRzRzRzRzR commented Dec 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

chatgpt-codex-connector Bot commented Dec 26, 2025

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

mergify Bot commented Jan 5, 2026

Uh oh!

Uh oh!

Uh oh!

zRzRzRzRzRzRzR commented Jan 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zRzRzRzRzRzRzR commented Dec 26, 2025 •

edited

Loading