Skip to content

[New Model] GLM4.7-Flash#17247

Merged
ispobock merged 21 commits intomainfrom
temp
Jan 20, 2026
Merged

[New Model] GLM4.7-Flash#17247
ispobock merged 21 commits intomainfrom
temp

Conversation

@Qiaolin-Yu
Copy link
Copy Markdown
Collaborator

@Qiaolin-Yu Qiaolin-Yu commented Jan 17, 2026

Motivation

Coauthored with Xinyuan

Modifications

Usage

Install sglang with this PR:

uv pip install sglang==0.3.2.dev9039+pr-17247.g90c446848 \                                                 
      --extra-index-url https://sgl-project.github.io/whl/pr/

Need to install transformers on this commit: huggingface/transformers@76732b4

uv pip install git+https://github.com/huggingface/transformers.git@76732b4e7120808ff989edbd16401f61fa6a0afa

For Blackweel, using --speculative-draft-attention-backend triton --attention-backend triton

Accuracy Tests

lm_eval \
  --model local-completions \
  --model_args model=zai-org/GLM-4.7-Flash,base_url=http://127.0.0.1:8000/v1/completions,tokenized_requests=False,trust_remote_code=True \
  --tasks gsm8k \
  --output_path ./ --batch_size 512
|Tasks|Version|     Filter     |n-shot|  Metric   |   |Value |   |Stderr|
|-----|------:|----------------|-----:|-----------|---|-----:|---|-----:|
|gsm8k|      3|flexible-extract|     5|exact_match|↑  |0.8332|±  |0.0103|
|     |       |strict-match    |     5|exact_match|↑  |0.8340|±  |0.0102|

Benchmarking and Profiling

Checklist

Review Process

  1. Ping Merge Oncalls to start the PR flow. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • /tag-run-ci-label, /rerun-failed-ci, /tag-and-rerun-ci
  4. After green CI and required approvals, ask Merge Oncalls to merge.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@Qiaolin-Yu Qiaolin-Yu marked this pull request as draft January 17, 2026 06:06
@JustinTong0323 JustinTong0323 self-assigned this Jan 18, 2026
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
@JustinTong0323
Copy link
Copy Markdown
Collaborator

JustinTong0323 commented Jan 18, 2026

Need to install transformers on this commit: huggingface/transformers@76732b4

uv pip install git+https://github.com/huggingface/transformers.git@76732b4e7120808ff989edbd16401f61fa6a0afa

For Blackweel, using --flashinfer-mla-disable-ragged

@JustinTong0323 JustinTong0323 changed the title wip: Temp [New Model] GLM4.7-Flash Jan 18, 2026
JustinTong0323 and others added 5 commits January 18, 2026 06:34
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
@JustinTong0323 JustinTong0323 marked this pull request as ready for review January 19, 2026 16:52
@JustinTong0323
Copy link
Copy Markdown
Collaborator

JustinTong0323 commented Jan 19, 2026

/tag-and-rerun-ci again

Fridge003 and others added 2 commits January 20, 2026 01:10
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
@JustinTong0323
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Signed-off-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
@b8zhong b8zhong added the format Auto Format Code label Jan 19, 2026
@github-actions github-actions Bot removed the format Auto Format Code label Jan 19, 2026
@JustinTong0323
Copy link
Copy Markdown
Collaborator

/tag-and-rerun-ci

@ispobock ispobock merged commit 76b06be into main Jan 20, 2026
305 of 329 checks passed
@ispobock ispobock deleted the temp branch January 20, 2026 15:44
Todobe pushed a commit to Todobe/sgl-sglang that referenced this pull request Mar 3, 2026
Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com>
Co-authored-by: JustinTong0323 <justinning0323@gmail.com>
Co-authored-by: Xinyuan Tong <xinyuantong.cs@gmail.com>
Co-authored-by: Baizhou Zhang <sobereddiezhang@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants