Skip to content

[RL] Fix weight update for mxfp8 flashinfer_cutlass gemm backend#22484

Merged
b8zhong merged 3 commits intosgl-project:mainfrom
zianglih:fi-ctlass-fix
Apr 12, 2026
Merged

[RL] Fix weight update for mxfp8 flashinfer_cutlass gemm backend#22484
b8zhong merged 3 commits intosgl-project:mainfrom
zianglih:fi-ctlass-fix

Conversation

@zianglih
Copy link
Copy Markdown
Contributor

@zianglih zianglih commented Apr 10, 2026

Motivation

@HumansAnd

#21576 refactors mxfp8 scaling factor swizzling to in-place style. However, on the flashinfer_cutlass mxfp8 code path, block_scale_interleave may pad the scales, violating the shape contract for weight update. Therefore, we revert to the previous original & swizzled dual buffer approach.

For the full mxfp8 DeepSeek 671B model, all the duplicate ue8m0 accounts for less than 1gb, thus the overhead is negligible.

In the future we should rely on a restore_weights_before_loading api which is still under development.

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

  1. Ping Merge Oncalls to start the process. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
  4. After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@zianglih
Copy link
Copy Markdown
Contributor Author

Hi @b8zhong , could you take a look at the tiny reverting fix? Thank you!

@b8zhong
Copy link
Copy Markdown
Collaborator

b8zhong commented Apr 11, 2026

/tag-and-rerun-ci

@b8zhong b8zhong enabled auto-merge (squash) April 11, 2026 03:24
@zianglih
Copy link
Copy Markdown
Contributor Author

/rerun-failed-ci

2 similar comments
@zianglih
Copy link
Copy Markdown
Contributor Author

/rerun-failed-ci

@zianglih
Copy link
Copy Markdown
Contributor Author

/rerun-failed-ci

@b8zhong b8zhong merged commit 31453bb into sgl-project:main Apr 12, 2026
108 of 113 checks passed
@zianglih zianglih deleted the fi-ctlass-fix branch April 24, 2026 17:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants