Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)" by mickqian · Pull Request #22649 · sgl-project/sglang

mickqian · 2026-04-13T02:22:53Z

This reverts commit 03a1a7b #22574

failed ci: https://github.com/sgl-project/sglang/actions/runs/24322506533/job/71011288001?pr=22633

Motivation

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review and Merge Process

Ping Merge Oncalls to start the process. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

This reverts commit 03a1a7b

mickqian · 2026-04-13T02:23:07Z

/tag-and-rerun-ci

gemini-code-assist

Code Review

This pull request simplifies the ModelOpt quantization workflow by removing the NVFP4 mixed-precision builder, hardcoding nibble swapping, and renaming the FP8 conversion tool to convert_modelopt_fp8_checkpoint. It also removes the prefix argument from several linear layer initializations within the FLUX model. Feedback indicates that the removal of the prefix argument is inconsistent in FluxAttention, which could lead to runtime errors, and suggests removing the now-unused prefix parameter from the FluxSingleTransformerBlock constructor.

gemini-code-assist · 2026-04-13T02:26:39Z

                bias=bias,
                gather_output=True,
                quant_config=quant_config,
-                prefix=f"{prefix}.to_q" if prefix else "to_q",


While this revert correctly removes the prefix argument from this ColumnParallelLinear call, other calls within FluxAttention (e.g., for to_out and to_add_out) still use the prefix argument. This suggests an incomplete revert. If ColumnParallelLinear no longer accepts prefix after this revert, those calls will cause a runtime error. Please ensure the prefix argument is removed from all ColumnParallelLinear calls consistently.

gemini-code-assist · 2026-04-13T02:26:39Z

                bias=True,
                gather_output=True,
                quant_config=quant_config,
-                prefix=f"{prefix}.proj_mlp" if prefix else "proj_mlp",


This change, along with other similar changes in this block, makes the prefix parameter of FluxSingleTransformerBlock.__init__ unused. Please consider removing the prefix parameter from the method signature in a follow-up change to improve code clarity.

…2574)" (#22649)" This reverts commit bf022e1.

…t#22574)" (sgl-project#22649)

Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)"

2184b4e

This reverts commit 03a1a7b

mickqian requested review from BBuf, DarkSharpness, HydraQYH, celve, ping1jing2, yhyang201, yingluosanqian and yuan-luo as code owners April 13, 2026 02:22

github-actions Bot added documentation Improvements or additions to documentation quant LLM Quantization blackwell SM100/SM120 diffusion SGLang Diffusion jit-kernel labels Apr 13, 2026

github-actions Bot added the run-ci label Apr 13, 2026

gemini-code-assist Bot reviewed Apr 13, 2026

View reviewed changes

mickqian merged commit bf022e1 into main Apr 13, 2026
136 of 179 checks passed

mickqian deleted the diffusion-ci branch April 13, 2026 03:17

BBuf added a commit that referenced this pull request Apr 13, 2026

Revert "Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#2…

fbdc957

…2574)" (#22649)" This reverts commit bf022e1.

pyc96 pushed a commit to pyc96/sglang that referenced this pull request Apr 14, 2026

Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (sgl-projec…

fdcc906

…t#22574)" (sgl-project#22649)

yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Apr 22, 2026

Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (sgl-projec…

e70cbce

…t#22574)" (sgl-project#22649)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)"#22649

Revert "[Diffusion] Add FLUX.1-dev ModelOpt NVFP4 support (#22574)"#22649
mickqian merged 1 commit intomainfrom
diffusion-ci

mickqian commented Apr 13, 2026 •

edited

Loading

Uh oh!

mickqian commented Apr 13, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Apr 13, 2026

Uh oh!

gemini-code-assist Bot Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mickqian commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

Uh oh!

mickqian commented Apr 13, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mickqian commented Apr 13, 2026 •

edited

Loading