Skip to content

[Diffusion][CI] Fix nunchaku unit test broken by #22365#22560

Merged
hnyls2002 merged 1 commit intomainfrom
fix-nunchaku-unit-test-mock
Apr 11, 2026
Merged

[Diffusion][CI] Fix nunchaku unit test broken by #22365#22560
hnyls2002 merged 1 commit intomainfrom
fix-nunchaku-unit-test-mock

Conversation

@alisonshao
Copy link
Copy Markdown
Collaborator

Summary

  • Mock maybe_download_model in test_resolve_transformer_quant_load_spec_keeps_nunchaku_hook to prevent it from trying to download a fake local path (/tmp/svdq-int4_r32.safetensors) as an HF repo
  • [Diffusion] modelopt diffusion fp8 support for flux1/flux2 and wan2.2 #22365 added _resolve_quant_config_from_transformer_override which calls maybe_download_model on the transformer weights path, but the test uses a non-existent /tmp path that fails HF Hub validation
  • CI failure example

Test plan

  • multimodal-gen-unit-test CI job passes

Mock maybe_download_model in test_resolve_transformer_quant_load_spec_keeps_nunchaku_hook
to prevent it from trying to download a fake local path as an HF repo.

#22365 added _resolve_quant_config_from_transformer_override which calls
maybe_download_model on the transformer_weights_path, but the test uses
a non-existent /tmp path that fails HF Hub validation.
@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@github-actions github-actions Bot added quant LLM Quantization diffusion SGLang Diffusion labels Apr 10, 2026
@alisonshao
Copy link
Copy Markdown
Collaborator Author

/rerun-stage multimodal-gen-unit-test

1 similar comment
@alisonshao
Copy link
Copy Markdown
Collaborator Author

/rerun-stage multimodal-gen-unit-test

@github-actions
Copy link
Copy Markdown
Contributor

❌ Stage multimodal-gen-unit-test doesn't support isolated runs yet.

NVIDIA stages:

  • stage-a-test-1-gpu-small
  • stage-a-test-cpu
  • stage-b-test-1-gpu-small
  • stage-b-test-1-gpu-large
  • stage-b-test-2-gpu-large
  • stage-b-test-4-gpu-b200
  • stage-c-test-4-gpu-h100
  • stage-c-test-8-gpu-h200
  • stage-c-test-8-gpu-h20
  • stage-c-test-4-gpu-b200
  • stage-c-test-4-gpu-gb200
  • stage-c-test-deepep-4-gpu-h100
  • stage-c-test-deepep-8-gpu-h200
  • multimodal-gen-test-1-gpu
  • multimodal-gen-test-2-gpu
  • multimodal-gen-component-accuracy-1-gpu
  • multimodal-gen-component-accuracy-2-gpu
  • multimodal-gen-test-1-b200

AMD stages:

  • sgl-kernel-unit-test-amd
  • sgl-kernel-unit-test-2-gpu-amd
  • stage-a-test-1-gpu-small-amd
  • stage-b-test-1-gpu-small-amd
  • stage-b-test-1-gpu-small-amd-nondeterministic
  • stage-b-test-1-gpu-small-amd-mi35x
  • stage-b-test-1-gpu-large-amd
  • stage-b-test-2-gpu-large-amd
  • multimodal-gen-test-1-gpu-amd
  • multimodal-gen-test-2-gpu-amd
  • stage-c-test-large-8-gpu-amd
  • stage-c-test-large-8-gpu-amd-mi35x

Other stages will be added soon. For now, use /rerun-failed-ci for those stages.

1 similar comment
@github-actions
Copy link
Copy Markdown
Contributor

❌ Stage multimodal-gen-unit-test doesn't support isolated runs yet.

NVIDIA stages:

  • stage-a-test-1-gpu-small
  • stage-a-test-cpu
  • stage-b-test-1-gpu-small
  • stage-b-test-1-gpu-large
  • stage-b-test-2-gpu-large
  • stage-b-test-4-gpu-b200
  • stage-c-test-4-gpu-h100
  • stage-c-test-8-gpu-h200
  • stage-c-test-8-gpu-h20
  • stage-c-test-4-gpu-b200
  • stage-c-test-4-gpu-gb200
  • stage-c-test-deepep-4-gpu-h100
  • stage-c-test-deepep-8-gpu-h200
  • multimodal-gen-test-1-gpu
  • multimodal-gen-test-2-gpu
  • multimodal-gen-component-accuracy-1-gpu
  • multimodal-gen-component-accuracy-2-gpu
  • multimodal-gen-test-1-b200

AMD stages:

  • sgl-kernel-unit-test-amd
  • sgl-kernel-unit-test-2-gpu-amd
  • stage-a-test-1-gpu-small-amd
  • stage-b-test-1-gpu-small-amd
  • stage-b-test-1-gpu-small-amd-nondeterministic
  • stage-b-test-1-gpu-small-amd-mi35x
  • stage-b-test-1-gpu-large-amd
  • stage-b-test-2-gpu-large-amd
  • multimodal-gen-test-1-gpu-amd
  • multimodal-gen-test-2-gpu-amd
  • stage-c-test-large-8-gpu-amd
  • stage-c-test-large-8-gpu-amd-mi35x

Other stages will be added soon. For now, use /rerun-failed-ci for those stages.

@alisonshao
Copy link
Copy Markdown
Collaborator Author

@hnyls2002 hnyls2002 merged commit 75223c5 into main Apr 11, 2026
86 of 93 checks passed
@hnyls2002 hnyls2002 deleted the fix-nunchaku-unit-test-mock branch April 11, 2026 00:49
pyc96 pushed a commit to pyc96/sglang that referenced this pull request Apr 14, 2026
…gl-project#22560)

Co-authored-by: Alison Shao <alison.shao@MacBook-Pro-D2W773R9CD.local>
yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Apr 22, 2026
…gl-project#22560)

Co-authored-by: Alison Shao <alison.shao@MacBook-Pro-D2W773R9CD.local>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

diffusion SGLang Diffusion high priority quant LLM Quantization

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants