Skip to content

[DSA] Enable all reduce fusion for DSA models #22390

Merged
Fridge003 merged 2 commits intomainfrom
dsa-allreduce-fusion
Apr 9, 2026
Merged

[DSA] Enable all reduce fusion for DSA models #22390
Fridge003 merged 2 commits intomainfrom
dsa-allreduce-fusion

Conversation

@Fridge003
Copy link
Copy Markdown
Collaborator

Motivation

Including DeepSeek V3.2 and GLM-5

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

  1. Ping Merge Oncalls to start the process. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
  4. After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@Fridge003
Copy link
Copy Markdown
Collaborator Author

/rerun-stage stage-c-test-8-gpu-h200

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 9, 2026

✅ Triggered stage-c-test-8-gpu-h200 to run independently (skipping dependencies). View workflow run

@Fridge003
Copy link
Copy Markdown
Collaborator Author

/rerun-test test_dsa_models_basic.py

@Fridge003
Copy link
Copy Markdown
Collaborator Author

/rerun-test test_dsa_models_mtp.py

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 9, 2026

8-gpu-h200 (1 test): View workflow run

cd test/ && python3 registered/8-gpu-models/test_dsa_models_basic.py

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 9, 2026

8-gpu-h200 (1 test): View workflow run

cd test/ && python3 registered/8-gpu-models/test_dsa_models_mtp.py

@Fridge003
Copy link
Copy Markdown
Collaborator Author

/rerun-test test_deepseek_v32_fp4_4gpu.py

@Fridge003
Copy link
Copy Markdown
Collaborator Author

/rerun-test test_deepseek_v32_fp4_mtp_4gpu.py

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 9, 2026

4-gpu-b200 (1 test): View workflow run

cd test/ && python3 registered/quant/test_deepseek_v32_fp4_4gpu.py

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 9, 2026

4-gpu-b200 (1 test): View workflow run

cd test/ && python3 registered/quant/test_deepseek_v32_fp4_mtp_4gpu.py

@Fridge003
Copy link
Copy Markdown
Collaborator Author

/rerun-test test/registered/cp/test_deepseek_v32_cp_single_node.py

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 9, 2026

8-gpu-h200 (1 test): View workflow run

cd test/ && python3 registered/cp/test_deepseek_v32_cp_single_node.py

@Fridge003 Fridge003 merged commit 606aa11 into main Apr 9, 2026
85 of 97 checks passed
@Fridge003 Fridge003 deleted the dsa-allreduce-fusion branch April 9, 2026 19:42
yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Apr 22, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants