[Bugfix] fix fuse_allreduce_rms when tp =1 by ZJY0516 · Pull Request #30178 · vllm-project/vllm

ZJY0516 · 2025-12-06T07:01:37Z

Purpose

Fix #24252 (comment)

cc @ProExpertProg @hjjq

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

gemini-code-assist

Code Review

This pull request introduces a bugfix in AllReduceFusionPass to prevent a potential AttributeError in is_applicable_for_range. The change adds checks to ensure the pass is not applied if it has been disabled during initialization or if a required attribute (max_token_num) is missing. This correctly handles cases where the pass initialization exits early. While the fix is effective, I've noted a small redundancy in the added checks that could be simplified for better code clarity and maintainability.

gemini-code-assist · 2025-12-06T07:03:12Z

vllm/compilation/collective_fusion.py

+        if not hasattr(self, "max_token_num"):
+            logger.warning_once(
+                "AllReduce fusion pass missing max token bound; skipping",
+            )
+            return False


This check for max_token_num appears to be redundant. A review of the __init__ method for AllReduceFusionPass shows that self.disabled is only False if the initialization completes successfully, which includes setting self.max_token_num. In all early-exit scenarios, self.disabled remains True, and self.max_token_num is not set. Therefore, the initial check for self.disabled on line 1191 is sufficient to guard against the AttributeError. Removing this redundant block will make the code cleaner and easier to reason about.

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

vllm/compilation/collective_fusion.py

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

ProExpertProg · 2025-12-07T16:30:01Z

Thanks, merging!

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

fix

5d32927

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

ZJY0516 requested review from ProExpertProg, youkaichao and zou3519 as code owners December 6, 2025 07:01

ZJY0516 changed the title ~~[Bugfix]~~ [Bugfix] fix fuse_allreduce_rms when tp =1 Dec 6, 2025

gemini-code-assist bot reviewed Dec 6, 2025

View reviewed changes

fix

d5e7599

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

ZJY0516 mentioned this pull request Dec 6, 2025

[Compile] Conditional compilation. Introduce compile_ranges #24252

Merged

3 tasks

ProExpertProg approved these changes Dec 7, 2025

View reviewed changes

vllm/compilation/collective_fusion.py Show resolved Hide resolved

ProExpertProg added the ready ONLY add when PR is ready to merge/full CI is needed label Dec 7, 2025

ZJY0516 added 2 commits December 7, 2025 23:49

update

6149fa6

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

update

1471462

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>

ProExpertProg enabled auto-merge (squash) December 7, 2025 16:29

Merge branch 'main' into fix_fuse_allreduce

66f28d7

ProExpertProg disabled auto-merge December 8, 2025 04:07

ProExpertProg enabled auto-merge (squash) December 8, 2025 04:07

ProExpertProg merged commit d143271 into vllm-project:main Dec 8, 2025
48 checks passed

ZJY0516 deleted the fix_fuse_allreduce branch December 8, 2025 07:34

dsuhinin pushed a commit to dsuhinin/vllm that referenced this pull request Jan 21, 2026

[Bugfix] fix fuse_allreduce_rms when tp =1 (vllm-project#30178)

e222d7f

Signed-off-by: zjy0516 <riverclouds.zhu@qq.com> Signed-off-by: dsuhinin <suhinin.dmitriy@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Bugfix] fix fuse_allreduce_rms when tp =1#30178

[Bugfix] fix fuse_allreduce_rms when tp =1#30178
ProExpertProg merged 5 commits intovllm-project:mainfrom
ZJY0516:fix_fuse_allreduce

ZJY0516 commented Dec 6, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 6, 2025

Uh oh!

Uh oh!

ProExpertProg commented Dec 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ZJY0516 commented Dec 6, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ProExpertProg commented Dec 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ZJY0516 commented Dec 6, 2025 •

edited by github-actions bot

Loading