Skip to content

[MoE] Move flashinfer cutlass moe to fused_moe/experts/#41977

Closed
bnellnm wants to merge 6 commits into
vllm-project:mainfrom
neuralmagic:move-flashinfer-cutlass-moe-to-experts
Closed

[MoE] Move flashinfer cutlass moe to fused_moe/experts/#41977
bnellnm wants to merge 6 commits into
vllm-project:mainfrom
neuralmagic:move-flashinfer-cutlass-moe-to-experts

Conversation

@bnellnm

@bnellnm bnellnm commented May 7, 2026

Copy link
Copy Markdown
Collaborator

Purpose

Rename flashinfer_cutlass_moe.py to experts/flashinfer_cutlass_moe.py and update all references (source, tests, docs).

Forked from #40571

cc: @Jackmin801 , @robertgshaw2-redhat

Test Plan

Test Result


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Jackmin801 and others added 6 commits April 22, 2026 02:34
Rename flashinfer_cutlass_moe.py to experts/flashinfer_cutlass_moe.py
and update all references (source, tests, docs).

Co-Authored-By: Claude <noreply@anthropic.com>
Signed-off-by: Jackmin801 <ongjackm@gmail.com>
Signed-off-by: Jackmin801 <56836461+Jackmin801@users.noreply.github.com>

@claude claude Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

@mergify

mergify Bot commented May 7, 2026

Copy link
Copy Markdown
Contributor

Documentation preview: https://vllm--41977.org.readthedocs.build/en/41977/

@mergify mergify Bot added documentation Improvements or additions to documentation nvidia labels May 7, 2026
@bnellnm bnellnm changed the title Move flashinfer cutlass moe to experts [MoE] Move flashinfer cutlass moe to fused_moe/experts/ May 7, 2026

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request relocates the FlashInferExperts implementation to the experts subdirectory within the fused_moe layers. The changes involve updating import paths across multiple test files, oracle modules, and the MoE kernel features documentation to reflect the new directory structure. I have no feedback to provide.

@bnellnm

bnellnm commented May 8, 2026

Copy link
Copy Markdown
Collaborator Author

Combined into one PR #41979

@bnellnm bnellnm closed this May 8, 2026
@github-project-automation github-project-automation Bot moved this to Done in NVIDIA May 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation nvidia

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

2 participants