[MoE] Move marlin moe to fused_moe/experts/#40572
Conversation
Rename fused_marlin_moe.py to experts/marlin_moe.py and update all references across the codebase (source, tests, docs). Co-Authored-By: Claude <noreply@anthropic.com> Signed-off-by: Jackmin801 <ongjackm@gmail.com>
|
Documentation preview: https://vllm--40572.org.readthedocs.build/en/40572/ |
There was a problem hiding this comment.
Code Review
This pull request refactors the project structure by moving the Marlin MoE kernel implementation from vllm/model_executor/layers/fused_moe/fused_marlin_moe.py to vllm/model_executor/layers/fused_moe/experts/marlin_moe.py. All associated import statements in the documentation, tests, and various quantization layers have been updated to reflect this new path. I have no feedback to provide as the changes are consistent and correctly implement the structural reorganization.
|
This pull request has merge conflicts that must be resolved before it can be |
Signed-off-by: Robert Shaw <robertgshaw2@gmail.com>
…experts Signed-off-by: Jackmin801 <ongjackm@gmail.com> # Conflicts: # vllm/lora/layers/fused_moe.py
|
This pull request has merge conflicts that must be resolved before it can be |
Rename fused_marlin_moe.py to experts/marlin_moe.py and update all references across the codebase (source, tests, docs).
Test plan
pytest -vs tests/kernels/moe