Skip to content

add flashinfer mxfp4#8847

Merged
zhyncs merged 17 commits intomainfrom
add_flashinfer_mxfp4
Aug 6, 2025
Merged

add flashinfer mxfp4#8847
zhyncs merged 17 commits intomainfrom
add_flashinfer_mxfp4

Conversation

@BBuf
Copy link
Copy Markdown
Collaborator

@BBuf BBuf commented Aug 6, 2025

Motivation

Modifications

Accuracy Test

Benchmark & Profiling

Checklist

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

Base automatically changed from gpt-oss-mxfp4 to main August 6, 2025 07:05
)

def forward(self, hidden_states: torch.Tensor, topk_output: StandardTopKOutput):
og_hidden_states = hidden_states.shape[-1]
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what is og_hidden_states? can we make it more clear?

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

origin_hidden_states_dim, renamed.

Copy link
Copy Markdown
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Must do padding here because the trl-llm kernel limit

@zhyncs zhyncs merged commit 4373df5 into main Aug 6, 2025
62 of 67 checks passed
@zhyncs zhyncs deleted the add_flashinfer_mxfp4 branch August 6, 2025 23:23
fzyzcjy added a commit to fzyzcjy/sglang that referenced this pull request Aug 7, 2025
This reverts commit 4373df5.

# Conflicts:
#	python/sglang/srt/server_args.py
fzyzcjy added a commit to fzyzcjy/sglang that referenced this pull request Aug 7, 2025
This reverts commit 4373df5.

# Conflicts:
#	python/sglang/srt/layers/moe/fused_moe_triton/layer.py
#	python/sglang/srt/layers/quantization/mxfp4.py
#	python/sglang/srt/server_args.py
narutolhy pushed a commit to narutolhy/sglang that referenced this pull request Aug 17, 2025
MahmoudAshraf97 pushed a commit to MahmoudAshraf97/sglang that referenced this pull request Sep 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants