Skip to content

[Fix] Fix accuracy bug in Flashmla sparse MLA kernel#22723

Merged
Fridge003 merged 3 commits intomainfrom
fix-flashmla
Apr 15, 2026
Merged

[Fix] Fix accuracy bug in Flashmla sparse MLA kernel#22723
Fridge003 merged 3 commits intomainfrom
fix-flashmla

Conversation

@Fridge003
Copy link
Copy Markdown
Collaborator

Motivation

Close #21291

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

  1. Ping Merge Oncalls to start the process. See the PR Merge Process.
  2. Get approvals from CODEOWNERS and other reviewers.
  3. Trigger CI tests with comments or contact authorized users to do so.
    • Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
  4. After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@Fridge003
Copy link
Copy Markdown
Collaborator Author

/tag-and-rerun-ci

@Fridge003 Fridge003 merged commit 113d654 into main Apr 15, 2026
95 of 113 checks passed
@Fridge003 Fridge003 deleted the fix-flashmla branch April 15, 2026 20:40
jmamou pushed a commit to jmamou/sglang that referenced this pull request Apr 20, 2026
yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Apr 22, 2026
zhangying098 pushed a commit to zhangying098/sglang that referenced this pull request Apr 23, 2026
kyx1999 pushed a commit to KMSorSMS/sglang that referenced this pull request Apr 27, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug] GLM-5 accuracy drop on B200 with flash_mla_with_kvcache kernel

1 participant