[diffusion] model: Properly validate device for Mistral 3 attention by avjves · Pull Request #22690 · sgl-project/sglang

avjves · 2026-04-13T12:05:55Z

Motivation

PR #22423 changed it so that Mistral 3 (Used in Flux2) uses cuDNN attention by default, if the device type is cuda. AMD HW however also reports the device type as cuda, but does not support cuDNN attention. This change broke AMD support for Flux2.

Modifications

Changes the check in Mistral 3 to make the decision to use cuDNN be also based on the current detected platform, not just the device of the tensor.

Accuracy Tests

Speed Tests and Profiling

Checklist

Format your code according to the Format code with pre-commit.
Add unit tests according to the Run and add unit tests.
Update documentation according to Write documentations.
Provide accuracy and speed benchmark results according to Test the accuracy and Benchmark the speed.
Follow the SGLang code style guidance.

Review and Merge Process

Ping Merge Oncalls to start the process. See the PR Merge Process.
Get approvals from CODEOWNERS and other reviewers.
Trigger CI tests with comments or contact authorized users to do so.
- Common commands include /tag-and-rerun-ci, /tag-run-ci-label, /rerun-failed-ci
After green CI and required approvals, ask Merge Oncalls or people with Write permission to merge the PR.

gemini-code-assist

Code Review

This pull request updates the Mistral 3 encoder to utilize current_platform.is_cuda() for hardware detection. A review comment points out that removing the explicit tensor device type check could incorrectly trigger the CUDA backend for CPU-resident tensors, suggesting a combined check instead.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

yhyang201 · 2026-04-13T12:42:33Z

/tag-and-rerun-ci

yhyang201 · 2026-04-13T14:34:25Z

/rerun-failed-ci

yhyang201 · 2026-04-13T15:48:55Z

@mickqian All CI (Nvidia + AMD) passed and PR is approved, ready for merge

— SGLDHelper bot

…gl-project#22690)

[diffusion] model: Properly validate device in Mistral SDPA

ab893a8

avjves requested review from mickqian, ping1jing2 and yhyang201 as code owners April 13, 2026 12:05

github-actions Bot added the diffusion SGLang Diffusion label Apr 13, 2026

gemini-code-assist Bot reviewed Apr 13, 2026

View reviewed changes

Comment thread python/sglang/multimodal_gen/runtime/models/encoders/mistral_3.py Outdated

avjves and others added 2 commits April 13, 2026 15:08

Apply suggestion from @gemini-code-assist[bot]

5407a5d

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Fix styling

6b4c1dd

mickqian approved these changes Apr 13, 2026

View reviewed changes

github-actions Bot added the run-ci label Apr 13, 2026

avjves mentioned this pull request Apr 14, 2026

[diffusion] [AMD] model: allow AITER backends in Flux 2 pipeline #22802

Merged

5 tasks

HaiShaw merged commit aaa6823 into sgl-project:main Apr 16, 2026
148 of 158 checks passed

avjves deleted the fix/mistral_cudnn_sdpa branch April 17, 2026 17:34

jmamou pushed a commit to jmamou/sglang that referenced this pull request Apr 20, 2026

[diffusion] model: Properly validate device for Mistral 3 attention (s…

feee531

…gl-project#22690)

yhyang201 pushed a commit to yhyang201/sglang that referenced this pull request Apr 22, 2026

[diffusion] model: Properly validate device for Mistral 3 attention (s…

f3383b4

…gl-project#22690)

zhangying098 pushed a commit to zhangying098/sglang that referenced this pull request Apr 23, 2026

[diffusion] model: Properly validate device for Mistral 3 attention (s…

afcf1b2

…gl-project#22690)

kyx1999 pushed a commit to KMSorSMS/sglang that referenced this pull request Apr 27, 2026

[diffusion] model: Properly validate device for Mistral 3 attention (s…

d35a568

…gl-project#22690)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[diffusion] model: Properly validate device for Mistral 3 attention#22690

[diffusion] model: Properly validate device for Mistral 3 attention#22690
HaiShaw merged 3 commits intosgl-project:mainfrom
avjves:fix/mistral_cudnn_sdpa

avjves commented Apr 13, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

yhyang201 commented Apr 13, 2026

Uh oh!

yhyang201 commented Apr 13, 2026

Uh oh!

yhyang201 commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

avjves commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Accuracy Tests

Speed Tests and Profiling

Checklist

Review and Merge Process

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

yhyang201 commented Apr 13, 2026

Uh oh!

yhyang201 commented Apr 13, 2026

Uh oh!

yhyang201 commented Apr 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

avjves commented Apr 13, 2026 •

edited

Loading