[FlexFlash] Blackwell fwd support by drisspg · Pull Request #167040 · pytorch/pytorch

drisspg · 2025-11-05T01:13:44Z

Stack from ghstack (oldest at bottom):

Need to land: Dao-AILab/flash-attention#1985
^^First^^

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

[ghstack-poisoned]

pytorch-bot · 2025-11-05T01:13:47Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167040

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit 9308f29 with merge base 39ebab1 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

inductor / unit-test / inductor-halide-test / test (inductor-halide, 1, 1, linux.12xlarge) (gh) (trunk failure)
test/inductor/test_halide.py::HalideCpuTests::test__dyn_quant_pack_4bit_weight_bf16_cpu_halide

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

trunk / linux-jammy-py3-clang12-executorch / test (executorch, 1, 1, linux.2xlarge, unstable) (gh) (#166072)
extension/llm/custom_ops/test_sdpa_with_kv_cache.py::SDPATest::test_sdpa_with_cache_no_mqa_4

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: fa1db77 Pull-Request: #167040

Skylion007 · 2025-11-05T17:52:36Z

torch/_inductor/kernel/flex/flex_flash_attention.py

 def _supports_nontrivial_mask_graphs() -> bool:
    """Currently only supported on Hopper (SM90) GPUs."""
-    return torch.cuda.get_device_capability()[0] == 9
+    return torch.cuda.get_device_capability()[0] in [9, 10]


What about consumer Blackwell? Ie. 12? Guessing no since CUDA8 isn't supported either.

Just these 2, I though allow A100

[ghstack-poisoned]

Skylion007 · 2025-11-09T19:36:18Z

@driss btw CUDNN_FRONTEND just came out with proper block mask bindings so we may end supporting that soon too

[ghstack-poisoned]

ghstack-source-id: d180ca6 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 88f9b24 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 6a56de6 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 21c6614 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 2d5ba7f Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: a90fa81 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 0634e20 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 219f4b5 Pull-Request: #167040

ghstack-source-id: f0dd29b Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: b2d6d3d Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 4e09b77 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 5ff61ff Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 4af2f73 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 2e676aa Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 22c7f61 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 516a5d0 Pull-Request: #167040

[ghstack-poisoned]

ghstack-source-id: 1db64e0 Pull-Request: #167040

drisspg · 2025-11-19T22:07:43Z

@pytorchbot merge

pytorchmergebot · 2025-11-19T22:09:38Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

ghstack-source-id: 1db64e0 Pull-Request: #167040

ghstack-source-id: 1db64e0 Pull-Request: pytorch/pytorch#167040

Update

4ce7905

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 5, 2025

[FlexFlas] Blackwell fwd support

7ebe7b6

ghstack-source-id: fa1db77 Pull-Request: #167040

pytorch-bot bot added ciflow/inductor module: inductor labels Nov 5, 2025

drisspg added the topic: not user facing topic category label Nov 5, 2025

Skylion007 approved these changes Nov 5, 2025

View reviewed changes

drisspg changed the title ~~[FlexFlas] Blackwell fwd support~~ [FlexFlash] Blackwell fwd support Nov 5, 2025

Skylion007 approved these changes Nov 6, 2025

View reviewed changes

drisspg mentioned this pull request Nov 7, 2025

Add FA4 to sdpa #167348

Closed

Update

dcb6d5d

[ghstack-poisoned]

albanD approved these changes Nov 7, 2025

View reviewed changes

drisspg mentioned this pull request Nov 8, 2025

Add Tests #167392

Closed

Update

6ceae77

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 9, 2025

[FlexFlash] Blackwell fwd support

17ad72d

ghstack-source-id: d180ca6 Pull-Request: #167040

Update

ef519cc

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 10, 2025

[FlexFlash] Blackwell fwd support

38781a7

ghstack-source-id: 88f9b24 Pull-Request: #167040

Update

3cdf085

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 10, 2025

[FlexFlash] Blackwell fwd support

186dac6

ghstack-source-id: 6a56de6 Pull-Request: #167040

Update

0259ee6

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 11, 2025

[FlexFlash] Blackwell fwd support

5f351e0

ghstack-source-id: 21c6614 Pull-Request: #167040

Update

3036461

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 11, 2025

[FlexFlash] Blackwell fwd support

232eaaa

ghstack-source-id: 2d5ba7f Pull-Request: #167040

Update

a2c4f75

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 11, 2025

[FlexFlash] Blackwell fwd support

bbb66f4

ghstack-source-id: a90fa81 Pull-Request: #167040

Update

c33da03

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 11, 2025

[FlexFlash] Blackwell fwd support

5a6f813

ghstack-source-id: 0634e20 Pull-Request: #167040

Update

6ef68b3

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 11, 2025

[FlexFlash] Blackwell fwd support

a8bb3f4

ghstack-source-id: 219f4b5 Pull-Request: #167040

drisspg added a commit that referenced this pull request Nov 17, 2025

[FlexFlash] Blackwell fwd support

a5d0d8e

ghstack-source-id: f0dd29b Pull-Request: #167040

Update

176c963

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 17, 2025

[FlexFlash] Blackwell fwd support

ac57604

ghstack-source-id: b2d6d3d Pull-Request: #167040

Update

52ba379

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 17, 2025

[FlexFlash] Blackwell fwd support

ae7ae97

ghstack-source-id: 4e09b77 Pull-Request: #167040

Update

7bea4fd

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 18, 2025

[FlexFlash] Blackwell fwd support

55559c1

ghstack-source-id: 5ff61ff Pull-Request: #167040

Update

2026cbf

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 18, 2025

[FlexFlash] Blackwell fwd support

fcde3b7

ghstack-source-id: 4af2f73 Pull-Request: #167040

Update

11a7c90

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 18, 2025

[FlexFlash] Blackwell fwd support

1090314

ghstack-source-id: 2e676aa Pull-Request: #167040

Update

7f83dd0

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 19, 2025

[FlexFlash] Blackwell fwd support

8e85984

ghstack-source-id: 22c7f61 Pull-Request: #167040

Update

70b976b

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 19, 2025

[FlexFlash] Blackwell fwd support

866908d

ghstack-source-id: 516a5d0 Pull-Request: #167040

Update

9308f29

[ghstack-poisoned]

drisspg added a commit that referenced this pull request Nov 19, 2025

[FlexFlash] Blackwell fwd support

cc012f7

ghstack-source-id: 1db64e0 Pull-Request: #167040

drisspg added the ciflow/b200 label Nov 19, 2025

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Nov 19, 2025

pytorchmergebot added the merging label Nov 19, 2025

drisspg mentioned this pull request Nov 19, 2025

Trying to get better error messages #168194

Closed

pytorchmergebot added the Merged label Nov 20, 2025

pytorchmergebot closed this in cda1b8d Nov 20, 2025

pytorchmergebot removed the merging label Nov 20, 2025

drisspg added a commit that referenced this pull request Nov 20, 2025

[FlexFlash] Blackwell fwd support

a2efddb

ghstack-source-id: 1db64e0 Pull-Request: #167040

tiendatngcs pushed a commit to tiendatngcs/pytorch-Dec25 that referenced this pull request Dec 10, 2025

[FlexFlash] Blackwell fwd support

fdbba87

ghstack-source-id: 1db64e0 Pull-Request: pytorch/pytorch#167040

github-actions bot deleted the gh/drisspg/218/head branch December 20, 2025 02:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FlexFlash] Blackwell fwd support#167040

[FlexFlash] Blackwell fwd support#167040
drisspg wants to merge 26 commits intogh/drisspg/218/basefrom
gh/drisspg/218/head

drisspg commented Nov 5, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Nov 5, 2025 •

edited

Loading

Uh oh!

Skylion007 Nov 5, 2025

Uh oh!

drisspg Nov 5, 2025

Uh oh!

Skylion007 commented Nov 9, 2025

Uh oh!

drisspg commented Nov 19, 2025

Uh oh!

pytorchmergebot commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

drisspg commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/167040

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

Skylion007 Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

drisspg Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Skylion007 commented Nov 9, 2025

Uh oh!

drisspg commented Nov 19, 2025

Uh oh!

pytorchmergebot commented Nov 19, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

drisspg commented Nov 5, 2025 •

edited

Loading

pytorch-bot bot commented Nov 5, 2025 •

edited

Loading