[flex attention][triton pin] use new TMA API by davidberard98 · Pull Request #155771 · pytorch/pytorch

davidberard98 · 2025-06-12T04:00:29Z

Stack from ghstack (oldest at bottom):

-> [flex attention][triton pin] use new TMA API #155771

Triton 3.4 will remove the experimental TMA APIs: triton-lang/triton#6488. Ahead of this, we are replacing the experimental TMA API usage with the stable TMA API in flex attention. This means that flex attention TMA will stop working with Triton 3.2 or Triton 3.3/3.3.1 for now (but it should work for Triton 3.4 in the PyTorch 2.8 release, and Meta-internal triton 3.3.1fb, which have the new TMA API).

This PR does the following:

replace the experimental TMA APIs with the stable TMA APIs
remove the workspace args.

Testing: I ran test/inductor/test_flex_attention.py on a H100 with @mandroid6's PR #153662 patched in to turn on TMA [TODO: confirm results once all the local tests pass, but from the first 100 tests I ran locally, all the failing tests were also failing on #153662 alone]

Note: When #153662 lands, turning on TMA support by default, it should be checking specifically for stable TMA API support (commented on PR)

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

[ghstack-poisoned]

pytorch-bot · 2025-06-12T04:00:32Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155771

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 9c7b915 with merge base 132babe ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 288a925 Pull Request resolved: #155771

Triton 3.4 will remove the experimental TMA APIs: triton-lang/triton#6488. Ahead of this, we are **replacing the experimental TMA API usage with the stable TMA API** in flex attention. This means that **flex attention TMA will stop working with Triton 3.2 or Triton 3.3/3.3.1** for now (but it should work for Triton 3.4 in the PyTorch 2.8 release, and Meta-internal triton 3.3.1fb, which have the new TMA API). This PR does the following: * replace the experimental TMA APIs with the stable TMA APIs * remove the workspace args. Testing: I ran test/inductor/test_flex_attention.py on a H100, [TODO confirm results] TODO: When #153662 lands, turning on TMA support by default, it should be modified slightly to check specifically for stable TMA API support. cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

Triton 3.4 will remove the experimental TMA APIs: triton-lang/triton#6488. Ahead of this, we are **replacing the experimental TMA API usage with the stable TMA API** in flex attention. This means that **flex attention TMA will stop working with Triton 3.2 or Triton 3.3/3.3.1** for now (but it should work for Triton 3.4 in the PyTorch 2.8 release, and Meta-internal triton 3.3.1fb, which have the new TMA API). This PR does the following: * replace the experimental TMA APIs with the stable TMA APIs * remove the workspace args. Testing: I ran test/inductor/test_flex_attention.py on a H100, [TODO confirm results] Note: When #153662 lands, turning on TMA support by default, it should be checking specifically for stable TMA API support (commented on PR) cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy chenyang78 kadeng muchulee8 amjames chauhang aakhundov [ghstack-poisoned]

ghstack-source-id: 4ca8938 Pull Request resolved: #155771

torch/_inductor/kernel/flex_attention.py

davidberard98 · 2025-06-13T22:14:56Z

@pytorchbot merge

pytorchmergebot · 2025-06-13T22:16:56Z

Merge failed

Reason: Approvers from one of the following sets are needed:

superuser (pytorch/metamates)
Core Reviewers (mruberry, lezcano, Skylion007, ngimel, peterbell10, ...)
Core Maintainers (soumith, gchanan, ezyang, dzhulgakov, malfet, ...)

Details for Dev Infra team

Raised by workflow job

Failing merge rule: Core Maintainers

nmacchioni

LGTM

davidberard98 · 2025-06-14T03:18:20Z

@pytorchbot merge

pytorchmergebot · 2025-06-14T03:20:05Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[flex attention][triton pin] use new TMA AP

5c6beef

[ghstack-poisoned]

davidberard98 mentioned this pull request Jun 12, 2025

[inductor][triton pin] add support for new TMA API for mm.py templates #155723

Closed

pytorch-bot bot added ciflow/inductor module: inductor labels Jun 12, 2025

davidberard98 added a commit that referenced this pull request Jun 12, 2025

[flex attention][triton pin] use new TMA AP

98d53e4

ghstack-source-id: 288a925 Pull Request resolved: #155771

davidberard98 changed the title ~~[flex attention][triton pin] use new TMA AP~~ [flex attention][triton pin] use new TMA API Jun 12, 2025

davidberard98 mentioned this pull request Jun 12, 2025

[pytorch][triton] Enabling TMA for flex-attention for supported device types #153662

Closed

davidberard98 marked this pull request as draft June 12, 2025 05:02

davidberard98 added release notes: inductor ciflow/h100 labels Jun 12, 2025

davidberard98 added a commit that referenced this pull request Jun 12, 2025

[flex attention][triton pin] use new TMA AP

518dc1b

ghstack-source-id: 4ca8938 Pull Request resolved: #155771

davidberard98 requested review from drisspg and mandroid6 and removed request for mandroid6 June 12, 2025 22:06

davidberard98 marked this pull request as ready for review June 13, 2025 03:17

mandroid6 approved these changes Jun 13, 2025

View reviewed changes

torch/_inductor/kernel/flex_attention.py Show resolved Hide resolved

torch/_inductor/kernel/flex_attention.py Show resolved Hide resolved

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 13, 2025

pytorchmergebot added the merging label Jun 13, 2025

pytorchmergebot removed the merging label Jun 13, 2025

nmacchioni approved these changes Jun 14, 2025

View reviewed changes

pytorchmergebot added the merging label Jun 14, 2025

pytorchmergebot added the Merged label Jun 14, 2025

pytorchmergebot closed this in c843909 Jun 14, 2025

pytorchmergebot removed the merging label Jun 14, 2025

davidberard98 mentioned this pull request Jun 27, 2025

Triton has removed the experimental descriptor API #154162

Closed

github-actions bot deleted the gh/davidberard98/374/head branch July 15, 2025 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[flex attention][triton pin] use new TMA API#155771

[flex attention][triton pin] use new TMA API#155771
davidberard98 wants to merge 3 commits intogh/davidberard98/374/basefrom
gh/davidberard98/374/head

davidberard98 commented Jun 12, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jun 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

davidberard98 commented Jun 13, 2025

Uh oh!

pytorchmergebot commented Jun 13, 2025

Uh oh!

nmacchioni left a comment

Uh oh!

davidberard98 commented Jun 14, 2025

Uh oh!

pytorchmergebot commented Jun 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

davidberard98 commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jun 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/155771

✅ No Failures

Uh oh!

Uh oh!

Uh oh!

davidberard98 commented Jun 13, 2025

Uh oh!

pytorchmergebot commented Jun 13, 2025

Merge failed

Uh oh!

nmacchioni left a comment

Choose a reason for hiding this comment

Uh oh!

davidberard98 commented Jun 14, 2025

Uh oh!

pytorchmergebot commented Jun 14, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

davidberard98 commented Jun 12, 2025 •

edited

Loading

pytorch-bot bot commented Jun 12, 2025 •

edited

Loading