Add torch.compile compatibility to FP8 SDPA using FA3 by howardzhang-cv · Pull Request #172622 · pytorch/pytorch

howardzhang-cv · 2026-01-16T08:17:58Z

Stack from ghstack (oldest at bottom):

Summary:
Added meta registration for new scaled_dot_product_flash_attention.low_p overload
Added inductor lowering fallback for new overload
Directly call op overload in _scaled_dot_product_attention_fp8 instead of python builtin function call

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @jataylo

[ghstack-poisoned]

pytorch-bot · 2026-01-16T08:18:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/172622

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 7cb5426 with merge base 32642ba ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Summary: Added meta registration for new scaled_dot_product_flash_attention.low_p overload Added inductor lowering fallback for new overload Directly call op overload in _scaled_dot_product_attention_fp8 instead of python builtin function call ghstack-source-id: 9605c0f Pull-Request: #172622

drisspg · 2026-01-17T23:52:22Z

    sdpa_constraint,
    warn=False,
 )
+make_fallback(


does the constraint work for the FP8 V layout ?

I think there is actually a section in this file on the layout constraints needed for inductor we should also apply for this overload

drisspg · 2026-01-17T23:52:37Z

        )
    # Directly call the internal flash attention operator which has descale support
-    result = torch._scaled_dot_product_flash_attention(
+    # Use the .low_p OpOverload directly for better torch.compile compatibility


hmm weird this helps?

drisspg · 2026-01-17T23:53:04Z

+        seed = torch.empty((2), dtype=torch.uint64, device="meta")
+        offset = torch.empty((), dtype=torch.uint64, device="meta")
+
+    return (


how much of this is resuable from the other meta funcs, can we just call them directly here?

drisspg · 2026-01-17T23:53:15Z

+    seqused_k: Tensor | None = None,
+    alibi_slopes: Tensor | None = None,
+):
+    print(


Summary: Added meta registration for new scaled_dot_product_flash_attention.low_p overload Added inductor lowering fallback for new overload Directly call op overload in _scaled_dot_product_attention_fp8 instead of python builtin function call ghstack-source-id: 9605c0f Pull-Request: pytorch/pytorch#172622

[ghstack-poisoned]

howardzhang-cv · 2026-01-22T23:48:25Z

@pytorchbot merge

pytorchmergebot · 2026-01-22T23:54:36Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Summary: Added meta registration for new scaled_dot_product_flash_attention.low_p overload Added inductor lowering fallback for new overload Directly call op overload in _scaled_dot_product_attention_fp8 instead of python builtin function call ghstack-source-id: baa029c Pull-Request: pytorch/pytorch#172622

Update

df5d7ce

[ghstack-poisoned]

howardzhang-cv requested review from albanD, jbschlosser and mikaylagawarecki as code owners January 16, 2026 08:17

howardzhang-cv mentioned this pull request Jan 16, 2026

Add FA3 to SDPA #172040

Closed

pytorch-bot Bot added ciflow/inductor module: inductor labels Jan 16, 2026

howardzhang-cv added the release notes: nn release notes category label Jan 16, 2026

howardzhang-cv mentioned this pull request Jan 16, 2026

Added test file for FA3 implementation of SDPA #172671

Closed

howardzhang-cv requested a review from drisspg January 17, 2026 01:01

drisspg reviewed Jan 17, 2026

View reviewed changes

mikaylagawarecki removed their request for review January 20, 2026 15:59

Update

a2a35ba

[ghstack-poisoned]

pytorch-bot Bot added the release notes: inductor (aoti) label Jan 21, 2026

drisspg approved these changes Jan 21, 2026

View reviewed changes

Update

00579ec

[ghstack-poisoned]

howardzhang-cv mentioned this pull request Jan 22, 2026

Added benchmark file for FA3 SDPA #173026

Closed

Update

7cb5426

[ghstack-poisoned]

pytorch-bot Bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jan 22, 2026

pytorchmergebot added the merging label Jan 22, 2026

pytorchmergebot added the Merged label Jan 23, 2026

pytorchmergebot closed this in 6a0920c Jan 23, 2026

pytorchmergebot removed the merging label Jan 23, 2026

github-actions Bot deleted the gh/howardzhang-cv/7/head branch February 23, 2026 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add torch.compile compatibility to FP8 SDPA using FA3#172622

Add torch.compile compatibility to FP8 SDPA using FA3#172622
howardzhang-cv wants to merge 4 commits intogh/howardzhang-cv/7/basefrom
gh/howardzhang-cv/7/head

howardzhang-cv commented Jan 16, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Jan 16, 2026 •

edited

Loading

Uh oh!

drisspg Jan 17, 2026

Uh oh!

drisspg Jan 21, 2026

Uh oh!

drisspg Jan 17, 2026

Uh oh!

drisspg Jan 17, 2026

Uh oh!

drisspg Jan 17, 2026

Uh oh!

howardzhang-cv commented Jan 22, 2026

Uh oh!

pytorchmergebot commented Jan 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

howardzhang-cv commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/172622

✅ No Failures

Uh oh!

drisspg Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

drisspg Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

drisspg Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

drisspg Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

drisspg Jan 17, 2026

Choose a reason for hiding this comment

Uh oh!

howardzhang-cv commented Jan 22, 2026

Uh oh!

pytorchmergebot commented Jan 22, 2026

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

howardzhang-cv commented Jan 16, 2026 •

edited

Loading

pytorch-bot Bot commented Jan 16, 2026 •

edited

Loading