Add stride check for attn_mask on non-cpu device by CaoE · Pull Request #158424 · pytorch/pytorch

CaoE · 2025-07-16T06:51:19Z

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

pytorch-bot · 2025-07-16T06:51:23Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158424

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit ebc237b with merge base a5e6881 ():

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-jammy-cuda12.8-py3.10-gcc11-sm89 / test (default, 5, 5, linux.g6.4xlarge.experimental.nvidia.gpu) (gh) (trunk failure)
inductor/test_inplace_padding.py::InplacePaddingTest::test_linear_and_cel

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

pull / cuda12.8-py3.10-gcc9-sm75 / test (pr_time_benchmarks, 1, 1, linux.g4dn.metal.nvidia.gpu) (gh) (#153987)
MISSING REGRESSION TEST

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull Request Overview

This PR adds a stride check for attention mask tensors on non-CPU devices to fix issue #158374. The change ensures that fused attention kernels properly validate that the attention mask has a stride of 1 in the last dimension when running on GPU devices, while allowing more flexibility on CPU.

Updates the stride validation logic to include attention mask stride checking for non-CPU devices
Adds comprehensive test coverage for attention masks with non-contiguous strides
Improves error messaging to include attention mask stride information in debug output

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
aten/src/ATen/native/transformers/sdp_utils_cpp.h	Adds device-specific stride validation for attention masks and enhances error messaging
test/inductor/test_fused_attention.py	Adds test case for attention mask with non-unit stride in last dimension

Comments suppressed due to low confidence (2)

aten/src/ATen/native/transformers/sdp_utils_cpp.h:511

[nitpick] The variable name 'mask_stride_check' is ambiguous. Consider renaming to 'mask_stride_valid' or 'is_mask_stride_compatible' to better indicate it represents a boolean validation result.

  bool mask_stride_check = is_cpu ? true : mask_stride_equal_1;

aten/src/ATen/native/transformers/sdp_utils_cpp.h:514

[nitpick] The variable name 'epilogue_message' is unclear. Consider renaming to 'additional_error_info' or 'mask_error_details' to better describe its purpose of providing additional error message content.

      std::ostringstream epilogue_message;

aten/src/ATen/native/transformers/sdp_utils_cpp.h

drisspg

Can you add a test here:

pytorch/test/test_transformers.py

Line 1498 in 4805a6e

class TestSDPAFailureModes(NNTestCase):

and ensure that this error is raised

CaoE · 2025-07-17T02:13:41Z

Can you add a test here:

pytorch/test/test_transformers.py

Line 1498 in 4805a6e

class TestSDPAFailureModes(NNTestCase):

and ensure that this error is raised

Added a test for this error message.

drisspg · 2025-07-17T03:41:08Z

test/test_transformers.py


+    @onlyCUDA
+    @unittest.skipIf(not PLATFORM_SUPPORTS_MEM_EFF_ATTENTION, "Efficient Attention was not built for this system")
+    @parametrize("kernel", [SDPBackend.EFFICIENT_ATTENTION])


can you also add cudnn

CaoE · 2025-07-18T01:03:25Z

@pytorchbot merge

pytorchmergebot · 2025-07-18T01:05:17Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Fixes pytorch#158374 Pull Request resolved: pytorch#158424 Approved by: https://github.com/Valentine233, https://github.com/drisspg, https://github.com/atalman

Add stride check for attn_mask on non-cpu device (#158424) Fixes #158374 Pull Request resolved: #158424 Approved by: https://github.com/Valentine233, https://github.com/drisspg, https://github.com/atalman

Add stride check for attn_mask on non-cpu device (pytorch#158424) Fixes pytorch#158374 Pull Request resolved: pytorch#158424 Approved by: https://github.com/Valentine233, https://github.com/drisspg, https://github.com/atalman

pytorch-bot bot added ciflow/inductor module: inductor labels Jul 16, 2025

CaoE added ciflow/trunk Trigger trunk jobs on your pull request topic: not user facing topic category labels Jul 16, 2025

pytorchbot added the open source label Jul 16, 2025

CaoE requested review from Valentine233 and Copilot July 16, 2025 08:00

Copilot AI reviewed Jul 16, 2025

View reviewed changes

add stride check for attn_mask on non-cpu device

f8cd555

CaoE force-pushed the fix_sdpa branch from 17cd74f to f8cd555 Compare July 16, 2025 08:02

Valentine233 reviewed Jul 16, 2025

View reviewed changes

aten/src/ATen/native/transformers/sdp_utils_cpp.h Outdated Show resolved Hide resolved

CaoE force-pushed the fix_sdpa branch from cc25923 to 7b608ed Compare July 16, 2025 08:42

CaoE requested a review from Valentine233 July 16, 2025 08:42

CaoE marked this pull request as ready for review July 16, 2025 08:43

Valentine233 approved these changes Jul 16, 2025

View reviewed changes

CaoE added the ciflow/slow label Jul 16, 2025

CaoE mentioned this pull request Jul 16, 2025

torch 2.8 RC regression - part 1 #158374

Closed

CaoE requested a review from drisspg July 16, 2025 08:58

modify message

517ed18

CaoE force-pushed the fix_sdpa branch from 7b608ed to 517ed18 Compare July 16, 2025 11:50

drisspg requested changes Jul 16, 2025

View reviewed changes

add invalid last dim test case on cuda

a14c5d0

drisspg reviewed Jul 17, 2025

View reviewed changes

CaoE force-pushed the fix_sdpa branch from 3835f4e to 80c00cf Compare July 17, 2025 13:44

add CUDNN_ATTENTION in test

ebc237b

CaoE force-pushed the fix_sdpa branch from 80c00cf to ebc237b Compare July 17, 2025 14:11

drisspg approved these changes Jul 17, 2025

View reviewed changes

atalman approved these changes Jul 17, 2025

View reviewed changes

pytorchmergebot added the merging label Jul 18, 2025

pytorchmergebot added the Merged label Jul 18, 2025

pytorchmergebot closed this in ef38edb Jul 18, 2025

pytorchmergebot removed the merging label Jul 18, 2025

This was referenced Jul 18, 2025

Add stride check for attn_mask on non-cpu device #158618

Merged

[v.2.8.0] Release Tracker #156745

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add stride check for attn_mask on non-cpu device#158424

Add stride check for attn_mask on non-cpu device#158424
CaoE wants to merge 4 commits intopytorch:mainfrom
CaoE:fix_sdpa

CaoE commented Jul 16, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Jul 16, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

drisspg left a comment

Uh oh!

CaoE commented Jul 17, 2025

Uh oh!

drisspg Jul 17, 2025

Uh oh!

CaoE Jul 17, 2025

Uh oh!

CaoE commented Jul 18, 2025

Uh oh!

pytorchmergebot commented Jul 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

CaoE commented Jul 16, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158424

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

drisspg left a comment

Choose a reason for hiding this comment

Uh oh!

CaoE commented Jul 17, 2025

Uh oh!

drisspg Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

CaoE Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

CaoE commented Jul 18, 2025

Uh oh!

pytorchmergebot commented Jul 18, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

CaoE commented Jul 16, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Jul 16, 2025 •

edited

Loading