[varlen_attn for inference] add test to aot_inductor by liangel-02 · Pull Request #175936 · pytorch/pytorch

liangel-02 · 2026-02-27T00:07:21Z

Stack from ghstack (oldest at bottom):

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @jataylo

[ghstack-poisoned]

pytorch-bot · 2026-02-27T00:07:25Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/175936

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (4 Unrelated Failures)

As of commit 2e18591 with merge base 4bc9d7f ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

inductor / unit-test / inductor-test / test (inductor, 1, 2, linux.g5.4xlarge.nvidia.gpu) (gh) (similar failure)
test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_pca_lowrank_cuda_float32
pull / linux-jammy-py3.10-gcc11 / test (distributed, 1, 2, lf.linux.2xlarge) (gh) (similar failure)
test/distributed/tensor/test_dtensor_ops.py::TestLocalDTensorOpsCPU::test_dtensor_op_db_nanmean_cpu_float32

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / linux-jammy-py3.10-gcc11 / test (distributed, 2, 2, lf.linux.2xlarge) (gh) (trunk failure)
'Test'

UNSTABLE - The following job is marked as unstable, possibly due to flakiness on trunk:

inductor / inductor-cpu-test / test (cpu_inductor_torchbench, 1, 2, linux.2xlarge.amx, unstable) (gh) (#174929)
detectron2_maskrcnn_r_50_fpn

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: d78ddbb Pull Request resolved: #175936

liangel-02 · 2026-02-27T00:08:02Z

test/inductor/test_aot_inductor.py

+    @unittest.skipIf(not SM90OrLater, "FA3 requires SM90+")
+    @unittest.skipIf("FA3" not in list_flash_attention_impls(), "FA3 not available")
+    def test_varlen_attn_paged_kv_cache(self):
+        if self.device != GPU_TYPE:


needed instead of @requires_gpu because otherwise the CPU variant will still run

test/inductor/test_aot_inductor.py

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

ghstack-source-id: 4d5593d Pull Request resolved: #175936

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

ghstack-source-id: 4cd7118 Pull Request resolved: #175936

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

ghstack-source-id: bff1fea Pull Request resolved: #175936

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

ghstack-source-id: 8aa6652 Pull Request resolved: #175936

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

This reverts commit cc142e7. Reverted #175936 on behalf of https://github.com/zou3519 due to sorry I think this broke inductor rocm ([comment](#175897 (comment)))

pytorchmergebot · 2026-03-07T16:30:32Z

@liangel-02 your PR has been reverted as part of the stack under #175897.

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

pytorchmergebot · 2026-03-08T18:46:18Z

Starting merge as part of PR stack under #176723

pytorchmergebot · 2026-03-08T23:40:42Z

Starting merge as part of PR stack under #176723

`aten/src/ATen/native/transformers/cuda/attention.cu` - renamed `_flash_attention_forward` to `_flash_attention_forward_impl`. this is now the core logic and takes `optional<Tensor> out`. - `_flash_attention_forward` is the non-out variant version and is a thin wrapper that calls `_flash_attention_forward_impl` with `out=std::nullopt` - `_flash_attention_forward_no_dropout_inplace` is the out-variant and calls `_flash_attention_forward_impl` with `Tensor& out` `aten/src/ATen/native/native_functions.yaml` - i registered a new op `_flash_attention_forward_no_dropout_inplace` `torch/_meta_registrations.py` - added meta registration that calls `meta__flash_attention_forward` but doesn't return out tensor `torch/nn/attention/varlen.py` - added public `varlen_attn_out` and private custom op `_varlen_attn_out` with `mutates_args={"out"}` `test/test_varlen_attention.py` - added out variant to existing tests Pull Request resolved: #176015 Approved by: https://github.com/drisspg ghstack dependencies: #175897, #175924, #175936

Pull Request resolved: #176723 Approved by: https://github.com/drisspg ghstack dependencies: #175897, #175924, #175936, #176015

This reverts commit 388d61e. Reverted #175936 on behalf of https://github.com/huydhn due to Sorry for reverting your change but a bunch of internal builds need to be updated to unblock this change D95758397 ([comment](#175924 (comment)))

pytorchmergebot · 2026-03-10T01:10:25Z

@liangel-02 your PR has been reverted as part of the stack under #175924.

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

liangel-02 · 2026-03-10T19:36:37Z

@liangel-02 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo Differential Revision: [D95996398](https://our.internmc.facebook.com/intern/diff/D95996398) [ghstack-poisoned]

pytorchmergebot · 2026-03-11T20:22:05Z

Starting merge as part of PR stack under #176723

pytorchmergebot · 2026-03-11T20:30:18Z

Starting merge as part of PR stack under #176723

`aten/src/ATen/native/transformers/cuda/attention.cu` - renamed `_flash_attention_forward` to `_flash_attention_forward_impl`. this is now the core logic and takes `optional<Tensor> out`. - `_flash_attention_forward` is the non-out variant version and is a thin wrapper that calls `_flash_attention_forward_impl` with `out=std::nullopt` - `_flash_attention_forward_no_dropout_inplace` is the out-variant and calls `_flash_attention_forward_impl` with `Tensor& out` `aten/src/ATen/native/native_functions.yaml` - i registered a new op `_flash_attention_forward_no_dropout_inplace` `torch/_meta_registrations.py` - added meta registration that calls `meta__flash_attention_forward` but doesn't return out tensor `torch/nn/attention/varlen.py` - added public `varlen_attn_out` and private custom op `_varlen_attn_out` with `mutates_args={"out"}` `test/test_varlen_attention.py` - added out variant to existing tests Pull Request resolved: #176015 Approved by: https://github.com/drisspg ghstack dependencies: #175924, #175936

Pull Request resolved: #176723 Approved by: https://github.com/drisspg ghstack dependencies: #175924, #175936, #176015

ghstack-source-id: 968c9eb Pull Request resolved: pytorch/pytorch#175936

add test to aot_inductor

bb1d5ba

[ghstack-poisoned]

This was referenced Feb 27, 2026

[varlen_attn for inference] add seqused_k #175897

Closed

[varlen_attn for inference] add page_table #175924

Closed

pytorch-bot bot added ciflow/inductor module: inductor topic: not user facing topic category labels Feb 27, 2026

liangel-02 added a commit that referenced this pull request Feb 27, 2026

add test to aot_inductor

6d9cdce

ghstack-source-id: d78ddbb Pull Request resolved: #175936

liangel-02 commented Feb 27, 2026

View reviewed changes

test/inductor/test_aot_inductor.py Show resolved Hide resolved

Update on "add test to aot_inductor"

241f0d6

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

liangel-02 added a commit that referenced this pull request Feb 27, 2026

add test to aot_inductor

b5b3998

ghstack-source-id: 4d5593d Pull Request resolved: #175936

Update on "add test to aot_inductor"

62e4fbc

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

liangel-02 added a commit that referenced this pull request Feb 27, 2026

add test to aot_inductor

a69d255

ghstack-source-id: 4cd7118 Pull Request resolved: #175936

liangel-02 requested a review from drisspg February 27, 2026 00:22

Update on "add test to aot_inductor"

d4f692f

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

liangel-02 added a commit that referenced this pull request Feb 27, 2026

add test to aot_inductor

1624cff

ghstack-source-id: bff1fea Pull Request resolved: #175936

Update on "add test to aot_inductor"

fda1ef6

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

liangel-02 added a commit that referenced this pull request Feb 27, 2026

add test to aot_inductor

f0eab2d

ghstack-source-id: 8aa6652 Pull Request resolved: #175936

liangel-02 mentioned this pull request Feb 27, 2026

[varlen_attn for inference] add out variant #176015

Closed

Update on "add test to aot_inductor"

9cc7e07

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

drisspg approved these changes Mar 4, 2026

View reviewed changes

liangel-02 added 4 commits March 4, 2026 09:36

Update on "add test to aot_inductor"

b53bed4

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

Update on "add test to aot_inductor"

60b9f78

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

Update on "add test to aot_inductor"

649d091

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

Update on "add test to aot_inductor"

4669b3b

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

liangel-02 changed the title ~~add test to aot_inductor~~ [varlen_attn for inference] add test to aot_inductor Mar 4, 2026

Update on "[varlen_attn for inference] add test to aot_inductor"

8dd0b18

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

liangel-02 added the ciflow/trunk Trigger trunk jobs on your pull request label Mar 5, 2026

Update on "[varlen_attn for inference] add test to aot_inductor"

7b85335

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Mar 7, 2026

pytorchmergebot reopened this Mar 7, 2026

Update on "[varlen_attn for inference] add test to aot_inductor"

86af4fe

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

pytorchmergebot closed this in 388d61e Mar 8, 2026

pytorchmergebot pushed a commit that referenced this pull request Mar 8, 2026

[varlen_attn for inference] remove unnecessary tensor creation (#176723)

26dddb9

Pull Request resolved: #176723 Approved by: https://github.com/drisspg ghstack dependencies: #175897, #175924, #175936, #176015

pytorchmergebot reopened this Mar 10, 2026

liangel-02 added 3 commits March 9, 2026 19:12

Update on "[varlen_attn for inference] add test to aot_inductor"

e15fcba

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

Update on "[varlen_attn for inference] add test to aot_inductor"

12ed21d

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

Update on "[varlen_attn for inference] add test to aot_inductor"

4d2aa2c

cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx ipiszy kadeng muchulee8 amjames chauhang aakhundov coconutruben jataylo [ghstack-poisoned]

meta-codesync bot added fb-exported meta-exported labels Mar 11, 2026

pytorchmergebot closed this in 82fec54 Mar 11, 2026

pytorchmergebot pushed a commit that referenced this pull request Mar 11, 2026

[varlen_attn for inference] remove unnecessary tensor creation (#176723)

f1f3d70

Pull Request resolved: #176723 Approved by: https://github.com/drisspg ghstack dependencies: #175924, #175936, #176015

sandy-gags pushed a commit to sandy-gags/pytorch that referenced this pull request Mar 12, 2026

add test to aot_inductor

fffa32c

ghstack-source-id: 968c9eb Pull Request resolved: pytorch/pytorch#175936

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[varlen_attn for inference] add test to aot_inductor#175936

[varlen_attn for inference] add test to aot_inductor#175936
liangel-02 wants to merge 19 commits intogh/liangel-02/16/basefrom
gh/liangel-02/16/head

liangel-02 commented Feb 27, 2026 •

edited by huydhn

Loading

Uh oh!

pytorch-bot bot commented Feb 27, 2026 •

edited

Loading

Uh oh!

liangel-02 Feb 27, 2026

Uh oh!

Uh oh!

pytorchmergebot commented Mar 7, 2026

Uh oh!

pytorchmergebot commented Mar 8, 2026

Uh oh!

pytorchmergebot commented Mar 8, 2026

Uh oh!

pytorchmergebot commented Mar 10, 2026

Uh oh!

liangel-02 commented Mar 10, 2026

Uh oh!

pytorchmergebot commented Mar 11, 2026

Uh oh!

pytorchmergebot commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

liangel-02 commented Feb 27, 2026 • edited by huydhn Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/175936

✅ You can merge normally! (4 Unrelated Failures)

Uh oh!

liangel-02 Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pytorchmergebot commented Mar 7, 2026

Uh oh!

pytorchmergebot commented Mar 8, 2026

Uh oh!

pytorchmergebot commented Mar 8, 2026

Uh oh!

pytorchmergebot commented Mar 10, 2026

Uh oh!

liangel-02 commented Mar 10, 2026

Uh oh!

pytorchmergebot commented Mar 11, 2026

Uh oh!

pytorchmergebot commented Mar 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

liangel-02 commented Feb 27, 2026 •

edited by huydhn

Loading

pytorch-bot bot commented Feb 27, 2026 •

edited

Loading