[Inductor][Intel GPU] Save `threads_per_warp` from tirton compiled kernel for launching kernel correctly in cpp wrapper. by pytorchbot · Pull Request #163388 · pytorch/pytorch

pytorchbot · 2025-09-20T00:02:49Z

Stack from ghstack (oldest at bottom):

-> [Inductor][Intel GPU] Save threads_per_warp from tirton compiled kernel for launching kernel correctly in cpp wrapper. #163315

On the Inductor XPU backend, threads_per_warp is not always 32. For Intel GEMM Triton kernels, it can be 16. This information must be preserved for XPU so that the Cpp wrapper can launch the kernel with the correct configuration.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

…rnel for launching kernel correctly in cpp wrapper. (#163315) On the Inductor XPU backend, `threads_per_warp` is not always 32. For Intel GEMM Triton kernels, it can be 16. This information must be preserved for XPU so that the Cpp wrapper can launch the kernel with the correct configuration. Pull Request resolved: #163315 Approved by: https://github.com/EikanWang, https://github.com/desertfire (cherry picked from commit 9f8a311)

pytorch-bot · 2025-09-20T00:02:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163388

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (1 Unrelated Failure)

As of commit 1f213fa with merge base 4840a1a ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

xpu / linux-jammy-xpu-n-py3.10 / test (default, 7, 8, linux.idc.xpu) (gh) (disabled by #163159)
inductor/test_torchinductor.py::TritonCodeGenTests::test_graph_partition_default_device_context

This comment was automatically generated by Dr. CI and updates every 15 minutes.

etaf · 2025-09-23T06:16:26Z

Hi, @atalman This cherry-pick for release/2.9 is to fix the new feature that support flex attention on Inductor Intel GPU backend. Could you kindly help have a review?

This was referenced Sep 20, 2025

[v.2.9.0] Release Tracker #162497

Closed

[Inductor][Intel GPU] Save threads_per_warp from tirton compiled kernel for launching kernel correctly in cpp wrapper. #163315

Closed

pytorch-bot Bot added ciflow/inductor module: inductor release notes: inductor (aoti) labels Sep 20, 2025

etaf added the ciflow/xpu Run XPU CI tasks label Sep 20, 2025

etaf requested a review from EikanWang September 20, 2025 00:10

pytorchbot added the open source label Sep 20, 2025

etaf requested a review from desertfire September 21, 2025 00:49

EikanWang approved these changes Sep 22, 2025

View reviewed changes

etaf requested a review from atalman September 23, 2025 06:14

desertfire approved these changes Sep 23, 2025

View reviewed changes

atalman approved these changes Sep 26, 2025

View reviewed changes

atalman merged commit 7cadf8a into release/2.9 Sep 26, 2025
146 of 150 checks passed

github-actions Bot deleted the cherry-pick-163315-by-pytorch_bot_bot_ branch October 27, 2025 02:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inductor][Intel GPU] Save `threads_per_warp` from tirton compiled kernel for launching kernel correctly in cpp wrapper.#163388

[Inductor][Intel GPU] Save `threads_per_warp` from tirton compiled kernel for launching kernel correctly in cpp wrapper.#163388
atalman merged 1 commit intorelease/2.9from
cherry-pick-163315-by-pytorch_bot_bot_

pytorchbot commented Sep 20, 2025

Uh oh!

pytorch-bot Bot commented Sep 20, 2025 •

edited

Loading

Uh oh!

etaf commented Sep 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

pytorchbot commented Sep 20, 2025

Uh oh!

pytorch-bot Bot commented Sep 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163388

✅ You can merge normally! (1 Unrelated Failure)

Uh oh!

etaf commented Sep 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot Bot commented Sep 20, 2025 •

edited

Loading