[Inductor] Set the default value of min_chunk_size to 512 by jiayisunx · Pull Request #150762 · pytorch/pytorch

jiayisunx · 2025-04-07T09:29:37Z

Stack from ghstack (oldest at bottom):

-> [Inductor] Set the default value of min_chunk_size to 512 #150762

Change the default value of min_chunk_size from 4096 to 512 to allow more for loops to be parallelized.
I tested the Inductor benchmark with this PR on CPU, and saw ~10% improvement in torchbench geomean speedup, and no change in huggingface/timm_models. There are about 15 torchbench models with different degrees of performance improvement, among which functorch_dp_cifar10, opacus_cifar10, hf_Reformer, and pyhpc_turbulent_kinetic_energy have more than 50% performance improvement.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben @mlazos

pytorch-bot · 2025-04-07T09:29:41Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150762

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 81030e5 with merge base 70b4a88 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 905b734 Pull Request resolved: #150762

[ghstack-poisoned]

ghstack-source-id: 0071793 Pull Request resolved: #150762

ghstack-source-id: 2cdb91d Pull Request resolved: #150762

[ghstack-poisoned]

ghstack-source-id: 72928ab Pull Request resolved: #150762

[ghstack-poisoned]

ghstack-source-id: 9dd2606 Pull Request resolved: #150762

[ghstack-poisoned]

ghstack-source-id: 7ba58e6 Pull Request resolved: #150762

[ghstack-poisoned]

leslie-fang-intel · 2025-07-01T06:54:26Z

torch/_inductor/codegen/cpp.py

                        f"{local_buf_dtype}* {local_buffer_name} = buf_{local_buffer_name}.get();"
                    )
            gen_loop_nest(loop_nest)
+            worksharing.close()


May I know for which case, we need to add this line of change?

Many UTs in test/inductor/test_cpu_select_algorithm.py failed after parallelization. Without this line, worksharing was not closed and the output code was missing a }.

Would like to understand more about the difference. Will these UTs also fail if we keep the chunk size but increase the loop size?

leslie-fang-intel

Thanks for the PR. Please elaborate more about why the changes of worksharing.close() is needed.

ghstack-source-id: a5fb84c Pull Request resolved: #150762

[ghstack-poisoned]

ghstack-source-id: e1750a3 Pull Request resolved: #150762

[ghstack-poisoned]

ghstack-source-id: 6c998a5 Pull Request resolved: #150762

[ghstack-poisoned]

jiayisunx · 2025-07-14T01:07:00Z

@pytorchbot merge

pytorchmergebot · 2025-07-14T01:08:53Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

huydhn · 2025-07-14T16:56:37Z

@pytorchbot revert -m 'Sorry for reverting your change, but an inductor compilation error shows up in trunk' -c nosignal

https://github.com/pytorch/pytorch/actions/runs/16255653984/job/45891797651#step:25:5755

pytorchmergebot · 2025-07-14T16:58:03Z

@pytorchbot successfully started a revert job. Check the current status here.
Questions? Feedback? Please reach out to the PyTorch DevX Team

…50762)" This reverts commit 3321acc. Reverted #150762 on behalf of https://github.com/huydhn due to Sorry for reverting your change, but an inductor compilation error shows up in trunk ([comment](#150762 (comment)))

pytorchmergebot · 2025-07-14T16:58:17Z

@jiayisunx your PR has been successfully reverted.

Change the default value of min_chunk_size from 4096 to 512 to allow more for loops to be parallelized. I tested the Inductor benchmark with this PR on CPU, and saw ~10% improvement in torchbench geomean speedup, and no change in huggingface/timm_models. There are about 15 torchbench models with different degrees of performance improvement, among which functorch_dp_cifar10, opacus_cifar10, hf_Reformer, and pyhpc_turbulent_kinetic_energy have more than 50% performance improvement. Pull Request resolved: #150762 Approved by: https://github.com/leslie-fang-intel, https://github.com/jansel ghstack-source-id: b524b9e

[ghstack-poisoned]

Change the default value of min_chunk_size from 4096 to 512 to allow more for loops to be parallelized. I tested the Inductor benchmark with this PR on CPU, and saw ~10% improvement in torchbench geomean speedup, and no change in huggingface/timm_models. There are about 15 torchbench models with different degrees of performance improvement, among which functorch_dp_cifar10, opacus_cifar10, hf_Reformer, and pyhpc_turbulent_kinetic_energy have more than 50% performance improvement. Pull Request resolved: #150762 Approved by: https://github.com/leslie-fang-intel, https://github.com/jansel ghstack-source-id: adf05f5

[ghstack-poisoned]

jiayisunx · 2025-07-21T12:38:24Z

@pytorchbot merge

pytorchmergebot · 2025-07-21T12:40:28Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added ciflow/inductor module: inductor labels Apr 7, 2025

jiayisunx added a commit that referenced this pull request Apr 7, 2025

[Inductor] Set the default value of min_chunk_size to 512

354aab7

ghstack-source-id: 905b734 Pull Request resolved: #150762

pytorchbot added the open source label Apr 7, 2025

jiayisunx marked this pull request as draft April 7, 2025 09:32

jiayisunx added 2 commits April 7, 2025 09:33

Update

22ae500

[ghstack-poisoned]

Update

aa05505

[ghstack-poisoned]

jiayisunx added a commit that referenced this pull request Apr 16, 2025

[Inductor] Set the default value of min_chunk_size to 512

ecfd609

ghstack-source-id: 0071793 Pull Request resolved: #150762

jiayisunx added a commit that referenced this pull request May 27, 2025

[Inductor] Set the default value of min_chunk_size to 512

0b44e1f

ghstack-source-id: 2cdb91d Pull Request resolved: #150762

Update

c4e861f

[ghstack-poisoned]

jiayisunx added a commit that referenced this pull request Jun 10, 2025

[Inductor] Set the default value of min_chunk_size to 512

6f9f99b

ghstack-source-id: 72928ab Pull Request resolved: #150762

Update

be46d3b

[ghstack-poisoned]

jiayisunx added a commit that referenced this pull request Jun 18, 2025

[Inductor] Set the default value of min_chunk_size to 512

a57ce04

ghstack-source-id: 9dd2606 Pull Request resolved: #150762

Update

f3cf100

[ghstack-poisoned]

jiayisunx added the release notes: inductor label Jun 19, 2025

jiayisunx added a commit that referenced this pull request Jun 19, 2025

[Inductor] Set the default value of min_chunk_size to 512

79f9bcc

ghstack-source-id: 7ba58e6 Pull Request resolved: #150762

Update

744e57b

[ghstack-poisoned]

jiayisunx requested a review from leslie-fang-intel July 1, 2025 01:21

jiayisunx marked this pull request as ready for review July 1, 2025 01:21

leslie-fang-intel requested a review from CaoE July 1, 2025 06:48

leslie-fang-intel reviewed Jul 1, 2025

View reviewed changes

jiayisunx requested a review from leslie-fang-intel July 1, 2025 08:38

leslie-fang-intel reviewed Jul 2, 2025

View reviewed changes

jiayisunx added a commit that referenced this pull request Jul 9, 2025

[Inductor] Set the default value of min_chunk_size to 512

82960fb

ghstack-source-id: a5fb84c Pull Request resolved: #150762

Update

ebbbdde

[ghstack-poisoned]

jiayisunx added a commit that referenced this pull request Jul 9, 2025

[Inductor] Set the default value of min_chunk_size to 512

07ddab9

ghstack-source-id: e1750a3 Pull Request resolved: #150762

Update

c916441

[ghstack-poisoned]

jiayisunx added a commit that referenced this pull request Jul 9, 2025

[Inductor] Set the default value of min_chunk_size to 512

46f195f

ghstack-source-id: 6c998a5 Pull Request resolved: #150762

Update

63ab6b5

[ghstack-poisoned]

jiayisunx added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 11, 2025

jiayisunx requested a review from jansel July 11, 2025 07:22

jansel approved these changes Jul 12, 2025

View reviewed changes

pytorchmergebot added the merging label Jul 14, 2025

pytorchmergebot closed this in 3321acc Jul 14, 2025

pytorchmergebot added Merged and removed merging labels Jul 14, 2025

pytorchmergebot added Reverted ci-no-td Do not run TD on this PR labels Jul 14, 2025

pytorchmergebot reopened this Jul 14, 2025

huydhn mentioned this pull request Jul 14, 2025

HUD log viewer crashes when the when the size of the logs is too large pytorch/test-infra#6926

Open

yanbing-j mentioned this pull request Jul 15, 2025

Enable TF32 as fp32 internal precision for matmul/linear/conv #157520

Closed

Update

febf0fa

[ghstack-poisoned]

Update

81030e5

[ghstack-poisoned]

pytorchmergebot added the merging label Jul 21, 2025

pytorchmergebot closed this in 1eb6b20 Jul 21, 2025

pytorchmergebot removed the merging label Jul 21, 2025

github-actions bot deleted the gh/jiayisunx/63/head branch August 21, 2025 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inductor] Set the default value of min_chunk_size to 512#150762

[Inductor] Set the default value of min_chunk_size to 512#150762
jiayisunx wants to merge 12 commits intogh/jiayisunx/63/basefrom
gh/jiayisunx/63/head

jiayisunx commented Apr 7, 2025 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Apr 7, 2025 •

edited

Loading

Uh oh!

leslie-fang-intel Jul 1, 2025

Uh oh!

jiayisunx Jul 1, 2025 •

edited

Loading

Uh oh!

leslie-fang-intel Jul 1, 2025

Uh oh!

leslie-fang-intel left a comment

Uh oh!

jiayisunx commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

huydhn commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

jiayisunx commented Jul 21, 2025

Uh oh!

pytorchmergebot commented Jul 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

jiayisunx commented Apr 7, 2025 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/150762

✅ No Failures

Uh oh!

leslie-fang-intel Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

jiayisunx Jul 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel left a comment

Choose a reason for hiding this comment

Uh oh!

jiayisunx commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Merge started

Uh oh!

huydhn commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

pytorchmergebot commented Jul 14, 2025

Uh oh!

jiayisunx commented Jul 21, 2025

Uh oh!

pytorchmergebot commented Jul 21, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

jiayisunx commented Apr 7, 2025 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Apr 7, 2025 •

edited

Loading

jiayisunx Jul 1, 2025 •

edited

Loading