[cpu] Modify inductor opt flag --- ftree-loop-vectorize by Valentine233 · Pull Request #121782 · pytorch/pytorch

Valentine233 · 2024-03-13T02:46:23Z

Fixes #115261, #113017.

For CPU inductor path, remove -ftree-loop-vectorize from optimization flags to fix functional issues.

Validation on 3 benchmark suites

FP32

Outlier models (speedup<0.8, single socket):

Reason of non-vec: atomic_add (scatter_add) @CaoE
- basic_gnn_gcn: 0.58
- basic_gnn_gin: 0.42
- basic_gnn_sage: 0.45
Reason of non-vec: index_expr (batch_norm)
Expected to be fixed by [inductor][cpp] complete vectorization for int32/int64 #122961
- maml: 0.79

BF16

Outlier models (speedup<0.8, single socket):

Reason of non-vec: atomic_add (scatter_add) @CaoE
- basic_gnn_gcn: 0.46
- basic_gnn_gin: 0.29
- basic_gnn_sage: 0.30
Reason of non-vec: index_expr (batch_norm)
Expected to be fixed by [inductor][cpp] complete vectorization for int32/int64 #122961
- maml: 0.66
- pytorch_CycleGAN_and_pix2pix: 0.39

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov @ColinPeppler @amjames @desertfire @chauhang

pytorch-bot · 2024-03-13T02:46:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/121782

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Unrelated Failure

As of commit ae50a74 with merge base e7141d1 ():

NEW FAILURE - The following job has failed:

inductor / linux-jammy-cpu-py3.8-gcc11-inductor / test (inductor_torchbench_cpu_smoketest_perf, 1, 1, linux.12xlarge) (gh)
Process completed with exit code 1.

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

inductor / linux-jammy-cpu-py3.8-gcc11-inductor / test (cpu_inductor_torchbench, 1, 2, linux.12xlarge) (gh)
detectron2_maskrcnn_r_101_c4

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jgong5

Please share performance numbers to make sure there is no regression.

jgong5 · 2024-03-13T05:25:15Z

+    if not config.cpp.enable_tree_loop_vec_opt_flag:
+        base_flags += " -fno-tree-loop-vectorize"


Please add the issue links as comment to explain why we have to disable this by default.

Thanks and added!

Valentine233 · 2024-03-18T08:00:00Z

 resnet50,float32,dynamic,default,2.28794694
 timm_efficientnet,float32,static,cpp,2.72195686
-mobilenet_v3_large,float32,static,cpp,3.02274304
+mobilenet_v3_large,float32,static,cpp,2.9000000


To pass CI, modify the speedup. According to the validation, this model doesn't have a perf regression.

leslie-fang-intel · 2024-03-28T07:41:59Z

@Valentine233 please help to check if whether we can enable the vectorization for the regression models.

Valentine233 · 2024-04-10T07:34:54Z

@Valentine233 please help to check if whether we can enable the vectorization for the regression models.

Updated in PR description.

Valentine233 · 2024-04-12T06:17:50Z

We may wait for all the regressions fixed, if they could be solved in short term.

github-actions · 2024-06-11T06:35:44Z

Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as Stale.
Feel free to remove the Stale label if you feel this was a mistake.
If you are unable to remove the Stale label please contact a maintainer in order to do so.
If you want the bot to never mark this PR stale again, add the no-stale label.
Stale pull requests will automatically be closed after 30 days of inactivity.

Reopen #121782, as more optimizations have landed. Fixes #115261, #113017. For CPU inductor path, remove -ftree-loop-vectorize from optimization flags to fix functional issues. ### Validation on 3 benchmark suites #### FP32 ![image](https://github.com/user-attachments/assets/ec920928-fa36-467f-ba07-d2c05c51b92e) Outlier models (speedup<0.8, single socket): None. #### BF16 ![image](https://github.com/user-attachments/assets/4a301e5e-147d-4b74-beb1-40290969ed80) Outlier models (speedup<0.8, single socket multi threads): - functorch_dp_cifar10 0.58 - opacus_cifar10 0.57 Pull Request resolved: #136827 Approved by: https://github.com/jansel, https://github.com/jgong5

Reopen pytorch#121782, as more optimizations have landed. Fixes pytorch#115261, pytorch#113017. For CPU inductor path, remove -ftree-loop-vectorize from optimization flags to fix functional issues. ### Validation on 3 benchmark suites #### FP32 ![image](https://github.com/user-attachments/assets/ec920928-fa36-467f-ba07-d2c05c51b92e) Outlier models (speedup<0.8, single socket): None. #### BF16 ![image](https://github.com/user-attachments/assets/4a301e5e-147d-4b74-beb1-40290969ed80) Outlier models (speedup<0.8, single socket multi threads): - functorch_dp_cifar10 0.58 - opacus_cifar10 0.57 Pull Request resolved: pytorch#136827 Approved by: https://github.com/jansel, https://github.com/jgong5

pytorchbot added the open source label Mar 13, 2024

Valentine233 changed the title ~~[cpu] Modify inductor opt flag ftree-loop-vectorize~~ [cpu] Modify inductor opt flag --- ftree-loop-vectorize Mar 13, 2024

github-actions Bot added module: inductor ciflow/inductor labels Mar 13, 2024

Valentine233 added the topic: not user facing topic category label Mar 13, 2024

Valentine233 marked this pull request as draft March 13, 2024 02:46

jgong5 reviewed Mar 13, 2024

View reviewed changes

pytorch-bot Bot added the module: dynamo label Mar 18, 2024

Valentine233 commented Mar 18, 2024

View reviewed changes

Valentine233 added 6 commits March 20, 2024 00:15

disable tree loop vec

ef5b199

add comment with issue links

e75e948

update comments

243dd42

modify expected speedup

44997f6

modify expected speedup

9f3b831

revert tmp speedup

ae50a74

Valentine233 force-pushed the tree_vec_target branch from 3612edd to ae50a74 Compare March 20, 2024 07:17

github-actions Bot added the Stale label Jun 11, 2024

github-actions Bot closed this Jul 11, 2024

github-actions Bot deleted the tree_vec_target branch August 12, 2024 01:59

Valentine233 mentioned this pull request Sep 19, 2024

[Inductor][cpu][miscompile] Outputs of torch.matmul abnormally change with extra outputs #115261

Closed

Valentine233 mentioned this pull request Sep 27, 2024

[cpu] Modify inductor opt flag --- ftree-loop-vectorize #136827

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cpu] Modify inductor opt flag --- ftree-loop-vectorize#121782

[cpu] Modify inductor opt flag --- ftree-loop-vectorize#121782
Valentine233 wants to merge 6 commits intomainfrom
tree_vec_target

Valentine233 commented Mar 13, 2024 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Mar 13, 2024 •

edited

Loading

Uh oh!

jgong5 left a comment

Uh oh!

jgong5 Mar 13, 2024

Uh oh!

Valentine233 Mar 13, 2024

Uh oh!

Valentine233 Mar 18, 2024

Uh oh!

leslie-fang-intel commented Mar 28, 2024

Uh oh!

Valentine233 commented Apr 10, 2024

Uh oh!

Valentine233 commented Apr 12, 2024

Uh oh!

github-actions Bot commented Jun 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		if not config.cpp.enable_tree_loop_vec_opt_flag:
		base_flags += " -fno-tree-loop-vectorize"

Conversation

Valentine233 commented Mar 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Validation on 3 benchmark suites

FP32

BF16

Uh oh!

pytorch-bot Bot commented Mar 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/121782

❌ 1 New Failure, 1 Unrelated Failure

Uh oh!

jgong5 left a comment

Choose a reason for hiding this comment

Uh oh!

jgong5 Mar 13, 2024

Choose a reason for hiding this comment

Uh oh!

Valentine233 Mar 13, 2024

Choose a reason for hiding this comment

Uh oh!

Valentine233 Mar 18, 2024

Choose a reason for hiding this comment

Uh oh!

leslie-fang-intel commented Mar 28, 2024

Uh oh!

Valentine233 commented Apr 10, 2024

Uh oh!

Valentine233 commented Apr 12, 2024

Uh oh!

github-actions Bot commented Jun 11, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Valentine233 commented Mar 13, 2024 •

edited

Loading

pytorch-bot Bot commented Mar 13, 2024 •

edited

Loading