[ROCm] enabling miopen_batch_norm lowering in inductor by jataylo · Pull Request #105740 · pytorch/pytorch

jataylo · 2023-07-21T14:31:52Z

Enabling miopen_batch_norm lowering for inductor only.

This is to avoid errors observed in some models and perf difference is very close from initial benchmarks.

LoweringException: RuntimeError: Expected contiguous tensor, but got non-contiguous tensor for argument #1 'input' (while checking arguments for miopen_batch_norm)
  target: aten.miopen_batch_norm.default

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @hongxiayang @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @peterbell10 @ipiszy @ngimel @yf225 @chenyang78 @kadeng @muchulee8 @aakhundov

pytorch-bot · 2023-07-21T14:31:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105740

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 4 Unrelated Failures

As of commit 82faf5b:

NEW FAILURE - The following job has failed:

linux-focal-rocm5.6-py3.8 / test (slow, 1, 1, linux.rocm.gpu) (gh)

BROKEN TRUNK - The following job failed but were present on the merge base 29f856e:

👉 Rebase onto the `viable/strict` branch to avoid these failures

linux-focal-py3.8-gcc7 / test (distributed, 2, 2, linux.2xlarge) (gh)

UNSTABLE - The following jobs failed but were likely due to flakiness present on trunk and has been marked as unstable:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

jithunnair-amd · 2023-07-21T15:17:32Z

@jataylo Are the perf issues ( due to which the lowering was disabled) resolved in ROCm5.6 specifically?

jataylo · 2023-07-21T16:17:59Z

@jataylo Are the perf issues ( due to which the lowering was disabled) resolved in ROCm5.6 specifically?

@jithunnair-amd Not from ROCm5.6 specifically seems to mostly be python/triton updates that is closing the gap here.

The primary motivation for pushing this change is a failure faced in a few models that use batch_norm from some pytorch change.

LoweringException: RuntimeError: Expected contiguous tensor, but got non-contiguous tensor for argument #1 'input' (while checking arguments for miopen_batch_norm)
  target: aten.miopen_batch_norm.default

We will keep tabs on performance and revert if deemed necessary cc: @dllehr-amd

jataylo · 2023-07-26T15:37:26Z

Hey @malfet enabling lowering for miopen_batch_norm on inductor to avoid some failures and unskipped the affected UTs. Could you help us approve this?

UTs failures are unrelated to this change.

The batch norm related UTs pass e.g.
2023-07-26T10:08:31.3428967Z inductor/test_torchinductor.py::CudaTests::test_batch_norm_2d_cuda PASSED [2.4299s] [ 58%]

cc: @jithunnair-amd

jataylo · 2023-08-01T13:52:17Z

Hi @malfet @huydhn @ngimel this is causing some pt2 model breakages for us until merged if you have some time to help us review.

The failures are unrelated but please let me know if I should rebase to get this green.

malfet · 2023-08-01T22:37:24Z

@pytorchbot merge -i

pytorchmergebot · 2023-08-01T22:39:07Z

Merge started

Your change will be merged while ignoring the following 5 checks: pull / linux-focal-py3.8-gcc7 / test (distributed, 2, 2, linux.2xlarge), slow / linux-focal-rocm5.6-py3.8 / test (slow, 1, 1, linux.rocm.gpu), periodic / linux-focal-rocm5.6-py3.8 / test (distributed, 1, 2, linux.rocm.gpu, unstable), trunk / linux-focal-rocm5.6-py3.8 / test (default, 1, 3, linux.rocm.gpu, unstable), inductor / cuda11.8-py3.10-gcc7-sm86 / test (inductor_torchbench, 1, 1, linux.g5.4xlarge.nvidia.gpu, unstable)

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

[ROCm] enabling miopen_batch_norm lowering in inductor

0f10ac6

github-actions bot added the module: inductor label Jul 21, 2023

jataylo added topic: not user facing topic category keep-going Don't stop on first failure, keep running tests until the end labels Jul 21, 2023

pytorchbot added the open source label Jul 21, 2023

jataylo and others added 10 commits July 24, 2023 10:22

Remove unused TEST_WITH_ROCM import

b1dd41d

Fix syntax error

de68b6a

Remove miopen_batch_norm from expected failures in test_decomp.py

cb88a0f

Update decompositions.py

99f885e

Add miopen_batch_norm_backward to decomp init

f4db2b6

Linting issue

aed3ad7

Removed miopen_batch_norm_backward registration

2d115c3

Remove miopen_batch_norm_backward lowering

1a316c4

Remove miopen_batch_norm_backward lowering

fa8920f

Move from _decomp/decomposition.py to _inductor/decomposition.py

55341aa

jataylo removed the keep-going Don't stop on first failure, keep running tests until the end label Jul 26, 2023

typing import

82faf5b

jataylo requested a review from malfet July 26, 2023 15:33

jataylo marked this pull request as ready for review July 26, 2023 15:33

jataylo added the rocm priority high priority ROCm PRs from performance or other aspects label Jul 26, 2023

jataylo requested a review from jithunnair-amd July 26, 2023 15:36

jataylo requested a review from ngimel July 27, 2023 13:26

jithunnair-amd approved these changes Jul 31, 2023

View reviewed changes

jataylo requested a review from huydhn August 1, 2023 13:48

malfet approved these changes Aug 1, 2023

View reviewed changes

pytorchmergebot added the merging label Aug 1, 2023

pytorchmergebot added Merged and removed merging labels Aug 1, 2023

pytorchmergebot closed this in 40184b2 Aug 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] enabling miopen_batch_norm lowering in inductor#105740

[ROCm] enabling miopen_batch_norm lowering in inductor#105740
jataylo wants to merge 12 commits intopytorch:mainfrom
jataylo:miopen_batch_norm_lowering

jataylo commented Jul 21, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jul 21, 2023 •

edited

Loading

Uh oh!

jithunnair-amd commented Jul 21, 2023

Uh oh!

jataylo commented Jul 21, 2023

Uh oh!

jataylo commented Jul 26, 2023

Uh oh!

jataylo commented Aug 1, 2023

Uh oh!

malfet commented Aug 1, 2023

Uh oh!

pytorchmergebot commented Aug 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

jataylo commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jul 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105740

❌ 1 New Failure, 4 Unrelated Failures

Uh oh!

jithunnair-amd commented Jul 21, 2023

Uh oh!

jataylo commented Jul 21, 2023

Uh oh!

jataylo commented Jul 26, 2023

Uh oh!

jataylo commented Aug 1, 2023

Uh oh!

malfet commented Aug 1, 2023

Uh oh!

pytorchmergebot commented Aug 1, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jataylo commented Jul 21, 2023 •

edited

Loading

pytorch-bot bot commented Jul 21, 2023 •

edited

Loading