[ROCm] Adjust elementwise_kernel settings on ROCm by iotamudelta · Pull Request #32609 · pytorch/pytorch

iotamudelta · 2020-01-25T02:53:17Z

Recent PR #31974 and upcoming PR #32383 are changing the behavior of the elementwise_kernel infrastructure on CUDA.

In order to stay in sync, change the nd-loop behavior to match ROCm and CUDA for now. Once the full rework is done, the ROCm settings will likely diverge again.

kostmo · 2020-01-25T03:00:51Z

💊 CircleCI build failures summary and remediations

As of commit d64e167:

1/1 failures introduced in this PR

Detailed failure analysis

One may explore the probable reasons each build failed interactively on the Dr. CI website.

🕵️ 1 new failure recognized by patterns

The following build failures do not appear to be due to upstream breakage:

pytorch_xla_linux_xenial_py3_6_clang7_build (1/1)

Step: "Build" (full log | pattern match details)

Jan 25 03:58:46 RuntimeError: Failed to apply patch: /var/lib/jenkins/workspace/xla/torch_patches/X10-device_test.diff

Jan 25 03:58:46 File to patch:  
Jan 25 03:58:46 Skip this patch? [y]  
Jan 25 03:58:46 4 out of 4 hunks ignored 
Jan 25 03:58:46 Applying patch file: /var/lib/jenkins/workspace/xla/torch_patches/X10-clip_grad.diff 
Jan 25 03:58:46 Applying patch file: /var/lib/jenkins/workspace/xla/torch_patches/X10-device_test.diff 
Jan 25 03:58:46 Traceback (most recent call last): 
Jan 25 03:58:46   File "/var/lib/jenkins/workspace/xla/scripts/cond_patch.py", line 67, in <module> 
Jan 25 03:58:46     patch_repo(args) 
Jan 25 03:58:46   File "/var/lib/jenkins/workspace/xla/scripts/cond_patch.py", line 49, in patch_repo 
Jan 25 03:58:46     raise RuntimeError('Failed to apply patch: {}'.format(ppath)) 
Jan 25 03:58:46 RuntimeError: Failed to apply patch: /var/lib/jenkins/workspace/xla/torch_patches/X10-device_test.diff 
Jan 25 03:58:46 + cleanup 
Jan 25 03:58:46 + retcode=1 
Jan 25 03:58:46 + set +x 
Jan 25 03:58:46 =================== sccache compilation log =================== 
Jan 25 03:58:46 =========== If your build fails, please take a look at the log above for possible reasons =========== 
Jan 25 03:58:46 Compile requests                 1 
Jan 25 03:58:46 Compile requests executed        0 
Jan 25 03:58:46 Cache hits                       0 
Jan 25 03:58:46 Cache misses                     0 
Jan 25 03:58:46 Cache timeouts                   0

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

This comment has been revised 2 times.

This reverts commit cde9532.

facebook-github-bot

@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-01-27T19:16:34Z

@ezyang merged this pull request in 5ac2593.

facebook-github-bot · 2020-01-27T19:16:42Z

@ezyang merged this pull request in 5ac2593.

Summary: Recent PR pytorch#31974 and upcoming PR pytorch#32383 are changing the behavior of the elementwise_kernel infrastructure on CUDA. In order to stay in sync, change the nd-loop behavior to match ROCm and CUDA for now. Once the full rework is done, the ROCm settings will likely diverge again. Pull Request resolved: pytorch#32609 Differential Revision: D19580121 Pulled By: ezyang fbshipit-source-id: 4c8dcf6db3ac973e48ece6a665615cfe7d7cb764

Adjust elementwise_kernel settings on ROCm

c114f5e

iotamudelta added module: rocm AMD GPU support for Pytorch open source labels Jan 25, 2020

iotamudelta requested review from bddppq, ezyang and ngimel January 25, 2020 02:53

iotamudelta added 4 commits January 24, 2020 21:10

Remove it all for now.

cde9532

Revert "Remove it all for now."

5230f0b

This reverts commit cde9532.

Revert "Remove it all for now."

66d388f

This reverts commit cde9532.

Remove ifdef-ing for good.

d64e167

iotamudelta mentioned this pull request Jan 25, 2020

Vectorized memory access in TensorIterator GPU loop for 1d contiguous case #32383

Closed

ngimel added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jan 25, 2020

ezyang approved these changes Jan 27, 2020

View reviewed changes

facebook-github-bot reviewed Jan 27, 2020

View reviewed changes

facebook-github-bot closed this in 5ac2593 Jan 27, 2020

facebook-github-bot added the merged label Jan 27, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm] Adjust elementwise_kernel settings on ROCm#32609

[ROCm] Adjust elementwise_kernel settings on ROCm#32609
iotamudelta wants to merge 5 commits intopytorch:masterfrom
iotamudelta:adjust_loop_bounds

iotamudelta commented Jan 25, 2020

Uh oh!

kostmo commented Jan 25, 2020 •

edited

Loading

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Jan 27, 2020

Uh oh!

facebook-github-bot commented Jan 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

iotamudelta commented Jan 25, 2020

Uh oh!

kostmo commented Jan 25, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CircleCI build failures summary and remediations

Detailed failure analysis

🕵️ 1 new failure recognized by patterns

pytorch_xla_linux_xenial_py3_6_clang7_build (1/1)

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jan 27, 2020

Uh oh!

facebook-github-bot commented Jan 27, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

kostmo commented Jan 25, 2020 •

edited

Loading