[Tensorexpr] Fix and improve handling multiple gpu devices by protonu · Pull Request #38365 · pytorch/pytorch

protonu · 2020-05-12T23:00:39Z

These commits fixes a bug which was exposed when we took away the fallback path. The fix is to set the appropriate device before setting CUDA stream.
The improvement is when compiling, setting the device to new device only if it's different from prior device, and removing redundant call to cudaFree

…s to cudaFree

facebook-github-bot

@protonu has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

zheng-xq · 2020-05-13T00:19:13Z

    ptr_to_args[buffer_args.size() + 1] = &rand_offset;
  }
-
+  const auto prior_device = at::cuda::current_device();


Good catch!

Since you are now using this->device(), it might be a good idea to check that device() can only be kCuda somewhere in "initialize()".

dr-ci · 2020-05-13T02:16:57Z

💊 CI failures summary and remediations

As of commit c47e560 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/py3.6-clang7-rocmdeb-ubuntu16.04

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 1 time.

facebook-github-bot · 2020-05-14T20:11:17Z

@protonu merged this pull request in d1eeb3b.

…8365) Summary: These commits fixes a bug which was exposed when we took away the fallback path. The fix is to set the appropriate device before setting CUDA stream. The improvement is when compiling, setting the device to new device only if it's different from prior device, and removing redundant call to cudaFree Pull Request resolved: pytorch#38365 Reviewed By: zheng-xq Differential Revision: D21537469 Pulled By: protonu fbshipit-source-id: b9662dd623b5c7cfd23eb6894e992a43665641e4

Protonu Basu added 3 commits May 12, 2020 13:46

set device for cuda-codegen if new device is not prior device

af048f8

check prior device before setting new device and avoid redundant call…

5b6b9b8

…s to cudaFree

fixing use of multigpu: set device before getting CUDAStream

c47e560

protonu requested review from ZolotukhinM and zheng-xq May 12, 2020 23:00

protonu requested a review from apaszke as a code owner May 12, 2020 23:00

facebook-github-bot added the oncall: jit Add this issue/PR to JIT oncall triage queue label May 12, 2020

protonu requested a review from Krovatkin May 12, 2020 23:01

protonu changed the title ~~Fix and improve handling multiple gpu devices~~ [Tensorexpr] Fix and improve handling multiple gpu devices May 12, 2020

facebook-github-bot reviewed May 13, 2020

View reviewed changes

zheng-xq approved these changes May 13, 2020

View reviewed changes

facebook-github-bot closed this in d1eeb3b May 14, 2020

facebook-github-bot added the merged label May 14, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tensorexpr] Fix and improve handling multiple gpu devices#38365

[Tensorexpr] Fix and improve handling multiple gpu devices#38365
protonu wants to merge 3 commits intopytorch:masterfrom
protonu:fixmultigpu

protonu commented May 12, 2020

Uh oh!

facebook-github-bot left a comment

Uh oh!

zheng-xq May 13, 2020

Uh oh!

dr-ci Bot commented May 13, 2020 •

edited

Loading

Uh oh!

facebook-github-bot commented May 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

protonu commented May 12, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

zheng-xq May 13, 2020

Choose a reason for hiding this comment

Uh oh!

dr-ci Bot commented May 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

facebook-github-bot commented May 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dr-ci Bot commented May 13, 2020 •

edited

Loading