fix aliasing bug in pixel shuffle/unshuffle by bdhirsh · Pull Request #86608 · pytorch/pytorch

bdhirsh · 2022-10-10T15:29:34Z

cc @albanD - at::pixel_shuffle and at::pixel_unshuffle advertise as being non-aliasing, but they have a C++ decomposition that internally uses reshape(), which means that it might return an alias.

I happened to notice this because a bunch of tests in test/test_ops.py failed when I ran locally with a DEBUG=1 build.

(P.S.: when are we finally gonna get a debug build test in CI? 😃)

I fixed by adding an extra clone, which... is going to be an unnecessary perf hit in the case where the reshape() already properly cloned the input. My hope is that this is fine, because this only impacts the composite kernel- we already have a "fast" CPU kernel that does the right thing. Is pixel_shuffle/unshuffle commonly used with cuda? Maybe we should just add a fast cuda kernel for it if that's the case.

Alternatively, it seems like it would be nice if reshape() accepted an optional argument to unconditionally return a copy. That seems like a rabbit hole that isn't worth going down for now though - I remember a discussion a while ago about making reshape() copy-on-write

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2022-10-10T15:29:36Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86608

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Failures, 3 Pending

As of commit d51903b:

The following jobs have failed:

macos-12-py3-x86-64 / build

This comment was automatically generated by Dr. CI and updates every 15 minutes.

albanD · 2022-10-10T16:01:16Z

(P.S.: when are we finally gonna get a debug build test in CI? smiley)

These should be some:
periodic / linux-bionic-cuda11.6-py3.7-gcc7-debug
periodic / linux-bionic-cuda11.7-py3.7-gcc7-debug

Not sure why this is isn't caught... cc @malfet any know issues left with the debug build?

albanD · 2022-10-10T16:03:09Z

aten/src/ATen/native/PixelShuffle.cpp


-  return input_permuted.reshape(final_shape);
+  // pixel_shuffle expects to *never* return an alias of the input.
+  return input_permuted.reshape(final_shape).clone();


Just do: input_permuted.clone().view(final_shape) to avoid the extra copy ;)

oh you're totally right haha

Fixes #82235 cc albanD - `at::pixel_shuffle` and `at::pixel_unshuffle` advertise as being non-aliasing, but they have a C++ decomposition that internally uses reshape(), which means that it might return an alias. I happened to notice this because a bunch of tests in `test/test_ops.py` failed when I ran locally with a `DEBUG=1` build. (P.S.: when are we finally gonna get a debug build test in CI? 😃) I fixed by adding an extra clone, which... is going to be an unnecessary perf hit in the case where the `reshape()` already properly cloned the input. My hope is that this is fine, because this only impacts the composite kernel- we already have a "fast" CPU kernel that does the right thing. Is `pixel_shuffle/unshuffle` commonly used with cuda? Maybe we should just add a fast cuda kernel for it if that's the case. Alternatively, it seems like it would be nice if `reshape()` accepted an optional argument to unconditionally return a copy. That seems like a rabbit hole that isn't worth going down for now though - I remember a discussion a while ago about making `reshape()` copy-on-write [ghstack-poisoned]

albanD

SGTM

Fixes #82235 cc albanD - `at::pixel_shuffle` and `at::pixel_unshuffle` advertise as being non-aliasing, but they have a C++ decomposition that internally uses reshape(), which means that it might return an alias. I happened to notice this because a bunch of tests in `test/test_ops.py` failed when I ran locally with a `DEBUG=1` build. (P.S.: when are we finally gonna get a debug build test in CI? 😃) I fixed by adding an extra clone, which... is going to be an unnecessary perf hit in the case where the `reshape()` already properly cloned the input. My hope is that this is fine, because this only impacts the composite kernel- we already have a "fast" CPU kernel that does the right thing. Is `pixel_shuffle/unshuffle` commonly used with cuda? Maybe we should just add a fast cuda kernel for it if that's the case. Alternatively, it seems like it would be nice if `reshape()` accepted an optional argument to unconditionally return a copy. That seems like a rabbit hole that isn't worth going down for now though - I remember a discussion a while ago about making `reshape()` copy-on-write [ghstack-poisoned]

eellison

Cool! Would you remove the skips here ? https://github.com/pytorch/pytorch/blob/master/test/test_ops.py#L1778

Fixes #82235 cc albanD - `at::pixel_shuffle` and `at::pixel_unshuffle` advertise as being non-aliasing, but they have a C++ decomposition that internally uses reshape(), which means that it might return an alias. I happened to notice this because a bunch of tests in `test/test_ops.py` failed when I ran locally with a `DEBUG=1` build. (P.S.: when are we finally gonna get a debug build test in CI? 😃) I fixed by adding an extra clone, which... is going to be an unnecessary perf hit in the case where the `reshape()` already properly cloned the input. My hope is that this is fine, because this only impacts the composite kernel- we already have a "fast" CPU kernel that does the right thing. Is `pixel_shuffle/unshuffle` commonly used with cuda? Maybe we should just add a fast cuda kernel for it if that's the case. Alternatively, it seems like it would be nice if `reshape()` accepted an optional argument to unconditionally return a copy. That seems like a rabbit hole that isn't worth going down for now though - I remember a discussion a while ago about making `reshape()` copy-on-write [ghstack-poisoned]

vadimkantorov · 2022-12-12T23:25:53Z

about extra copy:
there's a common theme that sometimes non-strict "aliasing is possible" (in reshape-style) semantics is preferable to elide copies in cases where it's known that no inplace will happen. e.g. one in-the-wild examples is torch.cat wrapper in detectron2:
https://github.com/facebookresearch/detectron2/blob/48b598b4f61fbb24182a69b521b2a0ba3252b842/detectron2/layers/wrappers.py#L39

that shortcuts if the number of passed tensors is 1

(similar cases may be multiplying by 1 / adding 0, but they might be less realistic...)

so not sure how to control for this - if deemed important aspect. maybe allow copy=True/copy=None arguments to such ops with copy=True by default

vadimkantorov · 2022-12-13T15:10:53Z

Somehow, one way could be to allow express "no inplace ops are allowed on this tensor or its views" or at least "assume no inplace ops will happen on this tensor or its views", then the executor engine can optimize out the unneeded copies easily @albanD

fix aliasing bug in pixel shuffle/unshuffle

13d66fb

[ghstack-poisoned]

github-actions bot requested review from Chillee, Krovatkin, albanD, anjali411, antoniojkim, ezyang, miladm and wconstab October 10, 2022 15:29

albanD reviewed Oct 10, 2022

View reviewed changes

albanD approved these changes Oct 11, 2022

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 11, 2022

bdhirsh requested review from mruberry and ngimel as code owners October 11, 2022 15:40

eellison reviewed Oct 11, 2022

View reviewed changes

bdhirsh added 5 commits October 11, 2022 13:55

bdhirsh added 2 commits October 12, 2022 13:51

pytorchmergebot closed this in 0feccda Oct 13, 2022

jxtps mentioned this pull request Dec 12, 2022

PixelShuffle/Unshuffle Channels Last Support on GPU #90708

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix aliasing bug in pixel shuffle/unshuffle#86608

fix aliasing bug in pixel shuffle/unshuffle#86608
bdhirsh wants to merge 10 commits intogh/bdhirsh/322/basefrom
gh/bdhirsh/322/head

bdhirsh commented Oct 10, 2022 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 10, 2022 •

edited

Loading

Uh oh!

albanD commented Oct 10, 2022

Uh oh!

albanD Oct 10, 2022

Uh oh!

bdhirsh Oct 10, 2022

Uh oh!

albanD left a comment

Uh oh!

eellison left a comment

Uh oh!

vadimkantorov commented Dec 12, 2022

Uh oh!

vadimkantorov commented Dec 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

bdhirsh commented Oct 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/86608

❌ 1 Failures, 3 Pending

Uh oh!

albanD commented Oct 10, 2022

Uh oh!

albanD Oct 10, 2022

Choose a reason for hiding this comment

Uh oh!

bdhirsh Oct 10, 2022

Choose a reason for hiding this comment

Uh oh!

albanD left a comment

Choose a reason for hiding this comment

Uh oh!

eellison left a comment

Choose a reason for hiding this comment

Uh oh!

vadimkantorov commented Dec 12, 2022

Uh oh!

vadimkantorov commented Dec 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bdhirsh commented Oct 10, 2022 •

edited

Loading

pytorch-bot bot commented Oct 10, 2022 •

edited

Loading