Get matrix multiply with unbacked SymInt working by ezyang · Pull Request #95218 · pytorch/pytorch

ezyang · 2023-02-21T15:28:23Z

Stack from ghstack (oldest at bottom):

This PR gets reflect @ R @ reflect working, where R has unbacked batch size. This pattern occurred in CrystalDPR. The billing of changes:

torch.broadcast_shapes avoids guarding on unbacked SymInts when testing for broadcastable dims. I extracted this to Rewrite torch.broadcast_shapes to be unbacked SymInt friendly #95217 for separate review; it's repeated in this PR as it is necessary for the E2E test
I disable matrix multiply folding when there is an unbacked SymInt on any input. Folding is strictly a performance optimization and can be omitted. Also, I believe export would prefer to get matmul (rather than bmm/etc), so we should eventually actually get [POC] Don't decompose matmul #91081 going
I add a direct Python transcription of the reshape composite adapted from reshape in python #84584 . I cannot use the PrimTorch composite as it has problems when I register it pre-autograd. It has the same implementation as regular reshape, but at the beginning there is one more test for trivial reshapes, which is sufficient for the matmul example.
I hand-write a meta function for expand, rather than using the PrimTorch decomposition. I couldn't really figure out how to make the PrimTorch decomposition guard free, but with the hand-written meta it is clear where the divergence lies: we cannot easily choose the correct stride for the unbacked dim, as we need to know whether or not the size is one (in which case we give the predicted stride) versus non-one (in which case we MUST give zero.) In composability sync, we agreed that changes to striding behavior are fair game with unbacked SymInts, so I just unconditionally give these zero stride.

Signed-off-by: Edward Z. Yang ezyang@meta.com

Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

pytorch-bot · 2023-02-21T15:28:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95218

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 0860fd3:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: e230b4f Pull Request resolved: #95218

ezyang · 2023-02-21T17:13:37Z

torch/_refs/__init__.py

 # NOTE: shape is a vararg because Tensor.reshape can be called with as
 # Tensor.reshape(a, b, c) or Tensor.reshape((a, b, c)) Function call
 # torch.reshape doesn't support unpacked shapes
+@aten.reshape.default.py_impl(DispatchKey.CompositeImplicitAutograd)


This appears to be deeply problematic. I'll probably figure out another way to do this.

This PR gets `reflect @ R @ reflect` working, where R has unbacked batch size. This pattern occurred in CrystalDPR. The billing of changes: * torch.broadcast_shapes avoids guarding on unbacked SymInts when testing for broadcastable dims. I extracted this to #95217 for separate review; it's repeated in this PR as it is necessary for the E2E test * I disable matrix multiply folding when there is an unbacked SymInt on any input. Folding is strictly a performance optimization and can be omitted. Also, I believe export would prefer to get matmul (rather than bmm/etc), so we should eventually actually get #91081 going * I switch `reshape` to use the Python implementation, which is easier to debug than the C++ one. Previously we couldn't easily do this as it was composite, but now we can with Python dispatcher. * I hand-write a meta function for expand, rather than using the PrimTorch decomposition. I couldn't really figure out how to make the PrimTorch decomposition guard free, but with the hand-written meta it is clear where the divergence lies: we cannot easily choose the correct stride for the unbacked dim, as we need to know whether or not the size is one (in which case we give the predicted stride) versus non-one (in which case we MUST give zero.) In composability sync, we agreed that changes to striding behavior are fair game with unbacked SymInts, so I just unconditionally give these zero stride. Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

This PR gets `reflect @ R @ reflect` working, where R has unbacked batch size. This pattern occurred in CrystalDPR. The billing of changes: * torch.broadcast_shapes avoids guarding on unbacked SymInts when testing for broadcastable dims. I extracted this to #95217 for separate review; it's repeated in this PR as it is necessary for the E2E test * I disable matrix multiply folding when there is an unbacked SymInt on any input. Folding is strictly a performance optimization and can be omitted. Also, I believe export would prefer to get matmul (rather than bmm/etc), so we should eventually actually get #91081 going * I add a direct Python transcription of the reshape composite adapted from #84584 . I cannot use the PrimTorch composite as it has problems when I register it pre-autograd. It has the same implementation as regular reshape, but at the beginning there is one more test for trivial reshapes, which is sufficient for the matmul example. * I hand-write a meta function for expand, rather than using the PrimTorch decomposition. I couldn't really figure out how to make the PrimTorch decomposition guard free, but with the hand-written meta it is clear where the divergence lies: we cannot easily choose the correct stride for the unbacked dim, as we need to know whether or not the size is one (in which case we give the predicted stride) versus non-one (in which case we MUST give zero.) In composability sync, we agreed that changes to striding behavior are fair game with unbacked SymInts, so I just unconditionally give these zero stride. Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

Get matrix multiply with unbacked SymInt working

1d8f5e3

Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

ezyang mentioned this pull request Feb 21, 2023

Reland "Add torch.empty_permuted (#95069)" #95208

Closed

pytorch-bot bot added the release notes: fx release notes category label Feb 21, 2023

ezyang mentioned this pull request Feb 21, 2023

Reland "Introduce constrain_range; remove old expr_subs (#95063)" #95209

Closed

This was referenced Feb 21, 2023

Get boolean masking to work with unbacked SymInts #94523

Closed

Unbacked SymInt support for pointwise and multiple boolean masks #94790

Closed

github-actions bot requested review from Chillee, SherlockNoMad, albanD, antoniojkim, bdhirsh, jbschlosser, miladm, voznesenskym and wconstab February 21, 2023 15:28

Update on "Get matrix multiply with unbacked SymInt working"

65fcb94

Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

ezyang added release notes: composability release notes category topic: not user facing topic category labels Feb 21, 2023

Update on "Get matrix multiply with unbacked SymInt working"

1dde744

Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request Feb 21, 2023

Get matrix multiply with unbacked SymInt working

5f543cd

Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: e230b4f Pull Request resolved: #95218

ezyang requested review from ngimel and suo February 21, 2023 15:39

This was referenced Feb 21, 2023

Rewrite torch.broadcast_shapes to be unbacked SymInt friendly #95217

Closed

Make it possible to trace resnet with unbacked batch size #95222

Closed

ezyang commented Feb 21, 2023

View reviewed changes

ezyang requested a review from mruberry as a code owner February 21, 2023 17:15

ezyang added 2 commits February 21, 2023 09:47

ezyang added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 22, 2023

ezyang closed this Feb 23, 2023

facebook-github-bot deleted the gh/ezyang/1837/head branch June 8, 2023 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get matrix multiply with unbacked SymInt working#95218

Get matrix multiply with unbacked SymInt working#95218
ezyang wants to merge 6 commits intogh/ezyang/1837/basefrom
gh/ezyang/1837/head

ezyang commented Feb 21, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 21, 2023 •

edited

Loading

Uh oh!

ezyang Feb 21, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ezyang commented Feb 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 21, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95218

✅ No Failures

Uh oh!

ezyang Feb 21, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ezyang commented Feb 21, 2023 •

edited

Loading

pytorch-bot bot commented Feb 21, 2023 •

edited

Loading