Get boolean masking to work with unbacked SymInts by ezyang · Pull Request #94523 · pytorch/pytorch

ezyang · 2023-02-09T17:21:26Z

Stack from ghstack (oldest at bottom):

This adds support for unbacked SymInts from nonzero, and then gets this working end-to-end with advanced indexing and boolean masking. I needed to apply a variety of fixes to get this to work; these fixes are emblematic of the kind of enablement work we will need to do for unbacked SymInts as they are used more heavily.

Feature guard tracing through nonzero with capture_dynamic_output_shape_ops because Inductor generally can't handle it
Allow boolean tensors into the index.Tensor meta, and redirect fake tensor to call the meta function directly (so that when it calls nonzero, it decomposes to the fake tensor nonzero)
Refactor compute_elementwise_output_strides to return the output permutation. I then use this to rewrite the empty_like reference to avoid calling empty_strided; instead, it allocates calls empty_permuted.
Refactor _broadcast_shapes to test for more common cases (shapes are equal) before broadcasting case, which is enough to get the broadcasting in advanced indexing going without adding an unnecessary == 1 guard.
Make nonzero return an unbacked symint in size, and put enough assumptions on it so that the boolean masking end to end works
Don't record memory_format on TensorMeta if there are unbacked SymInts. Actually, we can probably do a bit better here; sometimes a tensor will have unbacked SymInt but is obviously contiguous and we ought to report that. Leaving that for future work.

Signed-off-by: Edward Z. Yang ezyang@meta.com

cc @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @desertfire

Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

This adds support for unbacked SymInts from nonzero, and then gets this working end-to-end with advanced indexing and boolean masking. I needed to apply a variety of fixes to get this to work; these fixes are emblematic of the kind of enablement work we will need to do for unbacked SymInts as they are used more heavily. * When we allocate a contiguous tensor, hard code relevant fields (e.g., `is_contiguous_`) true, instead of keeping the complicated guardless implementation. As Horace notes, all guardless implementations really do is push off the evaluation problem until later. We do poke contiguity reasonably often, but we also typically allocate these with torch.empty so we know statically they are contiguous. * Allow boolean tensors into the index.Tensor meta, and redirect fake tensor to call the meta function directly (so that when it calls nonzero, it decomposes to the fake tensor nonzero) * Don't rewrap tensors in TensorMeta; it does nothing but hits `empty_strided` which is generally not unbacked SymInt friendly * Refactor `compute_elementwise_output_strides` to return the output permutation. I then use this to rewrite the `empty_like` reference to avoid calling `empty_strided`; instead, it allocates a contiguous tensor and permutes it. * Refactor `_broadcast_shapes` to test for more common cases (shapes are equal) before broadcasting case, which is enough to get the broadcasting in advanced indexing going without adding an unnecessary == 1 guard. * Make nonzero return an unbacked symint in size, and put enough assumptions on it so that the boolean masking end to end works * Don't remove leading 1s in setitem unless the value tensor is too big for the input tensor. Submitted separately at #94521 * Don't record `memory_format` on TensorMeta if there are unbacked SymInts. Actually, we can probably do a bit better here; sometimes a tensor will have unbacked SymInt but is obviously contiguous and we ought to report that. Leaving that for future work. Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

pytorch-bot · 2023-02-09T21:28:31Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/94523

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 894aec9 Pull Request resolved: #94523

This adds support for unbacked SymInts from nonzero, and then gets this working end-to-end with advanced indexing and boolean masking. I needed to apply a variety of fixes to get this to work; these fixes are emblematic of the kind of enablement work we will need to do for unbacked SymInts as they are used more heavily. * When we allocate a contiguous tensor, hard code relevant fields (e.g., `is_contiguous_`) true, instead of keeping the complicated guardless implementation. As Horace notes, all guardless implementations really do is push off the evaluation problem until later. We do poke contiguity reasonably often, but we also typically allocate these with torch.empty so we know statically they are contiguous. * Allow boolean tensors into the index.Tensor meta, and redirect fake tensor to call the meta function directly (so that when it calls nonzero, it decomposes to the fake tensor nonzero) * Don't rewrap tensors in TensorMeta; it does nothing but hits `empty_strided` which is generally not unbacked SymInt friendly * Refactor `compute_elementwise_output_strides` to return the output permutation. I then use this to rewrite the `empty_like` reference to avoid calling `empty_strided`; instead, it allocates a contiguous tensor and permutes it. * Refactor `_broadcast_shapes` to test for more common cases (shapes are equal) before broadcasting case, which is enough to get the broadcasting in advanced indexing going without adding an unnecessary == 1 guard. * Make nonzero return an unbacked symint in size, and put enough assumptions on it so that the boolean masking end to end works * Don't remove leading 1s in setitem unless the value tensor is too big for the input tensor. Submitted separately at #94521 * Don't record `memory_format` on TensorMeta if there are unbacked SymInts. Actually, we can probably do a bit better here; sometimes a tensor will have unbacked SymInt but is obviously contiguous and we ought to report that. Leaving that for future work. Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 4f2cf74 Pull Request resolved: #94523

Extracted from #94523 Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

Extracted from #94523 Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 45a7e62 Pull Request resolved: #95004

…nts" Extracted from #94523 which has E2E test Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

Extracted from #94523 which has E2E test Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: ed0f8a0 Pull Request resolved: #95003

Extracted from #94523 which has E2E test Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #95003 Approved by: https://github.com/voznesenskym, https://github.com/ngimel

Extracted from #94523 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #95004 Approved by: https://github.com/voznesenskym, https://github.com/ngimel, https://github.com/Skylion007

This adds support for unbacked SymInts from nonzero, and then gets this working end-to-end with advanced indexing and boolean masking. I needed to apply a variety of fixes to get this to work; these fixes are emblematic of the kind of enablement work we will need to do for unbacked SymInts as they are used more heavily. * When we allocate a contiguous tensor, hard code relevant fields (e.g., `is_contiguous_`) true, instead of keeping the complicated guardless implementation. As Horace notes, all guardless implementations really do is push off the evaluation problem until later. We do poke contiguity reasonably often, but we also typically allocate these with torch.empty so we know statically they are contiguous. * Allow boolean tensors into the index.Tensor meta, and redirect fake tensor to call the meta function directly (so that when it calls nonzero, it decomposes to the fake tensor nonzero) * Don't rewrap tensors in TensorMeta; it does nothing but hits `empty_strided` which is generally not unbacked SymInt friendly * Refactor `compute_elementwise_output_strides` to return the output permutation. I then use this to rewrite the `empty_like` reference to avoid calling `empty_strided`; instead, it allocates a contiguous tensor and permutes it. * Refactor `_broadcast_shapes` to test for more common cases (shapes are equal) before broadcasting case, which is enough to get the broadcasting in advanced indexing going without adding an unnecessary == 1 guard. * Make nonzero return an unbacked symint in size, and put enough assumptions on it so that the boolean masking end to end works * Don't remove leading 1s in setitem unless the value tensor is too big for the input tensor. Submitted separately at #94521 * Don't record `memory_format` on TensorMeta if there are unbacked SymInts. Actually, we can probably do a bit better here; sometimes a tensor will have unbacked SymInt but is obviously contiguous and we ought to report that. Leaving that for future work. Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

This adds support for unbacked SymInts from nonzero, and then gets this working end-to-end with advanced indexing and boolean masking. I needed to apply a variety of fixes to get this to work; these fixes are emblematic of the kind of enablement work we will need to do for unbacked SymInts as they are used more heavily. * When we allocate a contiguous tensor, hard code relevant fields (e.g., `is_contiguous_`) true, instead of keeping the complicated guardless implementation. As Horace notes, all guardless implementations really do is push off the evaluation problem until later. We do poke contiguity reasonably often, but we also typically allocate these with torch.empty so we know statically they are contiguous. * Allow boolean tensors into the index.Tensor meta, and redirect fake tensor to call the meta function directly (so that when it calls nonzero, it decomposes to the fake tensor nonzero) * Don't rewrap tensors in TensorMeta; it does nothing but hits `empty_strided` which is generally not unbacked SymInt friendly * Refactor `compute_elementwise_output_strides` to return the output permutation. I then use this to rewrite the `empty_like` reference to avoid calling `empty_strided`; instead, it allocates a contiguous tensor and permutes it. * Refactor `_broadcast_shapes` to test for more common cases (shapes are equal) before broadcasting case, which is enough to get the broadcasting in advanced indexing going without adding an unnecessary == 1 guard. * Make nonzero return an unbacked symint in size, and put enough assumptions on it so that the boolean masking end to end works * Don't remove leading 1s in setitem unless the value tensor is too big for the input tensor. Submitted separately at #94521 * Don't record `memory_format` on TensorMeta if there are unbacked SymInts. Actually, we can probably do a bit better here; sometimes a tensor will have unbacked SymInt but is obviously contiguous and we ought to report that. Leaving that for future work. Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

This adds support for unbacked SymInts from nonzero, and then gets this working end-to-end with advanced indexing and boolean masking. I needed to apply a variety of fixes to get this to work; these fixes are emblematic of the kind of enablement work we will need to do for unbacked SymInts as they are used more heavily. * Feature guard tracing through nonzero with capture_dynamic_output_shape_ops because Inductor generally can't handle it * Allow boolean tensors into the index.Tensor meta, and redirect fake tensor to call the meta function directly (so that when it calls nonzero, it decomposes to the fake tensor nonzero) * Refactor `compute_elementwise_output_strides` to return the output permutation. I then use this to rewrite the `empty_like` reference to avoid calling `empty_strided`; instead, it allocates calls `empty_permuted`. * Refactor `_broadcast_shapes` to test for more common cases (shapes are equal) before broadcasting case, which is enough to get the broadcasting in advanced indexing going without adding an unnecessary == 1 guard. * Make nonzero return an unbacked symint in size, and put enough assumptions on it so that the boolean masking end to end works * Don't record `memory_format` on TensorMeta if there are unbacked SymInts. Actually, we can probably do a bit better here; sometimes a tensor will have unbacked SymInt but is obviously contiguous and we ought to report that. Leaving that for future work. Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

…h#95003) Extracted from pytorch#94523 which has E2E test Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: pytorch#95003 Approved by: https://github.com/voznesenskym, https://github.com/ngimel

Extracted from pytorch#94523 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: pytorch#95004 Approved by: https://github.com/voznesenskym, https://github.com/ngimel, https://github.com/Skylion007

Get boolean masking to work with unbacked SymInts

7ee1cc6

Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

ezyang requested review from albanD and soulitzer as code owners February 9, 2023 17:21

ezyang mentioned this pull request Feb 9, 2023

sym_max/sym_min introduce guard if hinted #94400

Closed

pytorch-bot bot added the release notes: fx release notes category label Feb 9, 2023

github-actions bot requested review from Chillee, SherlockNoMad, antoniojkim, bdhirsh, jbschlosser, miladm, voznesenskym and wconstab February 9, 2023 17:21

github-actions bot added the ciflow/inductor label Feb 9, 2023

ezyang added a commit that referenced this pull request Feb 9, 2023

Get boolean masking to work with unbacked SymInts

ce8c0ff

Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 894aec9 Pull Request resolved: #94523

ezyang added a commit that referenced this pull request Feb 10, 2023

Get boolean masking to work with unbacked SymInts

9ba1afe

Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 4f2cf74 Pull Request resolved: #94523

albanD removed their request for review February 10, 2023 18:21

ezyang mentioned this pull request Feb 16, 2023

Remove unnecessary TensorMeta rewrap #95004

Closed

ezyang added a commit that referenced this pull request Feb 16, 2023

Remove unnecessary TensorMeta rewrap

480f66a

Extracted from #94523 Signed-off-by: Edward Z. Yang <ezyang@meta.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request Feb 16, 2023

Remove unnecessary TensorMeta rewrap

78de187

Extracted from #94523 Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: 45a7e62 Pull Request resolved: #95004

ezyang added a commit that referenced this pull request Feb 16, 2023

Update on "Hard code known true contiguity settings for unbacked SymI…

99e25b2

…nts" Extracted from #94523 which has E2E test Signed-off-by: Edward Z. Yang <ezyangmeta.com> [ghstack-poisoned]

ezyang added a commit that referenced this pull request Feb 16, 2023

Hard code known true contiguity settings for unbacked SymInts

7e5b8ee

Extracted from #94523 which has E2E test Signed-off-by: Edward Z. Yang <ezyangmeta.com> ghstack-source-id: ed0f8a0 Pull Request resolved: #95003

ezyang mentioned this pull request Feb 20, 2023

Don't truncate leading 1s if they are unbacked #95141

Closed

github-actions bot added the module: dynamo label Feb 20, 2023

ezyang added 2 commits February 20, 2023 13:32

ezyang mentioned this pull request Feb 20, 2023

Fix convit_base #95174

Closed

ezyang removed the keep-going Don't stop on first failure, keep running tests until the end label Feb 20, 2023

soulitzer removed their request for review February 20, 2023 23:58

ezyang added 2 commits February 20, 2023 16:30

This was referenced Feb 21, 2023

Reland "Add torch.empty_permuted (#95069)" #95208

Closed

Reland "Introduce constrain_range; remove old expr_subs (#95063)" #95209

Closed

ezyang added the ciflow/trunk Trigger trunk jobs on your pull request label Feb 21, 2023

ezyang added 2 commits February 21, 2023 06:45

ezyang mentioned this pull request Feb 21, 2023

Get matrix multiply with unbacked SymInt working #95218

Closed

ezyang requested a review from suo February 21, 2023 15:39

ezyang mentioned this pull request Feb 21, 2023

Make it possible to trace resnet with unbacked batch size #95222

Closed

ezyang closed this Feb 23, 2023

facebook-github-bot deleted the gh/ezyang/1806/head branch June 8, 2023 16:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get boolean masking to work with unbacked SymInts#94523

Get boolean masking to work with unbacked SymInts#94523
ezyang wants to merge 16 commits intogh/ezyang/1806/basefrom
gh/ezyang/1806/head

ezyang commented Feb 9, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 9, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ezyang commented Feb 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/94523

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ezyang commented Feb 9, 2023 •

edited

Loading

pytorch-bot bot commented Feb 9, 2023 •

edited

Loading