rationalize specialize_int_float by avikchaudhuri · Pull Request #95099 · pytorch/pytorch

avikchaudhuri · 2023-02-17T22:15:20Z

Stack from ghstack (oldest at bottom):

-> rationalize specialize_int_float #95099

cc @soumith @voznesenskym @yanboliang @penguinwu @anijain2305 @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @Xia-Weiwen @wenzhe-nrv @jiayisunx @desertfire

Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/) [ghstack-poisoned]

pytorch-bot · 2023-02-17T22:15:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95099

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 9 Failures

As of commit e8b01af:

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

torch/_dynamo/config.py

Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/) cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

Pull Request resolved: #95099 ghstack-source-id: 180611874 Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/)

torch/_dynamo/eval_frame.py

voznesenskym · 2023-02-17T22:33:21Z

Hmmm, if I set the config to specialize_int_float one way, but then export, and it silently changes it, thats (a) really hard to patch (b) ignores user config.

Can we instead start by just:

Removing the implication that setting dynamic disables it?
Asserting a specific config state for this flag if we are in export?

voznesenskym

Back to you, good first pass :)

ezyang · 2023-02-22T23:48:39Z

related #94640

Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/) cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

Pull Request resolved: #95099 ghstack-source-id: 181045688 Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/)

torch/_dynamo/eval_frame.py

ezyang

you need to update the xpass. There's also some unrelated refactoring (enable_dynamic to set_dynamic) which I am not exactly sure the motivation of. Also please use patch to update config for export.

ezyang · 2023-02-28T15:33:16Z

It seems like we should delete test_unspec.py now

OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from #95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes #95469 Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/) cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

ezyang · 2023-03-02T03:42:16Z

@pytorchbot merge -f "unrelated failure"

pytorchmergebot · 2023-03-02T03:44:03Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-03-02T03:44:08Z

Merge failed

Reason: This PR has internal changes and must be landed via Phabricator

Details for Dev Infra team

Raised by workflow job

ezyang · 2023-03-02T14:26:10Z

@pytorchbot merge -f "unrelated failure"

pytorchmergebot · 2023-03-02T14:30:03Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2023-03-02T14:30:09Z

Merge failed

Reason: Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x 901942680986cde32de7fcd81bc9a08159c726ee returned non-zero exit code 1

Auto-merging torch/_dynamo/eval_frame.py
CONFLICT (content): Merge conflict in torch/_dynamo/eval_frame.py
Auto-merging torch/_dynamo/variables/builder.py
error: could not apply 90194268098... rationalize specialize_int_float
hint: After resolving the conflicts, mark them with
hint: "git add/rm <pathspec>", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".

Details for Dev Infra team

Raised by workflow job

cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

Pull Request resolved: #95099 ghstack-source-id: f85d0d8 Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/)

ezyang · 2023-03-02T16:46:23Z

since @tugsbayasgalan landed an unblock, I'm just folding this into #95621

OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from #95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes #95469 Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". As part of this, I have to turn off unspecialization by default, as there are still latent bugs in inductor. The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from #95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes #95469 Signed-off-by: Edward Z. Yang <ezyangmeta.com> cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx peterbell10 desertfire [ghstack-poisoned]

OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". As part of this, I have to turn off unspecialization by default, as there are still latent bugs in inductor. The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from #95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes #95469 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: #95621 Approved by: https://github.com/jansel, https://github.com/Chillee

OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". As part of this, I have to turn off unspecialization by default, as there are still latent bugs in inductor. The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from pytorch/pytorch#95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes pytorch/pytorch#95469 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: pytorch/pytorch#95621 Approved by: https://github.com/jansel, https://github.com/Chillee

OK, so this PR used to be about reducing the number of constants we specialize on, but it turns out that unspecialization was ~essentially never used (because we still constant specialized way too aggressively) and I ended up having to fix a bunch of issues to actually get tests to pass. So this PR is now "make int unspecialization actually work". As part of this, I have to turn off unspecialization by default, as there are still latent bugs in inductor. The general strategy is that an unspecialized int is represented as a SymInt. Representing it as a 0d tensor (which is what the code used to do) is untenable: (1) we often need unspecialized ints to participate in size computations, but we have no way of propagating sympy expressions through tensor compute, and (2) a lot of APIs work when passed SymInt, but not when passed a Tensor. However, I continue to represent Numpy scalars as Tensors, as they are rarely used for size computation and they have an explicit dtype, so they are more accurately modeled as 0d tensors. * I folded in the changes from pytorch#95099 as I cannot represent unspecialized ints as SymInts without also turning on dynamic shapes. This also eliminates the necessity for test_unspec.py, as toggling specialization without dynamic shapes doesn't do anything. As dynamic shapes defaults to unspecializing, I just deleted this entirely; for the specialization case, I rely on regular static shape tests to catch it. (Hypothetically, we could also rerun all the tests with dynamic shapes, but WITH int/float specialization, but this seems... not that useful? I mean, I guess export wants it, but I'd kind of like our Source heuristic to improve enough that export doesn't have to toggle this either.) * Only 0/1 integers get specialized by default now * A hodgepodge of fixes. I'll comment on the PR about them. Fixes pytorch#95469 Signed-off-by: Edward Z. Yang <ezyang@meta.com> Pull Request resolved: pytorch#95621 Approved by: https://github.com/jansel, https://github.com/Chillee

rationalize specialize_int_float

bae8340

Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/) [ghstack-poisoned]

github-actions bot added ciflow/inductor module: dynamo labels Feb 17, 2023

ezyang reviewed Feb 17, 2023

View reviewed changes

torch/_dynamo/config.py Outdated Show resolved Hide resolved

avikchaudhuri added a commit that referenced this pull request Feb 17, 2023

rationalize specialize_int_float

0e470ed

Pull Request resolved: #95099 ghstack-source-id: 180611874 Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/)

voznesenskym reviewed Feb 17, 2023

View reviewed changes

torch/_dynamo/eval_frame.py Outdated Show resolved Hide resolved

voznesenskym reviewed Feb 17, 2023

View reviewed changes

torch/_dynamo/eval_frame.py Outdated Show resolved Hide resolved

voznesenskym requested changes Feb 17, 2023

View reviewed changes

avikchaudhuri added a commit that referenced this pull request Feb 23, 2023

rationalize specialize_int_float

0546391

Pull Request resolved: #95099 ghstack-source-id: 181045688 Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/)

avikchaudhuri requested a review from voznesenskym February 23, 2023 17:12

ezyang reviewed Feb 28, 2023

View reviewed changes

torch/_dynamo/eval_frame.py Outdated Show resolved Hide resolved

ezyang approved these changes Feb 28, 2023

View reviewed changes

ezyang mentioned this pull request Feb 28, 2023

Make int unspecialization actually work #95621

Closed

avikchaudhuri added the topic: not user facing topic category label Feb 28, 2023

Update on "rationalize specialize_int_float"

e8b01af

cc soumith voznesenskym yanboliang penguinwu anijain2305 EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng Xia-Weiwen wenzhe-nrv jiayisunx desertfire [ghstack-poisoned]

ezyang added a commit that referenced this pull request Mar 2, 2023

rationalize specialize_int_float

7643a57

Pull Request resolved: #95099 ghstack-source-id: f85d0d8 Differential Revision: [D43408128](https://our.internmc.facebook.com/intern/diff/D43408128/)

ezyang closed this Mar 2, 2023

facebook-github-bot deleted the gh/avikchaudhuri/8/head branch June 8, 2023 15:21

Conversation

avikchaudhuri commented Feb 17, 2023 • edited by ezyang Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Feb 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/95099

❌ 9 Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

voznesenskym commented Feb 17, 2023

Uh oh!

voznesenskym left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Feb 22, 2023

Uh oh!

Uh oh!

ezyang left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Feb 28, 2023

Uh oh!

ezyang commented Mar 2, 2023

Uh oh!

pytorchmergebot commented Mar 2, 2023

Merge started

Uh oh!

pytorchmergebot commented Mar 2, 2023

Merge failed

Uh oh!

ezyang commented Mar 2, 2023

Uh oh!

pytorchmergebot commented Mar 2, 2023

Merge started

Uh oh!

pytorchmergebot commented Mar 2, 2023

Merge failed

Uh oh!

ezyang commented Mar 2, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

avikchaudhuri commented Feb 17, 2023 •

edited by ezyang

Loading

pytorch-bot bot commented Feb 17, 2023 •

edited

Loading