[pytorch][mobile] make sure mobile build work with dynamic dispatch by ljk53 · Pull Request #34038 · pytorch/pytorch

ljk53 · 2020-03-01T07:28:50Z

Stack from ghstack:

[pytorch][mobile] support for custom mobile build with dynamic dispatch #34055 [pytorch][mobile] support for custom mobile build with dynamic dispatch
[pytorch][mobile] make sure mobile build work with dynamic dispatch #34038 [pytorch][mobile] make sure mobile build work with dynamic dispatch
[pytorch][CI] end-to-end custom build script #34012 [pytorch][CI] end-to-end custom build script

Summary:
Mobile build doesn't include autograd/VariableType dispatch. As the
result AutoNonVariableTypeMode needs to be set in mobile runtime.

With static dispatch this works is done inside generated jit-dispatch
code - AutoNonVariableTypeMode needs to be set on per-op basis. Setting
it globally or setting it for wrong ops might break some is_variable()
checks in the codebase.

Thanks to the unification of Variable class and Tensor class, all
is_variable() checks have been removed, so AutoNonVariableTypeMode can
be set globally now.

We never tested inference-only mobile build with dynamic dispatch. It
seems that dynamic dispatch also requires setting AutoNonVariableTypeMode
for our mobile build (where VariableType functions are not registered).

Verified the end-to-end test works with this change:

TEST_CUSTOM_BUILD_DYNAMIC=1 test/mobile/custom_build/build.sh

Differential Revision: D20193329

Summary: Mobile build doesn't include autograd/VariableType dispatch. As the result AutoNonVariableTypeMode needs to be set in mobile runtime. With static dispatch this works is done inside generated jit-dispatch code - AutoNonVariableTypeMode needs to be set on per-op basis. Setting it globally or setting it for wrong ops might break some `is_variable()` checks in the codebase. Thanks to the unification of Variable class and Tensor class, all is_variable() checks have been removed, so AutoNonVariableTypeMode can be set globally now. We never tested inference-only mobile build with dynamic dispatch. It seems that dynamic dispatch also requires setting AutoNonVariableTypeMode for our mobile build (where VariableType functions are not registered). Verified the end-to-end test works with this change: ``` TEST_CUSTOM_BUILD_DYNAMIC=1 test/mobile/custom_build/build.sh ``` [ghstack-poisoned]

Summary: Mobile build doesn't include autograd/VariableType dispatch. As the result AutoNonVariableTypeMode needs to be set in mobile runtime. With static dispatch this works is done inside generated jit-dispatch code - AutoNonVariableTypeMode needs to be set on per-op basis. Setting it globally or setting it for wrong ops might break some `is_variable()` checks in the codebase. Thanks to the unification of Variable class and Tensor class, all is_variable() checks have been removed, so AutoNonVariableTypeMode can be set globally now. We never tested inference-only mobile build with dynamic dispatch. It seems that dynamic dispatch also requires setting AutoNonVariableTypeMode for our mobile build (where VariableType functions are not registered). Verified the end-to-end test works with this change: ``` TEST_CUSTOM_BUILD_DYNAMIC=1 test/mobile/custom_build/build.sh ``` ghstack-source-id: bf23a44 Pull Request resolved: #34038

dr-ci · 2020-03-01T07:58:15Z

💊 CircleCI build failures summary and remediations

As of commit d67b04d (more details on the Dr. CI page):

Commit d67b04d was recently pushed. Waiting for builds...

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

This comment has been revised 8 times.

… dispatch" Summary: Mobile build doesn't include autograd/VariableType dispatch. As the result AutoNonVariableTypeMode needs to be set in mobile runtime. With static dispatch this works is done inside generated jit-dispatch code - AutoNonVariableTypeMode needs to be set on per-op basis. Setting it globally or setting it for wrong ops might break some `is_variable()` checks in the codebase. Thanks to the unification of Variable class and Tensor class, all is_variable() checks have been removed, so AutoNonVariableTypeMode can be set globally now. We never tested inference-only mobile build with dynamic dispatch. It seems that dynamic dispatch also requires setting AutoNonVariableTypeMode for our mobile build (where VariableType functions are not registered). Verified the end-to-end test works with this change: ``` TEST_CUSTOM_BUILD_DYNAMIC=1 test/mobile/custom_build/build.sh ``` [ghstack-poisoned]

… dispatch" Summary: Mobile build doesn't include autograd/VariableType dispatch. As the result AutoNonVariableTypeMode needs to be set in mobile runtime. With static dispatch this works is done inside generated jit-dispatch code - AutoNonVariableTypeMode needs to be set on per-op basis. Setting it globally or setting it for wrong ops might break some `is_variable()` checks in the codebase. Thanks to the unification of Variable class and Tensor class, all is_variable() checks have been removed, so AutoNonVariableTypeMode can be set globally now. We never tested inference-only mobile build with dynamic dispatch. It seems that dynamic dispatch also requires setting AutoNonVariableTypeMode for our mobile build (where VariableType functions are not registered). Verified the end-to-end test works with this change: ``` TEST_CUSTOM_BUILD_DYNAMIC=1 test/mobile/custom_build/build.sh ``` Differential Revision: [D20193329](https://our.internmc.facebook.com/intern/diff/D20193329) [ghstack-poisoned]

facebook-github-bot · 2020-03-04T05:12:36Z

@ljk53 merged this pull request in 0cf34cf.

…ytorch#34038) Summary: Pull Request resolved: pytorch#34038 Mobile build doesn't include autograd/VariableType dispatch. As the result AutoNonVariableTypeMode needs to be set in mobile runtime. With static dispatch this works is done inside generated jit-dispatch code - AutoNonVariableTypeMode needs to be set on per-op basis. Setting it globally or setting it for wrong ops might break some `is_variable()` checks in the codebase. Thanks to the unification of Variable class and Tensor class, all is_variable() checks have been removed, so AutoNonVariableTypeMode can be set globally now. We never tested inference-only mobile build with dynamic dispatch. It seems that dynamic dispatch also requires setting AutoNonVariableTypeMode for our mobile build (where VariableType functions are not registered). Verified the end-to-end test works with this change: ``` TEST_CUSTOM_BUILD_DYNAMIC=1 test/mobile/custom_build/build.sh ``` Test Plan: Imported from OSS Differential Revision: D20193329 Pulled By: ljk53 fbshipit-source-id: cc98414d89d12463dc82b0cdde0b6160dafc0349

… mobile callsites There are three guards related to mobile build: * AutoGradMode * AutoNonVariableTypeMode * GraphOptimizerEnabledGuard Today we need set some of these guards before calling libtorch APIs because we customized mobile build to only support inference (for both OSS and most FB use cases) to optimize binary size. Several changes were made since 1.3 release so there are already inconsistent uses of these guards in the codebase. I did a sweep of all mobile related model loading & forward() call sites, trying to unify the use of these guards: Full JIT: still set all three guards. More specifically: * OSS: Fixed a bug of not setting the guard at model load time correctly in Android JNI. * FB: Not covered by this diff (as we are using mobile interpreter for most internal builds). Lite JIT (mobile interpreter): only needs AutoNonVariableTypeMode guard. AutoGradMode doesn't seem to be relevant (so removed from a few places) and GraphOptimizerEnabledGuard definitely not relevant (only full JIT has graph optimizer). More specifically: * OSS: At this point we are not committed to support Lite-JIT. For Android it shares the same code with FB JNI callsites. * FB: ** JNI callsites: Use the unified LiteJITCallGuard. ** For iOS/C++: manually set AutoNonVariableTypeMode for _load_for_mobile() & forward() callsites. Ideally we should avoid having to set AutoNonVariableTypeMode for mobile interpreter. It's currently needed for dynamic dispatch + inference-only mobile build (where variable kernels are not registered) - without the guard it will try to run `variable_fallback_kernel` and crash (PR #34038). The proper fix will take some time so using this workaround to unblock selective BUCK build which depends on dynamic dispatch. PS. The current status (of having to set AutoNonVariableTypeMode) should not block running FL model + mobile interpreter - if all necessary variable kernels are registered then it can call _load_for_mobile()/forward() against the FL model without setting the AutoNonVariableTypeMode guard. It's still inconvenient for JAVA callsites as it's set unconditionally inside JNI methods. Differential Revision: [D20498017](https://our.internmc.facebook.com/intern/diff/D20498017/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D20498017/)! [ghstack-poisoned]

… mobile callsites There are three guards related to mobile build: * AutoGradMode * AutoNonVariableTypeMode * GraphOptimizerEnabledGuard Today we need set some of these guards before calling libtorch APIs because we customized mobile build to only support inference (for both OSS and most FB use cases) to optimize binary size. Several changes were made since 1.3 release so there are already inconsistent uses of these guards in the codebase. I did a sweep of all mobile related model loading & forward() call sites, trying to unify the use of these guards: Full JIT: still set all three guards. More specifically: * OSS: Fixed a bug of not setting the guard at model load time correctly in Android JNI. * FB: Not covered by this diff (as we are using mobile interpreter for most internal builds). Lite JIT (mobile interpreter): only needs AutoNonVariableTypeMode guard. AutoGradMode doesn't seem to be relevant (so removed from a few places) and GraphOptimizerEnabledGuard definitely not relevant (only full JIT has graph optimizer). More specifically: * OSS: At this point we are not committed to support Lite-JIT. For Android it shares the same code with FB JNI callsites. * FB: ** JNI callsites: Use the unified LiteJITCallGuard. ** For iOS/C++: manually set AutoNonVariableTypeMode for _load_for_mobile() & forward() callsites. Ideally we should avoid having to set AutoNonVariableTypeMode for mobile interpreter. It's currently needed for dynamic dispatch + inference-only mobile build (where variable kernels are not registered) - without the guard it will try to run `variable_fallback_kernel` and crash (PR #34038). The proper fix will take some time so using this workaround to unblock selective BUCK build which depends on dynamic dispatch. PS. The current status (of having to set AutoNonVariableTypeMode) should not block running FL model + mobile interpreter - if all necessary variable kernels are registered then it can call _load_for_mobile()/forward() against the FL model without setting the AutoNonVariableTypeMode guard. It's still inconvenient for JAVA callsites as it's set unconditionally inside JNI methods. Differential Revision: [D20498017](https://our.internmc.facebook.com/intern/diff/D20498017/) **NOTE FOR REVIEWERS**: This PR has internal Facebook specific changes or comments, please review them on [Phabricator](https://our.internmc.facebook.com/intern/diff/D20498017/)! ghstack-source-id: 100385440 Pull Request resolved: #34958

… mobile callsites Summary: There are three guards related to mobile build: * AutoGradMode * AutoNonVariableTypeMode * GraphOptimizerEnabledGuard Today we need set some of these guards before calling libtorch APIs because we customized mobile build to only support inference (for both OSS and most FB use cases) to optimize binary size. Several changes were made since 1.3 release so there are already inconsistent uses of these guards in the codebase. I did a sweep of all mobile related model loading & forward() call sites, trying to unify the use of these guards: Full JIT: still set all three guards. More specifically: * OSS: Fixed a bug of not setting the guard at model load time correctly in Android JNI. * FB: Not covered by this diff (as we are using mobile interpreter for most internal builds). Lite JIT (mobile interpreter): only needs AutoNonVariableTypeMode guard. AutoGradMode doesn't seem to be relevant (so removed from a few places) and GraphOptimizerEnabledGuard definitely not relevant (only full JIT has graph optimizer). More specifically: * OSS: At this point we are not committed to support Lite-JIT. For Android it shares the same code with FB JNI callsites. * FB: ** JNI callsites: Use the unified LiteJITCallGuard. ** For iOS/C++: manually set AutoNonVariableTypeMode for _load_for_mobile() & forward() callsites. Ideally we should avoid having to set AutoNonVariableTypeMode for mobile interpreter. It's currently needed for dynamic dispatch + inference-only mobile build (where variable kernels are not registered) - without the guard it will try to run `variable_fallback_kernel` and crash (PR #34038). The proper fix will take some time so using this workaround to unblock selective BUCK build which depends on dynamic dispatch. PS. The current status (of having to set AutoNonVariableTypeMode) should not block running FL model + mobile interpreter - if all necessary variable kernels are registered then it can call _load_for_mobile()/forward() against the FL model without setting the AutoNonVariableTypeMode guard. It's still inconvenient for JAVA callsites as it's set unconditionally inside JNI methods. Test Plan: - CI Reviewed By: xta0 Differential Revision: D20498017 fbshipit-source-id: ba6740f66839a61790873df46e8e66e4e141c728

…ytorch#34038) Summary: Pull Request resolved: pytorch#34038 Mobile build doesn't include autograd/VariableType dispatch. As the result AutoNonVariableTypeMode needs to be set in mobile runtime. With static dispatch this works is done inside generated jit-dispatch code - AutoNonVariableTypeMode needs to be set on per-op basis. Setting it globally or setting it for wrong ops might break some `is_variable()` checks in the codebase. Thanks to the unification of Variable class and Tensor class, all is_variable() checks have been removed, so AutoNonVariableTypeMode can be set globally now. We never tested inference-only mobile build with dynamic dispatch. It seems that dynamic dispatch also requires setting AutoNonVariableTypeMode for our mobile build (where VariableType functions are not registered). Verified the end-to-end test works with this change: ``` TEST_CUSTOM_BUILD_DYNAMIC=1 test/mobile/custom_build/build.sh ``` Test Plan: Imported from OSS Differential Revision: D20193329 Pulled By: ljk53 fbshipit-source-id: cc98414d89d12463dc82b0cdde0b6160dafc0349

… mobile callsites Summary: There are three guards related to mobile build: * AutoGradMode * AutoNonVariableTypeMode * GraphOptimizerEnabledGuard Today we need set some of these guards before calling libtorch APIs because we customized mobile build to only support inference (for both OSS and most FB use cases) to optimize binary size. Several changes were made since 1.3 release so there are already inconsistent uses of these guards in the codebase. I did a sweep of all mobile related model loading & forward() call sites, trying to unify the use of these guards: Full JIT: still set all three guards. More specifically: * OSS: Fixed a bug of not setting the guard at model load time correctly in Android JNI. * FB: Not covered by this diff (as we are using mobile interpreter for most internal builds). Lite JIT (mobile interpreter): only needs AutoNonVariableTypeMode guard. AutoGradMode doesn't seem to be relevant (so removed from a few places) and GraphOptimizerEnabledGuard definitely not relevant (only full JIT has graph optimizer). More specifically: * OSS: At this point we are not committed to support Lite-JIT. For Android it shares the same code with FB JNI callsites. * FB: ** JNI callsites: Use the unified LiteJITCallGuard. ** For iOS/C++: manually set AutoNonVariableTypeMode for _load_for_mobile() & forward() callsites. Ideally we should avoid having to set AutoNonVariableTypeMode for mobile interpreter. It's currently needed for dynamic dispatch + inference-only mobile build (where variable kernels are not registered) - without the guard it will try to run `variable_fallback_kernel` and crash (PR pytorch#34038). The proper fix will take some time so using this workaround to unblock selective BUCK build which depends on dynamic dispatch. PS. The current status (of having to set AutoNonVariableTypeMode) should not block running FL model + mobile interpreter - if all necessary variable kernels are registered then it can call _load_for_mobile()/forward() against the FL model without setting the AutoNonVariableTypeMode guard. It's still inconvenient for JAVA callsites as it's set unconditionally inside JNI methods. Test Plan: - CI Reviewed By: xta0 Differential Revision: D20498017 fbshipit-source-id: ba6740f66839a61790873df46e8e66e4e141c728

ljk53 mentioned this pull request Mar 1, 2020

[pytorch][CI] end-to-end custom build script #34012

Closed

ljk53 requested a review from ezyang March 1, 2020 07:32

ljk53 mentioned this pull request Mar 2, 2020

[pytorch][mobile] support for custom mobile build with dynamic dispatch #34055

Closed

ezyang approved these changes Mar 2, 2020

View reviewed changes

ljk53 added 2 commits March 2, 2020 18:21

facebook-github-bot closed this in 0cf34cf Mar 3, 2020

facebook-github-bot added the merged label Mar 4, 2020

facebook-github-bot deleted the gh/ljk53/109/head branch March 7, 2020 15:18

ljk53 mentioned this pull request Mar 18, 2020

[pytorch][mobile] fixed AutoGradMode/AutoNonVariableTypeMode uses for mobile callsites #34958

Closed

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[pytorch][mobile] make sure mobile build work with dynamic dispatch#34038

[pytorch][mobile] make sure mobile build work with dynamic dispatch#34038
ljk53 wants to merge 4 commits intogh/ljk53/109/basefrom
gh/ljk53/109/head

ljk53 commented Mar 1, 2020 •

edited

Loading

Uh oh!

dr-ci Bot commented Mar 1, 2020 •

edited

Loading

Uh oh!

facebook-github-bot commented Mar 4, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ljk53 commented Mar 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented Mar 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CircleCI build failures summary and remediations

Uh oh!

facebook-github-bot commented Mar 4, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ljk53 commented Mar 1, 2020 •

edited

Loading

dr-ci Bot commented Mar 1, 2020 •

edited

Loading