Support >2G model export - alternative implementation | torchlib(feat)#1004

Closed

justinchuby wants to merge 15 commits intomainfrom

justinchu/big-models-comp

Collaborator

justinchuby commented Aug 11, 2023 •

edited

Loading

Support >2G model export - alternative implementation where we build initializers ourselves. This is a follow up of #1003. We now build the TensorProto ourselves directly from PyTorch tensors. This circumvents torchscript _export_onnx's limitation of 2G protobuf serialization and additional serialization, because we are now keeping everything in memory.

2G model is now no longer a special case because we add initializers in a separate step.

TODO

Test to_model_proto with initlizers for all torch data types.

justinchuby added 7 commits

August 10, 2023 19:33


          Support >2G model export | torchlib(feat)

acbb78d

Fix

0b4bac8


          snapshot

bc7459e

Fix

0e015f0


          refine test

ca54944


          Alternative

af09c79


          remove checks

a47427b

justinchuby requested a review from BowenBao

August 11, 2023 04:11

Collaborator Author

justinchuby commented Aug 11, 2023

@BowenBao Speed seems to be reasonably good. There's errors: Unsuported type proto value case.

codecov bot commented Aug 11, 2023 •

edited

Loading

Codecov Report

Attention: Patch coverage is 60.00000% with 6 lines in your changes missing coverage. Please review.

Project coverage is 78.65%. Comparing base (5a8f4e3) to head (1aa61ac).
Report is 254 commits behind head on main.

Files	Patch %	Lines
...nxscript/function_libs/torch_lib/graph_building.py	60.00%	4 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1004      +/-   ##
==========================================
+ Coverage   78.64%   78.65%   +0.01%     
==========================================
  Files         118      118              
  Lines       15441    15435       -6     
  Branches     2424     2422       -2     
==========================================
- Hits        12144    12141       -3     
+ Misses       2899     2896       -3     
  Partials      398      398

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

justinchuby mentioned this pull request

Support >2G model export | torchlib(feat) #1003

Merged

github-actions bot commented Aug 11, 2023 •

edited

Loading

Test Results

        24 files ±0       24 suites ±0 1h 42m 10s ⏱️ + 5m 11s
  11 254 tests ±0   8 378 ✔️ ±0     2 876 💤 ±0 0 ❌ ±0
255 750 runs ±0 57 240 ✔️ ±0 198 510 💤 ±0 0 ❌ ±0

Results for commit 1aa61ac. ± Comparison against base commit 5a8f4e3.

♻️ This comment has been updated with latest results.


          Fix type proto error: cannot get rid of infer shape and checker lightly

ab5cab3

Contributor

BowenBao commented Aug 11, 2023

Adding back infer_shapes and check_model made tests pass locally.

justinchuby marked this pull request as draft

August 11, 2023 17:07

Collaborator Author

justinchuby commented Aug 11, 2023

Let me test again

justinchuby added 4 commits

August 11, 2023 17:10


          Merge branch 'main' into justinchu/big-models-comp

4861b78

Fix

549976b


          Reorder

f1f05c5


          docs

fb390aa

Collaborator Author

justinchuby commented Aug 11, 2023 •

edited

Loading

Adding back infer_shapes and check_model made tests pass locally.

Nice, thanks! I updated this PR for review

justinchuby marked this pull request as ready for review

August 11, 2023 17:19

justinchuby added 2 commits

August 11, 2023 17:23


          Clean up

aafab83


          Types

7e33cf4

BowenBao approved these changes

View reviewed changes

Contributor

BowenBao left a comment

🚀

justinchuby added module: torchlib hold on merging labels

Collaborator Author

justinchuby commented Aug 11, 2023

Needs validation with real models

justinchuby marked this pull request as draft

August 11, 2023 17:38

wschin reviewed

View reviewed changes

onnxscript/function_libs/torch_lib/graph_building.py Show resolved Hide resolved

titaiwangms self-requested a review

August 21, 2023 16:16


          Merge branch 'main' into justinchu/big-models-comp

1aa61ac

Collaborator Author

justinchuby commented Nov 30, 2023

@BowenBao should we move this to under a flag?

github-advanced-security bot found potential problems

View reviewed changes

onnxscript/function_libs/torch_lib/graph_building.py

+                      )
+                      tensor_proto = onnx.helper.make_tensor(
+                          name=name,
+                          data_type=_type_utils.JitScalarType.from_dtype(tensor.dtype).onnx_type(),

Check failure

Code scanning / lintrunner

PYLINT/E0602

Undefined variable '_type_utils' (undefined-variable) See [undefined-variable](https://pylint.pycqa.org/en/latest/user_guide/messages/error/undefined-variable.html). To disable, use ` # pylint: disable=undefined-variable`

onnxscript/function_libs/torch_lib/graph_building.py

+                      )
+                      tensor_proto = onnx.helper.make_tensor(
+                          name=name,
+                          data_type=_type_utils.JitScalarType.from_dtype(tensor.dtype).onnx_type(),

Check failure

Code scanning / lintrunner

MYPY/name-defined

Name "_type_utils" is not defined To disable, use ` # type: ignore[name-defined]`

onnxscript/function_libs/torch_lib/graph_building.py

+                      )
+                      tensor_proto = onnx.helper.make_tensor(
+                          name=name,
+                          data_type=_type_utils.JitScalarType.from_dtype(tensor.dtype).onnx_type(),

Check failure

Code scanning / lintrunner

RUFF/F821

Undefined name `_type_utils`. See https://beta.ruff.rs/docs/rules/

justinchuby marked this pull request as ready for review

December 6, 2023 00:39

BowenBao reviewed

View reviewed changes

onnxscript/function_libs/torch_lib/graph_building.py

		return self._graph.add_function_call(function, inputs, attributes)


		def _add_initializers(

Contributor

BowenBao Dec 8, 2023 •

edited

Loading

fyi @kunal-vaishnavi creating onnx tensorproto from torch tensor dataptr

justinchuby marked this pull request as draft

May 20, 2024 06:19

justinchuby closed this

justinchuby deleted the justinchu/big-models-comp branch

January 27, 2025 18:16

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

module: torchlib