Update to allow large models to be checked for mobile support. #18357

skottmckay · 2023-11-09T01:52:24Z

Description

Update usability checker and related infrastructure to support checking models > 2GB.

Add ability to set flag to keep initializers as external data
- we optimize the model as part of the checking so need to write out a new copy.
Handle issue with ONNX shape inferencing silently failing
- use API that supports large models but requires writing the model to a new file
- automate cleanup of that copy of the model

Motivation and Context

Allow analysis of LLMs to determine gaps for mobile usage.

- Add ability to set flag to keep initializers as external data - Handle issue with ONNX shape inferencing silently failing

tools/python/util/onnx_model_utils.py

thiagocrepaldi · 2023-11-09T17:48:03Z

@skottmckay do you think this PR would also fix #14697?

skottmckay · 2023-11-09T22:35:19Z

@skottmckay do you think this PR would also fix #14697?

It will get closer in that it would get to the point where it attempted to create the flatbuffer for the ORT format model, however the flatbuffer offsets are unsigned 32-bit int so there is at most 4GB of data that could be written out. Better than the 2GB protobuf limit though.

Not clear what the scenario is where you'd need to use an ORT format model. That implies a minimal build to save a few MB to load a model that is multiple GB. Due to that we haven't prioritized supporting these models in ORT format.

…geModelsInUsabilityChecker

tools/python/util/onnx_model_utils.py

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

…geModelsInUsabilityChecker

…soft#18357) ### Description  Update usability checker and related infrastructure to support checking models > 2GB. - Add ability to set flag to keep initializers as external data - we optimize the model as part of the checking so need to write out a new copy. - Handle issue with ONNX shape inferencing silently failing - use API that supports large models but requires writing the model to a new file - automate cleanup of that copy of the model ### Motivation and Context  Allow analysis of LLMs to determine gaps for mobile usage. --------- Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

Update to allow large models to be checked for mobile support.

5f29f05

- Add ability to set flag to keep initializers as external data - Handle issue with ONNX shape inferencing silently failing

skottmckay requested a review from edgchen1 November 9, 2023 01:52

thiagocrepaldi reviewed Nov 9, 2023

View reviewed changes

tools/python/util/onnx_model_utils.py Show resolved Hide resolved

skottmckay added 3 commits November 10, 2023 18:46

lint

b6b8552

Merge remote-tracking branch 'origin/main' into skottmckay/SupportLar…

6353c03

…geModelsInUsabilityChecker

Merge remote-tracking branch 'origin/main' into skottmckay/SupportLar…

b3379c5

…geModelsInUsabilityChecker

edgchen1 reviewed Nov 15, 2023

View reviewed changes

skottmckay and others added 3 commits November 15, 2023 16:35

Apply suggestions from code review

a263a46

Co-authored-by: Edward Chen <18449977+edgchen1@users.noreply.github.com>

Address PR comments

4c17760

Merge remote-tracking branch 'origin/main' into skottmckay/SupportLar…

7d1bd18

…geModelsInUsabilityChecker

edgchen1 approved these changes Nov 15, 2023

View reviewed changes

skottmckay merged commit e7a524f into main Nov 16, 2023

skottmckay deleted the skottmckay/SupportLargeModelsInUsabilityChecker branch November 16, 2023 21:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update to allow large models to be checked for mobile support. #18357

Update to allow large models to be checked for mobile support. #18357

Uh oh!

skottmckay commented Nov 9, 2023

Uh oh!

Uh oh!

thiagocrepaldi commented Nov 9, 2023

Uh oh!

skottmckay commented Nov 9, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Update to allow large models to be checked for mobile support. #18357

Update to allow large models to be checked for mobile support. #18357

Uh oh!

Conversation

skottmckay commented Nov 9, 2023

Description

Motivation and Context

Uh oh!

Uh oh!

thiagocrepaldi commented Nov 9, 2023

Uh oh!

skottmckay commented Nov 9, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants