Refine constant folding size limit heuristic #2025

gramalingam · 2025-01-20T21:48:20Z

Refine the size-limit heuristics used to control constant-folding. This refinement allows some common cases to be handled automatically, such as Transpose(weight) which is typically generated by the exporter. The refinement looks at the increase in model-size that would be caused replacing a node by a constant, by accounting for inputs of the node that would be eliminated as a result of the replacement.

codecov · 2025-01-20T21:52:51Z

❌ 2 Tests Failed:

Tests completed	Failed	Passed	Skipped
13022	2	13020	2454

View the top 1 failed tests by shortest run time

onnxscript.backend.onnx_export_test.TestOnnxBackEnd::test_export2python_produces_correct_onnx_script_model_0827_test_reduce_l2_default_axes_keepdims_example

Stack Traces | 0.005s run time

onnxscript\backend\onnx_export_test.py:137: in extract_functions
    mod = importlib.import_module(import_name)
C:\hostedtoolcache\windows\Python\3.11.9\x64\Lib\importlib\__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
E   ModuleNotFoundError: No module named 'tests.onnx_backend_test_code.test_reduce_l2_default_axes_keepdims_example'

The above exception was the direct cause of the following exception:
.nox\test_torch_nightly\Lib\site-packages\parameterized\parameterized.py:620: in standalone_func
    return func(*(a + p.args), **p.kwargs, **kw)
onnxscript\backend\onnx_export_test.py:271: in test_export2python_produces_correct_onnx_script_model
    functions = extract_functions(backend_test.name, code, self.test_folder)
onnxscript\backend\onnx_export_test.py:139: in extract_functions
    raise AssertionError(
E   AssertionError: Unable to import 'tests.onnx_backend_test_code.test_reduce_l2_default_axes_keepdims_example' (e=No module named 'tests.onnx_backend_test_code.test_reduce_l2_default_axes_keepdims_example') (file: 'D:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_reduce_l2_default_axes_keepdims_example.py', absolute path: 'D:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_reduce_l2_default_axes_keepdims_example.py', current folder: D:\a\onnxscript\onnxscript
E   ---- CONTENT --
E   import numpy
E   from onnx import TensorProto
E   from onnx.helper import make_tensor
E   from onnxscript import script, external_tensor
E   from onnxscript.values import Opset
E   from onnxscript.onnx_types import FLOAT, INT64
E   from onnxscript.onnx_opset import opset18
E   
E   @script()
E   def bck_test_reduce_l2_default_axes_keepdims_example(data: FLOAT[3,2,2], axes: INT64[0]) -> (FLOAT[1,1,1]):
E       reduced = opset18.ReduceL2(data, axes, keepdims=1)
E       return reduced

View the full list of 1 ❄️ flaky tests

onnxscript.backend.onnx_export_test.TestOnnxBackEnd::test_export2python_produces_correct_onnx_script_model_0385_test_gathernd_example_int32

Flake rate in main: 9.09% (Passed 20 times, Failed 2 times)

Stack Traces | 0.01s run time

onnxscript\backend\onnx_export_test.py:137: in extract_functions
    mod = importlib.import_module(import_name)
C:\hostedtoolcache\windows\Python\3.11.9\x64\Lib\importlib\__init__.py:126: in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
E   ModuleNotFoundError: No module named 'tests.onnx_backend_test_code.test_gathernd_example_int32'

The above exception was the direct cause of the following exception:
.nox\test_ort_nightly\Lib\site-packages\parameterized\parameterized.py:620: in standalone_func
    return func(*(a + p.args), **p.kwargs, **kw)
onnxscript\backend\onnx_export_test.py:271: in test_export2python_produces_correct_onnx_script_model
    functions = extract_functions(backend_test.name, code, self.test_folder)
onnxscript\backend\onnx_export_test.py:139: in extract_functions
    raise AssertionError(
E   AssertionError: Unable to import 'tests.onnx_backend_test_code.test_gathernd_example_int32' (e=No module named 'tests.onnx_backend_test_code.test_gathernd_example_int32') (file: 'D:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_gathernd_example_int32.py', absolute path: 'D:\\a\\onnxscript\\onnxscript\\tests\\onnx_backend_test_code\\test_gathernd_example_int32.py', current folder: D:\a\onnxscript\onnxscript
E   ---- CONTENT --
E   import numpy
E   from onnx import TensorProto
E   from onnx.helper import make_tensor
E   from onnxscript import script, external_tensor
E   from onnxscript.values import Opset
E   from onnxscript.onnx_types import INT32, INT64
E   from onnxscript.onnx_opset import opset13
E   
E   @script()
E   def bck_test_gathernd_example_int32(data: INT32[2,2], indices: INT64[2,2]) -> (INT32[2]):
E       output = opset13.GatherND(data, indices)
E       return output

To view more test analytics, go to the Test Analytics Dashboard
📢 Thoughts on this report? Let us know!

onnxscript/optimizer/_constant_folding.py

shubhambhokare1

LGTM, needs rebase

### Description This PR adds fusions for [Google's SigLIP model](https://huggingface.co/google/siglip-base-patch16-224/) and Microsoft's internal conformer-encoder model. Here is an example of how to run the ORT transformer optimizer for the SigLIP model. ``` $ git clone https://github.com/microsoft/onnxruntime $ cd onnxruntime/onnxruntime/python/tools/transformers $ python3 optimizer.py --input /path/to/model.onnx --output /path/to/model_opt.onnx --model_type clip --num_heads 16 --hidden_size 1152 --use_external_data_format --opt_level 0 --disable_shape_inference ``` Here is an example of how to run the ORT transformer optimizer for the conformer-encoder model. ``` $ git clone https://github.com/microsoft/onnxruntime $ cd onnxruntime/onnxruntime/python/tools/transformers $ python3 optimizer.py --input /path/to/model.onnx --output /path/to/model_opt.onnx --model_type conformer --num_heads 16 --hidden_size 1024 --use_external_data_format --opt_level 0 --disable_shape_inference --convert_attribute ``` ### Motivation and Context This PR helps optimize multi-modal models that use SigLIP for the vision encoder and conformer-encoder for the speech encoder. This PR uses changes from the following PRs: - pytorch/pytorch#144801 - microsoft/onnxscript#2018 - microsoft/onnxscript#2019 - microsoft/onnxscript#2020 - microsoft/onnxscript#2021 - microsoft/onnxscript#2022 - microsoft/onnxscript#2024 - microsoft/onnxscript#2025 - microsoft/onnxscript#2029 - microsoft/onnxscript#2033 ### Introduction of ONNX Script This PR introduces [ONNX Script](https://github.com/microsoft/onnxscript) into the ORT transformer optimizer as an optional step via the `fold_transpose_initializers()` method of the `DynamoOnnxHelper` class.

Refine constant folding size limit heuristic

3595815

gramalingam enabled auto-merge (squash) January 21, 2025 00:30

justinchuby reviewed Jan 21, 2025

View reviewed changes

onnxscript/optimizer/_constant_folding.py Show resolved Hide resolved

shubhambhokare1 approved these changes Jan 21, 2025

View reviewed changes

gramalingam merged commit 0447822 into main Jan 21, 2025
26 of 29 checks passed

gramalingam deleted the rama/cp-init-large branch January 21, 2025 19:05

justinchuby added the module: optimizer label Jan 21, 2025

kunal-vaishnavi mentioned this pull request Jan 29, 2025

Add fusions for SigLIP and Conformer-Encoder microsoft/onnxruntime#23528

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refine constant folding size limit heuristic #2025

Refine constant folding size limit heuristic #2025

Uh oh!

gramalingam commented Jan 20, 2025

Uh oh!

codecov bot commented Jan 20, 2025 •

edited

Loading

Uh oh!

Uh oh!

shubhambhokare1 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Refine constant folding size limit heuristic #2025

Refine constant folding size limit heuristic #2025

Uh oh!

Conversation

gramalingam commented Jan 20, 2025

Uh oh!

codecov bot commented Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

❌ 2 Tests Failed:

Uh oh!

Uh oh!

shubhambhokare1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Jan 20, 2025 •

edited

Loading