fix: Update lowering passes in `aten` tracer FX by gs-olive · Pull Request #1708 · pytorch/TensorRT

gs-olive · 2023-03-02T00:39:39Z

Description

Enable translation to reshape from view, which was causing failures when compiling BERT model due to memory layout of Tensors
Default to matmul within compose_bmm lowering pass when the dimension of inputs exceeds 3

Error displayed prior to remove_ops view fix (BERT model from Issue #1673):

RuntimeError: view size is not compatible with input tensor's size and stride (at least one dimension spans across two contiguous subspaces). Use .reshape(...) instead.

Error displayed prior to compose_bmm fix:

  File "~/TensorRT/py/torch_tensorrt/fx/passes/lower_basic_pass_aten.py", line 439, in compose_bmm
    new_func,
UnboundLocalError: local variable 'new_func' referenced before assignment

Note: test_reshape_aten is currently failing since the aten.view.default ops are being converted to aten.reshape

Fixes #1673

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)

Checklist:

[ x ] My code follows the style guidelines of this project (You can use the linters)
[ x ] I have performed a self-review of my own code
[ x ] I have commented my code, particularly in hard-to-understand areas and hacks
[ x ] I have made corresponding changes to the documentation
[ x ] I have added tests to verify my fix or my feature
[ x ] New and existing unit tests pass locally with my changes
[ x ] I have added the relevant labels to my PR in so that relevant reviewers are notified

- Enable translation to `reshape` from `view`, which was causing failures when compiling BERT model due to memory layout of Tensors - Default to `matmul` within `compose_bmm` lowering pass when the dimension of inputs exceeds 3

frank-wei · 2023-03-30T17:05:57Z

py/torch_tensorrt/fx/passes/lower_basic_pass_aten.py

    for n in module.graph.nodes:
        if n.op == "call_function" and n.target in (
            torch.ops.aten._unsafe_view.default,
+            torch.ops.aten.view.default,


It is not necessary to remove aten.view since the reshape operation is decomposed into aten.view(which is safe) and we have converter to support aten.view.

I see - thank you for the clarification on that. The reason I had removed the view operator was for cases like this:

def forward(self, x): x = x.permute(0, 2, 1, 3).contiguous() new_shape = x.size()[:-2] + (-1,) return x.view(new_shape)

These show up in the GPT2 code, and when using the aten tracer, they result in the following error (though they run fine in Torch):

File "~/TensorRT/py/torch_tensorrt/fx/tracer/dispatch_tracer/aten_tracer.py", line 161, in opt_trace fx_module(*args) File "/usr/local/lib/python3.8/dist-packages/torch/fx/graph_module.py", line 662, in call_wrapped return self._wrapped_call(self, *args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/torch/fx/graph_module.py", line 281, in __call__ raise e File "/usr/local/lib/python3.8/dist-packages/torch/fx/graph_module.py", line 271, in __call__ return super(self.cls, obj).__call__(*args, **kwargs) # type: ignore[misc] File "/usr/local/lib/python3.8/dist-packages/torch/nn/modules/module.py", line 1501, in _call_impl return forward_call(*args, **kwargs) File "<eval_with_key>.15", line 9, in forward File "/usr/local/lib/python3.8/dist-packages/torch/_ops.py", line 329, in __call__ return self._op(*args, **kwargs or {}) RuntimeError: view size is not compatible with input tensor's size and stride (at least one dimension spans across two contiguous subspaces). Use .reshape(...) instead.

frank-wei · 2023-03-30T17:10:19Z

py/torch_tensorrt/fx/passes/lower_basic_pass_aten.py

-            if len(real_other.meta["val"].size()) == 3:
+            elif len(real_other.meta["val"].size()) == 3:
                new_func = aten_compose_bmm_3d
+            else:


Not clear why we need this new_func = torch.ops.aten.matmul? Any example or unit test?

This addition is related to an issue in the compose_bmm lowering pass. I noticed that input_n can have a different shape than real_input, which causes the batch matrix multiply to have 4 dimensions instead of 3, reaching this else statement. I don't yet have a minimal reproducing example yet, as #1789 would likely need to be addressed first.

gs-olive added the component: fx label Mar 2, 2023

gs-olive requested a review from frank-wei March 2, 2023 00:39

gs-olive self-assigned this Mar 2, 2023

facebook-github-bot added cla signed fx labels Mar 2, 2023

github-actions bot added the component: api [Python] Issues re: Python API label Mar 2, 2023

github-actions bot requested a review from yinghai March 2, 2023 00:40

gs-olive mentioned this pull request Mar 2, 2023

🐛 [Bug] Transformers BERT Model does not compile via FX Path #1673

Closed

gs-olive marked this pull request as draft March 2, 2023 01:27

gs-olive added the WIP Work is in progress, pull request should not be merged yet label Mar 2, 2023

This was referenced Mar 16, 2023

🐛 [Bug] Transformers T5 Model does not compile via FX Path #1740

Closed

🐛 [Bug] Transformers GPT2 Model does not compile via FX Path #1741

Closed

fix: Update lowering passes in aten tracer

a063082

- Enable translation to `reshape` from `view`, which was causing failures when compiling BERT model due to memory layout of Tensors - Default to `matmul` within `compose_bmm` lowering pass when the dimension of inputs exceeds 3

gs-olive force-pushed the aten_fx_bert_fix branch from 4990f6c to a063082 Compare March 29, 2023 03:48

frank-wei reviewed Mar 30, 2023

View reviewed changes

gs-olive mentioned this pull request Mar 30, 2023

🐛 [Bug] Shape mismatch bug using view in FX aten path #1788

Closed

gs-olive closed this Jun 2, 2023

gs-olive mentioned this pull request Jun 20, 2023

🐛 [Bug] Issue with remove_ops lowering pass in FX/Dynamo #2036

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Update lowering passes in `aten` tracer FX#1708

fix: Update lowering passes in `aten` tracer FX#1708
gs-olive wants to merge 1 commit intopytorch:mainfrom
gs-olive:aten_fx_bert_fix

gs-olive commented Mar 2, 2023 •

edited

Loading

Uh oh!

frank-wei Mar 30, 2023

Uh oh!

gs-olive Mar 30, 2023

Uh oh!

frank-wei Mar 30, 2023

Uh oh!

gs-olive Mar 30, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gs-olive commented Mar 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Checklist:

Uh oh!

frank-wei Mar 30, 2023

Choose a reason for hiding this comment

Uh oh!

gs-olive Mar 30, 2023

Choose a reason for hiding this comment

Uh oh!

frank-wei Mar 30, 2023

Choose a reason for hiding this comment

Uh oh!

gs-olive Mar 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gs-olive commented Mar 2, 2023 •

edited

Loading

gs-olive Mar 30, 2023 •

edited

Loading