Avoid fallback for avg_pool - by qihqi · Pull Request #6409 · pytorch/xla

qihqi · 2024-01-29T22:07:52Z

By supporting divisor overrides and ceil_mode + count_include_pad property.

When count_include_pad is True, the number of paddings also appear in the denominator. But, if ceil_mode is also true, then, when we round, we can introduce extra padding. ** these paddings are NOT counted in the denominator** Therefore, when ceil_mode is true, we need to manually pad to distinguish padding that should count for denominator and those that shouldnt (i.e. those introduced by ceil_mode)

By supporting divisor overrides properly.

wonjoo-wj · 2024-01-30T21:12:03Z

Changes LGTM, but seems like CI failed:

======================================================================
FAIL: test_aten_avg_pool2d_3 (__main__.AtenOpTest) [torch_xla_diff:0.001]
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/tmp/pytorch/xla/test/test_core_aten_ops.py", line 72, in run_export_and_compare
    diff_output(
  File "/tmp/pytorch/xla/test/test_core_aten_ops.py", line 33, in diff_output
    testcase.assertTrue(
AssertionError: False is not true

Maybe just need to adjust rtol/atol?

wonjoo-wj · 2024-01-31T22:28:51Z

@qihqi, thanks! Could you link the github issues that this PR fixes, if there are any?

This reverts commit a0bae82.

…ll forward fix the issue.

qihqi · 2024-02-02T19:50:53Z

It looks like from Jan 31 -> Feb 1 we have improved performance of some vision models, this might has helped (as it avoids CPU fallback):

By supporting divisor overrides and ceil_mode + count_include_pad property. When count_include_pad is True, the number of paddings also appear in the denominator. But, if ceil_mode is also true, then, when we round, we can introduce extra padding. ** these paddings are NOT counted in the denominator** Therefore, when ceil_mode is true, we need to manually pad to distinguish padding that should count for denominator and those that shouldnt (i.e. those introduced by ceil_mode)

…tir.full` (#3826) **This changes/additions in this PR pertains to the effort to use xla for compiling PyTorch models in tt-torch** ### Ticket No tt-mlir issue. There is a metal issue I've filed but this is not solely a workaround to an op which will not run in tt-metal - Metal issue: tenstorrent/tt-metal#23617 - Another metal issue: tenstorrent/tt-metal#23581 ### Problem description torch-xla will decompose avg_pool2d into a sum-pool on the input tensor, and a divisor with the size of the window. **However**, the denominator is not a constant. Instead it is calculated to be the result of another sum-pool. This sum-pool is applied to a constant tensor containing only `1.0` The sum pool ends up making a tensor with the same spatial dimensions as the activation, and an element wise division is performed to get the same end result: ``` %60 = "ttir.full" (full tensor of 1.0 with shape 56x56) ... %704 = "ttir.pooling"(%702, %703) <{base_dilations = array<i64: 1, 1, 1, 1>, operandSegmentSizes = array<i32: 1, 1>, padding = array<i64: 0, 0, 0, 0, 0, 0, 0, 0>, pooling_method = #ttir<pooling_method Sum>, window_dilations = array<i64: 1, 1, 1, 1>, window_dimensions = array<i64: 1, 1, 2, 2>, window_strides = array<i64: 1, 1, 2, 2>}> : (tensor<1x128x56x56xbf16>, tensor<1x128x28x28xbf16>) -> tensor<1x128x28x28xbf16> %705 = ttir.empty() : tensor<28x28xbf16> %706 = "ttir.pooling"(%60, %705) <{base_dilations = array<i64: 1, 1>, operandSegmentSizes = array<i32: 1, 1>, padding = array<i64: 0, 0, 0, 0>, pooling_method = #ttir<pooling_method Sum>, window_dilations = array<i64: 1, 1>, window_dimensions = array<i64: 2, 2>, window_strides = array<i64: 2, 2>}> : (tensor<56x56xbf16>, tensor<28x28xbf16>) -> tensor<28x28xbf16> ... reshape the denominator (to unsqueeze) ... broadcast the denominator (so the channel dim is identical) %712 = "ttir.div" (%704, %706) -> ... ``` The reason torch-xla does this is seemingly to handle an edge case where the kwargs `count_include_pad = True` and `ceil_mode = True`. However it is actually applied across the board. - torch-xla PR where this was made: pytorch/xla#6409 **Futhermore**, this sum-pool itself is not even a valid pooling operation in PyTorch or ttnn as it the input tensor is 2D. PyTorch expects at least channels dim, and ttnn expects a channel dim and a batch dim. So if we were to instead rely on ttnn to compute this properly, and consteval the result. The lowering pattern for this `ttir.pooling` op in `TTIRToTTIRDecomposition` would require reshapes to be placed on the input and output. Which isn't necessarily a blocker. However a future fusing pattern to convert `div(sum_pool, const)` to `avg_pool` would be needlessly more complex if we also needed to match the case where the denominator is the result of another `sum_pool` of a constant which has reshapes on the input and output. ### What's changed Added a TTIRToTTIRDecomposition pattern for `ttir.pooling` to replace the operations results with `ttir.full` containing the correct values. No computation is required as the result of such a pattern is straightforward ### Checklist - [X] New/Existing tests provide coverage for changes

qihqi requested a review from wonjoo-wj January 29, 2024 22:08

qihqi force-pushed the qihqi/core_aten_ops branch 3 times, most recently from 49fb09a to 791fa27 Compare January 30, 2024 00:12

Avoid fallback for avg_pool -

1edf387

By supporting divisor overrides properly.

qihqi force-pushed the qihqi/core_aten_ops branch from 791fa27 to 1edf387 Compare January 30, 2024 00:14

qihqi force-pushed the qihqi/core_aten_ops branch 3 times, most recently from 39e4282 to 3a15b4a Compare January 31, 2024 05:10

Fix ceil model inconsistency

2e1d1b4

qihqi force-pushed the qihqi/core_aten_ops branch from 3a15b4a to 2e1d1b4 Compare January 31, 2024 17:13

wonjoo-wj approved these changes Jan 31, 2024

View reviewed changes

qihqi merged commit a0bae82 into master Jan 31, 2024

qihqi deleted the qihqi/core_aten_ops branch January 31, 2024 22:23

yeounoh mentioned this pull request Feb 1, 2024

Re-instate the missing CPU CI tests and their coverage reports #6448

Merged

yeounoh added a commit that referenced this pull request Feb 1, 2024

Revert "Avoid fallback for avg_pool - (#6409)"

ef753f4

This reverts commit a0bae82.

yeounoh mentioned this pull request Feb 1, 2024

Revert "Avoid fallback for avg_pool -" #6451

Closed

yeounoh added a commit that referenced this pull request Feb 1, 2024

Disabling a failing test due to #6409. This is to unblock ODML, we wi…

46c1dd3

…ll forward fix the issue.

yeounoh mentioned this pull request Feb 1, 2024

Disable TestPoolingNNDeviceTypeXLA temporarily. #6452

Merged

yeounoh added a commit that referenced this pull request Feb 1, 2024

Disabling a failing test due to #6409. This is to unblock ODML, we wi…

1252ba5

…ll forward fix the issue.

yeounoh added a commit that referenced this pull request Feb 1, 2024

Disabling a failing test due to #6409. This is to unblock ODML, we wi…

3962b9f

…ll forward fix the issue.

thong3le mentioned this pull request May 7, 2024

constant folding for AvgPool2d #7033

Closed

LPanosTT mentioned this pull request Jun 16, 2025

TTIRToTTIRDecomposition pattern for ttir.pooling when applied to ttir.full tenstorrent/tt-mlir#3826

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid fallback for avg_pool -#6409

Avoid fallback for avg_pool -#6409
qihqi merged 2 commits intomasterfrom
qihqi/core_aten_ops

qihqi commented Jan 29, 2024 •

edited

Loading

Uh oh!

wonjoo-wj commented Jan 30, 2024

Uh oh!

wonjoo-wj commented Jan 31, 2024 •

edited

Loading

Uh oh!

qihqi commented Feb 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

qihqi commented Jan 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wonjoo-wj commented Jan 30, 2024

Uh oh!

wonjoo-wj commented Jan 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qihqi commented Feb 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

qihqi commented Jan 29, 2024 •

edited

Loading

wonjoo-wj commented Jan 31, 2024 •

edited

Loading