Fix contiguity of expanded tensor by zasdfgbnm · Pull Request #2049 · csarofeen/pytorch

zasdfgbnm · 2022-10-11T06:24:44Z

Make TensorViewBuilder able to generate expanded tensor.
Fixes a problem on the contiguity found while working on Fix vec validation on expanded broadcasting #2044. Note that an expanded dimension and the dimension before can never be contiguous. Without fixing this, the added test will generate silent wrong result.
Validate contiguity of expanded dimensions during lowering

naoyam · 2022-10-11T15:35:33Z

torch/csrc/jit/codegen/cuda/tensor_view.cpp

+    Val* expanded_extent = nullptr;
+    Val** shape_extent = &extent;
+    if (!expanded_.empty()) {
+      is_expanded = expanded_[i];


Just in case, please use expanded_.at(i) to avoid silent memory errors

naoyam · 2022-10-11T15:38:21Z

torch/csrc/jit/codegen/cuda/tensor_view.cpp

    if (i == -1) {
      shape_.emplace_back(IrBuilder::create<Int>());
+    } else if (i == 1) {
+      shape_.emplace_back(FusionGuard::getCurFusion()->oneVal());


Is there any specific reason to use the one val? Do we also want to use the zero val?

Just trying to save an instance creation; I don't think it will make much difference here. And yes, we should also use zero val, because why not.

naoyam · 2022-10-11T15:46:50Z

torch/csrc/jit/codegen/cuda/tensor_view.cpp

-        domain[i] =
-            IterDomainBuilder(FusionGuard::getCurFusion()->zeroVal(), shape_[i])
-                .build();
+      *shape_extent = shape_[i];


torch/csrc/jit/codegen/cuda/tensor_view.cpp

naoyam · 2022-10-11T15:59:27Z

Contiguity and expanded broadcasts are still making me feel uneasy as I've yet got a clear idea of what they mean in PyTorch. Can you please explain why expanded domains and their left domains should never be contiguous?

zasdfgbnm · 2022-10-11T20:23:34Z

@naoyam For example, if you have a contiguous tensor of shape (4, 5, 1, 6), then the stride could be (30, 6, whatever, 1), but if you expand this tensor to (4, 5, 10, 6), the stride has to be (30, 6, 0, 1). And by definition, if we have true contiguity at the expanded dimension (dim 2), its stride must be stride[3] * 6, which can not be 0. Similarly, for the dimension before the expanded dimension (dim 1), if it had true contiguity, then its stride must be stride[2] * 10, which is always zero.

naoyam · 2022-10-11T23:36:09Z

@naoyam For example, if you have a contiguous tensor of shape (4, 5, 1, 6), then the stride could be (30, 6, whatever, 1), but if you expand this tensor to (4, 5, 10, 6), the stride has to be (30, 6, 0, 1). And by definition, if we have true contiguity at the expanded dimension (dim 2), its stride must be stride[3] * 6, which can not be 0. Similarly, for the dimension before the expanded dimension (dim 1), if it had true contiguity, then its stride must be stride[2] * 10, which is always zero.

Thanks for the explanation. Can you please add this as a comment to the code? Maybe at TensorDomain::getContiguousContiguity?

zasdfgbnm · 2022-10-12T00:52:00Z

@naoyam I have resolved all review comments

naoyam

LGTM.

zasdfgbnm added 5 commits October 10, 2022 23:22

Fix contiguity of expanded tensor

672547d

fix

19af554

lower validation

a903ed2

cleanup

75ae321

more cleanup

256204b

zasdfgbnm requested review from csarofeen and naoyam October 11, 2022 08:30

format

e570d15

naoyam reviewed Oct 11, 2022

View reviewed changes

review comments

11e53b6

symbolic expanded shape

3bacbac

naoyam approved these changes Oct 12, 2022

View reviewed changes

zasdfgbnm merged commit 3b72462 into devel Oct 12, 2022

zasdfgbnm deleted the fix-expand-contiguity branch October 12, 2022 03:37

zasdfgbnm mentioned this pull request Oct 13, 2022

Fix getVectorizedFusionInputOutput #2058

Merged

csarofeen mentioned this pull request Oct 15, 2022

Fix some missing TensorDomain::getContiguousContiguity calls #2083

Merged

zasdfgbnm mentioned this pull request Feb 24, 2023

Make contiguity ignore broadcasts #2517

Merged

zasdfgbnm mentioned this pull request Jun 13, 2023

Use getMaybeExpandedExtent() for lowering and launching segments NVIDIA/Fuser#471

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix contiguity of expanded tensor#2049

Fix contiguity of expanded tensor#2049
zasdfgbnm merged 8 commits intodevelfrom
fix-expand-contiguity

zasdfgbnm commented Oct 11, 2022 •

edited

Loading

Uh oh!

naoyam Oct 11, 2022

Uh oh!

zasdfgbnm Oct 12, 2022

Uh oh!

naoyam Oct 11, 2022

Uh oh!

zasdfgbnm Oct 12, 2022

Uh oh!

naoyam Oct 11, 2022

Uh oh!

zasdfgbnm Oct 12, 2022

Uh oh!

Uh oh!

Uh oh!

naoyam commented Oct 11, 2022

Uh oh!

zasdfgbnm commented Oct 11, 2022

Uh oh!

naoyam commented Oct 11, 2022

Uh oh!

zasdfgbnm commented Oct 12, 2022

Uh oh!

naoyam left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zasdfgbnm commented Oct 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

naoyam Oct 11, 2022

Choose a reason for hiding this comment

Uh oh!

zasdfgbnm Oct 12, 2022

Choose a reason for hiding this comment

Uh oh!

naoyam Oct 11, 2022

Choose a reason for hiding this comment

Uh oh!

zasdfgbnm Oct 12, 2022

Choose a reason for hiding this comment

Uh oh!

naoyam Oct 11, 2022

Choose a reason for hiding this comment

Uh oh!

zasdfgbnm Oct 12, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

naoyam commented Oct 11, 2022

Uh oh!

zasdfgbnm commented Oct 11, 2022

Uh oh!

naoyam commented Oct 11, 2022

Uh oh!

zasdfgbnm commented Oct 12, 2022

Uh oh!

naoyam left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zasdfgbnm commented Oct 11, 2022 •

edited

Loading