Kernel IR refactoring: part 4 by tlemo · Pull Request #274 · csarofeen/pytorch

tlemo · 2020-08-06T23:59:49Z

This iteration completes the definition of all the nodes which are present in the Kernel IR. In this PR, they are kept in sync with the original definitions as much as possible, but the goal is to specialize the Kernel IR nodes to fit their lowered roles, for example:

kir::TensorView -> kir::Array ?
kir::IterDomain -> ??? range
kir::TensorDomain -> ??? dimension

These are not plan of record, but just to illustrate that we'll have the opportunity to define simpler and more specific abstractions in the Kernel IR.

Note that the Fusion IR / Kernel IR split is still not completed. Some Kernel IRs may still point to Fusion IR nodes, although this is becoming more limited to a few cases, which will be the subject of the next iteration.

This reverts commit fc09c1b5a7240701da093406753908eba6f41e1d.

tlemo · 2020-08-07T00:22:30Z

+  const std::vector<bool> contiguity_;
+};
+
+class TORCH_CUDA_API TensorView : public Val {


contrast the lowered kir::TensorView with the fuser::TensorView definition

naoyam · 2020-08-07T04:07:55Z

+  const auto in1 = lowerOperand(top->in1(), top->out());
+  const auto in2 = lowerOperand(top->in2(), top->out());
+  const auto in3 = lowerOperand(top->in3(), top->out());
+  const auto out = lowerOutput(top);


nice cleanup!

naoyam · 2020-08-07T04:16:13Z

-  std::vector<Statement*> inds;
-  for (auto* ind : ti->indices())
-    inds.push_back(mutateAsVal(ind));
-
-  bool changed = false;
-  for (decltype(inds.size()) i{0}; i < inds.size(); i++) {
-    TORCH_INTERNAL_ASSERT(inds[i]->isVal() && inds[i]->asVal()->isAnInt());
-    if (!inds[i]->sameAs(ti->index(i)))
-      changed = true;
-  }
-
-  if (!changed)
-    return ti;
-
-  std::vector<Val*> valInds(inds.size(), nullptr);
-  for (decltype(inds.size()) i{0}; i < inds.size(); i++)
-    valInds[i] = inds[i]->asVal();
-
-  Val* mutated_val = new kir::TensorIndex(ti->view(), valInds);
-  registerMutation(ti, mutated_val);
-  return mutated_val;


Why can this be removed?

Doesn't seem to be needed. We may be able to eliminate the need for OptOutMutator completely after we refactor the lowering code.

Why are only the bodies of the overloads for the kernel IR nodes removed? Wouldn't the class become inconsistent? Wouldn't it make more sense to remove the class entirely if it's not needed?

That would be the end goal, although I believe that removing the class completely would require additional changes.

I've been chipping away slowly, to get to the current state where we can have some confidence that it can be removed (when I started this was not as clear)

I opened #276 to track this.

Sounds good to me.

csarofeen

Looks like a good step in the right direction.

csarofeen · 2020-08-11T16:39:59Z


 // Open a new inner most for loop
 kir::ForLoop* openFor(Expr* scope, IterDomain* id) {
+  const auto kir_id = new kir::IterDomain(id);


csarofeen · 2020-08-11T16:42:00Z

    if (dim->isReduction())
      continue;
-    ids.push_back(dim);
+    ids.push_back(new kir::IterDomain(dim));


tlemo added 30 commits July 20, 2020 17:55

WIP: split fusion IR and kernel IR

965ae9d

FusionExecutor cleanup

8164e08

FusionExecutor cleanup

2589871

clang-format

f125fee

Merge branch 'tlemo_cleanup' into code_ir

db30ebf

WIP: making sure all test cases pass

4b5f848

Merge branch '20_7_6_devel' into code_ir

ea820b2

Relocate GridReduction definition

449faca

Merge branch '20_7_6_devel' into code_ir

c08cf34

Merge branch '20_7_6_devel' into code_ir

8bc4b1a

Merge branch '20_7_6_devel' into code_ir

481635b

Checkpoint

58d9f1a

Revert in-progress changes to switch to a new Kernel IR hierarchy

72aec1d

This reverts commit fc09c1b5a7240701da093406753908eba6f41e1d.

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

17e79e1

WIP: create specialized version of the Kerner IR nodes

6301add

Checkpoint: test cases passing

ff68da9

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

abe282e

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

bf87fc4

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

7a03303

Cloning support for Kernel IR nodes

42ada9a

clang-format

812800b

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

831cca9

clang-format

9af9a8a

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

b6d249f

Checkpoint: register lowered IR nodes

209dc40

Minor cleanup

daeb891

Cloning lowered IR nodes

213cf0f

No need to setup uses for lowered IR values

19ae970

Checkpoint

148aefa

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

5b78372

tlemo added 17 commits August 3, 2020 12:39

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

92a4d75

Checkpoint

f32f310

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

84dc14b

Update GridReduction::getPredicateFlagName()

16f2b75

WIP: Lowering expressions

8067306

Add dispatch support for the new Kernel IR nodes

3a67e64

Remove the incomplete kernel source files

398502d

TensorDomain lowering constructor

131375b

Relax lowering checks

b62c244

Placeholder handlers for printing the new Kernel IR nodes

f0f722e

Lowering Fusion IR scalars to Kernel IR scalars

8e53ed1

Preserve the value numbering from Fusion IR to Kernel IR

3bd015f

Lower value definitions

76e4a9c

Small fixes

2e63f4b

Fix testGPU_FusionScalarInputs

6bcd54c

Merge remote-tracking branch 'origin/20_7_6_devel' into code_ir

4f6a606

clang-format

05c2f18

tlemo requested review from csarofeen and naoyam August 7, 2020 00:00

tlemo commented Aug 7, 2020

View reviewed changes

naoyam reviewed Aug 7, 2020

View reviewed changes

Comment thread torch/csrc/jit/codegen/cuda/index_compute.cpp Outdated

naoyam reviewed Aug 7, 2020

View reviewed changes

tlemo added 2 commits August 7, 2020 09:58

Incorporating review feedback

3752e72

Fix typo

a47cf75

naoyam approved these changes Aug 7, 2020

View reviewed changes

csarofeen approved these changes Aug 10, 2020

View reviewed changes

csarofeen merged commit 1874e11 into 20_7_6_devel Aug 10, 2020

tlemo deleted the code_ir_part4 branch August 10, 2020 20:03

csarofeen reviewed Aug 11, 2020

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kernel IR refactoring: part 4#274

Kernel IR refactoring: part 4#274
csarofeen merged 55 commits into20_7_6_develfrom
code_ir_part4

tlemo commented Aug 6, 2020 •

edited

Loading

Uh oh!

tlemo Aug 7, 2020

Uh oh!

Uh oh!

naoyam Aug 7, 2020

Uh oh!

naoyam Aug 7, 2020

Uh oh!

tlemo Aug 7, 2020

Uh oh!

naoyam Aug 7, 2020

Uh oh!

tlemo Aug 7, 2020

Uh oh!

naoyam Aug 7, 2020

Uh oh!

csarofeen left a comment

Uh oh!

csarofeen Aug 11, 2020

Uh oh!

csarofeen Aug 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tlemo commented Aug 6, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

csarofeen left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tlemo commented Aug 6, 2020 •

edited

Loading