Use blocks machinery to simplify bookkeeping in autodiff by zdevito · Pull Request #5036 · pytorch/pytorch

zdevito · 2018-02-04T08:24:54Z

Using @ezyang's suggestion, this change uses a block rather than
staging annotations to represent the reverse pass. This allows us to reuse the machinery to copy graphs/blocks to extract the reverse pass concisely, eliminating ~50 lines of code.

This also changes the input order of Gradients df to:
[output vjps][temporary vjps][captures]

In addition to being simpler to generate in this order, it also
will allow ExecutionPlan to append the captures onto the already-
existing input list of vjps that are given by the autograd,
rather than have to prepend them, which should be slightly cheaper.

This also changes the Gradient struct to enforce that input
captures appear before output captures in the capture list,
which makes it easier to use in ExecutionPlan.

@ezyang

Using @ezyang's suggestion, this change uses a block rather than staging annotations to represent the reverse pass. This allows us to reuse the machinery to copy graphs/blocks to extract the reverse pass concisely. This also change the input order of Gradients df to: [output vjps][temporary vjps][captures] In addition to being simpler to generate in this order, it also will allow ExecutionPlan to append the captures onto the already- existing input list of vjps that are given by the autograd, rather than have to prepend them, which should be slightly cheaper.

This changes the Gradient struct to enforce that input captures appear before output captures in the capture list, which makes it easier to use in ExecutionPlan.

ezyang · 2018-02-05T15:43:39Z

I did only a cursory look but it all seems fine.

apaszke · 2018-02-05T15:52:24Z

@ezyang thanks for letting me take a look :P

Sign in to view

+  // note: reverse_node is intentionally not inserted to avoid
+  // accidentally acting on it (e.g. in elminate dead code),
+  // std::cout << *reverse_node << to view its state.
+  auto reverse_node = graph.create("Reverse"_sym, 0);


Sign in to view

-      primal_outputs.emplace_back(capture_val);
-      grad_desc.df_input_captures.emplace_back(Capture::Kind::Output,
-                                               primal_outputs.size() - 1);
+      // we need to create a new temporary output for this capture because it wasn't availiable.


Sign in to view

  // XXX: Take care when handling outputs - they can be duplicated!
-  Gradient grad_desc;
+
+  WithInsertPoint guard(*grad_desc.f, grad_desc.f->block());


@ezyang

* Remove addValues and use WithInsertPoint * Use blocks to simplify differentiate Using @ezyang's suggestion, this change uses a block rather than staging annotations to represent the reverse pass. This allows us to reuse the machinery to copy graphs/blocks to extract the reverse pass concisely. This also change the input order of Gradients df to: [output vjps][temporary vjps][captures] In addition to being simpler to generate in this order, it also will allow ExecutionPlan to append the captures onto the already- existing input list of vjps that are given by the autograd, rather than have to prepend them, which should be slightly cheaper. * Enforce that input capture are before outputs This changes the Gradient struct to enforce that input captures appear before output captures in the capture list, which makes it easier to use in ExecutionPlan.

Remove addValues and use WithInsertPoint

1016a93

zdevito requested a review from apaszke February 4, 2018 08:25

onnxbot-worker-2 mentioned this pull request Feb 4, 2018

[auto] pytorch-pr-5036 onnxbot/onnx-fb-universe#536

Closed

zdevito added 2 commits February 4, 2018 00:35

Enforce that input capture are before outputs

6f27a41

This changes the Gradient struct to enforce that input captures appear before output captures in the capture list, which makes it easier to use in ExecutionPlan.

zdevito force-pushed the pr/use_blocks_autodiff branch from 71b4c33 to 6f27a41 Compare February 4, 2018 08:35

ezyang merged commit b044c95 into pytorch:master Feb 5, 2018

apaszke reviewed Feb 6, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use blocks machinery to simplify bookkeeping in autodiff#5036

Use blocks machinery to simplify bookkeeping in autodiff#5036
ezyang merged 3 commits intopytorch:masterfrom
zdevito:pr/use_blocks_autodiff

zdevito commented Feb 4, 2018

Uh oh!

ezyang commented Feb 5, 2018

Uh oh!

apaszke commented Feb 5, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zdevito commented Feb 4, 2018

Uh oh!

ezyang commented Feb 5, 2018

Uh oh!

apaszke commented Feb 5, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants