Proposal: replace Variable.volatile with global switch

Variable.volatile forces outputs to not require gradients if any of the inputs are marked volatile. This works OK in the forward pass, but we're forced to [change the meaning](https://github.com/pytorch/pytorch/blob/b06c59e543aa26586087c19fb7b713f8872105bb/torch/csrc/autograd/functions/accumulate_grad.cpp#L38) in the backwards. Gradients are sometimes volatile and sometimes not, which is awkward if you add them back to parameters, such as in optimizers.

We should replace `volatile` with a context manager in Python. (Chainer already did this with [`no_backprop_mode()`](http://docs.chainer.org/en/stable/reference/core/generated/chainer.no_backprop_mode.html#chainer.no_backprop_mode)).

At the C++ level, we should replace it with a thread-local global switch.

This will simplify the logic in the backwards: by default `backwards()` will set "no-backprop mode", unless `create_graph` is True.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: replace Variable.volatile with global switch #3627

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Proposal: replace Variable.volatile with global switch #3627

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions