`copy.deepcopy` does not copy gradient buffers of `torch.autograd.Variable` instance

I ran into unexpected behaviour with `copy.deepcopy` applied to a `Variable`. The gradient buffer of the `Variable` is not copied.
```
a = torch.autograd.Variable(torch.ones(1))
a.grad = torch.autograd.Variable(torch.ones(1))
b = copy.deepcopy(a)
print(b.grad)
```
I think it would be a good idea to copy the gradient buffer during a deep copy. My use case is recording the gradient of a model's parameter space for optimization research. This would also be useful for debugging/development of complex models that involve atypical gradient operations.

This is handled here: https://github.com/pytorch/pytorch/blob/5760b036fb338eacd641418321f23aee51b1aee9/torch/autograd/variable.py#L89-L97

A solution would be to also copy the `grad` attribute of the current `Variable`, which would involve a recursion of the deep copy since the grad attribute is also a `Variable`.

cc @ezyang @gchanan @zou3519 @bdhirsh @jbschlosser @albanD @gqchen @pearu @nikitaved @soulitzer @SsnL

	def __deepcopy__(self, memo):
	if not self.is_leaf:
	raise RuntimeError("Only Variables created explicitly by the user "
	"(graph leaves) support the deepcopy protocol at the moment")
	result = type(self)(self.data.clone())
	result.requires_grad = self.requires_grad
	result.volatile = self.volatile
	memo[id(self)] = result
	return result

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`copy.deepcopy` does not copy gradient buffers of `torch.autograd.Variable` instance #3307

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

copy.deepcopy does not copy gradient buffers of torch.autograd.Variable instance #3307

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`copy.deepcopy` does not copy gradient buffers of `torch.autograd.Variable` instance #3307