init._calculate_fan_in_and_fan_out needlessly indexing the weight tensor instead of using its shape.

## 🐛 Bug

Currently, `init._calculate_fan_in_and_fan_out` computes the receptive field by directly indexing the weight tensor instead of just using the shape:
https://github.com/pytorch/pytorch/blob/1588df6b99310307d8d62d4bfcc309aedf2d404e/torch/nn/init.py#L277

This means that, when [extending PyTorch's tensor class](https://pytorch.org/docs/stable/notes/extending.html), e.g. for lazy access, explicitly indexing the tensor will force a call to `tensor()` and reconstruct the full tensor/explicitly access the elements. 

Since the init sub-module doesn't check for torch_function, it is not possibly to override the init functions. Simply using the shape allows to avoid that.

cc @albanD @mruberry @jbschlosser

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

init._calculate_fan_in_and_fan_out needlessly indexing the weight tensor instead of using its shape. #53540

🐛 Bug

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

init._calculate_fan_in_and_fan_out needlessly indexing the weight tensor instead of using its shape. #53540

Description

🐛 Bug

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions