nn.Orthogonal

## 🚀 Feature

A module ```nn.Orthogonal``` similar to ```nn.Linear``` where the weight matrix <img src="https://latex.codecogs.com/svg.latex?W" /> is constrained to be orthogonal, i.e., <img src="https://latex.codecogs.com/svg.latex?W^T=W^{-1}"/>.  

## Motivation

There has been a growing interest in orthogonal parameterization of neural networks, see, e.g., [1,2,3,4,5].
To use orthogonal parameterization with PyTorch one has to implement it themselves or use third party code. 
It would be convenient if PyTorch has a built-in module ```nn.Orthogonal```that handles everything automatically. 
In particular, it would be convenient if ```nn.Orthogonal``` support different methods by, e.g.,   ```method={fasth,cayley,exp}```. 

## Pitch

During ICML I was suggested to make a pull request for PyTorch for [fasth](https://invertibleworkshop.github.io/accepted_papers/pdfs/10.pdf) [5] as ```nn.Orthogonal```. 
I want to
1. be sure this feature is desired 
2. discuss potential ways of interfacing with the user
3. implement the code and submit a pull request. 

I want ```nn.Orthogonal``` to support three methods: [Cayley transform](https://github.com/Lezcano/expRNN/blob/master/trivializations.py), [matrix exponential](https://github.com/Lezcano/expRNN/blob/master/trivializations.py) and  [fasth](https://invertibleworkshop.github.io/accepted_papers/pdfs/10.pdf). 

## Additional context

The [contribution instructions ](url) (see screenshot below) states that, generally, algorithms from recently-published research are not accepted, but it is suggested to open an issue, as I have now done. 
FastH is up to 20 times faster than the previous sequential algorithm (see image in bottom of page). 
Please note this is an algorithmic speed-up, it computes the exact same thing as the previous algorithm, just faster. 

![image](https://user-images.githubusercontent.com/8614529/88838933-8a349900-d1da-11ea-803e-d6585f44d247.png)


# References 
[1] Efficient Orthogonal Parametrisation of Recurrent Neural Networks Using Householder Reflections (ICML 2017)
[2] A Simple Parametrization of the Orthogonal and Unitary Group (ICML 2019)
[3] Stabilizing Gradients forDeep Neural Networks via Efficient SVD Parameterization (ICML 2018)
[4] Trivializations for Gradient-Based Optimization on Manifolds (NeurIPS 2019)
[5] Faster Orthogonal Parameterization with Householder Matrices (ICML Workshop 2020)


![image](https://user-images.githubusercontent.com/8614529/88841111-c1587980-d1dd-11ea-823a-d073d180aaaf.png)



cc @albanD @mruberry

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nn.Orthogonal #42243

🚀 Feature

Motivation

Pitch

Additional context

References

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

nn.Orthogonal #42243

Description

🚀 Feature

Motivation

Pitch

Additional context

References

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions