The Statistical Recurrent Unit

authors: Junier B. Oliva, Barnabas Poczos, Jeff Schneider
arxiv: https://arxiv.org/abs/1703.00381
Pytorch implemention of the experiment of SRU with pixel-by-pixel sequential MNIST.
Powered by DL HACKS

Requirements

I choose Adam for optimization, though SGD is used in the paper. (It might converge faster)
weight_decay is used. (The paper doesn't refer to it)

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
trained_models		trained_models
.gitignore		.gitignore
README.md		README.md
main.py		main.py
models.py		models.py
sru_tutorial.ipynb		sru_tutorial.ipynb
tune_params.py		tune_params.py