Default initial hidden states for recurrent layers : Issue#434 by goelhardik · Pull Request #605 · pytorch/pytorch

goelhardik · 2017-01-27T02:33:50Z

Setting a default initial hidden state of zeros if the hidden state is not provided by the user. Doing this in the RNNBase class, so it works for all three - RNN, GRU and LSTM.

torch/nn/modules/rnn.py


-    def forward(self, input, hx):
+    def forward(self, input, hx=None):
+        if (hx == None):


torch/nn/modules/rnn.py

-    def forward(self, input, hx):
+    def forward(self, input, hx=None):
+        if (hx == None):
+            batch_sz = input.size()[0] if self.batch_first else input.size()[1]


torch/nn/modules/rnn.py

+    def forward(self, input, hx=None):
+        if (hx == None):
+            batch_sz = input.size()[0] if self.batch_first else input.size()[1]
+            hx = torch.autograd.Variable(torch.Tensor(self.num_layers, batch_sz,


torch/nn/modules/rnn.py

+        if (hx == None):
+            batch_sz = input.size()[0] if self.batch_first else input.size()[1]
+            hx = torch.autograd.Variable(torch.Tensor(self.num_layers, batch_sz,
+                                                   self.input_size).zero_())


torch/nn/modules/rnn.py

+            batch_sz = input.size()[0] if self.batch_first else input.size()[1]
+            hx = torch.autograd.Variable(torch.Tensor(self.num_layers, batch_sz,
+                                                   self.input_size).zero_())
+            if (self.mode == 'LSTM'):


torch/nn/modules/rnn.py

+                                                   self.input_size).zero_())
+            if (self.mode == 'LSTM'):
+                hx = (torch.autograd.Variable(hx.data),
+                      torch.autograd.Variable(hx.data))


apaszke · 2017-01-27T15:36:13Z

One last thing. Can you please add a test that uses this change? Just instantiate one of each kind of RNNs we have and pass a batch through it - once without passing the hidden state, and once with a manually constructed one. Then use self.assertEqual to compare them and make sure that it works as we want. Thanks!

…ssue-434

goelhardik · 2017-01-29T02:27:17Z

I think I did a merge while trying to rebase my branch - that's why it shows the last commit 722c407. Is this okay? Should I try to revert this or just go ahead with adding the test case?

soumith · 2017-01-29T04:38:54Z

go ahead and add the testcase. we'll squash it down before merging.

…ytorch#605) onnx/onnx@79dc46f

IFU 20200318

Add pruning tutorial. Will create another PR to add it into the ToC.

goelhardik added 3 commits January 26, 2017 20:24

Fixed issue #434 : Default initial hidden state for recurrent layers

f0ffa38

Fixed a whitespace

ed7a7aa

Fixed whitespace issue again

14c7f4d

goelhardik changed the title ~~Default initial hidden states for recurrent layers #434~~ Default initial hidden states for recurrent layers : Issue#434 Jan 27, 2017

apaszke suggested changes Jan 27, 2017

View reviewed changes

Fixed changes requested by apaszke

3aada9b

Merge branch 'master' of https://github.com/goelhardik/pytorch into i…

722c407

…ssue-434

goelhardik added 2 commits January 28, 2017 22:45

Added testcase for initial hidden state of all RNNs

0628cae

Fixed if statement format

5603f42

apaszke approved these changes Jan 29, 2017

View reviewed changes

apaszke merged commit 956d946 into pytorch:master Jan 29, 2017

goelhardik deleted the issue-434 branch February 20, 2017 02:24

zou3519 pushed a commit to zou3519/pytorch that referenced this pull request Mar 30, 2018

[auto] Update onnx to 79dc46f - Add ONNX_NAMESPACE around rnn/old.cc (p…

ec36e6f

…ytorch#605) onnx/onnx@79dc46f

ezyang added the open source label Jun 24, 2019

jeffdaily pushed a commit to jeffdaily/pytorch that referenced this pull request Mar 20, 2020

Merge pull request pytorch#605 from iotamudelta/ifu_20200318

1a6a7c4

IFU 20200318

mrshenli pushed a commit to mrshenli/pytorch that referenced this pull request Apr 11, 2020

Merge pull request pytorch#605 from mickypaganini/master

ad3493b

Add pruning tutorial. Will create another PR to add it into the ToC.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Default initial hidden states for recurrent layers : Issue#434#605

Default initial hidden states for recurrent layers : Issue#434#605
apaszke merged 7 commits intopytorch:masterfrom
goelhardik:issue-434

goelhardik commented Jan 27, 2017

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented Jan 27, 2017

Uh oh!

goelhardik commented Jan 29, 2017

Uh oh!

soumith commented Jan 29, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

goelhardik commented Jan 27, 2017

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

apaszke commented Jan 27, 2017

Uh oh!

goelhardik commented Jan 29, 2017

Uh oh!

soumith commented Jan 29, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants