Remove clone in fused rnn double back quick fix#1683

Closed

csarofeen wants to merge 7 commits intopytorch:masterfrom

csarofeen:fused_rnn_fix

Contributor

csarofeen commented May 30, 2017

Better fix for #1532


          More performant fix for fused rnn kernels rnn pytorch#1532

Contributor

apaszke commented May 30, 2017 •

edited

Loading

What's the memory footprint of these changes? It seems that it holds quite a workspace. Can't it e.g. save hx separately to decrease mem usage (it seems that it's just copied into the buffer)?


          Optimize memory usage for fused rnn functions

770314f

Contributor Author

csarofeen commented May 30, 2017

That goes out of scope after the forward call and is not particularly big. But there were some larger memory usage issues that I got rid of.

apaszke reviewed

View reviewed changes

Contributor

apaszke left a comment

Looks good, but the linter is unhappy

torch/lib/THCUNN/generic/FusedRNNKernel.cu Outdated

    
                  *oghn = ghn;

                  DEVICE_LINEAR_GET(hidden, offset+0*hsz) = grg;

                  DEVICE_LINEAR_GET(hidden, offset+1*hsz) = gig;

                  DEVICE_LINEAR_GET(hidden, offset+2*hsz) = ghn;

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

csarofeen mentioned this pull request

Update rnnFusedPointwise.py to revert bias shape #1722

Closed


          Fused rnn refactor and bug fix for pytorch#1721

e7fb6f4

csarofeen mentioned this pull request

non-cudnn LSTM and GRU biases have wrong shapes #1721

Closed

csarofeen added 2 commits

June 5, 2017 12:33


          linter

b96559d


          linter

2ffbe29

apaszke reviewed

View reviewed changes

torch/nn/_functions/thnn/rnnFusedPointwise.py Outdated

    
                              ibias = ibias.view(1, -1)

                          if hbias.dim() == 1:

                              hbias.unsqueeze_(0)

                              hbias = hbias.view(1, -1)

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

This comment was marked as off-topic.

Sign in to view

torch/nn/_functions/thnn/rnnFusedPointwise.py Outdated

                           self.backend = type2backend[type(input_gate)]
                       hy = input_gate.new()
+                      storage = input_gate.new().resize_(hx.numel() * 5)

This comment was marked as off-topic.

Sign in to view

torch/nn/_functions/thnn/rnnFusedPointwise.py Outdated

-                      igc = input_gate.clone()
-                      hgc = hidden_gate.clone()
+                      gradInputHx = gradOutput.new()
+                      gradInInput = gradOutput.new().resize_(*self.igate_size)

This comment was marked as off-topic.

Sign in to view

torch/nn/_functions/thnn/rnnFusedPointwise.py Outdated

-                          return igc, hgc, gradInput, gb1, gb2
+                      if self.hasBias:
+                          gb1 = gradInInput.sum(0).squeeze()
+                          gb2 = gradInHidden.sum(0).squeeze()

This comment was marked as off-topic.

Sign in to view

torch/nn/_functions/thnn/rnnFusedPointwise.py Outdated

+                      gradInInput = gradOutput.new().resize_(*self.igate_size)
+                      gradInHidden = gradOutput.new().resize_(*self.hgate_size)
+                      storage = self.buffer

This comment was marked as off-topic.

Sign in to view

torch/nn/_functions/thnn/rnnFusedPointwise.py Outdated

                       hy = input_gate.new()
+                      storage = input_gate.new().resize_(hx.numel() * 5)
+                      self.hasBias = False

This comment was marked as off-topic.

Sign in to view


          rnnFusedPointwise.py fixes

9c765ea

soumith approved these changes

View reviewed changes


          squeeze -> keepdim=False

05afe62

Collaborator

soumith commented Jun 7, 2017

merged into master

soumith closed this

ezyang added the open source label

csarofeen deleted the fused_rnn_fix branch

February 12, 2020 13:32

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels