CuDNNConvolutionLayer accumulate gradients #3254

ronghanghu · 2015-10-28T05:52:17Z

After #1977, all layers should accumulate gradients for parameters. These two lines introduced in #3160 seems to be a bug.

@shelhamer @slayton58 can you take a look?

longjon · 2015-10-28T08:21:54Z

Good catch @ronghanghu -- this definitely looks wrong. I'll leave open for the moment since I haven't read this code before, but we should take care of this immediately.

We should really have better testing for this situation... if this is bugged as it appears, this breaks weight sharing and gradient accumulation.

slayton58 · 2015-10-28T12:25:23Z

Yep, this does look like a bug -- when I wrote the initial support on our fork and merged it across, it looks like this got dragged in (it's still present in our code)

shelhamer · 2015-10-28T16:09:39Z

Good catch @ronghanghu and sorry for missing this is my review. While there
are cuDNN layer tests and weight sharing + gradient accumulation tests
there are not weight sharing + gradient accumulation tests for cuDNN.
On Wed, Oct 28, 2015 at 05:25 slayton58 notifications@github.com wrote:

Yep, this does look like a bug -- when I wrote the initial support on our
fork and merged it across, it looks like this got dragged in (it's still
present in our code)

—
Reply to this email directly or view it on GitHub
#3254 (comment).

ronghanghu · 2015-10-28T16:56:58Z

We need to adapt the gradient checker to gradient accumulation, in order to prevent future mistakes like this one and #2532.

CuDNNConvolutionLayer accumulate gradients

seanbell · 2015-10-28T23:19:09Z

👍 I agree that the gradient checker needs to be updated. A simple fixed was proposed earlier by @tnarihi : tnarihi@7d45526

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CuDNNConvolutionLayer accumulate gradients #3254

CuDNNConvolutionLayer accumulate gradients #3254

Uh oh!

ronghanghu commented Oct 28, 2015

Uh oh!

longjon commented Oct 28, 2015

Uh oh!

slayton58 commented Oct 28, 2015

Uh oh!

shelhamer commented Oct 28, 2015

Uh oh!

ronghanghu commented Oct 28, 2015

Uh oh!

seanbell commented Oct 28, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

CuDNNConvolutionLayer accumulate gradients #3254

CuDNNConvolutionLayer accumulate gradients #3254

Uh oh!

Conversation

ronghanghu commented Oct 28, 2015

Uh oh!

longjon commented Oct 28, 2015

Uh oh!

slayton58 commented Oct 28, 2015

Uh oh!

shelhamer commented Oct 28, 2015

Uh oh!

ronghanghu commented Oct 28, 2015

Uh oh!

seanbell commented Oct 28, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants