Small optimization for adam by jma127 · Pull Request #12107 · pytorch/pytorch

jma127 · 2018-09-26T18:09:38Z

Apply weight decay for Adam in-place instead of via copy.

Synced offline with @soumith , who mentioned that it should be OK. This is also consistent with other optimizers, e.g.

pytorch/torch/optim/sgd.py

Line 93 in eee0173

d_p.add_(weight_decay, p.data)

facebook-github-bot

soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ssnl · 2018-09-27T08:30:09Z

well... I would certainly expect .grad to not change after optimizer step.

jma127 · 2018-09-27T11:48:59Z

Hmm, then the SGD implementation should be fixed to satisfy that invariant.

I'll leave it to you guys to determine whether or not this is a necessary invariant -- feel free to revert as you see fit.

Small optimization for adam

7de8936

jma127 requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners September 26, 2018 18:09

soumith approved these changes Sep 26, 2018

View reviewed changes

facebook-github-bot reviewed Sep 26, 2018

View reviewed changes

facebook-github-bot closed this in 383d340 Sep 27, 2018

ezyang added the merged label Jun 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Small optimization for adam#12107

Small optimization for adam#12107
jma127 wants to merge 1 commit intopytorch:masterfrom
jma127:master

jma127 commented Sep 26, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

ssnl commented Sep 27, 2018

Uh oh!

jma127 commented Sep 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

jma127 commented Sep 26, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ssnl commented Sep 27, 2018

Uh oh!

jma127 commented Sep 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants