Forward AD formulas batch 1#57768

Closed

albanD wants to merge 13 commits intogh/albanD/90/basefrom

gh/albanD/90/head

Collaborator

albanD commented May 6, 2021 •

edited

Loading

Note that this PR implements formulas only for ops that are supported by OpInfo.
Slow gradcheck also passes for this PR and can be found here: #57976

Stack from ghstack:

Forward AD formulas batch 3 #58094 Forward AD formulas batch 3
Forward AD formulas batch 2 #57863 Forward AD formulas batch 2
Forward AD formulas batch 1 #57768 Forward AD formulas batch 1

Differential Revision: D28387766


          Forward AD formulas batch 1

fa4d1a5

[ghstack-poisoned]

albanD requested a review from soulitzer as a code owner

May 6, 2021 19:54

This was referenced May 6, 2021

Add forward AD gradcheck #57633

Closed

Add forward AD test for op info #57701

Closed

Codegen inplace forward AD formula from out of place one if needed #57767

Closed

facebook-github-bot added the cla signed label

Contributor

facebook-github-bot commented May 6, 2021 •

edited

Loading

💊 CI failures summary and remediations

As of commit 10a4089 (more details on the Dr. CI page):

1/1 failures introduced in this PR

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

pytorch_macos_10_13_py3_build (1/1)

Step: "Spin up environment" (full log | diagnosis details | 🔁 rerun)

Waiting for a VM assignment: .......................................................................

Build-agent version 1.0.63541-3d9f91b1 (2021-05-21T07:54:49+0000)
Creating a dedicated VM with xcode:12.0 image
Waiting for a VM assignment: ............................................................................................................................................................................................................................................................................................................

We timed out preparing a VM for this build, potentially due to our infrastructure or cloud provider.  Please retry the build in a few minutes

Unexpected capacity error: error caused by capacity

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

albanD added a commit that referenced this pull request


          Forward AD formulas batch 1

7961a8d

ghstack-source-id: fabd1e8
Pull Request resolved: #57768

albanD requested a review from zou3519

May 6, 2021 20:48


          Update on "Forward AD formulas batch 1"

ba5532c

Note that this PR implements formulas only for ops that are supported by OpInfo.




[ghstack-poisoned]

albanD added a commit that referenced this pull request


          Forward AD formulas batch 1

ce6cce7

ghstack-source-id: 720308c
Pull Request resolved: #57768


          Update on "Forward AD formulas batch 1"

e9ecac2

Note that this PR implements formulas only for ops that are supported by OpInfo.




[ghstack-poisoned]

albanD mentioned this pull request

Forward AD formulas batch 2 #57863

Closed


          Update on "Forward AD formulas batch 1"

07d40d3

Note that this PR implements formulas only for ops that are supported by OpInfo.




[ghstack-poisoned]

dgl-intel pushed a commit to dgl-intel/pytorch that referenced this pull request


          Forward AD formulas batch 1

753fbbe

ghstack-source-id: f9da430
Pull Request resolved: pytorch#57768

albanD added 2 commits

May 7, 2021 18:39


          Update on "Forward AD formulas batch 1"

310225a

Note that this PR implements formulas only for ops that are supported by OpInfo.




[ghstack-poisoned]


          Update on "Forward AD formulas batch 1"

b822bbf

Note that this PR implements formulas only for ops that are supported by OpInfo.




[ghstack-poisoned]

albanD added a commit to albanD/pytorch that referenced this pull request


          Forward AD formulas batch 1

5e19164

ghstack-source-id: 8b9396e
Pull Request resolved: pytorch#57768

albanD added a commit that referenced this pull request


          Forward AD formulas batch 1

6fe9c4e

ghstack-source-id: 8b9396e
Pull Request resolved: #57768


          Update on "Forward AD formulas batch 1"

3db07cf

Note that this PR implements formulas only for ops that are supported by OpInfo.




[ghstack-poisoned]

albanD mentioned this pull request

Forward AD formulas batch 3 #58094

Closed

albanD added a commit to albanD/pytorch that referenced this pull request


          Forward AD formulas batch 1

f03d120

ghstack-source-id: e313cea
Pull Request resolved: pytorch#57768

albanD added 2 commits

May 12, 2021 10:09


          Update on "Forward AD formulas batch 1"

50f39ed

Note that this PR implements formulas only for ops that are supported by OpInfo.




[ghstack-poisoned]


          Update on "Forward AD formulas batch 1"

b26bedf

Note that this PR implements formulas only for ops that are supported by OpInfo.




[ghstack-poisoned]

Collaborator Author

albanD commented May 12, 2021

@albanD has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


          Update on "Forward AD formulas batch 1"

5e972ff

Note that this PR implements formulas only for ops that are supported by OpInfo.
Slow gradcheck also passes for this PR and can be found here: #57976


Differential Revision: [D28387766](https://our.internmc.facebook.com/intern/diff/D28387766)

[ghstack-poisoned]

This was referenced May 13, 2021

Revert "Revert D28387765: Add forward AD gradcheck" #58229

Closed

Revert "Revert D28387767: Add forward AD test for op info" #58230

Closed

Revert "Revert D28387764: Codegen inplace forward AD formula from out of place one if needed" #58231

Closed


          Update on "Forward AD formulas batch 1"

7f5cc11

Note that this PR implements formulas only for ops that are supported by OpInfo.
Slow gradcheck also passes for this PR and can be found here: #57976


Differential Revision: [D28387766](https://our.internmc.facebook.com/intern/diff/D28387766)

[ghstack-poisoned]

albanD mentioned this pull request

update abs forward ad formula #58235

Closed

zou3519 reviewed

View reviewed changes

tools/autograd/load_derivatives.py Outdated Show resolved Hide resolved

zou3519 reviewed

View reviewed changes

tools/autograd/load_derivatives.py Show resolved Hide resolved


          Update on "Forward AD formulas batch 1"

4b88d94

Note that this PR implements formulas only for ops that are supported by OpInfo.
Slow gradcheck also passes for this PR and can be found here: #57976


Differential Revision: [D28387766](https://our.internmc.facebook.com/intern/diff/D28387766)

[ghstack-poisoned]

zou3519 reviewed

View reviewed changes

torch/testing/_internal/common_methods_invocations.py Show resolved Hide resolved

zou3519 reviewed

View reviewed changes

tools/autograd/derivatives.yaml Outdated Show resolved Hide resolved

zou3519 reviewed

View reviewed changes

tools/autograd/derivatives.yaml

                 self: handle_r_to_c(self.scalar_type(), grad)
                 tensor1: handle_r_to_c(tensor1.scalar_type(), grad * (value / tensor2).conj())
                 tensor2: handle_r_to_c(tensor2.scalar_type(), -grad * (value * tensor1 / (tensor2 * tensor2)).conj())
+                result: self_t + maybe_multiply(tensor1_t / tensor2_p, value) - maybe_multiply(tensor2_t * (tensor1_p / tensor2_p) / tensor2_p, value)

Contributor

zou3519 May 18, 2021

(no action required) Could you actually "auto-elementwise" this and other pointwise operations? It looks like the formula (for the real case at least) is just the backward formula for self + backward formula for tensor 1 + backward formula for tensor2 while replacing all the grads with the correct tangents.

Collaborator Author

albanD May 21, 2021

Yes we could do it here. Will add it if it shows up again.

zou3519 reviewed

View reviewed changes

tools/autograd/derivatives.yaml

                 self: maybe_multiply(grad, beta.conj())
                 mat1: mm_mat1_backward(grad, mat2, mat1.sizes(), mat1.strides(), alpha)
                 mat2: mm_mat2_backward(grad, mat1, mat2.sizes(), mat2.strides(), alpha)
+                result: maybe_multiply(self_t, beta) + maybe_multiply(mat1_t.mm(mat2_p), alpha) + maybe_multiply(mat1_p.mm(mat2_t), alpha)

Contributor

zou3519 May 18, 2021

It's interesting to note that this is just maybe_multiply(self_t, beta) added with maybe_multiply( formula_for_mm , alpha ). Are there any chances we would want to dedup code between this and the mm formula in the future?

Collaborator Author

albanD May 21, 2021

The main problem with such formula is that they are not element-wise. So adding the formulas won't work.
And they are affine (not linear) and so we would need to provide some arguments to a smarter auto_affine to handle this. Which feels like a dangerous step to take.

Contributor

zou3519 May 21, 2021

Yeah, I agree the design for this would be tricky.

zou3519 reviewed

View reviewed changes

tools/autograd/derivatives.yaml Show resolved Hide resolved

zou3519 reviewed

View reviewed changes

Contributor

zou3519 left a comment

the formulas lgtm from a real numbers perspective, but I am not sure how to derive them for complex numbers

zou3519 approved these changes

View reviewed changes

Contributor

zou3519 left a comment •

edited

Loading

Offline Alban walked me through a derivation of complex forward-mode AD derivative for torch.sin and torch.conj and those helped me understand enough to derive the complex formulas as well.

NB: we should update the one example above that wasn't updated for this PR, but other than that things lgtm


          Update on "Forward AD formulas batch 1"

10a4089

Note that this PR implements formulas only for ops that are supported by OpInfo.
Slow gradcheck also passes for this PR and can be found here: #57976


Differential Revision: [D28387766](https://our.internmc.facebook.com/intern/diff/D28387766)

[ghstack-poisoned]

Collaborator Author

albanD commented May 24, 2021

@albanD has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot closed this in

09a1b1c

Contributor

facebook-github-bot commented May 25, 2021

@albanD merged this pull request in 09a1b1c.

facebook-github-bot added the Merged label

facebook-github-bot deleted the gh/albanD/90/head branch

May 29, 2021 14:17

deniskokarev pushed a commit to deniskokarev/pytorch that referenced this pull request


          Forward AD formulas batch 1 (pytorch#57768)

84cb837

Summary:
Pull Request resolved: pytorch#57768

Note that this PR implements formulas only for ops that are supported by OpInfo.

Test Plan: Imported from OSS

Reviewed By: zou3519, malfet

Differential Revision: D28387766

Pulled By: albanD

fbshipit-source-id: b4ba1cf1ac1dfd46cdd889385c9c2d5df3cf7a71

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cla signed Merged