Add tag to define differentiable/non-differrentiable variables by wschin · Pull Request #2723 · onnx/onnx

wschin · 2020-04-14T18:16:40Z

We are adding a tag to explicitly describe if an input is differentiable or not. This tag, if specified, may be automatically added into operator's signature in Operator.md.

There are two example uses of this new tag, one for defining Split spec and the other one for defining Reshape spec. You can see in Split, both of its input and output are differentiable. For Reshape, its 2nd input is not differentiable.

This PR uses the new tag defining differentiability for 5 operators. Their differentiability is explained below.

Reshape:
This operator has been discussed in #2794.

Shape:
Output is not a differentiable function of input because shape is a vector with discrete values.
Thus, the Jacobian matrix needed for backward doesn't exist.

Size:
Output is not a differentiable function of input because size is a discrete scalar.
Thus, the Jacobian matrix needed for backward doesn't exist.

Concat:
It's straight to derive the Jacobian matrix in this case. Let's consider a concatenation of two one-element vector, [x] and [y]. If axis = 0, the corresponding output should be z=[x, y]. Because dL/dz=[dL/dx, dL/dy], its Jacobian matrix'

[[dx/dx dx/dy]
[dy/dx dy/dy]]

is just an identity matrix. The same idea can be extended to the concatenation of any tensors; we just copy the right part in backward operator's input (dL/dz) to backward operators' inputs (dL/dx and dL/dy).

Split:
The computation of Split's Jacobian is very similar to Concat; it just inversely maps output elements to their original input locations. Assume that an input w=[x, y, z] is split into [x], [y], [z]. The corresponding backward operator maps 3 inputs [dL/dx], [dL/y], [dL/dz] into [dL/dx, dL/dy, dL/dz].

linkerzhang · 2020-04-16T03:31:48Z

+    if differentiable == formal_parameter.differentiationCategory:
+        tags.append('differentiable')
+    elif non_differentiable == formal_parameter.differentiationCategory:
+        tags.append('non-differentiable')


how about "unknown" case?

I'm wondering whether a separate "training" op doc should be generated, that will make the doc understanding easier, given ONNX may support more scenarios, other than DNN. thoughts?

Unknown would be an empty string. Would it be better to use differentiability unknown?

For the second comment, I don't feel splitting this tag from Operator.md is a good idea. To understand why an input/output is differentiable and how to compute its gradient, the reader must read that operator's entire document.

I don't believe that training algorithms used for deep learning training can be applied to many traditional ML models. Hence, we probably don't need to add differentiability tags to ops in ONNX-ML.

@wschin I'm ok for "differentialbility - unknown" or "undefined" to make the 3 cases very clear - "differentiable, not differentiable, undefined", if we don't want to put the training ops into a separate doc.

or maybe one line of "statement" - inputs without specifying "differentiability" have that as "undefined" at the beginning of operators.md?

@linkerzhang, training ops may not be put in another MD because --- this newly added attribute will be added to ALL existing operators.

Because each operator has their own differentiability, I am not sure how to create an one-line statement in the beginning of operator.md. Do you mean create one line for each operator?

yes, I mean one line in the beginning of operator.md to save clarifying it in each operator :).

@linkerzhang, I think I get your point. Is 1f1fc67 we want?

wschin · 2020-05-12T17:52:39Z

+            OpSchema::Single,
+            true,
+            1,
+            OpSchema::Differentiable)


How to prove it?

Pytorch/Tensorflow has this differentiable.

Add math of backward to operator and ask reviewer to review the math.

Implement a backward using an existing auto-diff library.

linkerzhang

thank you!

…2723) * Draft * Polish code and fix bugs * One line to explain how we specify undefined differentiability * Work on entire def.cc in tensor folder * Revert some operators' changes to cut PR's size * Clean unused changes Co-authored-by: Ke Zhang <kezhan@microsoft.com>

wschin requested review from a team as code owners April 14, 2020 18:16

wschin added the topic: training Issues related to ONNX training label Apr 14, 2020

wschin changed the title ~~[Draft] Add tag to define differentiable/non-differrentiable variables~~ Add tag to define differentiable/non-differrentiable variables Apr 14, 2020

wschin requested review from gramalingam and sveta-levitan April 14, 2020 20:11

wschin force-pushed the label-diff branch from bbdbcf7 to feee25f Compare April 15, 2020 15:07

wschin added 2 commits April 15, 2020 08:08

Draft

36a10f6

Polish code and fix bugs

1e003df

wschin force-pushed the label-diff branch from feee25f to 1e003df Compare April 15, 2020 15:10

sveta-levitan approved these changes Apr 15, 2020

View reviewed changes

linkerzhang reviewed Apr 16, 2020

View reviewed changes

wschin and others added 7 commits April 27, 2020 14:26

Merge branch 'master' into label-diff

f297368

Merge branch 'master' into label-diff

ee5f5b5

Merge branch 'master' into label-diff

fb5ff3c

Merge branch 'master' into label-diff

d76f943

Merge branch 'master' into label-diff

6358bcd

One line to explain how we specify undefined differentiability

1f1fc67

Work on entire def.cc in tensor folder

b7b65f3

wschin requested a review from ebarsoum May 12, 2020 17:38

wschin commented May 12, 2020

View reviewed changes

Merge branch 'master' into label-diff

9105c11

linkerzhang approved these changes May 18, 2020

View reviewed changes

linkerzhang and others added 6 commits May 18, 2020 09:26

Merge branch 'master' into label-diff

c87b08f

Merge branch 'master' into label-diff

fef4202

Merge branch 'master' into label-diff

13fbe45

Revert some operators' changes to cut PR's size

89bceba

Merge remote-tracking branch 'upstream/master' into label-diff

60febf5

Clean unused changes

58e9da5

wschin merged commit 925b365 into onnx:master Jun 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tag to define differentiable/non-differrentiable variables#2723

Add tag to define differentiable/non-differrentiable variables#2723
wschin merged 16 commits intoonnx:masterfrom
wschin:label-diff

wschin commented Apr 14, 2020 •

edited

Loading

Uh oh!

linkerzhang Apr 16, 2020

Uh oh!

linkerzhang Apr 16, 2020

Uh oh!

wschin Apr 16, 2020 •

edited

Loading

Uh oh!

sveta-levitan Apr 18, 2020

Uh oh!

linkerzhang May 5, 2020

Uh oh!

linkerzhang May 5, 2020

Uh oh!

wschin May 6, 2020

Uh oh!

linkerzhang May 11, 2020

Uh oh!

wschin May 12, 2020

Uh oh!

wschin May 12, 2020

Uh oh!

linkerzhang left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wschin commented Apr 14, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wschin Apr 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

linkerzhang left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wschin commented Apr 14, 2020 •

edited

Loading

wschin Apr 16, 2020 •

edited

Loading