Update Dropout and BatchNorm to be Training Friendly by lara-hdr · Pull Request #2568 · onnx/onnx

lara-hdr · 2020-01-23T18:24:24Z

Initial PR: #1887
This PR updates Dropout and BatchNormalization to be training friendly.

For Dropout:
The "ratio" is now an input rather than an attribute of the operator, and a new attribute "seed" is introduced.

For BatchNormalization:
The operator already have support for the mean and var as inputs/outputs and saved_mean and saved_var as output of the model. The input mean and var will be the running values if we are in training mode, otherwise they will be the estimated values. The optional outputs mean/var/saved_mean/saved_var will only be used in training mode.
A new optional input "training_mode" is introduced and defaults to False since in most cases we would export ONNX models in inference mode; this is an input rather than an attribute since we need to modify it during the runtime.
The new attribute "training_mode" would allow to state explicitly if the operator is in training or inference mode. However the backend engine running the ONNX model could infer this information from the ONNX model, so we could potentially remove this attribute completely, and let the engine decide how to compute the output of this operator.

lara-hdr · 2020-01-23T19:35:13Z

@SherlockNoMad @wschin for review

…raining_ops

lara-hdr · 2020-01-30T17:59:44Z

@gramalingam for review

SherlockNoMad

lara-hdr · 2020-01-31T20:28:24Z

cc @houseroad @postrational @linkerzhang

…a-hdr/onnx into lahaidar/update_training_ops

lara-hdr · 2020-02-05T23:57:56Z

@ebarsoum CI is green. Thanks!

Summary: - Update Dropout and Batchnorm in opset 12 : onnx/onnx#2568 - Update api logic for exporting to ONNX training amenable models Pull Request resolved: #32950 Reviewed By: hl475 Differential Revision: D19710370 Pulled By: houseroad fbshipit-source-id: e5e79d38552936966662c41d39ddf33be1ba3e35

) Summary: - Update Dropout and Batchnorm in opset 12 : onnx/onnx#2568 - Update api logic for exporting to ONNX training amenable models Pull Request resolved: pytorch#32950 Reviewed By: hl475 Differential Revision: D19710370 Pulled By: houseroad fbshipit-source-id: e5e79d38552936966662c41d39ddf33be1ba3e35

* Update Dropout and BatchNorm to be Training Friendly * fix test name * update ref implementation * merge with master and re-generate docs * fix eliminate dropout test * missing type annotation * update doc + shape inference * update doc * re-gen doc * update doc * update doc * fxitest * add hasInputShape check * rename outputs + update doc * static_cast for stricter CI Co-authored-by: Wei-Sheng Chin <wechi@microsoft.com> Co-authored-by: Emad Barsoum <ebarsoum@gmail.com>

) Summary: - Update Dropout and Batchnorm in opset 12 : onnx/onnx#2568 - Update api logic for exporting to ONNX training amenable models Pull Request resolved: pytorch#32950 Reviewed By: hl475 Differential Revision: D19710370 Pulled By: houseroad fbshipit-source-id: e5e79d38552936966662c41d39ddf33be1ba3e35

Update Dropout and BatchNorm to be Training Friendly

38d2292

lara-hdr requested a review from a team as a code owner January 23, 2020 18:24

fix test name

abd4704

Lara added 4 commits January 24, 2020 14:36

update ref implementation

d5e4e63

merge with master

347c376

merge with master and re-generate docs

bb1d298

fix eliminate dropout test

416e164

lara-hdr requested a review from a team as a code owner January 24, 2020 23:25

missing type annotation

6a98939

wschin reviewed Jan 27, 2020

View reviewed changes

Comment thread onnx/defs/nn/defs.cc

wschin reviewed Jan 27, 2020

View reviewed changes

Comment thread onnx/defs/nn/defs.cc

wschin reviewed Jan 27, 2020

View reviewed changes

Comment thread onnx/defs/nn/defs.cc

wschin reviewed Jan 27, 2020

View reviewed changes

Comment thread onnx/defs/nn/defs.cc

SherlockNoMad reviewed Jan 27, 2020

View reviewed changes

Comment thread docs/Operators.md

SherlockNoMad reviewed Jan 27, 2020

View reviewed changes

Comment thread docs/Operators.md Outdated

SherlockNoMad reviewed Jan 27, 2020

View reviewed changes

Comment thread docs/Operators.md

wschin reviewed Jan 28, 2020

View reviewed changes

Comment thread docs/Changelog.md Outdated

update doc + shape inference

b24b4ff