[JIT] Fix the clamp special case and gradient problem on None, add None to JIT by wanchaol · Pull Request #9596 · pytorch/pytorch

wanchaol · 2018-07-19T19:00:50Z

Supersedes #8925

This PR fixes #8502, it fixes the gradients problem for clamp when passing None to the function, and add support for the NoneLiteral and NoneType in script to enable clamp tests. Now we could have corner cases like:

@torch.jit.script
def func():
    x = torch.randn(3, 3, requires_grad=True)
    y = torch.clamp(x, None, 0) # max = 0
    y = torch.clamp(x, min=None, max=0)

In both JIT and Aten, we use Scalar(NAN) as a sentinel value when passing None type to function clamp, this is the current way we used to support None type in JIT and to solve the gradient problem when user explicitly passing None into clamp.

In JIT side, we create a tensor(NAN) and undefinedTensor if we encounter None when matching the function schema, and later in the interpreter, it will translate to Scalar(NAN) if needed.

Ideally we don't need clamp_min and clamp_max in ATenNative/Autograd and could only support clamp after this change, but since bunch of other operators (e.g. Activation.cpp, Loss.cpp) is using clamp_min in several places, we will still have the functions available, but all python invocations will only call clamp instead of clamp_min/max (with calling underlying th_max/th_min in clamp).

@zdevito @jamesr66a

facebook-github-bot

@wanchaol has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

@wanchaol has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

zou3519 · 2018-07-24T21:39:28Z

Does the following run correctly?

import torch
x = torch.arange(0, 10)
y = x.clamp(min=None, max=4)
print(y)

wanchaol · 2018-07-24T21:49:00Z

@zou3519 Yes here is the output:

In [2]: import torch
   ...: x = torch.arange(0, 10)
   ...: y = x.clamp(min=None, max=4)
   ...: print(y)
   ...:
   ...:
tensor([0, 1, 2, 3, 4, 4, 4, 4, 4, 4])

I will also add more subtle test cases like this one to try to capture every corner case.

zdevito

This looks good! I have just minor comments. If it is possible to remove the special-casing of clamp in the python frontend then we should.

tools/autograd/templates/python_torch_functions.cpp

torch/csrc/jit/operator.cpp

torch/csrc/jit/script/compiler.cpp

facebook-github-bot

wanchaol has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

wanchaol has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

aten/src/ATen/native/native_functions.yaml

…ation

facebook-github-bot

wanchaol has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

zdevito

Looks good now

…JIT (#9596) Summary: Supersedes #8925 This PR fixes #8502, it fixes the gradients problem for clamp when passing None to the function, and add support for the NoneLiteral and NoneType in script to enable clamp tests. Now we could have corner cases like: ```python torch.jit.script def func(): x = torch.randn(3, 3, requires_grad=True) y = torch.clamp(x, None, 0) # max = 0 y = torch.clamp(x, min=None, max=0) ``` In both JIT and Aten, we use Scalar(NAN) as a sentinel value when passing None type to function clamp, this is the current way we used to support None type in JIT and to solve the gradient problem when user explicitly passing None into clamp. In JIT side, we create a tensor(NAN) and undefinedTensor if we encounter None when matching the function schema, and later in the interpreter, it will translate to Scalar(NAN) if needed. Ideally we don't need clamp_min and clamp_max in ATenNative/Autograd and could only support clamp after this change, but since bunch of other operators (e.g. Activation.cpp, Loss.cpp) is using clamp_min in several places, we will still have the functions available, but all python invocations will only call clamp instead of clamp_min/max (with calling underlying th_max/th_min in clamp). zdevito jamesr66a Pull Request resolved: pytorch/pytorch#9596 Reviewed By: zdevito Differential Revision: D8940839 Pulled By: wanchaol fbshipit-source-id: c543a867b82e0ab8c99384773b173fdde2605d28

…JIT (pytorch#9596) Summary: Supersedes pytorch#8925 This PR fixes pytorch#8502, it fixes the gradients problem for clamp when passing None to the function, and add support for the NoneLiteral and NoneType in script to enable clamp tests. Now we could have corner cases like: ```python torch.jit.script def func(): x = torch.randn(3, 3, requires_grad=True) y = torch.clamp(x, None, 0) # max = 0 y = torch.clamp(x, min=None, max=0) ``` In both JIT and Aten, we use Scalar(NAN) as a sentinel value when passing None type to function clamp, this is the current way we used to support None type in JIT and to solve the gradient problem when user explicitly passing None into clamp. In JIT side, we create a tensor(NAN) and undefinedTensor if we encounter None when matching the function schema, and later in the interpreter, it will translate to Scalar(NAN) if needed. Ideally we don't need clamp_min and clamp_max in ATenNative/Autograd and could only support clamp after this change, but since bunch of other operators (e.g. Activation.cpp, Loss.cpp) is using clamp_min in several places, we will still have the functions available, but all python invocations will only call clamp instead of clamp_min/max (with calling underlying th_max/th_min in clamp). zdevito jamesr66a Pull Request resolved: pytorch#9596 Reviewed By: zdevito Differential Revision: D8940839 Pulled By: wanchaol fbshipit-source-id: c543a867b82e0ab8c99384773b173fdde2605d28

wanchaol added the oncall: jit Add this issue/PR to JIT oncall triage queue label Jul 19, 2018

wanchaol requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners July 19, 2018 19:00

wanchaol requested a review from jamesr66a July 19, 2018 19:01

wanchaol mentioned this pull request Jul 19, 2018

[JIT] Add Support for NoneLiteral. #8925

Closed

wanchaol requested review from Yangqing, anderspapitto, bddppq, dzhulgakov, houseroad and smessmer as code owners July 19, 2018 22:08

wanchaol force-pushed the clamp_none branch 3 times, most recently from 18c9009 to 9e9d917 Compare July 20, 2018 16:47

facebook-github-bot reviewed Jul 20, 2018

View reviewed changes

wanchaol force-pushed the clamp_none branch 2 times, most recently from e085b8e to da2ac8b Compare July 24, 2018 17:39

facebook-github-bot reviewed Jul 24, 2018

View reviewed changes

zdevito reviewed Jul 24, 2018

View reviewed changes

wanchaol force-pushed the clamp_none branch 2 times, most recently from 7610648 to bd853b1 Compare July 27, 2018 02:46

facebook-github-bot reviewed Jul 27, 2018

View reviewed changes

wanchaol force-pushed the clamp_none branch 2 times, most recently from bd9a1cc to fad128b Compare July 27, 2018 07:33

facebook-github-bot reviewed Jul 27, 2018

View reviewed changes

wanchaol commented Jul 27, 2018

View reviewed changes

aten/src/ATen/native/native_functions.yaml Outdated

This comment was marked as off-topic.

Sign in to view

wanchaol added 7 commits July 27, 2018 16:47

Fix clamp special case and derivatives, add None to JIT

322d219

change onnx symbolic to handle clamp dispatch

c36ac3f

rebase with Ivalue

81789b4

address comments

35462ea

more refactor, remove special case entirely and rely on wrapper gener…

43bc8a4

…ation

test case consistency

36771e5

fix the wired regression with change to double type

a8377db

wanchaol force-pushed the clamp_none branch from fad128b to a8377db Compare July 27, 2018 23:47

facebook-github-bot reviewed Jul 27, 2018

View reviewed changes

zdevito approved these changes Jul 28, 2018

View reviewed changes

facebook-github-bot closed this in 47c1bad Jul 28, 2018

wanchaol deleted the clamp_none branch July 28, 2018 05:56

gchanan mentioned this pull request Sep 11, 2018

Add padding as an option to torch.cat #11494

Closed

ezyang added the merged label Jun 26, 2019

Conversation

wanchaol commented Jul 19, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

zou3519 commented Jul 24, 2018

Uh oh!

wanchaol commented Jul 24, 2018

Uh oh!

zdevito left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

zdevito left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

wanchaol commented Jul 19, 2018 •

edited

Loading