Add complex support for torch.nn.L1Loss by soulitzer · Pull Request #49912 · pytorch/pytorch

soulitzer · 2020-12-29T04:42:41Z

Building on top of the work of @anjali411 (#46640)

Things added in this PR:

Modify backward and double-backward formulas
Add complex support for new module tests and criterion tests (and add complex tests for L1)
Modify some existing tests to support complex

facebook-github-bot · 2020-12-29T04:42:59Z

💊 CI failures summary and remediations

As of commit 8aee998 (more details on the Dr. CI page):

1/1 failures possibly* introduced in this PR
- 1/1 non-CircleCI failure(s)

ci.pytorch.org: 1 failed

Failed: pr/pytorch-linux-bionic-rocm3.10-py3.6

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

codecov · 2020-12-29T09:02:07Z

Codecov Report

Merging #49912 (8aee998) into master (f10e7aa) will increase coverage by 10.22%.
The diff coverage is 96.55%.

@@             Coverage Diff             @@
##           master   #49912       +/-   ##
===========================================
+ Coverage   70.48%   80.71%   +10.22%     
===========================================
  Files        1904     1904               
  Lines      206632   206633        +1     
===========================================
+ Hits       145653   166789    +21136     
+ Misses      60979    39844    -21135

anjali411

We should also update the documentation to indicate that L1Loss now supports complex numbers

…omplex-support

anjali411 · 2021-01-04T22:43:34Z

        type_map = {}
    if isinstance(obj, torch.Tensor):
        assert obj.is_leaf
-        t = type_map.get(obj.type(), get_gpu_type(obj.type()))


get_gpu_type is only used at one other place, so it would be awesome if you could update that too and get rid of get_gpu_type method.
https://github.com/pytorch/pytorch/blob/master/test/test_cuda.py#L770

also nit - let's just change this to t = type_map.get(obj.type(), obj.type()) and change the line below to res = obj.clone().type(t).cuda()

Might as well get rid of this test in that case
Since

def test_is_tensor(self): for t in types: tensor = get_gpu_type(t)() self.assertTrue(torch.is_tensor(tensor)) self.assertTrue(torch.is_tensor(torch.cuda.HalfTensor()))

becomes something like

for t in types: tensor = torch.tensor(data, dtype=t).cuda()

yeah right makes sense! let's do that

…omplex-support

soulitzer · 2021-01-05T22:37:05Z

        if gradOutput is None:
            gradOutput = torch.ones(())
-        criterion(*args).backward(gradOutput.to(input_tuple[0]))
+        criterion(*args).backward(gradOutput.to(output))


For C to R functions, input's dtype is not equal to output's dtype. In general, we'd like gradoutput to be the same dtype as output anyway

hmm what if the output is a tuple? I think we should add a similar check for output as input:
output_tuple = output if isinstance(output, tuple) else (output,)

Can output be a tuple though? Input might be a tuple only because when we backward, we might want to populate the grads of multiple inputs. I'm curious which functions return tuples.

not sure about torch.nn module functions, but some torch functions that come to mind are triangular_solve, qr

@soulitzer did you look into this?

There are functions like nn.AdaptiveMaxPool2d that do return a tuple, so I ended up adding the check for the tuple case.

anjali411 · 2021-01-06T17:41:58Z

  auto norm = reduction == Reduction::Mean ? grad_output / input.numel() : grad_output;
-  at::sub_out(grad_input, input, target).sign_().mul_(norm);
-  return grad_input;
+  return at::sub_out(grad_input, input, target).sgn_().mul_(norm);


looks good!

anjali411

Thanks @soulitzer the PR changes look good to me overall. let's rebase on the master and check the CI tests.

could you also remove get_gpu_type and test_is_tensor?

gchanan · 2021-01-08T16:23:23Z

        target_fn=lambda: torch.randn((2, 3, 4), requires_grad=True),
        reference_fn=lambda i, t, _: 1. / i.numel() *
        sum((a - b).abs().sum() for a, b in zip(i, t)),
+        check_complex=True,


is the target in this case supposed to be complex or real? The math makes it look like it should be complex, but the target created is real?

target_fn is only used by test_jit, which basically just tries to see if scripted module behaves the same as the python module. I don't see it handling check_bfloat16 or check_half either.

gchanan · 2021-01-08T16:26:44Z

-  } else {
-    at::sub_out(result, input, target).abs_();
+Tensor& l1_loss_out(Tensor& result, const Tensor& input, const Tensor& target, int64_t reduction) {
+  auto diff = at::sub_out(result, input, target);


does this cause warnings? Because we usually warn when the result is resized (not from something 0-sized).

Should be fixed in the latest update. When the shape of result matches the post-reduce shape a warning should no longer appear.

@gchanan do you want to take another look at this

facebook-github-bot

@soulitzer has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

…omplex-support

soulitzer · 2021-01-11T19:03:48Z

Fixes the l1_loss case for #50382.

facebook-github-bot

@soulitzer has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

soulitzer · 2021-01-12T19:56:23Z

@anjali411 @albanD Made a non-trivial change to the code. In the latest commit, everything is now routed through the out variant instead of having two separate code paths.

anjali411 · 2021-01-12T23:42:58Z

+    Tensor result = at::empty({0}, input.options().dtype(float_type));
+    return at::l1_loss_out(result, input, target, reduction);
+  }
+  Tensor result = at::empty({0}, input.options());


this should go in an else branch:

Tensor result; if (input.is_complex()) { ... } else { result = at::empty({0}, input.options()); }

oh wait just saw you have a return statement in the if condition. I still think it might be cleaner to change it to an if else statement with a common return

Hmm, then you'd have to declare result before the if else. Otherwise it would go out of scope by the time you try to return it.

is that necessarily cleaner? :P I feel like it could be good either way.

wait if you 'declare' it before, you are technically doing an extra default initialization then copy assigning instead of simply copy initializing

A nice trick for this that @ezyang showed me is to use a lambda:

const auto float_type = [&]() { if (input.is_complex()) { return c10::toValueType(input.scalar_type()); } else { return input.scalar_type(); } }(); Tensor result = at::empty({0}, input.options().dtype(float_type)); return at::l1_loss_out(result, input, target, reduction);

But beyond that, what happens if you call c10::toValueType on a non complex dtype? Is it just returned as-is? If so, you don't need branching in this function at all!

Ahh you're right c10::toValueType does handle the non-complex dtype by just returning as-is.

albanD

lgtm
just small potential simplification of the composite l1_loss.

facebook-github-bot

@soulitzer has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

anjali411 · 2021-01-15T19:54:55Z

@soulitzer thanks again! this PR looks good to me with the current changes. is there anything that's blocking this PR?

soulitzer · 2021-01-15T20:08:46Z

@anjali411 I actually almost landed this yesterday, but held off due to the CI issues. One thing I wanted to check again was to see if there were any updates to gen_variable_type which caused conflicts the last time I pulled, but it doesn't seem like there are - so its on its way now!

facebook-github-bot · 2021-01-15T23:55:09Z

@soulitzer merged this pull request in 6e3e570.

AhmedBoin · 2022-12-19T18:52:44Z

for the secound time you can implement your own

def complex_mse_loss(output, target):
    return (0.5*(output - target)**2).mean(dtype=torch.complex64)

you can also implement layers or any custom utils needed

class CLinear(nn.Module):
    def __init__(self, size_in, size_out):
        super().__init__()
        self.weights = nn.Parameter(torch.randn(size_in, size_out, dtype=torch.complex64) 
        self.bias = nn.Parameter(torch.zeros(size_out, dtype=torch.complex64))

    def forward(self, x):
        if not x.dtype == torch.complex64: x = x.type(torch.complex64)
        return x@self.weights + self.bias

Summary: Building on top of the work of anjali411 (pytorch#46640) Things added in this PR: 1. Modify backward and double-backward formulas 2. Add complex support for `new module tests` and criterion tests (and add complex tests for L1) 3. Modify some existing tests to support complex Pull Request resolved: pytorch#49912 Reviewed By: zhangguanheng66 Differential Revision: D25853036 Pulled By: soulitzer fbshipit-source-id: df619f1b71c450ab2818eb17804e0c55990aa8ad

Add old PR changes

b8e491c

soulitzer added module: nn Related to torch.nn module: complex Related to complex number support in PyTorch labels Dec 29, 2020

facebook-github-bot added the cla signed label Dec 29, 2020

Test complex inputs

d3d9d86

anjali411 reviewed Dec 29, 2020

View reviewed changes

Comment thread test/test_nn.py Outdated

anjali411 reviewed Dec 29, 2020

View reviewed changes

Comment thread torch/testing/_internal/common_nn.py

anjali411 reviewed Dec 29, 2020

View reviewed changes

soulitzer added 2 commits December 29, 2020 12:02

Add doc change

f60beb9

Merge branch 'master' of https://github.com/pytorch/pytorch into l1-c…

fb89c31

…omplex-support

soulitzer requested a review from anjali411 December 30, 2020 04:00

soulitzer added 2 commits December 30, 2020 13:53

Add test to new_module_tests

b06436e

Fix common_nn complex tests

cc9777e

anjali411 reviewed Jan 4, 2021

View reviewed changes

Comment thread torch/testing/_internal/common_nn.py Outdated

anjali411 reviewed Jan 4, 2021

View reviewed changes

Comment thread torch/testing/_internal/common_nn.py Outdated

gchanan reviewed Jan 5, 2021

View reviewed changes

Comment thread aten/src/ATen/native/Loss.cpp Outdated

gchanan reviewed Jan 5, 2021

View reviewed changes

Comment thread aten/src/ATen/native/Loss.cpp Outdated

Fix gradgrad

c8d96b1

soulitzer requested a review from albanD as a code owner January 5, 2021 06:09

soulitzer added 4 commits January 5, 2021 15:42

Actually fix gradgrad

96dad84

Merge branch 'master' of https://github.com/pytorch/pytorch into l1-c…

2f9cf41

…omplex-support

Address comments

c0868b9

typo

71db175

soulitzer commented Jan 5, 2021

View reviewed changes

anjali411 reviewed Jan 6, 2021

View reviewed changes

anjali411 added the complex_autograd label Jan 6, 2021

clean up

3435892

soulitzer force-pushed the l1-complex-support branch from 655c662 to 3435892 Compare January 7, 2021 18:00

soulitzer requested a review from gchanan January 7, 2021 22:05

gchanan reviewed Jan 8, 2021

View reviewed changes

facebook-github-bot reviewed Jan 8, 2021

View reviewed changes

soulitzer added 2 commits January 11, 2021 13:50

Update

344d4d3

Merge branch 'master' of https://github.com/pytorch/pytorch into l1-c…

f4e4e20

…omplex-support

fixes tests

5b63193

facebook-github-bot reviewed Jan 11, 2021

View reviewed changes

soulitzer requested review from albanD, anjali411 and gchanan January 12, 2021 19:12

anjali411 reviewed Jan 12, 2021

View reviewed changes

albanD approved these changes Jan 13, 2021

View reviewed changes

simplify

8aee998

facebook-github-bot reviewed Jan 14, 2021

View reviewed changes

anjali411 mentioned this pull request Jan 15, 2021

Improve complex support in common_nn test machinery #50593

Closed

facebook-github-bot closed this in 6e3e570 Jan 15, 2021

facebook-github-bot added the Merged label Jan 15, 2021

anjali411 mentioned this pull request Jan 19, 2021

Loss functions for complex tensors #46642

Open

18 tasks

soulitzer deleted the l1-complex-support branch April 14, 2021 20:40

Conversation

soulitzer commented Dec 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Dec 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

ci.pytorch.org: 1 failed

Uh oh!

codecov Bot commented Dec 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

anjali411 left a comment

Choose a reason for hiding this comment

Uh oh!

anjali411 Jan 4, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

soulitzer Jan 5, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anjali411 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

soulitzer commented Jan 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

soulitzer commented Jan 12, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

soulitzer commented Dec 29, 2020 •

edited

Loading

facebook-github-bot commented Dec 29, 2020 •

edited

Loading

codecov Bot commented Dec 29, 2020 •

edited

Loading

anjali411 Jan 4, 2021 •

edited

Loading

soulitzer Jan 5, 2021 •

edited

Loading

soulitzer commented Jan 11, 2021 •

edited

Loading