Fix #11752: fix numerical issue in log_softmax by wolegechu · Pull Request #21672 · pytorch/pytorch

wolegechu · 2019-06-12T03:56:03Z

#11866 has corrected this issue in function host_softmax (aten/src/ATen/native/SoftMax.cpp). But I tried the example proposed in #11752. log_softmax is still not working for big logits.

I have looked into the source code, found that example had called vec_host_softmax_lastdim, not host_softmax.

This code fixes the issue in _vec_log_softmax_lastdim and has a test for log_softmax.

vadimkantorov · 2019-06-12T15:39:57Z

test/test_nn.py


+    def test_log_softmax(self):
+        x_small = torch.ones(1, 2, dtype=torch.float32)
+        x_big = x_small * 1e16


softmax and logsoftmax should be invariant uner addition, but not multiplication, right? (multilicative factor is inverse temperature, right?)

should this line read x_big = x_small + 1e16?

Oops..., this test works in the special case, but it's a mistake. I've written a new commit.

ssnl · 2019-06-12T16:18:42Z

aten/src/ATen/native/cpu/SoftMaxKernel.cpp

+            scalar_t max_input = max_input_arr[j];
            vec256::map(
-                [tmp_sum](Vec x) { return x - Vec(tmp_sum); },
+                [tmp_sum, max_input](Vec x) { return x - Vec(max_input) - Vec(tmp_sum); },


Does the compiler optimize this into computing max_input and tmp_sum before the :map?

It shouldn’t be optimized to compute before.

In some cases, that input is large digits and the difference is small so that the preprocessed input value would be ignored (in the old way). You can see the example here. #11752 (comment)

And I have looked up the log_sofmax implementation in MXNet and TensorFlow. They also write like x - Vec(max_input) - Vec(tmp_sum) together to ensure the computing order.

Could you add a comment here so people won't "optimize" this away in future? Thanks!

test/test_nn.py

…echu/pytorch into fix_numerical_issue_in_log_softmax

wolegechu

@VitalyFedyunin So strange... I merged the 'test/test_nn.py' file from upstream:master branch to my branch. Now Github highlights the codes in green that don't belong to me.

…echu/pytorch into fix_numerical_issue_in_log_softmax

wolegechu · 2019-06-14T04:09:12Z

@VitalyFedyunin So strange... I merged the 'test/test_nn.py' file from upstream:master branch to my branch. Now Github highlights the codes in green that don't belong to me.

I introduced some new commits, which had been merged into master after I created this PR.

@VitalyFedyunin emmmmm...Anyway, I have removed all the unrelated changes.

facebook-github-bot

@VitalyFedyunin has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: pytorch/pytorch#11866 has corrected this issue in function `host_softmax` (aten/src/ATen/native/SoftMax.cpp). But I tried the example proposed in pytorch/pytorch#11752. `log_softmax` is still not working for big logits. I have looked into the source code, found that example had called `vec_host_softmax_lastdim`, not `host_softmax`. This code fixes the issue in `_vec_log_softmax_lastdim` and has a test for `log_softmax`. Pull Request resolved: pytorch/pytorch#21672 Differential Revision: D15856327 Pulled By: VitalyFedyunin fbshipit-source-id: 7a1fd3c0a03d366c99eb873e235361e4fcfa7567

facebook-github-bot · 2019-06-17T22:14:49Z

@VitalyFedyunin merged this pull request in b403b10.

wolegechu added 2 commits October 14, 2018 08:46

Add a test for log_softmax and simplify code...

d663297

fix numerical issue in log_softmax

2c66eb8

pytorchbot added module: cpu CPU specific problem (e.g., perf, algorithm) module: nn Related to torch.nn module: operators labels Jun 12, 2019

Merge branch 'master' into fix_numerical_issue_in_log_softmax

6309e0e

wolegechu changed the title ~~Fix #11752: fix numerical issue in log softmax~~ Fix #11752: fix numerical issue in log_softmax Jun 12, 2019

ezyang added the open source label Jun 12, 2019

vadimkantorov reviewed Jun 12, 2019

View reviewed changes

Update test_nn.py

3b02a13

ssnl reviewed Jun 12, 2019

View reviewed changes

VitalyFedyunin reviewed Jun 12, 2019

View reviewed changes

test/test_nn.py Outdated Show resolved Hide resolved

wolegechu added 5 commits June 13, 2019 10:44

add comment

f1602d3

update test

8c7730a

recover test_nn

666f7ea

Merge branch 'fix_numerical_issue_in_log_softmax' of github.com:woleg…

d478df0

…echu/pytorch into fix_numerical_issue_in_log_softmax

revover test_nn to master

1562e08

wolegechu commented Jun 13, 2019

View reviewed changes

wolegechu added 3 commits June 13, 2019 12:07

add test for log_softmax

4a8222a

Merge branch 'fix_numerical_issue_in_log_softmax' of github.com:woleg…

491a3f7

…echu/pytorch into fix_numerical_issue_in_log_softmax

recover recover

00f0985

gchanan assigned VitalyFedyunin Jun 13, 2019

gchanan added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jun 13, 2019

make test_nn.py no other diff

d2921fb

VitalyFedyunin approved these changes Jun 17, 2019

View reviewed changes

facebook-github-bot reviewed Jun 17, 2019

View reviewed changes

facebook-github-bot closed this in b403b10 Jun 17, 2019

facebook-github-bot added the merged label Jun 17, 2019

wolegechu deleted the fix_numerical_issue_in_log_softmax branch June 18, 2019 04:06

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix #11752: fix numerical issue in log_softmax#21672

Fix #11752: fix numerical issue in log_softmax#21672
wolegechu wants to merge 13 commits intopytorch:masterfrom
wolegechu:fix_numerical_issue_in_log_softmax

wolegechu commented Jun 12, 2019 •

edited

Loading

Uh oh!

vadimkantorov Jun 12, 2019

Uh oh!

wolegechu Jun 12, 2019 •

edited

Loading

Uh oh!

ssnl Jun 12, 2019

Uh oh!

wolegechu Jun 12, 2019 •

edited

Loading

Uh oh!

ssnl Jun 12, 2019

Uh oh!

wolegechu Jun 13, 2019

Uh oh!

Uh oh!

wolegechu left a comment

Uh oh!

wolegechu commented Jun 14, 2019 •

edited

Loading

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Jun 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Conversation

wolegechu commented Jun 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vadimkantorov Jun 12, 2019

Choose a reason for hiding this comment

Uh oh!

wolegechu Jun 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ssnl Jun 12, 2019

Choose a reason for hiding this comment

Uh oh!

wolegechu Jun 12, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ssnl Jun 12, 2019

Choose a reason for hiding this comment

Uh oh!

wolegechu Jun 13, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wolegechu left a comment

Choose a reason for hiding this comment

Uh oh!

wolegechu commented Jun 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jun 17, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

wolegechu commented Jun 12, 2019 •

edited

Loading

wolegechu Jun 12, 2019 •

edited

Loading

wolegechu Jun 12, 2019 •

edited

Loading

wolegechu commented Jun 14, 2019 •

edited

Loading