Fix print precision and match numpy behavior by ailzhang · Pull Request #12746 · pytorch/pytorch

ailzhang · 2018-10-17T03:41:58Z

Fix and simplify print logic
Follow numpy print rule https://github.com/numpy/numpy/blob/eb2bd11870731ea19a0eee72e616c7deb00f6c54/numpy/core/arrayprint.py#L859

scientific notation is used when absolute value of the smallest number is < 1e-4 or maximum > 1e8 or the ratio of the maximum absolute value to the minimum is > 1e3

I hope I didn't break anything since there seems to be a lot of edge cases here... Here are some easy sanity checks.

# For int tensor, we just print, never use scientific.
In [5]: torch.tensor(1)
Out[5]: tensor(1)
Out[2]: array(1) # numpy

In [6]: torch.tensor(10)
Out[6]: tensor(10)
Out[3]: array(10) # numpy

In [8]: torch.tensor(99000000)
Out[8]: tensor(99000000)
Out[5]: array(99000000) # numpy

In [9]: torch.tensor(100000000)
Out[9]: tensor(100000000)
Out[6]: array(100000000) # numpy

In [10]: torch.tensor(100000001)
Out[10]: tensor(100000001)
Out[7]: array(100000001) # numpy

In [11]: torch.tensor(1000000000)
Out[11]: tensor(1000000000)
Out[8]: array(1000000000) # numpy

In [12]: torch.tensor([1, 1000])
Out[12]: tensor([   1, 1000])
Out[9]: array([   1, 1000]) # numpy

In [13]: torch.tensor([1, 1010])
Out[13]: tensor([   1, 1010])
Out[10]: array([   1, 1010]) # numpy

For floating points, we use scientific when max/min > 1000 || max > 1e8 || min < 1e-4
Lines with "old" are old behaviors that either has precision issue, or not aligned with numpy

In [14]: torch.tensor(0.01)
Out[14]: tensor(0.0100)
Out[11]: array(0.01) # numpy

In [15]: torch.tensor(0.1)
Out[15]: tensor(0.1000)
Out[12]: array(0.1) # numpy

In [16]: torch.tensor(0.0001)
Out[16]: tensor(0.0001)
Out[14]: array(0.0001) # numpy

In [17]: torch.tensor(0.00002)
Out[17]: tensor(2.0000e-05)
Out[15]: array(2e-05) # numpy
Out[5]: tensor(0.0000) # old

In [18]: torch.tensor(1e8)
Out[18]: tensor(100000000.)
Out[16]: array(100000000.0) # numpy

In [19]: torch.tensor(1.1e8)
Out[19]: tensor(1.1000e+08)
Out[17]: array(1.1e8) # numpy 1.14.5, In <= 1.13 this was not using scientific print
Out[10]: tensor(110000000.) # old

In [20]: torch.tensor([0.01, 10.])
Out[20]: tensor([ 0.0100, 10.0000])
Out[18]: array([  0.01,  10.  ]) # numpy

In [21]: torch.tensor([0.01, 11.])
Out[21]: tensor([1.0000e-02, 1.1000e+01])
Out[19]: array([  1.00000000e-02,   1.10000000e+01]) # numpy
Out[7]: tensor([ 0.0100, 11.0000]) # old

When print floating number in int mode, we still need to respect rules to use scientific mode first

In [22]: torch.tensor([1., 1000.])
Out[22]: tensor([   1., 1000.])
Out[20]: array([    1.,  1000.]) # numpy

In [23]: torch.tensor([1., 1010.])
Out[23]: tensor([1.0000e+00, 1.0100e+03])
Out[21]: array([  1.00000000e+00,   1.01000000e+03]) # numpy
Out[9]: tensor([   1., 1010.]) # old

soumith

the test cases in the issue description, where you match numpy behavior,, make them test cases.

Sign in to view

        self.max_width = 1

+        # use tensor_view for 0-dim tensor iteration
+        tensor_view = tensor.view(tensor.nelement())


ailzhang · 2018-10-17T23:53:25Z

@soumith I added a few tests. There is one thing left todo:
As you can see from examples above, Numpy does automatic trimming and padding while we use a fixed precision of 4. Numpy default is 8. But this causes a problem when we try to print a really long tensor, we actually force the print precision to be 4:

>>> np.array([123456789.])
array([1.23456789e+08])
>>> torch.tensor([123456789.])
tensor([1.2346e+08])

I actually think we should port the dragon4_scientific code from Numpy here to make the print prettier. But it should be in a separate PR and depends on the priority. I opened #12797 for it.
https://github.com/numpy/numpy/blob/f36d2d4d3f622f7901e3d5ade13e04fc05062948/numpy/core/src/multiarray/multiarraymodule.c#L3530

soumith

lgtm. Instead of separate expect files per each string, it feels like it's better to inline the expected strings in the tests. Making a separate file doesn't add a lot of value here, and one has to jump 1 extra file to find out what the expected value is supposed to be.

facebook-github-bot

ailzhang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ezyang · 2018-10-18T16:35:12Z

I implemented inline expect tests in my spare time. Let me put it in PyTorch.

facebook-github-bot

ailzhang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

ailzhang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot

ailzhang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Fixes pytorch#12578 pytorch#9395. * Fix and simplify print logic * Follow numpy print rule https://github.com/numpy/numpy/blob/eb2bd11870731ea19a0eee72e616c7deb00f6c54/numpy/core/arrayprint.py#L859 > scientific notation is used when absolute value of the smallest number is < 1e-4 or maximum > 1e8 or the ratio of the maximum absolute value to the minimum is > 1e3 I hope I didn't break anything since there seems to be a lot of edge cases here... Here are some easy sanity checks. ``` In [5]: torch.tensor(1) Out[5]: tensor(1) Out[2]: array(1) # numpy In [6]: torch.tensor(10) Out[6]: tensor(10) Out[3]: array(10) # numpy In [8]: torch.tensor(99000000) Out[8]: tensor(99000000) Out[5]: array(99000000) # numpy In [9]: torch.tensor(100000000) Out[9]: tensor(100000000) Out[6]: array(100000000) # numpy In [10]: torch.tensor(100000001) Out[10]: tensor(100000001) Out[7]: array(100000001) # numpy In [11]: torch.tensor(1000000000) Out[11]: tensor(1000000000) Out[8]: array(1000000000) # numpy In [12]: torch.tensor([1, 1000]) Out[12]: tensor([ 1, 1000]) Out[9]: array([ 1, 1000]) # numpy In [13]: torch.tensor([1, 1010]) Out[13]: tensor([ 1, 1010]) Out[10]: array([ 1, 1010]) # numpy ``` For floating points, we use scientific when `max/min > 1000 || max > 1e8 || min < 1e-4` Lines with "old" are old behaviors that either has precision issue, or not aligned with numpy ``` In [14]: torch.tensor(0.01) Out[14]: tensor(0.0100) Out[11]: array(0.01) # numpy In [15]: torch.tensor(0.1) Out[15]: tensor(0.1000) Out[12]: array(0.1) # numpy In [16]: torch.tensor(0.0001) Out[16]: tensor(0.0001) Out[14]: array(0.0001) # numpy In [17]: torch.tensor(0.00002) Out[17]: tensor(2.0000e-05) Out[15]: array(2e-05) # numpy Out[5]: tensor(0.0000) # old In [18]: torch.tensor(1e8) Out[18]: tensor(100000000.) Out[16]: array(100000000.0) # numpy In [19]: torch.tensor(1.1e8) Out[19]: tensor(1.1000e+08) Out[17]: array(1.1e8) # numpy 1.14.5, In <= 1.13 this was not using scientific print Out[10]: tensor(110000000.) # old In [20]: torch.tensor([0.01, 10.]) Out[20]: tensor([ 0.0100, 10.0000]) Out[18]: array([ 0.01, 10. ]) # numpy In [21]: torch.tensor([0.01, 11.]) Out[21]: tensor([1.0000e-02, 1.1000e+01]) Out[19]: array([ 1.00000000e-02, 1.10000000e+01]) # numpy Out[7]: tensor([ 0.0100, 11.0000]) # old ``` When print floating number in int mode, we still need to respect rules to use scientific mode first ``` In [22]: torch.tensor([1., 1000.]) Out[22]: tensor([ 1., 1000.]) Out[20]: array([ 1., 1000.]) # numpy In [23]: torch.tensor([1., 1010.]) Out[23]: tensor([1.0000e+00, 1.0100e+03]) Out[21]: array([ 1.00000000e+00, 1.01000000e+03]) # numpy Out[9]: tensor([ 1., 1010.]) # old ``` Pull Request resolved: pytorch#12746 Differential Revision: D10443800 Pulled By: ailzhang fbshipit-source-id: f5e4e3fe9bf0b44af2c64c93a9ed42b73fa613f5

Ailing Zhang added 3 commits October 16, 2018 17:14

fix printing precision issue

e044a1c

use better var name

1d202a1

sci_mode has higher pri than int_mode

9f61426

soumith requested changes Oct 17, 2018

View reviewed changes

vishwakftw reviewed Oct 17, 2018

View reviewed changes

Comment thread torch/_tensor_str.py Outdated

self.max_width = 1

# use tensor_view for 0-dim tensor iteration

tensor_view = tensor.view(tensor.nelement())

This comment was marked as off-topic.

Sign in to view

Ailing Zhang added 3 commits October 17, 2018 13:40

fix failed tests

f22e586

int_mode for float only use sci_mode when max/min > 1000

d4c5f60

add tests to match numpy behavior

e92f38c

ailzhang force-pushed the fix_print_prec branch from abc306f to e92f38c Compare October 17, 2018 23:29

Ailing Zhang added 2 commits October 17, 2018 16:32

Merge remote-tracking branch 'upstream/master' into fix_print_prec

e237bba

our default precision is actually 4 instead of 8

131df0a

ailzhang mentioned this pull request Oct 17, 2018

Port dragon4_scientific for pretty float tensor print. #12797

Open

Ailing Zhang added 2 commits October 17, 2018 19:04

fix sparse test

5400cbf

Merge remote-tracking branch 'upstream/master' into fix_print_prec

9851b75

soumith approved these changes Oct 18, 2018

View reviewed changes

use inline comparison except for one large print

1454193

facebook-github-bot reviewed Oct 18, 2018

View reviewed changes

fix lint

4510750

facebook-github-bot reviewed Oct 18, 2018

View reviewed changes

Ailing Zhang added 3 commits October 22, 2018 20:01

Merge remote-tracking branch 'upstream/master' into fix_print_prec

cfb5b17

use assertExpectedInline

3fe7e65

Merge remote-tracking branch 'upstream/master' into fix_print_prec

a361b6a

facebook-github-bot reviewed Oct 23, 2018

View reviewed changes

Merge remote-tracking branch 'upstream/master' into fix_print_prec

8a5add8

facebook-github-bot reviewed Oct 24, 2018

View reviewed changes

facebook-github-bot closed this in 478886b Oct 25, 2018

vadimkantorov mentioned this pull request Oct 26, 2018

[pytorch] Strange tensor printing behavior #9395

Closed

ezyang added the merged label Jun 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix print precision and match numpy behavior#12746

Fix print precision and match numpy behavior#12746
ailzhang wants to merge 16 commits intopytorch:masterfrom
ailzhang:fix_print_prec

ailzhang commented Oct 17, 2018 •

edited

Loading

Uh oh!

soumith left a comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

ailzhang commented Oct 17, 2018 •

edited

Loading

Uh oh!

soumith left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

ezyang commented Oct 18, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

ailzhang commented Oct 17, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

ailzhang commented Oct 17, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

soumith left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Oct 18, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ailzhang commented Oct 17, 2018 •

edited

Loading

ailzhang commented Oct 17, 2018 •

edited

Loading