BUG: Fix `np.einsum` errors on Power9 Linux and z/Linux by jwoehr · Pull Request #14693 · numpy/numpy

jwoehr · 2019-10-14T02:13:44Z

BUG: fixes #14692 on Power 9 and z/Linux

seberg · 2019-10-14T03:02:05Z

So clang-tidy warns about: out_labels[ndim++] = label; (line 1946) and i = out_label - output_labels; (line 2055) and axes[i] = match - labels; (line 2232) (narrowing casts). I have no idea if those are actually particularly problematic, but thought I would mention.

jwoehr · 2019-10-14T03:08:50Z

So clang-tidy warns about: out_labels[ndim++] = label; (line 1946) and i = out_label - output_labels; (line 2055) and axes[i] = match - labels; (line 2232) (narrowing casts). I have no idea if those are actually particularly problematic, but thought I would mention.

I will explore this tomorrow. Thank you for all your help!

…_buglet

mattip · 2019-10-19T19:41:05Z

Every fix should have a test case. I am not sure we need 4d4cc4d, can you add a test that fails before and passes after to prove it is required? The fix in 68861de is definitely needed for np.einsum('abab', x).

mattip · 2019-10-19T19:47:30Z

numpy/core/src/multiarray/einsum.c.src

         * need it to be signed here.
         */
-        label = (signed char)labels[idim];
+        label = labels[idim];


Why this change?

mattip · 2019-10-19T19:47:42Z

numpy/core/src/multiarray/einsum.c.src

         * need it to be signed here.
         */
-        int label = (signed char)labels[idim];
+        int label = labels[idim];


Why this change?

@mattip we did that because it made it work on s390x and power9.
Of course it broke x86_64!
I thought I had reverted to a clean copy of master and made your change.
Now I think I made a MisTeAk :)
I will look at what I did and push again.

mattip · 2019-10-19T19:51:23Z

The test should be added to numpy/core/tests/test_einsum.py, maybe next to test_einsum_fixed_collapsingbug. Note that random.rand(10, 10, 10, 10) should be spelled random.random_sample((10, 10, 10, 10)). The test should mention the issues gh-14692 and gh-12689.

jwoehr · 2019-10-19T19:57:43Z

The test should be added to numpy/core/tests/test_einsum.py, maybe next to test_einsum_fixed_collapsingbug. Note that random.rand(10, 10, 10, 10) should be spelled random.random_sample((10, 10, 10, 10)). The test should mention the issues gh-14692 and gh-12689.

If it passes testing, I'll add that, thanks, @mattip

jwoehr · 2019-10-19T20:12:53Z

Note that random.rand(10, 10, 10, 10) should be spelled random.random_sample((10, 10, 10, 10)).

Hmm, this does not work:

tensor = np.random.random_sample(10, 10, 10, 10)
  File "mtrand.pyx", line 370, in numpy.random.mtrand.RandomState.random_sample
TypeError: random_sample() takes at most 1 positional argument (4 given)

mattip · 2019-10-19T20:17:40Z

Are you sure you used my code? It looks different to me.

mattip · 2019-10-19T20:18:02Z

The documented developer workflow is to add tests as part of the development process. I find it helpful to:

reproduce a failure locally from the CLI with python runtests.py --python which will compile and use a HEAD version of numpy
add a test that fails when run locally with python runtests.py -t path/to/changed-file
add a fix and verify that the test suite runs locally with python runtests.py
push the test and the fix together.

This makes sure your fix is correct and prevents unneeded CI runs. In this case, I needed to dive in with gdb to find out exactly what was going wrong so a test was critical to find the missing cast.

jwoehr · 2019-10-19T20:25:20Z

Are you sure you used my code? It looks different to me.

aha :) Lots of parentheses

jwoehr · 2019-10-19T20:32:11Z

w/r/t the test case, does it have to test some kind of result, or is it enough that the failure is that an exception is thrown?

mattip · 2019-10-19T20:55:08Z

numpy/core/tests/test_einsum.py

+        # Bug with signed vs unsigned char errored on power9 and s390x Linux
+        tensor = np.random.random_sample((10, 10, 10, 10))
+        print(np.einsum('ijij->', tensor))
+


Maybe (please check locally before pushing)

x = np.einsum('ijij->', tensor) y = tensor.trace(axis1=0, axis2=2).trace() assert_equal(x, y)

With floating point, you can't be sure that x and y will be exactly equal. Use assert_allclose with an appropriately small tolerance, or use an integer array for tensor (assuming the test doesn't actually depend on the data type of the array).

Done; tested on x86_64, power9, and s390x; and pushed.

mattip · 2019-10-20T03:23:52Z

Thanks @jwoehr

Backport of numpy#14693. Fixes numpy#14692 on Power 9 and z/Linux

fixes #14692 np.einsum errors on Power9 Linux and z/Linux

4d4cc4d

seberg added 00 - Bug component: numpy.einsum labels Oct 14, 2019

seberg changed the title ~~fixes #14692 np.einsum errors on Power9 Linux and z/Linux~~ BUG: Fix np.einsum errors on Power9 Linux and z/Linux Oct 15, 2019

jwoehr added 3 commits October 18, 2019 15:25

Merge branch 'master' of https://github.com/numpy/numpy into einsum_c…

cc3da40

…_buglet

Merge branch 'master' of https://github.com/numpy/numpy into einsum_c…

2af95c4

…_buglet

change suggested by mattip

68861de

mattip reviewed Oct 19, 2019

View reviewed changes

oops removed an (signed char) ... fixed

d9ecdea

charris added 06 - Regression 09 - Backport-Candidate PRs tagged should be backported labels Oct 19, 2019

charris added this to the 1.17.4 release. milestone Oct 19, 2019

added test case test_einsum_failed_on_p9_and_s390x(self)

0906162

mattip reviewed Oct 19, 2019

View reviewed changes

changed test to assert_allclose() the output values

263f616

mattip merged commit 9c8e904 into numpy:master Oct 20, 2019

mattip mentioned this pull request Oct 20, 2019

einsum error for mixed subscripts on s390x #12689

Closed

jwoehr deleted the einsum_c_buglet branch October 20, 2019 04:08

charris added a commit to charris/numpy that referenced this pull request Nov 8, 2019

BUG: Fix np.einsum errors on Power9 Linux and z/Linux

927af22

Backport of numpy#14693. Fixes numpy#14692 on Power 9 and z/Linux

charris mentioned this pull request Nov 8, 2019

BUG: Fix np.einsum errors on Power9 Linux and z/Linux #14855

Merged

charris added a commit to charris/numpy that referenced this pull request Nov 8, 2019

BUG: Fix np.einsum errors on Power9 Linux and z/Linux

566f9eb

Backport of numpy#14693. Fixes numpy#14692 on Power 9 and z/Linux

charris mentioned this pull request Nov 8, 2019

BUG: Fix np.einsum errors on Power9 Linux and z/Linux #14856

Merged

charris removed the 09 - Backport-Candidate PRs tagged should be backported label Nov 8, 2019

charris removed this from the 1.17.4 release. milestone Nov 8, 2019

seberg mentioned this pull request Jan 16, 2020

BUILD: use standard build of OpenBLAS for aarch64, ppc64le, s390x #15279

Merged

Uh oh!

Conversation

jwoehr commented Oct 14, 2019 • edited by mattip Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

seberg commented Oct 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jwoehr commented Oct 14, 2019

Uh oh!

mattip commented Oct 19, 2019

Uh oh!

mattip Oct 19, 2019

Choose a reason for hiding this comment

Uh oh!

mattip Oct 19, 2019

Choose a reason for hiding this comment

Uh oh!

jwoehr Oct 19, 2019

Choose a reason for hiding this comment

Uh oh!

mattip commented Oct 19, 2019

Uh oh!

jwoehr commented Oct 19, 2019

Uh oh!

jwoehr commented Oct 19, 2019

Uh oh!

mattip commented Oct 19, 2019

Uh oh!

mattip commented Oct 19, 2019

Uh oh!

jwoehr commented Oct 19, 2019

Uh oh!

jwoehr commented Oct 19, 2019

Uh oh!

mattip Oct 19, 2019

Choose a reason for hiding this comment

Uh oh!

WarrenWeckesser Oct 19, 2019

Choose a reason for hiding this comment

Uh oh!

jwoehr Oct 19, 2019

Choose a reason for hiding this comment

Uh oh!

mattip commented Oct 20, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

jwoehr commented Oct 14, 2019 •

edited by mattip

Loading

seberg commented Oct 14, 2019 •

edited

Loading