CI: Add new workflow/action for testing universal intrinsics on armv7 by seiko2plus · Pull Request #20206 · numpy/numpy

seiko2plus · 2021-10-27T09:01:03Z

The new action uses qemu to emulate a full container of armv7/ubuntu on x86_64.

To speed up the compilation a bridge has been made between the host and the
emulated container to ship x86_64 binaries of the GCC cross-compiler to
be executed natively during the build.

seiko2plus · 2021-10-27T13:00:18Z

+          g++ --version &&
+          pip3 --version &&
+          python3 --version &&
+          cd /numpy && pip3 install -v ./


Suggested change

cd /numpy && pip3 install -v ./

cd /numpy && python3 setup.py install

I wonder if there's a way to filter pip3 log to force printing only the build log!.

Using bare pip3 is dangerous when there may be several Pythons installed, python -mpip is safer.

I agree, but this armv7 container is guaranteed to contain only one version of python3. However, with or without wouldn't reduce the output of pip3. It seems there's no way to filter the verbose mode, I just want to keep pip3/python3 -mpip since it was part of #19820 investigation.

would it be fine to just use python3 setup.py install instead? it seems to me that building NumPy via pip3 works just fine on armv7. just one bug was discovered related to static casting affected by #19713

would it be fine to just use python3 setup.py install

I think these days the recommendation is python3 -m pip install .. That should be future-proof even if we change the build system. As of now, of course, it ends up calling setup.py underneath.

alright, I stuck to python3 setup.py install for exposing the build log only since the main goal behind this test is to check the sanity of universal intrinsics.

mattip · 2021-10-27T18:06:27Z

Nice. The runs I saw take about 18 minutes, which is around the time of the pypy38 run. The longest CI run is the "full" one at 27 minutes. It seems all the CI runs in parallel so this is not slowing down the overall CI time.

mattip · 2021-10-27T18:10:34Z

Would it be worthwhile to use the system blas libraries via apt?

The new action uses qemu to emulate a full container of armv7/ubuntu on x86_64. To speed up the compilation a bridge has been made between the host and the emulated container to ship x86_64 binaries of the GCC cross-compiler to be executed natively during the build.

These errors/warnings occurs when sizeof(long double) == sizeof(double).

seiko2plus · 2021-10-28T06:44:47Z

Would it be worthwhile to use the system blas libraries via apt?

I'm not sure if that would make any change, keep in mind only SIMD tests are running at the current moment, we can expand it later to cover other tests such as umath but qemu is extremely slow and not reliable. the build is kinda fast since the compiler is running natively(mixing between x86_64, armhf binaries).

mattip · 2021-10-28T07:13:56Z

Thanks @seiko2plus. I missed that this is only running the SIMD tests, which makes sense.

matthew-brett · 2021-11-10T12:32:16Z

-        typedef long double npy_longdouble;
        #define NPY_LONGDOUBLE_FMT "Lg"
 #endif
+typedef long double npy_longdouble;


Can I revisit this one? @carlkl pointed out that this is a rather significant change in behavior. Previously, for platforms and compilers where double == long double, at Numpy compile time, such as MSVC, this would give npy_longdouble as double. This is useful in the case when we are compiling using Numpy, but with a compiler where it is not true that double == long double. One example is compiling Scipy with mingw-w64, where, by default long double is Float80. Could y'all say more about why this change is needed? Why do we need to force npy_longdouble to be the compiler long double, rather than Numpy long double?

Sorry, I think this was my mistake in not looking carefully enough at the PR. I agree we should back out this change.

this change was needed to pass the c++ build when double == long double, since undefined cast causes compiling errors on C++, also remove the massive cast warnings with C sources.

this pr also linked with #20210

I completely believe that this change fixed the C++ build, but this is a backwards-incompatible change in the Numpy include headers used by all projects compiling against Numpy, so I do think we need to devote some serious thought to whether this is the right way to fix the C++ build.

Previously, for platforms and compilers where double == long double, at Numpy compile time, such as MSVC, this would give npy_longdouble as double. This is useful in the case when we are compiling using Numpy, but with a compiler where it is not true that double == long double.

Can we spell the broken workflow(s) out a bit more (sorry, this stuff is confusing)? Is the broken case "build NumPy with MSVC, then SciPy with Mingw"? And does "build both NumPy and SciPy with Mingw" still work?

What we need is an additional branch like that:

#if defined __MINGW32__ && __LDBL_DIG__ != __DBL_DIG__ typedef double npy_longdouble; #else typedef long double npy_longdouble; #endif

explanation: works in case of using the mingw-w64 toolchain regardless if gcc is used with an additional -mlong-double-64 switch or not. This would also correctly work for the the Msys2 toolchains, where long double is defined as extended precision (FLOAT80).
EDIT: this has to be put inside the
#if NPY_SIZEOF_LONGDOUBLE == NPY_SIZEOF_DOUBLE
branch of course:

#if NPY_SIZEOF_LONGDOUBLE == NPY_SIZEOF_DOUBLE #if defined __MINGW32__ && __LDBL_DIG__ != __DBL_DIG__ typedef double npy_longdouble; #else typedef long double npy_longdouble; #endif #define NPY_LONGDOUBLE_FMT "g" #else typedef long double npy_longdouble; #define NPY_LONGDOUBLE_FMT "Lg" #endif

Given that it fixed the universal2 build, we can't just back out this change. It'd be great to figure out what the right thing to do is though here, because it would be great to not release 1.22.0 with something that's going to prevent building SciPy with Mingw.

Ah, @seberg opened a fresh issue for this one, maybe best to continue there.

matthew-brett

A worry about the Numpy include change here.

seiko2plus · 2021-11-10T13:41:15Z

@matthew-brett,

A worry about the Numpy include change here.

this change was needed to pass this workflow, so I thought at that time it was fine to include it here.

seiko2plus changed the title ~~CI: Add new test for armv7 to test universal intrinsics~~ CI: Add new action for armv7 to test universal intrinsics Oct 27, 2021

seiko2plus force-pushed the ci_armv7_simd_test branch 12 times, most recently from 60d60b6 to e2a0023 Compare October 27, 2021 11:44

seiko2plus marked this pull request as ready for review October 27, 2021 11:46

seiko2plus changed the title ~~CI: Add new action for armv7 to test universal intrinsics~~ CI: Add new workflow/action for testing universal intrinsics on armv7 Oct 27, 2021

seiko2plus added 36 - Build Build related PR 05 - Testing labels Oct 27, 2021

seiko2plus force-pushed the ci_armv7_simd_test branch from e2a0023 to 9872f76 Compare October 27, 2021 12:26

seiko2plus requested a review from mattip October 27, 2021 12:54

seiko2plus commented Oct 27, 2021

View reviewed changes

seiko2plus force-pushed the ci_armv7_simd_test branch from 9872f76 to b545183 Compare October 28, 2021 06:29

seiko2plus added 2 commits October 28, 2021 08:30

BUG, BLD: Fix cast long double and double warnings(C)/errors(C++)

f559951

These errors/warnings occurs when sizeof(long double) == sizeof(double).

seiko2plus force-pushed the ci_armv7_simd_test branch from b545183 to f559951 Compare October 28, 2021 06:30

seiko2plus mentioned this pull request Oct 28, 2021

BUG: Build failure for macos universal2 wheels #20210

Closed

mattip merged commit 85d15ef into numpy:main Oct 28, 2021

seiko2plus linked an issue Oct 28, 2021 that may be closed by this pull request

BUG: Build failure for macos universal2 wheels #20210

Closed

This was referenced Nov 1, 2021

BUG: min/max is slow, re-implement using NEON (#17989) #20131

Merged

CI: Add new workflow for Intel SDE #20232

Merged

seiko2plus deleted the ci_armv7_simd_test branch November 5, 2021 16:56

matthew-brett reviewed Nov 10, 2021

View reviewed changes

seberg mentioned this pull request Nov 10, 2021

BUG: typedef long double npy_longdouble; creates problems on platforms with double == longdouble #20348

Closed

rgommers added the component: SIMD Issues in SIMD (fast instruction sets) code or machinery label Jul 12, 2022

	cd /numpy && pip3 install -v ./
	cd /numpy && python3 setup.py install

Uh oh!

Conversation

seiko2plus commented Oct 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

seiko2plus Oct 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

seiko2plus Oct 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattip commented Oct 27, 2021

Uh oh!

mattip commented Oct 27, 2021

Uh oh!

seiko2plus commented Oct 28, 2021

Uh oh!

mattip commented Oct 28, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

carlkl Nov 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matthew-brett left a comment

Choose a reason for hiding this comment

Uh oh!

seiko2plus commented Nov 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

seiko2plus commented Oct 27, 2021 •

edited

Loading

seiko2plus Oct 27, 2021 •

edited

Loading

seiko2plus Oct 27, 2021 •

edited

Loading

carlkl Nov 10, 2021 •

edited

Loading

seiko2plus commented Nov 10, 2021 •

edited

Loading