Conversation
|
What arguments should be used for CMake run / flags enabled? (please share some best known configuration) |
|
I used this CMake option to build and execute. |
|
Thank you for update! What are arguments used for Did you see these lines from CMake? If so, please try to fix check (ARM code without NEON optimizations is slow). |
|
Thank you for review.
Use I'm trying to enable NEON, but the build doesn't pass. |
|
Another problem here is that CMake is not running in cross-compilation mode ( I tried to fix check_neon.cpp but got this error:
However Details |
|
Thanks you! Attach Did you encounter a problem with a previously shared cmake command? It was a build error when NEON was enabled, but I pushed the branch I worked on halfway. If you build after running cmake, you should get an error in |
|
I added "custom" builder with ARM64 configuration. Please take a look. Consider pushing changes with updates on source branch of this PR ( |
|
Pushed NEON changes from branch "arm64-neon-windows10" here. |
Resolved by upgrading MSVS 2019.
Looks like a MSVC compiler bug which aliased "int8x16_t" and "uint8x16_t" and don't accept overloads. This simple code: Emits these errors: Any ideas how to workaround that? (without massive code refactoring) |
|
In MSVC, I found same issue: https://developercommunity.visualstudio.com/content/problem/335464/arm64-vector-intrinsics-typedefs-float32x4-t-and-i.html I think this is a very difficult problem to solve. At least I couldn't find a workaround... |
|
Bug fixes on the Visual Studio side will take time. The performance will drop because NEON cannot be used, but I would like to make the build pass for ARM / ARM64. |
This pullrequest changes
ARM64archtecture_M_ARM64preprocessor