Fix compilation on arm64 with FP16 when disabled by thesamesam · Pull Request #24203 · opencv/opencv

thesamesam · 2023-08-28T03:22:52Z

If building with -mcpu=native or any other setting which implies the current CPU has FP16 but with intrinsics disabled, we mistakenly try to use it even though convolution.hpp conditionally defines it correctly based on whether we should use it. convolution.cpp on the other hand was mismatched and trying to use it if the CPU supported it, even if not enabled in the build system.

Make the guards match.

Bug: https://bugs.gentoo.org/913031

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

zihaomu · 2023-08-28T11:48:54Z

Thanks for your contribution. How about doing the following:
https://github.com/opencv/opencv/blob/4.x/modules/dnn/src/layers/cpu_kernels/convolution.hpp#L17

#if define(__ARM_FEATURE_FP16_VECTOR_ARITHMETIC) && CV_FP16 // check FP16 FMA.
#define CONV_ARM_FP16 1
#endif

So that, we can use a single flag to determine if the user wants to enable FP16.

If building with -mcpu=native or any other setting which implies the current CPU has FP16 but with intrinsics disabled, we mistakenly try to use it even though convolution.hpp conditionally defines it correctly based on whether we should *use it*. convolution.cpp on the other hand was mismatched and trying to use it if the CPU supported it, even if not enabled in the build system. Make the guards match. Bug: https://bugs.gentoo.org/913031 Signed-off-by: Sam James <sam@gentoo.org>

thesamesam · 2023-08-29T02:06:24Z

Ah, good idea, thank you! Done.

zihaomu

LGTM! 👍

thesamesam force-pushed the arm64-fp16 branch 2 times, most recently from 2f3a130 to f6211de Compare August 28, 2023 06:46

zihaomu requested review from vpisarev and zihaomu and removed request for vpisarev August 28, 2023 11:39

zihaomu added bug category: dnn labels Aug 28, 2023

thesamesam force-pushed the arm64-fp16 branch from f6211de to c20febd Compare August 29, 2023 02:28

zihaomu approved these changes Aug 29, 2023

View reviewed changes

asmorkalov assigned zihaomu Sep 4, 2023

asmorkalov added this to the 4.9.0 milestone Sep 4, 2023

asmorkalov merged commit c53b3c5 into opencv:4.x Sep 4, 2023

thesamesam deleted the arm64-fp16 branch September 4, 2023 06:46

asmorkalov mentioned this pull request Sep 11, 2023

(5.x) Merge 4.x #24254

Merged

zihaomu mentioned this pull request Sep 13, 2023

Cross compiling for arm64 on Intel mac fails in the DNN module, some problem with CV_FP16 #24257

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix compilation on arm64 with FP16 when disabled#24203

Fix compilation on arm64 with FP16 when disabled#24203
asmorkalov merged 1 commit intoopencv:4.xfrom
thesamesam:arm64-fp16

thesamesam commented Aug 28, 2023

Uh oh!

zihaomu commented Aug 28, 2023 •

edited

Loading

Uh oh!

thesamesam commented Aug 29, 2023

Uh oh!

zihaomu left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

thesamesam commented Aug 28, 2023

Pull Request Readiness Checklist

Uh oh!

zihaomu commented Aug 28, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thesamesam commented Aug 29, 2023

Uh oh!

zihaomu left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zihaomu commented Aug 28, 2023 •

edited

Loading