Update OpenCVDetectCUDA.cmake by CSBVision · Pull Request #22675 · opencv/opencv

CSBVision · 2022-10-21T07:23:39Z

Adds the option to enable delay loading of CUDA DLLs on Windows. This is particularly useful to use the same binary on systems with and without CUDA support without distributing the CUDA DLLs to systems that cannot use them at all due to missing CUDA-supported hardware. Resolves #13509

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

asmorkalov · 2022-10-24T06:13:23Z

Thanks a lot for the great suggestion! The option looks very promising. I'll check several corner cases and return back with feedback.

stopmosk · 2022-12-12T10:13:41Z

Hi

I found some issues with this PR:

Build failed if Ninja generator is used. Only build with the MSVC generator succeeded. I googled and found this.
I built OpenCV with and without CUDA_ENABLE_DELAYLOAD and also build test CUDA-apps. I then ran it on two Windows machines (with and without CUDA installed), and found no difference in apps behaviour on both machines. Dependency checking shows that the only one difference is delayed loding of CUDA-dlls by opencv_cudaarithm460.dll (see screenshots).

opencv_cudaarithm460.dll built without CUDA_ENABLE_DELAYLOAD:

opencv_cudaarithm460.dll built with CUDA_ENABLE_DELAYLOAD:

asmorkalov · 2022-12-12T11:03:47Z

@stopmosk Please show your test app code and OpenCv build options pefore and after the patch.

CSBVision · 2022-12-12T11:33:13Z

Hi,

Thanks for your comment!

First, regarding the issue with Ninja: From the link you added, I suspect that Ninja mixes something between the respective import libraries (.lib files) and the delay-loaded DLLs. Unluckily, the /delayload flag requires the DLLs and not the LIBs, as already mentioned above. This can be easily fixed by only allowing MSCV inside the CMake scripts (i.e. adding an 'if not Ninja condition'), but we are open to investigate which additional flags are required for Ninja as well to support the flag there, too. Still, we never used Ninja and a quickly setup of CMake using Ninja with specified C and C++ compilers terminates with the error message

  The C++ compiler

    "C:/Program Files/Microsoft Visual Studio/2022/Professional/VC/Tools/MSVC/14.34.31933/bin/Hostx64/x64/cl.exe"

  is not able to compile a simple test program.

for unknown reasoning. The compiler is working fine (e.g. in combination with MSVC CMake builds) and it's unclear how to debug this. Is there a reference on how to compile OpenCV using Ninja on Windows? Unluckily, I found none by myself.

Regarding your second comment: This is exactly the expected behavior, only the CUDA DLLs are delay-loaded. Still, the cudaarithm DLL is not a good example for the benefit of this flag because - at least as far as I know - there is nothing inside this DLL that works without CUDA-supported hardware. I think the DNN module is a better example: Here, we can either use the CPU backend (supported on all machines), the OpenVINO backend (supported on machines with an OpenVINO supported CPU or GPU) or the CUDA backend. Obviously, the latter is supported on machines equipped with CUDA-compatible hardware only. Still if CUDA is activated while compiling OpenCV, the CUDA libraries always have to be distributed to all machines at runtime, even though they cannot be used at all. This is where the delay load option shines: Activate it and the whole CUDA functionality is available to devices that can use it, while all non-CUDA functionalities are still available on all machines, without needing a superfluous CUDA installation.

asmorkalov · 2022-12-13T08:33:11Z

cmake/OpenCVDetectCUDA.cmake

+  if(MSVC)
+    OCV_OPTION(CUDA_ENABLE_DELAYLOAD "Enable delayed loading of CUDA DLLs" OFF)
+  endif()


OCV_OPTION has VISIBLE_IF argument for conditions. I propose to replace the "if" with option with single line:

OCV_OPTION(CUDA_ENABLE_DELAYLOAD "Enable delayed loading of CUDA DLLs" OFF VISIBLE_IF MSVC AND (CMAKE_GENERATOR MATCHES "Visual Studio"))

Yes of course, it's more readable and cleaner.
Maybe it's worth think about defaulting to on? Just though about this option as it does not really change anything if the DLLs are there but avoids runtime errors if they are missing but still unused.

I prefer to have it optional. It may affect libraries loading order for guys why use other CUDA-based libraries like TensorRT, CuDNN, python code with CUDA.

Alright, just committed your proposal. So the PR can be merged now?

asmorkalov

👍 Looks good to me! Thanks for the contribution.

CSBVision · 2022-12-13T12:11:36Z

Great, thanks for approving 👍

asmorkalov · 2022-12-13T12:18:18Z

@CSBVision Could you squash the commits to have clear merge history?

CSBVision · 2022-12-13T12:28:27Z

Generally yes; however I don't want to mess up something. The right command should be

git rebase -i HEAD~3

or isn't it?

CSBVision · 2022-12-13T14:53:31Z

After git rebase -i HEAD~3 git opens a VIM window with

pick f5e9dfe64a Update OpenCVDetectCUDA.cmake
pick fdf67a4f5f Update OpenCVDetectCUDA.cmake
pick efc4f66e11 Update OpenCVDetectCUDA.cmake

where exiting shows Successfully rebased and updated refs/heads/patch-2. but git push shows everything is up to date. What am I missing?

alalek · 2022-12-13T16:13:24Z

Run again:

git rebase -i HEAD~3

Change 2nd and 3rd lines from pick to f (fixup):

pick f5e9dfe64a Update OpenCVDetectCUDA.cmake
f fdf67a4f5f Update OpenCVDetectCUDA.cmake
f efc4f66e11 Update OpenCVDetectCUDA.cmake

save and exit
git push ... with --force flag
PR's patch should be updated (1 commit is visible, +13 -0 lines changed).

Adds the option to enable delay loading of CUDA DLLs on Windows. This is particularly useful to use the same binary on systems with and without CUDA support without distributing the CUDA DLLs to systems that cannot use them at all due to missing CUDA-supported hardware. Resolves opencv#13509

CSBVision · 2022-12-13T18:08:24Z

Thanks @alalek , I hope it's correct now.

asmorkalov self-requested a review October 21, 2022 08:08

asmorkalov added category: build/install category: gpu/cuda (contrib) OpenCV 4.0+: moved to opencv_contrib labels Oct 24, 2022

asmorkalov self-assigned this Oct 24, 2022

asmorkalov requested a review from stopmosk December 2, 2022 07:53

asmorkalov added this to the 4.7.0 milestone Dec 12, 2022

asmorkalov reviewed Dec 13, 2022

View reviewed changes

asmorkalov approved these changes Dec 13, 2022

View reviewed changes

stopmosk approved these changes Dec 13, 2022

View reviewed changes

CSBVision force-pushed the patch-2 branch from efc4f66 to 332ff4b Compare December 13, 2022 16:42

asmorkalov merged commit 81aaca8 into opencv:4.x Dec 14, 2022

alalek mentioned this pull request Jan 8, 2023

(5.x) Merge 4.x #23113

Merged

CSBVision mentioned this pull request Jan 31, 2023

Two issues with CUDA_ENABLE_DELAYLOAD #23187

Closed

4 tasks

CSBVision mentioned this pull request Feb 10, 2023

Proposal: Delay load option for OpenCV modules? #23235

Closed

Uh oh!

Conversation

CSBVision commented Oct 21, 2022

Pull Request Readiness Checklist

Uh oh!

asmorkalov commented Oct 24, 2022

Uh oh!

stopmosk commented Dec 12, 2022

Uh oh!

asmorkalov commented Dec 12, 2022

Uh oh!

CSBVision commented Dec 12, 2022

Uh oh!

asmorkalov Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

CSBVision Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

asmorkalov Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

CSBVision Dec 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

asmorkalov left a comment

Choose a reason for hiding this comment

Uh oh!

CSBVision commented Dec 13, 2022

Uh oh!

asmorkalov commented Dec 13, 2022

Uh oh!

CSBVision commented Dec 13, 2022

Uh oh!

CSBVision commented Dec 13, 2022

Uh oh!

alalek commented Dec 13, 2022

Uh oh!

CSBVision commented Dec 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

CSBVision Dec 13, 2022 •

edited

Loading