`cuda`: update default PTX behaviour when `CUDA_ARCH_BIN` is unset by cudawarped · Pull Request #24131 · opencv/opencv

cudawarped · 2023-08-09T06:28:29Z

Currently when CUDA_ARCH_BIN isn't specified CMake attempts to build .cu files for the all architechtures it knows about, generating a list of architectures supported by the current CUDA Toolkit (see #17432). The idea being that if a user doesn't specify the arch then to be on the safe side it should generate for all architechtures for maximum compatibility. Additionally if a user just wants to specify a PTX architechture they have to pass CUDA_ARCH_BIN= (incurring the overhead of calling ocv_filter_available_architecture on all know architectures) in addition to CUDA_ARCH_PTX=<TAERGET ARCH> to CMake to avoid generating binary code for all supported architectures as well.

This PR proposes that:

When CUDA_ARCH_BIN and CUDA_ARCH_PTX are not specified, in addition to binary code for all compute capabilities, PTX code for the highest supported architecture is also generated. This would ensure that the CUDA device code will also run on newer GPU's than the installed CUDA Toolkit, increasing the level of compatibility futher.
When CUDA_ARCH_PTX is passed in isolation only PTX code for the desired CC is generated.

@tomoaki0705

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

…CH_PTX to be passed in isolation

cudawarped · 2023-08-12T08:11:29Z

@asmorkalov I have updated the PR.

asmorkalov

👍

asmorkalov added category: build/install category: gpu/cuda (contrib) OpenCV 4.0+: moved to opencv_contrib labels Aug 9, 2023

asmorkalov self-requested a review August 9, 2023 06:52

cudawarped changed the title ~~cuda: add default PTX when CUDA_ARCH_BIN is missing~~ cuda: add default PTX when CUDA_ARCH_BIN is missing Aug 9, 2023

cuda: add default ptx when CUDA_ARCH_BIN is missing and allow CUDA_AR…

358e306

…CH_PTX to be passed in isolation

cudawarped force-pushed the cuda_add_default_ptx branch from ee154ea to 358e306 Compare August 12, 2023 08:10

cudawarped changed the title ~~cuda: add default PTX when CUDA_ARCH_BIN is missing~~ cuda: update default PTX behaviour when CUDA_ARCH_BIN is unset Aug 13, 2023

asmorkalov self-assigned this Sep 14, 2023

asmorkalov added this to the 4.9.0 milestone Sep 14, 2023

asmorkalov approved these changes Sep 14, 2023

View reviewed changes

asmorkalov merged commit ec1c060 into opencv:4.x Sep 14, 2023

asmorkalov mentioned this pull request Sep 28, 2023

(5.x) Merge 4.x #24338

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`cuda`: update default PTX behaviour when `CUDA_ARCH_BIN` is unset#24131

`cuda`: update default PTX behaviour when `CUDA_ARCH_BIN` is unset#24131
asmorkalov merged 1 commit intoopencv:4.xfrom
cudawarped:cuda_add_default_ptx

cudawarped commented Aug 9, 2023 •

edited

Loading

Uh oh!

cudawarped commented Aug 12, 2023

Uh oh!

asmorkalov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

cudawarped commented Aug 9, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

cudawarped commented Aug 12, 2023

Uh oh!

asmorkalov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cudawarped commented Aug 9, 2023 •

edited

Loading