CUDA: fix native detection on Jetson by tomoaki0705 · Pull Request #17671 · opencv/opencv

tomoaki0705 · 2020-06-26T07:41:26Z

relates #17526
resolves #17598

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under OpenCV (BSD) License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

alalek · 2020-06-26T11:04:55Z

/cc @cyyever @nglee Could you please take a look on this?

asmorkalov

👍 Build tested with jetson nano.

tomoaki0705 · 2020-06-26T12:49:49Z

@asmorkalov could you double check on CUDA_ARCH_BIN after cmake, please ?
This patch won't break the build but it's supposed to limit CUDA_ARCH_BIN to 5.3 only on Nano

asmorkalov · 2020-06-26T19:11:39Z

Yes, 5.3 is reported in console and I see CUDA_ARCH_BIN:STRING=5.3 in CMakeCache.txt.

cyyever · 2020-06-27T07:12:59Z

It seems this PR just wraps CUDA_HOST_COMPILER in quota. I am OK with it, but it seems unnecessary to introduce a new variable.

nglee · 2020-07-09T16:44:44Z

For my case with TX2, with following cmake command:

cmake -DWITH_CUDA=On -DBUILD_PERF_TESTS=Off -DOPENCV_EXTRA_MODULES_PATH=../opencv_contrib/modules ../opencv

I get the following output:

--   NVIDIA CUDA:                   YES (ver 10.0, CUFFT CUBLAS)
--     NVIDIA GPU arch:             53 62 72 70
--     NVIDIA PTX archs:

The compute capability number of TX2 is 62.

fix native detection on Jetson

4cec9e5

tomoaki0705 mentioned this pull request Jun 26, 2020

respect CUDA_HOST_COMPILER when detecting CUDA arch #17526

Merged

asmorkalov self-assigned this Jun 26, 2020

asmorkalov added category: build/install category: gpu/cuda (contrib) OpenCV 4.0+: moved to opencv_contrib labels Jun 26, 2020

asmorkalov self-requested a review June 26, 2020 08:29

asmorkalov approved these changes Jun 26, 2020

View reviewed changes

opencv-pushbot merged commit c62e639 into opencv:master Jun 26, 2020

tomoaki0705 deleted the fixCUDANativeDetection branch June 26, 2020 23:40

alalek mentioned this pull request Jul 3, 2020

cmake(cuda): repair ccbin, re-implement execute_process() cache #17745

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: fix native detection on Jetson#17671

CUDA: fix native detection on Jetson#17671
opencv-pushbot merged 1 commit intoopencv:masterfrom
tomoaki0705:fixCUDANativeDetection

tomoaki0705 commented Jun 26, 2020

Uh oh!

alalek commented Jun 26, 2020

Uh oh!

asmorkalov left a comment

Uh oh!

tomoaki0705 commented Jun 26, 2020

Uh oh!

asmorkalov commented Jun 26, 2020

Uh oh!

cyyever commented Jun 27, 2020

Uh oh!

nglee commented Jul 9, 2020 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

tomoaki0705 commented Jun 26, 2020

Pull Request Readiness Checklist

Uh oh!

alalek commented Jun 26, 2020

Uh oh!

asmorkalov left a comment

Choose a reason for hiding this comment

Uh oh!

tomoaki0705 commented Jun 26, 2020

Uh oh!

asmorkalov commented Jun 26, 2020

Uh oh!

cyyever commented Jun 27, 2020

Uh oh!

nglee commented Jul 9, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

nglee commented Jul 9, 2020 •

edited

Loading