Skip to content

CUDA: fix native detection on Jetson#17671

Merged
opencv-pushbot merged 1 commit intoopencv:masterfrom
tomoaki0705:fixCUDANativeDetection
Jun 26, 2020
Merged

CUDA: fix native detection on Jetson#17671
opencv-pushbot merged 1 commit intoopencv:masterfrom
tomoaki0705:fixCUDANativeDetection

Conversation

@tomoaki0705
Copy link
Copy Markdown
Contributor

relates #17526
resolves #17598

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

  • I agree to contribute to the project under OpenCV (BSD) License.
  • To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
  • The PR is proposed to proper branch
  • There is reference to original bug report and related work
  • There is accuracy test, performance test and test data in opencv_extra repository, if applicable
    Patch to opencv_extra has the same branch name.
  • The feature is well documented and sample code can be built with the project CMake

@alalek
Copy link
Copy Markdown
Member

alalek commented Jun 26, 2020

/cc @cyyever @nglee Could you please take a look on this?

Copy link
Copy Markdown
Contributor

@asmorkalov asmorkalov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 Build tested with jetson nano.

@tomoaki0705
Copy link
Copy Markdown
Contributor Author

@asmorkalov could you double check on CUDA_ARCH_BIN after cmake, please ?
This patch won't break the build but it's supposed to limit CUDA_ARCH_BIN to 5.3 only on Nano

@asmorkalov
Copy link
Copy Markdown
Contributor

Yes, 5.3 is reported in console and I see CUDA_ARCH_BIN:STRING=5.3 in CMakeCache.txt.

@opencv-pushbot opencv-pushbot merged commit c62e639 into opencv:master Jun 26, 2020
@tomoaki0705 tomoaki0705 deleted the fixCUDANativeDetection branch June 26, 2020 23:40
@cyyever
Copy link
Copy Markdown
Contributor

cyyever commented Jun 27, 2020

It seems this PR just wraps CUDA_HOST_COMPILER in quota. I am OK with it, but it seems unnecessary to introduce a new variable.

@nglee
Copy link
Copy Markdown
Contributor

nglee commented Jul 9, 2020

For my case with TX2, with following cmake command:

cmake -DWITH_CUDA=On -DBUILD_PERF_TESTS=Off -DOPENCV_EXTRA_MODULES_PATH=../opencv_contrib/modules ../opencv

I get the following output:

--   NVIDIA CUDA:                   YES (ver 10.0, CUFFT CUBLAS)
--     NVIDIA GPU arch:             53 62 72 70
--     NVIDIA PTX archs:

The compute capability number of TX2 is 62.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

CUDA: detection of CUDA on Jetson TX1 and TX2 fail

6 participants