Miscellaneous updates for CUDA 10 by syed-ahmed · Pull Request #12017 · pytorch/pytorch

syed-ahmed · 2018-09-24T16:11:35Z

This PR has some updates related to CUDA 10.

c2195e9 ensures that the repo successfully builts on CUDA 10. Addresses Support for GPU compute capability > 7.0? #11888
423d8d3 follows up on the cufft max plan number bug: cufft errors after lots of plan generation #11089, which has been fixed in CUDA 10.

…an cache size bug has been fixed in CUDA10

Co-authored-by: Christian Sarofeen <csarofeen@nvidia.com>

facebook-github-bot

soumith is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: This PR has some updates related to CUDA 10. - pytorch/pytorch@c2195e9 ensures that the repo successfully builts on CUDA 10. Addresses pytorch/pytorch#11888 - pytorch/pytorch@423d8d3 follows up on the cufft max plan number bug: pytorch/pytorch#11089, which has been fixed in CUDA 10. Pull Request resolved: pytorch/pytorch#12017 Differential Revision: D10013405 Pulled By: soumith fbshipit-source-id: 5bc6d7f71d5133f7821b407b1ac6c51bef0f6fa8

aten/src/ATen/native/cuda/CuFFTPlanCache.h

+    // bug related to cuFFT plan cache max size has been fixed
+    // in CUDA 10. Hence, when compiling with CUDA 10, just
+    // don't do the erase.
+    #if CUDA_VERSION < 10000


aten/src/ATen/native/cuda/CuFFTPlanCache.h

@@ -346,6 +346,7 @@ class CuFFTConfig {
 //     be fine for now.
 // TODO: When CUDA 10 comes out, check if the bug is fixed or if we need another


Summary: SsnL As per your review in #12017, I added a max plan number for CUDA 10 path. Our internal cuFFT team couldn't suggest a number since the limit depends on host/device memory. That is, a plan allocates some buffers on the device and also creates objects for the plans on the host side. I raised this number to 4x arbitrarily per you suggestion. Pull Request resolved: #12553 Differential Revision: D10320832 Pulled By: SsnL fbshipit-source-id: 3148d45cd280dffb2039756e2f6a74fbc7aa086d

Summary: SsnL As per your review in pytorch/pytorch#12017, I added a max plan number for CUDA 10 path. Our internal cuFFT team couldn't suggest a number since the limit depends on host/device memory. That is, a plan allocates some buffers on the device and also creates objects for the plans on the host side. I raised this number to 4x arbitrarily per you suggestion. Pull Request resolved: pytorch/pytorch#12553 Differential Revision: D10320832 Pulled By: SsnL fbshipit-source-id: 3148d45cd280dffb2039756e2f6a74fbc7aa086d

syed-ahmed and others added 2 commits September 24, 2018 08:16

Adds no-op to CUFFT_MAX_PLAN_NUM related code path since the cufft pl…

423d8d3

…an cache size bug has been fixed in CUDA10

Updates ARCH value to support Turing

c2195e9

Co-authored-by: Christian Sarofeen <csarofeen@nvidia.com>

syed-ahmed requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners September 24, 2018 16:11

zou3519 approved these changes Sep 24, 2018

View reviewed changes

soumith approved these changes Sep 24, 2018

View reviewed changes

facebook-github-bot reviewed Sep 24, 2018

View reviewed changes

facebook-github-bot closed this in ffbac7d Sep 24, 2018

soumith mentioned this pull request Sep 24, 2018

Support for GPU compute capability > 7.0? #11888

Closed

t-vi mentioned this pull request Sep 26, 2018

On cuda 10: THCAtomics.cuh(100): error: cannot overload functions distinguished by return type alone #12075

Closed

ssnl reviewed Sep 28, 2018

View reviewed changes

syed-ahmed mentioned this pull request Oct 10, 2018

[CUDA10 fixes] Adds max plan number for CUDA 10 cufft plan cache array #12553

Closed

ezyang added open source merged labels Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Miscellaneous updates for CUDA 10#12017

Miscellaneous updates for CUDA 10#12017
syed-ahmed wants to merge 2 commits intopytorch:masterfrom
syed-ahmed:cuda-10-updates

syed-ahmed commented Sep 24, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

		@@ -346,6 +346,7 @@ class CuFFTConfig {
		// be fine for now.
		// TODO: When CUDA 10 comes out, check if the bug is fixed or if we need another

Conversation

syed-ahmed commented Sep 24, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants