[CUDA][CUDA 12] CUDA 12 Support Tracking Issue

CUDA 12 has been released, but we've identified several blocking issues (some code/API compatibility, some API functionality) that need to be addressed before a PyTorch + CUDA 12 build/environment could be considered usable by mainstream users. We're creating this issue to hopefully avoid duplicating work in identifying and resolving issues.

Required code changes for build-breaking changes:

- [x] Drop compute capability < 5.0 (#91213)
- [x] Wrap `cudaGraphInstantiate` as the API has changed (#91118)
- [x] Skip building `cudaProfilerInitialize` (#91118, #95582)
- [x] Support `const` `CUDASparse` descriptors (#90765, #90897, #91050)

Known API functionality issues:

- [x] `cudaSetDevice` is currently eagerly creating a CUDA context when called, which is a significant departure from previous behavior. This potentially creates many unused contexts that waste GPU memory (especially in DDP workloads), and would either require framework or CUDA toolkit code changes to resolve. (#91191, #91219, #94864)

Nightly CUDA 12.1 builds are now available for the adventurous:
@ https://pytorch.org/get-started/locally/
e.g.,
```
pip3 install --pre torch torchvision torchaudio --index-url https://download.pytorch.org/whl/nightly/cu121
```

CC @ptrblck @crcrpar @IvanYashchuk @xwang233 @Aidyn-A @Fuzzkatt @syed-ahmed @puririshi98 @ngimel @malfet 

Please feel free to update this issue as new items are discovered/resolved.

cc @ngimel

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA][CUDA 12] CUDA 12 Support Tracking Issue #91122

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[CUDA][CUDA 12] CUDA 12 Support Tracking Issue #91122

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions