[CI] Update NVIDIA driver to `580.82.07` by malfet · Pull Request #163111 · pytorch/pytorch

malfet · 2025-09-16T21:49:22Z

Stack from ghstack (oldest at bottom):

-> [CI] Update NVIDIA driver to 580.82.07 #163111

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d

This fix was suggested in #162878 (comment)

[ghstack-poisoned]

pytorch-bot · 2025-09-16T21:49:26Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163111

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 102 Pending

As of commit 5df64e3 with merge base cfc539f ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 1458bee Pull Request resolved: #163111

[ghstack-poisoned]

ghstack-source-id: c9afa12 Pull Request resolved: #163111

[ghstack-poisoned]

ghstack-source-id: 2479fa4 Pull Request resolved: #163111

[ghstack-poisoned]

ghstack-source-id: 8670b7d Pull Request resolved: #163111

huydhn · 2025-09-17T01:51:31Z

.github/workflows/_runner-determinator.yml

      - name: Get the workflow type for the current user
        id: set-condition
        run: |
-          curr_branch="${{ inputs.curr_branch }}"


I guess this is just a temp change to bypass the recent issue with no-runner-experiment?

huydhn

Stamped to unblocked, the PR needs to be cleaned up before landing

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d [ghstack-poisoned]

ghstack-source-id: 6726486 Pull Request resolved: #163111

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in #162878 (comment) [ghstack-poisoned]

ghstack-source-id: 86160e8 Pull Request resolved: #163111

malfet · 2025-09-17T14:40:38Z

@pytorchbot merge -f "Lint is green, signal has been green previously"

pytorchmergebot · 2025-09-17T14:43:55Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

malfet · 2025-09-17T17:34:41Z

@pytorchbot merge -f "Take two"

pytorchmergebot · 2025-09-17T17:36:55Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

nWEIdia · 2025-09-17T18:07:32Z

I suppose we need to lock numba version for a while for this patch to successfully apply? [Assuming a different numba version may have slight line number changes for the driver.py file]
Could you please note the numba version that the patch is done against?

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in pytorch#162878 (comment) Pull Request resolved: pytorch#163111 Approved by: https://github.com/huydhn

This reverts commit 16475a8. Reverted pytorch#163111 on behalf of https://github.com/malfet due to It started to fail now, but worked just fine in PR CI ([comment](pytorch#163111 (comment)))

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in pytorch#162878 (comment) Pull Request resolved: pytorch#163111 Approved by: https://github.com/huydhn

This reverts commit 16475a8. Reverted pytorch#163111 on behalf of https://github.com/malfet due to It started to fail now, but worked just fine in PR CI ([comment](pytorch#163111 (comment)))

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in pytorch#162878 (comment) Pull Request resolved: pytorch#163111 Approved by: https://github.com/huydhn

atalman · 2025-09-22T15:23:59Z

@pytorchbot cherry-pick --onto release/2.9 --fixes "Critical CI fix" -c critical

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in #162878 (comment) Pull Request resolved: #163111 Approved by: https://github.com/huydhn (cherry picked from commit 8dbac62)

pytorchbot · 2025-09-22T15:29:40Z

Cherry picking #163111

The cherry pick PR is at #163522 and it is linked with issue Critical CI fix. The following tracker issues are updated:

[v.2.9.0] Release Tracker #162497 (comment)

Details for Dev Infra team

Raised by workflow job

[CI] Update NVIDIA driver to `580.82.07` (#163111) To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in #162878 (comment) Pull Request resolved: #163111 Approved by: https://github.com/huydhn (cherry picked from commit 8dbac62) Co-authored-by: Nikita Shulga <nikita.shulga@gmail.com>

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in pytorch#162878 (comment) Pull Request resolved: pytorch#163111 Approved by: https://github.com/huydhn

This reverts commit 16475a8. Reverted pytorch#163111 on behalf of https://github.com/malfet due to It started to fail now, but worked just fine in PR CI ([comment](pytorch#163111 (comment)))

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in pytorch#162878 (comment) Pull Request resolved: pytorch#163111 Approved by: https://github.com/huydhn

The live patch for numba.cuda introduced in pytorch#163111 causes issues in ROCm CI jobs, which do not use CUDA. This change restricts the patching logic to only run when $BUILD_ENVIRONMENT contains 'cuda'.

The patch introduced in pytorch#163111 causes issues in ROCm CI jobs. This change restricts the patching logic to CUDA environments only.

The patch introduced in pytorch#163111 caused issues in ROCm environments. This change guards the patching logic to CUDA environments only, thus alleviating ROCm builds.

The patch introduced in #163111 caused issues in ROCm environments. This change guards the patching logic to CUDA environments only, thus ameliorating test failures in ROCm environments. Pull Request resolved: #164607 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>

) The patch introduced in pytorch#163111 caused issues in ROCm environments. This change guards the patching logic to CUDA environments only, thus ameliorating test failures in ROCm environments. Pull Request resolved: pytorch#164607 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>

Update

6728482

[ghstack-poisoned]

malfet requested a review from a team as a code owner September 16, 2025 21:49

malfet added a commit that referenced this pull request Sep 16, 2025

[DoNotMerge] Test new driver

2f408ed

ghstack-source-id: 1458bee Pull Request resolved: #163111

pytorch-bot bot added the topic: not user facing topic category label Sep 16, 2025

malfet added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 16, 2025

malfet marked this pull request as draft September 16, 2025 21:53

malfet added the ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR label Sep 16, 2025

Update

b3ab1b4

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 16, 2025

[DoNotMerge] Test new driver

dadd001

ghstack-source-id: c9afa12 Pull Request resolved: #163111

Update

15a6713

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 17, 2025

[DoNotMerge] Test new driver

97f4e3e

ghstack-source-id: 2479fa4 Pull Request resolved: #163111

Update

fba76e2

[ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 17, 2025

[DoNotMerge] Test new driver

d8e9542

ghstack-source-id: 8670b7d Pull Request resolved: #163111

malfet marked this pull request as ready for review September 17, 2025 01:37

malfet changed the title ~~[DoNotMerge] Test new driver~~ [CI] Update NVIDIA driver to 580.82.07 Sep 17, 2025

huydhn reviewed Sep 17, 2025

View reviewed changes

huydhn approved these changes Sep 17, 2025

View reviewed changes

Update on "[CI] Update NVIDIA driver to 580.82.07"

5d2b59a

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d [ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 17, 2025

[DoNotMerge] Test new driver

ba00465

ghstack-source-id: 6726486 Pull Request resolved: #163111

malfet mentioned this pull request Sep 17, 2025

Update Nvidia driver to CUDA 13.0 compatible 580.82.07 #162531

Closed

Update on "[CI] Update NVIDIA driver to 580.82.07"

4e037b2

To make CI machines capable of running CUDA-13 tests. Unfortunately, this upgrade regresses NUMBA integration, so live patch it with NVIDIA/numba-cuda@6e08c9d This fix was suggested in #162878 (comment) [ghstack-poisoned]

malfet added a commit that referenced this pull request Sep 17, 2025

[DoNotMerge] Test new driver

1624293

ghstack-source-id: 86160e8 Pull Request resolved: #163111

pytorchmergebot added the merging label Sep 17, 2025

pytorchmergebot closed this in 16475a8 Sep 17, 2025

pytorchmergebot added Merged and removed merging labels Sep 17, 2025

pytorchmergebot added the merging label Sep 17, 2025

pytorchmergebot closed this in 8dbac62 Sep 17, 2025

pytorchmergebot removed the merging label Sep 17, 2025

pytorchbot mentioned this pull request Sep 22, 2025

[CI] Update NVIDIA driver to 580.82.07 #163522

Merged

pytorchbot mentioned this pull request Sep 22, 2025

[v.2.9.0] Release Tracker #162497

Closed

CSkmd added a commit to CSkmd/pytorch that referenced this pull request Sep 30, 2025

[CI] Scope Numba CUDA-13 patch to CUDA environments onl

b29b665

The patch introduced in pytorch#163111 causes issues in ROCm CI jobs. This change restricts the patching logic to CUDA environments only.

CSkmd mentioned this pull request Oct 3, 2025

[CI] Limit Numba CUDA-13 patch to CUDA environments only #164607

Closed

github-actions bot deleted the gh/malfet/523/head branch October 23, 2025 02:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI] Update NVIDIA driver to `580.82.07`#163111

[CI] Update NVIDIA driver to `580.82.07`#163111
malfet wants to merge 7 commits intogh/malfet/523/basefrom
gh/malfet/523/head

malfet commented Sep 16, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 16, 2025 •

edited

Loading

Uh oh!

huydhn Sep 17, 2025

Uh oh!

huydhn left a comment

Uh oh!

malfet commented Sep 17, 2025

Uh oh!

pytorchmergebot commented Sep 17, 2025

Uh oh!

malfet commented Sep 17, 2025

Uh oh!

pytorchmergebot commented Sep 17, 2025

Uh oh!

nWEIdia commented Sep 17, 2025

Uh oh!

atalman commented Sep 22, 2025

Uh oh!

pytorchbot commented Sep 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

malfet commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163111

⏳ No Failures, 102 Pending

Uh oh!

huydhn Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

huydhn left a comment

Choose a reason for hiding this comment

Uh oh!

malfet commented Sep 17, 2025

Uh oh!

pytorchmergebot commented Sep 17, 2025

Merge started

Uh oh!

malfet commented Sep 17, 2025

Uh oh!

pytorchmergebot commented Sep 17, 2025

Merge started

Uh oh!

nWEIdia commented Sep 17, 2025

Uh oh!

atalman commented Sep 22, 2025

Uh oh!

pytorchbot commented Sep 22, 2025

Cherry picking #163111

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

malfet commented Sep 16, 2025 •

edited

Loading

pytorch-bot bot commented Sep 16, 2025 •

edited

Loading