Skip to content

Add a periodic test for older NVIDIA driver#158300

Closed
huydhn wants to merge 7 commits intopytorch:mainfrom
huydhn:cover-older-nvidia-driver
Closed

Add a periodic test for older NVIDIA driver#158300
huydhn wants to merge 7 commits intopytorch:mainfrom
huydhn:cover-older-nvidia-driver

Conversation

@huydhn
Copy link
Contributor

@huydhn huydhn commented Jul 14, 2025

This is needed because of the botched landing of #156097 which crashed on older NVIDIA drivers 525.*. I add a periodic job to install the 525.105.17 on CI, then run:

  1. A smoke to make sure that CUDA can be initialized
  2. And the whole the test suite on the older driver

Signed-off-by: Huy Do <huydhn@gmail.com>
@huydhn huydhn requested a review from ngimel July 14, 2025 23:54
@huydhn huydhn requested a review from a team as a code owner July 14, 2025 23:54
@huydhn huydhn added ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR test-config/default test-config/legacy_nvidia_driver labels Jul 14, 2025
@pytorch-bot pytorch-bot bot added the release notes: releng release notes category label Jul 14, 2025
@pytorch-bot
Copy link

pytorch-bot bot commented Jul 14, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/158300

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 1 Pending

As of commit 59b5e2d with merge base 8c3f206 (image):

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

huydhn added 3 commits July 14, 2025 18:22
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
@huydhn huydhn added the no-runner-experiments Bypass Meta/LF runner determinator label Jul 15, 2025
huydhn added 3 commits July 15, 2025 11:37
Signed-off-by: Huy Do <huydhn@gmail.com>
Signed-off-by: Huy Do <huydhn@gmail.com>
@huydhn
Copy link
Contributor Author

huydhn commented Jul 16, 2025

@pytorchbot merge -f 'Legacy driver tests are running now'

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@huydhn huydhn deleted the cover-older-nvidia-driver branch December 4, 2025 23:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR Merged no-runner-experiments Bypass Meta/LF runner determinator release notes: releng release notes category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants