Skip to content

[CD] Simplify NVIDIA driver installation step#163349

Closed
malfet wants to merge 1 commit intomainfrom
malfet-patch-4
Closed

[CD] Simplify NVIDIA driver installation step#163349
malfet wants to merge 1 commit intomainfrom
malfet-patch-4

Conversation

@malfet
Copy link
Contributor

@malfet malfet commented Sep 19, 2025

Undo changes introduced in #160956 as driver has been updated to 580 for both fleets

Fixes #163342

Undo changes introduced in #160956 as driver has been updated to 580 for both fleets
@malfet malfet requested a review from a team as a code owner September 19, 2025 15:54
@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Sep 19, 2025
@pytorch-bot
Copy link

pytorch-bot bot commented Sep 19, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163349

Note: Links to docs will display an error until the docs builds have been completed.

❌ 14 New Failures, 2 Unrelated Failures

As of commit 8cab9c5 with merge base f8f230a (image):

NEW FAILURES - The following jobs have failed:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@malfet malfet added the ciflow/binaries_wheel Trigger binary build and upload jobs for wheel on the PR label Sep 19, 2025
@malfet
Copy link
Contributor Author

malfet commented Sep 19, 2025

CUDA-13 failures are due to #162590 which has been reverted on trunk

@malfet
Copy link
Contributor Author

malfet commented Sep 19, 2025

@pytorchbot merge -f "Seems to be working"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

mansiag05 pushed a commit to mansiag05/pytorch that referenced this pull request Sep 22, 2025
Undo changes introduced in pytorch#160956 as driver has been updated to 580 for both fleets

Fixes pytorch#163342
Pull Request resolved: pytorch#163349
Approved by: https://github.com/seemethere
cleonard530 pushed a commit to cleonard530/pytorch that referenced this pull request Sep 22, 2025
Undo changes introduced in pytorch#160956 as driver has been updated to 580 for both fleets

Fixes pytorch#163342
Pull Request resolved: pytorch#163349
Approved by: https://github.com/seemethere
@atalman
Copy link
Contributor

atalman commented Sep 24, 2025

@pytorchbot cherry-pick --onto release/2.9 --fixes "Critical CI fix" -c critical

@pytorchbot
Copy link
Collaborator

Cherry picking #163349

Command git -C /home/runner/work/pytorch/pytorch cherry-pick -x b8c5ec582f9fe303d6525b9263eb8b738125e571 returned non-zero exit code 1

Auto-merging .github/workflows/_binary-test-linux.yml
CONFLICT (content): Merge conflict in .github/workflows/_binary-test-linux.yml
error: could not apply b8c5ec582f9... [CD] Simplify NVIDIA driver installation step (#163349)
hint: After resolving the conflicts, mark them with
hint: "git add/rm <pathspec>", then run
hint: "git cherry-pick --continue".
hint: You can instead skip this commit with "git cherry-pick --skip".
hint: To abort and get back to the state before "git cherry-pick",
hint: run "git cherry-pick --abort".
hint: Disable this message with "git config set advice.mergeConflict false"
Details for Dev Infra team Raised by workflow job

atalman pushed a commit to atalman/pytorch that referenced this pull request Sep 24, 2025
Undo changes introduced in pytorch#160956 as driver has been updated to 580 for both fleets

Fixes pytorch#163342
Pull Request resolved: pytorch#163349
Approved by: https://github.com/seemethere
atalman added a commit that referenced this pull request Sep 25, 2025
Undo changes introduced in #160956 as driver has been updated to 580 for both fleets

Fixes #163342
Pull Request resolved: #163349
Approved by: https://github.com/seemethere

Co-authored-by: Nikita Shulga <2453524+malfet@users.noreply.github.com>
dsashidh pushed a commit to dsashidh/pytorch that referenced this pull request Sep 26, 2025
Undo changes introduced in pytorch#160956 as driver has been updated to 580 for both fleets

Fixes pytorch#163342
Pull Request resolved: pytorch#163349
Approved by: https://github.com/seemethere
@github-actions github-actions bot deleted the malfet-patch-4 branch October 25, 2025 02:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/binaries_wheel Trigger binary build and upload jobs for wheel on the PR Merged topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[CD] - Manywheel CUDA builds failing since Sept 18

5 participants