Skip to content

Hold all nvidia software to the same version (fix to develop)#4459

Merged
nick-stroud merged 1 commit into
GoogleCloudPlatform:developfrom
samskillman:fix/version-compatibility
Jul 25, 2025
Merged

Hold all nvidia software to the same version (fix to develop)#4459
nick-stroud merged 1 commit into
GoogleCloudPlatform:developfrom
samskillman:fix/version-compatibility

Conversation

@samskillman

Copy link
Copy Markdown
Collaborator

Cherry-pick of #4458

Without this, during any combination of "update & upgrade", parts of the nvidia software stack are liable to be upgraded and become out of sync. While only libnvidia-compute-570-server causes immediate errors, it is best to keep everything in sync with the image until a point where an upgrade across all instances can be done.

Submission Checklist

NOTE: Community submissions can take up to 2 weeks to be reviewed.

Please take the following actions before submitting this pull request.

  • Fork your PR branch from the Toolkit "develop" branch (not main)
  • Test all changes with pre-commit in a local branch #
  • Confirm that "make tests" passes all tests
  • Add or modify unit tests to cover code changes
  • Ensure that unit test coverage remains above 80%
  • Update all applicable documentation
  • Follow Cluster Toolkit Contribution guidelines #

Without this, during any combination of "update & upgrade", parts
of the nvidia software stack are liable to be upgraded and become
out of sync. While only libnvidia-compute-570-server causes immediate
errors, it is best to keep everything in sync with the image until
a point where an upgrade across all instances can be done.
@samskillman samskillman requested a review from nick-stroud July 25, 2025 22:26
@samskillman samskillman requested a review from a team as a code owner July 25, 2025 22:26
@samskillman samskillman added the release-bugfix Added to release notes under the "Bug fixes" heading. label Jul 25, 2025
@samskillman samskillman changed the title Hold all nvidia software to the same version Hold all nvidia software to the same version (fix to develop) Jul 25, 2025
@nick-stroud nick-stroud merged commit 61aec72 into GoogleCloudPlatform:develop Jul 25, 2025
15 of 68 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-bugfix Added to release notes under the "Bug fixes" heading.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants