Skip to content

Fix build-service-image nvidia/kernel/lustre mismatch#4454

Merged
nick-stroud merged 1 commit into
GoogleCloudPlatform:developfrom
samskillman:fix/build-service-image-kernel-mismatch
Jul 25, 2025
Merged

Fix build-service-image nvidia/kernel/lustre mismatch#4454
nick-stroud merged 1 commit into
GoogleCloudPlatform:developfrom
samskillman:fix/build-service-image-kernel-mismatch

Conversation

@samskillman

Copy link
Copy Markdown
Collaborator

Submission Checklist

NOTE: Community submissions can take up to 2 weeks to be reviewed.

Please take the following actions before submitting this pull request.

  • Fork your PR branch from the Toolkit "develop" branch (not main)
  • Test all changes with pre-commit in a local branch #
  • Confirm that "make tests" passes all tests
  • Add or modify unit tests to cover code changes
  • Ensure that unit test coverage remains above 80%
  • Update all applicable documentation
  • Follow Cluster Toolkit Contribution guidelines #

@samskillman samskillman requested a review from nick-stroud July 25, 2025 17:31
@samskillman samskillman added the release-bugfix Added to release notes under the "Bug fixes" heading. label Jul 25, 2025
@samskillman samskillman requested a review from a team as a code owner July 25, 2025 17:31
@samskillman samskillman changed the title Fix nvidia/kernel/lustre mismatch Fix build-service-image nvidia/kernel/lustre mismatch Jul 25, 2025
@nick-stroud

Copy link
Copy Markdown
Collaborator

Booted standalone A3U from built image:

nickstroud@new-build-a3u:~$ nvidia-smi
Fri Jul 25 18:28:42 2025       
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 570.158.01             Driver Version: 570.158.01     CUDA Version: 12.8     |
|-----------------------------------------+------------------------+----------------------+
# Looked good
nickstroud@new-build-a3u:~$ sudo modprobe lustre
nickstroud@new-build-a3u:~$ uname -r
6.8.0-1033-gcp
nickstroud@new-build:~$ dpkg -l | grep 1032
nickstroud@new-build:~$ dpkg -l | grep 1033
hi  linux-gcp                                           6.8.0-1033.35~22.04.1                   amd64        Complete Google Cloud Platform (GCP) Linux kernel and headers
ii  linux-gcp-6.8-headers-6.8.0-1033                    6.8.0-1033.35~22.04.1                   amd64        Header files related to Linux kernel version 6.8.0
ii  linux-gcp-6.8-tools-6.8.0-1033                      6.8.0-1033.35~22.04.1                   amd64        Linux kernel version specific tools for version 6.8.0-1033
ii  linux-headers-6.8.0-1033-gcp                        6.8.0-1033.35~22.04.1                   amd64        Linux kernel headers for version 6.8.0 on 64 bit x86 SMP
hi  linux-headers-gcp                                   6.8.0-1033.35~22.04.1                   amd64        Google Cloud Platform (GCP) Linux kernel headers
ii  linux-image-6.8.0-1033-gcp                          6.8.0-1033.35~22.04.1                   amd64        Signed kernel image gcp
hi  linux-image-gcp                                     6.8.0-1033.35~22.04.1                   amd64        Google Cloud Platform (GCP) Linux kernel image
ii  linux-modules-6.8.0-1033-gcp                        6.8.0-1033.35~22.04.1                   amd64        Linux kernel extra modules for version 6.8.0 on 64 bit x86 SMP
ii  linux-modules-nvidia-570-server-open-6.8.0-1033-gcp 6.8.0-1033.35~22.04.1+1                 amd64        Linux kernel nvidia modules for version 6.8.0-1033
hi  linux-modules-nvidia-570-server-open-gcp            6.8.0-1033.35~22.04.1+1                 amd64        Extra drivers for nvidia-570-server-open for the gcp flavour
ii  linux-tools-6.8.0-1033-gcp                          6.8.0-1033.35~22.04.1                   amd64        Linux kernel version specific tools for version 6.8.0-1033
ii  lustre-client-modules-6.8.0-1033-gcp                2.14.0-ddn214-1                         amd64        Lustre Linux kernel module (kernel 6.8.0-1033-gcp)

@nick-stroud nick-stroud merged commit aa819ed into GoogleCloudPlatform:develop Jul 25, 2025
15 of 68 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-bugfix Added to release notes under the "Bug fixes" heading.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants