Skip to content

Release candidate : v1.83.0#5277

Merged
sudheer-quad merged 101 commits into
mainfrom
release-candidate
Mar 5, 2026
Merged

Release candidate : v1.83.0#5277
sudheer-quad merged 101 commits into
mainfrom
release-candidate

Conversation

@sudheer-quad

Copy link
Copy Markdown
Contributor

Release candidate: v1.83.0

Submission Checklist

NOTE: Community submissions can take up to 2 weeks to be reviewed.

Please take the following actions before submitting this pull request.

  • Fork your PR branch from the Toolkit "develop" branch (not main)
  • Test all changes with pre-commit in a local branch #
  • Confirm that "make tests" passes all tests
  • Add or modify unit tests to cover code changes
  • Ensure that unit test coverage remains above 80%
  • Update all applicable documentation
  • Follow Cluster Toolkit Contribution guidelines #

AdarshK15 and others added 30 commits February 5, 2026 03:55
Bumps [cryptography](https://github.com/pyca/cryptography) from 46.0.4 to 46.0.5.
- [Changelog](https://github.com/pyca/cryptography/blob/main/CHANGELOG.rst)
- [Commits](pyca/cryptography@46.0.4...46.0.5)

---
updated-dependencies:
- dependency-name: cryptography
  dependency-version: 46.0.5
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
@gemini-code-assist

Copy link
Copy Markdown
Contributor

Summary of Changes

Hello @sudheer-quad, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request prepares the toolkit for release v1.83.0 by updating core dependencies, introducing new GKE examples for advanced hardware and provisioning models, and enhancing the internal validation framework for blueprints. It also refines existing modules for better functionality and maintainability.

Highlights

  • Version Bump: The toolkit has been updated to version v1.83.0.
  • Terraform Version Update: The minimum required Terraform version has been updated to 1.12.2 across various modules and examples.
  • New GKE A4X-Max Bare Metal Example: A new example blueprint for deploying GKE clusters with A4X-Max Bare Metal nodes has been added, including support for ASAPd Lite and NCCL JobSet.
  • New GKE TPU DWS Queued Provisioning Examples: New examples for GKE TPU 7x and TPU v6e with Dynamic Workload Scheduler (DWS) Queued Provisioning have been introduced, demonstrating dynamic scaling and job orchestration.
  • Enhanced Blueprint Validation: New required and conditional validators have been added to the metadata validation system, along with documentation, to enforce more robust blueprint configurations.
  • Slurm Cluster Name Validation Update: The regex for slurm_cluster_name has been updated to allow hyphens and extend the maximum length to 20 characters.
  • Kubectl Apply Module Improvements: The kubectl-apply module now leverages Helm for installing Kueue and the NCCL gIB plugin, and includes support for ASAPd Lite installation.
  • GKE Node Pool Linux Configuration: The gke-node-pool module now supports linux_node_config for specifying sysctls and hugepages, and automatically applies the cloud.google.com/gke-queued label for DWS.
  • Deprecated Example Removal: The a3u-slurm-ubuntu-gcs example blueprint has been removed due to deprecation.
Changelog
  • .yamllint
    • Added an ignore path for raw-config-chart templates.
  • Makefile
    • Updated the minimum Terraform version to 1.12.2.
  • cmd/create.go
    • Refactored blueprint validator error messages for clarity.
  • cmd/root.go
    • Updated the toolkit version string to v1.83.0.
  • community/examples/hpc-build-slurm-image.yaml
    • Updated Python to 3.12 and Ansible to 8.7.0 for Slurm builds on Rocky8.
  • community/front-end/ofe/infrastructure_files/vpc_tf/GCP/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/infrastructure_files/vpc_tf/GCP/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/wait-for-startup/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/wait-for-startup/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/requirements.txt
    • Updated cryptography package version to 46.0.5.
  • community/front-end/ofe/tf/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/tf/network/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/tf/network/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/front-end/ofe/tf/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/gke-nodeset/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/gke-nodeset/variables.tf
    • Updated validation regex for 'slurm_cluster_name' to allow up to 19 characters after the initial letter.
  • community/modules/compute/gke-nodeset/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/compute/gke-partition/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/gke-partition/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/compute/htcondor-execute-point/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/htcondor-execute-point/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/compute/mig/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/mig/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/compute/notebook/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/notebook/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-tpu/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-tpu/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-partition/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-partition/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/container/artifact-registry/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/container/artifact-registry/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/database/bigquery-dataset/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/database/bigquery-dataset/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/database/bigquery-table/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/database/bigquery-table/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/database/slurm-cloudsql-federation/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/database/slurm-cloudsql-federation/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module versions to v1.83.0.
  • community/modules/file-system/nfs-server/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/file-system/nfs-server/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/file-system/weka-client/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/file-system/weka-client/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/files/fsi-montecarlo-on-batch/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/files/fsi-montecarlo-on-batch/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module versions to v1.83.0.
  • community/modules/internal/slurm-gcp/instance/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/internal/slurm-gcp/instance/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/internal/slurm-gcp/instance_template/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/internal/slurm-gcp/instance_template/variables.tf
    • Updated validation regex for 'slurm_cluster_name' to allow up to 19 characters after the initial letter.
  • community/modules/internal/slurm-gcp/instance_template/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/internal/slurm-gcp/internal_instance_template/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/internal/slurm-gcp/internal_instance_template/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/internal/slurm-gcp/login/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/internal/slurm-gcp/login/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/internal/slurm-gcp/nodeset_tpu/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/internal/slurm-gcp/nodeset_tpu/metadata.yaml
    • Added required validator for 'zone'.
    • Added conditional validator for 'subnetwork' and 'enable_public_ip'.
  • community/modules/internal/slurm-gcp/nodeset_tpu/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/management/dependencies-installer/README.md
    • Updated Terraform version requirement to 1.12.2.
    • Simplified description for 'gke_cluster_exists'.
  • community/modules/management/dependencies-installer/helm_install/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/management/dependencies-installer/helm_install/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/management/dependencies-installer/kubernetes_manifest/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/management/dependencies-installer/kubernetes_manifest/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/management/dependencies-installer/variables.tf
    • Simplified description for 'gke_cluster_exists'.
  • community/modules/management/dependencies-installer/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/project/service-enablement/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/project/service-enablement/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/pubsub/bigquery-sub/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/pubsub/bigquery-sub/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module versions to v1.83.0.
  • community/modules/pubsub/topic/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/pubsub/topic/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/remote-desktop/chrome-remote-desktop/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/remote-desktop/chrome-remote-desktop/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/htcondor-access-point/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/htcondor-access-point/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/scheduler/htcondor-central-manager/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/htcondor-central-manager/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/scheduler/htcondor-pool-secrets/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/htcondor-pool-secrets/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/scheduler/htcondor-service-accounts/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/htcondor-service-accounts/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/htcondor-setup/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/htcondor-setup/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/README.md
    • Updated Terraform version requirement to 1.12.2.
    • Added input variable 'subnetwork_stack_type'.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/controller.tf
    • Added 'stack_type' to the network interface configuration.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/metadata.yaml
    • Added deprecated validator for 'enable_smt' and 'disable_smt'.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_compute/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_compute/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_tpu/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_tpu/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/metadata.yaml
    • Added conditional validator for 'enable_hybrid' and 'slurm_control_host'.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/variables.tf
    • Updated validation regex for 'slurm_cluster_name' to allow up to 19 characters after the initial letter.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/variables.tf
    • Updated validation regex for 'slurm_cluster_name' to allow up to 19 characters after the initial letter.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/variables_controller_instance.tf
    • Added new variable 'subnetwork_stack_type'.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-login/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-login/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/scheduler/slinky/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scheduler/slinky/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/gcloud/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/gcloud/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/htcondor-install/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/htcondor-install/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/ramble-execute/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/ramble-execute/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/ramble-setup/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/ramble-setup/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/spack-execute/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/spack-execute/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/spack-setup/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/spack-setup/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/wait-for-startup/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/wait-for-startup/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • community/modules/scripts/windows-startup-script/README.md
    • Updated Terraform version requirement to 1.12.2.
  • community/modules/scripts/windows-startup-script/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • docs/blueprint-validation.md
    • Added documentation for 'Allowed Enum Validator'.
    • Added documentation for 'Required Validator'.
    • Added documentation for 'Conditional Validator'.
  • examples/README.md
    • Removed 'ps-slurm.yaml' from the examples list.
    • Updated links and descriptions for GKE consumption options, including A3 Ultra, TPU v6e, and TPU 7x.
  • examples/gke-a3-megagpu/README.md
    • Added a section for additional consumption options (Spot VMs, DWS Flex-start).
  • examples/gke-a3-ultragpu/gke-a3-ultragpu.yaml
    • Removed 'wait: false' setting for Kueue installation.
  • examples/gke-a4/gke-a4.yaml
    • Removed 'wait: false' setting for Kueue installation.
  • examples/gke-a4x-max-bm/README.md
    • Added new README for GKE A4X-Max Bare Metal example, including requirements, creation, NCCL testing, and cleanup instructions.
  • examples/gke-a4x-max-bm/asapd-lite-installer.yaml
    • Added new YAML file for ASAPd Lite DaemonSet installation.
  • examples/gke-a4x-max-bm/gke-a4x-max-bm-deployment.yaml
    • Added new deployment YAML for GKE A4X-Max Bare Metal cluster.
  • examples/gke-a4x-max-bm/gke-a4x-max-bm.yaml
    • Added new blueprint YAML for GKE A4X-Max Bare Metal cluster.
  • examples/gke-a4x-max-bm/nccl-jobset-example.yaml
    • Added new YAML file for NCCL JobSet example on A4X-Max.
  • examples/gke-a4x-max-bm/nvidia-dra-driver.yaml
    • Added new YAML file for NVIDIA DRA driver.
  • examples/gke-a4x/gke-a4x.yaml
    • Corrected the source of truth link for GCS Fuse CSI in comments.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/README.md
    • Updated formatting and wording for clarity.
    • Added hardware-specific guides for TPU v6e and TPU 7x.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/README.md
    • Added new README for TPU 7x DWS Queued Provisioning example.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/gke-tpu-7x-deployment.yaml
    • Added new deployment YAML for TPU 7x DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/gke-tpu-7x.yaml
    • Added new blueprint YAML for TPU 7x DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/tpu-7x-test-job-gcs.yaml
    • Added new YAML for TPU 7x GCS FIO benchmark job.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/tpu-7x-test-job.yaml
    • Added new YAML for TPU 7x simple test job.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/README.md
    • Added new README for TPU v6e DWS Queued Provisioning example.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/gke-tpu-v6e-deployment.yaml
    • Added new deployment YAML for TPU v6e DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/gke-tpu-v6e.yaml
    • Added new blueprint YAML for TPU v6e DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/tpu-v6e-test-job-gcs.yaml
    • Added new YAML for TPU v6e GCS FIO benchmark job.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/tpu-v6e-test-job.yaml
    • Added new YAML for TPU v6e simple test job.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/tpu-dws-queues.yaml.tftpl
    • Added new template for TPU DWS Kueue queues configuration.
  • examples/gke-g4/README.md
    • Updated blog post link for G4 VMs.
    • Added instructions for creating a GCS bucket for Terraform state.
  • examples/gke-tpu-7x/README.md
    • Added links to new DWS Queued Provisioning examples for TPU 7x.
  • examples/gke-tpu-7x/gke-tpu-7x-advanced.yaml
    • Added Kueue configuration path variable.
  • examples/gke-tpu-7x/gke-tpu-7x.yaml
    • Added Kueue configuration path variable and integrated Kueue installation settings.
  • examples/gke-tpu-v6e/README.md
    • Added links to new DWS Queued Provisioning examples for TPU v6e.
  • examples/gke-tpu-v6e/gke-tpu-v6e-advanced.yaml
    • Added Kueue configuration path variable.
  • examples/gke-tpu-v6e/gke-tpu-v6e.yaml
    • Added Kueue configuration path variable and integrated Kueue installation settings.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/README.md
    • Removed deprecated example blueprint.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/a3u-slurm-ubuntu-gcs.yaml
    • Removed deprecated example blueprint.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/deployment.yaml
    • Removed deprecated example deployment file.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-hpl-via-ramble.sh
    • Removed deprecated benchmark script.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-nemo-via-ramble.sh
    • Removed deprecated benchmark script.
  • examples/machine-learning/a3-highgpu-8g/README.md
    • Updated gcluster deploy commands to use full paths for blueprints.
  • examples/machine-learning/a3-highgpu-8g/a3high-slurm-deployment.yaml
    • Updated description for slurm_cluster_name to allow 20 characters and hyphens.
  • examples/machine-learning/a3-megagpu-8g/a3mega-slurm-blueprint.yaml
    • Updated base image version to v20260210.
    • Updated DGCM package installation to hold specific versions.
  • examples/machine-learning/a3-megagpu-8g/a3mega-slurm-deployment.yaml
    • Updated variable descriptions for clarity.
    • Updated slurm_cluster_name validation description.
  • examples/machine-learning/a3-ultragpu-8g/README.md
    • Added section for additional provisioning models (DWS Flex-Start, Spot VMs, reservations).
    • Added instructions for deploying A3-Ultra VMs.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-slurm-blueprint.yaml
    • Updated base image version to v20260210.
    • Updated DGCM package installation to hold specific versions.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-vm-deployment.yaml
    • Added new deployment YAML for A3 Ultra VMs.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-vm.yaml
    • Updated region and zone variables to be supplied by user.
    • Updated instance count to use number_of_vms variable.
  • examples/machine-learning/a4-highgpu-8g/README.md
    • Added section for additional provisioning models (DWS Flex-Start, Spot VMs, reservations).
    • Updated VM deployment instructions and cleanup section.
  • examples/machine-learning/a4-highgpu-8g/a4high-slurm-blueprint.yaml
    • Updated base image version to v20260210.
    • Updated DGCM package installation to hold specific versions.
  • examples/machine-learning/a4-highgpu-8g/a4high-vm-deployment.yaml
    • Added comment for a4h_provisioning_model.
  • examples/machine-learning/a4-highgpu-8g/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Updated Ramble git clone to not use --depth 1 and added a specific commit checkout.
    • Added overwrite_inventories: true to Ramble config.
    • Updated environment variables for NCCL tests, including OMPI_MCA_btl_tcp_if_include, UCX_NET_DEVICES, NCCL_SOCKET_IFNAME, NCCL_IB_HCA, LD_LIBRARY_PATH, NCCL_IB_GID_INDEX, NCCL_DEBUG, and NCCL_DEBUG_SUBSYS.
    • Updated --gpus-per-node to 4 and added ulimit -s 8192 to sbatch script.
    • Formatted output with column -t.
  • examples/machine-learning/a4x-highgpu-4g/README.md
    • Updated overview to include standalone VM group deployment.
    • Added new section for A4X-High VM Deployment, including configuration, deployment, and destruction instructions.
  • examples/machine-learning/a4x-highgpu-4g/a4x-vm-deployment.yaml
    • Added new deployment YAML for A4X-High VMs.
  • examples/machine-learning/a4x-highgpu-4g/a4x-vm.yaml
    • Updated base image to ubuntu-accelerator-2404-arm64-with-nvidia-580-v20260218.
    • Added gpu_topology: "1x72" to placement policy.
  • examples/machine-learning/a4x-highgpu-4g/a4xhigh-slurm-blueprint.yaml
    • Updated base image to ubuntu-accelerator-2404-arm64-with-nvidia-580-v20260218.
  • examples/machine-learning/a4x-highgpu-4g/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Updated Ramble git clone to not use --depth 1 and added a specific commit checkout.
    • Added overwrite_inventories: true to Ramble config.
    • Updated environment variables for NCCL tests, including OMPI_MCA_btl_tcp_if_include, UCX_NET_DEVICES, NCCL_SOCKET_IFNAME, NCCL_IB_HCA, LD_LIBRARY_PATH, NCCL_IB_GID_INDEX, NCCL_DEBUG, and NCCL_DEBUG_SUBSYS.
    • Updated --gpus-per-node to 4 and added ulimit -s 8192 to sbatch script.
    • Formatted output with column -t.
  • examples/machine-learning/a4x-maxgpu-4g-metal/README.md
    • Added new README for A4X Max Bare Metal Slurm cluster, detailing overview, components, custom image scripts, and deployment instructions.
  • examples/machine-learning/a4x-maxgpu-4g-metal/a4xmax-bm-slurm-blueprint.yaml
    • Added new blueprint for A4X Max Bare Metal Slurm cluster.
  • examples/machine-learning/a4x-maxgpu-4g-metal/a4xmax-bm-slurm-deployment.yaml
    • Added new deployment YAML for A4X Max Bare Metal Slurm cluster.
  • examples/machine-learning/a4x-maxgpu-4g-metal/system_benchmarks/README.md
    • Renamed from examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/README.md.
    • Updated content to reflect A4X Max benchmarks, removing HPL and NeMo specific sections.
  • examples/machine-learning/a4x-maxgpu-4g-metal/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Renamed from examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-nccl-tests-via-ramble.sh.
    • Updated script for A4X Max environment, including new container image, GPU count, and network interface settings.
  • examples/machine-learning/build-service-images/a3m/blueprint.yaml
    • Updated source image version to v20260210.
  • examples/machine-learning/build-service-images/common/blueprint.yaml
    • Updated source image version to v20260210.
  • modules/README.md
    • Added 'kubectl-apply' module to the Management section.
    • Removed 'kubernetes-operations' from the Scripts section.
  • modules/compute/gke-job-template/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/compute/gke-job-template/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/compute/gke-node-pool/README.md
    • Updated Terraform version requirement to 1.12.2.
    • Added input description for 'linux_node_config'.
  • modules/compute/gke-node-pool/main.tf
    • Added 'cloud.google.com/gke-queued' label for DWS Queued Provisioning.
    • Updated 'linux_node_config' usage for sysctls and hugepages.
    • Refined 'placement_policy' precondition for DWS Queued Provisioning with TPUs.
  • modules/compute/gke-node-pool/metadata.yaml
    • Added conditional validators for 'enable_flex_start' and 'enable_queued_provisioning'.
  • modules/compute/gke-node-pool/variables.tf
    • Added new variable 'linux_node_config' for Linux node configurations.
  • modules/compute/gke-node-pool/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/compute/resource-policy/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/compute/resource-policy/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/compute/vm-instance/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/compute/vm-instance/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/file-system/cloud-storage-bucket/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/cloud-storage-bucket/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/file-system/filestore/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/filestore/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/file-system/gke-persistent-volume/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/gke-persistent-volume/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/file-system/gke-storage/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/gke-storage/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/file-system/managed-lustre/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/managed-lustre/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/file-system/netapp-storage-pool/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/netapp-storage-pool/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/file-system/netapp-volume/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/netapp-volume/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/file-system/parallelstore/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/parallelstore/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/pre-existing-network-storage/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/file-system/pre-existing-network-storage/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/gpu-definition/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/gpu-definition/main.tf
    • Added A4X-Max GPU machine types ('a4x-maxgpu-4g-metal', 'a4x-maxgpu-4g-metal-nolssd').
  • modules/internal/gpu-definition/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/instance_validations/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/instance_validations/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/network-attachment/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/network-attachment/main.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/tpu-definition/README.md
    • Updated Terraform version requirement to 1.12.2.
    • Added 'kubernetes_label' output description.
  • modules/internal/tpu-definition/main.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/tpu-definition/outputs.tf
    • Added 'kubernetes_label' output for TPU discovery.
  • modules/internal/vpc_peering/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/internal/vpc_peering/main.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/management/kubectl-apply/README.md
    • Updated description to include NCCL gIB plugin installation.
    • Added example for installing the NCCL gIB plugin.
  • modules/management/kubectl-apply/helm_install/README.md
    • Updated Terraform version requirement to 1.12.2.
    • Added 'dependencies' input description.
  • modules/management/kubectl-apply/helm_install/main.tf
    • Added 'dependencies' as an implicit dependency anchor.
  • modules/management/kubectl-apply/helm_install/variables.tf
    • Added new variable 'dependencies'.
  • modules/management/kubectl-apply/helm_install/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/management/kubectl-apply/kubectl/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/management/kubectl-apply/kubectl/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/management/kubectl-apply/main.tf
    • Refactored Kueue configuration to use a Helm chart.
    • Added install_asapd_lite local variable.
    • Changed install_gib to use the Helm install module.
    • Added install_asapd_lite module for ASAPd Lite DaemonSet.
  • modules/management/kubectl-apply/raw-config-chart/Chart.yaml
    • Added new Helm chart for applying raw Kubernetes manifests.
  • modules/management/kubectl-apply/raw-config-chart/templates/manifests.yaml
    • Added new template for rendering raw Kubernetes manifests.
  • modules/management/kubectl-apply/raw-config-chart/values.yaml
    • Added new values file for raw config Helm chart.
  • modules/management/kubectl-apply/variables.tf
    • Removed 'wait' option from Kueue variable.
    • Simplified description for 'gke_cluster_exists'.
    • Added 'accelerator_type' to 'nvidia_dra_driver' variable.
    • Added new variable 'system_node_pool_id'.
    • Added new variable 'asapd_lite' for ASAPd Lite DaemonSet installation.
  • modules/management/kubectl-apply/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/monitoring/dashboard/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/monitoring/dashboard/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/network/firewall-rules/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/firewall-rules/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/network/gpu-rdma-vpc/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/gpu-rdma-vpc/main.tf
    • Updated logic for GKE output subnets and NIC types for RoCE metal configurations.
  • modules/network/gpu-rdma-vpc/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/multivpc/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/multivpc/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/pre-existing-subnetwork/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/pre-existing-subnetwork/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/network/pre-existing-vpc/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/pre-existing-vpc/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/network/private-service-access/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/private-service-access/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/network/vpc/README.md
    • Added input descriptions for 'subnetwork_stack_type' and 'subnetwork_ipv6_access_type'.
    • Updated Terraform version requirement to 1.12.2.
  • modules/network/vpc/main.tf
    • Added 'stack_type' and 'ipv6_access_type' to default subnetwork configuration.
    • Added 'output_primary_subnetwork_stack_type' to local outputs.
  • modules/network/vpc/metadata.yaml
    • Added deprecated validators for 'primary_subnetwork', 'additional_subnetworks', and 'subnetwork_size'.
  • modules/network/vpc/outputs.tf
    • Added new output 'subnetwork_stack_type'.
  • modules/network/vpc/variables.tf
    • Added new variables 'subnetwork_stack_type' and 'subnetwork_ipv6_access_type'.
  • modules/network/vpc/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/project/service-account/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/project/service-account/metadata.yaml
    • Added deprecated validators for 'descriptions', 'names', and 'prefix'.
  • modules/project/service-account/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/scheduler/batch-job-template/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/scheduler/batch-job-template/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • modules/scheduler/batch-login-node/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/scheduler/batch-login-node/metadata.yaml
    • Added deprecated validators for 'job_template_contents', 'job_filename', and 'job_id'.
  • modules/scheduler/batch-login-node/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/scheduler/gke-cluster/README.md
    • Updated Terraform version requirement to 1.12.2.
    • Added input description for 'enable_shielded_nodes'.
  • modules/scheduler/gke-cluster/main.tf
    • Used 'var.enable_shielded_nodes' instead of hardcoded 'true' for shielded nodes.
  • modules/scheduler/gke-cluster/outputs.tf
    • Simplified description for 'gke_cluster_exists'.
    • Added new output 'system_node_pool_id'.
  • modules/scheduler/gke-cluster/variables.tf
    • Added new variable 'enable_shielded_nodes'.
  • modules/scheduler/gke-cluster/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/scheduler/pre-existing-gke-cluster/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/scheduler/pre-existing-gke-cluster/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • modules/scripts/startup-script/README.md
    • Updated Terraform version requirement to 1.12.2.
  • modules/scripts/startup-script/metadata.yaml
    • Added deprecated validators for 'enable_docker_world_writable' and 'install_docker'.
  • modules/scripts/startup-script/versions.tf
    • Updated Terraform version requirement to 1.12.2.
    • Updated module version to v1.83.0.
  • pkg/config/config.go
    • Updated matchSlurmClusterNameExp regex to allow hyphens and increase max length to 20 characters.
  • pkg/config/config_test.go
    • Updated tests for slurm_cluster_name regex to reflect new length and character allowance.
  • pkg/config/deprecation_validate.go
    • Added new file for deprecation validation logic, including warnings for deprecated modules.
  • pkg/config/deprecation_validate_test.go
    • Added new file for deprecation validation tests.
  • pkg/config/errors.go
    • Updated errMsgSlurmClusterNameReqs to reflect the new regex for Slurm cluster names.
  • pkg/config/expand.go
    • Updated default Google provider versions to >= 6.9.0, <= 7.18.0.
  • pkg/config/expand_test.go
    • Updated tests for default Google provider versions.
  • pkg/config/validate.go
    • Added validateDeprecation to the module validation process.
  • pkg/logging/logging.go
    • Added timestamping and colored output for log messages.
  • pkg/modulereader/metadata.go
    • Added DeprecationDate and AlternativeModule fields to MetadataGhpc struct for deprecation management.
  • pkg/modulereader/modules/test_role/test_module/versions.tf
    • Updated Terraform version requirement to 1.12.2.
  • pkg/modulewriter/modulewriter_test.go
    • Updated Terraform version requirement in test output to 1.12.2.
  • pkg/modulewriter/tfwriter.go
    • Updated Terraform required version to 1.12.2.
  • pkg/shell/packer.go
    • Integrated newTimestampWriter for Packer command output.
  • pkg/shell/terraform.go
    • Integrated newTimestampWriter for Terraform command output.
  • pkg/shell/writer.go
    • Added new file for timestampWriter utility to prepend timestamps to log lines.
  • pkg/shell/writer_test.go
    • Added new file for timestampWriter tests.
  • pkg/sourcereader/modules/network/vpc/main.tf
    • Updated Terraform version requirement to 1.12.2.
  • pkg/validators/cloud.go
    • Added reservationNameRegex and resKeyRegex for parsing reservation names.
    • Added findReservationInOtherZones function to search for reservations.
    • Added TestReservationExists function to validate reservation existence.
    • Added testReservationExists function for blueprint validation.
  • pkg/validators/metadata_validator_helpers.go
    • Improved error handling in getModuleSettingValues.
    • Added parseString helper function.
    • Added convertToCty, ValuesMatch, and formatValue helper functions for cty.Value manipulation.
  • pkg/validators/metadata_validators.go
    • Added RequiredValidator implementation for enforcing variable presence/absence.
    • Added ConditionalValidator implementation for enforcing dependent variable conditions.
  • pkg/validators/metadata_validators_test.go
    • Added comprehensive tests for ConditionalValidator and RequiredValidator.
    • Added tests for isVarSet, convertToCty, and ValuesMatch helper functions.
  • pkg/validators/registry.go
    • Registered RequiredValidator and ConditionalValidator.
  • pkg/validators/validators.go
    • Added testReservationExistsName to the list of available validators.
    • Integrated automatic reservation validation for detected reservation variables.
  • pkg/validators/validators_test.go
    • Added tests for default reservation validators.
Ignored Files
  • Ignored by pattern: .github/workflows/** (2)
    • .github/workflows/label-external.yml
    • .github/workflows/pr-precommit.yml
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@gemini-code-assist

Copy link
Copy Markdown
Contributor

Summary of Changes

Hello @sudheer-quad, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request prepares the toolkit for release v1.83.0 by updating core dependencies, expanding GKE cluster deployment options to include advanced hardware like A4X-Max Bare Metal and DWS Queued Provisioning for TPUs, and significantly enhancing the blueprint validation framework. It also includes various quality-of-life improvements such as clearer error messages and timestamped logging.

Highlights

  • Version Update: The toolkit's version has been bumped to v1.83.0 across various modules and the main application.
  • Terraform Version Alignment: The minimum required Terraform version has been updated to 1.12.2 in the Makefile and consistently applied to required_version declarations across numerous Terraform modules.
  • Enhanced GKE Cluster Capabilities: New examples and modules have been introduced to support A4X-Max Bare Metal GKE clusters, including specific configurations for NVIDIA GB300 GPUs, MRDMA networking, and ASAPd-Lite daemonset installation. Additionally, DWS Queued Provisioning support has been added for TPU v6e and TPU 7x nodes, enabling dynamic scaling and all-or-nothing job scheduling.
  • Improved Blueprint Validation: New Allowed Enum, Required, and Conditional validators have been added to the blueprint validation system, enhancing the ability to enforce complex input rules and deprecate variables. Automatic reservation checks are now included in default validators.
  • Refactored Error Messaging and Logging: Error messages in cmd/create.go have been refactored for clarity using a multi-line string constant. The logging system now includes timestamped and colored output for better readability and debugging.
  • Slurm Cluster Name Flexibility: The validation regex for slurm_cluster_name has been updated to allow hyphens and extend the maximum length to 20 characters, providing more flexibility for cluster naming.
  • Kueue and NCCL gIB Plugin Integration: The kubectl-apply module now supports installing the NCCL gIB plugin and refactors Kueue configuration application using a new Helm chart for raw manifests, improving deployment flexibility and robustness.
Changelog
  • .yamllint
    • Added a new ignore path for raw-config-chart templates.
  • Makefile
    • Updated MIN_TERRAFORM_VERSION from 1.5.7 to 1.12.2.
  • cmd/create.go
    • Refactored multi-line error messages into a single constant string for improved readability.
  • cmd/root.go
    • Updated the application version string to v1.83.0.
  • community/examples/hpc-build-slurm-image.yaml
    • Updated Python version to 3.12 and Ansible version to 8.7.0 for Slurm build on Rocky8.
  • community/front-end/ofe/infrastructure_files/vpc_tf/GCP/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/infrastructure_files/vpc_tf/GCP/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/wait-for-startup/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/wait-for-startup/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/requirements.txt
    • Updated cryptography package version from 46.0.4 to 46.0.5.
  • community/front-end/ofe/tf/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/tf/network/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/tf/network/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/front-end/ofe/tf/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/gke-nodeset/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/gke-nodeset/variables.tf
    • Modified slurm_cluster_name validation regex to allow up to 19 characters (total 20) and hyphens.
  • community/modules/compute/gke-nodeset/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/compute/gke-partition/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/gke-partition/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/compute/htcondor-execute-point/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/htcondor-execute-point/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/compute/mig/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/mig/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/compute/notebook/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/notebook/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-tpu/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-tpu/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-partition/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-partition/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/container/artifact-registry/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/container/artifact-registry/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/database/bigquery-dataset/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/database/bigquery-dataset/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/database/bigquery-table/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/database/bigquery-table/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/database/slurm-cloudsql-federation/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/database/slurm-cloudsql-federation/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • community/modules/file-system/nfs-server/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/file-system/nfs-server/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/file-system/weka-client/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/file-system/weka-client/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/files/fsi-montecarlo-on-batch/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/files/fsi-montecarlo-on-batch/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • community/modules/internal/slurm-gcp/instance/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/internal/slurm-gcp/instance/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/internal/slurm-gcp/instance_template/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/internal/slurm-gcp/instance_template/variables.tf
    • Modified slurm_cluster_name validation regex to allow up to 19 characters (total 20) and hyphens.
  • community/modules/internal/slurm-gcp/instance_template/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/internal/slurm-gcp/internal_instance_template/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/internal/slurm-gcp/internal_instance_template/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/internal/slurm-gcp/login/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/internal/slurm-gcp/login/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/internal/slurm-gcp/nodeset_tpu/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/internal/slurm-gcp/nodeset_tpu/metadata.yaml
    • Added a required validator for the zone variable.
    • Added a conditional validator for subnetwork requiring enable_public_ip to be true if subnetwork is null.
  • community/modules/internal/slurm-gcp/nodeset_tpu/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/management/dependencies-installer/README.md
    • Updated required Terraform version to = 1.12.2.
    • Clarified description for gke_cluster_exists variable.
  • community/modules/management/dependencies-installer/helm_install/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/management/dependencies-installer/helm_install/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/management/dependencies-installer/kubernetes_manifest/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/management/dependencies-installer/kubernetes_manifest/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/management/dependencies-installer/variables.tf
    • Clarified description for gke_cluster_exists variable.
  • community/modules/management/dependencies-installer/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/project/service-enablement/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/project/service-enablement/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/pubsub/bigquery-sub/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/pubsub/bigquery-sub/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • community/modules/pubsub/topic/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/pubsub/topic/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/remote-desktop/chrome-remote-desktop/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/remote-desktop/chrome-remote-desktop/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/htcondor-access-point/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/htcondor-access-point/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/scheduler/htcondor-central-manager/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/htcondor-central-manager/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/scheduler/htcondor-pool-secrets/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/htcondor-pool-secrets/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/scheduler/htcondor-service-accounts/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/htcondor-service-accounts/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/htcondor-setup/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/htcondor-setup/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/README.md
    • Updated required Terraform version to = 1.12.2.
    • Added subnetwork_stack_type input variable description.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/controller.tf
    • Added stack_type to the network interface configuration for the controller instance.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/metadata.yaml
    • Added a required validator with deprecated: true for enable_smt and disable_smt.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_compute/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_compute/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_tpu/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_tpu/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/metadata.yaml
    • Added a conditional validator for slurm_control_host when enable_hybrid is true.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/variables.tf
    • Modified slurm_cluster_name validation regex to allow up to 19 characters (total 20) and hyphens.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/variables.tf
    • Modified slurm_cluster_name validation regex to allow up to 19 characters (total 20) and hyphens.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/variables_controller_instance.tf
    • Added subnetwork_stack_type variable with a default of IPV4_ONLY.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-login/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-login/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/scheduler/slinky/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scheduler/slinky/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/gcloud/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/gcloud/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/htcondor-install/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/htcondor-install/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/ramble-execute/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/ramble-execute/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/ramble-setup/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/ramble-setup/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/spack-execute/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/spack-execute/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/spack-setup/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/spack-setup/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/wait-for-startup/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/wait-for-startup/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • community/modules/scripts/windows-startup-script/README.md
    • Updated required Terraform version to = 1.12.2.
  • community/modules/scripts/windows-startup-script/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • docs/blueprint-validation.md
    • Added documentation for the Allowed Enum validator.
    • Added documentation for the Required validator, including deprecated flag usage.
    • Added documentation for the Conditional validator.
  • examples/README.md
    • Removed ps-slurm.yaml from the list of examples.
    • Added detailed provisioning and cleanup instructions for serverless-batch.yaml.
    • Updated PVC naming convention in gke-managed-lustre instructions.
    • Expanded gke-consumption-options section to detail A3 Ultra, TPU 7x, and TPU v6e examples.
  • examples/gke-a3-megagpu/README.md
    • Added a section on additional consumption options like Spot VMs and DWS Flex-start.
  • examples/gke-a3-ultragpu/gke-a3-ultragpu.yaml
    • Removed wait: false from Kueue installation settings.
  • examples/gke-a4/gke-a4.yaml
    • Removed wait: false from Kueue installation settings.
  • examples/gke-a4x-max-bm/README.md
    • Added a new README for A4X-Max Bare Metal GKE cluster deployment, including requirements, creation, NCCL testing, and cleanup instructions.
  • examples/gke-a4x-max-bm/asapd-lite-installer.yaml
    • Added a new Kubernetes DaemonSet manifest for installing asapd-lite on A4X-Max Bare Metal nodes.
  • examples/gke-a4x-max-bm/gke-a4x-max-bm-deployment.yaml
    • Added a new deployment configuration file for A4X-Max Bare Metal GKE clusters.
  • examples/gke-a4x-max-bm/gke-a4x-max-bm.yaml
    • Added a new blueprint for deploying A4X-Max Bare Metal GKE clusters with multiple VPCs, RDMA networking, service accounts, and GCS Fuse integration.
  • examples/gke-a4x-max-bm/nccl-jobset-example.yaml
    • Added a new JobSet example for running NCCL all-gather performance tests on A4X-Max Bare Metal GKE clusters.
  • examples/gke-a4x-max-bm/nvidia-dra-driver.yaml
    • Added a new Kubernetes manifest for NVIDIA DRA driver namespace and resource quota.
  • examples/gke-a4x/gke-a4x.yaml
    • Corrected the source of truth link for GCS Fuse CSI in the gcs-training module comments.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/README.md
    • Updated storage bucket creation instructions to use 'Cloud Storage bucket'.
    • Clarified variable replacement instructions.
    • Added a note about the blueprint creating VPC networks, Cloud Storage buckets, etc.
    • Added a new section for Hardware-Specific Guides for TPU v6e and TPU 7x.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/README.md
    • Added a new README for TPU 7x DWS Queued Provisioning, including cluster creation, job submission, monitoring, and validation instructions.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/gke-tpu-7x-deployment.yaml
    • Added a new deployment configuration file for TPU 7x DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/gke-tpu-7x.yaml
    • Added a new blueprint for deploying TPU 7x GKE clusters with DWS Queued Provisioning, multiple VPCs, GCS Fuse, Kueue, and JobSet.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/tpu-7x-test-job-gcs.yaml
    • Added a new JobSet example for running FIO benchmarks on GCS buckets with TPU 7x DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/tpu-7x-test-job.yaml
    • Added a new simple JobSet example for testing TPU 7x DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/README.md
    • Added a new README for TPU v6e DWS Queued Provisioning, including cluster creation, job submission, monitoring, and validation instructions.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/gke-tpu-v6e-deployment.yaml
    • Added a new deployment configuration file for TPU v6e DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/gke-tpu-v6e.yaml
    • Added a new blueprint for deploying TPU v6e GKE clusters with DWS Queued Provisioning, multiple VPCs, GCS Fuse, Kueue, and JobSet.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/tpu-v6e-test-job-gcs.yaml
    • Added a new JobSet example for running FIO benchmarks on GCS buckets with TPU v6e DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/tpu-v6e-test-job.yaml
    • Added a new simple JobSet example for testing TPU v6e DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/tpu-dws-queues.yaml.tftpl
    • Added a new Terraform template for Kueue configuration with DWS for TPUs.
  • examples/gke-g4/README.md
    • Updated blog post link for G4 VMs.
    • Added instructions for creating a Cloud Storage bucket for Terraform state.
    • Clarified usage of CLUSTER-NAME in cleanup instructions.
  • examples/gke-tpu-7x/README.md
    • Added links to new DWS Flex with Queued Provisioning examples for TPU 7x.
  • examples/gke-tpu-7x/gke-tpu-7x-advanced.yaml
    • Added kueue_configuration_path variable for Kueue configuration.
  • examples/gke-tpu-7x/gke-tpu-7x.yaml
    • Added kueue_configuration_path variable for Kueue configuration.
    • Updated workload-manager-install module to include Kueue installation and configuration.
  • examples/gke-tpu-v6e/README.md
    • Added links to new DWS Flex with Queued Provisioning examples for TPU v6e.
  • examples/gke-tpu-v6e/gke-tpu-v6e-advanced.yaml
    • Added kueue_configuration_path variable for Kueue configuration.
  • examples/gke-tpu-v6e/gke-tpu-v6e.yaml
    • Added kueue_configuration_path variable for Kueue configuration.
    • Updated workload-manager-install module to include Kueue installation and configuration.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/README.md
    • Removed deprecated blueprint and its associated documentation.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/a3u-slurm-ubuntu-gcs.yaml
    • Removed deprecated blueprint configuration file.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/deployment.yaml
    • Removed deprecated deployment configuration file.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-hpl-via-ramble.sh
    • Removed deprecated benchmark script.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-nemo-via-ramble.sh
    • Removed deprecated benchmark script.
  • examples/machine-learning/a3-highgpu-8g/README.md
    • Updated gcluster deploy commands to use full paths for blueprint and deployment files.
  • examples/machine-learning/a3-highgpu-8g/a3high-slurm-deployment.yaml
    • Updated slurm_cluster_name variable description to allow up to 20 characters and hyphens.
  • examples/machine-learning/a3-megagpu-8g/a3mega-slurm-blueprint.yaml
    • Updated base image version to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
    • Added apt-mark hold commands for NVIDIA datacenter-gpu-manager components during image build.
  • examples/machine-learning/a3-megagpu-8g/a3mega-slurm-deployment.yaml
    • Updated variable descriptions and removed default values for GCS bucket, deployment name, project ID, region, zone, slurm cluster name, cluster size, and reservation name.
  • examples/machine-learning/a3-ultragpu-8g/README.md
    • Added a new section detailing additional provisioning models (DWS Flex-Start, Spot VMs) and how to configure them.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-slurm-blueprint.yaml
    • Updated base image version to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
    • Added apt-mark hold commands for NVIDIA datacenter-gpu-manager components during image build.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-vm-deployment.yaml
    • Added a new deployment configuration file for A3-Ultra VMs.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-vm.yaml
    • Updated region and zone to be variables.
    • Set instance_count to use the number_of_vms variable.
  • examples/machine-learning/a4-highgpu-8g/README.md
    • Added a new section detailing additional provisioning models (Spot VMs) for A4-High VMs.
  • examples/machine-learning/a4-highgpu-8g/a4high-slurm-blueprint.yaml
    • Updated base image version to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
    • Added apt-mark hold commands for NVIDIA datacenter-gpu-manager components during image build.
  • examples/machine-learning/a4-highgpu-8g/a4high-vm-deployment.yaml
    • Added a commented-out line for a4h_provisioning_model: SPOT.
  • examples/machine-learning/a4-highgpu-8g/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Updated ramble clone command to not use --depth 1 and checkout a specific commit.
    • Added config.overwrite_inventories: true to ramble configuration.
    • Updated gpus-per-node from 8 to 4.
    • Modified LD_LIBRARY_PATH and NCCL_SOCKET_IFNAME environment variables.
    • Added ulimit -s 8192 to the Slurm sbatch script template.
    • Added column -t to the jq output for better formatting.
  • examples/machine-learning/a4x-highgpu-4g/README.md
    • Updated the blueprint overview to include standalone VM group deployment.
    • Added a new section for A4X-High VM Deployment, including configuration, deployment, and destruction instructions.
  • examples/machine-learning/a4x-highgpu-4g/a4x-vm-deployment.yaml
    • Added a new deployment configuration file for A4X-High VMs.
  • examples/machine-learning/a4x-highgpu-4g/a4x-vm.yaml
    • Updated base_image to use a specific version ubuntu-accelerator-2404-arm64-with-nvidia-580-v20260218.
    • Changed source_image_family and source_image_project_id to source_image and source_image_project_id respectively in the custom image module.
    • Added gpu_topology: "1x72" to the placement policy for VM instances.
  • examples/machine-learning/a4x-highgpu-4g/a4xhigh-slurm-blueprint.yaml
    • Updated base_image to use a specific version ubuntu-accelerator-2404-arm64-with-nvidia-580-v20260218.
    • Changed source_image_family and source_image_project_id to source_image and source_image_project_id respectively in the custom image module.
  • examples/machine-learning/a4x-highgpu-4g/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Updated ramble clone command to not use --depth 1 and checkout a specific commit.
    • Added config.overwrite_inventories: true to ramble configuration.
  • examples/machine-learning/a4x-maxgpu-4g-metal/README.md
    • Added a new README for A4X Max Bare Metal Slurm cluster deployment, including an overview, key components, custom image scripts, and deployment/cleanup instructions.
  • examples/machine-learning/a4x-maxgpu-4g-metal/a4xmax-bm-slurm-blueprint.yaml
    • Added a new blueprint for deploying A4X Max Bare Metal Slurm clusters with multi-NIC, RDMA, custom image build, GCS Fuse, and IMEX prolog/epilog scripts.
  • examples/machine-learning/a4x-maxgpu-4g-metal/a4xmax-bm-slurm-deployment.yaml
    • Added a new deployment configuration file for A4X Max Bare Metal Slurm clusters.
  • examples/machine-learning/a4x-maxgpu-4g-metal/system_benchmarks/README.md
    • Renamed from examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/README.md.
    • Removed HPL-NVIDIA and NeMo benchmarks from the documentation.
  • examples/machine-learning/a4x-maxgpu-4g-metal/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Renamed from examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-nccl-tests-via-ramble.sh.
    • Updated SOFTWARE_INSTALL to accept an argument.
    • Added ENROOT_CONFIG_PATH export.
    • Updated ramble setup command to use "${SOFTWARE_INSTALL}"/ramble/share/ramble/setup-env.sh.
    • Updated gpus_per_node from 8 to 4.
    • Modified OMPI_MCA_btl_tcp_if_include, UCX_NET_DEVICES, NCCL_SOCKET_IFNAME, NCCL_IB_HCA, LD_LIBRARY_PATH, NCCL_IB_GID_INDEX, NCCL_DEBUG, NCCL_DEBUG_SUBSYS environment variables.
    • Added ulimit -s 8192 to the Slurm sbatch script template.
    • Added column -t to the jq output for better formatting.
  • examples/machine-learning/build-service-images/a3m/blueprint.yaml
    • Updated source_image to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
  • examples/machine-learning/build-service-images/common/blueprint.yaml
    • Updated source_image to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
  • modules/README.md
    • Added kubectl-apply module to the Management section.
    • Removed kubernetes-operations from the Scripts section.
  • modules/compute/gke-job-template/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/compute/gke-job-template/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/compute/gke-node-pool/README.md
    • Updated required Terraform version to = 1.12.2.
    • Added linux_node_config input variable description.
    • Clarified error message for enable_flex_start with reservation_affinity.
  • modules/compute/gke-node-pool/main.tf
    • Merged kubernetes_labels to include cloud.google.com/gke-queued when enable_queued_provisioning is true.
    • Updated labels to use local.kubernetes_labels.
    • Updated linux_node_config usage to configure sysctls and hugepages.
    • Modified precondition for placement_policy when enable_queued_provisioning is true to allow compact placement for TPUs.
  • modules/compute/gke-node-pool/metadata.yaml
    • Added conditional validators for enable_flex_start and enable_queued_provisioning to ensure correct usage with static_node_count, auto_repair, and autoscaling_total_min_nodes.
  • modules/compute/gke-node-pool/variables.tf
    • Added linux_node_config variable for Linux node-specific configurations.
  • modules/compute/gke-node-pool/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/compute/resource-policy/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/compute/resource-policy/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/compute/vm-instance/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/compute/vm-instance/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/file-system/cloud-storage-bucket/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/cloud-storage-bucket/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/file-system/filestore/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/filestore/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/file-system/gke-persistent-volume/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/gke-persistent-volume/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/file-system/gke-storage/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/gke-storage/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/file-system/managed-lustre/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/managed-lustre/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/file-system/netapp-storage-pool/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/netapp-storage-pool/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/file-system/netapp-volume/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/netapp-volume/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/file-system/parallelstore/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/parallelstore/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/pre-existing-network-storage/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/file-system/pre-existing-network-storage/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/gpu-definition/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/gpu-definition/main.tf
    • Added a4x-maxgpu-4g-metal and a4x-maxgpu-4g-metal-nolssd machine types with nvidia-gb300 GPUs.
  • modules/internal/gpu-definition/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/instance_validations/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/instance_validations/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/network-attachment/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/network-attachment/main.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/tpu-definition/README.md
    • Updated required Terraform version to = 1.12.2.
    • Added kubernetes_label output description.
  • modules/internal/tpu-definition/main.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/tpu-definition/outputs.tf
    • Added kubernetes_label output to provide Kubernetes labels for TPU discovery.
  • modules/internal/vpc_peering/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/internal/vpc_peering/main.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/management/kubectl-apply/README.md
    • Updated description to include deploying NCCL gIB plugin.
    • Updated module table to reflect changes in kubectl-apply submodules.
  • modules/management/kubectl-apply/helm_install/README.md
    • Updated required Terraform version to = 1.12.2.
    • Added dependencies input description.
  • modules/management/kubectl-apply/helm_install/main.tf
    • Added dependencies input to the helm_release resource for implicit dependency anchoring.
  • modules/management/kubectl-apply/helm_install/variables.tf
    • Added dependencies variable to allow forcing Terraform dependencies.
  • modules/management/kubectl-apply/helm_install/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/management/kubectl-apply/kubectl/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/management/kubectl-apply/kubectl/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/management/kubectl-apply/main.tf
    • Refactored Kueue configuration content handling to support .tftpl files and template variables.
    • Added install_asapd_lite local variable.
    • Changed install_gib to use the helm_install submodule instead of kubectl.
  • modules/management/kubectl-apply/raw-config-chart/Chart.yaml
    • Added a new Helm chart definition for applying raw Kubernetes manifests.
  • modules/management/kubectl-apply/raw-config-chart/templates/manifests.yaml
    • Added a new Helm template to iterate and apply multiple Kubernetes manifests.
  • modules/management/kubectl-apply/raw-config-chart/values.yaml
    • Added a new Helm values file with an empty manifests array.
  • modules/management/kubectl-apply/variables.tf
    • Removed wait option from the kueue variable.
    • Clarified description for gke_cluster_exists variable.
    • Added accelerator_type to nvidia_dra_driver variable.
    • Added system_node_pool_id variable.
    • Added asapd_lite variable for installing the asapd-lite daemonset.
  • modules/management/kubectl-apply/versions.tf
    • Updated provider_meta module version to v1.83.0.
  • modules/monitoring/dashboard/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/monitoring/dashboard/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/network/firewall-rules/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/network/firewall-rules/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/network/gpu-rdma-vpc/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/network/gpu-rdma-vpc/main.tf
    • Added logic to determine subnet_count, nic_type_val, and stack_type_val based on is_roce_metal for GKE output.
  • modules/network/gpu-rdma-vpc/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/network/multivpc/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/network/multivpc/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/network/pre-existing-subnetwork/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/network/pre-existing-subnetwork/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/network/pre-existing-vpc/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/network/pre-existing-vpc/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/network/private-service-access/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/network/private-service-access/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/network/vpc/README.md
    • Added subnetwork_stack_type and subnetwork_ipv6_access_type to the list of inputs.
  • modules/network/vpc/main.tf
    • Added stack_type and ipv6_access_type to the default primary subnetwork configuration.
    • Added output_primary_subnetwork_stack_type to local outputs.
  • modules/network/vpc/metadata.yaml
    • Added required validators with deprecated: true for primary_subnetwork, additional_subnetworks, and subnetwork_size.
  • modules/network/vpc/outputs.tf
    • Added subnetwork_stack_type output for the primary subnetwork.
  • modules/network/vpc/variables.tf
    • Added subnetwork_stack_type variable with default IPV4_ONLY.
    • Added subnetwork_ipv6_access_type variable.
  • modules/network/vpc/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/project/service-account/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/project/service-account/metadata.yaml
    • Added required validators with deprecated: true for descriptions, names, and prefix.
  • modules/project/service-account/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/scheduler/batch-job-template/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/scheduler/batch-job-template/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • modules/scheduler/batch-login-node/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/scheduler/batch-login-node/metadata.yaml
    • Added required validators with deprecated: true for job_template_contents, job_filename, and job_id.
  • modules/scheduler/batch-login-node/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/scheduler/gke-cluster/README.md
    • Updated required Terraform version to = 1.12.2.
    • Added enable_shielded_nodes input description.
  • modules/scheduler/gke-cluster/main.tf
    • Configured enable_shielded_nodes to use the variable value instead of a hardcoded true.
  • modules/scheduler/gke-cluster/outputs.tf
    • Clarified description for gke_cluster_exists output.
    • Added system_node_pool_id output.
  • modules/scheduler/gke-cluster/variables.tf
    • Added enable_shielded_nodes variable with a default of true.
  • modules/scheduler/gke-cluster/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module versions to v1.83.0.
  • modules/scheduler/pre-existing-gke-cluster/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/scheduler/pre-existing-gke-cluster/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • modules/scripts/startup-script/README.md
    • Updated required Terraform version to = 1.12.2.
  • modules/scripts/startup-script/metadata.yaml
    • Added required validators with deprecated: true for enable_docker_world_writable and install_docker.
  • modules/scripts/startup-script/versions.tf
    • Updated required Terraform version to = 1.12.2.
    • Updated provider_meta module version to v1.83.0.
  • pkg/config/config.go
    • Modified matchSlurmClusterNameExp regex to allow hyphens and increase max length to 19 characters (total 20).
  • pkg/config/config_test.go
    • Updated tests for slurm_cluster_name regex to reflect new length and allowed characters.
  • pkg/config/deprecation_validate.go
    • Added new file to implement deprecation validation logic, including warnings for upcoming deprecations and errors for past deprecations.
  • pkg/config/deprecation_validate_test.go
    • Added new file with tests for deprecation validation scenarios.
  • pkg/config/errors.go
    • Updated errMsgSlurmClusterNameReqs to reflect the new allowed characters and length for Slurm cluster names.
  • pkg/config/expand.go
    • Updated default Google provider versions to <= 7.18.0.
  • pkg/config/expand_test.go
    • Updated tests for default Google provider versions to <= 7.18.0.
  • pkg/config/validate.go
    • Added validateDeprecation call to the module validation process.
  • pkg/logging/logging.go
    • Implemented timestamping for all log outputs.
    • Added color formatting for timestamps and warning messages.
  • pkg/modulereader/metadata.go
    • Added DeprecationDate and AlternativeModule fields to MetadataGhpc struct for deprecation management.
  • pkg/modulereader/modules/test_role/test_module/versions.tf
    • Updated required Terraform version to = 1.12.2.
  • pkg/modulewriter/modulewriter_test.go
    • Updated expected Terraform required_version to = 1.12.2 in tests.
  • pkg/modulewriter/tfwriter.go
    • Updated Terraform required_version to = 1.12.2.
  • pkg/shell/packer.go
    • Integrated newTimestampWriter for Packer command output to include timestamps.
  • pkg/shell/terraform.go
    • Integrated newTimestampWriter for Terraform command output to include timestamps.
  • pkg/shell/writer.go
    • Added new file implementing a timestampWriter for prepending timestamps to log lines.
  • pkg/shell/writer_test.go
    • Added new file with tests for the timestampWriter functionality.
  • pkg/sourcereader/modules/network/vpc/main.tf
    • Updated required Terraform version to = 1.12.2.
  • pkg/validators/cloud.go
    • Added reservationNameRegex and resKeyRegex for parsing reservation names.
    • Implemented findReservationInOtherZones to search for reservations across zones.
    • Implemented TestReservationExists to validate reservation existence, including handling shared VPCs and providing hints for incorrect zones.
  • pkg/validators/metadata_validator_helpers.go
    • Improved error handling in getModuleSettingValues when a setting is not found.
    • Added parseString helper function for parsing string inputs.
    • Added convertToCty, ValuesMatch, and formatValue helper functions for cty.Value manipulation and comparison.
  • pkg/validators/metadata_validators.go
    • Implemented RequiredValidator to enforce presence or absence of variables, supporting deprecated flags.
    • Implemented ConditionalValidator to enforce dependent variable conditions based on trigger variable values.
  • pkg/validators/metadata_validators_test.go
    • Added comprehensive tests for ConditionalValidator and RequiredValidator covering various scenarios, including trigger conditions, dependent requirements, deprecated flags, and custom error messages.
    • Added tests for isVarSet, convertToCty, and ValuesMatch helper functions.
  • pkg/validators/registry.go
    • Registered RequiredValidator and ConditionalValidator in the validator registry.
  • pkg/validators/validators.go
    • Added testReservationExistsName constant.
    • Registered testReservationExists in the implementations map.
    • Updated defaults function to automatically add testReservationExists validator for any detected reservation variables.
  • pkg/validators/validators_test.go
    • Added tests for default validators to include testReservationExists when reservation variables are present.
Ignored Files
  • Ignored by pattern: .github/workflows/** (2)
    • .github/workflows/label-external.yml
    • .github/workflows/pr-precommit.yml
Activity
  • The pull request is a release candidate for v1.83.0.
  • The PR description includes a standard submission checklist, indicating a formal review and release process.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@gemini-code-assist

Copy link
Copy Markdown
Contributor

Summary of Changes

Hello @sudheer-quad, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request prepares for the v1.83.0 release by updating core dependencies, introducing new GKE examples for advanced hardware and provisioning models, and enhancing internal validation and logging mechanisms. Key updates include a bump in the minimum Terraform version, the addition of A4X-Max Bare Metal and TPU DWS Queued Provisioning examples, and improvements to GKE node pool and kubectl apply modules for better configurability and support for new features. Furthermore, the PR refines Slurm cluster naming conventions and introduces robust metadata validators and a module deprecation warning system to guide users and maintain code quality.

Highlights

  • Version Bump: The project version has been updated to v1.83.0, reflecting a new release candidate.
  • Terraform Version Update: The minimum required Terraform version has been updated to 1.12.2 across various modules and examples.
  • New GKE A4X-Max Bare Metal Examples: Added new example blueprints for deploying GKE clusters with A4X-Max Bare Metal machines, including configurations for NCCL tests and GCS storage benchmarks.
  • New GKE TPU DWS Queued Provisioning Examples: Introduced new example blueprints for GKE TPU 7x and TPU v6e clusters utilizing Dynamic Workload Scheduler (DWS) with Queued Provisioning, demonstrating dynamic scaling and job orchestration.
  • Enhanced GKE Node Pool Configuration: The GKE node pool module now supports Linux node configurations (sysctls, hugepages) and automatically applies GKE Queued Provisioning labels when enabled.
  • Improved Kubectl Apply Module: The kubectl-apply module has been enhanced to support installation of asapd-lite and nvidia-dra-driver, and features refactored Kueue configuration logic.
  • Slurm Cluster Name Validation Update: The validation regex for Slurm cluster names has been updated to allow hyphens and extend the maximum length to 20 characters.
  • New Metadata Validators: Implemented and documented new required and conditional validators for blueprint metadata, allowing for more robust input validation and deprecation warnings.
  • Module Deprecation Mechanism: A new mechanism has been added to provide deprecation warnings for modules, including future removal dates and alternative module suggestions.
  • Timestamped Command Output: Packer and Terraform command outputs now include timestamps for improved logging and debugging.
Changelog
  • .yamllint
    • Added a new ignore path for raw-config-chart templates.
  • Makefile
    • Updated the minimum required Terraform version to 1.12.2.
  • cmd/create.go
    • Refactored blueprint validation error messages into a constant string for clarity and consistency.
  • cmd/root.go
    • Updated the application version string to v1.83.0.
  • community/examples/hpc-build-slurm-image.yaml
    • Updated Python version to 3.12 and Ansible version to 8.7.0 for Slurm build on Rocky8.
  • community/front-end/ofe/infrastructure_files/vpc_tf/GCP/README.md
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/infrastructure_files/vpc_tf/GCP/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/README.md
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/wait-for-startup/README.md
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/infrastructure_files/workbench_tf/google/wait-for-startup/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/requirements.txt
    • Updated cryptography dependency to version 46.0.5.
  • community/front-end/ofe/tf/README.md
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/tf/network/README.md
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/tf/network/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/front-end/ofe/tf/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/gke-nodeset/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/gke-nodeset/variables.tf
    • Updated the regex for slurm_cluster_name validation to allow up to 19 characters and hyphens.
  • community/modules/compute/gke-nodeset/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/compute/gke-partition/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/gke-partition/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/compute/htcondor-execute-point/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/htcondor-execute-point/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/compute/mig/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/mig/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/compute/notebook/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/notebook/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-tpu/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-tpu/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/compute/schedmd-slurm-gcp-v6-partition/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/compute/schedmd-slurm-gcp-v6-partition/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/container/artifact-registry/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/container/artifact-registry/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/database/bigquery-dataset/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/database/bigquery-dataset/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/database/bigquery-table/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/database/bigquery-table/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/database/slurm-cloudsql-federation/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/database/slurm-cloudsql-federation/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • community/modules/file-system/nfs-server/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/file-system/nfs-server/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/file-system/weka-client/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/file-system/weka-client/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/files/fsi-montecarlo-on-batch/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/files/fsi-montecarlo-on-batch/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • community/modules/internal/slurm-gcp/instance/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/internal/slurm-gcp/instance/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/internal/slurm-gcp/instance_template/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/internal/slurm-gcp/instance_template/variables.tf
    • Updated the regex for slurm_cluster_name validation to allow up to 19 characters and hyphens.
  • community/modules/internal/slurm-gcp/instance_template/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/internal/slurm-gcp/internal_instance_template/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/internal/slurm-gcp/internal_instance_template/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/internal/slurm-gcp/login/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/internal/slurm-gcp/login/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/internal/slurm-gcp/nodeset_tpu/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/internal/slurm-gcp/nodeset_tpu/metadata.yaml
    • Added a required validator for the zone variable.
    • Added a conditional validator for subnetwork and enable_public_ip.
  • community/modules/internal/slurm-gcp/nodeset_tpu/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/management/dependencies-installer/README.md
    • Updated required Terraform version to 1.12.2.
    • Simplified the description for the gke_cluster_exists input variable.
  • community/modules/management/dependencies-installer/helm_install/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/management/dependencies-installer/helm_install/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/management/dependencies-installer/kubernetes_manifest/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/management/dependencies-installer/kubernetes_manifest/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/management/dependencies-installer/variables.tf
    • Simplified the description for the gke_cluster_exists variable.
  • community/modules/management/dependencies-installer/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/project/service-enablement/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/project/service-enablement/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/pubsub/bigquery-sub/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/pubsub/bigquery-sub/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • community/modules/pubsub/topic/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/pubsub/topic/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/remote-desktop/chrome-remote-desktop/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/remote-desktop/chrome-remote-desktop/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/htcondor-access-point/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/htcondor-access-point/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/scheduler/htcondor-central-manager/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/htcondor-central-manager/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/scheduler/htcondor-pool-secrets/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/htcondor-pool-secrets/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/scheduler/htcondor-service-accounts/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/htcondor-service-accounts/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/htcondor-setup/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/htcondor-setup/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/README.md
    • Updated required Terraform version to 1.12.2.
    • Added subnetwork_stack_type input variable description.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/controller.tf
    • Added stack_type to the network interface configuration for the controller instance.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/metadata.yaml
    • Added a deprecated validator for enable_smt and disable_smt variables.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_compute/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_compute/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_tpu/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/cleanup_tpu/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/metadata.yaml
    • Added a new conditional validator for slurm_control_host when enable_hybrid is true.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/variables.tf
    • Updated the regex for slurm_cluster_name validation to allow up to 19 characters and hyphens.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/modules/slurm_files/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/variables.tf
    • Updated the regex for slurm_cluster_name validation to allow up to 19 characters and hyphens.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/variables_controller_instance.tf
    • Added a new variable subnetwork_stack_type with a default of 'IPV4_ONLY'.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-controller/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-login/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/schedmd-slurm-gcp-v6-login/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/scheduler/slinky/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scheduler/slinky/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/gcloud/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/gcloud/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/htcondor-install/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/htcondor-install/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/ramble-execute/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/ramble-execute/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/ramble-setup/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/ramble-setup/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/spack-execute/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/spack-execute/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/spack-setup/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/spack-setup/versions.tf
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/wait-for-startup/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/wait-for-startup/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • community/modules/scripts/windows-startup-script/README.md
    • Updated required Terraform version to 1.12.2.
  • community/modules/scripts/windows-startup-script/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • docs/blueprint-validation.md
    • Added documentation for the new Allowed Enum validator.
    • Added documentation for the new Required validator, including examples for mandatory and deprecated variables.
    • Added documentation for the new Conditional validator, with examples for trigger-dependent variable settings.
  • examples/README.md
    • Removed the ps-slurm.yaml example from the list.
    • Updated links and descriptions for GKE consumption options, including new hardware support.
  • examples/gke-a3-megagpu/README.md
    • Added a section detailing additional consumption options like Spot VMs and DWS Flex-start for GKE-A3M clusters.
  • examples/gke-a3-ultragpu/gke-a3-ultragpu.yaml
    • Removed the wait: false setting for Kueue installation, allowing it to wait for resources by default.
  • examples/gke-a4/gke-a4.yaml
    • Removed the wait: false setting for Kueue installation, allowing it to wait for resources by default.
  • examples/gke-a4x-max-bm/README.md
    • Added a new README for the GKE A4X-Max Bare Metal example, detailing requirements, cluster creation, NCCL tests, and cleanup.
  • examples/gke-a4x-max-bm/asapd-lite-installer.yaml
    • Added a new Kubernetes DaemonSet manifest for installing asapd-lite on A4X-Max Bare Metal nodes.
  • examples/gke-a4x-max-bm/gke-a4x-max-bm-deployment.yaml
    • Added a new deployment configuration file for the GKE A4X-Max Bare Metal example.
  • examples/gke-a4x-max-bm/gke-a4x-max-bm.yaml
    • Added a new blueprint for deploying a GKE A4X-Max Bare Metal cluster, including network, service accounts, node pools, and workload management installations.
  • examples/gke-a4x-max-bm/nccl-jobset-example.yaml
    • Added a new Kubernetes JobSet example for running NCCL tests on GKE A4X-Max Bare Metal clusters.
  • examples/gke-a4x-max-bm/nvidia-dra-driver.yaml
    • Added a new Kubernetes manifest for NVIDIA DRA driver namespace and resource quota.
  • examples/gke-a4x/gke-a4x.yaml
    • Corrected the source of truth URL for GCS Fuse CSI documentation in the gcs-training module comment.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/README.md
    • Updated text for creating a Cloud Storage bucket and replacing variables.
    • Added a new section for Hardware-Specific Guides, linking to TPU v6e and TPU 7x examples.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/README.md
    • Added a new README for the GKE TPU 7x DWS Queued Provisioning example, detailing setup, job submission, monitoring, and validation.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/gke-tpu-7x-deployment.yaml
    • Added a new deployment configuration file for the GKE TPU 7x DWS Queued Provisioning example.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/gke-tpu-7x.yaml
    • Added a new blueprint for deploying a GKE TPU 7x cluster with DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/tpu-7x-test-job-gcs.yaml
    • Added a new Kubernetes JobSet example for GCS storage benchmarking on TPU 7x with Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-7x/tpu-7x-test-job.yaml
    • Added a new simple Kubernetes JobSet example for testing TPU 7x with Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/README.md
    • Added a new README for the GKE TPU v6e DWS Queued Provisioning example, detailing setup, job submission, monitoring, and validation.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/gke-tpu-v6e-deployment.yaml
    • Added a new deployment configuration file for the GKE TPU v6e DWS Queued Provisioning example.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/gke-tpu-v6e.yaml
    • Added a new blueprint for deploying a GKE TPU v6e cluster with DWS Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/tpu-v6e-test-job-gcs.yaml
    • Added a new Kubernetes JobSet example for GCS storage benchmarking on TPU v6e with Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/gke-tpu-v6e/tpu-v6e-test-job.yaml
    • Added a new simple Kubernetes JobSet example for testing TPU v6e with Queued Provisioning.
  • examples/gke-consumption-options/dws-flex-start-queued-provisioning/tpu-dws-queues.yaml.tftpl
    • Added a new Terraform template for Kueue configuration with DWS for TPUs.
  • examples/gke-g4/README.md
    • Updated the blog post link for G4 machines.
    • Added instructions for creating a Cloud Storage bucket for Terraform state.
    • Clarified the replacement of CLUSTER-NAME with deployment_name in cleanup instructions.
  • examples/gke-tpu-7x/README.md
    • Added links to the new GKE TPU 7x DWS Flex with Queued Provisioning blueprints and READMEs.
  • examples/gke-tpu-7x/gke-tpu-7x-advanced.yaml
    • Added kueue_configuration_path variable for Kueue setup.
  • examples/gke-tpu-7x/gke-tpu-7x.yaml
    • Added kueue_configuration_path variable for Kueue setup.
    • Integrated Kueue installation settings into the workload-manager-install module.
  • examples/gke-tpu-v6e/README.md
    • Added links to the new GKE TPU v6e DWS Flex with Queued Provisioning blueprints and READMEs.
  • examples/gke-tpu-v6e/gke-tpu-v6e-advanced.yaml
    • Added kueue_configuration_path variable for Kueue setup.
  • examples/gke-tpu-v6e/gke-tpu-v6e.yaml
    • Added kueue_configuration_path variable for Kueue setup.
    • Integrated Kueue installation settings into the workload-manager-install module.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/README.md
    • Removed the deprecated A3U Slurm + Ubuntu + GCS example blueprint and its documentation.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/a3u-slurm-ubuntu-gcs.yaml
    • Removed the deprecated A3U Slurm + Ubuntu + GCS example blueprint.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/deployment.yaml
    • Removed the deprecated deployment file for the A3U Slurm + Ubuntu + GCS example.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-hpl-via-ramble.sh
    • Removed the deprecated HPL benchmark script.
  • examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-nemo-via-ramble.sh
    • Removed the deprecated NeMo benchmark script.
  • examples/machine-learning/a3-highgpu-8g/README.md
    • Updated gcluster deploy commands to use full relative paths for blueprints.
  • examples/machine-learning/a3-highgpu-8g/a3high-slurm-deployment.yaml
    • Updated the description and validation for slurm_cluster_name to allow up to 20 characters and hyphens.
  • examples/machine-learning/a3-megagpu-8g/a3mega-slurm-blueprint.yaml
    • Updated the base image version to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
    • Modified the datacenter-gpu-manager installation to specify version and use apt-mark hold.
  • examples/machine-learning/a3-megagpu-8g/a3mega-slurm-deployment.yaml
    • Updated variable descriptions to be placeholders for user input.
    • Updated the description and validation for slurm_cluster_name to allow up to 20 characters and hyphens.
  • examples/machine-learning/a3-ultragpu-8g/README.md
    • Added a new section on 'Additional ways to provision' for DWS Flex-Start and Spot VMs.
    • Added a new section for 'A3-ultra VMs' with instructions for deploying standalone A3 Ultra VMs.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-slurm-blueprint.yaml
    • Updated the base image version to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
    • Modified the datacenter-gpu-manager installation to specify version and use apt-mark hold.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-vm-deployment.yaml
    • Added a new deployment file for A3-Ultra VMs.
  • examples/machine-learning/a3-ultragpu-8g/a3ultra-vm.yaml
    • Updated region and zone variables to be user-supplied.
    • Set default number_of_vms to 2.
    • Updated instance_count to use the number_of_vms variable.
  • examples/machine-learning/a4-highgpu-8g/README.md
    • Added a new section on 'Additional ways to provision' for Spot VMs for A4-High VMs.
    • Added cleanup instructions for A4-High VMs.
  • examples/machine-learning/a4-highgpu-8g/a4high-slurm-blueprint.yaml
    • Updated the base image version to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
    • Modified the datacenter-gpu-manager installation to specify version and use apt-mark hold.
  • examples/machine-learning/a4-highgpu-8g/a4high-vm-deployment.yaml
    • Added a commented-out line for a4h_provisioning_model: SPOT.
  • examples/machine-learning/a4-highgpu-8g/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Updated ramble repository clone to not use --depth 1 and checked out a specific commit.
    • Added overwrite_inventories: true to ramble config.
    • Updated OMPI_MCA_btl_tcp_if_include and added PMIX_MCA_gds, UCX_NET_DEVICES, PMIX_MCA_psec, UCX_IB_FORK_INIT, NCCL_SOCKET_IFNAME, NCCL_IB_HCA, LD_LIBRARY_PATH, NCCL_IB_GID_INDEX, NCCL_DEBUG, NCCL_DEBUG_SUBSYS environment variables.
    • Updated gpus_per_node to 4 and added ulimit -s 8192 to the sbatch script.
    • Modified jq output formatting for better readability.
  • examples/machine-learning/a4x-highgpu-4g/README.md
    • Updated the overview to include standalone VM group deployment.
    • Added a new section for 'A4X-High VM Deployment' with instructions for modifying deployment files, deploying, and destroying VM groups.
  • examples/machine-learning/a4x-highgpu-4g/a4x-vm-deployment.yaml
    • Added a new deployment file for A4X-High VMs.
  • examples/machine-learning/a4x-highgpu-4g/a4x-vm.yaml
    • Updated source_image_project_id and source_image_family to use a base_image object with a specific image version ubuntu-accelerator-2404-arm64-with-nvidia-580-v20260218.
    • Added gpu_topology: "1x72" to the placement policy.
  • examples/machine-learning/a4x-highgpu-4g/a4xhigh-slurm-blueprint.yaml
    • Updated build_from_image_family and build_from_image_project to use a base_image object with a specific image version ubuntu-accelerator-2404-arm64-with-nvidia-580-v20260218.
  • examples/machine-learning/a4x-highgpu-4g/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Updated ramble repository clone to not use --depth 1 and checked out a specific commit.
    • Added overwrite_inventories: true to ramble config.
    • Updated OMPI_MCA_btl_tcp_if_include and added PMIX_MCA_gds, UCX_NET_DEVICES, PMIX_MCA_psec, UCX_IB_FORK_INIT, NCCL_SOCKET_IFNAME, NCCL_IB_HCA, LD_LIBRARY_PATH, NCCL_IB_GID_INDEX, NCCL_DEBUG, NCCL_DEBUG_SUBSYS environment variables.
    • Updated gpus_per_node to 4 and added ulimit -s 8192 to the sbatch script.
    • Modified jq output formatting for better readability.
  • examples/machine-learning/a4x-maxgpu-4g-metal/README.md
    • Added a new README for the A4X Max Bare Metal Slurm blueprint, detailing overview, key components, custom image scripts, deployment, and GCS Fuse mounts.
  • examples/machine-learning/a4x-maxgpu-4g-metal/a4xmax-bm-slurm-blueprint.yaml
    • Added a new blueprint for deploying an A4X Max Bare Metal Slurm cluster, including extensive network, image build, and Slurm configurations.
  • examples/machine-learning/a4x-maxgpu-4g-metal/a4xmax-bm-slurm-deployment.yaml
    • Added a new deployment file for the A4X Max Bare Metal Slurm blueprint.
  • examples/machine-learning/a4x-maxgpu-4g-metal/system_benchmarks/README.md
    • Renamed from examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/README.md.
    • Removed HPL-NVIDIA and NeMo benchmarks from the supported list and documentation.
  • examples/machine-learning/a4x-maxgpu-4g-metal/system_benchmarks/run-nccl-tests-via-ramble.sh
    • Renamed from examples/hypercompute_clusters/a3u-slurm-ubuntu-gcs/system_benchmarks/run-nccl-tests-via-ramble.sh.
    • Updated SOFTWARE_INSTALL to accept an argument.
    • Added ENROOT_CONFIG_PATH export.
    • Updated ramble setup command to use "${SOFTWARE_INSTALL}"/ramble/share/ramble/setup-env.sh.
    • Updated container_name and container_uri for ARM64 NCCL diagnostic image.
    • Updated gpus_per_node to 4 and processes_per_node comment.
    • Updated OMPI_MCA_btl_tcp_if_include to eth0,eth1.
    • Added PMIX_MCA_gds, UCX_NET_DEVICES, PMIX_MCA_psec, UCX_IB_FORK_INIT, NCCL_IB_HCA, NCCL_IB_GID_INDEX, NCCL_DEBUG, NCCL_DEBUG_SUBSYS environment variables.
    • Added ulimit -s 8192 to the sbatch script.
    • Modified jq output formatting for better readability and added column -t.
  • examples/machine-learning/build-service-images/a3m/blueprint.yaml
    • Updated the source image for the custom image build to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
  • examples/machine-learning/build-service-images/common/blueprint.yaml
    • Updated the source image for the custom image build to ubuntu-accelerator-2204-amd64-with-nvidia-570-v20260210.
  • modules/README.md
    • Added kubectl-apply module to the Management section.
    • Removed kubernetes-operations from the Scripts section.
  • modules/compute/gke-job-template/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/compute/gke-job-template/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/compute/gke-node-pool/README.md
    • Updated required Terraform version to 1.12.2.
    • Added linux_node_config input variable description.
  • modules/compute/gke-node-pool/main.tf
    • Merged Kubernetes labels to include TPU-specific labels and cloud.google.com/gke-queued when Queued Provisioning is enabled.
    • Updated linux_node_config block to use variable inputs for sysctls and hugepages_config.
    • Modified precondition for enable_queued_provisioning and placement_policy to allow compact placement for TPUs.
    • Updated error message for enable_flex_start to be more precise.
  • modules/compute/gke-node-pool/metadata.yaml
    • Added a conditional validator for enable_flex_start requiring static_node_count to be null.
    • Added a conditional validator for enable_flex_start requiring auto_repair to be false.
    • Added a conditional validator for enable_queued_provisioning requiring autoscaling_total_min_nodes to be 0.
  • modules/compute/gke-node-pool/variables.tf
    • Added a new linux_node_config variable to configure sysctls and hugepages.
  • modules/compute/gke-node-pool/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/compute/resource-policy/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/compute/resource-policy/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • modules/compute/vm-instance/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/compute/vm-instance/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/file-system/cloud-storage-bucket/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/cloud-storage-bucket/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/file-system/filestore/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/filestore/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/file-system/gke-persistent-volume/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/gke-persistent-volume/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • modules/file-system/gke-storage/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/gke-storage/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • modules/file-system/managed-lustre/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/managed-lustre/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/file-system/netapp-storage-pool/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/netapp-storage-pool/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/file-system/netapp-volume/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/netapp-volume/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/file-system/parallelstore/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/parallelstore/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/pre-existing-network-storage/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/file-system/pre-existing-network-storage/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/internal/gpu-definition/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/internal/gpu-definition/main.tf
    • Added a4x-maxgpu-4g-metal and a4x-maxgpu-4g-metal-nolssd machine types to accelerator_machines.
  • modules/internal/gpu-definition/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/internal/instance_validations/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/internal/instance_validations/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/internal/network-attachment/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/internal/network-attachment/main.tf
    • Updated required Terraform version to 1.12.2.
  • modules/internal/tpu-definition/README.md
    • Updated required Terraform version to 1.12.2.
    • Added kubernetes_label output description.
  • modules/internal/tpu-definition/main.tf
    • Updated required Terraform version to 1.12.2.
  • modules/internal/tpu-definition/outputs.tf
    • Added kubernetes_label output to provide TPU-specific Kubernetes labels.
  • modules/internal/vpc_peering/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/internal/vpc_peering/main.tf
    • Updated required Terraform version to 1.12.2.
  • modules/management/kubectl-apply/README.md
    • Updated description to include NCCL gIB plugin as a deployable infrastructure.
    • Added example configurations for installing gib and asapd_lite.
  • modules/management/kubectl-apply/helm_install/README.md
    • Updated required Terraform version to 1.12.2.
    • Added dependencies input variable description.
  • modules/management/kubectl-apply/helm_install/main.tf
    • Added dependencies input to the helm_release resource to force Terraform dependencies.
  • modules/management/kubectl-apply/helm_install/variables.tf
    • Added a new dependencies variable for managing Terraform dependencies.
  • modules/management/kubectl-apply/helm_install/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/management/kubectl-apply/kubectl/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/management/kubectl-apply/kubectl/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/management/kubectl-apply/main.tf
    • Refactored Kueue configuration logic to use a Helm chart for applying raw manifests.
    • Added install_asapd_lite local variable.
    • Changed install_gib to use the new helm_install module instead of kubectl.
  • modules/management/kubectl-apply/raw-config-chart/Chart.yaml
    • Added a new Helm chart definition for applying raw Kubernetes manifests.
  • modules/management/kubectl-apply/raw-config-chart/templates/manifests.yaml
    • Added a new Helm template for iterating and applying raw Kubernetes manifests.
  • modules/management/kubectl-apply/raw-config-chart/values.yaml
    • Added a new values file for the raw-config-chart Helm chart.
  • modules/management/kubectl-apply/variables.tf
    • Removed the wait option from the kueue variable.
    • Simplified the description for gke_cluster_exists.
    • Added accelerator_type to the nvidia_dra_driver variable.
    • Added new system_node_pool_id and asapd_lite variables.
  • modules/management/kubectl-apply/versions.tf
    • Updated the module version to v1.83.0.
  • modules/monitoring/dashboard/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/monitoring/dashboard/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • modules/network/firewall-rules/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/network/firewall-rules/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • modules/network/gpu-rdma-vpc/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/network/gpu-rdma-vpc/main.tf
    • Updated output_subnets_gke logic to dynamically determine subnetwork name, NIC type, and stack type based on is_roce_metal.
  • modules/network/gpu-rdma-vpc/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/network/multivpc/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/network/multivpc/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/network/pre-existing-subnetwork/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/network/pre-existing-subnetwork/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • modules/network/pre-existing-vpc/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/network/pre-existing-vpc/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • modules/network/private-service-access/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/network/private-service-access/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/network/vpc/README.md
    • Updated required Terraform version to 1.12.2.
    • Added subnetwork_stack_type and subnetwork_ipv6_access_type to the list of inputs.
  • modules/network/vpc/main.tf
    • Added stack_type and ipv6_access_type to the default primary subnetwork configuration.
    • Added output_primary_subnetwork_stack_type to local outputs.
  • modules/network/vpc/metadata.yaml
    • Added deprecated validators for primary_subnetwork, additional_subnetworks, and subnetwork_size.
  • modules/network/vpc/outputs.tf
    • Added subnetwork_stack_type output for the primary subnetwork.
  • modules/network/vpc/variables.tf
    • Added new subnetwork_stack_type and subnetwork_ipv6_access_type variables.
  • modules/network/vpc/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/project/service-account/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/project/service-account/metadata.yaml
    • Added deprecated validators for descriptions, names, and prefix variables.
  • modules/project/service-account/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/scheduler/batch-job-template/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/scheduler/batch-job-template/versions.tf
    • Updated required Terraform version to 1.12.2.
  • modules/scheduler/batch-login-node/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/scheduler/batch-login-node/metadata.yaml
    • Added deprecated validators for job_template_contents, job_filename, and job_id.
  • modules/scheduler/batch-login-node/versions.tf
    • Updated the module version to v1.83.0.
  • modules/scheduler/gke-cluster/README.md
    • Updated required Terraform version to 1.12.2.
    • Added enable_shielded_nodes input variable description.
  • modules/scheduler/gke-cluster/main.tf
    • Configured enable_shielded_nodes to use the variable input instead of a hardcoded value.
  • modules/scheduler/gke-cluster/outputs.tf
    • Simplified the description for gke_cluster_exists.
    • Added system_node_pool_id output.
  • modules/scheduler/gke-cluster/variables.tf
    • Added a new enable_shielded_nodes variable with a default of true.
  • modules/scheduler/gke-cluster/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0 for both google and google-beta providers.
  • modules/scheduler/pre-existing-gke-cluster/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/scheduler/pre-existing-gke-cluster/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • modules/scripts/startup-script/README.md
    • Updated required Terraform version to 1.12.2.
  • modules/scripts/startup-script/metadata.yaml
    • Added deprecated validators for enable_docker_world_writable and install_docker.
  • modules/scripts/startup-script/versions.tf
    • Updated required Terraform version to 1.12.2.
    • Updated the module version to v1.83.0.
  • pkg/config/config.go
    • Updated matchSlurmClusterNameExp regex to allow hyphens and increase the maximum length to 20 characters.
  • pkg/config/config_test.go
    • Updated tests for slurm_cluster_name validation to reflect the new regex allowing hyphens and a max length of 20 characters.
  • pkg/config/deprecation_validate.go
    • Added a new file containing logic for validating module deprecation dates and providing warnings.
  • pkg/config/deprecation_validate_test.go
    • Added a new test file for the validateDeprecation function.
  • pkg/config/errors.go
    • Updated errMsgSlurmClusterNameReqs to reflect the new validation rules for Slurm cluster names.
  • pkg/config/expand.go
    • Updated the default Google provider versions to >= 6.9.0, <= 7.18.0.
  • pkg/config/expand_test.go
    • Updated expected Google provider versions in tests to >= 6.9.0, <= 7.18.0.
  • pkg/config/validate.go
    • Added a call to validateDeprecation within the module validation process.
  • pkg/logging/logging.go
    • Added timestamping to log outputs and introduced colored output for warnings.
  • pkg/modulereader/metadata.go
    • Added DeprecationDate and AlternativeModule fields to MetadataGhpc struct for module deprecation management.
  • pkg/modulereader/modules/test_role/test_module/versions.tf
    • Updated required Terraform version to 1.12.2.
  • pkg/modulewriter/modulewriter_test.go
    • Updated the expected required Terraform version in tests to 1.12.2.
  • pkg/modulewriter/tfwriter.go
    • Updated the required_version for Terraform to "= 1.12.2".
  • pkg/shell/packer.go
    • Integrated newTimestampWriter to add timestamps to Packer command outputs.
  • pkg/shell/terraform.go
    • Integrated newTimestampWriter to add timestamps to Terraform command outputs.
  • pkg/shell/writer.go
    • Added a new file defining a timestampWriter utility for prepending timestamps to log lines.
  • pkg/shell/writer_test.go
    • Added a new test file for the timestampWriter utility.
  • pkg/sourcereader/modules/network/vpc/main.tf
    • Updated required Terraform version to 1.12.2.
  • pkg/validators/cloud.go
    • Added reservationNameRegex and resKeyRegex for parsing reservation names.
    • Implemented findReservationInOtherZones to search for reservations across zones.
    • Added TestReservationExists and testReservationExists functions for validating reservation existence and location.
  • pkg/validators/metadata_validator_helpers.go
    • Improved error handling in getModuleSettingValues for missing settings.
    • Added parseString helper function for string parsing.
    • Added convertToCty, ValuesMatch, and formatValue helper functions for cty.Value manipulation and comparison.
  • pkg/validators/metadata_validators.go
    • Implemented RequiredValidator to enforce presence or absence of variables.
    • Implemented ConditionalValidator to enforce dependent variable conditions based on trigger variables.
  • pkg/validators/metadata_validators_test.go
    • Added comprehensive tests for ConditionalValidator and RequiredValidator covering various scenarios.
    • Added tests for isVarSet, convertToCty, and ValuesMatch helper functions.
  • pkg/validators/registry.go
    • Registered the new RequiredValidator and ConditionalValidator.
  • pkg/validators/validators.go
    • Added testReservationExistsName to the list of validator names.
    • Integrated automatic reservation existence checks into the defaults function for blueprint validation.
  • pkg/validators/validators_test.go
    • Added tests for default validators, including the new reservation existence checks.
Ignored Files
  • Ignored by pattern: .github/workflows/** (2)
    • .github/workflows/label-external.yml
    • .github/workflows/pr-precommit.yml
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces release candidate v1.83.0, which brings a substantial set of updates and new features. Key changes include upgrading the minimum Terraform version to 1.12.2 across all modules, bumping various dependency versions, and updating the slurm_cluster_name validation to be more flexible. The release adds extensive new examples for A4X and TPU hardware, including demonstrations of Dynamic Workload Scheduler (DWS) with Queued Provisioning. It also introduces powerful new blueprint validation capabilities, such as required and conditional validators, automatic checks for reservation existence, and warnings for deprecated modules. Logging has been enhanced with timestamps for better readability of long-running operations. The changes are well-implemented and consistent, significantly improving the toolkit's functionality and user experience. The overall quality of the release is excellent.

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request is a significant release, bumping the version to v1.83.0. It includes widespread updates to pin the Terraform version, dependency upgrades, and support for new hardware like A4X Max bare metal instances. There are also great improvements to the validation framework with new validators, and enhanced logging for a better user experience. I've found a couple of points for improvement: one is a critical documentation update required by the style guide, and the other is a suggestion to improve code consistency.

Comment thread examples/README.md

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request prepares for the v1.83.0 release. It includes a wide range of changes such as bumping the toolkit version, updating dependencies, and introducing significant new features. Key additions include comprehensive examples for A4X Max Bare Metal and DWS with Queued Provisioning for TPUs. The validation framework has been enhanced with new validators for required/conditional fields, deprecation checks, and reservation existence, improving blueprint robustness. The logging has also been improved with timestamps for better traceability. My review identified a critical issue with an invalid Terraform version being set across the project, which will need to be corrected, along with a recommendation to review the naming convention for the Terraform version variable.

Comment thread Makefile

@SwarnaBharathiMantena SwarnaBharathiMantena left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • ml-a4-highgpu-slurm test --- failed --- reservation expired.
  • hpc-enterprise slurm --- failed --- managed lustre issue
  • The remaining three failing tests are failing in daily integration tests, and being tracked by team.

@SwarnaBharathiMantena SwarnaBharathiMantena left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lets ensure G4 Slurm test is passing.

@agrawalkhushi18

Copy link
Copy Markdown
Contributor

@SwarnaBharathiMantena SwarnaBharathiMantena left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@sudheer-quad sudheer-quad merged commit 09301db into main Mar 5, 2026
73 of 82 checks passed
@aslam-quad aslam-quad deleted the release-candidate branch March 9, 2026 08:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-chore To not include into release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.