Skip to content

Add DRANET support in Cluster Toolkit#5418

Merged
SwarnaBharathiMantena merged 9 commits into
GoogleCloudPlatform:developfrom
FIoannides:feature/dranet-support
Apr 28, 2026
Merged

Add DRANET support in Cluster Toolkit#5418
SwarnaBharathiMantena merged 9 commits into
GoogleCloudPlatform:developfrom
FIoannides:feature/dranet-support

Conversation

@FIoannides

Copy link
Copy Markdown
Contributor

Automatically enabled automated networking for supported GPUs and TPUs (https://docs.cloud.google.com/kubernetes-engine/docs/how-to/config-auto-net-for-accelerators#limitations)

Submission Checklist

NOTE: Community submissions can take up to 2 weeks to be reviewed.

Please take the following actions before submitting this pull request.

  • Fork your PR branch from the Toolkit "develop" branch (not main)
  • Test all changes with pre-commit in a local branch #
  • Confirm that "make tests" passes all tests
  • Add or modify unit tests to cover code changes
  • Ensure that unit test coverage remains above 80%
  • Update all applicable documentation
  • Follow Cluster Toolkit Contribution guidelines #

- Add major/minor version checking to ensure robustness (GKE >= 1.34.1)
- Restrict DRANET automatic enablement to specific hardware architectures (A3/A4/CT6E)
- Add validation block to enforce hardware and GKE version compatibility when DRANET is manually enabled
…works

- Prevent automatic DRANET enablement if additional_networks are provided
- Add validation check to prevent enable_dranet=true alongside additional_networks
- Bypass minimum additional networks check when DRANET is actively managing network profiles
@github-actions github-actions Bot added the external PR from external contributor label Mar 27, 2026
@gemini-code-assist

Copy link
Copy Markdown
Contributor

Summary of Changes

Hello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces support for DRANET within the GKE node pool module. It adds the necessary configuration parameters and logic to enable automated networking for supported accelerator-optimized machine types, ensuring that the feature is correctly applied based on GKE version compatibility and machine type requirements.

Highlights

  • DRANET Support: Added support for GKE managed Dynamic Resource Allocation (DRA) driver for networking (DRANET) and Accelerator Network Profile (ANP).
  • Automated Configuration: Implemented logic to automatically enable DRANET for supported GPU/TPU nodes on compatible GKE versions (1.34.1-gke.1829001 or later).
  • Validation Checks: Introduced Terraform checks to ensure DRANET requirements are met and to prevent conflicts with additional network configurations.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request introduces support for GKE managed Dynamic Resource Allocation (DRA) for networking (DRANET) and Accelerator Network Profiles (ANP) in the GKE node pool module. Changes include a new enable_dranet variable, automatic enablement logic based on GKE version and machine type compatibility, and validation checks to prevent configuration conflicts. Feedback suggests improving documentation consistency in the README and refactoring the version comparison logic in gpu_direct.tf for better readability by extracting version components into local variables.

Comment thread modules/compute/gke-node-pool/README.md Outdated
Comment thread modules/compute/gke-node-pool/gpu_direct.tf Outdated
FIoannides and others added 3 commits March 27, 2026 12:18
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
@FIoannides FIoannides marked this pull request as ready for review March 27, 2026 14:03
@FIoannides FIoannides requested review from a team and samskillman as code owners March 27, 2026 14:03
Comment thread modules/compute/gke-node-pool/main.tf
Comment thread modules/compute/gke-node-pool/main.tf
@SwarnaBharathiMantena SwarnaBharathiMantena added the release-module-improvements Added to release notes under the "Module Improvements" heading. label Apr 8, 2026
@SwarnaBharathiMantena

Copy link
Copy Markdown
Contributor

/gcbrun

Comment thread modules/compute/gke-node-pool/main.tf Outdated
@SwarnaBharathiMantena

Copy link
Copy Markdown
Contributor

/gcbrun

Comment thread modules/compute/gke-node-pool/gpu_direct.tf Outdated
@FIoannides FIoannides force-pushed the feature/dranet-support branch from a52543f to 199413a Compare April 22, 2026 16:04
@SwarnaBharathiMantena

Copy link
Copy Markdown
Contributor

/gcbrun

@FIoannides FIoannides force-pushed the feature/dranet-support branch from 199413a to dfded64 Compare April 23, 2026 07:02
@SwarnaBharathiMantena

Copy link
Copy Markdown
Contributor

/gcbrun

@SwarnaBharathiMantena SwarnaBharathiMantena left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Comment thread modules/compute/gke-node-pool/main.tf Outdated
@FIoannides FIoannides force-pushed the feature/dranet-support branch from 0df7565 to 3be5df8 Compare April 24, 2026 10:12
@FIoannides FIoannides force-pushed the feature/dranet-support branch from 3be5df8 to 24bea5b Compare April 27, 2026 08:26
@SwarnaBharathiMantena

Copy link
Copy Markdown
Contributor

/gcbrun

@SwarnaBharathiMantena SwarnaBharathiMantena left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@SwarnaBharathiMantena SwarnaBharathiMantena added release-key-new-features Added to release notes under the "Key New Features" heading. and removed release-module-improvements Added to release notes under the "Module Improvements" heading. labels Apr 28, 2026
@SwarnaBharathiMantena SwarnaBharathiMantena merged commit 95b778a into GoogleCloudPlatform:develop Apr 28, 2026
32 of 77 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

external PR from external contributor release-key-new-features Added to release notes under the "Key New Features" heading.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants