Add DRANET support in Cluster Toolkit#5418
Conversation
- Add major/minor version checking to ensure robustness (GKE >= 1.34.1) - Restrict DRANET automatic enablement to specific hardware architectures (A3/A4/CT6E) - Add validation block to enforce hardware and GKE version compatibility when DRANET is manually enabled
…works - Prevent automatic DRANET enablement if additional_networks are provided - Add validation check to prevent enable_dranet=true alongside additional_networks - Bypass minimum additional networks check when DRANET is actively managing network profiles
Summary of ChangesHello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request introduces support for DRANET within the GKE node pool module. It adds the necessary configuration parameters and logic to enable automated networking for supported accelerator-optimized machine types, ensuring that the feature is correctly applied based on GKE version compatibility and machine type requirements. Highlights
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here. Footnotes
|
There was a problem hiding this comment.
Code Review
This pull request introduces support for GKE managed Dynamic Resource Allocation (DRA) for networking (DRANET) and Accelerator Network Profiles (ANP) in the GKE node pool module. Changes include a new enable_dranet variable, automatic enablement logic based on GKE version and machine type compatibility, and validation checks to prevent configuration conflicts. Feedback suggests improving documentation consistency in the README and refactoring the version comparison logic in gpu_direct.tf for better readability by extracting version components into local variables.
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
…atible for better readability
|
/gcbrun |
|
/gcbrun |
a52543f to
199413a
Compare
|
/gcbrun |
199413a to
dfded64
Compare
|
/gcbrun |
0df7565 to
3be5df8
Compare
3be5df8 to
24bea5b
Compare
|
/gcbrun |
95b778a
into
GoogleCloudPlatform:develop
Automatically enabled automated networking for supported GPUs and TPUs (https://docs.cloud.google.com/kubernetes-engine/docs/how-to/config-auto-net-for-accelerators#limitations)
Submission Checklist
NOTE: Community submissions can take up to 2 weeks to be reviewed.
Please take the following actions before submitting this pull request.