Support vGPU (fractional GPU) for G4 GKE#5399
Conversation
Summary of ChangesHello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request introduces support for fractional GPUs (vGPU) on G4 virtual machines within Google Kubernetes Engine. The changes involve updating the GKE cluster's version and release channel in the provided example, expanding the internal GPU definitions to recognize new G4 machine types, and updating documentation to inform users of the necessary GKE version for vGPU functionality. This enhancement allows for more flexible and efficient utilization of GPU resources in GKE environments. Highlights
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here. Footnotes
|
There was a problem hiding this comment.
Code Review
This pull request updates the GKE G4 example to support G4 vGPU, requiring GKE version 1.35.2-gke.1485000 and changing the release channel to RAPID. It also adds new g4-standard machine types to the GPU definitions. Feedback suggests using min_master_version for specific GKE versions in the example blueprint for clarity and defining the repeated GPU type string as a local variable in the Terraform module for better maintainability.
2056cc5
into
GoogleCloudPlatform:develop
Added support for G4 vGPU (fractional GPU) in GKE.
Changes:
1.35.2-gke.1485000) andRAPIDrelease channel to accommodate vGPU features.g4-standard-6,g4-standard-12,g4-standard-24).README.mdto reflect the new minimum GKE version required for G4 vGPU support.Testing:
nvidia-smijob run was successfully completed.