Skip to content

Hotfix: Update the region and zone(s) to overcome Lustre instance creation issue#5290

Merged
SwarnaBharathiMantena merged 1 commit into
GoogleCloudPlatform:release-candidatefrom
SwarnaBharathiMantena:swarnabm/fix_lustre_res_cre_issue
Feb 26, 2026
Merged

Hotfix: Update the region and zone(s) to overcome Lustre instance creation issue#5290
SwarnaBharathiMantena merged 1 commit into
GoogleCloudPlatform:release-candidatefrom
SwarnaBharathiMantena:swarnabm/fix_lustre_res_cre_issue

Conversation

@SwarnaBharathiMantena

@SwarnaBharathiMantena SwarnaBharathiMantena commented Feb 26, 2026

Copy link
Copy Markdown
Contributor

The PR-test-hpc-enterprise-slurm PR test has been failing with the error below:

Step #1 - "hpc-enterprise-slurm-v6": 2026-02-26T05:10:29Z Error: Error waiting to create Instance: Error waiting for Creating Instance: Error code 13, message: an internal error has occurred
Step #1 - "hpc-enterprise-slurm-v6": 
Step #1 - "hpc-enterprise-slurm-v6":   with module.lustre-gcp.google_lustre_instance.lustre_instance,
Step #1 - "hpc-enterprise-slurm-v6":   on modules/embedded/modules/file-system/managed-lustre/main.tf line 61, in resource "google_lustre_instance" "lustre_instance":
Step #1 - "hpc-enterprise-slurm-v6":   61: resource "google_lustre_instance" "lustre_instance" {

One such failed build: https://pantheon.corp.google.com/cloud-build/builds;region=global/220fe844-9432-4d57-8820-82c6d32e001a?e=13803378&mods=monitoring_api_prod&project=hpc-toolkit-dev

Workaround: Update the region and zone inputs.
Callouts: The test should be improved to make a check similar to how spot tests run for identifying an appropriate region / zone.

@SwarnaBharathiMantena SwarnaBharathiMantena added the release-chore To not include into release notes label Feb 26, 2026
@gemini-code-assist

Copy link
Copy Markdown
Contributor

Summary of Changes

Hello @SwarnaBharathiMantena, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request provides a hotfix to address persistent build failures related to Lustre instance creation within the daily HPC Enterprise Slurm tests. The core of the solution involves reconfiguring the geographical parameters, shifting the test environment from a European region to a US central region to circumvent the underlying issue.

Highlights

  • Region and Zone Update: The region and zone parameters for the HPC Enterprise Slurm daily test configuration have been updated from europe-west4 and europe-west4-c to us-central1 and us-central1-a respectively.
  • GPU Zones Configuration: The list of gpu_zones has been modified to reflect the new us-central1 region, now including us-central1-a, us-central1-b, us-central1-c, and us-central1-f.
  • Addressing Lustre Instance Issue: These changes are implemented as a hotfix to resolve an ongoing issue with Lustre instance creation that was causing daily build failures.
Changelog
  • tools/cloud-build/daily-tests/tests/hpc-enterprise-slurm.yml
    • Updated the default zone from europe-west4-c to us-central1-a.
    • Modified the region in cli_deployment_vars from europe-west4 to us-central1.
    • Adjusted the gpu_zones list to [us-central1-a, us-central1-b, us-central1-c, us-central1-f].
Activity
  • No specific activity (comments, reviews, etc.) has been recorded for this pull request yet.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request provides a hotfix by updating the region and zone configurations in the hpc-enterprise-slurm.yml test file. These changes are intended to resolve issues with Lustre instance creation by moving to a more stable region.

@SwarnaBharathiMantena SwarnaBharathiMantena merged commit 80f0b3b into GoogleCloudPlatform:release-candidate Feb 26, 2026
16 of 83 checks passed
kadupoornima pushed a commit to kadupoornima/cluster-toolkit that referenced this pull request Mar 2, 2026
kadupoornima pushed a commit to kadupoornima/cluster-toolkit that referenced this pull request Mar 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-chore To not include into release notes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants