Skip to content

Fixes to nodeset dynamic#4125

Merged
cboneti merged 1 commit into
GoogleCloudPlatform:developfrom
cboneti:nodeset-dyn
May 15, 2025
Merged

Fixes to nodeset dynamic#4125
cboneti merged 1 commit into
GoogleCloudPlatform:developfrom
cboneti:nodeset-dyn

Conversation

@cboneti

@cboneti cboneti commented May 15, 2025

Copy link
Copy Markdown
Member
  • Added universe_domain (main problem that prevented nodes from running Slurm startup)
  • Added startup_script variable
  • Added network_storage variable

Submission Checklist

NOTE: Community submissions can take up to 2 weeks to be reviewed.

Please take the following actions before submitting this pull request.

  • Fork your PR branch from the Toolkit "develop" branch (not main)
  • Test all changes with pre-commit in a local branch #
  • Confirm that "make tests" passes all tests
  • Add or modify unit tests to cover code changes
  • Ensure that unit test coverage remains above 80%
  • Update all applicable documentation
  • Follow Cluster Toolkit Contribution guidelines #

* Added universe_domain (main problem that prevented nodes from running
Slurm startup)
* Added startup_script variable
* Added network_storage variable
@cboneti cboneti requested review from a team and samskillman as code owners May 15, 2025 00:40
@cboneti cboneti requested a review from mr0re1 May 15, 2025 00:40

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hello @cboneti, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

Summary of Changes

Hello! Gemini here, providing a summary of this pull request. This PR, titled "Fixes to nodeset dynamic", aims to address issues and add new configuration capabilities to the dynamic nodeset module for Slurm on GCP. The primary goal, as stated in the description, is to fix a problem preventing nodes from running the Slurm startup by adding the universe_domain variable. Additionally, it introduces variables for specifying a custom startup_script and configuring network_storage mounts on the nodes.

Highlights

  • Fix Node Startup Issue: Adds the universe_domain variable to the module inputs and includes it in the instance metadata, which is intended to resolve a problem preventing Slurm nodes from starting up correctly.
  • Custom Startup Script: Introduces a startup_script input variable allowing users to provide a custom script to be executed when the VM instances in the nodeset start.
  • Network Storage Configuration: Adds a network_storage input variable to define network attached storage mounts (like NFS) that should be configured on the nodes.
  • Documentation Updates: Updates the README.md file to document the newly added universe_domain, startup_script, and network_storage input variables.

Changelog

Click here to see the changelog
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/README.md
    • Added documentation for the network_storage input variable (around line 116).
    • Added documentation for the startup_script input variable (around line 127).
    • Added documentation for the universe_domain input variable (around line 130).
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/main.tf
    • Added universe_domain to the instance metadata configuration (around line 34-38).
    • Included startup_script and network_storage in the nodeset local variable definition (around line 46-47).
    • Defined a ghpc_startup_script local variable based on the startup_script input (around line 70-73).
  • community/modules/compute/schedmd-slurm-gcp-v6-nodeset-dynamic/variables.tf
    • Added the definition for the startup_script input variable with type and default (around line 379).
    • Added the definition for the universe_domain input variable with type, default, and nullable constraint (around line 385).
    • Added the definition for the network_storage input variable with type and default (around line 392).
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in issue comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.


Terraform code flows,
Variables define the cloud,
Fixing startup woes.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@cboneti cboneti added the release-bugfix Added to release notes under the "Bug fixes" heading. label May 15, 2025

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

The pull request introduces several new features to the nodeset dynamic module, including the addition of universe_domain, startup_script, and network_storage variables. The changes seem well-structured and address the identified problem of preventing nodes from running Slurm startup. However, some improvements can be made to enhance code clarity and maintainability.

Merge Readiness

The pull request is mostly in good shape and introduces valuable functionality. However, addressing the comment regarding the nullable attribute in variables.tf would improve the code's clarity. I am unable to directly approve this pull request, and recommend that others review and approve this code before merging.

@cboneti cboneti merged commit 618cc7d into GoogleCloudPlatform:develop May 15, 2025
13 of 68 checks passed
@cboneti cboneti deleted the nodeset-dyn branch May 15, 2025 16:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

release-bugfix Added to release notes under the "Bug fixes" heading.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants