Update ramble NCCL scripts to be fully noninteractive#5480
Conversation
Summary of ChangesHello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request improves the reliability of automated NCCL benchmark scripts by ensuring that prerequisite installations are fully non-interactive. By managing the Munge service lifecycle and configuring apt-get options, the scripts now avoid common pitfalls related to service locks and interactive configuration prompts during deployment. Highlights
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here. Footnotes
|
There was a problem hiding this comment.
Code Review
This pull request updates NCCL test scripts to handle Munge service conflicts during package installation by stopping the service before running apt-get and restarting it afterward. The review feedback identifies logic issues where the service might be started unnecessarily or left stopped upon failure, and suggests tracking the initial service state and removing inconsistent step numbering.
9c7b9ea to
6b49869
Compare
6b49869 to
a5c6e39
Compare
SwarnaBharathiMantena
left a comment
There was a problem hiding this comment.
LGTM!
@RachaelSTamakloe please ensure that these changes are tested.
35e1172
into
GoogleCloudPlatform:develop
Update slurm ramble a4h, a4x-high, a4x-max NCCL scripts to be fully noninteractive
Submission Checklist
NOTE: Community submissions can take up to 2 weeks to be reviewed.
Please take the following actions before submitting this pull request.