Bring parity of functionality to both A3U and A4#4023
Merged
tpdownes merged 3 commits intoMay 7, 2025
Conversation
Collaborator
Author
|
/gcbrun |
da16d6b to
bcd6972
Compare
tpdownes
previously approved these changes
May 1, 2025
Manual testing has shown problems with NVIDIA library version mismatches
* Accelerator images for A4 * persistenced enabled for A4 * NVIDIA repo pinning * SocketsPerBoard 2 for A3U * Enroot Config for A3U * A4 ResumeTimeout Match 1200 from A3U * Disk sizes 100GB for compute nodes * Add DWS Flex for A3U * Incorpriate accelerator image patch for A4
bcd6972 to
4e35202
Compare
tpdownes
suggested changes
May 6, 2025
Co-authored-by: Tom Downes <tpdownes@users.noreply.github.com>
tpdownes
previously approved these changes
May 7, 2025
tpdownes
left a comment
Contributor
There was a problem hiding this comment.
Minor change request. Please wait until the PR-test-ml-a4-highgpu-slurm test passes to merge.
….yaml Co-authored-by: Tom Downes <tpdownes@users.noreply.github.com>
tpdownes
approved these changes
May 7, 2025
tpdownes
left a comment
Contributor
There was a problem hiding this comment.
Tests passed before the final commit which only changed whitespace in a comment.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Marking as a breaking change as we are switching the base image being used in A4.
Submission Checklist
NOTE: Community submissions can take up to 2 weeks to be reviewed.
Please take the following actions before submitting this pull request.