Skip to content

[ci] fix h100-distributed#157826

Closed
d4l3k wants to merge 1 commit intomainfrom
d4l3k/fix_h100_ci
Closed

[ci] fix h100-distributed#157826
d4l3k wants to merge 1 commit intomainfrom
d4l3k/fix_h100_ci

Conversation

@d4l3k
Copy link
Member

@d4l3k d4l3k commented Jul 8, 2025

This was broken by #157341

This should resolve the permission issue

@d4l3k d4l3k requested a review from fduwjj July 8, 2025 18:03
@d4l3k d4l3k requested a review from a team as a code owner July 8, 2025 18:03
@pytorch-bot
Copy link

pytorch-bot bot commented Jul 8, 2025

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/157826

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 1 Pending

As of commit 8514b64 with merge base 7381c77 (image):
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the topic: not user facing topic category label Jul 8, 2025
@fduwjj
Copy link
Contributor

fduwjj commented Jul 8, 2025

interesting, this CI used to be working, is there any recent changes which make it not working? And looks like both test-h100.yml and h100-symm-mem.yml have this line as well.

@fduwjj
Copy link
Contributor

fduwjj commented Jul 8, 2025

NVM I saw your comment in the post and thanks for the quick fix!

@d4l3k
Copy link
Member Author

d4l3k commented Jul 8, 2025

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Jul 8, 2025
@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Collaborator

The merge job was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information see pytorch-bot wiki.

@huydhn
Copy link
Contributor

huydhn commented Jul 9, 2025

@pytorchbot merge -f 'No need to wait for ROCm jobs'

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@github-actions github-actions bot deleted the d4l3k/fix_h100_ci branch August 8, 2025 02:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants