Skip to content

[ROCM][CUDA][NCCL] Disable test_lowering_one_shot_all_reduce on ROCM#139414

Closed
eqy wants to merge 2 commits intomainfrom
eqy-patch-6
Closed

[ROCM][CUDA][NCCL] Disable test_lowering_one_shot_all_reduce on ROCM#139414
eqy wants to merge 2 commits intomainfrom
eqy-patch-6

Conversation

@eqy
Copy link
Copy Markdown
Collaborator

@eqy eqy commented Oct 31, 2024

@eqy eqy added module: cuda Related to torch.cuda, and CUDA support in general module: nccl Problems related to nccl support open source topic: not user facing topic category module: inductor ciflow/inductor rocm This tag is for PRs from ROCm team ciflow/rocm Trigger "default" config CI on ROCm labels Oct 31, 2024
@pytorch-bot pytorch-bot bot added module: rocm AMD GPU support for Pytorch oncall: distributed Add this issue/PR to distributed oncall triage queue labels Oct 31, 2024
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot bot commented Oct 31, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/139414

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (4 Unrelated Failures)

As of commit e7b3b91 with merge base 547d921 (image):

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copy link
Copy Markdown
Contributor

@huydhn huydhn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the fix!

@yifuwang
Copy link
Copy Markdown
Collaborator

Thanks for the fix!

@eqy
Copy link
Copy Markdown
Collaborator Author

eqy commented Oct 31, 2024

@pytorchmergebot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Oct 31, 2024
@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Merge failed

Reason: 1 mandatory check(s) failed. The first few are:

Dig deeper by viewing the failures on hud

Details for Dev Infra team Raised by workflow job

Failing merge rule: Core Maintainers

@eqy
Copy link
Copy Markdown
Collaborator Author

eqy commented Nov 1, 2024

@pytorchmergebot rebase

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Tried to rebase and push PR #139414, but it was already up to date. Try rebasing against main by issuing:
@pytorchbot rebase -b main

@eqy
Copy link
Copy Markdown
Collaborator Author

eqy commented Nov 1, 2024

@pytorchmergebot merge

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

rahulsingh-intel pushed a commit to rahulsingh-intel/pytorch that referenced this pull request Nov 5, 2024
pytorch#139414)

I'm not sure this is expected to run if it requires buffer-registration support CC @yifuwang @huydhn @syed-ahmed pytorch#138029

Pull Request resolved: pytorch#139414
Approved by: https://github.com/huydhn, https://github.com/yifuwang
@github-actions github-actions bot deleted the eqy-patch-6 branch December 2, 2024 02:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/inductor ciflow/rocm Trigger "default" config CI on ROCm ciflow/trunk Trigger trunk jobs on your pull request Merged module: cuda Related to torch.cuda, and CUDA support in general module: inductor module: nccl Problems related to nccl support module: rocm AMD GPU support for Pytorch oncall: distributed Add this issue/PR to distributed oncall triage queue open source rocm This tag is for PRs from ROCm team topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants