[ROCM][CI] Introduce tests-to-include as rocm-test workflow input#110511
[ROCM][CI] Introduce tests-to-include as rocm-test workflow input#110511jithunnair-amd wants to merge 10 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/110511
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit ac4ae2f with merge base 0fd856c ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
Seeing a failure |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
37181f5 to
84a7a0c
Compare
|
ROCm CI ran 1 shard as expected: https://github.com/pytorch/pytorch/actions/runs/6552121852/job/17795441333 Since the aim of this PR is to introduce a way to run smaller set of core unit tests, we should exclude the cpp tests. |
b21f82a to
8e934ef
Compare
|
@huydhn @clee2000 The ROCm CI as part of
|
|
Ping @jithunnair-amd to see if there is any update on this one. The context is that we start to see some ROCm failures landing in trunk, i.e. https://hud.pytorch.org/pytorch/pytorch/commit/7ccca60927cdccde63d6a1d40480950f24e9877a, because the PR didn't have |
Just updated this PR to use ROCm5.7, but otherwise it looks good from my end if all ROCm tests pass. Requesting @jeffdaily to also take a look in case I'm missing something. |
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
f17f649 to
f39847d
Compare
|
@jithunnair-amd are the CI failures real? |
The torchvision build failure is real, but is due to some unsupported compiler flags: The previous CI run succeeded and used a different torchvision commit. There was another torchvision commit bump after the one in the most recent failing CI run. I'm assessing if that'll resolve this issue. |
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
|
Successfully rebased |
f39847d to
ac4ae2f
Compare
Actually, 0a7eef9 fixed the issue wrt unsupported compiler flags, so expecting a rebase to help. |
|
ROCm CI passing with rebase: https://github.com/pytorch/pytorch/actions/runs/6854687390/job/18638805996 Merging as pre-rebase commit had all CI checks passing except ROCm. @pytorchbot merge -f "ROCm CI check passed post-rebase; all other CI checks passed pre-rebase already" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
…torch#110511) Fixes pytorch#110181 Pull Request resolved: pytorch#110511 Approved by: https://github.com/huydhn
Fixes #110181
cc @jeffdaily @sunway513 @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang