remove unnecessary registration#164481
remove unnecessary registration#164481eellison wants to merge 2 commits intogh/eellison/835/basefrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/164481
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 72fde2f with merge base d4e4f70 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
scaled_mm already had `needs_exact_strides` in its op registration. also added a test showing these strides are being respected. Pull Request resolved: pytorch#164481 Approved by: https://github.com/drisspg, https://github.com/mlazos
PR pytorch#164481 added unit test test_scaled_mm_preserves_strides in test/inductor/test_fp8.py. It was missing the adjustment for ROCm's F8 types on MI300.
PR #164481 added unit test test_scaled_mm_preserves_strides in test/inductor/test_fp8.py. It was missing the adjustment for ROCm's F8 types on MI300. Pull Request resolved: #164578 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>
|
it seems a rocm test start to fail after this pr 2025-10-03T03:26:21.3422505Z FAILED [0.5430s] inductor/test_fp8.py::TestFP8Lowering::test_scaled_mm_preserves_strides - RuntimeError: CUDA error: HIPBLAS_STATUS_NOT_SUPPORTED when calling seems related, about to revert this pr |
|
@pytorchbot revert -c nosignal -m "failed test in trunk https://hud.pytorch.org/pytorch/pytorch/commit/4691fe60700ac51a878775fd23a8f7c4548c6757#51853528524-box" |
|
@pytorchbot successfully started a revert job. Check the current status here. |
Reverting PR 164481 failedReason: Command Details for Dev Infra teamRaised by workflow job |
scaled_mm already had `needs_exact_strides` in its op registration. also added a test showing these strides are being respected. Pull Request resolved: pytorch#164481 Approved by: https://github.com/drisspg, https://github.com/mlazos
PR pytorch#164481 added unit test test_scaled_mm_preserves_strides in test/inductor/test_fp8.py. It was missing the adjustment for ROCm's F8 types on MI300. Pull Request resolved: pytorch#164578 Approved by: https://github.com/jeffdaily Co-authored-by: Jeff Daily <jeff.daily@amd.com>
Stack from ghstack (oldest at bottom):
scaled_mm already had
needs_exact_stridesin its op registration. also added a test showing these strides are being respected.cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben