[ROCm][CI] fix test_max_autotune.py:test_max_autotune_exhaustive() by AmdSampsa · Pull Request #176162 · pytorch/pytorch

AmdSampsa · 2026-03-02T14:32:11Z

test_max_autotune.py:test_max_autotune_exhaustive() was using cuda template config heuristics, but it needed to use rocm template heuristics instead

cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @jataylo @hongxiayang @naromero77amd @pragupta @jerrymannil @xinyazhang @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben

…mplate config heuristics, but it needed to use rocm template heuristics instead

pytorch-bot · 2026-03-02T14:32:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/176162

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job, 2 Unrelated Failures

As of commit f0d16ed with merge base 0569e4a ():

CANCELLED JOB - The following job was cancelled. Please retry:

Limited CI on H100 / linux-jammy-cuda12_8-py3_10-gcc11-sm90-FA3-ABI-stable-test / test (gh)
##[error]The operation was canceled.

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

rocm-mi300 / linux-noble-rocm-py3.12-mi300 / test (default, 2, 6, linux.rocm.gpu.gfx942.1) (gh) (similar failure)
test/dynamo/test_structured_trace.py::StructuredTraceTest::test_ddp_graphs
trunk / macos-py3-arm64 / test (default, 1, 3, macos-m1-stable) (gh) (similar failure)
test/test_indexing.py::TestIndexingMPS::test_index_reduce_reduce_amax_mps_float32

This comment was automatically generated by Dr. CI and updates every 15 minutes.

AmdSampsa · 2026-03-03T11:14:43Z

The remaining failing CI test:

FAILED CONSISTENTLY: test/dynamo/test_structured_trace.py::StructuredTraceTest::test_ddp_graphs
Stopping at first consistent failure
The following tests failed consistently: ['test/dynamo/test_structured_trace.py::StructuredTraceTest::test_ddp_graphs']

Seems to have nothing to do with our (atomic) unit test. When I run it manually on my test MI350, that dynamo test passes ok.
-> please force-merge.

jeffdaily · 2026-03-12T14:06:55Z

@pytorchbot merge

pytorchmergebot · 2026-03-12T14:09:14Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorchmergebot · 2026-03-12T14:09:36Z

Merge failed

Reason: 1 jobs have failed, first few of them are: Limited CI on H100 / linux-jammy-cuda12_8-py3_10-gcc11-sm90-FA3-ABI-stable-test / test

Details for Dev Infra team

Raised by workflow job

jeffdaily · 2026-03-12T17:44:01Z

@pytorchbot merge -f "all remaining failure are unrelated"

pytorchmergebot · 2026-03-12T17:51:02Z

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…ytorch#176162) test_max_autotune.py:test_max_autotune_exhaustive() was using cuda template config heuristics, but it needed to use rocm template heuristics instead Pull Request resolved: pytorch#176162 Approved by: https://github.com/jeffdaily

test_max_autotune.py:test_max_autotune_exhaustive() was using cuda te…

f0d16ed

…mplate config heuristics, but it needed to use rocm template heuristics instead

pytorch-bot bot added ciflow/b200 ciflow/h100 ciflow/inductor ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 module: inductor module: rocm AMD GPU support for Pytorch topic: not user facing topic category labels Mar 2, 2026

AmdSampsa requested review from jataylo and jithunnair-amd March 2, 2026 14:32

pytorchbot added the open source label Mar 2, 2026

jataylo requested a review from jeffdaily March 3, 2026 11:16

jeffdaily approved these changes Mar 6, 2026

View reviewed changes

jeffdaily marked this pull request as ready for review March 6, 2026 00:05

jeffdaily changed the title ~~fix test_max_autotune.py:test_max_autotune_exhaustive() for ROCm~~ [ROCm] fix test_max_autotune.py:test_max_autotune_exhaustive() Mar 12, 2026

jeffdaily changed the title ~~[ROCm] fix test_max_autotune.py:test_max_autotune_exhaustive()~~ [ROCm][CI] fix test_max_autotune.py:test_max_autotune_exhaustive() Mar 12, 2026

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Mar 12, 2026

pytorchmergebot added the merging label Mar 12, 2026

pytorchmergebot removed the merging label Mar 12, 2026

pytorchmergebot added the merging label Mar 12, 2026

pytorchmergebot closed this in c1d407d Mar 12, 2026

pytorchmergebot added the Merged label Mar 12, 2026

pytorchmergebot removed the merging label Mar 12, 2026

This was referenced Mar 18, 2026

TestModule: test/inductor/test_max_autotune.py::TestMaxAutotune #168608

Closed

Test: TestMaxAutotune.test_max_autotune_exhaustive #168618

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ROCm][CI] fix test_max_autotune.py:test_max_autotune_exhaustive()#176162

[ROCm][CI] fix test_max_autotune.py:test_max_autotune_exhaustive()#176162
AmdSampsa wants to merge 1 commit intopytorch:mainfrom
AmdSampsa:test-max-autotune_exhaustive-fix-clean

AmdSampsa commented Mar 2, 2026 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Mar 2, 2026 •

edited

Loading

Uh oh!

AmdSampsa commented Mar 3, 2026

Uh oh!

jeffdaily commented Mar 12, 2026

Uh oh!

pytorchmergebot commented Mar 12, 2026

Uh oh!

pytorchmergebot commented Mar 12, 2026

Uh oh!

jeffdaily commented Mar 12, 2026

Uh oh!

pytorchmergebot commented Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

AmdSampsa commented Mar 2, 2026 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/176162

❌ 1 Cancelled Job, 2 Unrelated Failures

Uh oh!

AmdSampsa commented Mar 3, 2026

Uh oh!

jeffdaily commented Mar 12, 2026

Uh oh!

pytorchmergebot commented Mar 12, 2026

Merge started

Uh oh!

pytorchmergebot commented Mar 12, 2026

Merge failed

Uh oh!

jeffdaily commented Mar 12, 2026

Uh oh!

pytorchmergebot commented Mar 12, 2026

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

AmdSampsa commented Mar 2, 2026 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Mar 2, 2026 •

edited

Loading