Skip to content

[ROCm][CI] fix test_max_autotune.py:test_max_autotune_exhaustive()#176162

Closed
AmdSampsa wants to merge 1 commit intopytorch:mainfrom
AmdSampsa:test-max-autotune_exhaustive-fix-clean
Closed

[ROCm][CI] fix test_max_autotune.py:test_max_autotune_exhaustive()#176162
AmdSampsa wants to merge 1 commit intopytorch:mainfrom
AmdSampsa:test-max-autotune_exhaustive-fix-clean

Conversation

@AmdSampsa
Copy link
Copy Markdown
Collaborator

@AmdSampsa AmdSampsa commented Mar 2, 2026

…mplate config heuristics, but it needed to use rocm template heuristics instead
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot bot commented Mar 2, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/176162

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job, 2 Unrelated Failures

As of commit f0d16ed with merge base 0569e4a (image):

CANCELLED JOB - The following job was cancelled. Please retry:

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@AmdSampsa
Copy link
Copy Markdown
Collaborator Author

The remaining failing CI test:

FAILED CONSISTENTLY: test/dynamo/test_structured_trace.py::StructuredTraceTest::test_ddp_graphs
Stopping at first consistent failure
The following tests failed consistently: ['test/dynamo/test_structured_trace.py::StructuredTraceTest::test_ddp_graphs']

Seems to have nothing to do with our (atomic) unit test. When I run it manually on my test MI350, that dynamo test passes ok.
-> please force-merge.

@jataylo jataylo requested a review from jeffdaily March 3, 2026 11:16
@jeffdaily jeffdaily marked this pull request as ready for review March 6, 2026 00:05
@jeffdaily jeffdaily changed the title fix test_max_autotune.py:test_max_autotune_exhaustive() for ROCm [ROCm] fix test_max_autotune.py:test_max_autotune_exhaustive() Mar 12, 2026
@jeffdaily jeffdaily changed the title [ROCm] fix test_max_autotune.py:test_max_autotune_exhaustive() [ROCm][CI] fix test_max_autotune.py:test_max_autotune_exhaustive() Mar 12, 2026
@jeffdaily
Copy link
Copy Markdown
Collaborator

@pytorchbot merge

@pytorch-bot pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Mar 12, 2026
@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Merge failed

Reason: 1 jobs have failed, first few of them are: Limited CI on H100 / linux-jammy-cuda12_8-py3_10-gcc11-sm90-FA3-ABI-stable-test / test

Details for Dev Infra team Raised by workflow job

@jeffdaily
Copy link
Copy Markdown
Collaborator

@pytorchbot merge -f "all remaining failure are unrelated"

@pytorchmergebot
Copy link
Copy Markdown
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

EmanueleCoradin pushed a commit to EmanueleCoradin/pytorch that referenced this pull request Mar 30, 2026
…ytorch#176162)

test_max_autotune.py:test_max_autotune_exhaustive() was using cuda template config heuristics, but it needed to use rocm template heuristics instead

Pull Request resolved: pytorch#176162
Approved by: https://github.com/jeffdaily
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ciflow/b200 ciflow/h100 ciflow/inductor ciflow/rocm-mi300 Trigger "default" config CI on ROCm MI300 ciflow/trunk Trigger trunk jobs on your pull request Merged module: inductor module: rocm AMD GPU support for Pytorch open source topic: not user facing topic category

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants