[AMD][gfx1100] test_decompose_mem_bound_mm.py tolerance increase#165625
[AMD][gfx1100] test_decompose_mem_bound_mm.py tolerance increase#165625k-artem wants to merge 5 commits intopytorch:mainfrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/165625
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New FailuresAs of commit ccba83d with merge base 1009790 ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
c1a7876 to
62cd2ac
Compare
|
@pytorchbot label "topic: not user facing" |
|
@pytorchbot label "release notes: rocm" |
|
Didn't find following labels among repository labels: label:ciflow/rocm |
|
@pytorchbot label "ciflow/rocm" |
|
To add these label(s) (ciflow/rocm) to the PR, please first approve the workflows that are awaiting approval (scroll to the bottom of this page). This helps ensure we don't trigger CI on this PR until it is actually authorized to do so. Please ping one of the reviewers if you do not have access to approve and run workflows. |
| def setup_tolerance(self, rtol=None, atol=None): | ||
| if rtol is None: | ||
| rtol = self.rtol | ||
| if atol is None: | ||
| atol = self.rtol |
There was a problem hiding this comment.
How does this work? rtol and atol local to the function. Modifying them here should not affect call sites like line 88 below. Right?
There was a problem hiding this comment.
- it works because
setattr(self, member, value)inside decorator set correct values. - One more isssue found
atol = self.rtolinstead ofatol = self.atol
I keep setup_tolerance (which actually can be removed) in case we need to update tolerance value at calls of compare_* functions.
Please let me know if I need to cp in back to ROCm fork or we will wait and get it via sync w/ upstream
|
/easycla |
|
\easycla |
This should allow us to move gfx1100 workflow to a lower frequency and also allow it to be triggered on PRs via a dedicated label, for any PRs that target Navi fixes such as [this](#165630) or [this](#165625). Pull Request resolved: #165699 Approved by: https://github.com/jeffdaily
|
@pytorchbot merge -f "lint failure unrelated; CLA signed but not reflecting on PR; trying to see if it is updated internally" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Merge failedReason: 1 mandatory check(s) failed. The first few are: Dig deeper by viewing the failures on hud |
|
\easycla |
This should allow us to move gfx1100 workflow to a lower frequency and also allow it to be triggered on PRs via a dedicated label, for any PRs that target Navi fixes such as [this](pytorch#165630) or [this](pytorch#165625). Pull Request resolved: pytorch#165699 Approved by: https://github.com/jeffdaily
|
\easycla |
|
/easycla |
|
@pytorchbot rebase |
|
@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here |
…navi3x(gfx11x) (cherry picked from commit 03c7da0) Signed-off-by: Artem Kuzmitckii <artem.kuzmitckii@amd.com>
Bug introduced by ROCm@03c7da0 (cherry picked from commit bbd0112) Signed-off-by: Artem Kuzmitckii <artem.kuzmitckii@amd.com>
Bug introduced by ROCm@03c7da0 Signed-off-by: Artem Kuzmitckii <artem.kuzmitckii@amd.com>
|
Successfully rebased |
df7c4c4 to
861ca48
Compare
|
@pytorchbot merge -f "navi3 test changes only, lint is passing. too many failures still to make sense of the signal" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
This should allow us to move gfx1100 workflow to a lower frequency and also allow it to be triggered on PRs via a dedicated label, for any PRs that target Navi fixes such as [this](pytorch#165630) or [this](pytorch#165625). Pull Request resolved: pytorch#165699 Approved by: https://github.com/jeffdaily
…orch#165625) test_decompose_mem_bound_mm.py tolerance increase for navi3x(gfx11x) (cherry picked from commit 03c7da0) from Fixes for CI HUD for gfx1100 Signed-off-by: Artem Kuzmitckii <artem.kuzmitckii@amd.com> Pull Request resolved: pytorch#165625 Approved by: https://github.com/jeffdaily Co-authored-by: iupaikov-amd <Iurii.Paikov@amd.com> Co-authored-by: Dmitry Nikolaev <139769634+dnikolaev-amd@users.noreply.github.com> Co-authored-by: Jeff Daily <jeff.daily@amd.com>
test_decompose_mem_bound_mm.py tolerance increase for navi3x(gfx11x)
(cherry picked from commit 03c7da0) from
Fixes for CI HUD for gfx1100
Signed-off-by: Artem Kuzmitckii artem.kuzmitckii@amd.com
cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov @coconutruben