[ROCm] Indexing perf optimization via Unroll/WideFetch/IdxReuse/OneDupOpt#146448
[ROCm] Indexing perf optimization via Unroll/WideFetch/IdxReuse/OneDupOpt#146448amd-hhashemi wants to merge 2 commits intopytorch:mainfrom amd-hhashemi:main
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/146448
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit ebfe443 with merge base 98c8927 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
Closed #146196 due to commit author was incorrect. Opened this one with it fixed. |
…pOpt (#1897) cherry-pick of pytorch#146448 Co-author: @amd-hhashemi
|
Looks like this PR hasn't been updated in a while so we're going to go ahead and mark this as |
Fixes #ISSUE_NUMBER
cc @jeffdaily @sunway513 @jithunnair-amd @pruthvistony @ROCmSupport @dllehr-amd @jataylo @hongxiayang @naromero77amd