[CP][BE] Cosmetic refactors for CP code base#163115
[CP][BE] Cosmetic refactors for CP code base#163115fegin wants to merge 2 commits intogh/fegin/319/basefrom
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/163115
Note: Links to docs will display an error until the docs builds have been completed. ✅ No FailuresAs of commit 15134ea with merge base 7a0f933 ( This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
Curious for CP, could also explicit variants / args be added to F.sdpa (e.g. a new backend and a new dict arg |
CP currently only supports what F.sdpa and FlexAttention, we basically just forward the arguments to the underlying SDPA or FlexAttention. Also there is a backend selection API for SDPA. Can you describe more your requirement if I misunderstood your request? @vadimkantorov |
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Well, I was proposing to add an explicit way of invoking CP-SDPA via |
Pull Request resolved: #163131 Approved by: https://github.com/tianyu-l, https://github.com/XilunWu ghstack dependencies: #162539, #162540, #162541, #163115
Summary: This PR is extracted from pytorch#162542, to make the original PR easier to review. This PR only contains cosmetic changes. Pull Request resolved: pytorch#163115 Approved by: https://github.com/tianyu-l ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541
Pull Request resolved: pytorch#163131 Approved by: https://github.com/tianyu-l, https://github.com/XilunWu ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541, pytorch#163115
We should only unsqueeze if necessary. Fix pytorch#162743 Pull Request resolved: pytorch#163231 Approved by: https://github.com/eqy ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541, pytorch#163115, pytorch#163131
Summary: This PR is extracted from pytorch#162542, to make the original PR easier to review. This PR only contains cosmetic changes. Pull Request resolved: pytorch#163115 Approved by: https://github.com/tianyu-l ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541
Pull Request resolved: pytorch#163131 Approved by: https://github.com/tianyu-l, https://github.com/XilunWu ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541, pytorch#163115
We should only unsqueeze if necessary. Fix pytorch#162743 Pull Request resolved: pytorch#163231 Approved by: https://github.com/eqy ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541, pytorch#163115, pytorch#163131
Summary: This PR is extracted from pytorch#162542, to make the original PR easier to review. This PR only contains cosmetic changes. Pull Request resolved: pytorch#163115 Approved by: https://github.com/tianyu-l ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541
Pull Request resolved: pytorch#163131 Approved by: https://github.com/tianyu-l, https://github.com/XilunWu ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541, pytorch#163115
We should only unsqueeze if necessary. Fix pytorch#162743 Pull Request resolved: pytorch#163231 Approved by: https://github.com/eqy ghstack dependencies: pytorch#162539, pytorch#162540, pytorch#162541, pytorch#163115, pytorch#163131
Stack from ghstack (oldest at bottom):
Summary:
This PR is extracted from #162542, to make the original PR
easier to review. This PR only contains cosmetic changes.
cc @H-Huang @awgu @wanchaol @fduwjj @wz337 @wconstab @d4l3k @pragupta @ezyang @msaroufim @dcci