Always unspecialize float in OSS#138922
Always unspecialize float in OSS#138922bobrenjc93 wants to merge 83 commits intogh/bobrenjc93/90/basefrom
Conversation
[ghstack-poisoned]
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/138922
Note: Links to docs will display an error until the docs builds have been completed. ❌ 30 New Failures, 1 Unrelated FailureAs of commit 794f6a4 with merge base c3fbec7 ( NEW FAILURES - The following jobs have failed:
FLAKY - The following job failed but was likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
@pytorchbot rebase -b main |
|
@pytorchbot started a rebase job onto refs/remotes/origin/main. Check the current status here |
|
Successfully rebased |
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]
cc voznesenskym penguinwu EikanWang jgong5 Guobing-Chen XiaobingSuper zhuhaozhe blzheng wenzhe-nrv jiayisunx chenyang78 kadeng chauhang amjames rec [ghstack-poisoned]
|
@pytorchbot revert -m 'Sorry for reverting your change but there is some slow tests failing after this land' -c nosignal inductor/test_efficient_conv_bn_eval.py::EfficientConvBNEvalCpuTests::test_basic_cpu GH job link HUD commit link |
|
@pytorchbot successfully started a revert job. Check the current status here. |
This reverts commit 6d779d0. Reverted #138922 on behalf of https://github.com/huydhn due to Sorry for reverting your change but there is some slow tests failing after this land ([comment](#138922 (comment)))
|
@bobrenjc93 your PR has been successfully reverted. |
Thanks for the reply.
|
|
@pytorchbot merge |
Merge startedYour change will be merged once all checks pass (ETA 0-4 Hours). Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
|
@leslie-fang-intel I'd recommend reading this design doc https://docs.google.com/document/d/1HswUSp9H6mg8Vg27mhRk8YzC9q_uf63b6wz-gwx65BQ/edit?pli=1&tab=t.0#heading=h.xvyiqp8tuje6
Previously we just inline the float. There was no tensorfication.
If for whatever reason we need to guard, we'll actually fall back specialization. See #140346 |
|
@bobrenjc93 I believe this might be causing perf regression on torchbench: https://hud.pytorch.org/benchmark/torchbench/inductor_no_cudagraphs?dashboard=torchinductor&startTime=Mon,%2018%20Nov%202024%2022:00:43%20GMT&stopTime=Mon,%2025%20Nov%202024%2022:00:43%20GMT&granularity=hour&mode=training&dtype=amp&deviceName=cuda%20(a100)&lBranch=gh/bobrenjc93/90/head&lCommit=e67d26992e8a473aec6ede13b8cb928208c3df34&rBranch=main&rCommit=1bdb92cbff6f8f6fd1842e8fd9969bca688eaea3 I also bisected on torchbench llama model (on A100 devvm): |
|
Discussed in Inductor group chat and the group agrees to revert this PR |
|
@pytorchbot revert -m "perf regression on torchbench" |
|
❌ 🤖 pytorchbot command failed: Try |
|
@pytorchbot revert -m "perf regression on torchbench" -c nosignal |
|
@pytorchbot successfully started a revert job. Check the current status here. |
|
@bobrenjc93 your PR has been successfully reverted. |
|
new benchmark runs as of 11/25 base: https://github.com/pytorch/pytorch/actions/runs/12023899968 |
|
Endorsing the NN module spec changes, although we will have to carefully check if this actually fixes all the benchmarks |
Stack from ghstack (oldest at bottom):
Fixes #107277
cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire @chauhang @aakhundov @rec