🐛 Describe the bug
https://github.com/pytorch/pytorch/pull/146992/files#r1953070064
during runtime estimation, we use this reverse map to lookup ops contained in ExternKernelSchedulerNode
|
# TODO(xmfan): reuse: an existing mapping for this if it exists, or formalize this into ir.py:ExternKernel |
|
kernel_name_to_op = { |
|
"extern_kernels.convolution": torch.ops.aten.convolution, |
|
"extern_kernels.mm": torch.ops.aten.mm, |
|
"extern_kernels.bmm": torch.ops.aten.bmm, |
|
"extern_kernels.addmm": torch.ops.aten.addmm, |
|
} |
i've been wanting to find an existing registration system, per @eellison one alternative can be FlopCounter registrations
Versions
main
cc @chauhang @penguinwu @voznesenskym @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @amjames @aakhundov
🐛 Describe the bug
https://github.com/pytorch/pytorch/pull/146992/files#r1953070064
during runtime estimation, we use this reverse map to lookup ops contained in ExternKernelSchedulerNode
pytorch/torch/_inductor/scheduler.py
Lines 924 to 930 in b0553ce
i've been wanting to find an existing registration system, per @eellison one alternative can be FlopCounter registrations
Versions
main
cc @chauhang @penguinwu @voznesenskym @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @amjames @aakhundov