Commit 40a02e2
Make out ops c10-full (with hacky-wrapper) (#48912)
Summary:
Pull Request resolved: #48912
ghstack-source-id: 118619234
(Note: this ignores all push blocking failures!)
Test Plan:
Benchmark:
---
Old (i.e. codegenerated unboxing wrapper + no hacky_wrapper):
```
<torch.utils.benchmark.utils.valgrind_wrapper.timer_interface.CallgrindStats object at 0x7f64d03ebcd0>
torch.absolute(t, out=o)
setup:
t = torch.empty([1])
o = torch.empty([1])
All Noisy symbols removed
Instructions: 657204 634396
Baseline: 4192 3786
100 runs per measurement, 1 thread
```
New (i.e. templated unboxing wrapper + hacky_wrapper):
```
<torch.utils.benchmark.utils.valgrind_wrapper.timer_interface.CallgrindStats object at 0x7fa7de211cd0>
torch.absolute(t, out=o)
setup:
t = torch.empty([1])
o = torch.empty([1])
All Noisy symbols removed
Instructions: 658160 633996
Baseline: 4210 3786
100 runs per measurement, 1 threa
```
Reviewed By: bhosmer
Differential Revision: D25363335
fbshipit-source-id: ab9c122491e4209a49254dad0f7b3adb677b2c531 parent 1133428 commit 40a02e2
2 files changed
Lines changed: 373 additions & 23 deletions
File tree
- aten/src/ATen
- core/op_registration
- native
Lines changed: 13 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
262 | 262 | | |
263 | 263 | | |
264 | 264 | | |
265 | | - | |
| 265 | + | |
266 | 266 | | |
267 | 267 | | |
268 | 268 | | |
269 | 269 | | |
270 | | - | |
| 270 | + | |
271 | 271 | | |
272 | | - | |
273 | | - | |
274 | | - | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
275 | 275 | | |
276 | 276 | | |
277 | | - | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
278 | 284 | | |
279 | 285 | | |
280 | 286 | | |
281 | 287 | | |
282 | | - | |
| 288 | + | |
283 | 289 | | |
284 | 290 | | |
285 | 291 | | |
| |||
0 commit comments