parallel norm operation for ATen on CPU by xhzhao · Pull Request #10535 · pytorch/pytorch

xhzhao · 2018-08-15T08:44:17Z

optimize norm operation for ATen CPU path.
norm is a very heavy operation in RNN related workloads, see OpenNMT-py example.
Our profiling show that norm takes about 8% in OpenNMT-py training time, which is not acceptable.

currently, the code path from TH module runs in sequential on CPU, see link.
The norm performance data compare before and after our optimization:

norm size	SKX6148 OOB	SKX6148 OPT	speedup
[35820, 500]	19.9693	0.6204	32.19
[24997, 500]	13.8752	0.4370	31.75
[2000, 1000]	2.1803	0.0926	23.53
[2000, 500]	1.0949	0.0508	21.55
[500, 1000]	0.5483	0.0494	11.10
[500, 500]	0.2758	0.0494	5.58

norm size	i7 OOB	i7 OPT	speedup
[35820, 500]	18.0185	3.8186	4.72
[24997, 500]	12.5457	2.5238	4.97
[2000, 1000]	1.8258	0.4204	4.34
[2000, 500]	0.9252	0.2391	3.87
[500, 1000]	0.4696	0.1129	4.16
[500, 500]	0.2380	0.0650	3.66

fmassa · 2018-08-15T13:06:02Z

cc @colesbury, as I believe the TensorIterator reduction work can also be used for norm computation

xhzhao · 2018-08-16T05:22:54Z

@fmassa Thanks for your response. Basically, i follow the sum implementation here, and this method is very easy to implement. I also take a look at a example of the TensorIterator, but it seems that there are only 4 operations supported(add/sub/mul/div).

fmassa · 2018-08-16T12:31:52Z

@xhzhao TensorIteration with reduction is not yet merged in master, but it will allow for more generic reductions on contiguous and non-contiguous tensors, over possibly multiple dimensions.
For a glimpse of what it will look like, have a look at https://github.com/colesbury/pytorch/tree/tensor_iterator_sum

I mentioned the TensorIterator to Sam so that we keep the norm operation in mind once once it is merged.

xhzhao · 2018-08-22T01:34:09Z

Maybe we can review this PR first, and update the norm operation after the TensorIterator PR merged.
what do you say?

aten/src/ATen/native/cpu/ReduceOpsKernel.cpp

+  }
+
+  static scalar_t norm_calc(const scalar_t* data, int64_t n, int64_t stride, float pval) {
+        scalar_t result = 0.0;


fmassa · 2018-09-14T06:46:09Z

Merged in #11565

xhzhao added 4 commits August 10, 2018 19:16

opt for norm, build pass

8003c59

pass test for stride=1,p=2

99b71de

pass all the pytorch test

cc6919b

short the code

3a19f0c

xhzhao requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners August 15, 2018 08:44

xhzhao added 3 commits August 15, 2018 18:29

support normall optimiation for DS2 and OpenNMT-py

eac096b

code reuse with norm_calc

fb0ab15

prepare for PR

fb66a86

fix for build error, and sparse test error

062a70a

weiyangfb added the ready for review (this tag is deprecated) All PRs are ready for review unless they are draft, WIP, or have undismissed requested changes label Aug 22, 2018

ssnl reviewed Aug 28, 2018

View reviewed changes

vectorize optimization for p=1/2/3 with stride=1

01620c5

fmassa mentioned this pull request Sep 12, 2018

optimize norm on ATen CPU backend #11565

Closed

fmassa closed this Sep 14, 2018

ezyang added the open source label Jun 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parallel norm operation for ATen on CPU#10535

parallel norm operation for ATen on CPU#10535
xhzhao wants to merge 9 commits intopytorch:masterfrom
xhzhao:norm_opt

xhzhao commented Aug 15, 2018

Uh oh!

fmassa commented Aug 15, 2018

Uh oh!

xhzhao commented Aug 16, 2018

Uh oh!

fmassa commented Aug 16, 2018

Uh oh!

xhzhao commented Aug 22, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

fmassa commented Sep 14, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

xhzhao commented Aug 15, 2018

Uh oh!

fmassa commented Aug 15, 2018

Uh oh!

xhzhao commented Aug 16, 2018

Uh oh!

fmassa commented Aug 16, 2018

Uh oh!

xhzhao commented Aug 22, 2018

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

fmassa commented Sep 14, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants