T-API: changed optimal vector width for Intel by ilya-lavrenov · Pull Request #2893 · opencv/opencv

ilya-lavrenov · 2014-06-24T09:40:38Z

Description:

changed optimal vector width for Intel platform since it shows better performance in most of cases

Performance report:
http://ocl.itseez.com/intel/export/perf/pr/2893/report/

check_regression=OCL_AbsDiff:OCL_Add:OCL_Sub:OCL_Mul:OCL_Div:OCL_Bitwise:OCL_Compare:OCL_Min:OCL_Max:OCL_Flip:OCL_Repeat:OCL_AbsDiff:OCL_Sum:OCL_Count:OCL_Norm:OCL_Mean:_OCL_CalcHist*
test_filter=OCL_AbsDiff:OCL_Add:OCL_Sub:OCL_Mul:OCL_Div:OCL_Bitwise:OCL_Compare:OCL_Min:OCL_Max:OCL_Flip:OCL_Repeat:OCL_AbsDiff:OCL_Sum:OCL_Count:OCL_Norm:OCL_Mean:_OCL_CalcHist*
test_modules=core,imgproc
build_examples=OFF

ilya-lavrenov · 2014-06-24T11:00:18Z

@mostafahagog or @abatushi or @myshevts or @mletavin
what do you think about this change? In most of cases it shows better performance. We can see a little performance degradation on some tests (even on 32f, but the patch affects only 8u) and I believe they are outliers.

mostafahagog · 2014-06-27T01:41:33Z

is this the one that gets the best results? how about
int vectorWidthsIntel[] = { 8, 8, 4, 4, 1, 1, 1, -1 }; ?
or
int vectorWidthsIntel[] = { 8, 8, 8, 8, 1, 1, 1, -1 };?

ilya-lavrenov · 2014-06-27T07:07:11Z

I tried 8 as vector width for uchar and it shows a worse performance with compare to 4. My observations argued me that 32bit is optimal in most of cases

Daniil-Osokin · 2014-07-11T05:58:10Z

@mostafahagog Hi, please check this again.

krodyush · 2014-07-11T11:37:17Z

@ilya-lavrenov I see that you dont use vector load for U8 data (vload4 vload8 vload16 ). that could be the reason why 16 and 8 is not optimal vector size and 4 shows better perf for 8U. could you try to use vloadn/vstoren operations for uchar instead of tuning vector size?

ilya-lavrenov · 2014-07-13T14:51:06Z

could you try to use vloadn/vstoren operations for uchar instead of tuning vector size?

the result is - uchar4 is better than uchar16 (see #2969)

SergeySivolgin · 2014-07-15T13:49:27Z

@ilya-lavrenov Ilya, could you please rerun performance tests for this pull request to avoid influence of other PRs. Thanks.

ilya-lavrenov · 2014-07-15T13:53:33Z

@SergeySivolgin, the process has been initiated.

vpisarev · 2014-07-25T11:50:38Z

@krodyush, can you please review the pull request, it's here for quite a long time already

krodyush · 2014-07-25T12:33:35Z

@vpisarev, I dont see speedup according last measurement. So, I dont see reason for such changes

ilya-lavrenov · 2014-07-25T13:09:08Z

@krodyush, last measurement are in progress yet (result you see were made with another driver)

krodyush · 2014-07-31T11:06:40Z

@ilya-lavrenov I looked into last 2 perf reports from 07.25 and 07.29 and see big deviations in results. It looks like perf test is not reliable enough. Could you re run it several times to be sure that we see real improvement but not some noise in measurement?

ilya-lavrenov · 2014-08-25T14:21:57Z

@krodyush, see the latest performance report.

krodyush · 2014-08-25T18:53:16Z

what was changed?

ilya-lavrenov · 2014-08-25T18:58:26Z

nothing, I've rerun perf report generation and we can see stable results that show performance gain.

krodyush · 2014-08-26T06:43:03Z

Then could you make several perf reports to be able to see the improvements.stability?

ElenaGvozdeva · 2014-09-03T08:58:24Z

@ilya-lavrenov please resolve merge conflict

ilya-lavrenov · 2014-09-03T09:06:14Z

@ElenaGvozdeva, done.

ilya-lavrenov · 2014-09-04T08:52:14Z

@krodyush, I've made 3 performance reports and each of them shows stable performance gain for uchars. So, please review this PR once again.

krodyush · 2014-09-09T08:04:53Z

@ilya-lavrenov what was the reason to reduce number of tests from ~3000 to ~900?

ilya-lavrenov · 2014-09-09T08:24:23Z

@krodyush, Sergey S. asked me to do that, because mostly these functions are affected by the patch.

krodyush · 2014-09-09T08:50:30Z

👍

ilya-lavrenov assigned mostafahagog Jun 24, 2014

ilya-lavrenov mentioned this pull request Jul 1, 2014

T-API: changed base types for cv::memopTypeToStr #2921

Merged

vpisarev assigned krodyush and unassigned mostafahagog Jul 25, 2014

ilya-lavrenov force-pushed the tapi_vector_width_intel branch from 0ef70f6 to 970de35 Compare August 25, 2014 07:30

ilya-lavrenov force-pushed the tapi_vector_width_intel branch 2 times, most recently from 6c5dec2 to 28cd305 Compare September 3, 2014 09:05

ilya-lavrenov force-pushed the tapi_vector_width_intel branch from 28cd305 to 1f598b5 Compare September 3, 2014 12:30

changed optimal vector width for Intel

98e7d4c

ilya-lavrenov force-pushed the tapi_vector_width_intel branch from 1f598b5 to 98e7d4c Compare September 4, 2014 07:59

opencv-pushbot merged commit 98e7d4c into opencv:master Sep 18, 2014

vpisarev added a commit that referenced this pull request Sep 18, 2014

Merge pull request #2893 from ilya-lavrenov:tapi_vector_width_intel

06e55dd

ilya-lavrenov deleted the tapi_vector_width_intel branch September 18, 2014 12:10

asmorkalov mentioned this pull request Jun 18, 2014

Several fixes android related fixes #759

Merged

Uh oh!

Conversation

ilya-lavrenov commented Jun 24, 2014

Uh oh!

ilya-lavrenov commented Jun 24, 2014

Uh oh!

mostafahagog commented Jun 27, 2014

Uh oh!

ilya-lavrenov commented Jun 27, 2014

Uh oh!

Daniil-Osokin commented Jul 11, 2014

Uh oh!

krodyush commented Jul 11, 2014

Uh oh!

ilya-lavrenov commented Jul 13, 2014

Uh oh!

SergeySivolgin commented Jul 15, 2014

Uh oh!

ilya-lavrenov commented Jul 15, 2014

Uh oh!

vpisarev commented Jul 25, 2014

Uh oh!

krodyush commented Jul 25, 2014

Uh oh!

ilya-lavrenov commented Jul 25, 2014

Uh oh!

krodyush commented Jul 31, 2014

Uh oh!

ilya-lavrenov commented Aug 25, 2014

Uh oh!

krodyush commented Aug 25, 2014

Uh oh!

ilya-lavrenov commented Aug 25, 2014

Uh oh!

krodyush commented Aug 26, 2014

Uh oh!

ElenaGvozdeva commented Sep 3, 2014

Uh oh!

ilya-lavrenov commented Sep 3, 2014

Uh oh!

ilya-lavrenov commented Sep 4, 2014

Uh oh!

krodyush commented Sep 9, 2014

Uh oh!

ilya-lavrenov commented Sep 9, 2014

Uh oh!

krodyush commented Sep 9, 2014

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants