Color Twist - Optimizations in sse/avx2 for tensor implementation by r-abishek · Pull Request #81 · ROCm/rpp

r-abishek · 2021-11-23T21:58:49Z

Adds color_twist optimizations in sse and avx2 for host
Uses a tensor based implementation
Support for NCHW<->NHWC, U8/F16/F32/I8
Adds the corresponding unit tests and performance tests

@rrawther This PR adds the new color_twist tensor based implementation.

…into ar/opt_color_twist_host

rrawther · 2021-11-30T23:20:40Z

src/modules/cpu/host_fused_functions.hpp

            x3 = _mm_shuffle_ps(x3,x3, _MM_SHUFFLE(0,3,2,1));

-            // Un-normalize
+#if 0


please remove the commented code.

rrawther · 2021-11-30T23:34:41Z

src/modules/cpu/host_tensor_augmentations.hpp

+        dstPtrChannel = dstPtrImage;
+
+#if __AVX2__
+        Rpp32u alignedLength = bufferLength & ~23;


this logic only works for alignment which are power of 2. Else you need to do x = x/align *align.
applies to all such math

Done for all occurrences of 47 or 23 in tensor_augmentations hpp.

rrawther · 2021-11-30T23:40:25Z

src/modules/rppt_tensor_augmentations.cpp

-//                       RpptRoiType roiType,
-//                       rppHandle_t rppHandle)
-// {
-// #ifdef OCL_COMPILE


Why is this commented?

I'll remove all the OCL_COMPILE for all the functions developed in a separate PR along with one other common API level change.

rrawther

please address the review comments

rrawther

@paveltc to confirm all the tests are passing for this PR

paveltc · 2021-12-03T19:41:45Z

@rrawther @r-abishek There are potential issues with this PR. I have contacted Abishek regarding them. I will get the images that look wrong and share them via email.

…k/rpp into ar/opt_color_twist_host

rrawther · 2021-12-06T18:12:02Z

src/modules/cpu/host_tensor_augmentations.hpp

+                    __m128 p[4];
+                    rpp_simd_load(rpp_load12_f32pkd3_to_f32pln3, srcPtrTemp_ps, p);    // simd loads
+                    compute_color_twist_12_host(p[0], p[1], p[2], pColorTwistParams);    // color_twist adjustment
+                    rpp_simd_store(rpp_store12_f32pln3_to_f32pkd3, dstPtrTemp_ps, p);    // simd stores


We need to convert the pixels to fp16 and store in the SIMD code to avoid extra conversion at the end. Applies to similar code below

@rrawther I am unable to use the _mm256_loadu_ph() that directly loads half precision pixels since its under AVX512 instructions -> https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#text=_mm256_loadu_&ig_expand=7353,340,383,4325,4324

Or the 128 vector length _mm_loadu_ph() seems to be under "CPUID Flags: AVX512_FP16 + AVX512VL" too

rrawther · 2021-12-06T18:24:18Z

utilities/rpp-performancetests/HOST_NEW/Tensor_host_pln1.cpp

-            // roiTypeSrc = RpptRoiType::LTRB;
-            // roiTypeDst = RpptRoiType::LTRB;
+            // Uncomment to run test case with an xywhROI override
+            /*for (i = 0; i < images; i++)


Please avoid commented code for enabling/disabling features. Instead use #deifnes or command_line arguments

rrawther

@r-abishek @paveltc : Are all the tests pass using this PR?

r-abishek · 2021-12-06T19:24:19Z

@rrawther I've increased dst static allocation by 1 due to the writes. This should fix the issue. @paveltc Please let me know if all the issues are fixed.

paveltc · 2021-12-06T20:21:38Z

@rrawther @r-abishek I ran the tests and I don't see the issues.

r-abishek · 2021-12-06T20:28:44Z

@paveltc Thanks for verifying, @rrawther I have added the minor fix here - bb0252d

rrawther · 2021-12-06T20:47:28Z

@kiritigowda: this is ready to merge

r-abishek and others added 15 commits November 15, 2021 18:35

Tensor color_twist initial commit for U8

784505a

Merge branch 'rr/color_twist_new' of https://github.com/rrawther/rpp …

b573f2c

…into ar/opt_color_twist_host

Remove two mul instructions

91b4fb8

Add cast to ps

8d20b21

Add AVX2 version for U8PKD3 color_twist

76fae17

Fix all variants under U8

68bdd89

Common formatting change

803338e

Add support for f32

bc11b7d

Add f16 support

6bf57b9

Add i8 support

9f0a8e4

Minor build fix

4ee98c6

Add tensor color_twist unittests

506cb94

Add tensor color_twist performancetests

4a7e037

Fix codacy issue

c6dc50b

Fix codacy issue

b070543

rrawther reviewed Nov 30, 2021

View reviewed changes

rrawther requested changes Nov 30, 2021

View reviewed changes

r-abishek added 2 commits November 30, 2021 21:05

Remove commented code

bccc1b0

Change alignedLength computation in tensor_augmentations

d04faff

kiritigowda assigned paveltc Dec 2, 2021

kiritigowda added the enhancement New feature or request label Dec 2, 2021

Modify comment style

4f15c3f

rrawther approved these changes Dec 3, 2021

View reviewed changes

r-abishek added 2 commits December 3, 2021 15:27

Fix for BatchPD PLN3 color_twist

71af531

Merge branch 'ar/opt_color_twist_host' of https://github.com/r-abishe…

697ae33

…k/rpp into ar/opt_color_twist_host

kiritigowda requested a review from rrawther December 6, 2021 15:34

rrawther reviewed Dec 6, 2021

View reviewed changes

Increase static allocation

f17c828

Fix for squiggly lines

bb0252d

kiritigowda merged commit 10aaf5b into ROCm:master Dec 6, 2021

Conversation

r-abishek commented Nov 23, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rrawther Nov 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rrawther left a comment

Choose a reason for hiding this comment

Uh oh!

rrawther left a comment

Choose a reason for hiding this comment

Uh oh!

paveltc commented Dec 3, 2021

Uh oh!

rrawther Dec 6, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rrawther left a comment

Choose a reason for hiding this comment

Uh oh!

r-abishek commented Dec 6, 2021

Uh oh!

paveltc commented Dec 6, 2021

Uh oh!

r-abishek commented Dec 6, 2021

Uh oh!

rrawther commented Dec 6, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rrawther Nov 30, 2021 •

edited

Loading

rrawther Dec 6, 2021 •

edited

Loading