Open
Conversation
Collaborator
RooseweltMcW
commented
Aug 11, 2025
- For 2D generate input for U8 and golden outputs
- 4D Float input file regenerate in range 0 -10 for F32
- Modified test suite to support both F32 and U8 bitdepth
- U8 input file generation 0 - 255 (2D, 4D)
- Golden output generation for U8/4D F32
…ated and QA generated
…n for U8 3Dim and golden output for U8 3Dim
| } | ||
| case 4: | ||
| { | ||
| numValues = 18630; |
| int meanStdDevOutputStride = 0, axisMaskStride = 0; | ||
| if(isMeanStd) | ||
| meanStdDevOutputStride = goldenOutputLength / (2 * sizeof(Rpp32f)); | ||
| meanStdDevOutputStride = goldenOutputLength / 2; |
| int sampleLength = bufferLength / batchSize; | ||
| int fileMatch = 0; | ||
| for(int i = 0; i < batchSize; i++) | ||
| for (int i = 0; i < batchSize; i++) |
Owner
There was a problem hiding this comment.
remove the space you added
| } | ||
|
|
||
| std::string bitDepthStr; | ||
| switch (bitDepth) |
Collaborator
Author
There was a problem hiding this comment.
Removed the switch case because it is being used in the Main function
| funcName = funcName + "_" + bitDepthStr; | ||
| std::string status = funcName + ": "; | ||
| cout << std::endl << "Results for Test case: " << funcName << std::endl; | ||
| std::cout << "\nResults for Test case: " << funcName << std::endl; |
| } | ||
|
|
||
| if(DEBUG_MODE) | ||
| if(DEBUG_MODE && bitDepth == 2) |
Owner
There was a problem hiding this comment.
it should just be DEBUG_MODE
|
|
||
| string func = funcName; | ||
| if (axisMaskCase) | ||
| std::string bitdepthStr; |
| inputSecond = static_cast<Rpp32f *>(calloc(iBufferSizeSecondInBytes, 1)); | ||
| output = static_cast<Rpp32f *>(calloc(oBufferSizeInBytes, 1)); | ||
| // allocate memory for input / output | ||
| // Host pointers (pinned memory) |
Owner
There was a problem hiding this comment.
Don't allocate pinned memoryfor every thing input and output has to be HOST mem only
| } | ||
|
|
||
| // Convert inputs to correponding bit depth specified by user | ||
| convert_input_bitdepth(inputF32, inputF32Second, input, inputSecond, bitDepth, iBufferSize, iBufferSizeSecond, iBufferSizeInBytes, iBufferSizeSecondInBytes, srcDescriptorPtrND, srcDescriptorPtrNDSecond, testCase); |
Owner
There was a problem hiding this comment.
please revert these changes, these changes will slow down the process
| for(Rpp32u k = 0; k < length[level]; k++) | ||
| { | ||
| *dstPtrTemp = (((T2)*srcPtrTemp - meanPtr[idx]) * multiplierPtr[idx]) + shift; | ||
| if constexpr (std::is_same<T2, Rpp8u>::value) |
Misc test suite updates
* Initial changes for tensor sum for PLN1 and PKD3 variants U8 * Make changes for rest of the bit depths and layouts * Remove unused pack functions * Adjust comments * Initial cleanup * Further cleanup * Remove unused functions * Address copilot comments * Further standardize comments * Standardize comments spacing in functions affected --------- Co-authored-by: Srihari-mcw <srihari@multicorewareinc.com> Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
* removed numbers for bitdepths and output toggle * Add layout enum and replicate changes to HOST testsuite * Fix copilot review comments * resolve review comments * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * minor bugfix * minor bugfix * minor fix --------- Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
HazarathKumarM
pushed a commit
to RooseweltMcW/rpp
that referenced
this pull request
Dec 22, 2025
HazarathKumarM
added a commit
that referenced
this pull request
Feb 13, 2026
* Travis CI - key error fix * Fix Bug in ColorTwist (#6) (#8) (#9) * Added golden outputs and resolved HOST backend * Updated bin files for median filter and resize crop mirror * Updated bin files * Updated bin files for the next set of kernel F32 QA * Updated bin files for jpeg_compression_distortion * Fixed resize QA failures * Fix for Resize bilinear F32 QA HOST and HIP * Fix for lens correction QA f32 for HOST and HIP for 1e-4 precision * Fixed HIP rcm QA * updates for warp Affine F32 QA * Fix for RCM QA match for U8 and F32 updates AVX * Fix for lens correction AVX * Removed space * Fixed warp affine for every other varient with the updated changes * Add fixes to match precision in quantization * Fix Precision mismatches * Update default cutoff to 1e-5 and specialized cutoff to 1e-4 * F32 QA Fix * Made Quality percentage as arg from testsuite * Resolved copilot comments * Resolved the copilot comments * Resolved Codex comments * HOST and HIP - pinned buffers for respective API (ROCm#628) * Removed memcpy and used hipHostMalloc for allocation : blend * Removed memcpy and used hipHostMalloc for allocation : brightness * Removed memcpy and used hipHostMalloc for allocation : color cast * Removed memcpy and used hipHostMalloc for allocation : color twist * Removed memcpy and used hipHostMalloc for allocation : contrast * Removed memcpy and used hipHostMalloc for allocation : crop mirror normalize * Removed memcpy and used hipHostMalloc for allocation : Exposure * Removed memcpy and used hipHostMalloc for allocation : Gamma correction * Removed memcpy and used hipHostMalloc for allocation : gaussian filter * Removed memcpy and used hipHostMalloc for allocation : Noise * Removed memcpy and used hipHostMalloc for allocation : Non linear blend * Removed memcpy and used hipHostMalloc for allocation : Resize mirror normalize * Removed memcpy and used hipHostMalloc for allocation : Water * Added hipHostFree for all kernels in test suite * Added hipHostFree for all kernels in test suite * Removed memcpy and used hipHostMalloc for allocation : Flip, spatter, rcm, color temperature * Resolved copilot review comments * Updated version * Removed unused parameter * Updated version in cmakeList * removed the host to device mem copies for warp affine and rotate * Updated version * Removed comment * Updated Chnagelog file * Update patch version from 2.2.0 to 2.2.1 * Update CHANGELOG * Address copilot comments for HIP HOST consistent allocation * Documentation changes for updated memcpy changes * Update ricap outer API to use pinned memory and remove mem copy * Fix memory allocation and deallocation for permutationTensor * Update api/rppt_tensor_effects_augmentations.h Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Fix spelling of noiseProbability and saltProbability * Fix deallocation --------- Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com> Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com> Co-authored-by: Srihari-mcw <srihari@multicorewareinc.com> Co-authored-by: hmaddise <HazarathKumar.Maddisetty@amd.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * resolved review comments * minor comment change * Resolved copilot review comments * Update src/modules/tensor/cpu/kernel/resize.cpp Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update src/modules/tensor/cpu/kernel/resize.cpp Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Update src/modules/tensor/hip/kernel/jpeg_compression_distortion.cpp Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Updated test suite and resoled review comments * Updated HIP for F32 QA reduction function cases --------- Co-authored-by: Kiriti Gowda <kiriti.nageshgowda@amd.com> Co-authored-by: Lokesh Bonta <lokeswara@multicorewareinc.com> Co-authored-by: sampath117 <snehaa@multicorewareinc.com> Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com> Co-authored-by: Srihari-mcw <srihari@multicorewareinc.com> Co-authored-by: ManasaDattaT <tammisetti.manasadatta@multicorewareinc.com> Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com> Co-authored-by: hmaddise <HazarathKumar.Maddisetty@amd.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Lakshmi Kumar <lakshmi.kumar@amd.com>
HazarathKumarM
added a commit
to RooseweltMcW/rpp
that referenced
this pull request
Feb 18, 2026
* add support for dilate in HOST backend
* minor fix in changelog
* added golden outputs
remove commented code
* resolve build errors
* Add padding changes in HIP backend
* fix sigsev issues
* fix QA for 9x9 kernel
* Add if condition for pack function and template for unpack and signext function
* Fix the rename of preLoadRows and max Comments
* Fix Fix remane of Loader and MorphVecLoader
* Fix empty space, dilate_row_hip_compute function, removed if & else and aligned indent R.
* Fix remove whitespace and restored all unnecessary changes.
* Fix remove precision line and reverted back to static cast.
* Fix remove empty line, rename of kernelSze & padPolicy and remove {} for single line condition
* Fix Indentation of IF condition.
* resolved review comments
* resolve review comments
* Test suite - Add QA pass/fail tests for F32 bit depth (ROCm#665)
* Travis CI - key error fix
* Fix Bug in ColorTwist (HazarathKumarM#6) (HazarathKumarM#8) (HazarathKumarM#9)
* Added golden outputs and resolved HOST backend
* Updated bin files for median filter and resize crop mirror
* Updated bin files
* Updated bin files for the next set of kernel F32 QA
* Updated bin files for jpeg_compression_distortion
* Fixed resize QA failures
* Fix for Resize bilinear F32 QA HOST and HIP
* Fix for lens correction QA f32 for HOST and HIP for 1e-4 precision
* Fixed HIP rcm QA
* updates for warp Affine F32 QA
* Fix for RCM QA match for U8 and F32 updates AVX
* Fix for lens correction AVX
* Removed space
* Fixed warp affine for every other varient with the updated changes
* Add fixes to match precision in quantization
* Fix Precision mismatches
* Update default cutoff to 1e-5 and specialized cutoff to 1e-4
* F32 QA Fix
* Made Quality percentage as arg from testsuite
* Resolved copilot comments
* Resolved the copilot comments
* Resolved Codex comments
* HOST and HIP - pinned buffers for respective API (ROCm#628)
* Removed memcpy and used hipHostMalloc for allocation : blend
* Removed memcpy and used hipHostMalloc for allocation : brightness
* Removed memcpy and used hipHostMalloc for allocation : color cast
* Removed memcpy and used hipHostMalloc for allocation : color twist
* Removed memcpy and used hipHostMalloc for allocation : contrast
* Removed memcpy and used hipHostMalloc for allocation : crop mirror normalize
* Removed memcpy and used hipHostMalloc for allocation : Exposure
* Removed memcpy and used hipHostMalloc for allocation : Gamma correction
* Removed memcpy and used hipHostMalloc for allocation : gaussian filter
* Removed memcpy and used hipHostMalloc for allocation : Noise
* Removed memcpy and used hipHostMalloc for allocation : Non linear blend
* Removed memcpy and used hipHostMalloc for allocation : Resize mirror normalize
* Removed memcpy and used hipHostMalloc for allocation : Water
* Added hipHostFree for all kernels in test suite
* Added hipHostFree for all kernels in test suite
* Removed memcpy and used hipHostMalloc for allocation : Flip, spatter, rcm, color temperature
* Resolved copilot review comments
* Updated version
* Removed unused parameter
* Updated version in cmakeList
* removed the host to device mem copies for warp affine and rotate
* Updated version
* Removed comment
* Updated Chnagelog file
* Update patch version from 2.2.0 to 2.2.1
* Update CHANGELOG
* Address copilot comments for HIP HOST consistent allocation
* Documentation changes for updated memcpy changes
* Update ricap outer API to use pinned memory and remove mem copy
* Fix memory allocation and deallocation for permutationTensor
* Update api/rppt_tensor_effects_augmentations.h
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Fix spelling of noiseProbability and saltProbability
* Fix deallocation
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
Co-authored-by: Srihari-mcw <srihari@multicorewareinc.com>
Co-authored-by: hmaddise <HazarathKumar.Maddisetty@amd.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* resolved review comments
* minor comment change
* Resolved copilot review comments
* Update src/modules/tensor/cpu/kernel/resize.cpp
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Update src/modules/tensor/cpu/kernel/resize.cpp
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Update src/modules/tensor/hip/kernel/jpeg_compression_distortion.cpp
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
* Updated test suite and resoled review comments
* Updated HIP for F32 QA reduction function cases
---------
Co-authored-by: Kiriti Gowda <kiriti.nageshgowda@amd.com>
Co-authored-by: Lokesh Bonta <lokeswara@multicorewareinc.com>
Co-authored-by: sampath117 <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Srihari-mcw <srihari@multicorewareinc.com>
Co-authored-by: ManasaDattaT <tammisetti.manasadatta@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
Co-authored-by: hmaddise <HazarathKumar.Maddisetty@amd.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Lakshmi Kumar <lakshmi.kumar@amd.com>
* Erode - HOST and HIP update (ROCm#666)
* added initial api support for erode
* added support for U8 and I8 bitdepths for 3, 5, 7, 9 kernel sizes
* added F16 and F32 bitdepth support
* added generic kernel support
* added golden outputs
removed commented code
* fix build errors
* Fix build and test_suite errors
* revert padding changes
* updated erode HIP kernel with latest changes
* Add F32 QA
* minor formatting fixes
* minor comment fix
* resolve copilot comments
* resolve review comments
* resolved review comments
* Add unpack templating changes and fix segmentation issue
* Fix PKD to PKD kernel 9 for Pack and Unpack changes.
* Add and template signext function
* Fix min Comments
* Fix one min Comments
* Add unroll and rename of preLoadRows
* Fix remane of Loader and MorphVecLoader
* Add empty line before comment
* Fix remove empty line, rename of kernelSze & padPolicy and remove {} for single line condition
* resolved review comments
* fix build warnings
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Mukesh Jayakodi <mukesh.jayakodi@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
Co-authored-by: HazarathKumarM <119284987+HazarathKumarM@users.noreply.github.com>
Co-authored-by: Lakshmi Kumar <lakshmi.kumar@amd.com>
* fix build errors
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Mukesh Jayakodi <mukesh.jayakodi@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
Co-authored-by: HazarathKumarM <119284987+HazarathKumarM@users.noreply.github.com>
Co-authored-by: Kiriti Gowda <kiriti.nageshgowda@amd.com>
Co-authored-by: Lokesh Bonta <lokeswara@multicorewareinc.com>
Co-authored-by: sampath117 <snehaa@multicorewareinc.com>
Co-authored-by: Srihari-mcw <srihari@multicorewareinc.com>
Co-authored-by: ManasaDattaT <tammisetti.manasadatta@multicorewareinc.com>
Co-authored-by: hmaddise <HazarathKumar.Maddisetty@amd.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Lakshmi Kumar <lakshmi.kumar@amd.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.