RPP Fog - Enhancement of Fog Effect with Gray Tone Integration#340
RPP Fog - Enhancement of Fog Effect with Gray Tone Integration#340r-abishek merged 7 commits intoar/opt_fogfrom
Conversation
sampath1117
commented
Sep 19, 2024
- A gray tone has been incorporated into the fog effect to achieve a more realistic and balanced visual output
|
@r-abishek Please dont merge this PR to opensource PR till both of the above is checked |
|
Checked the performance without and without this additional changes and it looks similar |
src/include/cpu/rpp_cpu_common.hpp
Outdated
| { | ||
| __m256 pAlphaMaskFactor[2], pIntensityMaskFactor[2], pGray[2], pOneMinusGrayFactor; | ||
| pGray[0] = _mm256_add_ps(_mm256_mul_ps(_mm256_set1_ps(0.299f), p[0]), _mm256_add_ps(_mm256_mul_ps(_mm256_set1_ps(0.587f), p[2]), _mm256_mul_ps(_mm256_set1_ps(0.114f), p[4]))); | ||
| pGray[1] = _mm256_add_ps(_mm256_mul_ps(_mm256_set1_ps(0.299f), p[1]), _mm256_add_ps(_mm256_mul_ps(_mm256_set1_ps(0.587f), p[3]), _mm256_mul_ps(_mm256_set1_ps(0.114f), p[5]))); |
There was a problem hiding this comment.
Couple of changes
- Please store this 3 __m256 const values in an array in fog kernel here, instead of doing set1_ps operation each time
https://github.com/r-abishek/rpp/blob/sr/fog_pr_changes/src/modules/cpu/kernel/fog.hpp#L84
__m256 pConversionFactor[3];
pConversionFactor[0] = _mm256_set1_ps(0.299f);
pConversionFactor[1] = _mm256_set1_ps(0.587f);
pConversionFactor[2] = _mm256_set1_ps(0.114f);
Pass this array to compute_fog_48_host and use it
- Use fmadd operation for L3141 and L3142 since we are doing multiplication followed by addition
|
@sampath1117 The diff should also show us removing the fog image from ROCm docs and adding the new improved image? |
There was a problem hiding this comment.
@sampath1117 I have added some more comments on this PR to address before merge. Some comments have multiple instances to be changed, pls check
| for (i = 0; i < batchSize; i++) | ||
| { | ||
| intensityFactor[i] = 0; | ||
| greyFactor[i] = 0; |
There was a problem hiding this comment.
Why is greyFactor 0 for hip and 0.3 for host? Please match this
There was a problem hiding this comment.
Does this have a grayFactor of 0? Please consistently add the 0.3 or perhaps 0.35 everywhere. On the HOST/HIP test suites, as well as this ROCm docs image
| * \param [out] dstPtr destination tensor in HOST memory | ||
| * \param [in] dstDescPtr destination tensor descriptor (Restrictions - numDims = 4, offsetInBytes >= 0, dataType = U8/F16/F32/I8, layout = NCHW/NHWC, c = same as that of srcDescPtr) | ||
| * \param [in] intensityFactor intensity factor values for fog calculation (1D tensor in HOST memory, of size batchSize, with 0 <= intensityFactor <= 0.5 for each image in batch) | ||
| * \param [in] grayFactor gray factor values for fog calculation (1D tensor in HOST memory, of size batchSize, with 0 <= grayFactor <= 1 for each image in batch) |
There was a problem hiding this comment.
Change comment for both HOST and HIP to something like:
- \param [in] grayFactor gray factor values to introduce grayness in the image for fog calculation
src/modules/hip/kernel/fog.hpp
Outdated
| float4 bMultiplier_f4 = static_cast<float4>(0.114f); | ||
| grey_f4[0] = r_f8->f4[0] * rMultiplier_f4 + g_f8->f4[0] * gMultiplier_f4 + b_f8->f4[0] * bMultiplier_f4; | ||
| grey_f4[1] = r_f8->f4[1] * rMultiplier_f4 + g_f8->f4[1] * gMultiplier_f4 + b_f8->f4[1] * bMultiplier_f4; | ||
| float4 oneMinusGreyFactor = static_cast<float4>(1.0f) - *greyFactor_f4; |
src/modules/cpu/kernel/fog.hpp
Outdated
| __m256 pConversionFactor[3]; | ||
| pConversionFactor[0] = _mm256_set1_ps(0.299f); | ||
| pConversionFactor[1] = _mm256_set1_ps(0.587f); | ||
| pConversionFactor[2] = _mm256_set1_ps(0.114f); |
There was a problem hiding this comment.
In rppdefs.h, perhaps do a # define for a R_TO_GRAY_FACTOR, G_TO_GRAY_FACTOR and B_TO_GRAY_FACTOR. Use it in HOST, HIP - everywhere you need these 3 numbers.
There was a problem hiding this comment.
We seem to already have it but not using it - https://github.com/ROCm/rpp/blob/6cc67130cd7cc15f66eac16c9d5da294a8433fc1/src/include/hip/rpp_hip_common.hpp#L132-L134
Either define in hip common and cpu common separately, or just a single definition in rppdefs.h
There was a problem hiding this comment.
Although not part of this PR, we should ideally be using those # defines for this line too -> https://github.com/ROCm/rpp/blob/6cc67130cd7cc15f66eac16c9d5da294a8433fc1/src/include/cpu/rpp_cpu_common.hpp#L3222
src/modules/cpu/kernel/fog.hpp
Outdated
| pixel.R = static_cast<Rpp32f>(srcPtrTemp[0]); | ||
| pixel.G = static_cast<Rpp32f>(srcPtrTemp[1]); | ||
| pixel.B = static_cast<Rpp32f>(srcPtrTemp[2]); | ||
| Rpp32f gray = 0.299 * pixel.R + 0.587 * pixel.G + 0.114 * pixel.B; |
There was a problem hiding this comment.
Same comment wherever hardcoded numbers exist
There was a problem hiding this comment.
And please add parentheses.
Rpp32f gray = (RGB_TO_GREY_WEIGHT_RED * pixel.R) + (RGB_TO_GREY_WEIGHT_GREEN * pixel.G) + (RGB_TO_GREY_WEIGHT_BLUE * pixel.B);
src/modules/cpu/kernel/fog.hpp
Outdated
| Rpp32f oneMinusGrayValue = 1 - grayValue; | ||
| pixel.R = (pixel.R * oneMinusGrayValue + gray); | ||
| pixel.G = (pixel.G * oneMinusGrayValue + gray); | ||
| pixel.B = (pixel.B * oneMinusGrayValue + gray); |
There was a problem hiding this comment.
Perhaps consolidate all these lines to the below. I've also changed the close-parentheses position on the last 3 lines.
RpptFloatRGB pixel = {static_cast<Rpp32f>(srcPtrTemp[0]),
static_cast<Rpp32f>(srcPtrTemp[1])
static_cast<Rpp32f>(srcPtrTemp[2])};
Rpp32f gray = grayValue * ((RGB_TO_GREY_WEIGHT_RED * pixel.R) + (RGB_TO_GREY_WEIGHT_GREEN * pixel.G) + (RGB_TO_GREY_WEIGHT_BLUE * pixel.B));
Rpp32f oneMinusGrayValue = 1 - grayValue;
pixel.R = (pixel.R * oneMinusGrayValue) + gray;
pixel.G = (pixel.G * oneMinusGrayValue) + gray;
pixel.B = (pixel.B * oneMinusGrayValue) + gray;
src/modules/cpu/kernel/fog.hpp
Outdated
| dstPtrTemp[1] = static_cast<Rpp8u>(RPPPIXELCHECK(std::nearbyintf(pixel.G * alphaMaskFactor + intensityMaskFactor))); | ||
| dstPtrTemp[2] = static_cast<Rpp8u>(RPPPIXELCHECK(std::nearbyintf(pixel.B * alphaMaskFactor + intensityMaskFactor))); | ||
| srcPtrTemp+=3; | ||
| dstPtrTemp+=3; |
There was a problem hiding this comment.
Please ensure += has spaces before and after the operator
src/modules/cpu/kernel/fog.hpp
Outdated
| Rpp32f intensityMaskFactor = ((*fogIntensityMaskPtrTemp++) * ((*fogAlphaMaskPtrTemp++) + intensityValue)); | ||
| dstPtrTemp[0] = static_cast<Rpp8u>(RPPPIXELCHECK(std::nearbyintf(pixel.R * alphaMaskFactor + intensityMaskFactor))); | ||
| dstPtrTemp[1] = static_cast<Rpp8u>(RPPPIXELCHECK(std::nearbyintf(pixel.G * alphaMaskFactor + intensityMaskFactor))); | ||
| dstPtrTemp[2] = static_cast<Rpp8u>(RPPPIXELCHECK(std::nearbyintf(pixel.B * alphaMaskFactor + intensityMaskFactor))); |
There was a problem hiding this comment.
Please don't refrain from putting parentheses. Can add one around (pixel.G * alphaMaskFactor)
src/modules/cpu/kernel/fog.hpp
Outdated
| *dstPtrTempR++ = static_cast<Rpp16f>(RPPPIXELCHECKF32(pixel.R * alphaMaskFactor + intensityMaskFactor)); | ||
| *dstPtrTempG++ = static_cast<Rpp16f>(RPPPIXELCHECKF32(pixel.G * alphaMaskFactor + intensityMaskFactor)); | ||
| *dstPtrTempB++ = static_cast<Rpp16f>(RPPPIXELCHECKF32(pixel.B * alphaMaskFactor + intensityMaskFactor)); | ||
| srcPtrTemp+=3; |
* Update CMakeLists.txt
Version Upgrade
* Bump rocm-docs-core[api_reference] from 0.38.1 to 1.0.0 in /docs/sphinx (#337)
* Bump rocm-docs-core[api_reference] from 0.38.1 to 1.0.0 in /docs/sphinx
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.38.1 to 1.0.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.38.1...v1.0.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
* Use Python 3.10 in RTD config
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sam Wu <sam.wu2@amd.com>
* Bump rocm-docs-core[api_reference] from 1.0.0 to 1.1.0 in /docs/sphinx (#339)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 1.0.0 to 1.1.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v1.0.0...v1.1.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Gaussian Noise Voxel Tensor on HOST and HIP (#323)
* License - updates to 2024 and consistency changes (#298)
* Match all CMakeLists.txt license as per RPP's outermost LICENSE file
* Match all python files' license as per RPP's outermost LICENSE file
* Match all .hpp files' license as per RPP's outermost LICENSE file
* Match all .cpp files' license as per RPP's outermost LICENSE file
* Match all .h files' license as per RPP's outermost LICENSE file
* Remove all rights reserved as per LICENSE file
* Remove double space in "Copyright (c) 2019 - 2023 Advanced Micro Devices, Inc."
* Match all .cmake files' license as per RPP's outermost LICENSE file
* Match all .cpp.in files' license as per RPP's outermost LICENSE file
* Replace 283 occurrences in 282 files - 2023 to 2024
* Add "MIT License" title to 281 instances
* Add missing license
* Test - Update README.md for test_suite (#299)
* Bump rocm-docs-core[api_reference] from 0.33.0 to 0.33.1 in /docs/sphinx (#301)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.0 to 0.33.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.0...v0.33.1)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump rocm-docs-core[api_reference] from 0.33.1 to 0.33.2 in /docs/sphinx (#302)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.1 to 0.33.2.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.1...v0.33.2)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update doc codeowners (#303)
* Documentation - Bump rocm-docs-core[api_reference] from 0.33.2 to 0.34.0 in /docs/sphinx (#304)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.2 to 0.34.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.2...v0.34.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Test suite - upgrade 5 qa perf (#305)
* experimental changes for adding qa mode for performance tests
* made changes to add display more information w.r.t QA results summary for performance tests
* minor changes
* Add changes to dump qa results to excel file
* Add performance QA for three new tensor functions
* update prerequisites in readme
* added changes to handle unsupported cases
* removed treshold dictionary and added performance Noise treshold
add new dataset for performance QA
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Changes to the performane summary dataframe
* minor changes
* Update CMakeLists.txt to add ${CMAKE_CURRENT_SOURCE_DIR} for CI
* Update CMakeLists.txt fix
* Update CMakeLists.txt fix
* remove tabulate dependency
* Update README.md to remove tabulate pip install
* Fix for CI machine failure
* Add note on performance
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Abishek <52214183+r-abishek@users.noreply.github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: r-abishek <abishek@multicorewareinc.com>
* RPP Color Temperature on HOST and HIP (#271)
* Initial commit - Color Temperature HOST Tensor
* Initial commit - Color Temperature HIP Tensor
* Add color temperature golden outputs
* address review comments
* Use reinterpret_cast instead of static_cast
* Combine templated functions to support all datatypes into one
(got minor perf difference of order 3%)
Also fixes indentation
* Fix i8 datatype
* Cleanup
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix PLN3 variant outputs
Also modifies reference outputs
* Update color_temperature.hpp license
* Delete color_temperature_u8_Tensor_PKD3.csv
* Delete color_temperature_u8_Tensor_PLN3.csv
---------
Co-authored-by: snehaa8 <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
* RPP Voxel 3D Tensor Add/Subtract scalar on HOST and HIP (#272)
* added HOST support for voxel add kernel
* added HIP support for voxel add kernel
* added test suite support for add scalar
* added Doxygen support and modified hip kernel function names as per new standard
* added HOST support for voxel subtract kernel
* added HIP support for voxel subtract kernel
* added test suite support
* updated the golden outputs for subtract with correct values
* removed unnessary validation checks
* Remove double spaces
* Fix header
* Fix all retval docs
* Fix docs to add memory type
* Fix comment
* Add divider comment
* Use post-increment efficiently
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted add and subtract scalar golden outputs to bin files
* changed copyright from 2023 to 2024
* Update add_scalar.hpp license
* Update subtract_scalar.hpp license
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* RPP Magnitude on HOST and HIP (#278)
* Initial commit - Magnitude HOST Tensor
* Add QA reference outputs
* Update runTests.py
* Initial commit - Magnitude HIP Tensor
* Add dual input support in testsuite
* Optimize HOST kernel further
* Optimize i8 datatype further
* Modify comments
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Bump rocm-docs-core[api_reference] from 0.31.0 to 0.33.0 in /docs/sphinx (#294)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.31.0 to 0.33.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.31.0...v0.33.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update Copywright year
* Combine templated functions to support all datatypes
* Modify format of reference outputs
* Update rppi_arithmetic_operations.h license
* Update rppt_tensor_arithmetic_operations.h license
* Update host_tensor_arithmetic_operations.hpp
* Update magnitude.hpp license
* Update hip_tensor_arithmetic_operations.hpp license
* Delete magnitude_u8_Tensor_PKD3.csv
* Delete magnitude_u8_Tensor_PLN1.csv
* Delete magnitude_u8_Tensor_PLN3.csv
* Update rpp_test_suite_common.h license
* Update runTests.py license
* Update Tensor_hip.cpp license
* Update runTests.py license
* Update Tensor_host.cpp license
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
* Bump rocm-docs-core[api_reference] from 0.34.0 to 0.34.2 in /docs/sphinx (#309)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.34.0 to 0.34.2.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.34.0...v0.34.2)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Tensor Audio Support - Down Mixing (#296)
* Initial commit - Non slient region detection
Includes unittest setup
* Initial commit - To Decibels
Includes unittest setup
* Intial commit - pre_emphasis_filter
* Intial commit - down_mixing
* Replace vectors with arrays
* Cleanup
* Minor cleanup
* Optimize downmixing Kernel
Includes cleanup
* Replace Rpp64s with Rpp32s
* Cleanup
* Optimize and precompute cutOff
* Fix buffer used
* Fix buffer used
* Additional Cleanup
* Optimize post incrmeent operation
* Optimize post increment operation
* Update testsuite for Audio
* code cleanup
* Add Readme file for Audio test suite
* changes based on review comments
* minor change
* Remove unittest folders and updated README.md
* Remove unit tests
* minor change
* code cleanup
* added common header file for audio helper functions
* removed unncessary audio wav files
fixed bug in ROI updation for audio test suite
resolved issue in summary generation for performance tests in python
* removed log file
* added doxygen support for audio
* added doxygen changes for to_decibels
* updated test suite support for to_decibels
* minor change
* added doxygen changes for preemphasis filter
* updated changes for preemphasis filter in test suite
* removed the usage of getMax function and used std::max_element
* modularized code in test suite
* merge with latest changes
* minor change
* minor change
* minor change
* resolved codacy warnings
* Codacy fix - Remove unused cpuTime
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* resolved issue with file_system dependency in test suite
* Doxygen changes
changed malloc to new in NSR kernel
* RPP RICAP Tensor for HOST and HIP (#213)
* Initial commit - Ricap HOST Tensor
Includes testsuite changes
* Add QA tests for RICAP
Used three_images_224x224_src1 folder to create golden outputs
* Add three_images_224x224_src1 into TEST_IMAGES
* Support HIP Backend for RICAP
* Fix HIP pkd3->pkd3 variant
* regenerated golden outputs for RICAP
minor changes in HOST shell script for handling RICAP in QA mode
* minor bug fix in RICAP HIP kernels
* Improve readability and Cleanup
* Additional cleanup
* Cleanup testsuite
Includes new golden outputs
* Additional testuite fixes
* Minor cleanup
* Fix codacy warnings
* Address other codacy warnings
* Update ricap.hpp with reference paper
* Add RICAP dataset path in readme
* Make changes to error codes returned
* Modify roi crop region for unit and perf tests
* RPP Tensor Water Augmentation on HOST and HIP (#181)
* added water HOST and HIP codes
* added water case in test suite
* added golden outputs for water
* added omp thread changes for water augmentation
* experimental changes
* fixed output issue with AVX2 instructions
* added AVX2 support for PKD3 load function
minor changes in PLN variant load functions
* nwc commit - added avx2 changes for u8 layout toggle variants but need to add store functions for completion
* Add Avx2 implementation for F32 and U8 toggle variants
* Add AVX2 support for u8 pkd3-pln3 and i8 pkd3-pln3 for water augmentation
* change F32 load and store logic
* optimized the store function for F32 PLN3-PKD3
* reverted back irrelevant changes
* minor change
* optimized load and store functions for water U8 and F32 variants in host
removed commented code
* removed golden outputs for water
* minor changes
* renamed few functions and removed unused functions
updated i8 pln1 load as per the optimized u8 pln1 load
* fixed bug in i8 load function
* changed cast to c++ style
resolved spacing issues and added comments for AVX codes for better understanding
made changes to handle cases where QA Tests are not supported
* added golden outputs for water
* updated golden outputs with latest changes
* modified the u8, i8 pkd3-pln3 function and added comments for the vectorized code
* fixed minor bug in I8 variants
* made to changes to resolve codacy warnings
* changed cast to c++ style in hip kernel
* changed generic nn F32 loads using gather and setr instructions
* added comments for latest changes
* minor change
* added definition for storing 32 and 64 bits from a 128bit register
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix build error
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* Boost deps fix for test suite
---------
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
* Documentation - Readme & changelog updates (#251)
* readme and changelog updates for 6.0
* minor update
* added ctests for audio test suite for CI
made changes to add more clarity on the QA Tests results
* Cmake mods for ctest
* HOST-only build error bugfix
* added qa mode paramter to python audio script
added golden output map for QA testing of Non silent region detection
* minor change
* Documentation - Bump rocm-docs-core[api_reference] from 0.26.0 to 0.27.0 in /docs/sphinx (#253)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.26.0 to 0.27.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.26.0...v0.27.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Resize Mirror Normalize Bugfix (#252)
* added fix for hipMemset
* remove pixel check for U8-F32 and U8-F16 for HOST codes
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
* added example for MMS calculation in comments for better understanding
* Sphinx - updates (#257)
* Sphinx - updates
* Doxygen - Updates
* Docs - Remove index.md
* updated info used to for running audio test suite
* removed bitdepth variable from audio test suite
* added more information on computing NSR outputs in the example added
* Fix doxygen for decibels
Also removes extra QA reference files
* move tensor_host_audio.cpp to host folder
* Fix build errors and qa tests in Audio Test suite
* Fix build errors and qa tests in Audio Test suite
* Add reference output and test samples for downmix
* Add down_mix in augmentation list and supported cases
* Remove auto-merge repeated funcs
* Improve clarity of header docs
* Remove blank line
* Improve clarity on header docs
* Add Doxygen comments
* minor change
* converted golden outputs to binary file for downmixing
* removed old golden output file for preemphasis and todecibels
* modified info for downmixing as per new changes
used handle memory for temporary buffers
* formatting changes
* moved the common code for SSE and AVX to outside
* Update down_mixing.hpp license
* Update rppt_tensor_audio_augmentations.h
* combined the srcLength and channels tensors into single tensor
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: Lisa <lisajdelaney@gmail.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sundarrajan98 <sundarrajan@multicorewareinc.com>
* RPP Voxel 3D Tensor Multiply scalar on HOST and HIP (#306)
* added HIP support for voxel scalar multiply kernel
* added HOST support for voxel multiply kernel
added golden outputs for voxel multiply kernel
* merge with master
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted multiply scalar voxel golden outputs to bin files
* changed copyright from 2023 to 2024
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Test Suite Bugfix (#307)
* experimental changes for adding qa mode for performance tests
* made changes to add display more information w.r.t QA results summary for performance tests
* minor changes
* Add changes to dump qa results to excel file
* Add performance QA for three new tensor functions
* update prerequisites in readme
* added changes to handle unsupported cases
* removed treshold dictionary and added performance Noise treshold
add new dataset for performance QA
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Changes to the performane summary dataframe
* minor changes
* Update CMakeLists.txt to add ${CMAKE_CURRENT_SOURCE_DIR} for CI
* Update CMakeLists.txt fix
* Update CMakeLists.txt fix
* remove tabulate dependency
* Update README.md to remove tabulate pip install
* Fix for CI machine failure
* Add note on performance
* Fix segmentation fault
* Revert QAmode to restrict HIP bitdepths
* Use Rpp64u for HOST while comparing outputs
* Fix ambiguous abs call
* Fix for SLES CI HIP fail - error: incompatible pointer types assigning to 'unsigned long *' from 'unsigned long long *' - refOutput = TensorSumReferenceOutputs[numChannels].data();
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: Pavel Tcherniaev <Pavel.Tcherniaev@amd.com>
* Initial commit
* Merge changes and fixes for gaussian noise 3d
* Test suite merge and fixes for gaussian noise 3d
* added initial support for gaussian noise HOST NDHWC variant
* added NCDHW support
* added u8 and i8 bitdepth support
* updated gaussian noise voxel host outer api to match with hip api
merged gaussian noise voxel kernel codes in 2d kernel codes
* resolved black pixels issue across border
* minor changes
* modified HIP kernel as per the latest changes
* modified the description as per the latest changes
* made changes in test suite
* added new host compute functions for gaussian noise 3d
* Bump rocm-docs-core[api_reference] from 0.35.0 to 0.35.1 in /docs/sphinx (#319)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.35.0 to 0.35.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.35.0...v0.35.1)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* moved the copy 3d function to rpp_cpu_common.hpp
* reverted incorrect changes happened with merge
* fix test suite issue with RMN
* revert incorrect merge changes
remove empty blank lines
* modify suffix from 3d to voxel for gaussian noise
added U8 support for gaussian noise HIP voxel kernel
* added separate copy kernel for copying input to output when mean and stddev passed is 0
* Bump rocm-docs-core[api_reference] from 0.35.1 to 0.36.0 in /docs/sphinx (#322)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.35.1 to 0.36.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.35.1...v0.36.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* fixed bug in test suite
* Docs - Bump rocm-docs-core[api_reference] from 0.36.0 to 0.37.0 in /docs/sphinx (#328)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.36.0 to 0.37.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.36.0...v0.37.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Link cleanup (#326)
* link updates
* update tables
* pare down index
* API cleanup
* consistency
* verbiage
* Update notes
* change function name from CHECK to CHECK_RETURN_STATUS
* Docs - Bump rocm-docs-core[api_reference] from 0.37.0 to 0.37.1 in /docs/sphinx (#329)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.37.0 to 0.37.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.37.0...v0.37.1)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Voxel Flip on HIP and HOST (#285)
* added support for flip voxel
* added test suite support
* added golden outputs for flip voxel
made changes in test suite to run QA tests for flip
* updated golden outputs with correct values
* minor bug fix in the hip test suite
* made changes to variable names for better readability
fixed comments in test suite
minor cleanup
* combined the flip axis factor as ternary operator in HIP kernel
added new enum for error handling when source and destination layouts are not matching
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted flip voxel golden outputs to bin files
* changed copyright from 2023 to 2024
* Update flip_voxel.hpp license
* License - updates to 2024 and consistency changes (#298)
* Match all CMakeLists.txt license as per RPP's outermost LICENSE file
* Match all python files' license as per RPP's outermost LICENSE file
* Match all .hpp files' license as per RPP's outermost LICENSE file
* Match all .cpp files' license as per RPP's outermost LICENSE file
* Match all .h files' license as per RPP's outermost LICENSE file
* Remove all rights reserved as per LICENSE file
* Remove double space in "Copyright (c) 2019 - 2023 Advanced Micro Devices, Inc."
* Match all .cmake files' license as per RPP's outermost LICENSE file
* Match all .cpp.in files' license as per RPP's outermost LICENSE file
* Replace 283 occurrences in 282 files - 2023 to 2024
* Add "MIT License" title to 281 instances
* Add missing license
* Test - Update README.md for test_suite (#299)
* Bump rocm-docs-core[api_reference] from 0.33.0 to 0.33.1 in /docs/sphinx (#301)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.0 to 0.33.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.0...v0.33.1)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump rocm-docs-core[api_reference] from 0.33.1 to 0.33.2 in /docs/sphinx (#302)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.1 to 0.33.2.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.1...v0.33.2)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update doc codeowners (#303)
* Documentation - Bump rocm-docs-core[api_reference] from 0.33.2 to 0.34.0 in /docs/sphinx (#304)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.2 to 0.34.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.2...v0.34.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Test suite - upgrade 5 qa perf (#305)
* experimental changes for adding qa mode for performance tests
* made changes to add display more information w.r.t QA results summary for performance tests
* minor changes
* Add changes to dump qa results to excel file
* Add performance QA for three new tensor functions
* update prerequisites in readme
* added changes to handle unsupported cases
* removed treshold dictionary and added performance Noise treshold
add new dataset for performance QA
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Changes to the performane summary dataframe
* minor changes
* Update CMakeLists.txt to add ${CMAKE_CURRENT_SOURCE_DIR} for CI
* Update CMakeLists.txt fix
* Update CMakeLists.txt fix
* remove tabulate dependency
* Update README.md to remove tabulate pip install
* Fix for CI machine failure
* Add note on performance
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Abishek <52214183+r-abishek@users.noreply.github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: r-abishek <abishek@multicorewareinc.com>
* RPP Color Temperature on HOST and HIP (#271)
* Initial commit - Color Temperature HOST Tensor
* Initial commit - Color Temperature HIP Tensor
* Add color temperature golden outputs
* address review comments
* Use reinterpret_cast instead of static_cast
* Combine templated functions to support all datatypes into one
(got minor perf difference of order 3%)
Also fixes indentation
* Fix i8 datatype
* Cleanup
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix PLN3 variant outputs
Also modifies reference outputs
* Update color_temperature.hpp license
* Delete color_temperature_u8_Tensor_PKD3.csv
* Delete color_temperature_u8_Tensor_PLN3.csv
---------
Co-authored-by: snehaa8 <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
* RPP Voxel 3D Tensor Add/Subtract scalar on HOST and HIP (#272)
* added HOST support for voxel add kernel
* added HIP support for voxel add kernel
* added test suite support for add scalar
* added Doxygen support and modified hip kernel function names as per new standard
* added HOST support for voxel subtract kernel
* added HIP support for voxel subtract kernel
* added test suite support
* updated the golden outputs for subtract with correct values
* removed unnessary validation checks
* Remove double spaces
* Fix header
* Fix all retval docs
* Fix docs to add memory type
* Fix comment
* Add divider comment
* Use post-increment efficiently
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted add and subtract scalar golden outputs to bin files
* changed copyright from 2023 to 2024
* Update add_scalar.hpp license
* Update subtract_scalar.hpp license
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* RPP Magnitude on HOST and HIP (#278)
* Initial commit - Magnitude HOST Tensor
* Add QA reference outputs
* Update runTests.py
* Initial commit - Magnitude HIP Tensor
* Add dual input support in testsuite
* Optimize HOST kernel further
* Optimize i8 datatype further
* Modify comments
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Bump rocm-docs-core[api_reference] from 0.31.0 to 0.33.0 in /docs/sphinx (#294)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.31.0 to 0.33.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.31.0...v0.33.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update Copywright year
* Combine templated functions to support all datatypes
* Modify format of reference outputs
* Update rppi_arithmetic_operations.h license
* Update rppt_tensor_arithmetic_operations.h license
* Update host_tensor_arithmetic_operations.hpp
* Update magnitude.hpp license
* Update hip_tensor_arithmetic_operations.hpp license
* Delete magnitude_u8_Tensor_PKD3.csv
* Delete magnitude_u8_Tensor_PLN1.csv
* Delete magnitude_u8_Tensor_PLN3.csv
* Update rpp_test_suite_common.h license
* Update runTests.py license
* Update Tensor_hip.cpp license
* Update runTests.py license
* Update Tensor_host.cpp license
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
* Bump rocm-docs-core[api_reference] from 0.34.0 to 0.34.2 in /docs/sphinx (#309)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.34.0 to 0.34.2.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.34.0...v0.34.2)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Tensor Audio Support - Down Mixing (#296)
* Initial commit - Non slient region detection
Includes unittest setup
* Initial commit - To Decibels
Includes unittest setup
* Intial commit - pre_emphasis_filter
* Intial commit - down_mixing
* Replace vectors with arrays
* Cleanup
* Minor cleanup
* Optimize downmixing Kernel
Includes cleanup
* Replace Rpp64s with Rpp32s
* Cleanup
* Optimize and precompute cutOff
* Fix buffer used
* Fix buffer used
* Additional Cleanup
* Optimize post incrmeent operation
* Optimize post increment operation
* Update testsuite for Audio
* code cleanup
* Add Readme file for Audio test suite
* changes based on review comments
* minor change
* Remove unittest folders and updated README.md
* Remove unit tests
* minor change
* code cleanup
* added common header file for audio helper functions
* removed unncessary audio wav files
fixed bug in ROI updation for audio test suite
resolved issue in summary generation for performance tests in python
* removed log file
* added doxygen support for audio
* added doxygen changes for to_decibels
* updated test suite support for to_decibels
* minor change
* added doxygen changes for preemphasis filter
* updated changes for preemphasis filter in test suite
* removed the usage of getMax function and used std::max_element
* modularized code in test suite
* merge with latest changes
* minor change
* minor change
* minor change
* resolved codacy warnings
* Codacy fix - Remove unused cpuTime
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* resolved issue with file_system dependency in test suite
* Doxygen changes
changed malloc to new in NSR kernel
* RPP RICAP Tensor for HOST and HIP (#213)
* Initial commit - Ricap HOST Tensor
Includes testsuite changes
* Add QA tests for RICAP
Used three_images_224x224_src1 folder to create golden outputs
* Add three_images_224x224_src1 into TEST_IMAGES
* Support HIP Backend for RICAP
* Fix HIP pkd3->pkd3 variant
* regenerated golden outputs for RICAP
minor changes in HOST shell script for handling RICAP in QA mode
* minor bug fix in RICAP HIP kernels
* Improve readability and Cleanup
* Additional cleanup
* Cleanup testsuite
Includes new golden outputs
* Additional testuite fixes
* Minor cleanup
* Fix codacy warnings
* Address other codacy warnings
* Update ricap.hpp with reference paper
* Add RICAP dataset path in readme
* Make changes to error codes returned
* Modify roi crop region for unit and perf tests
* RPP Tensor Water Augmentation on HOST and HIP (#181)
* added water HOST and HIP codes
* added water case in test suite
* added golden outputs for water
* added omp thread changes for water augmentation
* experimental changes
* fixed output issue with AVX2 instructions
* added AVX2 support for PKD3 load function
minor changes in PLN variant load functions
* nwc commit - added avx2 changes for u8 layout toggle variants but need to add store functions for completion
* Add Avx2 implementation for F32 and U8 toggle variants
* Add AVX2 support for u8 pkd3-pln3 and i8 pkd3-pln3 for water augmentation
* change F32 load and store logic
* optimized the store function for F32 PLN3-PKD3
* reverted back irrelevant changes
* minor change
* optimized load and store functions for water U8 and F32 variants in host
removed commented code
* removed golden outputs for water
* minor changes
* renamed few functions and removed unused functions
updated i8 pln1 load as per the optimized u8 pln1 load
* fixed bug in i8 load function
* changed cast to c++ style
resolved spacing issues and added comments for AVX codes for better understanding
made changes to handle cases where QA Tests are not supported
* added golden outputs for water
* updated golden outputs with latest changes
* modified the u8, i8 pkd3-pln3 function and added comments for the vectorized code
* fixed minor bug in I8 variants
* made to changes to resolve codacy warnings
* changed cast to c++ style in hip kernel
* changed generic nn F32 loads using gather and setr instructions
* added comments for latest changes
* minor change
* added definition for storing 32 and 64 bits from a 128bit register
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix build error
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* Boost deps fix for test suite
---------
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
* Documentation - Readme & changelog updates (#251)
* readme and changelog updates for 6.0
* minor update
* added ctests for audio test suite for CI
made changes to add more clarity on the QA Tests results
* Cmake mods for ctest
* HOST-only build error bugfix
* added qa mode paramter to python audio script
added golden output map for QA testing of Non silent region detection
* minor change
* Documentation - Bump rocm-docs-core[api_reference] from 0.26.0 to 0.27.0 in /docs/sphinx (#253)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.26.0 to 0.27.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.26.0...v0.27.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Resize Mirror Normalize Bugfix (#252)
* added fix for hipMemset
* remove pixel check for U8-F32 and U8-F16 for HOST codes
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
* added example for MMS calculation in comments for better understanding
* Sphinx - updates (#257)
* Sphinx - updates
* Doxygen - Updates
* Docs - Remove index.md
* updated info used to for running audio test suite
* removed bitdepth variable from audio test suite
* added more information on computing NSR outputs in the example added
* Fix doxygen for decibels
Also removes extra QA reference files
* move tensor_host_audio.cpp to host folder
* Fix build errors and qa tests in Audio Test suite
* Fix build errors and qa tests in Audio Test suite
* Add reference output and test samples for downmix
* Add down_mix in augmentation list and supported cases
* Remove auto-merge repeated funcs
* Improve clarity of header docs
* Remove blank line
* Improve clarity on header docs
* Add Doxygen comments
* minor change
* converted golden outputs to binary file for downmixing
* removed old golden output file for preemphasis and todecibels
* modified info for downmixing as per new changes
used handle memory for temporary buffers
* formatting changes
* moved the common code for SSE and AVX to outside
* Update down_mixing.hpp license
* Update rppt_tensor_audio_augmentations.h
* combined the srcLength and channels tensors into single tensor
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: Lisa <lisajdelaney@gmail.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sundarrajan98 <sundarrajan@multicorewareinc.com>
* RPP Voxel 3D Tensor Multiply scalar on HOST and HIP (#306)
* added HIP support for voxel scalar multiply kernel
* added HOST support for voxel multiply kernel
added golden outputs for voxel multiply kernel
* merge with master
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted multiply scalar voxel golden outputs to bin files
* changed copyright from 2023 to 2024
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Test Suite Bugfix (#307)
* experimental changes for adding qa mode for performance tests
* made changes to add display more information w.r.t QA results summary for performance tests
* minor changes
* Add changes to dump qa results to excel file
* Add performance QA for three new tensor functions
* update prerequisites in readme
* added changes to handle unsupported cases
* removed treshold dictionary and added performance Noise treshold
add new dataset for performance QA
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Changes to the performane summary dataframe
* minor changes
* Update CMakeLists.txt to add ${CMAKE_CURRENT_SOURCE_DIR} for CI
* Update CMakeLists.txt fix
* Update CMakeLists.txt fix
* remove tabulate dependency
* Update README.md to remove tabulate pip install
* Fix for CI machine failure
* Add note on performance
* Fix segmentation fault
* Revert QAmode to restrict HIP bitdepths
* Use Rpp64u for HOST while comparing outputs
* Fix ambiguous abs call
* Fix for SLES CI HIP fail - error: incompatible pointer types assigning to 'unsigned long *' from 'unsigned long long *' - refOutput = TensorSumReferenceOutputs[numChannels].data();
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: Pavel Tcherniaev <Pavel.Tcherniaev@amd.com>
* Bump rocm-docs-core[api_reference] from 0.34.2 to 0.35.0 in /docs/sphinx (#313)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.34.2 to 0.35.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.34.2...v0.35.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Reduction - Tensor min and Tensor max on HOST and HIP (#260)
* Minor Change
* Add Validation check for DST_FOLDER path
* added water HOST and HIP codes
* added water case in test suite
* added golden outputs for water
* Add Validation checks for all options in testAllScript.sh
* Add sanity check for dual Input cases
Set Max Dimension and Max Image Dump
Replaced Fast DCT tag with Accurate DCT
* Regenerate golden outputs using accurate dct Flag
Add golden outputs for some new augmentations
* Fix Flip golden outputs mismatch
Fix PLN3 variants mismatch in QA mode
* Add MAX_BATCH_SIZE check
removed Augmentations function calls for failing Qa modes
code cleanup
* Add crop and gamma correction augmentations
code cleanup
* Add comments to functions in rpp_test_suite_common.h
* minor change
* code cleanup
* minor code changes
* Change roi and Image sizes for crop augmentation
* Change numIterations option to numRuns
Addressed PR comments
* added omp thread changes for water augmentation
* experimental changes
* fixed output issue with AVX2 instructions
* added AVX2 support for PKD3 load function
minor changes in PLN variant load functions
* Add turboJpeg header to update maxHeight and maxWidth values
* nwc commit - added avx2 changes for u8 layout toggle variants but need to add store functions for completion
* Change the performance Timings logic
* Add Avx2 implementation for F32 and U8 toggle variants
* minor change to support u8_f16 and u8_f32 cases
* Regenerate LUT golden outputs with ACCURATE_DCT tag
* Minor code changes
* Add AVX2 support for u8 pkd3-pln3 and i8 pkd3-pln3 for water augmentation
* Made changes to the runTests.py in Host to remove testAllScipts.sh
* Made changes to the runTests.py in HIP to remove testAllScipts.sh
* Initial commit - Image min and max Reduction kernel
Includes
* u8 datatype for both min and max HOST Tensor of all variants.
* Testsuite changes.
* NWC -initial code for min max PLN3 - PLN3
* made changes to split min and max kernels seperately
* splitted kernels for min and max
* made changes to print final max/min in the R,G,B channels
* fixed inaccuracies in min/max computation
* made changes to typecast intermediate output to output requested by user
added comments for the code
code cleanup and minor changes in test suite
* fixed build issues
removed image folders used for min, max and sum
reverted unwanted file changes
* minor changes in test suite
* removed support for unwanted test case in Tensor_hip.cpp
* Adds new option roi
* remove testAllScripts.sh
* Adds roi Option in HIP backend
* Implement f32 variants
* Implement f16 and i8 datatype variants
* change F32 load and store logic
* Add build flags in CMakeLists.txt to set AVX/SSE flags based on the system configuration
* minor code changes
* Initial commit - Image sum Reduction kernel
Includes u8 PLN1 -> PLN1 conversion for HOST Tensor
* Implement PKD3 and PLN3 for Image sum Tensor HOST
* Support i8, f16 and f32 datatypes
* Initial commit - Image sum Reduction HIP kernel
Includes u8 PLN1 -> PLN1 conversion for Tensor
* Implement PKD3 and PLN3 for Image sum Tensor HIP
* Add support in testsuite
Revert normalization for i8 HOST Tensor variants
* Fix HIP testsuite
Remove additional blanks for 1 channel output
* Modify print statement in HIP testsuite
* Improve readability for testsuite outputs
* optimized the store function for F32 PLN3-PKD3
* reverted back irrelevant changes
* minor change
* Fix HIP to support larger inputs
* optimized load and store functions for water U8 and F32 variants in host
removed commented code
* Cleanup
* removed golden outputs for water
* minor changes
* Cleanup
Support Reduction QA test in testsuite
* renamed few functions and removed unused functions
updated i8 pln1 load as per the optimized u8 pln1 load
* fixed bug in i8 load function
* Remove unused variables and C style casting
* changed cast to c++ style
resolved spacing issues and added comments for AVX codes for better understanding
made changes to handle cases where QA Tests are not supported
* added golden outputs for water
* updated golden outputs with latest changes
* modified the u8, i8 pkd3-pln3 function and added comments for the vectorized code
* fixed minor bug in I8 variants
* Optimize u8 datatype further
* Fix static_cast
* made to changes to resolve codacy warnings
* changed cast to c++ style in hip kernel
* Initial commit - Ricap HOST Tensor
Includes testsuite changes
* Add QA tests for RICAP
Used three_images_224x224_src1 folder to create golden outputs
* Add three_images_224x224_src1 into TEST_IMAGES
* added rotate case with golden outputs
changed generic bilinear HOST codes to match with HIP codes
* Add golden output for remaining all tensor augmentations
* fix python script issues
* Optimize u8 and i8 datatype
Uses uint and int internal processing instead of float
* Fix testsuite build errors
* minor change
* Fix QA check
* Modify api naming from image_sum to tensor_sum
Includes changes for both HOST and HIP
* Support HIP Backend for RICAP
* change rcm and rmn golden outputs
* Fix HIP pkd3->pkd3 variant
* changes based on review comments
* change test_suite folder to tests
* Optimize u8 and i8 datatype of HIP
Includes modification in naming of shared memory
* minor fix
* changed generic nn F32 loads using gather and setr instructions
* Optimize and cleanup U8 HIP
* regenerated golden outputs for RICAP
minor changes in HOST shell script for handling RICAP in QA mode
* minor bug fix in RICAP HIP kernels
* Fix i8 datatype variants
Includes cleanup
* Fix the issues with color_to_greyscale
* remove the empty folder creation
* reverting back the folder name change
* minor change
* added comments for latest changes
* minor change
* Improve readability and Cleanup
* Fix QA for HIP
Includes cleanup
* resolved review comments
* minor change
* Modify api naming from image_ to tensor_ for HOST
* Add support for QA tests
* removed range check for RMN U8-F32 and U8-F16 variants
changed from hipMemset to hipMemsetAsync for RMN HIP Kernel
removed multiplication by 255 for stdDev in RMN HOST U8-F16 and U8-F32 variants
* Modify naming of shared memory with _smem in HIP
Includes cleanup
* Typecast and reuse markArr for HIP U8 and I8
* Cleanup and minor optimization
* minor fix
* fix codacy warnings
* Additional cleanup
* Cleanup and move #define
* Changed the complexity of if statements in runTests.py
* Cleanup testsuite
Includes new golden outputs
* Additional testuite fixes
* Minor cleanup
* Codacy fixes
* Fix codacy warnings
* Codacy fix
* Address other codacy warnings
* cleanup
* Change Image functions to generic
* Update ricap.hpp with reference paper
* resolved minor issues happened with merge
* minor changes
* fixed minor issue with getting profiler times
* minor formatting changes
* resolved build issues in test suite
renamed the min and max kernel file names
* RPP RICAP Tensor for HOST and HIP (#213)
* Initial commit - Ricap HOST Tensor
Includes testsuite changes
* Add QA tests for RICAP
Used three_images_224x224_src1 folder to create golden outputs
* Add three_images_224x224_src1 into TEST_IMAGES
* Support HIP Backend for RICAP
* Fix HIP pkd3->pkd3 variant
* regenerated golden outputs for RICAP
minor changes in HOST shell script for handling RICAP in QA mode
* minor bug fix in RICAP HIP kernels
* Improve readability and Cleanup
* Additional cleanup
* Cleanup testsuite
Includes new golden outputs
* Additional testuite fixes
* Minor cleanup
* Fix codacy warnings
* Address other codacy warnings
* Update ricap.hpp with reference paper
* Add RICAP dataset path in readme
* Make changes to error codes returned
* Modify roi crop region for unit and perf tests
* RPP Tensor Water Augmentation on HOST and HIP (#181)
* added water HOST and HIP codes
* added water case in test suite
* added golden outputs for water
* added omp thread changes for water augmentation
* experimental changes
* fixed output issue with AVX2 instructions
* added AVX2 support for PKD3 load function
minor changes in PLN variant load functions
* nwc commit - added avx2 changes for u8 layout toggle variants but need to add store functions for completion
* Add Avx2 implementation for F32 and U8 toggle variants
* Add AVX2 support for u8 pkd3-pln3 and i8 pkd3-pln3 for water augmentation
* change F32 load and store logic
* optimized the store function for F32 PLN3-PKD3
* reverted back irrelevant changes
* minor change
* optimized load and store functions for water U8 and F32 variants in host
removed commented code
* removed golden outputs for water
* minor changes
* renamed few functions and removed unused functions
updated i8 pln1 load as per the optimized u8 pln1 load
* fixed bug in i8 load function
* changed cast to c++ style
resolved spacing issues and added comments for AVX codes for better understanding
made changes to handle cases where QA Tests are not supported
* added golden outputs for water
* updated golden outputs with latest changes
* modified the u8, i8 pkd3-pln3 function and added comments for the vectorized code
* fixed minor bug in I8 variants
* made to changes to resolve codacy warnings
* changed cast to c++ style in hip kernel
* changed generic nn F32 loads using gather and setr instructions
* added comments for latest changes
* minor change
* added definition for storing 32 and 64 bits from a 128bit register
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix build error
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* Boost deps fix for test suite
---------
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
* Documentation - Readme & changelog updates (#251)
* readme and changelog updates for 6.0
* minor update
* Documentation - Bump rocm-docs-core[api_reference] from 0.26.0 to 0.27.0 in /docs/sphinx (#253)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.26.0 to 0.27.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.26.0...v0.27.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Resize Mirror Normalize Bugfix (#252)
* added fix for hipMemset
* remove pixel check for U8-F32 and U8-F16 for HOST codes
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
* Cmake fix to prevent warning
* Fix paths in new python scripts
* Sphinx - updates (#257)
* Sphinx - updates
* Doxygen - Updates
* Docs - Remove index.md
* Test suite fixes after tensor_min / tensor_max HOST merge
* Fix max case
* QA tests fix for hip and host
* naming convention changes as per new std
* Substitute imagePartial with partial
* Substitute imageMin/imageMax with min/max
* Replace hipMemset with hipMemsetAsync, and replace hipDeviceSynchronize with hipStreamSynchronize
* Use variable instead of batchCount*4
* Use post increment effectivly
* Resolve codacy warnings
* Additional cleanup
* remove unused variable
* Documentation - Bump rocm-docs-core[api_reference] from 0.28.0 to 0.29.0 in /docs/sphinx (#265)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.28.0 to 0.29.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.28.0...v0.29.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Remove auto merge boost
* Spaces formatting
* Bump rocm-docs-core[api_reference] from 0.29.0 to 0.30.1 in /docs/sphinx (#268)
Bumps [rocm-docs-core[api_reference]](https://github.com/Rade…
* Update CMakeLists.txt
Version Upgrade
* Bump rocm-docs-core[api_reference] from 0.38.1 to 1.0.0 in /docs/sphinx (#337)
* Bump rocm-docs-core[api_reference] from 0.38.1 to 1.0.0 in /docs/sphinx
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.38.1 to 1.0.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.38.1...v1.0.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
* Use Python 3.10 in RTD config
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sam Wu <sam.wu2@amd.com>
* Bump rocm-docs-core[api_reference] from 1.0.0 to 1.1.0 in /docs/sphinx (#339)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 1.0.0 to 1.1.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v1.0.0...v1.1.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Gaussian Noise Voxel Tensor on HOST and HIP (#323)
* License - updates to 2024 and consistency changes (#298)
* Match all CMakeLists.txt license as per RPP's outermost LICENSE file
* Match all python files' license as per RPP's outermost LICENSE file
* Match all .hpp files' license as per RPP's outermost LICENSE file
* Match all .cpp files' license as per RPP's outermost LICENSE file
* Match all .h files' license as per RPP's outermost LICENSE file
* Remove all rights reserved as per LICENSE file
* Remove double space in "Copyright (c) 2019 - 2023 Advanced Micro Devices, Inc."
* Match all .cmake files' license as per RPP's outermost LICENSE file
* Match all .cpp.in files' license as per RPP's outermost LICENSE file
* Replace 283 occurrences in 282 files - 2023 to 2024
* Add "MIT License" title to 281 instances
* Add missing license
* Test - Update README.md for test_suite (#299)
* Bump rocm-docs-core[api_reference] from 0.33.0 to 0.33.1 in /docs/sphinx (#301)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.0 to 0.33.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.0...v0.33.1)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump rocm-docs-core[api_reference] from 0.33.1 to 0.33.2 in /docs/sphinx (#302)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.1 to 0.33.2.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.1...v0.33.2)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update doc codeowners (#303)
* Documentation - Bump rocm-docs-core[api_reference] from 0.33.2 to 0.34.0 in /docs/sphinx (#304)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.2 to 0.34.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.2...v0.34.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Test suite - upgrade 5 qa perf (#305)
* experimental changes for adding qa mode for performance tests
* made changes to add display more information w.r.t QA results summary for performance tests
* minor changes
* Add changes to dump qa results to excel file
* Add performance QA for three new tensor functions
* update prerequisites in readme
* added changes to handle unsupported cases
* removed treshold dictionary and added performance Noise treshold
add new dataset for performance QA
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Changes to the performane summary dataframe
* minor changes
* Update CMakeLists.txt to add ${CMAKE_CURRENT_SOURCE_DIR} for CI
* Update CMakeLists.txt fix
* Update CMakeLists.txt fix
* remove tabulate dependency
* Update README.md to remove tabulate pip install
* Fix for CI machine failure
* Add note on performance
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Abishek <52214183+r-abishek@users.noreply.github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: r-abishek <abishek@multicorewareinc.com>
* RPP Color Temperature on HOST and HIP (#271)
* Initial commit - Color Temperature HOST Tensor
* Initial commit - Color Temperature HIP Tensor
* Add color temperature golden outputs
* address review comments
* Use reinterpret_cast instead of static_cast
* Combine templated functions to support all datatypes into one
(got minor perf difference of order 3%)
Also fixes indentation
* Fix i8 datatype
* Cleanup
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix PLN3 variant outputs
Also modifies reference outputs
* Update color_temperature.hpp license
* Delete color_temperature_u8_Tensor_PKD3.csv
* Delete color_temperature_u8_Tensor_PLN3.csv
---------
Co-authored-by: snehaa8 <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
* RPP Voxel 3D Tensor Add/Subtract scalar on HOST and HIP (#272)
* added HOST support for voxel add kernel
* added HIP support for voxel add kernel
* added test suite support for add scalar
* added Doxygen support and modified hip kernel function names as per new standard
* added HOST support for voxel subtract kernel
* added HIP support for voxel subtract kernel
* added test suite support
* updated the golden outputs for subtract with correct values
* removed unnessary validation checks
* Remove double spaces
* Fix header
* Fix all retval docs
* Fix docs to add memory type
* Fix comment
* Add divider comment
* Use post-increment efficiently
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted add and subtract scalar golden outputs to bin files
* changed copyright from 2023 to 2024
* Update add_scalar.hpp license
* Update subtract_scalar.hpp license
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* RPP Magnitude on HOST and HIP (#278)
* Initial commit - Magnitude HOST Tensor
* Add QA reference outputs
* Update runTests.py
* Initial commit - Magnitude HIP Tensor
* Add dual input support in testsuite
* Optimize HOST kernel further
* Optimize i8 datatype further
* Modify comments
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Bump rocm-docs-core[api_reference] from 0.31.0 to 0.33.0 in /docs/sphinx (#294)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.31.0 to 0.33.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.31.0...v0.33.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update Copywright year
* Combine templated functions to support all datatypes
* Modify format of reference outputs
* Update rppi_arithmetic_operations.h license
* Update rppt_tensor_arithmetic_operations.h license
* Update host_tensor_arithmetic_operations.hpp
* Update magnitude.hpp license
* Update hip_tensor_arithmetic_operations.hpp license
* Delete magnitude_u8_Tensor_PKD3.csv
* Delete magnitude_u8_Tensor_PLN1.csv
* Delete magnitude_u8_Tensor_PLN3.csv
* Update rpp_test_suite_common.h license
* Update runTests.py license
* Update Tensor_hip.cpp license
* Update runTests.py license
* Update Tensor_host.cpp license
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
* Bump rocm-docs-core[api_reference] from 0.34.0 to 0.34.2 in /docs/sphinx (#309)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.34.0 to 0.34.2.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.34.0...v0.34.2)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Tensor Audio Support - Down Mixing (#296)
* Initial commit - Non slient region detection
Includes unittest setup
* Initial commit - To Decibels
Includes unittest setup
* Intial commit - pre_emphasis_filter
* Intial commit - down_mixing
* Replace vectors with arrays
* Cleanup
* Minor cleanup
* Optimize downmixing Kernel
Includes cleanup
* Replace Rpp64s with Rpp32s
* Cleanup
* Optimize and precompute cutOff
* Fix buffer used
* Fix buffer used
* Additional Cleanup
* Optimize post incrmeent operation
* Optimize post increment operation
* Update testsuite for Audio
* code cleanup
* Add Readme file for Audio test suite
* changes based on review comments
* minor change
* Remove unittest folders and updated README.md
* Remove unit tests
* minor change
* code cleanup
* added common header file for audio helper functions
* removed unncessary audio wav files
fixed bug in ROI updation for audio test suite
resolved issue in summary generation for performance tests in python
* removed log file
* added doxygen support for audio
* added doxygen changes for to_decibels
* updated test suite support for to_decibels
* minor change
* added doxygen changes for preemphasis filter
* updated changes for preemphasis filter in test suite
* removed the usage of getMax function and used std::max_element
* modularized code in test suite
* merge with latest changes
* minor change
* minor change
* minor change
* resolved codacy warnings
* Codacy fix - Remove unused cpuTime
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* resolved issue with file_system dependency in test suite
* Doxygen changes
changed malloc to new in NSR kernel
* RPP RICAP Tensor for HOST and HIP (#213)
* Initial commit - Ricap HOST Tensor
Includes testsuite changes
* Add QA tests for RICAP
Used three_images_224x224_src1 folder to create golden outputs
* Add three_images_224x224_src1 into TEST_IMAGES
* Support HIP Backend for RICAP
* Fix HIP pkd3->pkd3 variant
* regenerated golden outputs for RICAP
minor changes in HOST shell script for handling RICAP in QA mode
* minor bug fix in RICAP HIP kernels
* Improve readability and Cleanup
* Additional cleanup
* Cleanup testsuite
Includes new golden outputs
* Additional testuite fixes
* Minor cleanup
* Fix codacy warnings
* Address other codacy warnings
* Update ricap.hpp with reference paper
* Add RICAP dataset path in readme
* Make changes to error codes returned
* Modify roi crop region for unit and perf tests
* RPP Tensor Water Augmentation on HOST and HIP (#181)
* added water HOST and HIP codes
* added water case in test suite
* added golden outputs for water
* added omp thread changes for water augmentation
* experimental changes
* fixed output issue with AVX2 instructions
* added AVX2 support for PKD3 load function
minor changes in PLN variant load functions
* nwc commit - added avx2 changes for u8 layout toggle variants but need to add store functions for completion
* Add Avx2 implementation for F32 and U8 toggle variants
* Add AVX2 support for u8 pkd3-pln3 and i8 pkd3-pln3 for water augmentation
* change F32 load and store logic
* optimized the store function for F32 PLN3-PKD3
* reverted back irrelevant changes
* minor change
* optimized load and store functions for water U8 and F32 variants in host
removed commented code
* removed golden outputs for water
* minor changes
* renamed few functions and removed unused functions
updated i8 pln1 load as per the optimized u8 pln1 load
* fixed bug in i8 load function
* changed cast to c++ style
resolved spacing issues and added comments for AVX codes for better understanding
made changes to handle cases where QA Tests are not supported
* added golden outputs for water
* updated golden outputs with latest changes
* modified the u8, i8 pkd3-pln3 function and added comments for the vectorized code
* fixed minor bug in I8 variants
* made to changes to resolve codacy warnings
* changed cast to c++ style in hip kernel
* changed generic nn F32 loads using gather and setr instructions
* added comments for latest changes
* minor change
* added definition for storing 32 and 64 bits from a 128bit register
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix build error
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* Boost deps fix for test suite
---------
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
* Documentation - Readme & changelog updates (#251)
* readme and changelog updates for 6.0
* minor update
* added ctests for audio test suite for CI
made changes to add more clarity on the QA Tests results
* Cmake mods for ctest
* HOST-only build error bugfix
* added qa mode paramter to python audio script
added golden output map for QA testing of Non silent region detection
* minor change
* Documentation - Bump rocm-docs-core[api_reference] from 0.26.0 to 0.27.0 in /docs/sphinx (#253)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.26.0 to 0.27.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.26.0...v0.27.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Resize Mirror Normalize Bugfix (#252)
* added fix for hipMemset
* remove pixel check for U8-F32 and U8-F16 for HOST codes
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
* added example for MMS calculation in comments for better understanding
* Sphinx - updates (#257)
* Sphinx - updates
* Doxygen - Updates
* Docs - Remove index.md
* updated info used to for running audio test suite
* removed bitdepth variable from audio test suite
* added more information on computing NSR outputs in the example added
* Fix doxygen for decibels
Also removes extra QA reference files
* move tensor_host_audio.cpp to host folder
* Fix build errors and qa tests in Audio Test suite
* Fix build errors and qa tests in Audio Test suite
* Add reference output and test samples for downmix
* Add down_mix in augmentation list and supported cases
* Remove auto-merge repeated funcs
* Improve clarity of header docs
* Remove blank line
* Improve clarity on header docs
* Add Doxygen comments
* minor change
* converted golden outputs to binary file for downmixing
* removed old golden output file for preemphasis and todecibels
* modified info for downmixing as per new changes
used handle memory for temporary buffers
* formatting changes
* moved the common code for SSE and AVX to outside
* Update down_mixing.hpp license
* Update rppt_tensor_audio_augmentations.h
* combined the srcLength and channels tensors into single tensor
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: Lisa <lisajdelaney@gmail.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sundarrajan98 <sundarrajan@multicorewareinc.com>
* RPP Voxel 3D Tensor Multiply scalar on HOST and HIP (#306)
* added HIP support for voxel scalar multiply kernel
* added HOST support for voxel multiply kernel
added golden outputs for voxel multiply kernel
* merge with master
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted multiply scalar voxel golden outputs to bin files
* changed copyright from 2023 to 2024
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Test Suite Bugfix (#307)
* experimental changes for adding qa mode for performance tests
* made changes to add display more information w.r.t QA results summary for performance tests
* minor changes
* Add changes to dump qa results to excel file
* Add performance QA for three new tensor functions
* update prerequisites in readme
* added changes to handle unsupported cases
* removed treshold dictionary and added performance Noise treshold
add new dataset for performance QA
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Changes to the performane summary dataframe
* minor changes
* Update CMakeLists.txt to add ${CMAKE_CURRENT_SOURCE_DIR} for CI
* Update CMakeLists.txt fix
* Update CMakeLists.txt fix
* remove tabulate dependency
* Update README.md to remove tabulate pip install
* Fix for CI machine failure
* Add note on performance
* Fix segmentation fault
* Revert QAmode to restrict HIP bitdepths
* Use Rpp64u for HOST while comparing outputs
* Fix ambiguous abs call
* Fix for SLES CI HIP fail - error: incompatible pointer types assigning to 'unsigned long *' from 'unsigned long long *' - refOutput = TensorSumReferenceOutputs[numChannels].data();
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: Pavel Tcherniaev <Pavel.Tcherniaev@amd.com>
* Initial commit
* Merge changes and fixes for gaussian noise 3d
* Test suite merge and fixes for gaussian noise 3d
* added initial support for gaussian noise HOST NDHWC variant
* added NCDHW support
* added u8 and i8 bitdepth support
* updated gaussian noise voxel host outer api to match with hip api
merged gaussian noise voxel kernel codes in 2d kernel codes
* resolved black pixels issue across border
* minor changes
* modified HIP kernel as per the latest changes
* modified the description as per the latest changes
* made changes in test suite
* added new host compute functions for gaussian noise 3d
* Bump rocm-docs-core[api_reference] from 0.35.0 to 0.35.1 in /docs/sphinx (#319)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.35.0 to 0.35.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.35.0...v0.35.1)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* moved the copy 3d function to rpp_cpu_common.hpp
* reverted incorrect changes happened with merge
* fix test suite issue with RMN
* revert incorrect merge changes
remove empty blank lines
* modify suffix from 3d to voxel for gaussian noise
added U8 support for gaussian noise HIP voxel kernel
* added separate copy kernel for copying input to output when mean and stddev passed is 0
* Bump rocm-docs-core[api_reference] from 0.35.1 to 0.36.0 in /docs/sphinx (#322)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.35.1 to 0.36.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.35.1...v0.36.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* fixed bug in test suite
* Docs - Bump rocm-docs-core[api_reference] from 0.36.0 to 0.37.0 in /docs/sphinx (#328)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.36.0 to 0.37.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.36.0...v0.37.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Link cleanup (#326)
* link updates
* update tables
* pare down index
* API cleanup
* consistency
* verbiage
* Update notes
* change function name from CHECK to CHECK_RETURN_STATUS
* Docs - Bump rocm-docs-core[api_reference] from 0.37.0 to 0.37.1 in /docs/sphinx (#329)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.37.0 to 0.37.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.37.0...v0.37.1)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Voxel Flip on HIP and HOST (#285)
* added support for flip voxel
* added test suite support
* added golden outputs for flip voxel
made changes in test suite to run QA tests for flip
* updated golden outputs with correct values
* minor bug fix in the hip test suite
* made changes to variable names for better readability
fixed comments in test suite
minor cleanup
* combined the flip axis factor as ternary operator in HIP kernel
added new enum for error handling when source and destination layouts are not matching
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted flip voxel golden outputs to bin files
* changed copyright from 2023 to 2024
* Update flip_voxel.hpp license
* License - updates to 2024 and consistency changes (#298)
* Match all CMakeLists.txt license as per RPP's outermost LICENSE file
* Match all python files' license as per RPP's outermost LICENSE file
* Match all .hpp files' license as per RPP's outermost LICENSE file
* Match all .cpp files' license as per RPP's outermost LICENSE file
* Match all .h files' license as per RPP's outermost LICENSE file
* Remove all rights reserved as per LICENSE file
* Remove double space in "Copyright (c) 2019 - 2023 Advanced Micro Devices, Inc."
* Match all .cmake files' license as per RPP's outermost LICENSE file
* Match all .cpp.in files' license as per RPP's outermost LICENSE file
* Replace 283 occurrences in 282 files - 2023 to 2024
* Add "MIT License" title to 281 instances
* Add missing license
* Test - Update README.md for test_suite (#299)
* Bump rocm-docs-core[api_reference] from 0.33.0 to 0.33.1 in /docs/sphinx (#301)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.0 to 0.33.1.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.0...v0.33.1)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Bump rocm-docs-core[api_reference] from 0.33.1 to 0.33.2 in /docs/sphinx (#302)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.1 to 0.33.2.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.1...v0.33.2)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update doc codeowners (#303)
* Documentation - Bump rocm-docs-core[api_reference] from 0.33.2 to 0.34.0 in /docs/sphinx (#304)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.33.2 to 0.34.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.33.2...v0.34.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Test suite - upgrade 5 qa perf (#305)
* experimental changes for adding qa mode for performance tests
* made changes to add display more information w.r.t QA results summary for performance tests
* minor changes
* Add changes to dump qa results to excel file
* Add performance QA for three new tensor functions
* update prerequisites in readme
* added changes to handle unsupported cases
* removed treshold dictionary and added performance Noise treshold
add new dataset for performance QA
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Changes to the performane summary dataframe
* minor changes
* Update CMakeLists.txt to add ${CMAKE_CURRENT_SOURCE_DIR} for CI
* Update CMakeLists.txt fix
* Update CMakeLists.txt fix
* remove tabulate dependency
* Update README.md to remove tabulate pip install
* Fix for CI machine failure
* Add note on performance
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Abishek <52214183+r-abishek@users.noreply.github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: r-abishek <abishek@multicorewareinc.com>
* RPP Color Temperature on HOST and HIP (#271)
* Initial commit - Color Temperature HOST Tensor
* Initial commit - Color Temperature HIP Tensor
* Add color temperature golden outputs
* address review comments
* Use reinterpret_cast instead of static_cast
* Combine templated functions to support all datatypes into one
(got minor perf difference of order 3%)
Also fixes indentation
* Fix i8 datatype
* Cleanup
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix PLN3 variant outputs
Also modifies reference outputs
* Update color_temperature.hpp license
* Delete color_temperature_u8_Tensor_PKD3.csv
* Delete color_temperature_u8_Tensor_PLN3.csv
---------
Co-authored-by: snehaa8 <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
* RPP Voxel 3D Tensor Add/Subtract scalar on HOST and HIP (#272)
* added HOST support for voxel add kernel
* added HIP support for voxel add kernel
* added test suite support for add scalar
* added Doxygen support and modified hip kernel function names as per new standard
* added HOST support for voxel subtract kernel
* added HIP support for voxel subtract kernel
* added test suite support
* updated the golden outputs for subtract with correct values
* removed unnessary validation checks
* Remove double spaces
* Fix header
* Fix all retval docs
* Fix docs to add memory type
* Fix comment
* Add divider comment
* Use post-increment efficiently
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted add and subtract scalar golden outputs to bin files
* changed copyright from 2023 to 2024
* Update add_scalar.hpp license
* Update subtract_scalar.hpp license
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* RPP Magnitude on HOST and HIP (#278)
* Initial commit - Magnitude HOST Tensor
* Add QA reference outputs
* Update runTests.py
* Initial commit - Magnitude HIP Tensor
* Add dual input support in testsuite
* Optimize HOST kernel further
* Optimize i8 datatype further
* Modify comments
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Bump rocm-docs-core[api_reference] from 0.31.0 to 0.33.0 in /docs/sphinx (#294)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.31.0 to 0.33.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.31.0...v0.33.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Update Copywright year
* Combine templated functions to support all datatypes
* Modify format of reference outputs
* Update rppi_arithmetic_operations.h license
* Update rppt_tensor_arithmetic_operations.h license
* Update host_tensor_arithmetic_operations.hpp
* Update magnitude.hpp license
* Update hip_tensor_arithmetic_operations.hpp license
* Delete magnitude_u8_Tensor_PKD3.csv
* Delete magnitude_u8_Tensor_PLN1.csv
* Delete magnitude_u8_Tensor_PLN3.csv
* Update rpp_test_suite_common.h license
* Update runTests.py license
* Update Tensor_hip.cpp license
* Update runTests.py license
* Update Tensor_host.cpp license
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
* Bump rocm-docs-core[api_reference] from 0.34.0 to 0.34.2 in /docs/sphinx (#309)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.34.0 to 0.34.2.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.34.0...v0.34.2)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Tensor Audio Support - Down Mixing (#296)
* Initial commit - Non slient region detection
Includes unittest setup
* Initial commit - To Decibels
Includes unittest setup
* Intial commit - pre_emphasis_filter
* Intial commit - down_mixing
* Replace vectors with arrays
* Cleanup
* Minor cleanup
* Optimize downmixing Kernel
Includes cleanup
* Replace Rpp64s with Rpp32s
* Cleanup
* Optimize and precompute cutOff
* Fix buffer used
* Fix buffer used
* Additional Cleanup
* Optimize post incrmeent operation
* Optimize post increment operation
* Update testsuite for Audio
* code cleanup
* Add Readme file for Audio test suite
* changes based on review comments
* minor change
* Remove unittest folders and updated README.md
* Remove unit tests
* minor change
* code cleanup
* added common header file for audio helper functions
* removed unncessary audio wav files
fixed bug in ROI updation for audio test suite
resolved issue in summary generation for performance tests in python
* removed log file
* added doxygen support for audio
* added doxygen changes for to_decibels
* updated test suite support for to_decibels
* minor change
* added doxygen changes for preemphasis filter
* updated changes for preemphasis filter in test suite
* removed the usage of getMax function and used std::max_element
* modularized code in test suite
* merge with latest changes
* minor change
* minor change
* minor change
* resolved codacy warnings
* Codacy fix - Remove unused cpuTime
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* resolved issue with file_system dependency in test suite
* Doxygen changes
changed malloc to new in NSR kernel
* RPP RICAP Tensor for HOST and HIP (#213)
* Initial commit - Ricap HOST Tensor
Includes testsuite changes
* Add QA tests for RICAP
Used three_images_224x224_src1 folder to create golden outputs
* Add three_images_224x224_src1 into TEST_IMAGES
* Support HIP Backend for RICAP
* Fix HIP pkd3->pkd3 variant
* regenerated golden outputs for RICAP
minor changes in HOST shell script for handling RICAP in QA mode
* minor bug fix in RICAP HIP kernels
* Improve readability and Cleanup
* Additional cleanup
* Cleanup testsuite
Includes new golden outputs
* Additional testuite fixes
* Minor cleanup
* Fix codacy warnings
* Address other codacy warnings
* Update ricap.hpp with reference paper
* Add RICAP dataset path in readme
* Make changes to error codes returned
* Modify roi crop region for unit and perf tests
* RPP Tensor Water Augmentation on HOST and HIP (#181)
* added water HOST and HIP codes
* added water case in test suite
* added golden outputs for water
* added omp thread changes for water augmentation
* experimental changes
* fixed output issue with AVX2 instructions
* added AVX2 support for PKD3 load function
minor changes in PLN variant load functions
* nwc commit - added avx2 changes for u8 layout toggle variants but need to add store functions for completion
* Add Avx2 implementation for F32 and U8 toggle variants
* Add AVX2 support for u8 pkd3-pln3 and i8 pkd3-pln3 for water augmentation
* change F32 load and store logic
* optimized the store function for F32 PLN3-PKD3
* reverted back irrelevant changes
* minor change
* optimized load and store functions for water U8 and F32 variants in host
removed commented code
* removed golden outputs for water
* minor changes
* renamed few functions and removed unused functions
updated i8 pln1 load as per the optimized u8 pln1 load
* fixed bug in i8 load function
* changed cast to c++ style
resolved spacing issues and added comments for AVX codes for better understanding
made changes to handle cases where QA Tests are not supported
* added golden outputs for water
* updated golden outputs with latest changes
* modified the u8, i8 pkd3-pln3 function and added comments for the vectorized code
* fixed minor bug in I8 variants
* made to changes to resolve codacy warnings
* changed cast to c++ style in hip kernel
* changed generic nn F32 loads using gather and setr instructions
* added comments for latest changes
* minor change
* added definition for storing 32 and 64 bits from a 128bit register
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix build error
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* Boost deps fix for test suite
---------
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
* Documentation - Readme & changelog updates (#251)
* readme and changelog updates for 6.0
* minor update
* added ctests for audio test suite for CI
made changes to add more clarity on the QA Tests results
* Cmake mods for ctest
* HOST-only build error bugfix
* added qa mode paramter to python audio script
added golden output map for QA testing of Non silent region detection
* minor change
* Documentation - Bump rocm-docs-core[api_reference] from 0.26.0 to 0.27.0 in /docs/sphinx (#253)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.26.0 to 0.27.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.26.0...v0.27.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Resize Mirror Normalize Bugfix (#252)
* added fix for hipMemset
* remove pixel check for U8-F32 and U8-F16 for HOST codes
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
* added example for MMS calculation in comments for better understanding
* Sphinx - updates (#257)
* Sphinx - updates
* Doxygen - Updates
* Docs - Remove index.md
* updated info used to for running audio test suite
* removed bitdepth variable from audio test suite
* added more information on computing NSR outputs in the example added
* Fix doxygen for decibels
Also removes extra QA reference files
* move tensor_host_audio.cpp to host folder
* Fix build errors and qa tests in Audio Test suite
* Fix build errors and qa tests in Audio Test suite
* Add reference output and test samples for downmix
* Add down_mix in augmentation list and supported cases
* Remove auto-merge repeated funcs
* Improve clarity of header docs
* Remove blank line
* Improve clarity on header docs
* Add Doxygen comments
* minor change
* converted golden outputs to binary file for downmixing
* removed old golden output file for preemphasis and todecibels
* modified info for downmixing as per new changes
used handle memory for temporary buffers
* formatting changes
* moved the common code for SSE and AVX to outside
* Update down_mixing.hpp license
* Update rppt_tensor_audio_augmentations.h
* combined the srcLength and channels tensors into single tensor
---------
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: Lisa <lisajdelaney@gmail.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Sundarrajan98 <sundarrajan@multicorewareinc.com>
* RPP Voxel 3D Tensor Multiply scalar on HOST and HIP (#306)
* added HIP support for voxel scalar multiply kernel
* added HOST support for voxel multiply kernel
added golden outputs for voxel multiply kernel
* merge with master
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* converted multiply scalar voxel golden outputs to bin files
* changed copyright from 2023 to 2024
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Test Suite Bugfix (#307)
* experimental changes for adding qa mode for performance tests
* made changes to add display more information w.r.t QA results summary for performance tests
* minor changes
* Add changes to dump qa results to excel file
* Add performance QA for three new tensor functions
* update prerequisites in readme
* added changes to handle unsupported cases
* removed treshold dictionary and added performance Noise treshold
add new dataset for performance QA
* RPP Test Suite Upgrade 4 - CSV to BIN conversions for file size reduction (#293)
* change golden outputs from .csv files to .bin files
* Changed comparision funtions to use .bin files
* Address review comments
* minor change
* Address review comments
* minor change
---------
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Changes to the performane summary dataframe
* minor changes
* Update CMakeLists.txt to add ${CMAKE_CURRENT_SOURCE_DIR} for CI
* Update CMakeLists.txt fix
* Update CMakeLists.txt fix
* remove tabulate dependency
* Update README.md to remove tabulate pip install
* Fix for CI machine failure
* Add note on performance
* Fix segmentation fault
* Revert QAmode to restrict HIP bitdepths
* Use Rpp64u for HOST while comparing outputs
* Fix ambiguous abs call
* Fix for SLES CI HIP fail - error: incompatible pointer types assigning to 'unsigned long *' from 'unsigned long long *' - refOutput = TensorSumReferenceOutputs[numChannels].data();
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: Pavel Tcherniaev <Pavel.Tcherniaev@amd.com>
* Bump rocm-docs-core[api_reference] from 0.34.2 to 0.35.0 in /docs/sphinx (#313)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.34.2 to 0.35.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/ROCm/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.34.2...v0.35.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Reduction - Tensor min and Tensor max on HOST and HIP (#260)
* Minor Change
* Add Validation check for DST_FOLDER path
* added water HOST and HIP codes
* added water case in test suite
* added golden outputs for water
* Add Validation checks for all options in testAllScript.sh
* Add sanity check for dual Input cases
Set Max Dimension and Max Image Dump
Replaced Fast DCT tag with Accurate DCT
* Regenerate golden outputs using accurate dct Flag
Add golden outputs for some new augmentations
* Fix Flip golden outputs mismatch
Fix PLN3 variants mismatch in QA mode
* Add MAX_BATCH_SIZE check
removed Augmentations function calls for failing Qa modes
code cleanup
* Add crop and gamma correction augmentations
code cleanup
* Add comments to functions in rpp_test_suite_common.h
* minor change
* code cleanup
* minor code changes
* Change roi and Image sizes for crop augmentation
* Change numIterations option to numRuns
Addressed PR comments
* added omp thread changes for water augmentation
* experimental changes
* fixed output issue with AVX2 instructions
* added AVX2 support for PKD3 load function
minor changes in PLN variant load functions
* Add turboJpeg header to update maxHeight and maxWidth values
* nwc commit - added avx2 changes for u8 layout toggle variants but need to add store functions for completion
* Change the performance Timings logic
* Add Avx2 implementation for F32 and U8 toggle variants
* minor change to support u8_f16 and u8_f32 cases
* Regenerate LUT golden outputs with ACCURATE_DCT tag
* Minor code changes
* Add AVX2 support for u8 pkd3-pln3 and i8 pkd3-pln3 for water augmentation
* Made changes to the runTests.py in Host to remove testAllScipts.sh
* Made changes to the runTests.py in HIP to remove testAllScipts.sh
* Initial commit - Image min and max Reduction kernel
Includes
* u8 datatype for both min and max HOST Tensor of all variants.
* Testsuite changes.
* NWC -initial code for min max PLN3 - PLN3
* made changes to split min and max kernels seperately
* splitted kernels for min and max
* made changes to print final max/min in the R,G,B channels
* fixed inaccuracies in min/max computation
* made changes to typecast intermediate output to output requested by user
added comments for the code
code cleanup and minor changes in test suite
* fixed build issues
removed image folders used for min, max and sum
reverted unwanted file changes
* minor changes in test suite
* removed support for unwanted test case in Tensor_hip.cpp
* Adds new option roi
* remove testAllScripts.sh
* Adds roi Option in HIP backend
* Implement f32 variants
* Implement f16 and i8 datatype variants
* change F32 load and store logic
* Add build flags in CMakeLists.txt to set AVX/SSE flags based on the system configuration
* minor code changes
* Initial commit - Image sum Reduction kernel
Includes u8 PLN1 -> PLN1 conversion for HOST Tensor
* Implement PKD3 and PLN3 for Image sum Tensor HOST
* Support i8, f16 and f32 datatypes
* Initial commit - Image sum Reduction HIP kernel
Includes u8 PLN1 -> PLN1 conversion for Tensor
* Implement PKD3 and PLN3 for Image sum Tensor HIP
* Add support in testsuite
Revert normalization for i8 HOST Tensor variants
* Fix HIP testsuite
Remove additional blanks for 1 channel output
* Modify print statement in HIP testsuite
* Improve readability for testsuite outputs
* optimized the store function for F32 PLN3-PKD3
* reverted back irrelevant changes
* minor change
* Fix HIP to support larger inputs
* optimized load and store functions for water U8 and F32 variants in host
removed commented code
* Cleanup
* removed golden outputs for water
* minor changes
* Cleanup
Support Reduction QA test in testsuite
* renamed few functions and removed unused functions
updated i8 pln1 load as per the optimized u8 pln1 load
* fixed bug in i8 load function
* Remove unused variables and C style casting
* changed cast to c++ style
resolved spacing issues and added comments for AVX codes for better understanding
made changes to handle cases where QA Tests are not supported
* added golden outputs for water
* updated golden outputs with latest changes
* modified the u8, i8 pkd3-pln3 function and added comments for the vectorized code
* fixed minor bug in I8 variants
* Optimize u8 datatype further
* Fix static_cast
* made to changes to resolve codacy warnings
* changed cast to c++ style in hip kernel
* Initial commit - Ricap HOST Tensor
Includes testsuite changes
* Add QA tests for RICAP
Used three_images_224x224_src1 folder to create golden outputs
* Add three_images_224x224_src1 into TEST_IMAGES
* added rotate case with golden outputs
changed generic bilinear HOST codes to match with HIP codes
* Add golden output for remaining all tensor augmentations
* fix python script issues
* Optimize u8 and i8 datatype
Uses uint and int internal processing instead of float
* Fix testsuite build errors
* minor change
* Fix QA check
* Modify api naming from image_sum to tensor_sum
Includes changes for both HOST and HIP
* Support HIP Backend for RICAP
* change rcm and rmn golden outputs
* Fix HIP pkd3->pkd3 variant
* changes based on review comments
* change test_suite folder to tests
* Optimize u8 and i8 datatype of HIP
Includes modification in naming of shared memory
* minor fix
* changed generic nn F32 loads using gather and setr instructions
* Optimize and cleanup U8 HIP
* regenerated golden outputs for RICAP
minor changes in HOST shell script for handling RICAP in QA mode
* minor bug fix in RICAP HIP kernels
* Fix i8 datatype variants
Includes cleanup
* Fix the issues with color_to_greyscale
* remove the empty folder creation
* reverting back the folder name change
* minor change
* added comments for latest changes
* minor change
* Improve readability and Cleanup
* Fix QA for HIP
Includes cleanup
* resolved review comments
* minor change
* Modify api naming from image_ to tensor_ for HOST
* Add support for QA tests
* removed range check for RMN U8-F32 and U8-F16 variants
changed from hipMemset to hipMemsetAsync for RMN HIP Kernel
removed multiplication by 255 for stdDev in RMN HOST U8-F16 and U8-F32 variants
* Modify naming of shared memory with _smem in HIP
Includes cleanup
* Typecast and reuse markArr for HIP U8 and I8
* Cleanup and minor optimization
* minor fix
* fix codacy warnings
* Additional cleanup
* Cleanup and move #define
* Changed the complexity of if statements in runTests.py
* Cleanup testsuite
Includes new golden outputs
* Additional testuite fixes
* Minor cleanup
* Codacy fixes
* Fix codacy warnings
* Codacy fix
* Address other codacy warnings
* cleanup
* Change Image functions to generic
* Update ricap.hpp with reference paper
* resolved minor issues happened with merge
* minor changes
* fixed minor issue with getting profiler times
* minor formatting changes
* resolved build issues in test suite
renamed the min and max kernel file names
* RPP RICAP Tensor for HOST and HIP (#213)
* Initial commit - Ricap HOST Tensor
Includes testsuite changes
* Add QA tests for RICAP
Used three_images_224x224_src1 folder to create golden outputs
* Add three_images_224x224_src1 into TEST_IMAGES
* Support HIP Backend for RICAP
* Fix HIP pkd3->pkd3 variant
* regenerated golden outputs for RICAP
minor changes in HOST shell script for handling RICAP in QA mode
* minor bug fix in RICAP HIP kernels
* Improve readability and Cleanup
* Additional cleanup
* Cleanup testsuite
Includes new golden outputs
* Additional testuite fixes
* Minor cleanup
* Fix codacy warnings
* Address other codacy warnings
* Update ricap.hpp with reference paper
* Add RICAP dataset path in readme
* Make changes to error codes returned
* Modify roi crop region for unit and perf tests
* RPP Tensor Water Augmentation on HOST and HIP (#181)
* added water HOST and HIP codes
* added water case in test suite
* added golden outputs for water
* added omp thread changes for water augmentation
* experimental changes
* fixed output issue with AVX2 instructions
* added AVX2 support for PKD3 load function
minor changes in PLN variant load functions
* nwc commit - added avx2 changes for u8 layout toggle variants but need to add store functions for completion
* Add Avx2 implementation for F32 and U8 toggle variants
* Add AVX2 support for u8 pkd3-pln3 and i8 pkd3-pln3 for water augmentation
* change F32 load and store logic
* optimized the store function for F32 PLN3-PKD3
* reverted back irrelevant changes
* minor change
* optimized load and store functions for water U8 and F32 variants in host
removed commented code
* removed golden outputs for water
* minor changes
* renamed few functions and removed unused functions
updated i8 pln1 load as per the optimized u8 pln1 load
* fixed bug in i8 load function
* changed cast to c++ style
resolved spacing issues and added comments for AVX codes for better understanding
made changes to handle cases where QA Tests are not supported
* added golden outputs for water
* updated golden outputs with latest changes
* modified the u8, i8 pkd3-pln3 function and added comments for the vectorized code
* fixed minor bug in I8 variants
* made to changes to resolve codacy warnings
* changed cast to c++ style in hip kernel
* changed generic nn F32 loads using gather and setr instructions
* added comments for latest changes
* minor change
* added definition for storing 32 and 64 bits from a 128bit register
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
* Fix build error
* CMakeLists - Version Update
1.5.0 - TOT Version
* CHANGELOG Updates
Version 1.5.0 placeholder
* Boost deps fix for test suite
---------
Co-authored-by: Snehaa Giridharan <snehaa@multicorewareinc.com>
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
Co-authored-by: Snehaa-Giridharan <118163708+snehaa8@users.noreply.github.com>
Co-authored-by: HazarathKumarM <hazarathkumar@multicorewareinc.com>
Co-authored-by: Kiriti Gowda <kiritigowda@gmail.com>
* Documentation - Readme & changelog updates (#251)
* readme and changelog updates for 6.0
* minor update
* Documentation - Bump rocm-docs-core[api_reference] from 0.26.0 to 0.27.0 in /docs/sphinx (#253)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.26.0 to 0.27.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.26.0...v0.27.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* RPP Resize Mirror Normalize Bugfix (#252)
* added fix for hipMemset
* remove pixel check for U8-F32 and U8-F16 for HOST codes
---------
Co-authored-by: sampath1117 <sampath.rachumallu@multicorewareinc.com>
* Cmake fix to prevent warning
* Fix paths in new python scripts
* Sphinx - updates (#257)
* Sphinx - updates
* Doxygen - Updates
* Docs - Remove index.md
* Test suite fixes after tensor_min / tensor_max HOST merge
* Fix max case
* QA tests fix for hip and host
* naming convention changes as per new std
* Substitute imagePartial with partial
* Substitute imageMin/imageMax with min/max
* Replace hipMemset with hipMemsetAsync, and replace hipDeviceSynchronize with hipStreamSynchronize
* Use variable instead of batchCount*4
* Use post increment effectivly
* Resolve codacy warnings
* Additional cleanup
* remove unused variable
* Documentation - Bump rocm-docs-core[api_reference] from 0.28.0 to 0.29.0 in /docs/sphinx (#265)
Bumps [rocm-docs-core[api_reference]](https://github.com/RadeonOpenCompute/rocm-docs-core) from 0.28.0 to 0.29.0.
- [Release notes](https://github.com/RadeonOpenCompute/rocm-docs-core/releases)
- [Changelog](https://github.com/RadeonOpenCompute/rocm-docs-core/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/RadeonOpenCompute/rocm-docs-core/compare/v0.28.0...v0.29.0)
---
updated-dependencies:
- dependency-name: rocm-docs-core[api_reference]
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
* Remove auto merge boost
* Spaces formatting
* Bump rocm-docs-core[api_reference] from 0.29.0 to 0.30.1 in /docs/sphinx (#268)
Bumps [rocm-docs-core[api_reference]](https://github.com/Rade…