RISC-V RVV 0.7: v_add/v_sub saturation and avoiding 64-bit VLEN by mshabunin · Pull Request #23198 · opencv/opencv

mshabunin · 2023-01-30T20:32:08Z

This PR includes two fixes for RISC-V RVV 0.7 intrinsics:

Use non-saturated add/sub instructions for 32 and 64 bit int
Apparently 32- and 64-bit types differ from 8- and 16-bit, e.g. NEON intrinsics have same pattern (vqadd - saturated, vadd - wraps on overflow):

opencv/modules/core/include/opencv2/core/hal/intrin_neon.hpp

Lines 473 to 493 in ff8af10

    
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_uint8x16, vqaddq_u8) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_uint8x16, vqsubq_u8) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_int8x16, vqaddq_s8) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_int8x16, vqsubq_s8) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_uint16x8, vqaddq_u16) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_uint16x8, vqsubq_u16) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_int16x8, vqaddq_s16) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_int16x8, vqsubq_s16) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_int32x4, vaddq_s32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_int32x4, vsubq_s32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(*, v_int32x4, vmulq_s32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_uint32x4, vaddq_u32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_uint32x4, vsubq_u32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(*, v_uint32x4, vmulq_u32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_float32x4, vaddq_f32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_float32x4, vsubq_f32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(*, v_float32x4, vmulq_f32) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_int64x2, vaddq_s64) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_int64x2, vsubq_s64) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_uint64x2, vaddq_u64) 
        
           OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_uint64x2, vsubq_u64)

v_check_ intrinsics use 32-bit element size to avoid 64-bit operations which are not supported by HW

force_builders=Custom
Xbuild_image:Custom=riscv-gcc
Xbuild_image:Custom=riscv-gcc-rvv
build_image:Custom=riscv-gcc-rvv-128
Xbuild_image:Custom=riscv-clang
Xbuild_image:Custom=riscv-clang-rvv
Xbuild_image:Custom=riscv-clang-rvv-128
test_modules:Custom=core,imgproc,dnn
buildworker:Custom=linux-1,linux-4
test_timeout:Custom=1200
build_contrib:Custom=OFF

…n v_check_

RISC-V/RVV 0.7: v_add/v_sub saturation and avoiding 64-bit register i…

9efaa3c

…n v_check_

asmorkalov requested a review from alalek January 31, 2023 05:50

asmorkalov added the platform: riscv label Jan 31, 2023

alalek approved these changes Jan 31, 2023

View reviewed changes

opencv-pushbot merged commit a6b178a into opencv:4.x Jan 31, 2023

mshabunin deleted the fix-rvv-07 branch February 1, 2023 12:33

asmorkalov mentioned this pull request May 31, 2023

(5.x) Merge 4.x #23718

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RISC-V RVV 0.7: v_add/v_sub saturation and avoiding 64-bit VLEN#23198

RISC-V RVV 0.7: v_add/v_sub saturation and avoiding 64-bit VLEN#23198
opencv-pushbot merged 1 commit intoopencv:4.xfrom
mshabunin:fix-rvv-07

mshabunin commented Jan 30, 2023 •

edited by alalek

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_uint8x16, vqaddq_u8)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_uint8x16, vqsubq_u8)
	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_int8x16, vqaddq_s8)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_int8x16, vqsubq_s8)
	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_uint16x8, vqaddq_u16)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_uint16x8, vqsubq_u16)
	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_int16x8, vqaddq_s16)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_int16x8, vqsubq_s16)
	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_int32x4, vaddq_s32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_int32x4, vsubq_s32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(*, v_int32x4, vmulq_s32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_uint32x4, vaddq_u32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_uint32x4, vsubq_u32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(*, v_uint32x4, vmulq_u32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_float32x4, vaddq_f32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_float32x4, vsubq_f32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(*, v_float32x4, vmulq_f32)
	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_int64x2, vaddq_s64)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_int64x2, vsubq_s64)
	OPENCV_HAL_IMPL_NEON_BIN_OP(+, v_uint64x2, vaddq_u64)
	OPENCV_HAL_IMPL_NEON_BIN_OP(-, v_uint64x2, vsubq_u64)

Uh oh!

Conversation

mshabunin commented Jan 30, 2023 • edited by alalek Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mshabunin commented Jan 30, 2023 •

edited by alalek

Loading