Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics. by hanliutong · Pull Request #20521 · opencv/opencv

hanliutong · 2021-08-09T01:59:16Z

PR for GSoC'21 project on Optimize OpenCV DNN for RISC-V. Related PR #20287.

This PR will further optimize DNN on the basis of #20287, especially when VLEN>128.

In #20287, we have used RVV Intrinsic to optimize the 4 kernel functions in DNN. However, if RVV vectors are 256-bit wide or more longer, then the current implementation will use only a part of them. This PR tries to adjustable to different vector sizes.

Functions	Implement && Build	block size	Max used of vReg (v0-v31)
fastGEMM	⏱ Ready for review	4*7	32
fastGEMM1T	⏱ Ready for review	15*2	32
fastConv	⏱ Ready for review	3*8	28
fastDepthwiseConv	⏱ Ready for review	/	18

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

asmorkalov

👍 Tested with QEMU and vlen=128,256

modules/dnn/src/layers/layers_common.simd.hpp

asmorkalov

👍

Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics. * Update fastGEMM for multi VLEN. * Update fastGEMM1T for multi VLEN. * Update fastDepthwiseConv for multi VLEN. * Update fastConv for multi VLEN. * Replace malloc with cv::AutoBuffer.

asmorkalov self-requested a review August 9, 2021 07:14

asmorkalov added category: dnn optimization platform: riscv labels Aug 9, 2021

hanliutong added 4 commits August 20, 2021 09:30

Update fastGEMM for multi VLEN.

32f2d9e

Update fastGEMM1T for multi VLEN.

e626dec

Update fastDepthwiseConv for multi VLEN.

daedfeb

Update fastConv for multi VLEN.

2ddb33a

hanliutong force-pushed the dev-rvv-multiVLEN branch from cd0b910 to 2ddb33a Compare August 20, 2021 07:13

hanliutong marked this pull request as ready for review August 24, 2021 04:48

hanliutong changed the title ~~WIP: Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics.~~ Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics. Aug 26, 2021

Merge branch 'opencv:master' into dev-rvv-multiVLEN

cd5baad

asmorkalov reviewed Oct 4, 2021

View reviewed changes

modules/dnn/src/layers/layers_common.simd.hpp Outdated Show resolved Hide resolved

Replace malloc with cv::AutoBuffer.

9508aa9

asmorkalov self-requested a review October 5, 2021 14:07

asmorkalov approved these changes Oct 5, 2021

View reviewed changes

alalek assigned asmorkalov Oct 5, 2021

alalek merged commit e5fb504 into opencv:master Oct 5, 2021

alalek mentioned this pull request Oct 15, 2021

(5.x) Merge 4.x #20886

Merged

hanliutong deleted the dev-rvv-multiVLEN branch November 9, 2021 07:50

hanliutong mentioned this pull request Nov 19, 2021

Further optimize DNN for RISC-V Vector. #21086

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics.#20521

Make the implementation of optimization in DNN adjustable to different vector sizes with RVV intrinsics.#20521
alalek merged 6 commits intoopencv:masterfrom
hanliutong:dev-rvv-multiVLEN

hanliutong commented Aug 9, 2021 •

edited

Loading

Uh oh!

asmorkalov left a comment

Uh oh!

Uh oh!

asmorkalov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

hanliutong commented Aug 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

asmorkalov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

asmorkalov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hanliutong commented Aug 9, 2021 •

edited

Loading