cuda4dnn(MatMul): add MatMulOp by YashasSamaga · Pull Request #20138 · opencv/opencv

YashasSamaga · 2021-05-21T19:36:55Z

Adds a new MatMulOp which performs strided batched GEMM (or regular GEMM when effective tensor ranks are two or below) on runtime blobs.

resolves #19929

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

force_builders=Custom
buildworker:Custom=linux-4
build_image:Custom=ubuntu-cuda-cc52:18.04
Xbuild_image:Custom=ubuntu-cuda:18.04

alalek

Well done! Thank you for contribution 👍

add MatMulOp

32df5fa

YashasSamaga marked this pull request as ready for review June 5, 2021 08:16

alalek approved these changes Jun 7, 2021

View reviewed changes

opencv-pushbot merged commit 1c4d708 into opencv:master Jun 8, 2021

alalek mentioned this pull request Jun 13, 2021

(5.x) Merge 4.x #20266

Merged

alalek mentioned this pull request Oct 15, 2021

(5.x) Merge 4.x #20886

Merged

Provide feedback