dnn: avoid const layer forwarding in layer norm layer and attention layer by fengyuentau · Pull Request #25238 · opencv/opencv

fengyuentau · 2024-03-20T09:10:42Z

While profiling ViTs with dnn, I found ConstLayer can take a proportion of the inference time, which is weird. This comes from the data copy during the inference of ConstLayer. There is a chance that we can improve the efficiency of data copying but the easiest and most convenient way is to avoid ConstLayer. This PR change the way how we handle constants in layer normalization layer and attention layer, which is storing in the layer blobs instead of making constant layers for them.

Checklists:

Backend compatibility in layer normalization layer.

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

force_builders=Linux OpenCL, Win64 OpenCL

dkurt · 2024-03-20T09:34:01Z

@fengyuentau, good catch, thanks! Once you can cover backends initialization, let's merge this.

…vino backend

fengyuentau · 2024-03-21T09:32:07Z

Backend intialization with constant blobs is done. Lets see whether all tests are passed or not.

modules/dnn/src/cuda4dnn/primitives/layer_norm.hpp

modules/dnn/src/layers/layer_norm.cpp

asmorkalov · 2024-03-26T11:50:39Z

Some perf numbers for AMD Ryzen 7 2700X, 64Gb RAM:

Geometric mean (ms)

               Name of Test                  4.x-1  patched-1 patched-1 
                                                                  vs    
                                                                4.x-1   
                                                              (x-factor)
VIT_B_32::DNNTestNetwork::OCV/CPU           122.025  114.232     1.07   
VIT_B_32::DNNTestNetwork::OCV/OCL           310.753  240.440     1.29   
VIT_B_32::DNNTestNetwork::OCV/OCL_FP16      307.549  239.036     1.29   
VisionTransformer::Layer_Attention::OCV/CPU  4.689    4.760      0.98   
VitTrack::DNNTestNetwork::OCV/CPU            6.730    6.600      1.02   
VitTrack::DNNTestNetwork::OCV/OCL           11.914   11.575      1.03   
VitTrack::DNNTestNetwork::OCV/OCL_FP16      11.870   11.442      1.04

asmorkalov

👍

dnn: avoid const layer forwarding in layer norm layer and attention layer opencv#25238 While profiling ViTs with dnn, I found `ConstLayer` can take a proportion of the inference time, which is weird. This comes from the data copy during the inference of `ConstLayer`. There is a chance that we can improve the efficiency of data copying but the easiest and most convenient way is to avoid `ConstLayer`. This PR change the way how we handle constants in layer normalization layer and attention layer, which is storing in the layer blobs instead of making constant layers for them. Checklists: - [x] Backend compatibility in layer normalization layer. ### Pull Request Readiness Checklist See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request - [x] I agree to contribute to the project under Apache 2 License. - [x] To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV - [x] The PR is proposed to the proper branch - [x] There is a reference to the original bug report and related work - [x] There is accuracy test, performance test and test data in opencv_extra repository, if applicable Patch to opencv_extra has the same branch name. - [x] The feature is well documented and sample code can be built with the project CMake

store constants in blobs

7b0bd99

fengyuentau added the category: dnn label Mar 20, 2024

fengyuentau added this to the 4.10.0 milestone Mar 20, 2024

fengyuentau requested a review from dkurt March 20, 2024 09:10

fengyuentau and others added 10 commits March 20, 2024 17:56

restore code in onnx importer

0e94934

fix opencl backend

1235400

fix cann backend

281f8c9

fix openvino backend

c7f3206

quick fixes for warnings in opencl backend and compile errors in open…

a29c7f0

…vino backend

fix cuda backend

2c6f5a4

fix bias in opencl backend

fd4f00e

opencl: align mat type

05a77f9

openvino: fix bias shape

a661d14

cann: support optional bias

989761d

fengyuentau requested a review from vpisarev March 21, 2024 09:30

dkurt approved these changes Mar 22, 2024

View reviewed changes

asmorkalov assigned dkurt Mar 25, 2024

asmorkalov reviewed Mar 25, 2024

View reviewed changes

resolve comments

6427b38

fengyuentau changed the title ~~dnn: store and use constants in blobs for layer normalization layer and attention layer~~ dnn: avoid const layer forwarding in layer norm layer and attention layer Mar 26, 2024

asmorkalov approved these changes Mar 26, 2024

View reviewed changes

asmorkalov merged commit accf200 into opencv:4.x Mar 26, 2024

fengyuentau deleted the optimized_const branch March 26, 2024 12:59

asmorkalov mentioned this pull request Apr 1, 2024

5.x merge 4.x #25305

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

dnn: avoid const layer forwarding in layer norm layer and attention layer#25238

dnn: avoid const layer forwarding in layer norm layer and attention layer#25238
asmorkalov merged 12 commits intoopencv:4.xfrom
fengyuentau:optimized_const

fengyuentau commented Mar 20, 2024 •

edited

Loading

Uh oh!

dkurt commented Mar 20, 2024

Uh oh!

fengyuentau commented Mar 21, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

asmorkalov commented Mar 26, 2024

Uh oh!

asmorkalov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

fengyuentau commented Mar 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

dkurt commented Mar 20, 2024

Uh oh!

fengyuentau commented Mar 21, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

asmorkalov commented Mar 26, 2024

Uh oh!

asmorkalov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fengyuentau commented Mar 20, 2024 •

edited

Loading