Skip to content

DNN: add the Winograd fp16 support#23654

Merged
vpisarev merged 4 commits intoopencv:4.xfrom
zihaomu:wino_fp16
Nov 20, 2023
Merged

DNN: add the Winograd fp16 support#23654
vpisarev merged 4 commits intoopencv:4.xfrom
zihaomu:wino_fp16

Conversation

@zihaomu
Copy link
Copy Markdown
Member

@zihaomu zihaomu commented May 22, 2023

To add the winograd FP16 compute branch for convolution layer of 3x3 stride 1 case.

Test on M1 chip, 4 threads.

Model Name 4.x (Conv(FP16) + Wino(FP 32)) Conv(FP16) + Wino(FP 16)
ReseNet 50 18.5 ms 15.6 ms (18.5% speed up)

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

  • I agree to contribute to the project under Apache 2 License.
  • To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
  • The PR is proposed to the proper branch
  • There is a reference to the original bug report and related work
  • There is accuracy test, performance test and test data in opencv_extra repository, if applicable
    Patch to opencv_extra has the same branch name.
  • The feature is well documented and sample code can be built with the project CMake

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants