Skip to content

Depthwise Convolution layer with 5x5 kernel much slower than 4.7.0 #23941

@WanliZhong

Description

@WanliZhong

System Information

macOS M2
OpenCV 4.7.0 and 4.8.0
python

Detailed description

mediapipe palm detection model from OpenCV zoo show the inference time is double than the 4.7.0 version.
reference: https://github.com/opencv/opencv_zoo/pull/190/files#r1252471348

Model file: https://github.com/opencv/opencv_zoo/blob/main/models/palm_detection_mediapipe/palm_detection_mediapipe_2023feb.onnx
Layer by Layer performance test results:
palm_4.7.txt
palm_4.8.txt

depth wise convolution inference time is 3 times than 4.7.0 version.

Steps to reproduce

just run benchmark in OpenCV zoo

Issue submission checklist

  • I report the issue, it's not a question
  • I checked the problem with documentation, FAQ, open issues, forum.opencv.org, Stack Overflow, etc and have not found any solution
  • I updated to the latest OpenCV version and the issue is still there
  • There is reproducer code and related data files (videos, images, onnx, etc)

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions