CUDA: Handle fusion of conv+eltwise in case of multi-output node (i.e. Split) by dkurt · Pull Request #27326 · opencv/opencv

dkurt · 2025-05-17T08:33:18Z

Pull Request Readiness Checklist

Enables YOLO11n with CUDA backend

resolves #26566

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or another license that is incompatible with OpenCV
The PR is proposed to the proper branch
There is a reference to the original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

dkurt · 2025-05-17T08:39:25Z

import numpy as np
import cv2 as cv

net = cv.dnn.readNet("yolo11n.onnx")

inp = np.random.rand(1, 3, 640, 640).astype(np.float32)
net.setInput(inp)
net.setPreferableTarget(cv.dnn.DNN_TARGET_CPU)
ref = net.forward()

net = cv.dnn.readNet("yolo11n.onnx")

net.setInput(inp)
net.setPreferableBackend(cv.dnn.DNN_BACKEND_CUDA)
net.setPreferableTarget(cv.dnn.DNN_TARGET_CUDA)
out = net.forward()

print("ref shape", ref.shape)
print("out shape", out.shape)
print("diff", np.max(np.abs(ref - out)))

ref shape (1, 84, 8400)
out shape (1, 84, 8400)
diff 0.0029296875

Handle fusion of conv+eltwise in case of multi-output node (i.e. Split)

fd5b33b

dkurt added bug category: dnn labels May 17, 2025

dkurt changed the title ~~Handle fusion of conv+eltwise in case of multi-output node (i.e. Split)~~ CUDA: Handle fusion of conv+eltwise in case of multi-output node (i.e. Split) May 17, 2025

dkurt added this to the 4.12.0 milestone May 17, 2025

dkurt requested a review from asmorkalov May 17, 2025 11:08

asmorkalov approved these changes May 19, 2025

View reviewed changes

asmorkalov self-assigned this May 19, 2025

asmorkalov merged commit 9d2d927 into opencv:4.x May 19, 2025
103 of 109 checks passed

dkurt deleted the handle_multi_output_eltwise_fusion branch May 19, 2025 07:48

asmorkalov mentioned this pull request May 27, 2025

5.x merge 4.x #27370

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CUDA: Handle fusion of conv+eltwise in case of multi-output node (i.e. Split)#27326

CUDA: Handle fusion of conv+eltwise in case of multi-output node (i.e. Split)#27326
asmorkalov merged 1 commit intoopencv:4.xfrom
dkurt:handle_multi_output_eltwise_fusion

dkurt commented May 17, 2025 •

edited

Loading

Uh oh!

dkurt commented May 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

dkurt commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Readiness Checklist

Uh oh!

dkurt commented May 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dkurt commented May 17, 2025 •

edited

Loading

dkurt commented May 17, 2025 •

edited

Loading