dnn: fix gather layer implementation by alalek · Pull Request #22993 · opencv/opencv

alalek · 2022-12-20T06:12:40Z

support FP16 data

Fixes failed tests:

[  FAILED  ] Test_ONNX_layers.Gather/1, where GetParam() = OCV/OCL_FP16
[  FAILED  ] Test_ONNX_layers.GatherMulti/1, where GetParam() = OCV/OCL_FP16
[  FAILED  ] Test_ONNX_layers.MatMul_init/1, where GetParam() = OCV/OCL_FP16

- support FP16 data

asmorkalov · 2022-12-21T08:56:17Z

@rogday Could you look at it too?

fengyuentau · 2022-12-21T09:46:04Z

modules/dnn/src/layers/gather_layer.cpp

+
        const int axis = normalize_axis(m_axis, shape(inp));

+        // FIXIT: why should we work with non-normalized input? it should be handled in importer or layers's output generator


It was originally normalized on the go:

for (size_t i = 0; i < outer_dims; ++i) { // ... for (size_t j = 0 ; j < indices.total(); ++j) { const size_t index = (static_cast<int>(idx[j]) + inp.size[axis]) % inp.size[axis]; // .. } }

I think in onnx importer we should load what it is without any extra operation on constant input & attributes. Operations like normalization should be done in the layer constructor or somewhere. We need the original information when it comes to build operators for other backend, like CANN and TIM-VX, who have operator primitives aligned with ONNX, TF ...

"On the go" normalization is not efficient as performs it multiple times (especially through division).
There is change to perform normalization once (per forward() call) before the main loop.

It was done because the elements of index tensor can be negative per spec and non-constant at the same time. We cannot perform normalization ahead of time since we don't have the data.

rogday · 2022-12-21T21:33:30Z

Why did you extract normalization into its own for loop?

alalek · 2022-12-22T02:50:06Z

Why should we do that multiple times in nested loop? Including conversions float->int on each iteration.

dnn: fix gather layer implementation

1102b7e

- support FP16 data

asmorkalov requested a review from fengyuentau December 20, 2022 13:34

fengyuentau approved these changes Dec 21, 2022

View reviewed changes

alalek assigned fengyuentau Dec 21, 2022

opencv-pushbot merged commit 6b4f3e5 into opencv:4.x Dec 21, 2022

alalek mentioned this pull request Jan 8, 2023

(5.x) Merge 4.x #23113

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

dnn: fix gather layer implementation#22993

dnn: fix gather layer implementation#22993
opencv-pushbot merged 1 commit intoopencv:4.xfrom
alalek:fixup_21738

alalek commented Dec 20, 2022 •

edited

Loading

Uh oh!

asmorkalov commented Dec 21, 2022

Uh oh!

fengyuentau Dec 21, 2022

Uh oh!

alalek Dec 21, 2022

Uh oh!

rogday Dec 21, 2022 •

edited

Loading

Uh oh!

rogday commented Dec 21, 2022

Uh oh!

alalek commented Dec 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants


		const int axis = normalize_axis(m_axis, shape(inp));

		// FIXIT: why should we work with non-normalized input? it should be handled in importer or layers's output generator

Uh oh!

Conversation

alalek commented Dec 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

asmorkalov commented Dec 21, 2022

Uh oh!

fengyuentau Dec 21, 2022

Choose a reason for hiding this comment

Uh oh!

alalek Dec 21, 2022

Choose a reason for hiding this comment

Uh oh!

rogday Dec 21, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rogday commented Dec 21, 2022

Uh oh!

alalek commented Dec 22, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

alalek commented Dec 20, 2022 •

edited

Loading

rogday Dec 21, 2022 •

edited

Loading