[G-API] Support postprocessing for not argmaxed outputs#20476

Merged

alalek merged 7 commits intoopencv:masterfrom

TolyaTalamanov:at/support-unet-camvid-0001-segm-sample

Aug 6, 2021

Contributor

TolyaTalamanov commented Jul 29, 2021 •

edited

Loading

Pull Request Readiness Checklist

See details at https://github.com/opencv/opencv/wiki/How_to_contribute#making-a-good-pull-request

I agree to contribute to the project under Apache 2 License.
To the best of my knowledge, the proposed patch is not based on a code under GPL or other license that is incompatible with OpenCV
The PR is proposed to proper branch
There is reference to original bug report and related work
There is accuracy test, performance test and test data in opencv_extra repository, if applicable
Patch to opencv_extra has the same branch name.
The feature is well documented and sample code can be built with the project CMake

Overview

Some semantic segmentation networks such as unet-camvid-0001 from OMZ produce multi-plane output (1 x num_classesx H x W). In that case need to perform argmax operation for every pixel through channel plane in order to convert output to 1 x 1 x H x W representation where every pixel is class id.

Build configuration

force_builders=Custom,Custom Win,Custom Mac
build_gapi_standalone:Linux x64=ade-0.1.1f
build_gapi_standalone:Win64=ade-0.1.1f
build_gapi_standalone:Mac=ade-0.1.1f
build_gapi_standalone:Linux x64 Debug=ade-0.1.1f

build_image:Custom=centos:7
buildworker:Custom=linux-1
build_gapi_standalone:Custom=ade-0.1.1f

Xbuild_image:Custom=ubuntu-openvino-2020.3.0:16.04
build_image:Custom Win=openvino-2021.3.0
build_image:Custom Mac=openvino-2021.3.0

test_modules:Custom=gapi,python2,python3,java
test_modules:Custom Win=gapi,python2,python3,java
test_modules:Custom Mac=gapi,python2,python3,java

buildworker:Custom=linux-1
// disabled due high memory usage: test_opencl:Custom=ON
test_opencl:Custom=OFF
test_bigdata:Custom=1
test_filter:Custom=*


          Support postprocessing for not argmaxed outputs

e8d7386

TolyaTalamanov requested a review from mpashchenkov

July 29, 2021 21:20

TolyaTalamanov commented

View reviewed changes

modules/gapi/samples/semantic_segmentation.cpp Outdated Show resolved Hide resolved

modules/gapi/samples/semantic_segmentation.cpp Outdated Show resolved Hide resolved

modules/gapi/samples/semantic_segmentation.cpp Show resolved Hide resolved

mpashchenkov reviewed

View reviewed changes

modules/gapi/samples/semantic_segmentation.cpp Outdated Show resolved Hide resolved

TolyaTalamanov added 3 commits

July 30, 2021 17:41


          Fix typo

21a0253


          Add assert

926b97e


          Remove static cast

b81200d

TolyaTalamanov requested a review from mpashchenkov

July 30, 2021 14:54

TolyaTalamanov added 2 commits

July 30, 2021 17:55


          CamelCast to snake_case

80ac52b


          Fix windows warning

bd51b16

* Add static_cast to uint8_t

mpashchenkov approved these changes

View reviewed changes

modules/gapi/samples/semantic_segmentation.cpp Outdated Show resolved Hide resolved


          Add const to variables

b597ea6

asmorkalov added the category: g-api / gapi label

Contributor Author

TolyaTalamanov commented Aug 4, 2021

@dmatveev Could you have a look ?

dmatveev self-assigned this

dmatveev added this to the 4.5.4 milestone

dmatveev approved these changes

View reviewed changes

Contributor

dmatveev left a comment

LGTM if the existing case is not broken with this change.

modules/gapi/samples/semantic_segmentation.cpp

Comment on lines +51 to +69

+              void classesToColors(const cv::Mat &out_blob,
+                                         cv::Mat &mask_img) {
+                  const int H = out_blob.size[0];
+                  const int W = out_blob.size[1];
+                  mask_img.create(H, W, CV_8UC3);
+                  GAPI_Assert(out_blob.type() == CV_8UC1);
+                  const uint8_t* const classes = out_blob.ptr<uint8_t>();
+                  for (int rowId = 0; rowId < H; ++rowId) {
+                      for (int colId = 0; colId < W; ++colId) {
+                          uint8_t class_id = classes[rowId * W + colId];
+                          mask_img.at<cv::Vec3b>(rowId, colId) =
+                              class_id < colors.size()
+                              ? colors[class_id]
+                              : cv::Vec3b{0, 0, 0}; // NB: sample supports 20 classes
+                      }
+                  }
+              }

Contributor

dmatveev Aug 4, 2021

Can this be expressed with our graph operators? Just wondering

Contributor Author

TolyaTalamanov Aug 5, 2021

Do you mean call this function inside the user kernel ? Or express this algo by using already existing operations ?

modules/gapi/samples/semantic_segmentation.cpp

Comment on lines +121 to 123

+                      cv::resize(mask_img, out, in.size());
                       const float blending = 0.3f;
                       out = in * blending + out * (1 - blending);

Contributor

dmatveev Aug 4, 2021

can this be moved on the graph level, too? Not critical to do it right now but worth considering for the future.

Contributor Author

TolyaTalamanov Aug 5, 2021

On the graph level cv::Size parameter is unknown, isn't it ?
It's obviously can be custom resize operation

modules/gapi/samples/semantic_segmentation.cpp

Comment on lines +109 to +110

		// NB: If output has more than single plane, it contains probabilities
		// otherwise class id.

Contributor

dmatveev Aug 4, 2021

Is this robust enough? Maybe explicit enum flag is better? I just don't know.

Contributor Author

TolyaTalamanov Aug 5, 2021

What do you mean by enum flag ? In that case you need to match model name with postprocessing enum flag, right ?
I don't think that it's a great solution, just tried not to overdesign it.

https://github.com/openvinotoolkit/open_model_zoo/blob/master/demos/common/python/models/segmentation.py#L79

Contributor Author

TolyaTalamanov commented Aug 6, 2021

@alalek Can it be merged ?

alalek merged commit 24de676 into opencv:master

alalek mentioned this pull request

(5.x) Merge 4.x #20886

Merged

a-sajjad72 pushed a commit to a-sajjad72/opencv that referenced this pull request


          Merge pull request opencv#20476 from TolyaTalamanov:at/support-unet-c…

e2ee48a

…amvid-0001-segm-sample

[G-API] Support postprocessing for not argmaxed outputs

* Support postprocessing for not argmaxed outputs

* Fix typo

* Add assert

* Remove static cast

* CamelCast to snake_case

* Fix windows warning

* Add static_cast to uint8_t

* Add const to variables

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: g-api / gapi