Improvement of sequential connected components Wu's algorithm and pro… by MicheleCancilla · Pull Request #7705 · opencv/opencv

MicheleCancilla · 2016-11-22T12:06:10Z

This pullrequest changes

With this Pull Request we provide the following improvements on “cv::connectedComponents” and on “cv::connectedComponentsWithStats” functions:

We improve the speed of Wu’s algorithm (sequential version) providing a better implementation, both for 4 and 8-way connectivity.
We provide a parallel CPU implementation of both Wu and Grana’s algorithms that are the ones currently included in the OpenCV library. These parallel versions are automatically chose by the functions when the TBB variable is defined. The performance are shown in the charts below where acronyms are used to distinguish the different implementation/version of the algorithms. An explanation of the datasets used and how the tests were performed can be found here.

To conclude, this pull request reply to RFC #7270 and take into account the issue #7607 providing the same solution proposed by @sokian with pull request #7608 that is open yet.

SAUF_OLD: is the old implementation of the sequential version of the Wu’s algorithm;
SAUF_NEW: is the implementation of the sequential version of the Wu’s algorithm proposed with this pull request;
SAUF_PAR: is the parallel version of the Wu’s algorithm introduced by this pull request;
BBDT_SEQ: is the current sequential version of the Grana’s algorithm (currently included in the OpenCV);
BBDT_PAR: is the parallel version of the Grana’s algorithm introduced by this pull request;

Comparison of Wu’s algorithms with 8-way connectivity (without computing statistics)

3DPeS	Medical	Tobacco800

Comparison of Wu’s algorithms with 4-way connectivity (without computing statistics)

3DPeS	Medical	Tobacco800

Comparison of Grana’s algorithms without computing statistics

3DPeS	Medical	Tobacco800

Comparison of Grana’s algorithms with statistics computation

3DPeS	Medical	Tobacco800

Contributors:

Michele Cancilla @MicheleCancilla (cancilla.michele[at]gmail.com)
Federico Bolelli @prittt (federico.bolelli[at]unimore.it)
Costantino Grana @CostantinoGrana (costantino.grana[at]unimore.it)

…vide parallel version of both Wu's and Grana's algorithms (using TBB library)

mshabunin · 2016-11-22T13:04:54Z

Please, restore text indentation on lines 54-3957: it was 4 spaces, but you've changed it to 8 spaces.

mshabunin · 2016-11-22T13:09:06Z

Is it possible to change TBB to cv::parallel_for_? It has different backends (including TBB) and will work on more platforms.

prittt · 2016-11-22T16:17:46Z

Hi @mshabunin,

"Please, restore text indentation on lines 54-3957: it was 4 spaces, but you've changed it to 8 spaces."
Sure!!

"Is it possible to change TBB to cv::parallel_for_? It has different backends (including TBB) and will work on more platforms."
We are trying to do that and we are making some other performance tests to see the differences.

Extended parallel version to all frameworks supported by OpenCV; Added some documentation notes in modules/imgproc/include/opencv2/imgproc.hpp;

MicheleCancilla · 2016-11-24T14:16:08Z

Hi @mshabunin. You were right: we reverted the parallel code to cv::parallel_for_ and now it is obviously more flexible (tests revealed that also parallel frameworks different from TBB improve performances pretty well). We fixed the indentation issue and wrote down some notes in the documentation.

alalek · 2016-11-24T16:31:59Z

'_P' => 'P_'

To avoid conflicts with reserved variable names (started with underscore).

…h getNumberOfCPUs

mschoeneck · 2016-11-25T14:03:19Z

I tested your last commit a bit. I noticed, that when I use CV_16U label type, connectedComponents returns only the labels from first thread and the label image is broken. This seems to occure for both, GRANA and WU as well as for 4 and 8 connected.
With CV_32S it seems to be OK.
When I set number of threads to 1 the CV_16U output also returns correct number of found labels and label image looks correct.
I use OpenMP as parallel framework and Visual Studio 2013. The correct number of labels is about 10000.

MicheleCancilla · 2016-11-29T08:31:34Z

Hello @mschoeneck,
you are right. We try to explain the problem in the easiest possible way.

The CV_16U label type can be used only when the number of labels in the image can be fit a 16bit data type (i.e 65536). The problem with the parallel version of the algorithms (which divide the image to label in stripes and compute labeling on each stripe in parallel) is that each thread must start labeling from the maximum possible value reachable by the previous thread. This is necessary in order to avoid collision accesses to equivalence label array.

For example (considering an image with 8 rows and 2 thread): the first thread works with rows from 0 to 3 [0;3] and starts from label 1 (background is 0). The second one works with rows from 4 to 7 [4;7] and starts from maximum label reachable (at least in theory) by thread one incremented by one. So if the maximum label reachable from thread one is 5 thread two starts labeling from label 6.

Then, if you run connectedComponents function with CV_16U label type on an image that actually have only two labels, but the maximum number of different labels that image could contain is greater than 65536, parallel labeling will fail because of its nature.

For this reason we decided to call the parallel connectedComponents only with CV_32S label type.

@mshabunin @alalek The latest push failed under linux with strange errors (error closing /tmp/ccXGh37G.s: No space left on device), it's my fault or was there a problem with the buildbot?

alalek · 2016-11-29T08:37:22Z

there a problem with the buildbot

These builds are restarted.

MicheleCancilla · 2017-02-22T17:24:18Z

hello, do you have any updates? 😄

…mments

MicheleCancilla · 2017-04-27T10:55:59Z

The last commit includes the following changes/improvements:

Improvement of array of equivalences’ upper bound. The current upper bound in release 3.2.0 is undersized (wrong in some cases) but in the current pull request (as in the pull request REF) we have proposed a new upper bound that is correct but oversized in some cases (i.e. on images with even number of rows and/or cols). With the last commit, we definitely solve the problem (see following figure);
Specialized the first assigned label of each thread for parallel algorithms in 8-way-connectivity;
Fix some wrong comments (authors’ name) in other modules;

vpisarev · 2017-05-24T13:57:13Z

trying to fix doc builder complains, see #8789

Improvement of sequential connected components Wu's algorithm and pro…

0bc9a0d

…vide parallel version of both Wu's and Grana's algorithms (using TBB library)

Fixed unnecessary black spaces;

5b23c0b

Extended parallel version to all frameworks supported by OpenCV; Added some documentation notes in modules/imgproc/include/opencv2/imgproc.hpp;

Fixed _P reserved variable name problem and changed getNumThreads wit…

4b7fc59

…h getNumberOfCPUs

Removed parallel version for CV_16U label type

89a0a46

mshabunin and others added 2 commits February 26, 2017 07:46

Merge branch 'master' into master

0d7666a

Improvement of array of equivalences’ upper bound + fix some wrong co…

9405c6d

…mments

vpisarev self-assigned this May 24, 2017

vpisarev mentioned this pull request May 24, 2017

Parallel connected components #8789

Merged

vpisarev closed this May 24, 2017

alalek mentioned this pull request Jan 28, 2020

Fix bug 16410 and add test #16415

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improvement of sequential connected components Wu's algorithm and pro…#7705

Improvement of sequential connected components Wu's algorithm and pro…#7705
MicheleCancilla wants to merge 6 commits intoopencv:masterfrom
MicheleCancilla:master

MicheleCancilla commented Nov 22, 2016

Uh oh!

mshabunin commented Nov 22, 2016

Uh oh!

mshabunin commented Nov 22, 2016

Uh oh!

prittt commented Nov 22, 2016

Uh oh!

MicheleCancilla commented Nov 24, 2016

Uh oh!

alalek commented Nov 24, 2016

Uh oh!

mschoeneck commented Nov 25, 2016 •

edited

Loading

Uh oh!

MicheleCancilla commented Nov 29, 2016

Uh oh!

alalek commented Nov 29, 2016

Uh oh!

MicheleCancilla commented Feb 22, 2017

Uh oh!

MicheleCancilla commented Apr 27, 2017

Uh oh!

vpisarev commented May 24, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Uh oh!

Conversation

MicheleCancilla commented Nov 22, 2016

This pullrequest changes

Contributors:

Uh oh!

mshabunin commented Nov 22, 2016

Uh oh!

mshabunin commented Nov 22, 2016

Uh oh!

prittt commented Nov 22, 2016

Uh oh!

MicheleCancilla commented Nov 24, 2016

Uh oh!

alalek commented Nov 24, 2016

Uh oh!

mschoeneck commented Nov 25, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MicheleCancilla commented Nov 29, 2016

Uh oh!

alalek commented Nov 29, 2016

Uh oh!

MicheleCancilla commented Feb 22, 2017

Uh oh!

MicheleCancilla commented Apr 27, 2017

Uh oh!

vpisarev commented May 24, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

mschoeneck commented Nov 25, 2016 •

edited

Loading