[GSOC] Speeding-up AKAZE, part #3 by hrnr · Pull Request #9249 · opencv/opencv

hrnr · 2017-07-27T16:30:20Z

This is currently based on #8951. I will rebase on master, once it gets in.

Currently this contains reworked version of finding scale space extremas. Speed up for images with lots of keypoints is ~3.6x.


Geometric mean

                                                   Name of Test                                                        perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf      perf      perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf   
                                                                                                                    8200996b1  b12facf48  aa5a72b46  8cc0b286c  1d3f7fe9e  c13351891  76151e566  ea089a8ab  f9c2951fa  ba071d1ad  09c7288de  d71718dea  61a35d7a6  1d42c1fa8  70b66d1b6 1a4c8989d b12facf48  aa5a72b46  8cc0b286c  1d3f7fe9e  c13351891  76151e566  ea089a8ab  f9c2951fa  ba071d1ad  09c7288de  d71718dea  61a35d7a6  1d42c1fa8  70b66d1b6  1a4c8989d 
                                                                                                                                                                                                                                                                                                      vs         vs         vs         vs         vs         vs         vs         vs         vs         vs         vs         vs         vs         vs         vs    
                                                                                                                                                                                                                                                                                                     perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf       perf   
                                                                                                                                                                                                                                                                                                  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1  8200996b1 
                                                                                                                                                                                                                                                                                                  (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor) (x-factor)
detectAndExtract::feature2d::(AKAZE_DEFAULT, "cv/detectors_descriptors_evaluation/images_datasets/leuven/img1.png") 73.555 ms  65.326 ms  45.166 ms  45.018 ms  42.333 ms  39.708 ms  39.556 ms  38.896 ms  43.073 ms  39.038 ms  37.845 ms  37.977 ms  38.303 ms  31.364 ms  33.183 ms 33.253 ms    1.13       1.63       1.63       1.74       1.85       1.86       1.89       1.71       1.88       1.94       1.94       1.92       2.35       2.22       2.21   
detectAndExtract::feature2d::(AKAZE_DEFAULT, "stitching/a3.png")                                                    58.468 ms  52.088 ms  35.045 ms  34.284 ms  32.775 ms  30.440 ms  29.703 ms  30.023 ms  31.295 ms  30.154 ms  28.886 ms  29.472 ms  28.711 ms  23.899 ms  26.021 ms 26.826 ms    1.12       1.67       1.71       1.78       1.92       1.97       1.95       1.87       1.94       2.02       1.98       2.04       2.45       2.25       2.18   
detectAndExtract::feature2d::(AKAZE_DEFAULT, "stitching/s2.jpg")                                                    276.495 ms 250.674 ms 173.192 ms 165.687 ms 162.380 ms 148.727 ms 151.201 ms 156.154 ms 169.405 ms 159.459 ms 149.227 ms 157.420 ms 151.767 ms 119.365 ms 76.862 ms 77.646 ms    1.10       1.60       1.67       1.70       1.86       1.83       1.77       1.63       1.73       1.85       1.76       1.82       2.32       3.60       3.56

hrnr · 2017-07-27T16:32:03Z

Notice that 1a4c898 actually slows things down. I can't really explain that.

edit: Of cause that I have removed that hack, it is better to use L2, which is used in the original algorithm. But I'm just curious, why it runs slightly slower. maybe it has got just negligible impact + measuments fluctuations?

vpisarev · 2017-07-31T11:54:23Z

@hrnr, thank you, cool results! Are you going to do any more commits or it can be reviewed and integrated?

hrnr · 2017-07-31T12:19:21Z

Yes, I'm planning to optimize further at least non-linear diffusion and Scharr kernels with nonstandard sigmas.

#8951 is ready for merge, it improves CPU perf by ~2.0x and implements basic OCL support.

vpisarev · 2017-07-31T12:49:54Z

ok, thanks! as we are preparing 3.3 now, perhaps, it makes sense to merge something now

vpisarev · 2017-07-31T12:50:00Z

👍

hrnr · 2017-07-31T12:54:14Z

Ok, this can go in too, but it should be rebased on #8951. When #8951 gets in I will rebase it, and it is good to go.

alalek · 2017-08-01T13:46:20Z

@hrnr #8951 is merged. Please rebase this patch.

* incorporade finding of extremas and subpixel refinement from Hideaki Suzuki's fast_akaze (https://github.com/h2suzuki/fast_akaze) * use opencv parallel framework * do not search for keypoints near the border, where we can't compute sensible descriptors (bugs fixed in ffd9ad9, 2c53895), but the descriptors were not 100% correct. this is a better solution this version produces less keypoints with the same treshold. It is more effective in pruning similar keypoints (which do not bring any new information), so we have less keypoints, but with high quality. Accuracy is about the same.

* fix bug in subpixel refinement * see commit db3dc22981e856ca8111f2f7fe57d9c2e0286efc in Pablo's repo

* store just keypoints positions * store positions in uchar mask for effective spatial search for neighbours * construct keypoints structs at the very end

hrnr · 2017-08-01T14:06:33Z

rebased and remowed the fixup commit.

* win32 has lower accuracy

hrnr · 2017-08-02T09:12:35Z

build failed on win32, which seems to have a slightly lower accuracy. I have lowered the treshold a bit.

BTW Do you have idea why buildbot scheduled a win32 build? These builders has not been scheduled ever before for this PR.

alalek · 2017-08-02T09:23:50Z

Some builders are optional on precommit checks.

We observed that:

there is no testdata for kaze/akaze tests, so these tests don't check results, just write them.
valgrind builder detects unitialized values in akaze algorithm results

hrnr · 2017-08-02T10:00:12Z

Testdata are an omission, but now I'm not sure if it is wise to have tests like these at all. For descriptors test the exact number of keypoints must be the same, which might be problematic as everything in AKAZE is done in floats. Detector test seems to be quite fine though.

What is the approach to these magic values controlling tests? Shouldn't we drop them (at least for AKAZE) and rely only on invarience tests, which don't control any magic values?

I'll check the problem with valgrind.

alalek · 2017-08-02T10:30:11Z

Agree, current tests look strange and currently they are broken. I will take a look on them.

alalek · 2017-08-02T20:40:39Z

@hrnr Valgrind issue: there are untouched last two bits of descriptor: 61 bytes is 488 bits, but MLDB [changes] 486 bits only.

alalek · 2017-08-03T08:34:28Z

Lets put it in and fix remaining issues.
👍

hrnr force-pushed the akaze_part3 branch from 1a4c898 to aa6bb85 Compare July 28, 2017 09:34

sovrasov added the GSoC label Jul 31, 2017

vpisarev self-assigned this Jul 31, 2017

vpisarev added this to the 3.3 milestone Jul 31, 2017

hrnr added 3 commits August 1, 2017 16:00

incorporate bugfix from upstream

4fb7e57

* fix bug in subpixel refinement * see commit db3dc22981e856ca8111f2f7fe57d9c2e0286efc in Pablo's repo

rework finding of scale space extremas

642fff7

* store just keypoints positions * store positions in uchar mask for effective spatial search for neighbours * construct keypoints structs at the very end

hrnr force-pushed the akaze_part3 branch from aa6bb85 to 642fff7 Compare August 1, 2017 14:01

lower inlier threshold in test

59c6438

* win32 has lower accuracy

alalek merged commit 3166d0c into opencv:master Aug 3, 2017

hrnr mentioned this pull request Aug 24, 2017

[GSOC] Speeding-up AKAZE, tracking tutorial #9444

Merged

shimat mentioned this pull request Aug 15, 2018

AKAZE keypoint output differs between 3.2 and 3.3 #12217

Closed

Uh oh!

Conversation

hrnr commented Jul 27, 2017

Uh oh!

hrnr commented Jul 27, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vpisarev commented Jul 31, 2017

Uh oh!

hrnr commented Jul 31, 2017

Uh oh!

vpisarev commented Jul 31, 2017

Uh oh!

vpisarev commented Jul 31, 2017

Uh oh!

hrnr commented Jul 31, 2017

Uh oh!

alalek commented Aug 1, 2017

Uh oh!

hrnr commented Aug 1, 2017

Uh oh!

hrnr commented Aug 2, 2017

Uh oh!

alalek commented Aug 2, 2017

Uh oh!

hrnr commented Aug 2, 2017

Uh oh!

alalek commented Aug 2, 2017

Uh oh!

alalek commented Aug 2, 2017

Uh oh!

alalek commented Aug 3, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hrnr commented Jul 27, 2017 •

edited

Loading