Merged
Conversation
Use 4x FMA chains to sum on SIMD 128 FP64 targets. On x86 this showed about 1.4x improvement. For PPC, do a full multiply (32x32->64b), convert to DP then accumulate. This may be slightly less precise for some inputs. But is 1.5x faster than the above which is about 1.5x than the FMA above for ~2.5x speedup.
* removed private flann documentation * common tutorial images moved to doc/images * grouping issues
* Convert ImgWarp from SSE SIMD to HAL - 2.8x faster on Power (VSX) and 15% speedup on x86 * Change compile flag from CV_SIMD128 to CV_SIMD128_64F for use of v_float64x2 type * Changing WarpPerspectiveLine from class functions and dispatching to static functions. * Re-add dynamic runtime and dispatch execution. * RRestore SSE4_1 optimizations inside opt_SSE4_1 namespace
- error: variables in the local address space can only be declared in the outermost scope of a kernel function
* Wrap unsupported by IE layers as custom layers * Replace pointers to layers blobs to their shapes * Enable Faster R-CNN with IE backend on CPU
* fix 'chmod' is not recognized as an internal or external command in Windows opencv#14939 * cmake: update PCH command handling
* Adding all possible data type interactions to the perf tests since some use SIMD acceleration and others do not. * Disabling full tests by default. * Giving proper names, removing magic numbers and sanity checks of new performance tests for the integral function. * Giving proper names, making array static.
* Added mechanism to set/toggle NORMAL window to TOPMOST window. * highgui: remove unnecessary/legacy code
Member
Author
|
👍 |
Merged
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
#14872 from AhiyaHiya:feature/topmost_window
#15063 from dkurt:dnn_ie_ocv_layers
#15170 from sturkmen72:update_samples
#15339 from pmur:dotprod-32s-vsx
#15358 from ChipKerchner:imgwarpToHal
#15402 from ChipKerchner:normUnroll
#15424 from mshabunin:add-cmake-docs
#15433 from huihut:master
#15435 from alalek:update_version_3.4.8-pre
#15437 from devnexen:fbsd_opencl_build_fix
#15440 from everton1984:new_integral_tests
#15444 from alalek:ocl_fix_fft_kernel
Previous "Merge 3.4": #15423