Merged
Conversation
Fix resize+concat optimization * fix resize+concat optimization * add comment and fix indentation
- drop CreateSubBuffer - fix FUSED_CONV_ELTWISE mode
- if kernel code doesn't support that
- dropped usage of CreateSubBuffer() - buffers lifetime management issue - fixed elementwise offset - avoid out of bounds read access
Add Normalize subgraph, fix Slice, Mul and Expand * Add Normalize subgraph, support for starts<0 and axis<0 in Slice, Mul broadcasting in the middle and fix Expand's unsqueeze * remove todos * remove range-based for loop * address review comments * change >> to > > in template * fix indexation * fix expand that does nothing
Fix PReLU negative slope access pattern * fix prelu negative slope access pattern * change begin() to ptr()
…mm_requirements core(OpenCL): fix intel_gpu_gemm kernel requirements * core(ocl): fix intel_gpu_gemm integration - allow bailout to generic OpenCL kernel * core(ocl): avoid failures of generic OpenCL gemm kernel * core(ocl): define alignment requirements of intel_gpu_gemm kernels
* copy 4.x selectOpenCLDevice() -- it is compatible * filter platforms rather than trying only first matching * this works on 3.4 and 4.x master
DNN: BiasAdd tf_importer.cpp could load Const from First Place
Member
Author
|
👍 |
Merged
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
#20605 from rogday:split_slice_shenanigans
#20647 from rogday:resize_concat_optimization
#20648 from alalek:issue_20615
#20651 from alalek:issue_18361
#20652 from alalek:ocl_debug
#20655 from alalek:dnn_ocl_fix_globalsize
#20657 from zchrissirhcz:upd-doc-for-py_fast
#20661 from alalek:dnn_ocl_fix_gemm_like_kernel
#20670 from alalek:core_ocl_fix_intel_gpu_gemm_requirements
#20674 from rogday:prelu_slope
#20675 from diablodale:fix2-20544
#20686 from zihaomu:tf_biasadd_bug
#20687 from alalek:core_ocl_fix_intel_gemm
#20689 from diablodale:fix20613-selectocldevice
#20690 from alalek:build_opencv_winpack_dldt_2021.4.1
Previous "Merge 3.4": #20642
Details