-
-
Notifications
You must be signed in to change notification settings - Fork 56.5k
Winograd convolution fails with assertion on CPU without AVX #23760
Copy link
Copy link
Closed
Description
System Information
OpenCV: 4.x (before 4.8.0)
OS: Ubuntu Linux 22.04
CPU: Core 2 Duo 6600
Detailed description
CPU Features:
cat /proc/cpuinfo
processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 15
model name : Intel(R) Core(TM)2 CPU 6600 @ 2.40GHz
stepping : 6
microcode : 0xd0
cpu MHz : 1600.000
cache size : 4096 KB
physical id : 0
siblings : 2
core id : 0
cpu cores : 2
apicid : 0
initial apicid : 0
fpu : yes
fpu_exception : yes
cpuid level : 10
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good nopl cpuid aperfmperf pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm lahf_lm pti tpr_shadow dtherm
vmx flags : tsc_offset vtpr
bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_unknown
bogomips : 4799.70
clflush size : 64
cache_alignment : 64
address sizes : 36 bits physical, 48 bits virtual
power management:
OpenCV Optimizations:
" CPU/HW features:\n"
" Baseline: SSE SSE2 SSE3\n"
" requested: SSE3\n"
" Dispatched code generation: SSE4_1 SSE4_2 FP16 AVX AVX2 AVX512_SKX\n"
" requested: SSE4_1 SSE4_2 AVX FP16 AVX2 AVX512_SKX\n"
" SSE4_1 (18 files): + SSSE3 SSE4_1\n"
" SSE4_2 (2 files): + SSSE3 SSE4_1 POPCNT SSE4_2\n"
" FP16 (1 files): + SSSE3 SSE4_1 POPCNT SSE4_2 FP16 AVX\n"
" AVX (8 files): + SSSE3 SSE4_1 POPCNT SSE4_2 AVX\n"
" AVX2 (36 files): + SSSE3 SSE4_1 POPCNT SSE4_2 FP16 FMA3 AVX AVX2\n"
" AVX512_SKX (8 files): + SSSE3 SSE4_1 POPCNT SSE4_2 FP16 FMA3 AVX AVX2 AVX_512F AVX512_COMMON AVX512_SKX\n"
Steps to reproduce
Tested with Core 2 Duo 6600 and Ubuntu 22.04 Found some issue with latest convolution optimizations in DNN:
./bin/opencv_test_dnn
CTEST_FULL_OUTPUT
OpenCV version: 4.7.0-dev
OpenCV VCS version: 4.7.0-396-gd3e7968927
Build type: Release
Compiler: /usr/bin/c++ (ver 11.3.0)
Parallel framework: pthreads (nthreads=2)
CPU features: SSE SSE2 SSE3 *SSE4.1? *SSE4.2? *FP16? *AVX? *AVX2? *AVX512-SKX?
Intel(R) IPP version: disabled
OpenCL is disabled
TEST: Skip tests with tags: 'mem_6gb', 'verylong', 'dnn_skip_opencv_backend', 'dnn_skip_cpu', 'dnn_skip_cpu_fp16', 'dnn_skip_ocl', 'dnn_skip_ocl_fp16', 'dnn_skip_onnx_conformance', 'dnn_skip_parser'
....
[----------] 20 tests from DNNTestNetwork
[ RUN ] DNNTestNetwork.AlexNet/0, where GetParam() = OCV/CPU
unknown file: Failure
C++ exception with description "OpenCV(4.7.0-dev) /home/arina/Projects/opencv/modules/dnn/src/layers/cpu_kernels/conv_winograd_f63.cpp:401: error: (-215:Assertion failed) CONV_WINO_IBLOCK == 3 && CONV_WINO_KBLOCK == 4 && CONV_WINO_ATOM_F32 == 4 in function 'winofunc_BtXB_8x8_f32'
" thrown in the test body.
[ FAILED ] DNNTestNetwork.AlexNet/0, where GetParam() = OCV/CPU (1216 ms)
[ RUN ] DNNTestNetwork.ResNet_50/0, where GetParam() = OCV/CPU
[ SKIP ] OpenCV tests: Can't find data file: dnn/ResNet-50-model.caffemodel
[ OK ] DNNTestNetwork.ResNet_50/0 (1 ms)
[ RUN ] DNNTestNetwork.SqueezeNet_v1_1/0, where GetParam() = OCV/CPU
unknown file: Failure
C++ exception with description "OpenCV(4.7.0-dev) /home/arina/Projects/opencv/modules/dnn/src/layers/cpu_kernels/conv_winograd_f63.cpp:401: error: (-215:Assertion failed) CONV_WINO_IBLOCK == 3 && CONV_WINO_KBLOCK == 4 && CONV_WINO_ATOM_F32 == 4 in function 'winofunc_BtXB_8x8_f32'
" thrown in the test body.
[ FAILED ] DNNTestNetwork.SqueezeNet_v1_1/0, where GetParam() = OCV/CPU (27 ms)
[ RUN ] DNNTestNetwork.GoogLeNet/0, where GetParam() = OCV/CPU
unknown file: Failure
C++ exception with description "OpenCV(4.7.0-dev) /home/arina/Projects/opencv/modules/dnn/src/layers/cpu_kernels/conv_winograd_f63.cpp:401: error: (-215:Assertion failed) CONV_WINO_IBLOCK == 3 && CONV_WINO_KBLOCK == 4 && CONV_WINO_ATOM_F32 == 4 in function 'winofunc_BtXB_8x8_f32'
" thrown in the test body.
[ FAILED ] DNNTestNetwork.GoogLeNet/0, where GetParam() = OCV/CPU (127 ms)
[ RUN ] DNNTestNetwork.Inception_5h/0, where GetParam() = OCV/CPU
unknown file: Failure
C++ exception with description "OpenCV(4.7.0-dev) /home/arina/Projects/opencv/modules/dnn/src/layers/cpu_kernels/conv_winograd_f63.cpp:401: error: (-215:Assertion failed) CONV_WINO_IBLOCK == 3 && CONV_WINO_KBLOCK == 4 && CONV_WINO_ATOM_F32 == 4 in function 'winofunc_BtXB_8x8_f32'
" thrown in the test body.
[ FAILED ] DNNTestNetwork.Inception_5h/0, where GetParam() = OCV/CPU (258 ms)
[ RUN ] DNNTestNetwork.ENet/0, where GetParam() = OCV/CPU
double free or corruption (!prev)
Segmentation fault (core dumped)
Issue submission checklist
- I report the issue, it's not a question
- I checked the problem with documentation, FAQ, open issues, forum.opencv.org, Stack Overflow, etc and have not found any solution
- I updated to the latest OpenCV version and the issue is still there
- There is reproducer code and related data files (videos, images, onnx, etc)
Reactions are currently unavailable