Skip to content

Dnn block layout v4#17

Closed
vpisarev wants to merge 21 commits into5.xfrom
dnn_block_layout_v4
Closed

Dnn block layout v4#17
vpisarev wants to merge 21 commits into5.xfrom
dnn_block_layout_v4

Conversation

@vpisarev
Copy link
Copy Markdown
Owner

@vpisarev vpisarev commented Mar 1, 2026

test

…t fails (probably, the reference output needs to be updated).
2. properly extended depthwise convolution, maxpool and avgpool to support 3 spatial dimensions (i.e. 6-dimensional tensors NC1DHWC0)
…s (other than the 1st one) of some layers, such as convolution or batch norm, are 'embedded' into the layer itself.

* added fusion of batch norm & activation

(fusion of residual connection addition is not implemented yet)
* temporarily disabled many optimizations in order to make tests pass — now need to restore them all
…and reasonably fast! (and yet it's fully compatible with a weird-layout shufflenet!)
* added AVX2-optimized convolution kernel
* run resnet50 in debug mode just once to save time (benchmarking does not make sense in debug mode anyway)
@vpisarev vpisarev closed this Mar 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant