Skip to content

Dnn block layout v5#18

Closed
vpisarev wants to merge 23 commits into5.xfrom
dnn_block_layout_v5
Closed

Dnn block layout v5#18
vpisarev wants to merge 23 commits into5.xfrom
dnn_block_layout_v5

Conversation

@vpisarev
Copy link
Copy Markdown
Owner

@vpisarev vpisarev commented Mar 2, 2026

tmp2

vpisarev added 23 commits March 2, 2026 02:53
…t fails (probably, the reference output needs to be updated).
2. properly extended depthwise convolution, maxpool and avgpool to support 3 spatial dimensions (i.e. 6-dimensional tensors NC1DHWC0)
…s (other than the 1st one) of some layers, such as convolution or batch norm, are 'embedded' into the layer itself.

* added fusion of batch norm & activation

(fusion of residual connection addition is not implemented yet)
* temporarily disabled many optimizations in order to make tests pass — now need to restore them all
…and reasonably fast! (and yet it's fully compatible with a weird-layout shufflenet!)
* added AVX2-optimized convolution kernel
* run resnet50 in debug mode just once to save time (benchmarking does not make sense in debug mode anyway)
* (maybe temporarily) disabled graph simplifier test with resize + convolution; resize needs to be updated to support block layout.
@vpisarev vpisarev closed this Mar 2, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant