Implement DepthToSpace uint8_t and Enable DropQDQNodesRules #23352

yihonglyu · 2025-01-14T07:12:55Z

Description

Implemented the DepthToSpace uint8_t kernel.
Enabled DropQDQNodesRules for DepthToSpace.
Added unit tests for the DepthToSpace uint8_t kernel.

Motivation and Context

This commit aims to enhance the performance of the Image Super-Resolution INT8 Model (RFDN). Specifically, it improves the Inference Per Second (IPS) by 25%, providing a significant boost in efficiency and speed.

- Implemented the DepthToSpace uint8_t kernel. - Enabled DropQDQNodesRules for DepthToSpace. - Added unit tests for the DepthToSpace uint8_t kernel.

Copilot

Copilot reviewed 1 out of 5 changed files in this pull request and generated no comments.

Files not reviewed (4)

onnxruntime/core/optimizer/qdq_transformer/selectors_actions/qdq_selector_action_transformer.cc: Language not supported
onnxruntime/core/providers/cpu/tensor/space_depth_ops.cc: Language not supported
onnxruntime/test/providers/cpu/tensor/space_depth_ops_test.cc: Language not supported
onnxruntime/test/providers/provider_test_utils.h: Language not supported

fajin-corp

### Description  - Implemented the DepthToSpace uint8_t kernel. - Enabled DropQDQNodesRules for DepthToSpace. - Added unit tests for the DepthToSpace uint8_t kernel. ### Motivation and Context  This commit aims to enhance the performance of the Image Super-Resolution INT8 Model (RFDN). Specifically, it improves the Inference Per Second (IPS) by 25%, providing a significant boost in efficiency and speed.

yihonglyu added 5 commits January 9, 2025 22:31

Implement DepthToSpace uint8_t and Enable DropQDQNodesRules

e4be9a3

- Implemented the DepthToSpace uint8_t kernel. - Enabled DropQDQNodesRules for DepthToSpace. - Added unit tests for the DepthToSpace uint8_t kernel.

Add uint8_t support for DepthToSpace 11

84b7b3e

Regenerate docs/OperatorKernels.md

9d6ba50

Fix Lint/Python format error (CLANGFORMAT)

c36dea7

Do not drop DQ and Q (16 bit) in DQ -> DepthToSpace -> Q

8a6e540

yihonglyu requested a review from Copilot January 14, 2025 07:12

Copilot AI reviewed Jan 14, 2025

View reviewed changes

yihonglyu requested review from a team, fajin-corp, hariharans29, jchen351 and justinchuby January 15, 2025 23:17

jchen351 approved these changes Jan 15, 2025

View reviewed changes

fajin-corp approved these changes Jan 16, 2025

View reviewed changes

yihonglyu merged commit e51bcfb into main Jan 16, 2025
119 checks passed

yihonglyu deleted the yilyu/depth-space-u8 branch January 16, 2025 03:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement DepthToSpace uint8_t and Enable DropQDQNodesRules #23352

Implement DepthToSpace uint8_t and Enable DropQDQNodesRules #23352

Uh oh!

yihonglyu commented Jan 14, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

fajin-corp left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Implement DepthToSpace uint8_t and Enable DropQDQNodesRules #23352

Implement DepthToSpace uint8_t and Enable DropQDQNodesRules #23352

Uh oh!

Conversation

yihonglyu commented Jan 14, 2025

Description

Motivation and Context

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

fajin-corp left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants