[WebNN EP] Optimize model partitioning #23332

peishenyan · 2025-01-13T02:55:59Z

Description

The old GetCapability function of WebNN EP is just a very simple search for groups of nodes that can be handled. This doesn't work well in the following example graph, where A and D could be handled by the EP, but B is between them in the topological order, as you get two single node capabilities. However, it may also be advantageous if C and E could be handled by the EP, since they would be combined with D even though they are not connected.

    A  B  C
    | /   |
    D     E
    |     |

Therefore, we improve partitioning results by reusing utils::CreateSupportedPartitions, which walks the edges for each node that the EP can handle as they are iterated in topological order. This would guarantee that all connected nodes that can be handled are grouped together. Correspondingly, we modify the webnn::GetSupportedNodes function to return the supported nodes instead of the group of supported partitions.

Motivation and Context

onnxruntime/core/providers/webnn/webnn_execution_provider.cc

peishenyan · 2025-01-13T08:33:57Z

@fdwr, PTAL, thanks!

fdwr

Interesting - I didn't consider disconnected graph chunks to be considered potentially the same partition. I am good with it if Wanming is.

onnxruntime/core/providers/webnn/builders/helper.cc

fdwr · 2025-01-14T03:19:31Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

fdwr · 2025-01-14T03:19:34Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2025-01-14T03:19:38Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

fdwr · 2025-01-14T03:19:40Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-01-14T03:19:46Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2025-01-14T03:19:52Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2025-01-14T03:19:59Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-01-14T03:20:10Z

Azure Pipelines successfully started running 9 pipeline(s).

Honry

LGTM % a nit.

onnxruntime/core/providers/webnn/builders/helper.h

fdwr · 2025-01-14T23:30:18Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

fdwr · 2025-01-14T23:30:21Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2025-01-14T23:30:24Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI

fdwr · 2025-01-14T23:30:25Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-01-14T23:30:35Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2025-01-14T23:30:42Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-01-14T23:30:47Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-01-14T23:31:01Z

Azure Pipelines successfully started running 9 pipeline(s).

fdwr · 2025-01-16T00:11:14Z

Merge conflicts :/.

onnxruntime/core/providers/webnn/builders/helper.h

The old GetCapability function of WebNN EP is just a very simple search for groups of nodes that can be handled. This doesn't work well in the following example graph: A B | | \|/ \|/ C -> D This graph topological order is A, B, C, D, and WebNN EP supports only A and C. In the past, the partitioning result is {A}, {B}, {C}, {D}, four partitions. But the optimized result is {A, C} and {B, D}. Therefore, we improve partitioning results by reusing utils::CreateSupportedPartitions, which walks the edges for each node that the EP can handle as they are iterated in topological order. This would guarantee that all connected nodes that can be handled are grouped together. Correspondingly, we modify the webnn::GetSupportedNodes function to return the supported nodes instead of the group of supported partitions. Update onnxruntime/core/providers/webnn/builders/helper.cc Co-authored-by: Dwayne Robinson <fdwr@hotmail.com>

peishenyan · 2025-01-16T14:22:32Z

Oh...🤦‍ Just rebased code. Please help to re-trigger the tests. @fdwr, thanks.

fdwr · 2025-01-16T18:09:06Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

fdwr · 2025-01-16T18:09:08Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2025-01-16T18:09:10Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI

fdwr · 2025-01-16T18:09:12Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-01-16T18:09:21Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2025-01-16T18:09:28Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-01-16T18:09:32Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2025-01-16T18:09:48Z

Azure Pipelines successfully started running 9 pipeline(s).

fdwr

✅ Reapproved.

### Description  The old `GetCapability` function of WebNN EP is just a very simple search for groups of nodes that can be handled. This doesn't work well in the following example graph, where A and D could be handled by the EP, but B is between them in the topological order, as you get two single node capabilities. However, it may also be advantageous if C and E could be handled by the EP, since they would be combined with D even though they are not connected. ``` A B C | / | D E | | ``` Therefore, we improve partitioning results by reusing `utils::CreateSupportedPartitions`, which walks the edges for each node that the EP can handle as they are iterated in topological order. This would guarantee that all connected nodes that can be handled are grouped together. Correspondingly, we modify the `webnn::GetSupportedNodes` function to return the supported nodes instead of the group of supported partitions. ### Motivation and Context  Co-authored-by: Dwayne Robinson <fdwr@hotmail.com>

peishenyan force-pushed the webnn_partition branch from 4dc1715 to 2558060 Compare January 13, 2025 06:54

Honry reviewed Jan 13, 2025

View reviewed changes

onnxruntime/core/providers/webnn/webnn_execution_provider.cc Show resolved Hide resolved

Honry reviewed Jan 13, 2025

View reviewed changes

onnxruntime/core/providers/webnn/webnn_execution_provider.cc Outdated Show resolved Hide resolved

Honry reviewed Jan 13, 2025

View reviewed changes

onnxruntime/core/providers/webnn/webnn_execution_provider.cc Show resolved Hide resolved

fdwr previously approved these changes Jan 14, 2025

View reviewed changes

onnxruntime/core/providers/webnn/builders/helper.cc Outdated Show resolved Hide resolved

onnxruntime/core/providers/webnn/builders/helper.cc Show resolved Hide resolved

peishenyan dismissed fdwr’s stale review via 7b8f570 January 14, 2025 07:08

Honry reviewed Jan 14, 2025

View reviewed changes

onnxruntime/core/providers/webnn/builders/helper.h Outdated Show resolved Hide resolved

peishenyan force-pushed the webnn_partition branch from 7b8f570 to 03df4f3 Compare January 14, 2025 08:06

guschmue added the ep:WebNN WebNN execution provider label Jan 14, 2025

fdwr previously approved these changes Jan 14, 2025

View reviewed changes

peishenyan dismissed fdwr’s stale review via 852690d January 16, 2025 14:08

peishenyan force-pushed the webnn_partition branch from 03df4f3 to 852690d Compare January 16, 2025 14:08

fdwr approved these changes Jan 16, 2025

View reviewed changes

fdwr merged commit 80f686e into microsoft:main Jan 16, 2025
76 checks passed

[WebNN EP] Optimize model partitioning #23332

[WebNN EP] Optimize model partitioning #23332

Uh oh!

Conversation

peishenyan commented Jan 13, 2025 • edited by fdwr Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Uh oh!

Uh oh!

Uh oh!

Uh oh!

peishenyan commented Jan 13, 2025

Uh oh!

fdwr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

fdwr commented Jan 14, 2025

Uh oh!

fdwr commented Jan 14, 2025

Uh oh!

fdwr commented Jan 14, 2025

Uh oh!

fdwr commented Jan 14, 2025

Uh oh!

azure-pipelines bot commented Jan 14, 2025

Uh oh!

azure-pipelines bot commented Jan 14, 2025

Uh oh!

azure-pipelines bot commented Jan 14, 2025

Uh oh!

azure-pipelines bot commented Jan 14, 2025

Uh oh!

Honry left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fdwr commented Jan 14, 2025

Uh oh!

fdwr commented Jan 14, 2025

Uh oh!

fdwr commented Jan 14, 2025

Uh oh!

fdwr commented Jan 14, 2025

Uh oh!

azure-pipelines bot commented Jan 14, 2025

Uh oh!

azure-pipelines bot commented Jan 14, 2025

Uh oh!

azure-pipelines bot commented Jan 14, 2025

Uh oh!

azure-pipelines bot commented Jan 14, 2025

Uh oh!

fdwr commented Jan 16, 2025

Uh oh!

peishenyan commented Jan 16, 2025

Uh oh!

fdwr commented Jan 16, 2025

Uh oh!

fdwr commented Jan 16, 2025

Uh oh!

fdwr commented Jan 16, 2025

Uh oh!

fdwr commented Jan 16, 2025

Uh oh!

azure-pipelines bot commented Jan 16, 2025

Uh oh!

azure-pipelines bot commented Jan 16, 2025

Uh oh!

azure-pipelines bot commented Jan 16, 2025

Uh oh!

azure-pipelines bot commented Jan 16, 2025

Uh oh!

fdwr left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

peishenyan commented Jan 13, 2025 •

edited by fdwr

Loading