Expose an API to query the CUDA compute stream to launch a custom kernel by hariharans29 · Pull Request #9141 · microsoft/onnxruntime

hariharans29 · 2021-09-21T19:38:55Z

Description:

Description as title.

Particularly useful for the scenario where-in custom ops compiled into shared libraries need to achieve implicit synchronization with ORT's CUDA kernels
- Also useful for the "regular" custom ops scenario where custom ops are not compiled into a shared library and directly registered used the APIs. But that had a work-around of creating a session with a user-created stream and just using that stream to launch custom kernels. This work-around is much harder to achieve when custom ops are compiled into shared libraries (See Update CUDA custom op unit tests to account for recent ORT change #6971)
Currently, we only have one compute stream per-session. So, this could be a session level API. But it is kept as an API at the OrtKernelContext level to keep the design flexible enough for the case where-in (in future) sessions could have multiple streams (one per host thread). When the sessions starts maintaining one stream per host thread, the API will start returning the stream corresponding to that thread.

Motivation and Context
#7068 (comment)

minrui-hust · 2021-09-26T16:38:57Z

amazing feature, I am just looking for it!

wangyems · 2021-10-08T18:10:03Z

Working as expected on my end. Thanks Hari!

hariharans29 added 4 commits September 20, 2021 18:47

Initial commit

5ce2c86

More changes

8eb680b

a

823b0e6

Merge remote-tracking branch 'origin/master' into hari/cuda_stream_api

236246e

hariharans29 commented Sep 21, 2021

View reviewed changes

Comment thread include/onnxruntime/core/session/onnxruntime_cxx_api.h Outdated

pranavsharma reviewed Sep 22, 2021

View reviewed changes

Comment thread include/onnxruntime/core/session/onnxruntime_c_api.h

wangyems self-requested a review October 8, 2021 18:10

wangyems previously approved these changes Oct 8, 2021

View reviewed changes

Merge remote-tracking branch 'origin/master' into hari/cuda_stream_api

a1b7384

pranavsharma reviewed Nov 3, 2021

View reviewed changes

Comment thread include/onnxruntime/core/session/onnxruntime_c_api.h Outdated

hariharans29 added 2 commits November 4, 2021 12:59

Merge remote-tracking branch 'origin/master' into hari/cuda_stream_api

0aee8e8

PR feedback

e1af542

hariharans29 dismissed wangyems’s stale review via e1af542 November 4, 2021 21:06

hariharans29 added 3 commits November 4, 2021 15:30

Resolve conflicts

4d6caf1

Fix bad merge

b99e820

Merge remote-tracking branch 'origin/master' into hari/cuda_stream_api

69daf8a

pranavsharma approved these changes Nov 9, 2021

View reviewed changes

hariharans29 merged commit 65590b0 into master Nov 9, 2021

hariharans29 deleted the hari/cuda_stream_api branch November 9, 2021 05:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose an API to query the CUDA compute stream to launch a custom kernel#9141

Expose an API to query the CUDA compute stream to launch a custom kernel#9141
hariharans29 merged 10 commits intomasterfrom
hari/cuda_stream_api

hariharans29 commented Sep 21, 2021

Uh oh!

Uh oh!

Uh oh!

minrui-hust commented Sep 26, 2021

Uh oh!

wangyems commented Oct 8, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

hariharans29 commented Sep 21, 2021

Uh oh!

Uh oh!

Uh oh!

minrui-hust commented Sep 26, 2021

Uh oh!

wangyems commented Oct 8, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants