Lazy load driver APIs using cudaGetDriverEntryPoint#4197
Conversation
|
Review updated until commit 2c4c299 Description
Changes walkthrough 📝
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
|
!test |
Thank you for the fix! Why applying it only to |
|
!test |
|
!test |
|
!test |
It's no longer needed after #4197
It's no longer needed after #4197
Fixes #154073 Reference: NVIDIA/Fuser#4197 See PR #154097 @nWEIdia is currently out of the office, so I’ve temporarily taken over his work. Pull Request resolved: #156097 Approved by: https://github.com/ngimel, https://github.com/cyyever Co-authored-by: Wei Wang <weiwan@nvidia.com>
Fixes #154073 Reference: NVIDIA/Fuser#4197 See PR #154097 @nWEIdia is currently out of the office, so I’ve temporarily taken over his work. Pull Request resolved: #156097 Approved by: https://github.com/ngimel Co-authored-by: Wei Wang <weiwan@nvidia.com>
Fixes #154073 Reference: NVIDIA/Fuser#4197 See PR #154097 @nWEIdia is currently out of the office, so I’ve temporarily taken over his work. Pull Request resolved: #156097 Approved by: https://github.com/syed-ahmed, https://github.com/wujingyue, https://github.com/atalman Co-authored-by: Wei Wang <weiwan@nvidia.com>
Reopen #156097 Fixes #154073 Reference: NVIDIA/Fuser#4197 See PR #156097 and #154097 Pull Request resolved: #158295 Approved by: https://github.com/Skylion007, https://github.com/ngimel, https://github.com/eqy, https://github.com/huydhn Co-authored-by: Wei Wang <weiwan@nvidia.com>
Reopen #156097 Fixes #154073 Reference: NVIDIA/Fuser#4197 See PR #156097 and #154097 Pull Request resolved: #158295 Approved by: https://github.com/Skylion007, https://github.com/ngimel, https://github.com/eqy, https://github.com/huydhn Co-authored-by: Wei Wang <weiwan@nvidia.com> (cherry picked from commit a9f902a)
[CUDA] Use runtime driver API for cuStreamWriteValue32 (#158295) Reopen #156097 Fixes #154073 Reference: NVIDIA/Fuser#4197 See PR #156097 and #154097 Pull Request resolved: #158295 Approved by: https://github.com/Skylion007, https://github.com/ngimel, https://github.com/eqy, https://github.com/huydhn (cherry picked from commit a9f902a) Co-authored-by: Frank Lin <eee4017@gmail.com> Co-authored-by: Wei Wang <weiwan@nvidia.com>
[CUDA] Use runtime driver API for cuStreamWriteValue32 (pytorch#158295) Reopen pytorch#156097 Fixes pytorch#154073 Reference: NVIDIA/Fuser#4197 See PR pytorch#156097 and pytorch#154097 Pull Request resolved: pytorch#158295 Approved by: https://github.com/Skylion007, https://github.com/ngimel, https://github.com/eqy, https://github.com/huydhn (cherry picked from commit a9f902a) Co-authored-by: Frank Lin <eee4017@gmail.com> Co-authored-by: Wei Wang <weiwan@nvidia.com>
This is apparently more robust than #4196 because it doesn't hard code the version.
Fixes #3907
cc @samnordmann