use new cuda kernel launch code in nvprof parsing#35016
use new cuda kernel launch code in nvprof parsing#35016xwang233 wants to merge 1 commit intopytorch:masterfrom
Conversation
|
That's cool. I guess we can't easily run tests that require nvprof in our CI, but can you guys add a test in yours? Also, what's the plan for when nvprof is no longer supported, and there's only nsight systems? |
💊 CircleCI build failures summary and remediationsAs of commit 516687b (more details on the Dr. CI page): ✅ None of the build failures appear to be your fault 💚
🚧 4 upstream failures:These were probably caused by upstream breakages:
This comment was automatically generated by Dr. CI (expand for details).Follow this link to opt-out of these comments for your Pull Requests.Please report bugs/suggestions on the GitHub issue tracker. This comment has been revised 4 times. |
|
This seems pretty harmless to accept. |
facebook-github-bot
left a comment
There was a problem hiding this comment.
@ezyang is landing this pull request. If you are a Facebook employee, you can view this diff on Phabricator.
|
@ngimel Xiao has written the test and we'll add it to our CI. |
Summary: This PR would fix pytorch#33986. The meaning of cbid 13 and 211 can be found at here https://github.com/ezyang/nvprof2json/blob/837c094852c9c5164344db7c19432da37d9a8b09/nvprof2json.py#L238 https://github.com/ezyang/nvprof2json/blob/837c094852c9c5164344db7c19432da37d9a8b09/nvprof2json.py#L436 or it can also be found in the header file at `/usr/local/cuda/extras/CUPTI/include/cupti_runtime_cbid.h`. Please also check [this at stackoverflow](https://stackoverflow.com/questions/48552390/whats-the-difference-between-launching-with-an-api-call-vs-the-triple-chevron-s). I also executed the profiling code (in the issue) on CUDA 9.2, and the cbid is already changed to 211. Just in case someone would build pytorch against older CUDA versions, I leave both 13 and 211 in the assertion. cc csarofeen ptrblck ezyang ngimel Pull Request resolved: pytorch#35016 Differential Revision: D20550879 Pulled By: ezyang fbshipit-source-id: 968efc5e1126f1dd31acc9f5f4463f351d8a4c4f
This PR would fix #33986.
The meaning of cbid 13 and 211 can be found at here
https://github.com/ezyang/nvprof2json/blob/837c094852c9c5164344db7c19432da37d9a8b09/nvprof2json.py#L238
https://github.com/ezyang/nvprof2json/blob/837c094852c9c5164344db7c19432da37d9a8b09/nvprof2json.py#L436
or it can also be found in the header file at
/usr/local/cuda/extras/CUPTI/include/cupti_runtime_cbid.h.Please also check this at stackoverflow. I also executed the profiling code (in the issue) on CUDA 9.2, and the cbid is already changed to 211. Just in case someone would build pytorch against older CUDA versions, I leave both 13 and 211 in the assertion.
cc @csarofeen @ptrblck @ezyang @ngimel