[nnc] Make our exceptions c10::Errors, get C++ stacktraces by bertmaher · Pull Request #64332 · pytorch/pytorch

bertmaher · 2021-09-01T02:04:33Z

Stack from ghstack:

-> [nnc] Make our exceptions c10::Errors, get C++ stacktraces #64332

With this diff, if a compiler bug occurs (unlikely, I know!) we'll be able to get a c++ stacktrace leading to the exception, rather than just a terse message. E.g.,

RuntimeError: UNSUPPORTED DTYPE
Exception raised from compilation_error at ../torch/csrc/jit/tensorexpr/exceptions.h:32 (most recent call first):
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7f966659b2eb in /fsx/users/bertrand/c\
onda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libc10.so)
frame #1: <unknown function> + 0x376f099 (0x7f966a195099 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda.so)
frame #2: <unknown function> + 0x3763bf5 (0x7f966a189bf5 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda.so)
frame #3: torch::jit::tensorexpr::CudaCodeGen::Initialize() + 0xdd8 (0x7f966a193368 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda\
.so)

Differential Revision: D30745610

[ghstack-poisoned]

facebook-github-bot · 2021-09-01T02:04:38Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/64332

💊 CI failures summary and remediations

As of commit 6bee54c (more details on the Dr. CI page):

1/1 failures introduced in this PR

1 failure not recognized by patterns:

Job	Step	Action
^{Lint / shellcheck}	^{Assert that regenerating the workflows didn't change them}	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

ghstack-source-id: 23982ef Pull Request resolved: #64332

ZolotukhinM

Awesome! You probably need to update this place in vectorizer to fix the failing test:

pytorch/torch/csrc/jit/tensorexpr/loopnest.cpp

Line 479 in 6bb4b5d

} catch (std::runtime_error& e) {

navahgar · 2021-09-01T04:27:16Z

This is awesome. Great work. I was planning to look into this. Thanks for figuring this out.

navahgar · 2021-09-01T04:28:25Z

I'm going to wait for this to land to add buildErrorMessage to them.

navahgar · 2021-09-01T04:31:32Z

torch/csrc/jit/tensorexpr/exceptions.h

+                __FILE__,
+                static_cast<uint32_t>(__LINE__),
+            },
+            err) {}


Actually, now that you have made this a single base class from which all the classes below derive, if you can replace err with buildErrorMessage(err), that will take care of adding the fuser turn off info as well.

With this diff, if a compiler bug occurs (unlikely, I know!) we'll be able to get a c++ stacktrace leading to the exception, rather than just a terse message. E.g., ``` RuntimeError: UNSUPPORTED DTYPE Exception raised from compilation_error at ../torch/csrc/jit/tensorexpr/exceptions.h:32 (most recent call first): frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7f966659b2eb in /fsx/users/bertrand/c\ onda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libc10.so) frame #1: <unknown function> + 0x376f099 (0x7f966a195099 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda.so) frame #2: <unknown function> + 0x3763bf5 (0x7f966a189bf5 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda.so) frame #3: torch::jit::tensorexpr::CudaCodeGen::Initialize() + 0xdd8 (0x7f966a193368 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda\ .so) ``` [ghstack-poisoned]

bertmaher · 2021-09-03T00:44:53Z

@bertmaher has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

With this diff, if a compiler bug occurs (unlikely, I know!) we'll be able to get a c++ stacktrace leading to the exception, rather than just a terse message. E.g., ``` RuntimeError: UNSUPPORTED DTYPE Exception raised from compilation_error at ../torch/csrc/jit/tensorexpr/exceptions.h:32 (most recent call first): frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7f966659b2eb in /fsx/users/bertrand/c\ onda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libc10.so) frame #1: <unknown function> + 0x376f099 (0x7f966a195099 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda.so) frame #2: <unknown function> + 0x3763bf5 (0x7f966a189bf5 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda.so) frame #3: torch::jit::tensorexpr::CudaCodeGen::Initialize() + 0xdd8 (0x7f966a193368 in /fsx/users/bertrand/conda/envs/pytorch/lib/python3.8/site-packages/torch/lib/libtorch_cuda\ .so) ``` Differential Revision: [D30745610](https://our.internmc.facebook.com/intern/diff/D30745610) [ghstack-poisoned]

ghstack-source-id: c4a36dd Pull Request resolved: #64332

bertmaher · 2021-09-03T01:22:08Z

@bertmaher has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-09-05T03:33:13Z

@bertmaher merged this pull request in 18b2751.

facebook-github-bot · 2021-09-05T23:09:27Z

This pull request has been reverted by bcc7e82.

This was landed as pytorch#64332 but got reverted due to bad compilation performance, which in turn was due to a bug in vectorized that has already been fixed. This reverts commit bcc7e82.

[nnc] Make our exceptions c10::Errors, get C++ stacktraces

fa07757

[ghstack-poisoned]

facebook-github-bot added oncall: jit Add this issue/PR to JIT oncall triage queue cla signed labels Sep 1, 2021

bertmaher added a commit that referenced this pull request Sep 1, 2021

[nnc] Make our exceptions c10::Errors, get C++ stacktraces

22f80a9

ghstack-source-id: 23982ef Pull Request resolved: #64332

bertmaher requested review from ZolotukhinM, huiguoo and navahgar September 1, 2021 02:04

ZolotukhinM approved these changes Sep 1, 2021

View reviewed changes

navahgar approved these changes Sep 1, 2021

View reviewed changes

bertmaher added a commit that referenced this pull request Sep 3, 2021

[nnc] Make our exceptions c10::Errors, get C++ stacktraces

f55faf5

ghstack-source-id: c4a36dd Pull Request resolved: #64332

facebook-github-bot closed this in 18b2751 Sep 5, 2021

facebook-github-bot added the Merged label Sep 5, 2021

facebook-github-bot added the Reverted label Sep 5, 2021

facebook-github-bot deleted the gh/bertmaher/173/head branch September 8, 2021 14:19

navahgar mentioned this pull request Feb 3, 2022

[nnc] Make our exceptions c10::Errors, get C++ stacktraces #72300

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[nnc] Make our exceptions c10::Errors, get C++ stacktraces#64332

[nnc] Make our exceptions c10::Errors, get C++ stacktraces#64332
bertmaher wants to merge 4 commits intogh/bertmaher/173/basefrom
gh/bertmaher/173/head

bertmaher commented Sep 1, 2021 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 1, 2021 •

edited

Loading

Uh oh!

ZolotukhinM left a comment

Uh oh!

navahgar commented Sep 1, 2021

Uh oh!

navahgar commented Sep 1, 2021

Uh oh!

navahgar Sep 1, 2021

Uh oh!

bertmaher commented Sep 3, 2021

Uh oh!

bertmaher commented Sep 3, 2021

Uh oh!

facebook-github-bot commented Sep 5, 2021

Uh oh!

facebook-github-bot commented Sep 5, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

bertmaher commented Sep 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Sep 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

1 failure not recognized by patterns:

Uh oh!

ZolotukhinM left a comment

Choose a reason for hiding this comment

Uh oh!

navahgar commented Sep 1, 2021

Uh oh!

navahgar commented Sep 1, 2021

Uh oh!

navahgar Sep 1, 2021

Choose a reason for hiding this comment

Uh oh!

bertmaher commented Sep 3, 2021

Uh oh!

bertmaher commented Sep 3, 2021

Uh oh!

facebook-github-bot commented Sep 5, 2021

Uh oh!

facebook-github-bot commented Sep 5, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bertmaher commented Sep 1, 2021 •

edited

Loading

facebook-github-bot commented Sep 1, 2021 •

edited

Loading