Suppress C++ stacktrace on `XLA_CHECK*()` calls. by ysiraichi · Pull Request #9448 · pytorch/xla

ysiraichi · 2025-07-07T14:54:45Z

This PR improves error messages in PyTorch/XLA by suppressing the display of C++ stack traces during XLA check failures, making them more user-friendly. Currently, when XLA_CHECK*() fails, the resulting error output includes a lengthy and verbose C++ stacktrace. While these can be useful for deep-dive debugging by developers, they often add noise for end-users.

Key Changes:

Suppression of C++ stacktraces for all XLA_CHECK*() failures
Add source location information if XLA_SHOW_CPP_ERROR_CONTEXT=1
- Following the changes in Error Handling: refactor XlaCoordinator to use status types. #9386

Before:

Traceback (most recent call last):
  File "dot.py", line 6, in <module>
    torch.dot(a, b)
RuntimeError: torch_xla/csrc/aten_xla_bridge.cpp:110 : Check failed: xtensor
*** Begin stack trace ***
        tsl::CurrentStackTrace[abi:cxx11]()
        torch_xla::bridge::GetXlaTensor(at::Tensor const&)
        torch_xla::XLANativeFunctions::dot(at::Tensor const&, at::Tensor const&)

        c10::BoxedKernel::callBoxed(c10::OperatorHandle const&, c10::DispatchKeySet, std::vector<c10::IValue, std::allocator<c10::IValue> >*) const
        c10::KernelFunction::callBoxed(c10::OperatorHandle const&, c10::DispatchKeySet, std::vector<c10::IValue, std::allocator<c10::IValue> >*) const
        c10::Dispatcher::callBoxed(c10::OperatorHandle const&, std::vector<c10::IValue, std::allocator<c10::IValue> >*) const



        c10::BoxedKernel::callBoxed(c10::OperatorHandle const&, c10::DispatchKeySet, std::vector<c10::IValue, std::allocator<c10::IValue> >*) const


        at::_ops::dot::redispatch(c10::DispatchKeySet, at::Tensor const&, at::Tensor const&)





        at::_ops::dot::call(at::Tensor const&, at::Tensor const&)
        at::Tensor::dot(at::Tensor const&) const



        _PyObject_MakeTpCall
        _PyEval_EvalFrameDefault

        PyEval_EvalCode



        _PyRun_SimpleFileObject
        _PyRun_AnyFileObject
        Py_RunMain
        Py_BytesMain
        __libc_start_main
        _start
*** End stack trace ***
Input tensor is not an XLA tensor: torch.FloatTensor

After:

Traceback (most recent call last):
  File "dot.py", line 6, in <module>
    torch.dot(a, b)
RuntimeError: Check failed: xtensor: Input tensor is not an XLA tensor: torch.FloatTensor

zhanyong-wan

Thanks!

zhanyong-wan

Thanks!

ysiraichi requested review from ghpvnist and zhanyong-wan July 7, 2025 14:54

zhanyong-wan requested changes Jul 7, 2025

View reviewed changes

Comment thread torch_xla/csrc/runtime/tf_logging.cpp

Comment thread torch_xla/csrc/status.h Outdated

Comment thread torch_xla/csrc/runtime/debug_macros.h

Comment thread torch_xla/csrc/runtime/tf_logging.cpp

ysiraichi requested a review from zhanyong-wan July 8, 2025 18:08

zhanyong-wan approved these changes Jul 9, 2025

View reviewed changes

ysiraichi force-pushed the ysiraichi/xla-check-dont-show-cpp-stacktrace branch from f96ad24 to 7d8baa0 Compare July 9, 2025 20:32

ysiraichi added 4 commits July 10, 2025 09:07

Improve TORCH_CHECK error message.

d631126

Fix lint + add collon.

70238c3

Address reviews.

4c8ab2a

Add test_debug_macros target to BUILD.

fa8ac92

ysiraichi force-pushed the ysiraichi/xla-check-dont-show-cpp-stacktrace branch from 7d8baa0 to fa8ac92 Compare July 10, 2025 12:07

ysiraichi merged commit 5496a36 into master Jul 10, 2025
23 of 24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suppress C++ stacktrace on `XLA_CHECK*()` calls.#9448

Suppress C++ stacktrace on `XLA_CHECK*()` calls.#9448
ysiraichi merged 4 commits intomasterfrom
ysiraichi/xla-check-dont-show-cpp-stacktrace

ysiraichi commented Jul 7, 2025

Uh oh!

zhanyong-wan left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhanyong-wan left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ysiraichi commented Jul 7, 2025

Uh oh!

zhanyong-wan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

zhanyong-wan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants