GH-37164: [Python] Attach Python stacktrace to errors in `ConvertPyError`#39380

Merged

pitrou merged 6 commits intoapache:mainfrom

wjones127:37164-rbr-traceback

Jan 11, 2024

Member

wjones127 commented Dec 27, 2023 •

edited by github-actions bot

Loading

Rationale for this change

Users might define Python generators that are used in RecordBatchReaders and then exported through the C Data Interface. However, if an error occurs in their generator, the stacktrace and message are currently swallowed in the current ConvertPyError implementation, which only provides the type of error. This makes debugging code that passed RBRs difficult.

What changes are included in this PR?

Changes ConvertPyError to provide the fully formatted traceback in the error message.

Are these changes tested?

Yes, added one test to validate the errors messages are propagated.

Are there any user-facing changes?

This is a minor change in the error reporting behavior, which will provide more information.

Closes: [Python] Attach Python stacktrace to errors in ConvertPyError #37164


          feat: format exceptions in RBR export

6acd2f8

github-actions bot added Component: Python awaiting committer review labels

wjones127 added 2 commits

December 26, 2023 17:19


          handle errors

ed3eba3


          fix other test

17ba2bc

wjones127 marked this pull request as ready for review

December 27, 2023 02:34

wjones127 requested a review from AlenkaF

December 27, 2023 02:35

AlenkaF reviewed

View reviewed changes

python/pyarrow/src/arrow/python/common.cc Show resolved Hide resolved

github-actions bot added awaiting changes and removed awaiting committer review labels

pitrou requested changes

View reviewed changes

python/pyarrow/src/arrow/python/common.cc

+                                                           &fmt_exception));
+                  OwnedRef formatted;
+                  formatted.reset(PyObject_CallFunctionObjArgs(fmt_exception.obj(), exc_type_.obj(),

Member

pitrou Jan 9, 2024

You need to check for errors here.

Member Author

wjones127 Jan 11, 2024

Thanks. I've added error checks to each of the calls into the Python API.

python/pyarrow/src/arrow/python/common.cc Outdated

+                  std::stringstream ss;
+                  ss << "Python exception: ";
+                  Py_ssize_t num_lines = PyList_GET_SIZE(formatted.obj());

Member

pitrou Jan 9, 2024

You should probably check that we do have a list here. Or you can simply use PySequence_Length and PySequence_GetItem (this code is not performance-critical).

Member Author

wjones127 Jan 11, 2024

The API docs claim this should return a list, but it's perhaps possible that they might change to some other sequence. I changed to be generic sequences.

I assume the function calls will do the type checking internally, correct?

Member

pitrou Jan 11, 2024 •

edited

Loading

I assume the function calls will do the type checking internally, correct?

If you're using the concrete PyList APIs, they will generally assume that it is a list object (there is no hard rule unfortunately). Also, fast access macros such as PyList_GET_ITEM don't do any error checking.

Member Author

wjones127 Jan 11, 2024

Ah right. I see the distinction now between the error checking and non-error-checking variants. 👍

python/pyarrow/src/arrow/python/common.cc Outdated Show resolved Hide resolved

python/pyarrow/tests/test_cffi.py Show resolved Hide resolved

python/pyarrow/tests/test_cffi.py Show resolved Hide resolved


          handle errors

c539894

github-actions bot added awaiting change review awaiting changes and removed awaiting changes awaiting change review labels


          use sequence

f1d7f01

github-actions bot added awaiting change review awaiting changes and removed awaiting changes awaiting change review labels

pitrou reviewed

View reviewed changes

python/pyarrow/src/arrow/python/common.cc

+                  std::stringstream ss;
+                  ss << "Python exception: ";
+                  Py_ssize_t num_lines = PySequence_Length(formatted.obj());

Member

pitrou Jan 11, 2024

You should also check for failure here (-1 is returned on error).

Member Author

wjones127 Jan 11, 2024

Thanks, I should have seen that 🤦


          add another error check

a8a74f2

github-actions bot added awaiting change review awaiting changes and removed awaiting changes awaiting change review labels

pitrou approved these changes

View reviewed changes

pitrou merged commit 6fe7480 into apache:main

pitrou removed the awaiting changes label

github-actions bot added the awaiting committer review label

conbench-apache-arrow bot commented Jan 13, 2024

After merging your PR, Conbench analyzed the 6 benchmarking runs that have been run so far on merge-commit 6fe7480.

There were no benchmark performance regressions. 🎉

The full Conbench report has more details. It also includes information about 1 possible false positive for unstable benchmarks that are known to sometimes produce them.

dgreiss pushed a commit to dgreiss/arrow that referenced this pull request


          apacheGH-37164: [Python] Attach Python stacktrace to errors in `Conve…

a88a3a0

…rtPyError` (apache#39380)

### Rationale for this change

Users might define Python generators that are used in RecordBatchReaders and then exported through the C Data Interface. However, if an error occurs in their generator, the stacktrace and message are currently swallowed in the current `ConvertPyError` implementation, which only provides the type of error. This makes debugging code that passed RBRs difficult.

### What changes are included in this PR?

Changes `ConvertPyError` to provide the fully formatted traceback in the error message.

### Are these changes tested?

Yes, added one test to validate the errors messages are propagated.

### Are there any user-facing changes?

This is a minor change in the error reporting behavior, which will provide more information.
* Closes: apache#37164

Authored-by: Will Jones <willjones127@gmail.com>
Signed-off-by: Antoine Pitrou <antoine@python.org>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

awaiting committer review Component: Python