Throw error on tensor creation when sequence shape cannot be determined by sethah · Pull Request #7583 · pytorch/pytorch

sethah · 2018-05-15T17:11:31Z

Currently, tensors can be created from Python sequences (determined by PySequence_Check). The shape of the tensor to be created is determined by iterating over the first element in each of the (potentially nested) sequences. This is done here.

There is an assumption that it is safe to index the PyObect at element zero if PySequence_Check(obj) is true and PySequence_Length(obj) > 0. Unfortunately, Python objects are still free to raise errors in their __getitem__ methods under these conditions, which is often the case when creating tensors from Pandas objects. In this case, PySequence_GetItem will return a null pointer, which in turn causes a segmentation fault when the next PySequence_Check call is made.

This patch adds a simple check for a null pointer and raises a ValueError when this happens. The error trace from the call to __getitem__ is not propagated since it is generally unhelpful and confusing. A unit test is added that verifies the appropriate error is raised in this situation.

Examples

seq = pd.Series([1.0, 2.0, 3.0])
torch.Tensor(seq)  # succeeds, since seq[0] is defined
torch.Tensor(seq[1:])  # segfault, since seq[0] generates a KeyError

df = pd.DataFrame(np.ones((2, 3)), columns=['a', 'b', 'c'])
torch.Tensor(df)  # segfault, since df[0] tries to access a column named 0

Notes

It would be better to be able to handle Pandas objects in general, or at least give a nicer error message (e.g. "did you mean torch.Tensor(df.values)?"), but that code would be specific to checking for Pandas objects.
I don't believe there's any surefire way to get the first element in the underlying sequence, which is what PySequence_GetItem(obj, 0) tries to do, but I could have missed it
I am new to the code here, so if there is a better way to handle the error, or if the unit test is not quite exhaustive, please let me know.

ezyang · 2018-05-15T19:40:52Z

@pytorchbot retest this please

yf225 · 2018-05-16T23:40:18Z

@pytorchbot retest this please

soumith · 2018-05-18T17:14:47Z

thank you @sethah

…n sequence shape cannot be determined (pytorch/pytorch#7583) pytorch/pytorch@32b23a4

…ed (pytorch#7583) * first commit * unit test * minor style edits

kigenchesire · 2023-06-18T05:15:57Z

This worked out for me. I was trying to convert a y_train and Y_val into a tensor.

train_labels = torch.tensor(y_train.to_numpy())
val_labels = torch.tensor(y_val.to_numpy())

sethah added 3 commits May 14, 2018 21:37

first commit

e9485b0

unit test

85d952f

minor style edits

291d988

sethah requested review from apaszke, colesbury, ezyang, gchanan, soumith and zdevito as code owners May 15, 2018 17:11

ezyang approved these changes May 15, 2018

View reviewed changes

soumith merged commit 32b23a4 into pytorch:master May 18, 2018

onnxbot added a commit to onnxbot/onnx-fb-universe that referenced this pull request May 18, 2018

[auto] Update pytorch to 32b23a4 - Throw error on tensor creation whe…

148a535

…n sequence shape cannot be determined (pytorch/pytorch#7583) pytorch/pytorch@32b23a4

weiyangfb pushed a commit to weiyangfb/pytorch that referenced this pull request Jun 11, 2018

Throw error on tensor creation when sequence shape cannot be determin…

217d653

…ed (pytorch#7583) * first commit * unit test * minor style edits

ezyang added the open source label Jun 24, 2019

NicoSlvd mentioned this pull request Apr 30, 2025

Graph classification on full nts fredshone/ntsx#8

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Throw error on tensor creation when sequence shape cannot be determined#7583

Throw error on tensor creation when sequence shape cannot be determined#7583
soumith merged 3 commits intopytorch:masterfrom
sethah:pandas_segfault

sethah commented May 15, 2018

Uh oh!

ezyang commented May 15, 2018

Uh oh!

yf225 commented May 16, 2018

Uh oh!

soumith commented May 18, 2018

Uh oh!

kigenchesire commented Jun 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

sethah commented May 15, 2018

Examples

Notes

Uh oh!

ezyang commented May 15, 2018

Uh oh!

yf225 commented May 16, 2018

Uh oh!

soumith commented May 18, 2018

Uh oh!

kigenchesire commented Jun 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants