Change THTensor::size into a std::vector<int64_t> by ezyang · Pull Request #9518 · pytorch/pytorch

ezyang · 2018-07-18T01:59:58Z

This patch was very carefully constructed to avoid having to modify
too many files; there are some obvious follow ups which I will
be hitting later.

I didn't do stride. But the change for stride should look
very similar. I did stride.
I did NOT rename the field in question, so that direct
accesses of the form foo->size[n] keep working. I intend
to do a codemod to fix all of these shortly.
Anywhere a "public" API function made use of a int64_t*
of sizes, I opted to just finagle it out of the tensor using
THTensor_getSizePtr rather than try to rewrite all of these
sites to use ArrayRef. They should use ArrayRef eventually,
but not yet.
_THSizeDesc got an overload that understands ArrayRef (which
a vector size is convertible to). Eventually we should get
rid of all of these functions (because ArrayRef is printable
via the AT_ERROR macros), but not today.
I ran into something very subtle in the implementation of sizes()
for TensorDerived: I MUST use the dim as per Tensor::dim() (which
correctly is zero for scalars), otherwise I'll give a nonsense
sizes(). We can fix this eventually once Scalar is turned on
internally.
I added two new functions THTensor_resizeSize and THTensor_setSize.
Maybe these are eventually worth deifying as methods in the Tensor class, but
for now I'm keeping them out-of-line just in case.

Signed-off-by: Edward Z. Yang ezyang@fb.com

This patch was very carefully constructed to avoid having to modify too many files; there are some obvious follow ups which I will be hitting later. - I didn't do stride. But the change for stride should look very similar. - I did NOT rename the field in question, so that direct accesses of the form foo->size[n] keep working. I intend to do a codemod to fix all of these shortly. - Anywhere a "public" API function made use of a int64_t* of sizes, I opted to just finagle it out of the tensor using THTensor_getSizePtr rather than try to rewrite all of these sites to use ArrayRef. They should use ArrayRef eventually, but not yet. - _THSizeDesc got an overload that understands ArrayRef (which a vector size is convertible to). Eventually we should get rid of all of these functions (because ArrayRef is printable via the AT_ERROR macros), but not today. - I ran into something very subtle in the implementation of sizes() for TensorDerived: I MUST use the dim as per Tensor::dim() (which correctly is zero for scalars), otherwise I'll give a nonsense sizes(). We can fix this eventually once Scalar is turned on internally. - I added two new functions THTensor_resizeSize and THTensor_setSize. Maybe these are eventually worth deifying as methods in the Tensor class, but for now I'm keeping them out-of-line just in case. Signed-off-by: Edward Z. Yang <ezyang@fb.com>

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

facebook-github-bot

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

aten/src/TH/THGeneral.cpp

 }

-TH_API THDescBuff _THSizeDesc(const int64_t *size, const int64_t ndim) {
+THDescBuff _THSizeDesc(at::ArrayRef<int64_t> size, const int64_t ndim) {


aten/src/TH/THGeneral.h.in

+#ifdef __cplusplus
+// Mangled so we can have an overload
+AT_API THDescBuff _THSizeDesc(const int64_t *size, const int64_t ndim);
+AT_API THDescBuff _THSizeDesc(at::ArrayRef<int64_t> size, const int64_t ndim);


aten/src/THC/generic/THCTensor.cpp


-  newSize = (int64_t*)THAlloc(sizeof(int64_t)*(self->dim()+1));
-  newStride = (int64_t*)THAlloc(sizeof(int64_t)*(self->dim()+1));
+  std::vector<int64_t> newSize(/* size */ self->dim() + 1);


aten/src/THC/generic/THCTensor.cpp

  THLongStorage *inferred_size = THLongStorage_newInferSize(size, numel);
-  auto stride = THTensor_compute_stride(at::IntList(tensor->size, tensor->dim()),
-                                        at::IntList(tensor->stride, tensor->dim()),
+  auto stride = THTensor_compute_stride(at::IntList(THTensor_getSizePtr(tensor), tensor->dim()),


Signed-off-by: Edward Z. Yang <ezyang@fb.com>

ezyang · 2018-07-18T18:29:30Z

Indeed, the inconsistency was because of some code that was doing tensor->dim_--

facebook-github-bot

@ezyang has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

ezyang · 2018-07-18T20:42:21Z

Abandoning this PR in favor of #9561

…h std::vector (#9561) Summary: * THTensor now stores `sizes_` and `strides_` which is a `std::vector<int64_t>` * Anywhere a "public" API function made use of a int64_t* of sizes, I opted to just finagle it out of the tensor using THTensor_getSizePtr rather than try to rewrite all of these sites to use ArrayRef. They should use ArrayRef eventually, but not yet. * There are new utility functions for resizing sizes/strides in one go (THTensor_resizeDim), or replacing sizes and strides with completely new values (THTensor_setSizesAndStrides) * Anywhere you said `t->size[n] = 0`, we now say `THTensor_setSizeAt(t, n, 0)`, ditto for strides * Anywhere you said `t->size[n]`, we now say `t->size(n)` (coming soon: ditto for strides) Previous review of just the `std::vector` change in #9518, but I'm planning to merge this all in one go. Note for gchanan: review from commit "ci" and after Pull Request resolved: #9561 Reviewed By: cpuhrsch Differential Revision: D8901926 Pulled By: ezyang fbshipit-source-id: 483cf275060ab0a13845cba1ece39dd127142510

…h std::vector (pytorch#9561) Summary: * THTensor now stores `sizes_` and `strides_` which is a `std::vector<int64_t>` * Anywhere a "public" API function made use of a int64_t* of sizes, I opted to just finagle it out of the tensor using THTensor_getSizePtr rather than try to rewrite all of these sites to use ArrayRef. They should use ArrayRef eventually, but not yet. * There are new utility functions for resizing sizes/strides in one go (THTensor_resizeDim), or replacing sizes and strides with completely new values (THTensor_setSizesAndStrides) * Anywhere you said `t->size[n] = 0`, we now say `THTensor_setSizeAt(t, n, 0)`, ditto for strides * Anywhere you said `t->size[n]`, we now say `t->size(n)` (coming soon: ditto for strides) Previous review of just the `std::vector` change in pytorch#9518, but I'm planning to merge this all in one go. Note for gchanan: review from commit "ci" and after Pull Request resolved: pytorch#9561 Reviewed By: cpuhrsch Differential Revision: D8901926 Pulled By: ezyang fbshipit-source-id: 483cf275060ab0a13845cba1ece39dd127142510

ezyang requested review from apaszke, colesbury, gchanan, soumith and zdevito as code owners July 18, 2018 01:59

ezyang added 4 commits July 17, 2018 19:44

Apparently dim doesn't match size ugh

56329cc

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

Convert stride to std::vector

a797588

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

MAGMAAAA

798b7e8

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

ezyang force-pushed the pr/thtensor-size-vector branch from cce0747 to 798b7e8 Compare July 18, 2018 02:44

facebook-github-bot reviewed Jul 18, 2018

View reviewed changes

gchanan reviewed Jul 18, 2018

View reviewed changes

ezyang added 4 commits July 18, 2018 08:03

Get rid of THSizeDesc overload.

275fa90

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

bugfix

c7bcd65

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

ci

f885f8a

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

Grab some more dim_ increment/decrement

d8ca8a8

Signed-off-by: Edward Z. Yang <ezyang@fb.com>

facebook-github-bot reviewed Jul 18, 2018

View reviewed changes

ezyang mentioned this pull request Jul 18, 2018

Eliminate direct access to size/strides of THTensor; replace them with std::vector #9561

Closed

ezyang closed this Jul 18, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change THTensor::size into a std::vector<int64_t>#9518

Change THTensor::size into a std::vector<int64_t>#9518
ezyang wants to merge 8 commits intopytorch:masterfrom
ezyang:pr/thtensor-size-vector

ezyang commented Jul 18, 2018 •

edited

Loading

Uh oh!

facebook-github-bot left a comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

ezyang commented Jul 18, 2018

Uh oh!

facebook-github-bot left a comment

Uh oh!

ezyang commented Jul 18, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ezyang commented Jul 18, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

This comment was marked as off-topic.

Uh oh!

ezyang commented Jul 18, 2018

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

ezyang commented Jul 18, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ezyang commented Jul 18, 2018 •

edited

Loading