[jit] Remove `torch.save`-related logic from pickler by driazati · Pull Request #25502 · pytorch/pytorch

driazati · 2019-08-30T22:33:59Z

The Pickler previously had a distinction between tensors that would be inlined in 1 pickle binary (matching the format of torch.save()) and tensors that are saved elsewhere with only a reference stored in the binary. This PR moves that distinction out to torch::pickle_save to match the eager Python interface.

The change can be seen in register_prim_ops.cpp where the call to jit::pickle is now torch::pickle_save

Differential Revision: D17175215

…pick

zdevito

The changes that move torch::save out are good. I do not think we should be removing tensor references at the same time. They are the most appropriate way to for the RPC world to be dealing with tensor data. In particular if the tensors are on GPUs there may be a more efficient pathway to copy them, but the logic inside the pickler is going force a copy to the cpu and a copy back. It would be best to split the changes for torch::save from changes to tensor references and revisit how pickler RPC should work.

zdevito · 2019-09-05T05:50:28Z

  archive.save_to(std::forward<SaveToArgs>(args)...);
 }

+TORCH_API std::vector<char> save(const torch::IValue& ivalue);


This is the C++ Module API. I am not sure this is the right place to put the save function. It might be more appropriately defined where torch::load for modules is defined.

+1, this API doesn't seem to belong here - it returns std::vector<char>, while all other torch::save APIs don't return anything and expect to be used by torch::save(value, filepath). I'd suggest naming this function pickle_save or similar, so that C++ API end users won't be confused.

btw, do we have this function in Python API? Or is it an implementation detail that would only be used by JIT?

The function (renamed to pickle_save) should produce the same output as torch.save() in Python, so there should be a similar API here too. A rename is fine given the clash with the existing torch::save/torch::load. It's intended to be used as an API function e.g. #25591

pietern · 2019-09-05T07:45:12Z

Thanks for flagging, @zdevito. I agree it would be great if the copy to CPU is done as part of torch::save instead of in the pickler (with a .cpu() on every tensor before writing it, for example, or an fmap beforehand).

What I don't yet fully get here is the tradeoff between storing a tensor instead of a storage object in WriteableStorageData. It looks like keeping a storage object here is enough, as long as the metadata of a tensor (dtype, sizes, strides) is taken care of by the pickler, as part of the reference to the storage. That way, we only need to deal with a c10::Device, a pointer, and a size in bytes. If I'm not mistaken, this metadata must be taken care of outside the pickler, since the pickler only has a reference to this WriteableStorageData entry. Am I reading this wrong?

Also cc @lerks since we had a discussion about this.

This reverts commit 7a0dade.

…pick

pietern · 2019-09-09T11:27:08Z

@driazati Looking at the recent changes, do you now plan to indeed keep the tensor table around, and always serialize their metadata (since pushTensorReference is now gone)?

If so, the comment in the summary should be updated.

…pick

facebook-github-bot · 2019-09-18T07:07:34Z

@driazati merged this pull request in 61197e9.

Summary: The Pickler previously had a distinction between tensors that would be inlined in 1 pickle binary (matching the format of `torch.save()`) and tensors that are saved elsewhere with only a reference stored in the binary. This PR moves that distinction out to `torch::pickle_save` to match the eager Python interface. The change can be seen in `register_prim_ops.cpp` where the call to `jit::pickle` is now `torch::pickle_save` ](https://our.intern.facebook.com/intern/diff/17175215/) Pull Request resolved: pytorch#25502 Pulled By: driazati Differential Revision: D17175215 fbshipit-source-id: 8c9a21327cc79eaf6a0e488ea99e305be52f82b1

[jit][wip][skip ci] Remove torch.save stuff from pickler

da35ce8

driazati requested review from apaszke, ebetica, goldsborough and yf225 as code owners August 30, 2019 22:33

pytorchbot added caffe2 oncall: jit Add this issue/PR to JIT oncall triage queue module: build Build system issues module: cpp Related to C++ API labels Aug 30, 2019

driazati mentioned this pull request Sep 3, 2019

IValue pickle does not work properly if an empty tensor table is not provided #25591

Open

Your Name added 6 commits September 3, 2019 11:41

Merge branch 'master' of github.com:pytorch/pytorch into driazati/fix…

0a40b83

…pick

cleanup

14b471e

cleanup

8840225

cleanup

0e9b6d3

cleanup

8b2fc6c

[jit] Remove tensor table from pickler

7a0dade

driazati mentioned this pull request Sep 3, 2019

[jit] Remove tensor table from pickler #25618

Merged

driazati changed the title ~~[jit][wip][skip ci] Remove torch.save stuff from pickler~~ [jit] Remove torch.save-related logic from pickler Sep 4, 2019

driazati requested review from mrshenli and pietern as code owners September 4, 2019 00:09

pytorchbot added the oncall: distributed Add this issue/PR to distributed oncall triage queue label Sep 4, 2019

driazati requested a review from zdevito September 4, 2019 00:12

pietern mentioned this pull request Sep 4, 2019

Adding RRef as return value for builtin operators #25169

Closed

zdevito reviewed Sep 5, 2019

View reviewed changes

Your Name added 2 commits September 5, 2019 11:05

Revert "[jit] Remove tensor table from pickler"

e021817

This reverts commit 7a0dade.

Address some comments

9c2724e

driazati mentioned this pull request Sep 5, 2019

Load tensor from file in C++ fails #20356

Closed

Your Name added 2 commits September 5, 2019 13:25

Merge branch 'master' of github.com:pytorch/pytorch into driazati/fix…

0f2f850

…pick

Fix windows

7960b43

Your Name added 4 commits September 11, 2019 10:42

Merge branch 'master' of github.com:pytorch/pytorch into driazati/fix…

2cba6e2

…pick

Fix build

ddd544b

Re-enable test

0be00b4

Cleanup

33d6aef

driazati requested a review from zdevito September 11, 2019 18:02

wip

19e071b

zdevito reviewed Sep 13, 2019

View reviewed changes

Comment thread torch/csrc/jit/pickler.cpp

Revert tensor table changes

acb8d7f

driazati requested a review from zdevito September 13, 2019 20:19

zdevito approved these changes Sep 17, 2019

View reviewed changes

Your Name added 2 commits September 17, 2019 13:58

Merge branch 'master' of github.com:pytorch/pytorch into driazati/fix…

6bf6865

…pick

Move the logic to pickle.cpp to satisfy the mobile build

f6f6e93

facebook-github-bot closed this in 61197e9 Sep 18, 2019

facebook-github-bot added the merged label Sep 18, 2019

yxjiang mentioned this pull request Nov 18, 2019

Move the attributes of a module to the given device #29987

Open

facebook-github-bot deleted the driazati/fixpick branch July 13, 2020 17:54

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[jit] Remove `torch.save`-related logic from pickler#25502

[jit] Remove `torch.save`-related logic from pickler#25502
driazati wants to merge 19 commits intomasterfrom
driazati/fixpick

driazati commented Aug 30, 2019 •

edited

Loading

Uh oh!

zdevito left a comment

Uh oh!

zdevito Sep 5, 2019

Uh oh!

yf225 Sep 5, 2019

Uh oh!

driazati Sep 11, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pietern commented Sep 5, 2019

Uh oh!

pietern commented Sep 9, 2019

Uh oh!

Uh oh!

facebook-github-bot commented Sep 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

driazati commented Aug 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zdevito left a comment

Choose a reason for hiding this comment

Uh oh!

zdevito Sep 5, 2019

Choose a reason for hiding this comment

Uh oh!

yf225 Sep 5, 2019

Choose a reason for hiding this comment

Uh oh!

driazati Sep 11, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pietern commented Sep 5, 2019

Uh oh!

pietern commented Sep 9, 2019

Uh oh!

Uh oh!

facebook-github-bot commented Sep 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

driazati commented Aug 30, 2019 •

edited

Loading