Concatenate directly into shared memory when constructing batches by colesbury · Pull Request #1323 · pytorch/pytorch

colesbury · 2017-04-21T19:57:08Z

This saves an extra memory copy, which speeds up data loading a bit
(5-10% with accimage).

As part of this change:

torch.cat accepts keyword argument out
sepcifiying out=None is treated like not specifying out

This saves an extra memory copy, which speeds up data loading a bit (5-10% with accimage). As part of this change: * torch.cat accepts keyword argument out * sepcifiying out=None is treated like not specifying out

apaszke

LGTM 👍

Sign in to view

+        if _use_shared_memory:
+            # If we're in a background process, concatenate directly into a
+            # shared memory tensor to avoid an extra copy
+            numel = sum([x.numel() for x in batch])


chenyuntc · 2017-04-22T14:34:14Z

seems to cause error

import torch as t
a=t.Tensor(2,4)
b=t.Tensor(2,4)
c=t.Tensor(2,8)
t.cat(a,b,out=c)

TypeError                                 Traceback (most recent call last)
<ipython-input-1-ce3d61609aec> in <module>()
      3 b=t.Tensor(2,4)
      4 c=t.Tensor(2,8)
----> 5 t.cat(a,b,out=c)

TypeError: cat received an invalid combination of arguments - got (torch.FloatTensor, torch.FloatTensor, out=torch.FloatTensor), but expected one of:
 * (sequence[torch.FloatTensor] seq)
 * (sequence[torch.FloatTensor] seq, int dim)

also see https://discuss.pytorch.org/t/cat-got-an-unexpected-keyword-argument-out/2151

colesbury · 2017-04-22T15:52:18Z

cat expects a sequence and a dimension : torch.cat([a, b], dim, out=c)

…

On Sat, Apr 22, 2017 at 10:34 AM 陈云 ***@***.***> wrote: seems to cause error import torch as t a=t.Tensor(2,4) b=t.Tensor(2,4) c=t.Tensor(2,8)t.cat(a,b,out=c) TypeError Traceback (most recent call last)<ipython-input-1-ce3d61609aec> in <module>() 3 b=t.Tensor(2,4) 4 c=t.Tensor(2,8)----> 5 t.cat(a,b,out=c) TypeError: cat received an invalid combination of arguments - got (torch.FloatTensor, torch.FloatTensor, out=torch.FloatTensor), but expected one of: * (sequence[torch.FloatTensor] seq) * (sequence[torch.FloatTensor] seq, int dim) also see https://discuss.pytorch.org/t/cat-got-an-unexpected-keyword-argument-out/2151 — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#1323 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAoB-slsNRtQY5CF-Ah07662EdwMALfNks5ryg_ngaJpZM4NEuC8> .

Reflect the changes of pytorch#1323

Reflect the changes of #1323

Reflect the changes of pytorch#1323

boeddeker · 2018-11-28T09:57:04Z

@colesbury Is there a reason why numpy arrays in default_collate do not use shared memory?
I would suggest changing

pytorch/torch/utils/data/dataloader.py

Line 218 in e8754ee

return torch.stack([torch.from_numpy(b) for b in batch], 0)

to

return default_collate([torch.from_numpy(b) for b in batch])

colesbury · 2018-11-28T18:22:54Z

@boeddeker that seems fine. Can you send a PR with the change?

boeddeker · 2018-11-29T08:10:53Z

ok, I opened a PR

… numpy (#14534) Summary: Since #1323 tensors are shared with shared memory, but this feature is not active for numpy. This PR fix this. Pull Request resolved: #14534 Differential Revision: D13561649 Pulled By: soumith fbshipit-source-id: b6bc9e99fb91e8b675c2ef131fba9fa11c1647c0

8b42d4c..185fe9c includes the following commits: 185fe9c Expose occupany limiting factors (pytorch#1330) 0c8ede0 remove the rocprofiler early exit hack (pytorch#1329) 4826a43 Remove duplicate test ignore (pytorch#1328) 37fada9 Ensure that async doesn't loop while sync is active (pytorch#1327) 628e1d0 Add host_name to OSS Kineto trace metadata via gethostname() (pytorch#1323) 9d7373b Revert D97166802 (pytorch#1326) 3a61657 Fix Lingering INT32 Overflow (pytorch#1324) 50a0085 Re-enabled some hardcoded tests (pytorch#1321) e19dd92 Expose occupany limiting factors (pytorch#1322) Authored with Claude.

… numpy (pytorch#14534) Summary: Since pytorch#1323 tensors are shared with shared memory, but this feature is not active for numpy. This PR fix this. Pull Request resolved: pytorch#14534 Differential Revision: D13561649 Pulled By: soumith fbshipit-source-id: b6bc9e99fb91e8b675c2ef131fba9fa11c1647c0

Concatenate directly into shared memory when constructing batches

6dda302

This saves an extra memory copy, which speeds up data loading a bit (5-10% with accimage). As part of this change: * torch.cat accepts keyword argument out * sepcifiying out=None is treated like not specifying out

colesbury force-pushed the cat branch from debd958 to 6dda302 Compare April 21, 2017 19:57

apaszke approved these changes Apr 21, 2017

View reviewed changes

Comment thread torch/utils/data/dataloader.py

if _use_shared_memory:

# If we're in a background process, concatenate directly into a

# shared memory tensor to avoid an extra copy

numel = sum([x.numel() for x in batch])

This comment was marked as off-topic.

Sign in to view

soumith merged commit 24d92b5 into pytorch:master Apr 22, 2017

chenyuntc mentioned this pull request Apr 23, 2017

add keyword out for autograd function Concat to match torch.cat #1336

Merged

bunelr added a commit to bunelr/pytorch that referenced this pull request May 13, 2017

Adapt documentation to reflect new supported argument

d7e367e

Reflect the changes of pytorch#1323

bunelr mentioned this pull request May 13, 2017

Adapt documentation to reflect new supported argument #1548

Merged

soumith pushed a commit that referenced this pull request May 14, 2017

Adapt documentation to reflect new supported argument (#1548)

6fc9130

Reflect the changes of #1323

Jiaming-Liu pushed a commit to Jiaming-Liu/pytorch that referenced this pull request May 18, 2017

Adapt documentation to reflect new supported argument (pytorch#1548)

20aba89

Reflect the changes of pytorch#1323

boeddeker mentioned this pull request Nov 29, 2018

Concatenate directly into shared memory when constructing batches for numpy #14534

Closed

eqy pushed a commit to eqy/pytorch that referenced this pull request Jan 20, 2022

Disable fast math (pytorch#1323)

e01e5bf

scotts mentioned this pull request Mar 31, 2026

Update third_party/kineto submodule to 185fe9c #178918

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Concatenate directly into shared memory when constructing batches#1323

Concatenate directly into shared memory when constructing batches#1323
soumith merged 1 commit intopytorch:masterfrom
colesbury:cat

colesbury commented Apr 21, 2017

Uh oh!

apaszke left a comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

chenyuntc commented Apr 22, 2017

Uh oh!

colesbury commented Apr 22, 2017 via email

Uh oh!

boeddeker commented Nov 28, 2018

Uh oh!

colesbury commented Nov 28, 2018

Uh oh!

boeddeker commented Nov 29, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

colesbury commented Apr 21, 2017

Uh oh!

apaszke left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as off-topic.

Uh oh!

chenyuntc commented Apr 22, 2017

Uh oh!

colesbury commented Apr 22, 2017 via email

Uh oh!

boeddeker commented Nov 28, 2018

Uh oh!

colesbury commented Nov 28, 2018

Uh oh!

boeddeker commented Nov 29, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants