addmm: Reduce constant time overhead by peterbell10 · Pull Request #41374 · pytorch/pytorch

peterbell10 · 2020-07-13T23:44:01Z

Fixes the overhead reported by @ngimel in #40927 (comment)

As it turns out, Tensor.size(n) has more overhead than Tensor.sizes()[n]. Since addmm does a lot of introspection of the input matrix sizes and strides, this added up to a noticeable (~1 us) constant time overhead.

With this change, a 1x1 matmul takes 2.85 us on my machine compared to 2.90 us on pytorch 1.5.

dr-ci · 2020-07-13T23:53:28Z

💊 CI failures summary and remediations

As of commit ef2e973 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker or post in the (internal) Dr. CI Users group.

See how this bot performed.

This comment has been revised 9 times.

ngimel

Awesome, thanks for fixing!

ngimel · 2020-07-14T00:52:26Z

-  result.resize_({ self.size(0), mat2.size(1) });
+  TORCH_CHECK(self.dim() == 2, "self must be a matrix");
+  TORCH_CHECK(mat2.dim() == 2, "mat2 must be a matrix");
+  native::resize_(result, {self.sizes()[0], mat2.sizes()[1]});


nit: since you are resizing result in addmm_cpu_impl, do you need this resize here?

result is also the self tensor in addmm which must at least broadcast to the correct dimensions.

ezyang · 2020-07-14T01:18:36Z

As it turns out, Tensor.size(n) has more overhead than Tensor.sizes()[n]

Ugggh. Is this because size(n) is doing checked index access and [n] is doing unchecked indexing? Another reason we need to devirtualize size so that compiler can optimize away bounds test if you check dim earlier...

ngimel · 2020-07-14T01:22:10Z

In addition to checked access, Tensor.size(n) also maybe_wraps n, for additional overhead.

peterbell10 · 2020-07-14T01:25:38Z

size(n) does dim wrapping and bounds checking but is also not inlined and does runtime dispatch for each call.

facebook-github-bot

@ngimel has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-07-14T22:29:02Z

@ngimel merged this pull request in e2c4c2f.

Summary: Fixes the overhead reported by ngimel in pytorch#40927 (comment) As it turns out, `Tensor.size(n)` has more overhead than `Tensor.sizes()[n]`. Since addmm does a lot of introspection of the input matrix sizes and strides, this added up to a noticeable (~1 us) constant time overhead. With this change, a 1x1 matmul takes 2.85 us on my machine compared to 2.90 us on pytorch 1.5. Pull Request resolved: pytorch#41374 Reviewed By: ailzhang Differential Revision: D22519924 Pulled By: ngimel fbshipit-source-id: b29504bee7de79ce42e5e50f91523dde42b073b7

peterbell10 added the open source label Jul 13, 2020

peterbell10 requested a review from ezyang July 13, 2020 23:44

addmm_impl_cpu_: Reduce constant time overhead

ef2e973

peterbell10 force-pushed the addmm-overhead branch from f6ba74c to ef2e973 Compare July 14, 2020 00:45

ngimel approved these changes Jul 14, 2020

View reviewed changes

facebook-github-bot reviewed Jul 14, 2020

View reviewed changes

facebook-github-bot closed this in e2c4c2f Jul 14, 2020

facebook-github-bot added the merged label Jul 14, 2020

mruberry added the Merged label Oct 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

addmm: Reduce constant time overhead#41374

addmm: Reduce constant time overhead#41374
peterbell10 wants to merge 1 commit intopytorch:masterfrom
peterbell10:addmm-overhead

peterbell10 commented Jul 13, 2020 •

edited

Loading

Uh oh!

dr-ci Bot commented Jul 13, 2020 •

edited

Loading

Uh oh!

ngimel left a comment

Uh oh!

ngimel Jul 14, 2020

Uh oh!

peterbell10 Jul 14, 2020

Uh oh!

ezyang commented Jul 14, 2020

Uh oh!

ngimel commented Jul 14, 2020

Uh oh!

peterbell10 commented Jul 14, 2020

Uh oh!

facebook-github-bot left a comment

Uh oh!

facebook-github-bot commented Jul 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

peterbell10 commented Jul 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dr-ci Bot commented Jul 13, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

💊 CI failures summary and remediations

Uh oh!

ngimel left a comment

Choose a reason for hiding this comment

Uh oh!

ngimel Jul 14, 2020

Choose a reason for hiding this comment

Uh oh!

peterbell10 Jul 14, 2020

Choose a reason for hiding this comment

Uh oh!

ezyang commented Jul 14, 2020

Uh oh!

ngimel commented Jul 14, 2020

Uh oh!

peterbell10 commented Jul 14, 2020

Uh oh!

facebook-github-bot left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Jul 14, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

peterbell10 commented Jul 13, 2020 •

edited

Loading

dr-ci Bot commented Jul 13, 2020 •

edited

Loading