[perf][OSS] tensor views for bucketing by blefaudeux · Pull Request #300 · facebookresearch/fairscale

blefaudeux · 2021-01-08T23:08:05Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos, doc improvements)
Did you read the contributor guideline?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Speedups by a couple of %, removes a lot of code, saves some memory (buckets are free)

PR review

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

blefaudeux · 2021-01-08T23:45:31Z

@msbaines I'm not sure how that works for torch1.5 and 1.6, I don't think that we're actually testing it nowadays (since the pip default change)

blefaudeux · 2021-01-09T00:23:41Z

@msbaines I'm not sure how that works for torch1.5 and 1.6, I don't think that we're actually testing it nowadays (since the pip default change)

I've just tested, this part (tensor views) is fine, but a previous PR (#297) was not in fact pytorch 1.5 compatible (cc @pritamdamania87), and CI did not catch that..

min-xu-ai

Really like the simplification and performance/memory gain!

min-xu-ai · 2021-01-10T18:55:52Z

fairscale/optim/oss.py

            torch.distributed group (default: group.WORLD)
        broadcast_buffer_size (int):
-            the size of the buffer used to batch the small parameter tensors (default 128k).
+            the max size of the buffer used to batch the small parameter tensors, in number of elements (default 16M).


talk a bit about the impact on per-GPU memory usage here?

the beauty of this PR is that the impact on GPU usage is now.. 0 (zero) :) the small parameters are just a view of a bigger buffer, so the bucketing takes no extra space (on top of taking no extra cpu cycle).
I just realized that I should resize it to match the last bucketed parameter though, right now the remainder is unused and do increase the gpu memory a tiny bit

hmm, the test jobs do show a slightly larger memory consumption, looking into that

ok, makes sense given the initial extra memory allocation, before the parameters have been folded to the buffer, I've updated the docstring to reflect that

…tiny bit of memory

blefaudeux added 2 commits January 8, 2021 20:25

wip, up for sharing

5959d2e

min bucket size with model size

df28ac3

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 8, 2021

reverting a name change, not a good idea now that people depend on it

d1820a6

blefaudeux requested review from joshim5, min-xu-ai and msbaines January 9, 2021 00:10

min-xu-ai approved these changes Jan 10, 2021

View reviewed changes

blefaudeux and others added 4 commits January 10, 2021 19:51

resize the bucket after all the params have been squeezed in, save a …

249d7fc

…tiny bit of memory

black

1825645

minor tweak, in place copy

8deadf9

minor, ensure that the cache is freed and improve the comments

1c9fdad

blefaudeux merged commit 6219b57 into master Jan 11, 2021

blefaudeux deleted the oss_tensor_views_test branch January 11, 2021 18:10

blefaudeux linked an issue Jan 12, 2021 that may be closed by this pull request

[ShardedOptimizer] Use views in the buckets / save memory #187

Closed

blefaudeux mentioned this pull request Feb 6, 2021

[refactor] OSS only use flat buffers #371

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[perf][OSS] tensor views for bucketing#300

[perf][OSS] tensor views for bucketing#300
blefaudeux merged 7 commits intomasterfrom
oss_tensor_views_test

blefaudeux commented Jan 8, 2021 •

edited

Loading

Uh oh!

blefaudeux commented Jan 8, 2021

Uh oh!

blefaudeux commented Jan 9, 2021

Uh oh!

min-xu-ai left a comment

Uh oh!

min-xu-ai Jan 10, 2021

Uh oh!

blefaudeux Jan 11, 2021

Uh oh!

blefaudeux Jan 11, 2021

Uh oh!

blefaudeux Jan 11, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

blefaudeux commented Jan 8, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before submitting

What does this PR do?

PR review

Did you have fun?

Uh oh!

blefaudeux commented Jan 8, 2021

Uh oh!

blefaudeux commented Jan 9, 2021

Uh oh!

min-xu-ai left a comment

Choose a reason for hiding this comment

Uh oh!

min-xu-ai Jan 10, 2021

Choose a reason for hiding this comment

Uh oh!

blefaudeux Jan 11, 2021

Choose a reason for hiding this comment

Uh oh!

blefaudeux Jan 11, 2021

Choose a reason for hiding this comment

Uh oh!

blefaudeux Jan 11, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

blefaudeux commented Jan 8, 2021 •

edited

Loading