Add documentation page for pipeline parallelism. by pritamdamania87 · Pull Request #50791 · pytorch/pytorch

pritamdamania87 · 2021-01-20T04:12:08Z

Stack from ghstack:

Add documentation page for pipeline parallelism. #50791 Add documentation page for pipeline parallelism.

Add a dedicated pipeline parallelism doc page explaining the APIs and
the overall value of the module.

Differential Revision: D25967981

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) ghstack-source-id: 120018222 Pull Request resolved: #50791

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

Pull Request resolved: #50791 Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. ghstack-source-id: 120057129 Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/)

rohan-varma

LGTM. Are we planning to add additional documentation/tutorials that go into more details on how to write an application with Pipe and combining it with DDP?

rohan-varma · 2021-01-21T20:26:02Z

+   (vertical axis). The horizontal axis represents training this model through 
+   time demonstrating that the GPUs are utilized much more efficiently. 
+   However, there still exists a bubble (as demonstrated in the figure) where 
+   certain GPUs are not utilized.


Would it be useful to give an approximation of the increase in utilization a user can expect when using Pipe? I guess this varies a lot for different workloads but maybe we can take an example workload?

I feel the two figures attached provide the user a good idea how GPUs are utilized in a more efficient manner. I feel its probably better to illustrate the speed up of this technique in a benchmark/tutorial instead of our docs where we are mostly documenting the feature and its APIs.

pritamdamania87

Are we planning to add additional documentation/tutorials that go into more details on how to write an application with Pipe and combining it with DDP?

Yes, I'm planning to write a tutorial for this. The idea I had was to use the Transformer example from here: https://pytorch.org/tutorials/beginner/transformer_tutorial.html, increase the model size (layers, hidden units etc.) such that it doesn't fit on a single GPU and show how the same model can be trained using pipeline parallelism.

pritamdamania87 · 2021-01-22T01:43:09Z

+   (vertical axis). The horizontal axis represents training this model through 
+   time demonstrating that the GPUs are utilized much more efficiently. 
+   However, there still exists a bubble (as demonstrated in the figure) where 
+   certain GPUs are not utilized.


I feel the two figures attached provide the user a good idea how GPUs are utilized in a more efficient manner. I feel its probably better to illustrate the speed up of this technique in a benchmark/tutorial instead of our docs where we are mostly documenting the feature and its APIs.

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

Pull Request resolved: #50791 Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. ghstack-source-id: 120173804 Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/)

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

Pull Request resolved: #50791 Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. ghstack-source-id: 120214106 Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/)

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

Pull Request resolved: #50791 Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. ghstack-source-id: 120257168 Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/)

mrshenli

are we going to have tutorials and examples on Portal?

pritamdamania87 · 2021-01-25T20:23:53Z

are we going to have tutorials and examples on Portal?

What is Portal? Was planning to write tutorials and examples as we usually do, is there some new process around this?

facebook-github-bot · 2021-01-25T21:50:50Z

This pull request has been merged in 68c2185.

mrshenli · 2021-01-28T18:11:46Z

What is Portal? Was planning to write tutorials and examples as we usually do, is there some new process around this?

This one, I guess this is a main feature in pipe?
https://github.com/pytorch/pytorch/blob/master/torch/distributed/pipeline/sync/skip/portal.py

Summary: Pull Request resolved: pytorch#50791 Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. ghstack-source-id: 120257168 Test Plan: 1) View locally 2) waitforbuildbot Reviewed By: rohan-varma Differential Revision: D25967981 fbshipit-source-id: b607b788703173a5fa4e3526471140506171632b

Add documentation page for pipeline parallelism.

b58631f

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

pritamdamania87 requested review from mingzhe09088, mrshenli, rohan-varma and zhaojuanmao as code owners January 20, 2021 04:12

facebook-github-bot added cla signed oncall: distributed Add this issue/PR to distributed oncall triage queue labels Jan 20, 2021

pritamdamania87 requested a review from msbaines January 20, 2021 04:12

Update on "Add documentation page for pipeline parallelism."

88ffe9d

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

rohan-varma approved these changes Jan 21, 2021

View reviewed changes

pritamdamania87 commented Jan 22, 2021

View reviewed changes

Update on "Add documentation page for pipeline parallelism."

8ba209d

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

Update on "Add documentation page for pipeline parallelism."

1d9dfc3

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

Update on "Add documentation page for pipeline parallelism."

00a6eda

Add a dedicated pipeline parallelism doc page explaining the APIs and the overall value of the module. Differential Revision: [D25967981](https://our.internmc.facebook.com/intern/diff/D25967981/) [ghstack-poisoned]

mrshenli reviewed Jan 25, 2021

View reviewed changes

facebook-github-bot closed this in 68c2185 Jan 25, 2021

facebook-github-bot added the Merged label Jan 25, 2021

facebook-github-bot deleted the gh/pritamdamania87/200/head branch January 29, 2021 15:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add documentation page for pipeline parallelism.#50791

Add documentation page for pipeline parallelism.#50791
pritamdamania87 wants to merge 5 commits intogh/pritamdamania87/200/basefrom
gh/pritamdamania87/200/head

pritamdamania87 commented Jan 20, 2021 •

edited

Loading

Uh oh!

rohan-varma left a comment

Uh oh!

rohan-varma Jan 21, 2021

Uh oh!

pritamdamania87 Jan 22, 2021

Uh oh!

pritamdamania87 left a comment

Uh oh!

pritamdamania87 Jan 22, 2021

Uh oh!

mrshenli left a comment

Uh oh!

pritamdamania87 commented Jan 25, 2021

Uh oh!

facebook-github-bot commented Jan 25, 2021

Uh oh!

mrshenli commented Jan 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

pritamdamania87 commented Jan 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rohan-varma left a comment

Choose a reason for hiding this comment

Uh oh!

rohan-varma Jan 21, 2021

Choose a reason for hiding this comment

Uh oh!

pritamdamania87 Jan 22, 2021

Choose a reason for hiding this comment

Uh oh!

pritamdamania87 left a comment

Choose a reason for hiding this comment

Uh oh!

pritamdamania87 Jan 22, 2021

Choose a reason for hiding this comment

Uh oh!

mrshenli left a comment

Choose a reason for hiding this comment

Uh oh!

pritamdamania87 commented Jan 25, 2021

Uh oh!

facebook-github-bot commented Jan 25, 2021

Uh oh!

mrshenli commented Jan 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pritamdamania87 commented Jan 20, 2021 •

edited

Loading