Add aten::all_reduce with meta impl by wconstab · Pull Request #93109 · pytorch/pytorch

wconstab · 2023-01-26T23:57:54Z

Stack from ghstack (oldest at bottom):

[ghstack-poisoned]

pytorch-bot · 2023-01-26T23:57:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/93109

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 12 Failures

As of commit 1612fde:

NEW FAILURES - The following jobs have failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

wanchaol · 2023-01-27T00:14:25Z

aten/src/ATen/native/native_functions.yaml

+
+# Collectives
+# TODO: add reduce_op and add some form of ranks instead of processgroup obj
+- func: all_reduce(Tensor self, int group_id, str reduce_op) -> Tensor


quick question: would this group_id be just a single process group id across all ranks? for example we have two pgs, and we do all_reduce on two pgs together, would this group_id be different across two pgs? If this is the case, I feel this make it not SPMD and I'm wondering we should make this SPMD instead.

don't look at this part too closely. we've moved on to a new API proposal with list[rank] but I didn't bother to update this stack until we converged on the design

[ghstack-poisoned]

albanD · 2023-01-30T17:48:52Z

tools/autograd/derivatives.yaml

  result: auto_linear
+
+- name: all_reduce(Tensor self, int group_id, str reduce_op) -> Tensor
+  self: at::ones_like(self)


That sounds wrong!
This should be a all_scatter() no?
If you don't want it to be differentiable right now, mark it as such with non_differentiable (see doc at the top of this file for details).

[ghstack-poisoned]

Add aten::all_reduce with meta impl

01ea86d

[ghstack-poisoned]

wconstab requested review from albanD and soulitzer as code owners January 26, 2023 23:57

This was referenced Jan 26, 2023

Eager impl for aten::all_reduce #93110

Closed

Inductor support for aten::all_reduce #93111

Closed

Test for aten::all_reduce #93112

Closed

allred.py demo script (not for land) #93113

Closed

wanchaol reviewed Jan 27, 2023

View reviewed changes

Update on "Add aten::all_reduce with meta impl"

cbbb7e6

[ghstack-poisoned]

wconstab mentioned this pull request Jan 28, 2023

Refactor dynamo distributed test helpers to be reusable #93187

Closed

wconstab requested review from a team, H-Huang, awgu, kwen2501, mrshenli, rohan-varma and zhaojuanmao as code owners January 28, 2023 00:15

Update on "Add aten::all_reduce with meta impl"

bb08755

[ghstack-poisoned]

albanD reviewed Jan 30, 2023

View reviewed changes

wconstab added 3 commits January 30, 2023 17:50

Update on "Add aten::all_reduce with meta impl"

6677e05

[ghstack-poisoned]

Update on "Add aten::all_reduce with meta impl"

dbc1a85

[ghstack-poisoned]

Update on "Add aten::all_reduce with meta impl"

02e39f7

[ghstack-poisoned]

This was referenced Feb 2, 2023

Refactor to allow reuse of SchedulerNode.allocate #93328

Closed

Mark buffers that reuse other buffers #93329

Closed

pytorch-bot bot added the release notes: releng release notes category label Feb 2, 2023

Update on "Add aten::all_reduce with meta impl"

5a253f2

[ghstack-poisoned]

wconstab mentioned this pull request Feb 2, 2023

Simplify scheduler allocate logic #93897

Closed

Update on "Add aten::all_reduce with meta impl"

1612fde

[ghstack-poisoned]

wconstab mentioned this pull request Feb 2, 2023

[abandoned] Meta kernel for allreduce_ #90024

Closed

wconstab closed this Feb 28, 2023

facebook-github-bot deleted the gh/wconstab/77/head branch June 8, 2023 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add aten::all_reduce with meta impl#93109

Add aten::all_reduce with meta impl#93109
wconstab wants to merge 8 commits intogh/wconstab/77/basefrom
gh/wconstab/77/head

wconstab commented Jan 26, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Jan 26, 2023 •

edited

Loading

Uh oh!

wanchaol Jan 27, 2023

Uh oh!

wconstab Jan 28, 2023

Uh oh!

albanD Jan 30, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wconstab commented Jan 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Jan 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/93109

❌ 12 Failures

Uh oh!

wanchaol Jan 27, 2023

Choose a reason for hiding this comment

Uh oh!

wconstab Jan 28, 2023

Choose a reason for hiding this comment

Uh oh!

albanD Jan 30, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wconstab commented Jan 26, 2023 •

edited

Loading

pytorch-bot bot commented Jan 26, 2023 •

edited

Loading