-
Notifications
You must be signed in to change notification settings - Fork 27.7k
Complex Number support for distributed #45760
Copy link
Copy link
Open
Labels
module: bootcampWe plan to do a full writeup on the issue, and then get someone to do it for onboardingWe plan to do a full writeup on the issue, and then get someone to do it for onboardingmodule: c10dIssues/PRs related to collective communications and process groupsIssues/PRs related to collective communications and process groupsmodule: complexRelated to complex number support in PyTorchRelated to complex number support in PyTorchoncall: distributedAdd this issue/PR to distributed oncall triage queueAdd this issue/PR to distributed oncall triage queuept_distributed_rampupRamp up tasks for new developers on PT distributedRamp up tasks for new developers on PT distributedtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
Metadata
Metadata
Assignees
Labels
module: bootcampWe plan to do a full writeup on the issue, and then get someone to do it for onboardingWe plan to do a full writeup on the issue, and then get someone to do it for onboardingmodule: c10dIssues/PRs related to collective communications and process groupsIssues/PRs related to collective communications and process groupsmodule: complexRelated to complex number support in PyTorchRelated to complex number support in PyTorchoncall: distributedAdd this issue/PR to distributed oncall triage queueAdd this issue/PR to distributed oncall triage queuept_distributed_rampupRamp up tasks for new developers on PT distributedRamp up tasks for new developers on PT distributedtriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate moduleThis issue has been looked at a team member, and triaged and prioritized into an appropriate module
🚀 Feature
As per title, complex numbers should be supported in
torch.distributed.Motivation
Distribute computing support for complex numbers came up in conversations with people at Argonne National Laboratory and Flatiron Institute. Currently, some of them use Uber's Horovod library for distributed computing. The operations that they commonly use are all_reduce and broadcasting operations.
Pitch
torch.view_as_complexcan be used to convert the real tensor back to complex tensor.cc @ezyang @anjali411 @dylanbespalko @mruberry @pietern @mrshenli @pritamdamania87 @zhaojuanmao @satgera @rohan-varma @gqchen @aazzolini @xush6528 @osalpekar @jiayisuse @agolynski