[SPMD][DTensor] introduce xla_distribute_module for DTensor integration by yeounoh · Pull Request #6683 · pytorch/xla

yeounoh · 2024-03-07T00:27:18Z

This is to support pytorch/pytorch#92909

yeounoh · 2024-03-07T00:40:41Z

This need to land for experimental release of the auto-sharding API #6322

yeounoh · 2024-03-07T07:12:32Z

cc @baoleai for visibility

yeounoh · 2024-03-07T17:56:44Z

CI turned green, and locally looks good on TPU and CPU

python test/spmd/test_dtensor_integration.py
...
----------------------------------------------------------------------
Ran 3 tests in 3.740s

OK

alanwaketan

LGTM, but how does that work with auto_sharding? You still shard the inputs in the test case.

yeounoh · 2024-03-07T18:07:18Z

LGTM, but how does that work with auto_sharding? You still shard the inputs in the test case.

Good question, we were thinkning about introducing pre-defined partition_fn for autosharding, e.g., torch_xla.distributed.auto_sharding_policy (subject to change). It would just be calling use_spmd(auto=True) though.

yeounoh added the distributed SPMD and other distributed things. label Mar 7, 2024

yeounoh self-assigned this Mar 7, 2024

yeounoh force-pushed the xla_distribute_module branch from c88e189 to a157894 Compare March 7, 2024 00:39

yeounoh requested review from alanwaketan and wanchaol March 7, 2024 00:41

yeounoh force-pushed the xla_distribute_module branch 2 times, most recently from 26fc3a8 to 30850e1 Compare March 7, 2024 07:11

yeounoh requested a review from JackCaoG March 7, 2024 07:12

yeounoh mentioned this pull request Mar 7, 2024

[DTensor][XLA] support XLA backend in distirbute_module API pytorch/pytorch#121355

Closed

yeounoh force-pushed the xla_distribute_module branch 3 times, most recently from 3ab5d64 to 4caa123 Compare March 7, 2024 17:54

alanwaketan approved these changes Mar 7, 2024

View reviewed changes

[SPMD][DTensor] introduce xla_distribute_module for DTensor integration

e659a76

yeounoh force-pushed the xla_distribute_module branch from 4caa123 to e659a76 Compare March 7, 2024 18:31

yeounoh merged commit b6b9c6d into master Mar 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPMD][DTensor] introduce xla_distribute_module for DTensor integration#6683

[SPMD][DTensor] introduce xla_distribute_module for DTensor integration#6683
yeounoh merged 1 commit intomasterfrom
xla_distribute_module

yeounoh commented Mar 7, 2024

Uh oh!

yeounoh commented Mar 7, 2024

Uh oh!

yeounoh commented Mar 7, 2024

Uh oh!

yeounoh commented Mar 7, 2024

Uh oh!

alanwaketan left a comment

Uh oh!

yeounoh commented Mar 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yeounoh commented Mar 7, 2024

Uh oh!

yeounoh commented Mar 7, 2024

Uh oh!

yeounoh commented Mar 7, 2024

Uh oh!

yeounoh commented Mar 7, 2024

Uh oh!

alanwaketan left a comment

Choose a reason for hiding this comment

Uh oh!

yeounoh commented Mar 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants