support dynamism on add, mul by lsy323 · Pull Request #6443 · pytorch/xla

lsy323 · 2024-02-01T06:26:46Z

Add unbounded dynamism test for some aten ops, those ops are used in ViT model. Let's add more as we work on other models.

Unsupported ops

add
conv
gelu
native_layer_norm
select
slice
softmax

Also add support of dynamism for add.

Before the change, the unbounded dynamism cannot propagate because the constant scalar is broadcasted to the same shape as the input tensor. Then during implicit broadcasting, we have ? + concrete_dim => concrete_dim

Add some missing tests to CI scripts

cc @sdasgup3

(cherry picked from commit f55abc88ae361e89da675a1aa1e4a19e7a5c762a)

(cherry picked from commit 30abe2be43defc25db8954c525d34f7f3de35292)

(cherry picked from commit 8526b2091ffafccf6972ecba3c111d1b0869621e)

lsy323 · 2024-02-07T19:37:37Z

HLO changed for spmd basic sharding test test_mark_sharding_ir. The graph becomes more concise after the lowering for add/mul is updated. Semantic and sharding annotation remains the same. Only the HLO op name changed.
cc @yeounoh @alanwaketan

From

ENTRY %IrToHlo.17 (p0.9: f32[1,128], p1.11: f32[1,128]) -> (f32[1,128]) {
  %p1.11 = f32[1,128]{1,0} parameter(1), sharding={replicated}
  %p0.9 = f32[1,128]{1,0} parameter(0), sharding={replicated}
  %constant.4 = f32[] constant(1)
  %reshape.5 = f32[1,1]{1,0} reshape(f32[] %constant.4)
  %broadcast.6 = f32[1,1]{1,0} broadcast(f32[1,1]{1,0} %reshape.5), dimensions={0,1}
  %reshape.7 = f32[1]{0} reshape(f32[1,1]{1,0} %broadcast.6)
  %broadcast.8 = f32[1,128]{1,0} broadcast(f32[1]{0} %reshape.7), dimensions={0}
  %multiply.10 = f32[1,128]{1,0} multiply(f32[1,128]{1,0} %p0.9, f32[1,128]{1,0} %broadcast.8)
  %add.12 = f32[1,128]{1,0} add(f32[1,128]{1,0} %p1.11, f32[1,128]{1,0} %multiply.10)
  %custom-call.13 = f32[1,128]{1,0} custom-call(f32[1,128]{1,0} %add.12), custom_call_target="Sharding", sharding={replicated}
  %constant.2 = f32[] constant(0)
  %constant.1 = f32[] constant(1)
  %multiply.3 = f32[] multiply(f32[] %constant.2, f32[] %constant.1)
  %broadcast.14 = f32[1,128]{1,0} broadcast(f32[] %multiply.3), dimensions={}
  %add.15 = f32[1,128]{1,0} add(f32[1,128]{1,0} %custom-call.13, f32[1,128]{1,0} %broadcast.14)
  ROOT %tuple.16 = (f32[1,128]{1,0}) tuple(f32[1,128]{1,0} %add.15)
}

To

ENTRY %IrToHlo.14 (p0.5: f32[1,128], p1.8: f32[1,128]) -> (f32[1,128]) {
  %p1.8 = f32[1,128]{1,0} parameter(1), sharding={replicated}
  %p0.5 = f32[1,128]{1,0} parameter(0), sharding={replicated}
  %constant.4 = f32[] constant(1)
  %broadcast.6 = f32[1,128]{1,0} broadcast(f32[] %constant.4), dimensions={}
  %multiply.7 = f32[1,128]{1,0} multiply(f32[1,128]{1,0} %p0.5, f32[1,128]{1,0} %broadcast.6)
  %add.9 = f32[1,128]{1,0} add(f32[1,128]{1,0} %p1.8, f32[1,128]{1,0} %multiply.7)
  %custom-call.10 = f32[1,128]{1,0} custom-call(f32[1,128]{1,0} %add.9), custom_call_target="Sharding", sharding={replicated}
  %constant.2 = f32[] constant(0)
  %constant.1 = f32[] constant(1)
  %multiply.3 = f32[] multiply(f32[] %constant.2, f32[] %constant.1)
  %broadcast.11 = f32[1,128]{1,0} broadcast(f32[] %multiply.3), dimensions={}
  %add.12 = f32[1,128]{1,0} add(f32[1,128]{1,0} %custom-call.10, f32[1,128]{1,0} %broadcast.11)
  ROOT %tuple.13 = (f32[1,128]{1,0}) tuple(f32[1,128]{1,0} %add.12)
}

alanwaketan · 2024-02-12T21:47:08Z

HLO changed for spmd basic sharding test test_mark_sharding_ir. The graph becomes more concise after the lowering for add/mul is updated. Semantic and sharding annotation remains the same. Only the HLO op name changed. cc @yeounoh @alanwaketan

From

ENTRY %IrToHlo.17 (p0.9: f32[1,128], p1.11: f32[1,128]) -> (f32[1,128]) {
  %p1.11 = f32[1,128]{1,0} parameter(1), sharding={replicated}
  %p0.9 = f32[1,128]{1,0} parameter(0), sharding={replicated}
  %constant.4 = f32[] constant(1)
  %reshape.5 = f32[1,1]{1,0} reshape(f32[] %constant.4)
  %broadcast.6 = f32[1,1]{1,0} broadcast(f32[1,1]{1,0} %reshape.5), dimensions={0,1}
  %reshape.7 = f32[1]{0} reshape(f32[1,1]{1,0} %broadcast.6)
  %broadcast.8 = f32[1,128]{1,0} broadcast(f32[1]{0} %reshape.7), dimensions={0}
  %multiply.10 = f32[1,128]{1,0} multiply(f32[1,128]{1,0} %p0.9, f32[1,128]{1,0} %broadcast.8)
  %add.12 = f32[1,128]{1,0} add(f32[1,128]{1,0} %p1.11, f32[1,128]{1,0} %multiply.10)
  %custom-call.13 = f32[1,128]{1,0} custom-call(f32[1,128]{1,0} %add.12), custom_call_target="Sharding", sharding={replicated}
  %constant.2 = f32[] constant(0)
  %constant.1 = f32[] constant(1)
  %multiply.3 = f32[] multiply(f32[] %constant.2, f32[] %constant.1)
  %broadcast.14 = f32[1,128]{1,0} broadcast(f32[] %multiply.3), dimensions={}
  %add.15 = f32[1,128]{1,0} add(f32[1,128]{1,0} %custom-call.13, f32[1,128]{1,0} %broadcast.14)
  ROOT %tuple.16 = (f32[1,128]{1,0}) tuple(f32[1,128]{1,0} %add.15)
}

To

ENTRY %IrToHlo.14 (p0.5: f32[1,128], p1.8: f32[1,128]) -> (f32[1,128]) {
  %p1.8 = f32[1,128]{1,0} parameter(1), sharding={replicated}
  %p0.5 = f32[1,128]{1,0} parameter(0), sharding={replicated}
  %constant.4 = f32[] constant(1)
  %broadcast.6 = f32[1,128]{1,0} broadcast(f32[] %constant.4), dimensions={}
  %multiply.7 = f32[1,128]{1,0} multiply(f32[1,128]{1,0} %p0.5, f32[1,128]{1,0} %broadcast.6)
  %add.9 = f32[1,128]{1,0} add(f32[1,128]{1,0} %p1.8, f32[1,128]{1,0} %multiply.7)
  %custom-call.10 = f32[1,128]{1,0} custom-call(f32[1,128]{1,0} %add.9), custom_call_target="Sharding", sharding={replicated}
  %constant.2 = f32[] constant(0)
  %constant.1 = f32[] constant(1)
  %multiply.3 = f32[] multiply(f32[] %constant.2, f32[] %constant.1)
  %broadcast.11 = f32[1,128]{1,0} broadcast(f32[] %multiply.3), dimensions={}
  %add.12 = f32[1,128]{1,0} add(f32[1,128]{1,0} %custom-call.10, f32[1,128]{1,0} %broadcast.11)
  ROOT %tuple.13 = (f32[1,128]{1,0}) tuple(f32[1,128]{1,0} %add.12)
}

Thanks for the heads up, and it LGTM.

pytorch#6443) Co-authored-by: Siyuan Liu <lsiyuan@google.coim>

#6443) Co-authored-by: Siyuan Liu <lsiyuan@google.coim>

lsy323 requested a review from qihqi February 1, 2024 06:26

qihqi approved these changes Feb 6, 2024

View reviewed changes

lsy323 changed the title ~~add unbounded dynamism test for some aten ops~~ add unbounded dynamism test for some aten ops, support add Feb 6, 2024

lsy323 changed the title ~~add unbounded dynamism test for some aten ops, support add~~ add unbounded dynamism test for some aten ops, support dynamism on add Feb 6, 2024

Siyuan Liu added 6 commits February 6, 2024 21:42

add unbounded dynamism test for some aten ops

81e1158

format

31d6a26

fix comment for skipped tests

e955630

cover mul

0c11345

(cherry picked from commit f55abc88ae361e89da675a1aa1e4a19e7a5c762a)

cover mul

ff09ccb

(cherry picked from commit 30abe2be43defc25db8954c525d34f7f3de35292)

add missing tests to ci scripts

92a6e00

lsy323 force-pushed the lsiyuan/aten-dynamism-test branch from 3e4db72 to 92a6e00 Compare February 6, 2024 21:42

Siyuan Liu added 4 commits February 6, 2024 21:43

yapf

15d5c28

fix scalar type

456df9c

(cherry picked from commit 8526b2091ffafccf6972ecba3c111d1b0869621e)

disable addmm test

899ec5c

disable mark pattern api in gh ci, due to tf dep

24ea8c1

lsy323 force-pushed the lsiyuan/aten-dynamism-test branch from 5d5d8c0 to 24ea8c1 Compare February 7, 2024 06:01

lsy323 mentioned this pull request Feb 7, 2024

Enable unbounded dynamism on conv, softmax, addmm, slice #6494

Merged

update hlo op name in expected hlo str

3f86ba4

lsy323 merged commit 8d91ff5 into master Feb 9, 2024

amithrm pushed a commit to amithrm/xla that referenced this pull request Mar 1, 2024

add unbounded dynamism test for some aten ops, support dynamism on add (

2472ad5

pytorch#6443) Co-authored-by: Siyuan Liu <lsiyuan@google.coim>

lsy323 deleted the lsiyuan/aten-dynamism-test branch March 4, 2024 19:13

bhavya01 pushed a commit that referenced this pull request Apr 22, 2024

add unbounded dynamism test for some aten ops, support dynamism on add (

35a62a6

#6443) Co-authored-by: Siyuan Liu <lsiyuan@google.coim>

lsy323 changed the title ~~add unbounded dynamism test for some aten ops, support dynamism on add~~ support dynamism on add, mul Aug 30, 2024

miladm assigned lsy323 Sep 17, 2024

miladm added the dynamism Dynamic Shape Features label Sep 17, 2024

lsy323 mentioned this pull request Sep 17, 2024

Support Unbounded Dynamism for torch.export #6393

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support dynamism on add, mul#6443

support dynamism on add, mul#6443
lsy323 merged 11 commits intomasterfrom
lsiyuan/aten-dynamism-test

lsy323 commented Feb 1, 2024 •

edited

Loading

Uh oh!

lsy323 commented Feb 7, 2024

Uh oh!

alanwaketan commented Feb 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

lsy323 commented Feb 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lsy323 commented Feb 7, 2024

Uh oh!

alanwaketan commented Feb 12, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lsy323 commented Feb 1, 2024 •

edited

Loading