[Pallas] Make repeat_with_fixed_output_size not OOM on VMEM by alanwaketan · Pull Request #7145 · pytorch/xla

alanwaketan · 2024-05-29T21:47:30Z

Summary:
https://openxla.org/xla/operation_semantics#reducewindow doesn't support int64. Let's make sure input to cumsum is always int32.

Test Plan:
python test/test_gmm.py
python test/test_operations.py

alanwaketan · 2024-05-29T21:57:59Z

Thanks Jack for approving.

alanwaketan · 2024-05-30T01:02:29Z

Let me just remove the cumsum thing...

alanwaketan · 2024-05-30T02:29:12Z

Skip GPU tests to move fast.

alanwaketan added 2 commits May 29, 2024 21:39

initial commit

d428dea

fix linters

faa9f09

alanwaketan requested review from JackCaoG, miladm and wonjoo-wj May 29, 2024 21:47

JackCaoG approved these changes May 29, 2024

View reviewed changes

alanwaketan added the tpuci label May 29, 2024

Add a check

b6e9e8e

JackCaoG approved these changes May 29, 2024

View reviewed changes

Separate cumsum out

4e70838

alanwaketan merged commit ce1205e into master May 30, 2024

alanwaketan deleted the alanwaketan/tgmm2 branch May 30, 2024 02:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Pallas] Make repeat_with_fixed_output_size not OOM on VMEM#7145

[Pallas] Make repeat_with_fixed_output_size not OOM on VMEM#7145
alanwaketan merged 4 commits intomasterfrom
alanwaketan/tgmm2

alanwaketan commented May 29, 2024

Uh oh!

alanwaketan commented May 29, 2024

Uh oh!

alanwaketan commented May 30, 2024

Uh oh!

alanwaketan commented May 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

alanwaketan commented May 29, 2024

Uh oh!

alanwaketan commented May 29, 2024

Uh oh!

alanwaketan commented May 30, 2024

Uh oh!

alanwaketan commented May 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants