Do not cache input args in dynamo bridge by lsy323 · Pull Request #6553 · pytorch/xla

lsy323 · 2024-02-16T17:40:16Z

Do not cache FX Graph args in GraphInputMatcher. It's not necessary to cache those tensors, because user inputs will be used in each run.

Without this change, the cached input tensors will cause more device memory usage as new graph is cached.

xla_args is the user inputs passed down from Dynamo, the cached tensors are released based on the tensor_id of xla_args, when each GraphInputMatcher is constructed.

lsy323 · 2024-02-21T00:49:03Z

Hi @wonjoolee95 @JackCaoG, this PR is ready for review. The CI failed because of the staled GCP token. I ran the dynamo tests locally and they passed.

lsy323 added 5 commits February 16, 2024 17:39

do not cache input args

d31ab09

only get tensor id for tensor input

5a61615

remove analyze buffer script

83e776b

refactor

f798cd4

fix GraphInputMatcher usage in test, add assertion

8b7afd8

lsy323 requested review from JackCaoG and wonjoo-wj February 21, 2024 00:48

lsy323 marked this pull request as ready for review February 21, 2024 00:48

JackCaoG reviewed Feb 21, 2024

View reviewed changes

Comment thread test/dynamo/test_num_output.py

JackCaoG approved these changes Feb 21, 2024

View reviewed changes

wonjoo-wj approved these changes Feb 21, 2024

View reviewed changes

add graph input matcher test

3b4c7de

JackCaoG reviewed Feb 21, 2024

View reviewed changes

Comment thread test/dynamo/test_graph_input_matcher.py

JackCaoG approved these changes Feb 21, 2024

View reviewed changes

JackCaoG merged commit 6aeab30 into master Feb 21, 2024

miladm assigned lsy323 Feb 22, 2024

miladm added the dynamo label Feb 22, 2024

lsy323 deleted the lsiyuan/dynamo-cache-no-input branch February 22, 2024 19:30

amithrm pushed a commit to amithrm/xla that referenced this pull request Mar 1, 2024

Do not cache input args in dynamo bridge (pytorch#6553)

4d93a01

kmabeeTT mentioned this pull request Mar 2, 2026

Host memory leak between torch models due to experimental compile (250GB in gpt-oss-120B on galaxy) tenstorrent/tt-xla#3507

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do not cache input args in dynamo bridge#6553

Do not cache input args in dynamo bridge#6553
JackCaoG merged 6 commits intomasterfrom
lsiyuan/dynamo-cache-no-input

lsy323 commented Feb 16, 2024 •

edited

Loading

Uh oh!

lsy323 commented Feb 21, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

lsy323 commented Feb 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lsy323 commented Feb 21, 2024

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

lsy323 commented Feb 16, 2024 •

edited

Loading