Add API to donate input buffer for dynamo execution by JackCaoG · Pull Request #6587 · pytorch/xla

JackCaoG · 2024-02-22T02:43:52Z

This pr adds an api torch_xla._XLAC._set_buffer_donation which can be used to mark the buffer associated with current tensor to be donated in the next execution. This api is currently only enabled for dynamo(torch.compile) use case, ti is a no-op in LazyTensor world(since LTC already has the auto aliasing based on inplace op and dynamo's functionization pass remove all inplace ops).

Example usage should be

  def dummy_inplace_add(self, input):
    input += 1
    return

  def test_manual_buffer_donation(self):
    device = xm.xla_device()
    input = torch.randn(5, 5).to(device)
    dummy_inplace_add_compiled = torch.compile(
        self.dummy_inplace_add, backend='openxla')

    met.clear_all()
    # input is a device_data, we should be able to set the buffer donation field.
    self.assertTrue(torch_xla._XLAC._set_buffer_donation(input, True))
    # make sure buffer donation setting is correctly updated
    self.assertTrue(torch_xla._XLAC._get_buffer_donation(input))
	
	for _ in range(100):
      # You don't need to keep calling this function if you function does not cause dynamo recompilation.
      # check below for the reason.
      self.assertTrue(torch_xla._XLAC._set_buffer_donation(input, True))
      dummy_inplace_add_compiled(input)

Please note a couple things

_set_buffer_donation is called on a tensor, but being applied to the buffer associated with this tensor. If the tensor you passed in is not a buffer(for example an intermediate tensor that has not been evulated) this api will be no-op and return false.
Buffer aliasing is being set up during compilation time. Torch Dynamo also does not track this field, so even if you change _set_buffer_donation after first execution(torch.compile compilation triggered at first execution of the compiled function), aliasing will not change.

JackCaoG · 2024-02-22T02:49:34Z

I chatted with @bdhirsh , ideally we want to figure out which input tensor can be aliased without user calling this api. There should be a way for us to retrive the aliasing information from functionization pass in aot-autograd.

The context is that If we need to alias an input buffer to output buffer, we need to make sure input buffer can not be accessed by original tensor. For example

a += 1

is fine because we know once the in place ops happens, a will point to a different buffer, so it is safe to reuse that buffer to something else.

if it is

b = a + 1

I can't alias because a's buffer is still needed. once aliasing happens, original buffer's value will be invalidated. if we alias in the

b = a + 1

case. a's value will be incorrect after this computation. Dynamo comes with functionization which will remove all inplace ops from the fx graph passed down, which is a problem in this case.

JackCaoG · 2024-02-24T00:41:58Z


+bool ShouldAliasBasedOnBufferDonor() {
+  // This env var will be updated during run time, do not use static bool here.
+  return runtime::sys_util::GetEnvBool("XLA_SHOULD_ALIAS_WITH_BUFFER_DONOR",


this is not ideal.. I want to add a new config to coll.config but that struct now lives on upstream...

…ll overwrite the buffer donation

…to hash conflict

lsy323

LGTM, left a few questions

JackCaoG · 2024-02-27T03:56:34Z

I will merge this change to unblock the user, fix comments in a follow up pr.

alanwaketan

Sorry for being late, but looks good to me.

JackCaoG marked this pull request as ready for review February 24, 2024 00:27

JackCaoG requested review from alanwaketan and lsy323 February 24, 2024 00:36

JackCaoG changed the title ~~[WIP]Add API to donate input buffer for dynamo execution~~ Add API to donate input buffer for dynamo execution Feb 24, 2024

JackCaoG commented Feb 24, 2024

View reviewed changes

JackCaoG added 12 commits February 24, 2024 00:44

Add api to buffer donation

507ebc1

add SetBufferDonors

ebbafda

Add get_buffer_donation, fix a bug where mark+step with devicedata wi…

ccbdc53

…ll overwrite the buffer donation

add python tests, currently they will fail if being run together due …

92480b4

…to hash conflict

add test to testing script

5671ee5

make sure compilation hash tracks buffer donor index

0b6c3a8

only enable buffer donor aliasing in dynamo

3561bdf

Fix a bug where warm up cache might accidentlly execute the graph

e1c3a44

Add test for non-dynamo buffer donation

0b60229

add more test

d12446b

remove debugging messages

93af291

add comment

fda1c41

JackCaoG force-pushed the JackCaoG/dynamo_aliasing_2 branch from 09cc0cb to fda1c41 Compare February 24, 2024 00:45

remove debug messages and fix tests

866b2bc

JackCaoG mentioned this pull request Feb 26, 2024

Add api to buffer donation #6616

Closed

JackCaoG requested a review from will-cromar February 26, 2024 22:41

lsy323 approved these changes Feb 27, 2024

View reviewed changes

Comment thread torch_xla/core/dynamo_bridge.py

Comment thread torch_xla/csrc/helpers.cpp

Comment thread torch_xla/csrc/xla_graph_executor.cpp

JackCaoG merged commit 3e2a23c into master Feb 27, 2024

alanwaketan reviewed Feb 27, 2024

View reviewed changes

amithrm pushed a commit to amithrm/xla that referenced this pull request Mar 1, 2024

Add API to donate input buffer for dynamo execution (pytorch#6587)

1e772e3

rpsilva-aws mentioned this pull request Feb 22, 2025

Extend buffer donation aliasing APIs #8721

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add API to donate input buffer for dynamo execution#6587

Add API to donate input buffer for dynamo execution#6587
JackCaoG merged 13 commits intomasterfrom
JackCaoG/dynamo_aliasing_2

JackCaoG commented Feb 22, 2024 •

edited

Loading

Uh oh!

JackCaoG commented Feb 22, 2024

Uh oh!

JackCaoG Feb 24, 2024

Uh oh!

lsy323 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JackCaoG commented Feb 27, 2024

Uh oh!

alanwaketan left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JackCaoG commented Feb 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JackCaoG commented Feb 22, 2024

Uh oh!

JackCaoG Feb 24, 2024

Choose a reason for hiding this comment

Uh oh!

lsy323 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

JackCaoG commented Feb 27, 2024

Uh oh!

alanwaketan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JackCaoG commented Feb 22, 2024 •

edited

Loading