Skip to content

[WIP] Only clear ObjectRef metadata for task arguments that have a non-default tensor transport#53630

Closed
stephanie-wang wants to merge 1 commit intoray-project:masterfrom
stephanie-wang:fix-52938
Closed

[WIP] Only clear ObjectRef metadata for task arguments that have a non-default tensor transport#53630
stephanie-wang wants to merge 1 commit intoray-project:masterfrom
stephanie-wang:fix-52938

Conversation

@stephanie-wang
Copy link
Copy Markdown
Contributor

Why are these changes needed?

When an object with a non-default tensor transport is passed as an argument to another task, the task needs to know the object ID so that it can fetch the remaining data through the out-of-band tensor transport. Previously, this metadata would get cleared for inlined objects, but we need to retain it for these partially inlined objects. This PR adds a tensor transport field for ObjectRefs in C++ so that that the dependency resolver can check whether it should retain the ObjectRef metadata or not for downstream tasks.

Related issue number

Closes #53623.

Checks

  • I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
  • I've run scripts/format.sh to lint the changes in this PR.
  • I've included any doc changes needed for https://docs.ray.io/en/master/.
    • I've added any new APIs to the API Reference. For example, if I added a
      method in Tune, I've added it in doc/source/tune/api/ under the
      corresponding .rst file.
  • I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
  • Testing Strategy
    • Unit tests
    • Release tests
    • This PR is not tested :(

Signed-off-by: Stephanie wang <smwang@cs.washington.edu>
@kevin85421
Copy link
Copy Markdown
Member

Here is the PR based on this PR: #53692

stephanie-wang added a commit that referenced this pull request Jun 10, 2025
…t_ref` for small and non-GPU objects (#53692)

This PR is based on #53630.

See #53623 for the issue. In this PR, we clear the object ref when the
arg's tensor transport is not OBJECT_STORE.

Closes #53623 
---------

Signed-off-by: Stephanie wang <smwang@cs.washington.edu>
Signed-off-by: Kai-Hsun Chen <kaihsun@anyscale.com>
Signed-off-by: Kai-Hsun Chen <kaihsun@apache.org>
Co-authored-by: Stephanie wang <smwang@cs.washington.edu>
Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>
elliot-barn pushed a commit that referenced this pull request Jun 18, 2025
…t_ref` for small and non-GPU objects (#53692)

This PR is based on #53630.

See #53623 for the issue. In this PR, we clear the object ref when the
arg's tensor transport is not OBJECT_STORE.

Closes #53623
---------

Signed-off-by: Stephanie wang <smwang@cs.washington.edu>
Signed-off-by: Kai-Hsun Chen <kaihsun@anyscale.com>
Signed-off-by: Kai-Hsun Chen <kaihsun@apache.org>
Co-authored-by: Stephanie wang <smwang@cs.washington.edu>
Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
elliot-barn pushed a commit that referenced this pull request Jul 2, 2025
…t_ref` for small and non-GPU objects (#53692)

This PR is based on #53630.

See #53623 for the issue. In this PR, we clear the object ref when the
arg's tensor transport is not OBJECT_STORE.

Closes #53623
---------

Signed-off-by: Stephanie wang <smwang@cs.washington.edu>
Signed-off-by: Kai-Hsun Chen <kaihsun@anyscale.com>
Signed-off-by: Kai-Hsun Chen <kaihsun@apache.org>
Co-authored-by: Stephanie wang <smwang@cs.washington.edu>
Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[core][gpu-objects] Performance regression caused by transferring object references for small objects

3 participants