[Data] Return 'inf' for max object store memory usage#59751
[Data] Return 'inf' for max object store memory usage#59751bveeramani merged 1 commit intomasterfrom
Conversation
Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>
There was a problem hiding this comment.
Code Review
This pull request correctly addresses the issue of capping object store memory usage by setting the maximum to infinity for ActorPoolMapOperator and TaskPoolMapOperator. The logic is sound, as the output size of tasks can be unpredictable. The changes are clear and the update to the test for ActorPoolMapOperator is appropriate.
| # Set the max `object_store_memory` requirement to 'inf', because we | ||
| # don't know how much data each task can output. | ||
| object_store_memory=float("inf"), | ||
| ) |
There was a problem hiding this comment.
This change to set object_store_memory to float("inf") is correct. However, the existing tests for min_max_resource_requirements in test_task_pool_map_operator.py do not cover the case where _max_concurrency is set. It would be beneficial to add a test case to ensure this code path is validated, similar to how it's tested for ActorPoolMapOperator.
) ray-project#59412 updated the reservation resource allocator to cap resource allocations based on the resources returned by `min_max_resource_usage`. The problem is that running tasks can produce any amounts of data, so it doesn't make sense to cap by `obj_store_mem_max_pending_output_per_task * concurrency`. This PR fixes that issue by setting the max object store memory usage to 'inf'. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: jasonwrwang <jasonwrwang@tencent.com>
) ray-project#59412 updated the reservation resource allocator to cap resource allocations based on the resources returned by `min_max_resource_usage`. The problem is that running tasks can produce any amounts of data, so it doesn't make sense to cap by `obj_store_mem_max_pending_output_per_task * concurrency`. This PR fixes that issue by setting the max object store memory usage to 'inf'. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>
) ray-project#59412 updated the reservation resource allocator to cap resource allocations based on the resources returned by `min_max_resource_usage`. The problem is that running tasks can produce any amounts of data, so it doesn't make sense to cap by `obj_store_mem_max_pending_output_per_task * concurrency`. This PR fixes that issue by setting the max object store memory usage to 'inf'. Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: peterxcli <peterxcli@gmail.com>
#59412 updated the reservation resource allocator to cap resource allocations based on the resources returned by
min_max_resource_usage. The problem is that running tasks can produce any amounts of data, so it doesn't make sense to cap byobj_store_mem_max_pending_output_per_task * concurrency.This PR fixes that issue by setting the max object store memory usage to 'inf'.