[Data] Refactor `Planner` to avoid storing plan-specific state by bveeramani · Pull Request #53955 · ray-project/ray

bveeramani · 2025-06-19T17:34:31Z

Why are these changes needed?

The current Planner implementation stores plan-specific state (like the op_map) as a class attribute. This makes the planner technically stateful across calls, which could lead to incorrect results if .plan() is called multiple times on the same instance:

planner = Planner()
planner.plan(logical_plan1)
# This will lead to incorrect results because it reuses state form the first plan
planner.plan(logical_plan2)

While this doesn’t currently happen in practice, it’s fragile and could easily lead to bugs.

This PR refactors the planner to avoid storing any state tied to a specific plan, making it safe to reuse across multiple calls.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> [Data] Insert checkpoint layers during planning instead of during physical optimization (#1860) Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

Copilot

Pull Request Overview

This PR refactors the Planner to remove plan-specific state by replacing direct instantiations of Planner with a factory function (create_planner), and updates the planning logic to use a functional approach with a mapping of logical operator types to planning functions.

Replace direct Planner instantiation with create_planner in tests and logical optimizers.
Change the PLAN_LOGICAL_OP_FNS collection from a list to a dictionary and update the planning recursion accordingly.

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
python/ray/data/tests/test_randomize_block_order.py	Updated Planner instantiation to use create_planner
python/ray/data/tests/test_operator_fusion.py	Updated multiple Planner instantiations with create_planner
python/ray/data/tests/test_execution_optimizer.py	Replaced Planner() calls with create_planner() in tests
python/ray/data/_internal/planner/planner.py	Refactored internal planning logic and mapping collection
python/ray/data/_internal/planner/init.py	Added create_planner factory method
python/ray/data/_internal/logical/optimizers.py	Updated get_execution_plan to use create_planner

Comments suppressed due to low confidence (1)

python/ray/data/_internal/planner/planner.py:182

The function plan_recursively's return type annotation and its docstring appear to be mismatched; the docstring suggests a mapping from physical to logical operators, while the annotation indicates Dict[LogicalOperator, PhysicalOperator]. Consider updating the annotation and docstring to accurately reflect that the mapping is from PhysicalOperator to LogicalOperator.

def plan_recursively(

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

alexeykudinkin · 2025-06-19T17:44:23Z

python/ray/data/_internal/planner/planner.py

    """

-    def __init__(self):
-        self._physical_op_to_logical_op: Dict[PhysicalOperator, LogicalOperator] = {}


@bveeramani i don't think there's anything wrong with the planner being stateful component

I think issue you're running into could be addressed much more easily:

Make Planner.plan class/static method

Planner.plan initializes planner and makes sure that instances aren't shared b/w invocations

…roject#53955)   ## Why are these changes needed?  The current `Planner` implementation stores plan-specific state (like the `op_map`) as a class attribute. This makes the planner technically stateful across calls, which could lead to incorrect results if `.plan()` is called multiple times on the same instance: ``` planner = Planner() planner.plan(logical_plan1) # This will lead to incorrect results because it reuses state form the first plan planner.plan(logical_plan2) ``` While this doesn’t currently happen in practice, it’s fragile and could easily lead to bugs. This PR refactors the planner to avoid storing any state tied to a specific plan, making it safe to reuse across multiple calls. ## Related issue number  ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

## Why are these changes needed?  The current `Planner` implementation stores plan-specific state (like the `op_map`) as a class attribute. This makes the planner technically stateful across calls, which could lead to incorrect results if `.plan()` is called multiple times on the same instance: ``` planner = Planner() planner.plan(logical_plan1) # This will lead to incorrect results because it reuses state form the first plan planner.plan(logical_plan2) ``` While this doesn’t currently happen in practice, it’s fragile and could easily lead to bugs. This PR refactors the planner to avoid storing any state tied to a specific plan, making it safe to reuse across multiple calls. ## Related issue number  ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>

Original PR #53955 by bveeramani Original: ray-project/ray#53955

…n-specific state Merged from original PR #53955 Original: ray-project/ray#53955

Update stuff

455bc4e

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> [Data] Insert checkpoint layers during planning instead of during physical optimization (#1860) Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

Copilot AI review requested due to automatic review settings June 19, 2025 17:34

bveeramani requested a review from a team as a code owner June 19, 2025 17:34

Copilot AI reviewed Jun 19, 2025

View reviewed changes

bveeramani assigned raulchen Jun 19, 2025

Fix type hint

98b7db3

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

alexeykudinkin reviewed Jun 19, 2025

View reviewed changes

raulchen approved these changes Jun 19, 2025

View reviewed changes

bveeramani enabled auto-merge (squash) June 19, 2025 18:14

github-actions bot added the go add ONLY when ready to merge, run all tests label Jun 19, 2025

bveeramani merged commit 1d54fc7 into master Jun 19, 2025
7 checks passed

bveeramani deleted the refactor-planner branch June 19, 2025 19:52

snorkelopstesting4-web mentioned this pull request Oct 22, 2025

[Data] Refactor Planner to avoid storing plan-specific state snorkel-marlin-repos/ray-project_ray_pr_53955_d6a41f93-e10a-44d7-b4c3-8d181e0cde7d#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Refactor `Planner` to avoid storing plan-specific state#53955

[Data] Refactor `Planner` to avoid storing plan-specific state#53955
bveeramani merged 2 commits intomasterfrom
refactor-planner

bveeramani commented Jun 19, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

alexeykudinkin Jun 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

bveeramani commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

alexeykudinkin Jun 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bveeramani commented Jun 19, 2025 •

edited

Loading