[Data] Don't run `train` and `ml` tests on Data-only PRs by bveeramani · Pull Request #59690 · ray-project/ray

bveeramani · 2025-12-26T20:23:18Z

Thank you for contributing to Ray! 🚀
Please review the Ray Contribution Guide before opening a pull request.

⚠️ Remove these instructions before submitting your PR.

💡 Tip: Mark as draft if you want early feedback, or ready for review when it's complete.

Description

Briefly describe what this PR accomplishes and why it's needed.

Related issues

Link related issues: "Fixes #1234", "Closes #1234", or "Related to #1234".

Additional information

Optional: Add implementation details, API changes, usage examples, screenshots, etc.

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

gemini-code-assist

Code Review

This pull request aims to optimize the CI process by preventing train and ml tests from running on pull requests that only contain changes related to Ray Data. This is achieved by updating the test rules in ci/pipeline/test_rules.txt. To ensure that data-related tests within the train module are still executed, a new Buildkite step is introduced in .buildkite/ml.rayci.yml which specifically runs tests tagged for data. Correspondingly, several tests in python/ray/train/ and python/ray/train/v2/ have been tagged with team:data in their BUILD.bazel files. Additionally, a test has been refactored by moving it to a more appropriate file. The changes are logical and well-executed. I have one suggestion to improve the robustness of the new CI step.

gemini-code-assist · 2025-12-26T20:25:06Z

.buildkite/ml.rayci.yml

+    commands:
+      - bazel run //ci/ray_ci:test_in_docker -- //python/ray/train/... data
+        --build-name mlgpubuild-py3.10 --python-version 3.10
+    depends_on: [ "mlgpubuild-multipy", "forge" ]


Consider adding soft_fail: true to this new test step. This would prevent potential flakiness in this newly carved-out test suite from blocking data-only PRs. Other similar test steps in this file, like :train: ml: train v2 gpu tests, use soft_fail: true.

depends_on: [ "mlgpubuild-multipy", "forge" ] soft_fail: true

bad advice gemini.. bad advice..

cursor · 2025-12-26T20:30:26Z

.buildkite/ml.rayci.yml

+    instance_type: gpu-large
+    commands:
+      - bazel run //ci/ray_ci:test_in_docker -- //python/ray/train/... data
+        --build-name mlgpubuild-py3.10 --python-version 3.10


Missing tag filter causes all train tests to run

The new buildkite job command is missing --only-tags team:data to filter which tests to run. Without this filter, all tests in //python/ray/train/... will execute when data files change, rather than just the tests tagged with team:data. This defeats the PR's goal of avoiding running all train tests on data-only changes. Other similar commands in this file consistently use --only-tags to filter tests by tag.

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

…t/ray into stop-running-ml-tests Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

aslonnie · 2025-12-28T04:39:50Z

python/ray/train/v2/BUILD.bazel

    env = {"RAY_TRAIN_V2_ENABLED": "1"},
    tags = [
        "exclusive",
+        "team:data",


a test is normally expected to only have one team tag. if it has multiple team tags, I am not sure if something downstream consuming these tests will break.

why we need to add more team tags for these tests?

aslonnie · 2025-12-28T04:59:14Z

.buildkite/ml.rayci.yml

    depends_on: [ "mlgpubuild-multipy", "forge" ]
    soft_fail: true
+
+  - label: ":train: ml: data tests"


I suppose the purpose of this is to have a subset of train tests to be triggered on data changes?

if that is the purpose, here is what needs to be done:

for the tests, instead of adding team:data tag, add datatests (or some other meaningful tag for these tests)

you can change the team tag if you think that is the right thing to do, and move them to the data team's test group. but since they live in train directory, probably not the right idea to change the team tag

add a test job here with --only-tags datatests, with job tags having both data and train in it, meaning these tests will be triggered by both data code changes and train code changes.

add datatests into --except-tags that are used in other jobs, so that they will only run once.

bveeramani · 2025-12-31T05:15:43Z

Closing this prototype PR. I've described the task here: #59780

Initial com mit

0585bed

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

bveeramani requested a review from a team as a code owner December 26, 2025 20:23

bveeramani marked this pull request as draft December 26, 2025 20:23

bveeramani added the go add ONLY when ready to merge, run all tests label Dec 26, 2025

gemini-code-assist bot reviewed Dec 26, 2025

View reviewed changes

cursor bot reviewed Dec 26, 2025

View reviewed changes

bveeramani and others added 3 commits December 26, 2025 15:28

Merge branch 'master' into stop-running-ml-tests

542d787

Add data test

6b1fc74

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

Merge branch 'stop-running-ml-tests' of https://github.com/ray-projec…

c421320

…t/ray into stop-running-ml-tests Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

aslonnie requested review from andrew-anyscale and aslonnie December 27, 2025 01:29

aslonnie reviewed Dec 28, 2025

View reviewed changes

bveeramani mentioned this pull request Dec 31, 2025

[Data][CI] Stop running all ML tests on Data premerge #59780

Closed

bveeramani closed this Dec 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Don't run `train` and `ml` tests on Data-only PRs#59690

[Data] Don't run `train` and `ml` tests on Data-only PRs#59690
bveeramani wants to merge 4 commits intomasterfrom
stop-running-ml-tests

bveeramani commented Dec 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Dec 26, 2025

Uh oh!

aslonnie Dec 31, 2025

Uh oh!

cursor bot Dec 26, 2025

Uh oh!

aslonnie Dec 28, 2025

Uh oh!

aslonnie Dec 28, 2025

Uh oh!

bveeramani commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bveeramani commented Dec 26, 2025

Description

Related issues

Additional information

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Dec 26, 2025

Choose a reason for hiding this comment

Uh oh!

aslonnie Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

cursor bot Dec 26, 2025

Choose a reason for hiding this comment

Missing tag filter causes all train tests to run

Uh oh!

aslonnie Dec 28, 2025

Choose a reason for hiding this comment

Uh oh!

aslonnie Dec 28, 2025

Choose a reason for hiding this comment

Uh oh!

bveeramani commented Dec 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants