[Data] Add `average_num_inputs_per_task` and `num_output_blocks_per_task_s` metrics by bveeramani · Pull Request #56379 · ray-project/ray

bveeramani · 2025-09-09T17:19:17Z

Why are these changes needed?

This PR adds two task-level metrics for better visibility into operator performance:

average_num_inputs_per_task – average input blocks per task.
num_output_blocks_per_task_s – average output blocks per task per second.

Both return None if no tasks have finished or no output exists, avoiding misleading values. These metrics can be used to help making scheduling decisions.

Related issue number

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

gemini-code-assist

Code Review

This pull request introduces two new metrics for operator performance monitoring. The implementation is mostly correct, but I've identified a couple of areas for improvement. Specifically, for the average_num_inputs_per_task metric, the description and metrics group seem to be incorrect, likely due to a copy-paste error. For num_output_blocks_per_task_s, I've suggested a rename and description update to better reflect its nature as a rate of block generation, which should improve clarity. The test updates are appropriate but will need to be adjusted if my suggestions are implemented.

python/ray/data/_internal/execution/interfaces/op_runtime_metrics.py

iamjustinhsu

gemini comment

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

…ask_s` metrics (ray-project#56379)   ## Why are these changes needed?  This PR adds two task-level metrics for better visibility into operator performance: * `average_num_inputs_per_task` – average input blocks per task. * `num_output_blocks_per_task_s` – average output blocks per task per second. Both return `None` if no tasks have finished or no output exists, avoiding misleading values. These metrics can be used to help making scheduling decisions. ## Related issue number  ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: zac <zac@anyscale.com>

…ask_s` metrics (#56379)   ## Why are these changes needed?  This PR adds two task-level metrics for better visibility into operator performance: * `average_num_inputs_per_task` – average input blocks per task. * `num_output_blocks_per_task_s` – average output blocks per task per second. Both return `None` if no tasks have finished or no output exists, avoiding misleading values. These metrics can be used to help making scheduling decisions. ## Related issue number  ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: Douglas Strodtman <douglas@anyscale.com>

…ask_s` metrics (ray-project#56379)   ## Why are these changes needed?  This PR adds two task-level metrics for better visibility into operator performance: * `average_num_inputs_per_task` – average input blocks per task. * `num_output_blocks_per_task_s` – average output blocks per task per second. Both return `None` if no tasks have finished or no output exists, avoiding misleading values. These metrics can be used to help making scheduling decisions. ## Related issue number  ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( --------- Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

Initial commit

1b4a2f6

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

bveeramani requested a review from a team as a code owner September 9, 2025 17:19

bveeramani assigned iamjustinhsu Sep 9, 2025

gemini-code-assist bot reviewed Sep 9, 2025

View reviewed changes

python/ray/data/_internal/execution/interfaces/op_runtime_metrics.py Show resolved Hide resolved

python/ray/data/_internal/execution/interfaces/op_runtime_metrics.py Show resolved Hide resolved

iamjustinhsu approved these changes Sep 9, 2025

View reviewed changes

ray-gardener bot added the data Ray Data-related issues label Sep 9, 2025

bveeramani enabled auto-merge (squash) September 9, 2025 19:14

github-actions bot added the go add ONLY when ready to merge, run all tests label Sep 9, 2025

Address review comment

35c7b07

Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu>

github-actions bot disabled auto-merge September 9, 2025 19:15

Merge branch 'master' into add-new-metrics

756db2e

bveeramani enabled auto-merge (squash) September 9, 2025 19:16

github-actions bot disabled auto-merge September 9, 2025 19:16

bveeramani enabled auto-merge (squash) September 9, 2025 20:51

Merge branch 'master' into add-new-metrics

2720eb3

github-actions bot disabled auto-merge September 10, 2025 18:10

bveeramani merged commit d28b3f4 into master Sep 10, 2025
5 checks passed

bveeramani deleted the add-new-metrics branch September 10, 2025 20:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Add `average_num_inputs_per_task` and `num_output_blocks_per_task_s` metrics#56379

[Data] Add `average_num_inputs_per_task` and `num_output_blocks_per_task_s` metrics#56379
bveeramani merged 4 commits intomasterfrom
add-new-metrics

bveeramani commented Sep 9, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

iamjustinhsu left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bveeramani commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

iamjustinhsu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bveeramani commented Sep 9, 2025 •

edited

Loading