[core] Remove perf test in `shutdown_coordinator_test` by codope · Pull Request #56033 · ray-project/ray

codope · 2025-08-28T04:50:30Z

Why are these changes needed?

The test microbenchmarks a method (ShutdownCoordinator::ShouldEarlyExit) which uses a lock. It's checked at task boundaries and event loop posts. That's orders of magnitude less frequent than per-object/serialization loops. But, it is not very useful. So, removing the test.

Related issue number

Closes #55801

Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com>

gemini-code-assist

Code Review

This pull request correctly disables a performance test under TSAN to prevent spurious failures. The change to use std::chrono::steady_clock instead of std::chrono::high_resolution_clock is an excellent improvement for ensuring monotonic time measurement in the benchmark. The code is clean and the changes are well-justified.

jjyao · 2025-08-28T05:00:35Z

src/ray/core_worker/tests/shutdown_coordinator_test.cc

  volatile bool result = false;

  for (int i = 0; i < iterations; ++i) {
    result = coordinator->ShouldEarlyExit();


why do we want to microbenchmark this? If we want to do proper microbenchmark, we should use libraries like https://github.com/google/benchmark since writing good and reliable benchmark is pretty hard.

agree especially not in our ci env

ShouldEarlyExit() sits on hot paths (e.g., CoreWorker::IsExiting() checks inside tight loops). In an earlier (unmerged) implementation of this method, I was using more complex atomic flag and memory ordering. It was simplified to simply take a lock but there was a perf concern. So, I added this test to get a sense of how much time it takes. I can remove it form unit test. Don't see a very good utility in running this now.

ya i'd say kill it

dayshah

i still don't get why this doesn't pass tsan, is there inherently something unsafe with ShouldEarlyExit. I think I'm just missing context on this test and why we need to test should exit perf

dayshah · 2025-08-28T05:00:24Z

src/ray/core_worker/tests/shutdown_coordinator_test.cc

-  auto start = std::chrono::high_resolution_clock::now();
+  auto start = std::chrono::steady_clock::now();
  constexpr int iterations = 1000000;
  volatile bool result = false;


why do we need volatile

To prevent the compiler from optimizing away the calls/loop. Without volatile (or another optimizer barrier), the compiler may elide repeated reads with no observable side effects.

dayshah · 2025-08-28T05:00:53Z

src/ray/core_worker/tests/shutdown_coordinator_test.cc

  volatile bool result = false;

  for (int i = 0; i < iterations; ++i) {
    result = coordinator->ShouldEarlyExit();


should just wrap this in void( or RAY_UNUSED instead of having the unused result var

yeah can do, however, void/RAY_UNUSED only silence "unused variable" warnings; they don’t prevent call elision. So, i can keep volatile to force evaluation; keep (void)result or RAY_UNUSED(result); to silence the warning.
But, I am considering removing the test altogether -- #56033 (comment)

Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com>

jjyao

Please update PR title and description to reflect the latest

edoakes · 2025-08-28T16:54:01Z

src/ray/core_worker/tests/shutdown_coordinator_test.cc

-  // Should be very fast (less than 100ns per call on modern hardware)
-  double ns_per_call = static_cast<double>(duration.count()) / iterations;
-  EXPECT_LT(ns_per_call, 100.0)
-      << "ShouldEarlyExit too slow: " << ns_per_call << "ns per call";


we should not write this type of test in CI as a general rule; it's destined to be flaky.

if we want to measure performance, we should do it as a release test where we actually track the metrics over time.

testing this specific call also doesn't make much sense to me.

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: Gang Zhao <gang@gang-JQ62HD2C37.local>

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: sampan <sampan@anyscale.com>

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: jugalshah291 <shah.jugal291@gmail.com>

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: yenhong.wong <yenhong.wong@grabtaxi.com>

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: Douglas Strodtman <douglas@anyscale.com>

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com>

[core] Disbale perf test in shutdown_coordinator_test under tsan

888f301

Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com>

codope requested a review from a team as a code owner August 28, 2025 04:50

gemini-code-assist bot reviewed Aug 28, 2025

View reviewed changes

jjyao reviewed Aug 28, 2025

View reviewed changes

dayshah reviewed Aug 28, 2025

View reviewed changes

kill the benchmark

40a888e

Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com>

jjyao approved these changes Aug 28, 2025

View reviewed changes

codope added the go add ONLY when ready to merge, run all tests label Aug 28, 2025

codope changed the title ~~[core] Disbale perf test in shutdown_coordinator_test under tsan~~ [core] Remove perf test in shutdown_coordinator_test Aug 28, 2025

dayshah enabled auto-merge (squash) August 28, 2025 06:09

dayshah merged commit 235b43f into ray-project:master Aug 28, 2025
6 of 7 checks passed

edoakes reviewed Aug 28, 2025

View reviewed changes

tohtana pushed a commit to tohtana/ray that referenced this pull request Aug 29, 2025

[core] Remove perf test in shutdown_coordinator_test (ray-project#5…

0585c14

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>

tohtana pushed a commit to tohtana/ray that referenced this pull request Aug 29, 2025

[core] Remove perf test in shutdown_coordinator_test (ray-project#5…

1e11e23

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: Masahiro Tanaka <mtanaka@anyscale.com>

gangsf pushed a commit to gangsf/ray that referenced this pull request Sep 2, 2025

[core] Remove perf test in shutdown_coordinator_test (ray-project#5…

44e8f47

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: Gang Zhao <gang@gang-JQ62HD2C37.local>

sampan-s-nayak pushed a commit to sampan-s-nayak/ray that referenced this pull request Sep 8, 2025

[core] Remove perf test in shutdown_coordinator_test (ray-project#5…

f70f00c

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: sampan <sampan@anyscale.com>

wyhong3103 pushed a commit to wyhong3103/ray that referenced this pull request Sep 12, 2025

[core] Remove perf test in shutdown_coordinator_test (ray-project#5…

e8d8d14

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: yenhong.wong <yenhong.wong@grabtaxi.com>

dstrodtman pushed a commit to dstrodtman/ray that referenced this pull request Oct 6, 2025

[core] Remove perf test in shutdown_coordinator_test (ray-project#5…

2b11a6a

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com> Signed-off-by: Douglas Strodtman <douglas@anyscale.com>

landscapepainter pushed a commit to landscapepainter/ray that referenced this pull request Nov 17, 2025

[core] Remove perf test in shutdown_coordinator_test (ray-project#5…

f283c9c

…6033) Signed-off-by: Sagar Sumit <sagarsumit09@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] Remove perf test in `shutdown_coordinator_test`#56033

[core] Remove perf test in `shutdown_coordinator_test`#56033
dayshah merged 2 commits intoray-project:masterfrom
codope:fix-shutdown-coordinator-perf-test

codope commented Aug 28, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

jjyao Aug 28, 2025

Uh oh!

dayshah Aug 28, 2025

Uh oh!

codope Aug 28, 2025

Uh oh!

dayshah Aug 28, 2025

Uh oh!

dayshah left a comment

Uh oh!

dayshah Aug 28, 2025

Uh oh!

codope Aug 28, 2025

Uh oh!

dayshah Aug 28, 2025

Uh oh!

codope Aug 28, 2025

Uh oh!

jjyao left a comment

Uh oh!

Uh oh!

edoakes Aug 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

codope commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dayshah left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jjyao left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codope commented Aug 28, 2025 •

edited

Loading