Skip to content

[core] Deflake test_object_spilling.py#53803

Merged
edoakes merged 1 commit intoray-project:masterfrom
edoakes:eoakes/deflake-spilling
Jun 13, 2025
Merged

[core] Deflake test_object_spilling.py#53803
edoakes merged 1 commit intoray-project:masterfrom
edoakes:eoakes/deflake-spilling

Conversation

@edoakes
Copy link
Copy Markdown
Collaborator

@edoakes edoakes commented Jun 13, 2025

Flake: https://buildkite.com/ray-project/postmerge/builds/10825#019767ae-c856-44d2-a2ce-5115c1c7e828/177-1225

Removing the assertion on an internal implementation detail, instead just checking that the spilling workload can succeed.

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
@edoakes edoakes requested a review from a team June 13, 2025 14:46
@edoakes edoakes added the go add ONLY when ready to merge, run all tests label Jun 13, 2025
@edoakes edoakes enabled auto-merge (squash) June 13, 2025 14:46
@edoakes edoakes disabled auto-merge June 13, 2025 18:37
@edoakes edoakes merged commit e75d6d6 into ray-project:master Jun 13, 2025
4 of 6 checks passed
edoakes added a commit that referenced this pull request Jun 16, 2025
I made the workload more stressful in
#53803 by fetching all of the
results concurrently. That seems to have caused Windows to time out:
https://buildkite.com/ray-project/postmerge/builds/10876#01977755-755b-485e-bd13-f0ea3e33cc36/158-818

I won't pretend to fully understand why, but reverting to the old
pattern in an attempt to fix it.

Also added an explicit wait for the dir to drain because there was an
error during cleanup caused by it.

---------

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
elliot-barn pushed a commit that referenced this pull request Jun 18, 2025
Flake:
https://buildkite.com/ray-project/postmerge/builds/10825#019767ae-c856-44d2-a2ce-5115c1c7e828/177-1225

Removing the assertion on an internal implementation detail, instead
just checking that the spilling workload can succeed.

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
elliot-barn pushed a commit that referenced this pull request Jun 18, 2025
I made the workload more stressful in
#53803 by fetching all of the
results concurrently. That seems to have caused Windows to time out:
https://buildkite.com/ray-project/postmerge/builds/10876#01977755-755b-485e-bd13-f0ea3e33cc36/158-818

I won't pretend to fully understand why, but reverting to the old
pattern in an attempt to fix it.

Also added an explicit wait for the dir to drain because there was an
error during cleanup caused by it.

---------

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
minerharry pushed a commit to minerharry/ray that referenced this pull request Jun 27, 2025
I made the workload more stressful in
ray-project#53803 by fetching all of the
results concurrently. That seems to have caused Windows to time out:
https://buildkite.com/ray-project/postmerge/builds/10876#01977755-755b-485e-bd13-f0ea3e33cc36/158-818

I won't pretend to fully understand why, but reverting to the old
pattern in an attempt to fix it.

Also added an explicit wait for the dir to drain because there was an
error during cleanup caused by it.

---------

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
elliot-barn pushed a commit that referenced this pull request Jul 2, 2025
Flake:
https://buildkite.com/ray-project/postmerge/builds/10825#019767ae-c856-44d2-a2ce-5115c1c7e828/177-1225

Removing the assertion on an internal implementation detail, instead
just checking that the spilling workload can succeed.

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
elliot-barn pushed a commit that referenced this pull request Jul 2, 2025
I made the workload more stressful in
#53803 by fetching all of the
results concurrently. That seems to have caused Windows to time out:
https://buildkite.com/ray-project/postmerge/builds/10876#01977755-755b-485e-bd13-f0ea3e33cc36/158-818

I won't pretend to fully understand why, but reverting to the old
pattern in an attempt to fix it.

Also added an explicit wait for the dir to drain because there was an
error during cleanup caused by it.

---------

Signed-off-by: Edward Oakes <ed.nmi.oakes@gmail.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

go add ONLY when ready to merge, run all tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants