[core] Increase instance sizes for wheel / HA tests#53783
Merged
jjyao merged 1 commit intoray-project:masterfrom Jun 13, 2025
Merged
[core] Increase instance sizes for wheel / HA tests#53783jjyao merged 1 commit intoray-project:masterfrom
jjyao merged 1 commit intoray-project:masterfrom
Conversation
Signed-off-by: dayshah <dhyey2019@gmail.com>
Contributor
There was a problem hiding this comment.
Pull Request Overview
This PR increases the CI instance sizes from medium to large for wheel builds and HA integration tests to prevent Bazel OOM failures.
- Bumped
instance_typetolargefor Serve wheel and Python steps - Updated KubeRay operator test instance size
- Increased core wheel and HA test instance sizes
Reviewed Changes
Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.
| File | Description |
|---|---|
| .buildkite/serve.rayci.yml | Changed instance_type to large for Serve steps |
| .buildkite/kuberay.rayci.yml | Changed instance_type to large for operator test |
| .buildkite/core.rayci.yml | Changed instance_type to large for core steps |
Comments suppressed due to low confidence (1)
.buildkite/serve.rayci.yml:91
- [nitpick] Several steps across YAML files repeat the same
instance_type: largesetting. Consider using a YAML anchor or default agent configuration to DRY up the pipeline and centralize instance size changes.
instance_type: large
Collaborator
|
I have seen increased memory consumption when compiling locally lately as well. |
elliot-barn
pushed a commit
that referenced
this pull request
Jun 18, 2025
Signed-off-by: dayshah <dhyey2019@gmail.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
8 tasks
elliot-barn
pushed a commit
that referenced
this pull request
Jul 2, 2025
Signed-off-by: dayshah <dhyey2019@gmail.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
8 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Why are these changes needed?
Many of my pr's that change any gcs code fail on these because of bazel oom's on medium sized instances. It seems specific to ci steps that are building wheels or doing HA things.
Examples on 3 separate pr's:
https://buildkite.com/ray-project/premerge/builds/41945#_
https://buildkite.com/ray-project/premerge/builds/41876
https://buildkite.com/ray-project/premerge/builds/41774#01975b1b-3ca8-477a-a2a0-2efbc6099c4d
Related issue number
Checks
git commit -s) in this PR.scripts/format.shto lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/under thecorresponding
.rstfile.