[Core] Add default Ray Node labels at Node init#53360
[Core] Add default Ray Node labels at Node init#53360jjyao merged 39 commits intoray-project:masterfrom
Conversation
|
cc: @MengjinYan |
MengjinYan
left a comment
There was a problem hiding this comment.
The logic change looks good to me! Just one comment about testing.
|
CI tests failures @ryanaoleary |
@edoakes Think I fixed the CI failure with bd51036. |
|
The solution looks worrisome/hacky. What causes the accelerator manager calls to fail on startup? We might be breaking some kind of assumption there. @jjyao PTAL at the usage of accelerator manager here |
It'd raise an exception here due to missing dependencies such as |
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Co-authored-by: Mengjin Yan <mengjinyan3@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
Co-authored-by: Mengjin Yan <mengjinyan3@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
…sourceAndLabelSpec Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com>
…nd move record hardware usage to node.py Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com>
8bb604b to
4148cc0
Compare
|
@MengjinYan rebased and re-pushed since there were some changes to |
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com> Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Co-authored-by: Mengjin Yan <mengjinyan3@gmail.com> Signed-off-by: Krishna Kalyan <krishnakalyan3@gmail.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com> Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Co-authored-by: Mengjin Yan <mengjinyan3@gmail.com> Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com> Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Co-authored-by: Mengjin Yan <mengjinyan3@gmail.com> Signed-off-by: jugalshah291 <shah.jugal291@gmail.com>
Signed-off-by: Ryan O'Leary <ryanaoleary@google.com> Signed-off-by: Ryan O'Leary <113500783+ryanaoleary@users.noreply.github.com> Co-authored-by: Jiajun Yao <jeromeyjj@gmail.com> Co-authored-by: Mengjin Yan <mengjinyan3@gmail.com> Signed-off-by: Douglas Strodtman <douglas@anyscale.com>
Why are these changes needed?
This PR adds support for populating several default Ray node labels (described here) in the Ray runtime environment when a node is initialized. This change will help support autoscaling with the Label Selector API. This PR is related to ray-project/kuberay#3699 which passes several environment variables from the K8s stack which are used to set
ray.io/labels. I'll leave a comment on this PR with manual tests showing theray.io/accelerator-typeandray.io/availability-zonelabels getting set.Related issue number
#51564
Checks
git commit -s) in this PR.scripts/format.shto lint the changes in this PR.method in Tune, I've added it in
doc/source/tune/api/under thecorresponding
.rstfile.