Added Ray-Serve Config For LLMs by Blaze-DSP · Pull Request #3517 · ray-project/kuberay

Blaze-DSP · 2025-05-01T05:10:39Z

Added Example Config For Ray-Serve LLM

kouroshHakha

The config looks good to me. (tho I haven't run the config myself)

Blaze-DSP · 2025-05-02T04:34:37Z

Should I also add config for autoscaling?

kevin85421 · 2025-05-05T17:53:32Z

Chatted with @Blaze-DSP offline

kouroshHakha · 2025-05-06T17:33:05Z

What is the plan? @kevin85421

ray-operator/config/samples/ray-service.llm-serve.yaml

kevin85421 · 2025-05-06T23:11:46Z

What is the plan? @kevin85421

Add a doc in Ray repo and make this example simpler (e.g. remove LoRA).

Blaze-DSP · 2025-05-07T05:30:56Z

I have updated the ray serve llm config and added doc for it in the ray repo.

PR For Doc.: ray-serve llm doc

ray-operator/config/samples/ray-service.llm-serve.yaml

eicherseiji · 2025-06-05T22:42:56Z

Going to give this a shot on my setup in the next week-ish.

eicherseiji

Worked great on my setup. Thanks for the PR @Blaze-DSP!

kevin85421 · 2025-06-07T03:31:29Z

@eicherseiji could you push to this branch directly to fix CI issues so that I can merge this PR? Thanks!

eicherseiji · 2025-06-08T00:58:32Z

Remove HuggingFace token from head pod since it's not needed there (from my tests).
Filed [Dashboard/Core] Resource list in Cluster Dashboard tab should show only logical GPUs ray#53641 due to unexpected ray dashboard cluster tab

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

outdated

Co-authored-by: DPatel_7 <dpatel@gocommotion.com> Co-authored-by: Seiji Eicher <seiji@anyscale.com>

andrewsykim · 2025-06-10T00:22:09Z

ray-operator/config/samples/ray-service.llm-serve.yaml

+              limits:
+                cpu: 32
+                memory: 32Gi
+                nvidia.com/gpu: "4"


I know it will depend on the GPU type, but does Qwen/Qwen2.5-7B-Instruct really need 4 GPUs? What GPUs did you test with?

Ah I noticed that tensor parallelism is not set, so it must only be using 1 GPU, I suggest updating this example to only request 1 GPU for each worker

Co-authored-by: DPatel_7 <dpatel@gocommotion.com> Co-authored-by: Seiji Eicher <seiji@anyscale.com>

Co-authored-by: Blaze-DSP <111348803+Blaze-DSP@users.noreply.github.com> Co-authored-by: DPatel_7 <dpatel@gocommotion.com> Co-authored-by: Seiji Eicher <seiji@anyscale.com>

kouroshHakha reviewed May 1, 2025

View reviewed changes

kouroshHakha requested a review from kevin85421 May 1, 2025 05:39

andrewsykim requested changes May 6, 2025

View reviewed changes

ray-operator/config/samples/ray-service.llm-serve.yaml Outdated Show resolved Hide resolved

andrewsykim reviewed May 6, 2025

View reviewed changes

ray-operator/config/samples/ray-service.llm-serve.yaml Outdated Show resolved Hide resolved

pcmoritz mentioned this pull request May 8, 2025

[Doc] Added ray-serve llm doc ray-project/ray#52832

Merged

8 tasks

kevin85421 reviewed May 13, 2025

View reviewed changes

ray-operator/config/samples/ray-service.llm-serve.yaml Outdated Show resolved Hide resolved

ray-operator/config/samples/ray-service.llm-serve.yaml Outdated Show resolved Hide resolved

ray-operator/config/samples/ray-service.llm-serve.yaml Outdated Show resolved Hide resolved

andrewsykim previously requested changes May 20, 2025

View reviewed changes

ray-operator/config/samples/ray-service.llm-serve.yaml Show resolved Hide resolved

kevin85421 added 1.4.0 release-blocker labels May 31, 2025

eicherseiji self-assigned this Jun 5, 2025

eicherseiji self-requested a review June 7, 2025 01:24

eicherseiji approved these changes Jun 7, 2025

View reviewed changes

DPatel_7 and others added 7 commits June 8, 2025 08:44

updates

489e079

v2 config

3e7ce5b

secret added

daf3093

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

updates

be8983d

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

updates

98b8466

Signed-off-by: DPatel_7 <dpatel@gocommotion.com>

Fix lint and set head GPUs 0

93bc194

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

Remove hf token from head pod

1dc7050

Signed-off-by: Seiji Eicher <seiji@anyscale.com>

eicherseiji force-pushed the llm-serving branch from a48cc5e to 1dc7050 Compare June 8, 2025 15:47

kevin85421 approved these changes Jun 9, 2025

View reviewed changes

kevin85421 merged commit 280902f into ray-project:master Jun 9, 2025
25 checks passed

MortalHappiness pushed a commit to MortalHappiness/kuberay that referenced this pull request Jun 9, 2025

Added Ray-Serve Config For LLMs (ray-project#3517)

7d715a5

Co-authored-by: DPatel_7 <dpatel@gocommotion.com> Co-authored-by: Seiji Eicher <seiji@anyscale.com>

MortalHappiness mentioned this pull request Jun 9, 2025

[Release] Cherry-pick release-blocker for release 1.4.0 #3755

Closed

4 tasks

andrewsykim reviewed Jun 10, 2025

View reviewed changes

pawelpaszki pushed a commit to opendatahub-io/kuberay that referenced this pull request Jun 10, 2025

Added Ray-Serve Config For LLMs (ray-project#3517)

f827772

Co-authored-by: DPatel_7 <dpatel@gocommotion.com> Co-authored-by: Seiji Eicher <seiji@anyscale.com>

kevin85421 pushed a commit to kevin85421/kuberay that referenced this pull request Jun 12, 2025

Added Ray-Serve Config For LLMs (ray-project#3517)

143fdad

Co-authored-by: DPatel_7 <dpatel@gocommotion.com> Co-authored-by: Seiji Eicher <seiji@anyscale.com>

Conversation

Blaze-DSP commented May 1, 2025

Uh oh!

kouroshHakha left a comment

Choose a reason for hiding this comment

Uh oh!

Blaze-DSP commented May 2, 2025

Uh oh!

kevin85421 commented May 5, 2025

Uh oh!

kouroshHakha commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevin85421 commented May 6, 2025

Uh oh!

Blaze-DSP commented May 7, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

eicherseiji commented Jun 5, 2025

Uh oh!

eicherseiji left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kevin85421 commented Jun 7, 2025

Uh oh!

eicherseiji commented Jun 8, 2025

Uh oh!

Uh oh!

andrewsykim Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

andrewsykim Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kouroshHakha commented May 6, 2025 •

edited

Loading

eicherseiji left a comment •

edited

Loading