Skip to content

[Serve.llm][P/D] Support separate deployment config for PDProxy in Prefill disagg#53821

Merged
kouroshHakha merged 2 commits intoray-project:masterfrom
kouroshHakha:kh/pd-ctrl-deployment
Jun 16, 2025
Merged

[Serve.llm][P/D] Support separate deployment config for PDProxy in Prefill disagg#53821
kouroshHakha merged 2 commits intoray-project:masterfrom
kouroshHakha:kh/pd-ctrl-deployment

Conversation

@kouroshHakha
Copy link
Copy Markdown
Contributor

This PR is simple. It adds a separate configuration for controlling deployment config of PDProxy

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
@kouroshHakha kouroshHakha added the go add ONLY when ready to merge, run all tests label Jun 14, 2025
@kouroshHakha kouroshHakha marked this pull request as ready for review June 14, 2025 00:42
Copilot AI review requested due to automatic review settings June 14, 2025 00:42
@kouroshHakha kouroshHakha requested a review from a team as a code owner June 14, 2025 00:42

This comment was marked as outdated.

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
@kouroshHakha kouroshHakha requested a review from Copilot June 14, 2025 00:46
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR introduces a dedicated deployment configuration for the PDProxy component in the prefill-decode disaggregated flow, allowing users to pass custom Ray Serve options.

  • Added a new proxy_deployment_config field to PDServingArgs for arbitrary Ray Serve deployment parameters.
  • Forwarded proxy_deployment_config through parse_args.
  • Applied the custom options in build_app via .options(**pd_config.proxy_deployment_config) on the PDProxyServer deployment.
Comments suppressed due to low confidence (2)

python/ray/llm/_internal/serve/deployments/prefill_decode_disagg/prefill_decode_disagg.py:42

  • [nitpick] Consider expanding this description to list common keys (e.g., num_replicas, ray_actor_options) or link directly to the Ray Serve deployment options docs for easier discoverability.
The Ray @server.deployment options for the proxy server.

python/ray/llm/_internal/serve/deployments/prefill_decode_disagg/prefill_decode_disagg.py:205

  • Add unit or integration tests to verify that various proxy_deployment_config parameters (e.g., replicas, resource tags) are correctly applied and take effect on the PDProxyServer deployment.
.options(**pd_config.proxy_deployment_config)

@kouroshHakha kouroshHakha enabled auto-merge (squash) June 16, 2025 22:07
@kouroshHakha kouroshHakha merged commit 9fb4510 into ray-project:master Jun 16, 2025
6 checks passed
elliot-barn pushed a commit that referenced this pull request Jun 18, 2025
…efill disagg (#53821)

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
minerharry pushed a commit to minerharry/ray that referenced this pull request Jun 27, 2025
…efill disagg (ray-project#53821)

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
elliot-barn pushed a commit that referenced this pull request Jul 2, 2025
…efill disagg (#53821)

Signed-off-by: kouroshhakha <kourosh@anyscale.com>
Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

go add ONLY when ready to merge, run all tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants