-
Notifications
You must be signed in to change notification settings - Fork 7.4k
Closed
Labels
P1Issue that should be fixed within a few weeksIssue that should be fixed within a few weekscommunity-backlogenhancementRequest for new feature and/or capabilityRequest for new feature and/or capabilityperformanceserveRay Serve Related IssueRay Serve Related Issue
Description
Description
The current implementation for ray.serve.batch executes the batches synchronously. This throttles throughput for asynchronous methods wrapped in ray.serve.batch.
Use case
This could significantly improve usability when doing I/O calls to an endpoint that expects batching. It also would increase throughput for router-style composed actor where the sub-actors by avoiding the bubble that arises from waiting for the slowest sub-actor.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
P1Issue that should be fixed within a few weeksIssue that should be fixed within a few weekscommunity-backlogenhancementRequest for new feature and/or capabilityRequest for new feature and/or capabilityperformanceserveRay Serve Related IssueRay Serve Related Issue