Skip to content

[Data]: Support making embeddings batch predictions with build_llm_processor #55384

@martinbomio

Description

@martinbomio

Description

Ray data supports making batch chat completion batch predictions with vLLM through vLLMEngineProcessorConfig and build_llm_processor. Would be great to have something similar for /v1/embeddings

Use case

Make batch predictions with embeddings models

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions