Is vllm support stream chat in offline inference? #2083

Closed

opened

on Dec 13, 2023

I found vllm support streaming in api server. But I want to know is vllm support stream chat in offline inference. I didn't found it in example.

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Fields

No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests