Offline Generation #24

Closed

opened

on Jan 17, 2024

Hi, is it possible to do offline Generation similar to the vllm batch Inference where the model is not served?

Like

Llm = sglang("path/to/llm")

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests