Skip to content

GentleCold/DaseR

Repository files navigation

DaseR

DaseR icon

RAG-native KV cache service for LLM inference. Integrates with vLLM via KVConnectorBase_V1; stores KV tensors directly to NVMe using NVIDIA cuFile (GDS) or io_uring as a fallback.

Architecture

DaseR architecture

Install

source <venv>/bin/activate
pip install -e .

Docs

License

DaseR is licensed under the Apache License 2.0.

About

RAG-native KV cache service for LLM inference.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors