๐ DeepSeek-OCR โ the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM โก (~2500 tokens/s on A100-40G) โ powered by vllm==0.8.5 for day-0 model support.
๐ง Compresses visual contexts up to 20ร while keeping
A high-throughput and memory-efficient inference and serving engine for LLMs. Join slack.vllm.ai to discuss together with the community!
Joined March 2024



















