QierLi

Qier Li QierLi

Inference Engine @ Meta

Achievements

parallel-compressors parallel-compressors Public

Forked from xhl0726/parallel-compressors
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
LMCache LMCache Public

Forked from LMCache/LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python
mini-sglang mini-sglang Public

Forked from sgl-project/mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python
original_performance_takehome original_performance_takehome Public

Forked from anthropics/original_performance_takehome

Anthropic's original performance take-home, now open for you to try!

Python