hlin99

Follow

🎯

Focusing

Tony Lin hlin99

🎯

Focusing

Follow

1 follower · 2 following

Intel
Shanghai
23:24 (UTC -12:00)

Achievements

Achievements

Organizations

Pinned Loading

xPyD-hub/xPyD-proxy xPyD-hub/xPyD-proxy Public

PD Proxy Server

Python 2
xPyD-hub/xPyD-sim xPyD-hub/xPyD-sim Public

OpenAI-compatible LLM inference simulator for xPyD — dummy prefill/decode nodes with realistic behavior

Python 1
xPyD-hub/xPyD-bench xPyD-hub/xPyD-bench Public

Benchmarking & PD ratio planning tool for xPyD proxy

Python 1
LMCache LMCache Public

Forked from LMCache/LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 1
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
xPyD-hub/xPyD-acc xPyD-hub/xPyD-acc Public

PD disaggregation accuracy diagnostic tool — pinpoint whether accuracy issues come from Prefill, KV transfer, or Decode

Python 1