LLM Infrastructure Engineer @ ByteDance Volcano Engine. Previously interned at Huawei. M.S. in Computer Science from Northwestern Polytechnical University.
Working on LLM Infrastructure — inference, training, INT4 quantization, and system-level optimization for large-scale models.
Contributor to:
- sgl-project/sglang — Fast serving framework for LLMs and VLMs.
- ByteDance-Seed/VeOmni — Omni-modal training framework.
Python · CUDA · PyTorch · Triton
wangyuzhan@bytedance.com · wyz_yy@mail.nwpu.edu.cn · 1812107659@qq.com


