PatchouliTIS

Follow

💭

I may be slow to respond.

PatchyTIS PatchouliTIS

💭

I may be slow to respond.

Follow

Никто кроме нас.

4 followers · 9 following

@Tencent
KamiTsubaki
@pyra_m

Achievements

Achievements

Highlights

Pro

Pinned Loading

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 1
vllm-project/speculators vllm-project/speculators Public

A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

Python 516 104
INT8BlockwiseGEMM INT8BlockwiseGEMM Public

2-step online int8 quantization method for Ampere.

Cuda
DotTopkFusedKernel DotTopkFusedKernel Public

C++
BlockSparseVLLM BlockSparseVLLM Public

Python
MultiModal-Eagle3 MultiModal-Eagle3 Public

Python