GitHub - lukealonso/b12x

b12x is an SM120/SM121 CuTe DSL kernel library for (primarily) NVFP4 LLM inference.

It is intentionally narrow. This is not a generic CUDA kernel collection or a full model-serving stack. It does not intend to target any other GPU architectures, including SM100. It is a focused package for a small number of high-performance kernels plus the runtime glue needed to launch them cleanly from sglang/vllm.

Currently supported kernels:

NVFP4 fused MoE GEMM
NVFP4 dense GEMM
BF16/FP8 paged attention
Sparse MLA attention (for DSA/NSA only).

pip install b12x

Ask your friendly neighborhood AI agent for further information on how to use this library.

Name		Name	Last commit message	Last commit date
Latest commit History 713 Commits
b12x		b12x
benchmarks		benchmarks
docs		docs
scripts		scripts
tests		tests
.codex		.codex
.gitignore		.gitignore
MANIFEST.in		MANIFEST.in
README.md		README.md
_rs_ncu_dr_256_1024.ncu-rep		_rs_ncu_dr_256_1024.ncu-rep
_rs_ncu_mg_256_1024.ncu-rep		_rs_ncu_mg_256_1024.ncu-rep
ml-primitives-glossary.md		ml-primitives-glossary.md
num_verify.py		num_verify.py
poison_align16.py		poison_align16.py
poison_nsweep.py		poison_nsweep.py
pyproject.toml		pyproject.toml
replay_dump.py		replay_dump.py
replay_scratch.py		replay_scratch.py
repro_n384.py		repro_n384.py
test_mla_compile.py		test_mla_compile.py
verify_moe_split.py		verify_moe_split.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages