quant.cpp

WASM
GitHub

Run an LLM in your browser

No install. No API key. No server.

SmolLM2 135M
~135 MB · Fast download
Fast
Llama 3.2 1B Instruct
~770 MB · Llama family
Quality

or load your own GGUF · runs 100% client-side