tinyllm

minimal transformer inference in c. ~400 lines total.

quick start

make
./download.sh
./tinyllm models/stories15M.bin "Once upon a time"

./tinyllm <model.bin> [prompt] [-t temp] [-p topp] [-n steps] [-s seed] [-z tokenizer]

runs llama-style models (tinyllamas, llama2.c format). implements:

no dependencies except libc and libm.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
include		include
src		src
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
download.sh		download.sh