GitHub - yapit-tts/yapit: Listen to anything. TTS for documents, papers, and web pages.

yapit: Listen to anything. Open-source TTS for documents, web pages, and text.

Website | Self-Host | Architecture

Paste a URL or upload a PDF. Yapit renders the document and reads it aloud.

Handles the documents other TTS tools can't: academic papers with math, citations, figures, tables, messy formatting. Equations get spoken descriptions, citations become prose, page noise is skipped. The original content displays faithfully.
170+ voices across 15 languages. Premium voices or free local synthesis that runs entirely in your browser, no account needed.
Vim-style keyboard shortcuts, document outliner, media key support, adjustable speed, dark mode, share by link.

Powered by Gemini, Kokoro, Inworld TTS, DocLayout-YOLO, defuddle.

Self-hosting

git clone https://github.com/yapit-tts/yapit.git && cd yapit
cp .env.selfhost.example .env.selfhost
make self-host

Open http://localhost and create an account. Data persists across restarts.

.env.selfhost is self-documenting — see the comments for optional features (Gemini extraction, Inworld voices, RunPod overflow).

Multi-worker GPU setup:

Workers are pull-based — any machine with Redis access can run them. Connect from the local network or via Tailscale, for example. GPU and CPU workers run side-by-side; faster workers naturally pull more jobs. Scale by running more containers on any machine that can reach Redis.

Prereq: Docker 25+, nvidia-container-toolkit with CDI enabled, network access to the Redis instance.

# One-time GPU setup: generate CDI spec + enable CDI in Docker
sudo nvidia-ctk cdi generate --output=/etc/cdi/nvidia.yaml
# Add {"features": {"cdi": true}} to /etc/docker/daemon.json, then:
sudo systemctl restart docker

git clone --depth 1 https://github.com/yapit-tts/yapit.git && cd yapit

# Pull only the images you need
docker compose -f docker-compose.worker.yml pull kokoro-gpu yolo-gpu

# Start 2 Kokoro + 1 YOLO worker
REDIS_URL=redis://<host>:6379/0 docker compose -f docker-compose.worker.yml up -d \
  --scale kokoro-gpu=2 --scale yolo-gpu=1 kokoro-gpu yolo-gpu

Adjust --scale to your GPU. A 4GB card fits 2 Kokoro + 1 YOLO comfortably.

NVIDIA MPS (recommended for multiple workers per GPU)

MPS lets multiple workers share one GPU context — less VRAM overhead, no context switching. Without MPS, each worker gets its own CUDA context (~300MB each). The compose file mounts the MPS pipe automatically; just start the daemon.

sudo tee /etc/systemd/system/nvidia-mps.service > /dev/null <<'EOF'
[Unit]
Description=NVIDIA Multi-Process Service (MPS)
After=nvidia-persistenced.service

[Service]
Type=forking
ExecStart=/usr/bin/nvidia-cuda-mps-control -d
ExecStop=/bin/sh -c 'echo quit | /usr/bin/nvidia-cuda-mps-control'
Restart=on-failure

[Install]
WantedBy=multi-user.target
EOF
sudo systemctl daemon-reload
sudo systemctl enable --now nvidia-mps

To stop: make self-host-down.

Roadmap

Now:

Launch

Support uploading images, EPUB.
Support AI-transform for websites.
Support exporting audio as MP3.

Later:

Better support for self-hosting (better modularity for adding voices, extraction methods, documentation)
Support thinking parameter for Gemini
Support temperature parameter for Inworld

Development

make dev-cpu    # start backend services (Docker Compose)
cd frontend && npm run dev  # start frontend
make test-local # run tests

See agent/knowledge/dev-setup.md for full setup instructions.

The agent/knowledge/ directory is the project's in-depth knowledge base, maintained jointly with Claude during development.

Name		Name	Last commit message	Last commit date
Latest commit History 757 Commits
.claude		.claude
.github/workflows		.github/workflows
agent		agent
dashboard		dashboard
dev		dev
docker		docker
docs		docs
experiments		experiments
frontend		frontend
scripts		scripts
tests		tests
video		video
yapit		yapit
.dockerignore		.dockerignore
.env.dev		.env.dev
.env.prod		.env.prod
.env.selfhost.example		.env.selfhost.example
.env.sops		.env.sops
.env.template		.env.template
.gitignore		.gitignore
.mcp.json		.mcp.json
.pre-commit-config.yaml		.pre-commit-config.yaml
.ruff.toml		.ruff.toml
.sops.yaml		.sops.yaml
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.selfhost.yml		docker-compose.selfhost.yml
docker-compose.worker.yml		docker-compose.worker.yml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Website | Self-Host | Architecture

Self-hosting

Roadmap

Development

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Website | Self-Host | Architecture

Self-hosting

Roadmap

Development

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages