▶️ Running Locally

Remote inference backend for the Llamatik ecosystem.

Ktor · Kotlin/JVM · llama.cpp-compatible API · Drop-in remote inference

✨ What is Llamatik Server?

Llamatik Server is a lightweight HTTP backend that exposes the same API as the Llamatik Kotlin library, enabling seamless remote inference.

It allows you to:

🧠 Run LLM inference remotely
🌐 Switch from on-device to server inference with no API changes
🚀 Deploy scalable inference backends
🔁 Build hybrid offline-first applications with online fallback

If you're using Llamatik in your app, this server acts as a drop-in remote backend.

🧱 Architecture

Your App
│
▼
LlamaBridge (shared Kotlin API)
│
├─ llamatik-core     → On-device inference (llama.cpp, whisper.cpp, SD)
├─ llamatik-client   → Remote HTTP client
└─ llamatik-server   → This backend

Switching between local and remote inference requires no API changes ---
only configuration.

🚀 Features

✅ Implements the same API contract as Llamatik
✅ Compatible with llama.cpp-based inference
✅ Streaming & non-streaming generation
✅ JSON schema-constrained generation
✅ Embeddings support
✅ Production-ready Ktor server
✅ Docker-ready deployment

🛠 Requirements

JVM 21+
Docker (optional, for containerized deployment)

▶️ Running Locally

From the project root:

./gradlew run

The server will start on:

http://localhost:8080

🐳 Running with Docker

Build the image:

docker build -t llamatik .

Run the container:

docker run -p 8080:8080 llamatik

🖥 Running as a Linux Service (systemd)

Create:

/etc/systemd/system/docker.llamatik.service

[Unit]
Description=Llamatik
After=docker.service
Requires=docker.service

[Service]
TimeoutStartSec=0
Restart=always
ExecStartPre=-/usr/bin/docker exec %n stop
ExecStartPre=-/usr/bin/docker rm %n
ExecStart=/usr/bin/docker run -p 8080:8080 llamatik

[Install]
WantedBy=default.target

Enable on boot:

sudo systemctl enable docker.llamatik

Control manually:

sudo service docker.llamatik stop
sudo service docker.llamatik start

🔄 Hybrid Mode (Local + Remote)

Llamatik is designed for offline-first apps.

You can:

Run inference locally (llama.cpp via Kotlin/Native)
Fallback to this server when needed
Switch dynamically based on connectivity

🌍 Production Deployment

For production usage you should:

Add HTTPS (via reverse proxy like Nginx or Caddy)
Use container orchestration (Docker Compose / Kubernetes)
Configure resource limits
Add authentication if exposed publicly

Example architecture:

Internet
   │
Reverse Proxy (TLS)
   │
Llamatik Server (Docker)
   │
llama.cpp runtime

📦 Related Projects

🔗 Llamatik Library -- Kotlin Multiplatform AI SDK
https://github.com/ferranpons/llamatik

🤝 Contributing

Contributions are welcome: - Performance improvements - Deployment enhancements - Documentation updates

Open an issue or PR 🚀

📜 License

This project is licensed under the MIT License.
See LICENSE for details.

Built with ❤️ for the Kotlin & AI community.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.run		.run
docs		docs
gradle		gradle
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

✨ What is Llamatik Server?

🧱 Architecture

🚀 Features

🛠 Requirements

▶️ Running Locally

🐳 Running with Docker

🖥 Running as a Linux Service (systemd)

🔄 Hybrid Mode (Local + Remote)

🌍 Production Deployment

📦 Related Projects

🤝 Contributing

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

✨ What is Llamatik Server?

🧱 Architecture

🚀 Features

🛠 Requirements

▶️ Running Locally

🐳 Running with Docker

🖥 Running as a Linux Service (systemd)

🔄 Hybrid Mode (Local + Remote)

🌍 Production Deployment

📦 Related Projects

🤝 Contributing

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages