EULLM Engine EuLLM-v0.6.2
Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.
Quick install
# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-linux-x64 -o eullm
chmod +x eullm
# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm
# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-macos-arm64 -o eullm
chmod +x eullm# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-windows-x64.exe -OutFile eullm.exe
# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.ggufWhich binary to download?
| Binary | GPU Support | Requirements |
|---|---|---|
eullm-linux-x64 |
CPU only | None |
eullm-linux-x64-cuda-12.8 |
NVIDIA GPU (RTX 3000/4000/5000) | NVIDIA driver 570+ |
eullm-linux-arm64 |
CPU only | ARM64 Linux |
eullm-macos-x64 |
CPU only | macOS Intel |
eullm-macos-arm64 |
Metal (Apple GPU) | macOS Apple Silicon |
eullm-windows-x64.exe |
CPU only | Windows 10/11 x64 |
eullm-windows-x64-cuda-12.8.zip |
NVIDIA GPU (RTX 3000/4000/5000) | NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder |
Pick a model interactively
# Run with no arguments to open the model picker (local models + catalog)
./eullmWeb browsing
# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --webUsage
./eullm run ./model.gguf # Run any GGUF model
./eullm run ./model.gguf --batch-size 16 # Continuous batching for RAG
./eullm run ./model.gguf --web # Enable web browsingVerify checksums
sha256sum -c checksums.txtWhat's Changed
Full Changelog: EuLLM-v0.6.1...EuLLM-v0.6.2