Releases · eullm/eullm

@primoco

EULLM Engine EuLLM-v0.6.2

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/web chat multimodal by @primoco in #185

Full Changelog: EuLLM-v0.6.1...EuLLM-v0.6.2

@primoco

EULLM Engine EuLLM-v0.6.1

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

chore(engine): 0.6.1 — multimodal vision fix (BOS) promoted to stable by @primoco in #184

Full Changelog: EuLLM-v0.6.1-beta.4...EuLLM-v0.6.1

@primoco

EULLM Engine EuLLM-v0.6.1-beta.4

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/web chat multimodal by @primoco in #183

Full Changelog: EuLLM-v0.6.1-beta.3...EuLLM-v0.6.1-beta.4

@primoco

EULLM Engine EuLLM-v0.6.1-beta.3

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/web chat multimodal by @primoco in #182

Full Changelog: EuLLM-v0.6.1-beta.2...EuLLM-v0.6.1-beta.3

@primoco

EULLM Engine EuLLM-v0.6.1-beta.2

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/web chat multimodal by @primoco in #181

Full Changelog: EuLLM-v0.6.1-beta.1...EuLLM-v0.6.1-beta.2

@primoco

EULLM Engine EuLLM-v0.6.1-beta.1

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/web chat multimodal by @primoco in #179
Feat/web chat multimodal by @primoco in #180

Full Changelog: EuLLM-v0.6.0...EuLLM-v0.6.1-beta.1

@primoco

EULLM Engine EuLLM-v0.6.0

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/web chat multimodal by @primoco in #178

Full Changelog: EuLLM-v0.6.0-beta.8...EuLLM-v0.6.0

@primoco

EULLM Engine EuLLM-v0.5.20

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/UI math rendering by @primoco in #174

Full Changelog: EuLLM-v0.6.0-beta.6...EuLLM-v0.5.20

@primoco

EULLM Engine EuLLM-v0.6.0-beta.8

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/web chat multimodal by @primoco in #177

Full Changelog: EuLLM-v0.6.0-beta.7...EuLLM-v0.6.0-beta.8

@primoco

EULLM Engine EuLLM-v0.6.0-beta.7

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-macos-arm64 -o eullm
chmod +x eullm

# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary	GPU Support	Requirements
`eullm-linux-x64`	CPU only	None
`eullm-linux-x64-cuda-12.8`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+
`eullm-linux-arm64`	CPU only	ARM64 Linux
`eullm-macos-x64`	CPU only	macOS Intel
`eullm-macos-arm64`	Metal (Apple GPU)	macOS Apple Silicon
`eullm-windows-x64.exe`	CPU only	Windows 10/11 x64
`eullm-windows-x64-cuda-12.8.zip`	NVIDIA GPU (RTX 3000/4000/5000)	NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Feat/UI math rendering by @primoco in #175
fix(engine,ci): keep clippy gate off vendored crates; fix CUDA submod… by @primoco in #176

Full Changelog: EuLLM-v0.5.20...EuLLM-v0.6.0-beta.7

Releases: eullm/eullm

v0.6.2

EULLM Engine EuLLM-v0.6.2

Quick install

Which binary to download?

Pick a model interactively

Web browsing

Usage

Verify checksums

What's Changed

Contributors

Uh oh!

v0.6.1

EULLM Engine EuLLM-v0.6.1

Quick install

Which binary to download?

Pick a model interactively

Web browsing

Usage

Verify checksums

What's Changed

Contributors

Uh oh!

v0.6.1-beta.4

EULLM Engine EuLLM-v0.6.1-beta.4

Quick install

Which binary to download?

Pick a model interactively

Web browsing

Usage

Verify checksums

What's Changed

Contributors

Uh oh!

v0.6.1-beta.3

EULLM Engine EuLLM-v0.6.1-beta.3

Quick install

Which binary to download?

Pick a model interactively

Web browsing

Usage

Verify checksums

What's Changed

Contributors

Uh oh!

v0.6.1-beta.2

EULLM Engine EuLLM-v0.6.1-beta.2

Quick install

Which binary to download?

Pick a model interactively

Web browsing

Usage

Verify checksums

What's Changed

Contributors

Uh oh!

v0.6.1-beta.1

EULLM Engine EuLLM-v0.6.1-beta.1

Quick install

Which binary to download?

Pick a model interactively

Web browsing

Usage

Verify checksums

What's Changed

Contributors

Uh oh!

v0.6.0

EULLM Engine EuLLM-v0.6.0

Quick install

Which binary to download?

Pick a model interactively

Web browsing

Usage

Verify checksums

What's Changed

Contributors

Uh oh!

v0.5.20

EULLM Engine EuLLM-v0.5.20