Skip to content

Releases: eullm/eullm

v0.6.2

09 Jun 16:26
09ff027

Choose a tag to compare

EULLM Engine EuLLM-v0.6.2

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.2/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Full Changelog: EuLLM-v0.6.1...EuLLM-v0.6.2

v0.6.1

09 Jun 10:44
738aa87

Choose a tag to compare

EULLM Engine EuLLM-v0.6.1

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

  • chore(engine): 0.6.1 — multimodal vision fix (BOS) promoted to stable by @primoco in #184

Full Changelog: EuLLM-v0.6.1-beta.4...EuLLM-v0.6.1

v0.6.1-beta.4

09 Jun 08:55
dc15e5a

Choose a tag to compare

v0.6.1-beta.4 Pre-release
Pre-release

EULLM Engine EuLLM-v0.6.1-beta.4

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.4/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Full Changelog: EuLLM-v0.6.1-beta.3...EuLLM-v0.6.1-beta.4

v0.6.1-beta.3

09 Jun 06:39
1c15568

Choose a tag to compare

v0.6.1-beta.3 Pre-release
Pre-release

EULLM Engine EuLLM-v0.6.1-beta.3

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.3/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Full Changelog: EuLLM-v0.6.1-beta.2...EuLLM-v0.6.1-beta.3

v0.6.1-beta.2

08 Jun 14:18
7574dcc

Choose a tag to compare

v0.6.1-beta.2 Pre-release
Pre-release

EULLM Engine EuLLM-v0.6.1-beta.2

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.2/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Full Changelog: EuLLM-v0.6.1-beta.1...EuLLM-v0.6.1-beta.2

v0.6.1-beta.1

08 Jun 11:28
677df9d

Choose a tag to compare

v0.6.1-beta.1 Pre-release
Pre-release

EULLM Engine EuLLM-v0.6.1-beta.1

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.1-beta.1/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Full Changelog: EuLLM-v0.6.0...EuLLM-v0.6.1-beta.1

v0.6.0

07 Jun 11:12
9ad521c

Choose a tag to compare

EULLM Engine EuLLM-v0.6.0

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Full Changelog: EuLLM-v0.6.0-beta.8...EuLLM-v0.6.0

v0.5.20

06 Jun 13:43
acf225f

Choose a tag to compare

EULLM Engine EuLLM-v0.5.20

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.5.20/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Full Changelog: EuLLM-v0.6.0-beta.6...EuLLM-v0.5.20

v0.6.0-beta.8

06 Jun 20:27
ba62695

Choose a tag to compare

v0.6.0-beta.8 Pre-release
Pre-release

EULLM Engine EuLLM-v0.6.0-beta.8

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.8/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

Full Changelog: EuLLM-v0.6.0-beta.7...EuLLM-v0.6.0-beta.8

v0.6.0-beta.7

06 Jun 17:19
f68a135

Choose a tag to compare

v0.6.0-beta.7 Pre-release
Pre-release

EULLM Engine EuLLM-v0.6.0-beta.7

Drop-in Ollama replacement with continuous batching and EU AI Act audit trail.

Quick install

# Linux x64 (CPU only)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-linux-x64 -o eullm
chmod +x eullm

# Linux x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000 series)
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-linux-x64-cuda-12.8 -o eullm
chmod +x eullm

# macOS Apple Silicon
curl -L https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-macos-arm64 -o eullm
chmod +x eullm
# Windows x64 (CPU only) — PowerShell
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-windows-x64.exe -OutFile eullm.exe

# Windows x64 with NVIDIA GPU (CUDA 12.8 — supports RTX 3000/4000/5000)
# This is a ZIP: eullm.exe + the CUDA runtime DLLs it needs.
Invoke-WebRequest -Uri https://github.com/eullm/eullm/releases/download/EuLLM-v0.6.0-beta.7/eullm-windows-x64-cuda-12.8.zip -OutFile eullm-cuda.zip
Expand-Archive eullm-cuda.zip -DestinationPath eullm-cuda
# Run eullm.exe from inside the extracted folder (the DLLs must
# stay next to it).
.\eullm-cuda\eullm.exe run model.gguf

Which binary to download?

Binary GPU Support Requirements
eullm-linux-x64 CPU only None
eullm-linux-x64-cuda-12.8 NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+
eullm-linux-arm64 CPU only ARM64 Linux
eullm-macos-x64 CPU only macOS Intel
eullm-macos-arm64 Metal (Apple GPU) macOS Apple Silicon
eullm-windows-x64.exe CPU only Windows 10/11 x64
eullm-windows-x64-cuda-12.8.zip NVIDIA GPU (RTX 3000/4000/5000) NVIDIA driver 570+ for Windows. ZIP bundles eullm.exe + CUDA DLLs — run from the extracted folder

Pick a model interactively

# Run with no arguments to open the model picker (local models + catalog)
./eullm

Web browsing

# Enable transparent web fetch (URLs in messages are fetched and injected)
./eullm run model.gguf --web

Usage

./eullm run ./model.gguf                    # Run any GGUF model
./eullm run ./model.gguf --batch-size 16    # Continuous batching for RAG
./eullm run ./model.gguf --web              # Enable web browsing

Verify checksums

sha256sum -c checksums.txt

What's Changed

  • Feat/UI math rendering by @primoco in #175
  • fix(engine,ci): keep clippy gate off vendored crates; fix CUDA submod… by @primoco in #176

Full Changelog: EuLLM-v0.5.20...EuLLM-v0.6.0-beta.7