ci: Setup self-hosted CI for Intel Linux Vulkan backend by rillomas · Pull Request #20154 · ggml-org/llama.cpp

rillomas · 2026-03-06T03:52:16Z

We would like add a single self-hosted CI runner for Intel Linux Vulkan backend related to #19213.
This will be focused for Vulkan backend only at the moment, but when OpenVINO backend gets merged we will use it for both backends.

An example workflow execution on the branch can be viewed at the following:

https://github.com/rillomas/llama.cpp/actions/runs/22884633443/job/66394545350

…linux

rillomas · 2026-03-10T03:12:51Z

Hi @ggerganov we are ready to add an instance for Intel Vulkan backend. Will you please send me a runner token? I am located in Japan so hopefully your TZ is close to mine (since the runner tokens only last for an hour)

Update: I was able to receive a token and confirm CI instance added

ggerganov · 2026-03-10T07:20:56Z

Thanks, the workflow appears to run successfully: https://github.com/ggml-org/llama.cpp/actions/runs/22884579987/job/66394392012?pr=20154

@0cc4m @jeffbolznv FYI this is a coopmat runner

rillomas · 2026-03-10T08:21:55Z

Thanks, the workflow appears to run successfully: https://github.com/ggml-org/llama.cpp/actions/runs/22884579987/job/66394392012?pr=20154

FYI this specific workflow was run on another instance which was already added by the OpenVINO team (LNL). Vulkan workload will work on both instances but I haven't tested if OpenVINO workload works on the instance I added today (PTL)

ggerganov · 2026-03-10T08:25:11Z

Thanks for pointing that out. If the OpenVINO workflows don't run on your runner, we will gate them with an extra tag.

* 'master' of github.com:ggml-org/llama.cpp: (33 commits) convert : better mtp check and fix return [no ci] (ggml-org#20419) vulkan: fix SSM_CONV PP scaling with large ubatch sizes (ggml-org#20379) New conversations now auto-select the first loaded model (ggml-org#20403) ggml-virtgpu: Fix some build commands (ggml-org#20341) metal : avoid divisions in bin kernel (ggml-org#20426) ci: Setup self-hosted CI for Intel Linux Vulkan backend (ggml-org#20154) vulkan: fix l2_norm epsilon handling (ggml-org#20350) vulkan: fix OOB check in flash_attn_mask_opt (ggml-org#20296) vulkan: Fix ErrorOutOfHostMemory on Intel GPU when loading large models with --no-mmap (ggml-org#20059) opencl: use larger workgroup size for get_rows (ggml-org#20316) opencl: add cumsum op (ggml-org#18981) hip: compile debug builds with -O2 on hip to avoid a compiler bug (ggml-org#20392) common/parser: add GigaChatV3/3.1 models support (ggml-org#19931) model : add support for Phi4ForCausalLMV (ggml-org#20168) graph : add optional scale parameter to build_lora_mm [no ci] (ggml-org#20427) common : fix --n-cpu-moe, --cpu-moe for models with fused gate + up (ggml-org#20416) ggml-webgpu: Add supports for `GGML_OP_REPEAT` (ggml-org#20230) llama : enable chunked fused GDN path (ggml-org#20340) llama : whitespace cleanup (ggml-org#20422) ggml : add NVFP4 quantization type support (ggml-org#19769) ...

rillomas added 2 commits March 6, 2026 12:46

added Intel Linux vulkan backend definition

899f99a

Merge remote-tracking branch 'origin/master' into setup-ci-for-intel-…

3e5cac5

…linux

github-actions bot added the devops improvements to build systems and github actions label Mar 6, 2026

rillomas changed the title ~~ci: Setup self-hosted ci for Intel linux Vulkan backend~~ ci: Setup self-hosted ci for Intel Linux Vulkan backend Mar 6, 2026

rillomas changed the title ~~ci: Setup self-hosted ci for Intel Linux Vulkan backend~~ ci: Setup self-hosted CI for Intel Linux Vulkan backend Mar 6, 2026

Merge remote-tracking branch 'origin/master' into setup-ci-for-intel-…

c0124db

…linux

rillomas marked this pull request as ready for review March 10, 2026 03:07

rillomas requested a review from CISC as a code owner March 10, 2026 03:07

rillomas marked this pull request as draft March 10, 2026 04:41

rillomas marked this pull request as ready for review March 10, 2026 07:08

ggerganov requested a review from 0cc4m March 10, 2026 07:21

ggerganov approved these changes Mar 10, 2026

View reviewed changes

CISC approved these changes Mar 10, 2026

View reviewed changes

0cc4m approved these changes Mar 12, 2026

View reviewed changes

0cc4m merged commit 4cc6eb1 into ggml-org:master Mar 12, 2026
72 of 74 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: Setup self-hosted CI for Intel Linux Vulkan backend#20154

ci: Setup self-hosted CI for Intel Linux Vulkan backend#20154
0cc4m merged 3 commits intoggml-org:masterfrom
rillomas:setup-ci-for-intel-linux

rillomas commented Mar 6, 2026 •

edited

Loading

Uh oh!

rillomas commented Mar 10, 2026 •

edited

Loading

Uh oh!

ggerganov commented Mar 10, 2026

Uh oh!

rillomas commented Mar 10, 2026

Uh oh!

ggerganov commented Mar 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rillomas commented Mar 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rillomas commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ggerganov commented Mar 10, 2026

Uh oh!

rillomas commented Mar 10, 2026

Uh oh!

ggerganov commented Mar 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rillomas commented Mar 6, 2026 •

edited

Loading

rillomas commented Mar 10, 2026 •

edited

Loading