ci : run ui publish on ubuntu-slim by CISC · Pull Request #23818 · ggml-org/llama.cpp

CISC · 2026-05-28T13:17:28Z

Overview

Let's try not to stall Release. :)

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: a'a

ggerganov · 2026-05-28T13:29:18Z

not sure which runners are under these tags?

Can you access this page: https://github.com/organizations/ggml-org/settings/actions/runners

CISC · 2026-05-28T13:35:08Z

not sure which runners are under these tags?

Can you access this page: https://github.com/organizations/ggml-org/settings/actions/runners

Nope.

ggerganov · 2026-05-28T13:43:53Z

I'll think about this change - the jobs that produce any user artifacts are security sensitive and likely should not run on self-hosted or 3rd-party-hosted runners without the proper disclaimers.

Plus I am not sure yet that the change will help overall. I was also watching how the release was stalled for half an hour because of this tiny last job, but the fact that it was not being picked up means that the runners are plenty busy with other stuff already. If the job was unblocked, it would just add more jobs to the pool which is already too big.

For the development process, I think the PR jobs are higher priority compared to the release jobs because the maintainers would wait less when they work on something. The release are a bit "secondary" with the new concurrency model of the CI.

ggerganov · 2026-05-28T13:56:22Z

However, this is problematic: https://github.com/ggml-org/llama.cpp/actions/runs/26560048666/job/78302518452#step:4:50

I think the cache for CUDA 13.3 got evicted. The vulkan ccaches seem excessive:

ggerganov · 2026-05-28T16:10:06Z

I think we are good now. 15 releases incoming in quick succession 😄

CISC · 2026-05-28T17:39:14Z

I think we are good now. 15 releases incoming in quick succession 😄

Alright!

A possible alternative in current PR is moving to slim.

ggerganov · 2026-05-28T17:41:42Z

Yes, just make sure the slim can handle it.

CISC · 2026-05-28T17:43:42Z

Yes, just make sure the slim can handle it.

Should do, the job just takes a few seconds, and nothing special in it.

CISC · 2026-05-28T17:58:40Z

Yes, just make sure the slim can handle it.

Should do, the job just takes a few seconds, and nothing special in it.

Successful test: https://github.com/CISC/llama.cpp/actions/runs/26592454747/job/78354538399

* origin/master: (32 commits) hexagon: basic/generic op fusion support and RMS_NORM+MUL fusion (ggml-org#23835) mtmd-debug: add color and rainbow mode (ggml-org#23829) mtmd: fix gemma 4 projector pre_norm (ggml-org#23822) opencl: move backend info printing into its own function (ggml-org#23702) ci : run ui publish on ubuntu-slim (ggml-org#23818) ui: fix audio and video modality detection (ggml-org#23756) ci : releases use Github-hosted builds for the UI (ggml-org#23823) app : improve help output (ggml-org#23805) mtmd: n_head_kv defaults to n_head (ggml-org#23782) mtmd: fix gemma 4 audio rms norm eps (ggml-org#23815) ci : change Vulkan builds to Release to reduce ccache (ggml-org#23820) arg: Add LLAMA_ARG_API_KEY_FILE environment variable for --api-key-file (ggml-org#23167) test-llama-archs: fix table format [no release] (ggml-org#23810) ggml: auto apply iGPU flag CUDA/HIP if integrated device (ggml-org#23007) mmvq Optim: add MMVQ_PARAMETERS_TURING(mmvq_parameter_table_id) for … (ggml-org#23729) CUDA: route batch>=4 quantized matmul to MMQ on AMD MFMA hardware (ggml-org#23227) server: minor tweaks to use more cpp features (ggml-org#23785) hexagon: minor refresh for HMX FA and MM (ggml-org#23796) vulkan: fast path for walsh-hadamard transform (ggml-org#23687) chat : add Granite 4.1 chat template (ggml-org#23518) ...

* run ui publish on self-hosted fast * run on ubuntu-slim

run ui publish on self-hosted fast

1a8c453

CISC requested a review from a team as a code owner May 28, 2026 13:17

github-actions Bot added the devops improvements to build systems and github actions label May 28, 2026

run on ubuntu-slim

ebb6ce1

CISC changed the title ~~ci : run ui publish on self-hosted fast~~ ci : run ui publish on ubuntu-slim May 28, 2026

ggerganov merged commit 3ef2369 into master May 28, 2026
3 checks passed

ggerganov deleted the cisc/ci-ui-publish-self-hosted-fast branch May 28, 2026 17:58

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

ci : run ui publish on ubuntu-slim (ggml-org#23818)

aee505a

* run ui publish on self-hosted fast * run on ubuntu-slim

turbo-tan pushed a commit to turbo-tan/llama.cpp-tq3 that referenced this pull request Jun 2, 2026

ci : run ui publish on ubuntu-slim (ggml-org#23818)

5e4a1c4

* run ui publish on self-hosted fast * run on ubuntu-slim

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci : run ui publish on ubuntu-slim#23818

ci : run ui publish on ubuntu-slim#23818
ggerganov merged 2 commits into
masterfrom
cisc/ci-ui-publish-self-hosted-fast

CISC commented May 28, 2026 •

edited

Loading

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

CISC commented May 28, 2026

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

CISC commented May 28, 2026

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

CISC commented May 28, 2026

Uh oh!

Uh oh!

CISC commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

CISC commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Requirements

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

CISC commented May 28, 2026

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

CISC commented May 28, 2026

Uh oh!

ggerganov commented May 28, 2026

Uh oh!

CISC commented May 28, 2026

Uh oh!

Uh oh!

CISC commented May 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CISC commented May 28, 2026 •

edited

Loading