feat(parakeet-cpp): enable GGML_CUDA_GRAPHS in the cublas build by localai-bot · Pull Request #10273 · mudler/LocalAI

localai-bot · 2026-06-12T16:37:53Z

What

Pass -DGGML_CUDA_GRAPHS=ON alongside -DPARAKEET_GGML_CUDA=ON in the parakeet-cpp backend's cublas build.

Why

ggml leaves GGML_CUDA_GRAPHS off by default. With it on, the CUDA backend captures and replays the compute graph for a small but free speedup. Measured on a GB10 (interleaved, best-of, same 180s clip):

model	graphs ON	graphs OFF	gain
tdt-1.1b	~1477 ms	~1498 ms	+1.4%
tdt-0.6b-v3	~970 ms	~974 ms	+0.4%

Never negative across runs. It is not gated by parakeet.cpp's CMake options, so it passes straight through to ggml and takes effect regardless of the pinned parakeet.cpp commit.

Notes

Set explicitly here because the backend builds a pinned parakeet.cpp; the upstream also enables this in its CMake (build(cuda): enable GGML_CUDA_GRAPHS on CUDA builds parakeet.cpp#26), so the two stay consistent.
Runtime kill-switch GGML_CUDA_DISABLE_GRAPHS=1 still works for A/B testing.

🤖 Generated with Claude Code

ggml leaves GGML_CUDA_GRAPHS off by default. Passing -DGGML_CUDA_GRAPHS=ON for cublas builds lets the CUDA backend capture and replay the compute graph for a small free speedup (about 1% measured on a GB10, never negative). It is not gated by parakeet.cpp's CMake options, so it passes straight through to ggml. Assisted-by: Claude Opus 4.8 <noreply@anthropic.com>

mudler approved these changes Jun 12, 2026

View reviewed changes

mudler merged commit 8c8204d into master Jun 12, 2026
66 of 67 checks passed

mudler deleted the feat/parakeet-cuda-graphs branch June 12, 2026 16:47

BrewTestBot mentioned this pull request Jun 13, 2026

localai 4.4.3 Homebrew/homebrew-core#287865

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(parakeet-cpp): enable GGML_CUDA_GRAPHS in the cublas build#10273

feat(parakeet-cpp): enable GGML_CUDA_GRAPHS in the cublas build#10273
mudler merged 1 commit into
masterfrom
feat/parakeet-cuda-graphs

localai-bot commented Jun 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

localai-bot commented Jun 12, 2026

What

Why

Notes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants