build: enable parallel builds in msbuild using MTT by jeffbolznv · Pull Request #17708 · ggml-org/llama.cpp

jeffbolznv · 2025-12-03T02:20:31Z

msbuild performance has been poor recently, particularly since the model implementations were split into separate source files. This change enables MultiToolTask which enables parallelism within a project without the problems of /MP. See https://devblogs.microsoft.com/cppblog/improved-parallelism-in-msbuild/.

I tested each of the vulkan and cuda backends. My CPU is a 32-core threadripper.

vulkan before
========== Rebuild started at 2:50 PM and took 03:38.550 minutes ==========
vulkan after
========== Rebuild started at 3:36 PM and took 01:57.272 minutes ==========

cuda before
========== Rebuild started at 3:27 PM and took 07:59.340 minutes ==========
cuda after
========== Rebuild started at 3:38 PM and took 06:17.516 minutes ==========

This is supported in vs2019+. I think unknown options are ignored, so this shouldn't be harmful on older versions.

ggerganov

I don't have an environment to test this. Feel free to merge if it works on your end.

Acly · 2025-12-03T10:18:35Z

Maybe guard it with if(LLAMA_STANDALONE) ?

jeffbolznv · 2025-12-03T15:08:46Z

Maybe guard it with if(LLAMA_STANDALONE) ?

Sure, done.

* build: enable parallel builds in msbuild using MTT * check LLAMA_STANDALONE

* origin/master: server: strip content-length header on proxy (ggml-org#17734) server: move msg diffs tracking to HTTP thread (ggml-org#17740) examples : add missing code block end marker [no ci] (ggml-org#17756) common : skip model validation when --help is requested (ggml-org#17755) ggml-cpu : remove asserts always evaluating to false (ggml-org#17728) convert: use existing local chat_template if mistral-format model has one. (ggml-org#17749) cmake : simplify build info detection using standard variables (ggml-org#17423) ci : disable ggml-ci-x64-amd-* (ggml-org#17753) common: use native MultiByteToWideChar (ggml-org#17738) metal : use params per pipeline instance (ggml-org#17739) llama : fix sanity checks during quantization (ggml-org#17721) build : move _WIN32_WINNT definition to headers (ggml-org#17736) build: enable parallel builds in msbuild using MTT (ggml-org#17708) ggml-cpu: remove duplicate conditional check 'iid' (ggml-org#17650) Add a couple of file types to the text section (ggml-org#17670) convert : support latest mistral-common (fix conversion with --mistral-format) (ggml-org#17712) Use OpenAI-compatible `/v1/models` endpoint by default (ggml-org#17689) webui: Fix zero pasteLongTextToFileLen to disable conversion being overridden (ggml-org#17445)

* build: enable parallel builds in msbuild using MTT * check LLAMA_STANDALONE

build: enable parallel builds in msbuild using MTT

79342b7

jeffbolznv requested a review from ggerganov as a code owner December 3, 2025 02:20

jeffbolznv mentioned this pull request Dec 3, 2025

build: for GGML_BACKEND_DL, ggml need not depend on backend #17709

Open

loci-dev mentioned this pull request Dec 3, 2025

UPSTREAM PR #17708: build: enable parallel builds in msbuild using MTT auroralabs-loci/llama.cpp#408

Open

github-actions bot added the build Compilation issues label Dec 3, 2025

ggerganov approved these changes Dec 3, 2025

View reviewed changes

check LLAMA_STANDALONE

18a08f9

jeffbolznv merged commit d8b5cdc into ggml-org:master Dec 4, 2025
76 of 80 checks passed

khemchand-zetta pushed a commit to khemchand-zetta/llama.cpp that referenced this pull request Dec 4, 2025

build: enable parallel builds in msbuild using MTT (ggml-org#17708)

3822d65

* build: enable parallel builds in msbuild using MTT * check LLAMA_STANDALONE

gabe-l-hart mentioned this pull request Dec 10, 2025

feat: llama.cpp bump (17f7f4) for SSM performance improvements ollama/ollama#13408

Merged

0Marble pushed a commit to 0Marble/llama.cpp that referenced this pull request Dec 18, 2025

build: enable parallel builds in msbuild using MTT (ggml-org#17708)

201277d

* build: enable parallel builds in msbuild using MTT * check LLAMA_STANDALONE

Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026

build: enable parallel builds in msbuild using MTT (ggml-org#17708)

1c37ec5

* build: enable parallel builds in msbuild using MTT * check LLAMA_STANDALONE

blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026

build: enable parallel builds in msbuild using MTT (#17708)

60482bc

* build: enable parallel builds in msbuild using MTT * check LLAMA_STANDALONE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build: enable parallel builds in msbuild using MTT#17708

build: enable parallel builds in msbuild using MTT#17708
jeffbolznv merged 2 commits intoggml-org:masterfrom
jeffbolznv:msbuild_par

jeffbolznv commented Dec 3, 2025

Uh oh!

ggerganov left a comment

Uh oh!

Acly commented Dec 3, 2025

Uh oh!

jeffbolznv commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jeffbolznv commented Dec 3, 2025

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Acly commented Dec 3, 2025

Uh oh!

jeffbolznv commented Dec 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants