model: using single llm_build per arch by ngxson · Pull Request #21970 · ggml-org/llama.cpp

ngxson · 2026-04-15T22:22:18Z

Overview

Prepare for #21966

Using one single llm_build_* class per arch will make the migration a bit easier.

Example before:

        case LLM_ARCH_LLAMA:
            {
                llm = std::make_unique<llm_build_llama<false>>(*this, params);
            } break;
        case LLM_ARCH_LLAMA4:
            {
                if (hparams.swa_type == LLAMA_SWA_TYPE_NONE) {
                    llm = std::make_unique<llm_build_llama<false>>(*this, params);
                } else {
                    llm = std::make_unique<llm_build_llama_iswa>(*this, params);
                }
            } break;

Example after:

        case LLM_ARCH_LLAMA:
            {
                llm = std::make_unique<llm_build_llama<false>>(*this, params);
            } break;
        case LLM_ARCH_LLAMA4:
            {
                if (hparams.swa_type == LLAMA_SWA_TYPE_NONE) {
                    llm = std::make_unique<llm_build_llama4<false>>(*this, params);
                } else {
                    llm = std::make_unique<llm_build_llama4<true>>(*this, params);
                }
            } break;

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: no

CISC

This will conflict with #21971, mind landing that one first?

ggerganov · 2026-04-16T07:57:05Z

+struct llm_build_t5encoder : public llm_build_t5<true> {
+    llm_build_t5encoder(const llama_model & model, const llm_graph_params & params);


Do we need to keep this, instead of using llm_build_t5<true>?

I think it's there to match the arch name (for the migration script)?

@ggerganov the t5encoder has its own tensor loader and hparams loader, so I think it make sense to have a dedicated graph builder. The dedicated class will then expanded into llama_model_t5encoder during the migration (via a script), and its hparams/tensors loader will be moved there

ngxson · 2026-04-16T16:30:37Z

Should be ok now I guess? @CISC @ggerganov

* model: using single llm_build per arch * fix merge * nits

model: using single llm_build per arch

be46a50

ngxson requested a review from pwilkin April 15, 2026 22:22

ngxson requested review from CISC and ggerganov as code owners April 15, 2026 22:22

CISC approved these changes Apr 15, 2026

View reviewed changes

github-actions Bot added the model Model specific label Apr 15, 2026

ggerganov reviewed Apr 16, 2026

View reviewed changes

ngxson mentioned this pull request Apr 16, 2026

model: move load_hparams and load_tensors to per-model definition #22004

Merged

6 tasks

ngxson added 3 commits April 16, 2026 18:28

Merge branch 'master' into xsn/one_llm_build_per_arch

a2295b0

fix merge

8c39fc8

Merge branch 'master' into xsn/one_llm_build_per_arch

13a4d7a

nits

e6125f8

ggerganov approved these changes Apr 16, 2026

View reviewed changes

ggerganov added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label Apr 16, 2026

ngxson merged commit 4fbdabd into ggml-org:master Apr 16, 2026
46 of 50 checks passed

cnsiva pushed a commit to saas-home/llama.cpp that referenced this pull request Apr 17, 2026

model: using single llm_build per arch (ggml-org#21970)

e911142

* model: using single llm_build per arch * fix merge * nits

samuraieng pushed a commit to samuraieng/llama.cpp that referenced this pull request Apr 19, 2026

model: using single llm_build per arch (ggml-org#21970)

3754f53

* model: using single llm_build per arch * fix merge * nits

mengqin pushed a commit to mengqin/llama.cpp that referenced this pull request Apr 20, 2026

model: using single llm_build per arch (ggml-org#21970)

7621d24

* model: using single llm_build per arch * fix merge * nits

ArberSephirotheca pushed a commit to ArberSephirotheca/llama.cpp that referenced this pull request Apr 21, 2026

model: using single llm_build per arch (ggml-org#21970)

94adb82

* model: using single llm_build per arch * fix merge * nits

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Apr 23, 2026

model: using single llm_build per arch (ggml-org#21970)

5d34078

* model: using single llm_build per arch * fix merge * nits

rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 1, 2026

model: using single llm_build per arch (ggml-org#21970)

79b94d1

* model: using single llm_build per arch * fix merge * nits

ljubomirj pushed a commit to ljubomirj/llama.cpp that referenced this pull request May 6, 2026

model: using single llm_build per arch (ggml-org#21970)

ea9998b

* model: using single llm_build per arch * fix merge * nits

my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026

model: using single llm_build per arch (ggml-org#21970)

ff55612

* model: using single llm_build per arch * fix merge * nits

my-other-github-account pushed a commit to my-other-github-account/llama.cpp that referenced this pull request May 15, 2026

model: using single llm_build per arch (ggml-org#21970)

f5ea44d

* model: using single llm_build per arch * fix merge * nits

fewtarius pushed a commit to fewtarius/llama.cpp that referenced this pull request May 30, 2026

model: using single llm_build per arch (ggml-org#21970)

2e9c80a

* model: using single llm_build per arch * fix merge * nits

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model: using single llm_build per arch#21970

model: using single llm_build per arch#21970
ngxson merged 5 commits into
ggml-org:masterfrom
ngxson:xsn/one_llm_build_per_arch

ngxson commented Apr 15, 2026 •

edited

Loading

Uh oh!

CISC left a comment

Uh oh!

ggerganov Apr 16, 2026

Uh oh!

CISC Apr 16, 2026

Uh oh!

ngxson Apr 16, 2026

Uh oh!

ngxson commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		struct llm_build_t5encoder : public llm_build_t5<true> {
		llm_build_t5encoder(const llama_model & model, const llm_graph_params & params);

Conversation

ngxson commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Requirements

Uh oh!

CISC left a comment

Choose a reason for hiding this comment

Uh oh!

ggerganov Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

CISC Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

ngxson Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

ngxson commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ngxson commented Apr 15, 2026 •

edited

Loading