arch: refactor LLM_TENSOR_NAMES by ngxson · Pull Request #18051 · ggml-org/llama.cpp

ngxson · 2025-12-15T09:09:51Z

Motivation

while working on #18042 , I want to add a new arch GLM4V which is mostly the same as GLM4, but use M-RoPE instead of normal RoPE.

However, I feel like sometimes it can be quite too redundant to duplicate the list of tensors and cgraph (for cgraph, we can reuse the same file, but switch rope type using template)

This PR propose a new way to organize model tensors naming, bring it more aligned with the convert_hf_to_gguf.py script, while also allow multiple models to reuse the same mapping.

TODO in a follow-up PR: also update the mapping in convert_hf_to_gguf.py to allow models to reuse the same mappings

This PR was mostly generated using this script: https://gist.github.com/ngxson/a20411de6bc66a84a4d354b254bfe4da

Before

static const std::map<llm_arch, std::map<llm_tensor, const char *>> LLM_TENSOR_NAMES = {
    {
        LLM_ARCH_LLAMA,
        {
            { LLM_TENSOR_TOKEN_EMBD,      "token_embd" },
            { LLM_TENSOR_OUTPUT_NORM,     "output_norm" },
            { LLM_TENSOR_OUTPUT,          "output" },
            { LLM_TENSOR_ROPE_FREQS,      "rope_freqs" },
            { LLM_TENSOR_ATTN_NORM,       "blk.%d.attn_norm" },
            { LLM_TENSOR_ATTN_Q,          "blk.%d.attn_q" },
            ...

After

static const std::map<llm_tensor, const char *> LLM_TENSOR_NAMES = {
    { LLM_TENSOR_TOKEN_EMBD,                             "token_embd" },
    { LLM_TENSOR_OUTPUT_NORM,                            "output_norm" },
    { LLM_TENSOR_OUTPUT_NORM_LFM2,                       "token_embd_norm" }, // fix for wrong tensor name
    { LLM_TENSOR_OUTPUT,                                 "output" },
...

static std::set<llm_tensor> llm_get_tensor_names(llm_arch arch) {
    switch (arch) {
        case LLM_ARCH_LLAMA:
        case LLM_ARCH_DECI:
        case LLM_ARCH_MISTRAL3:
            return {
                LLM_TENSOR_TOKEN_EMBD,
                LLM_TENSOR_OUTPUT_NORM,
                LLM_TENSOR_OUTPUT,
                LLM_TENSOR_ROPE_FREQS,
                LLM_TENSOR_ATTN_NORM,
                ...

* arch: refactor LLM_TENSOR_NAMES * update docs * typo * fix LLM_ARCH_NEMOTRON_H_MOE * show more meaningful error message on missing tensor * fix and tested LLM_ARCH_NEMOTRON_H_MOE

ngxson added 2 commits December 15, 2025 10:02

arch: refactor LLM_TENSOR_NAMES

d2bed05

update docs

6215579

ngxson requested a review from ggerganov December 15, 2025 09:09

ngxson requested a review from CISC as a code owner December 15, 2025 09:09

github-actions bot added the documentation Improvements or additions to documentation label Dec 15, 2025

ngxson added the refactoring Refactoring label Dec 15, 2025

typo

dc73ba9

ngxson mentioned this pull request Dec 15, 2025

model: support GLM4V vision encoder #18042

Merged

ggerganov approved these changes Dec 16, 2025

View reviewed changes

ngxson added 4 commits December 16, 2025 12:03

Merge branch 'master' into xsn/arch_refactor_llm_names

942ddbe

fix LLM_ARCH_NEMOTRON_H_MOE

f4b088c

show more meaningful error message on missing tensor

dffb032

fix and tested LLM_ARCH_NEMOTRON_H_MOE

4a78eba

ngxson merged commit 7f2b2f3 into ggml-org:master Dec 16, 2025
67 of 68 checks passed

ngxson mentioned this pull request Dec 16, 2025

model: fix LFM2 missing tensors #18105

Merged

loci-dev mentioned this pull request Dec 16, 2025

UPSTREAM PR #18105: model: fix LFM2 missing tensors auroralabs-loci/llama.cpp#594

Open

tdakhran mentioned this pull request Dec 17, 2025

model: fix LFM2_MOE missing tensors #18132

Merged

wallentri88 mentioned this pull request Feb 24, 2026

Eval bug: qwen35 and qwen35moe graph split issues (Severe PP impact, crashes) #19864

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

arch: refactor LLM_TENSOR_NAMES#18051

arch: refactor LLM_TENSOR_NAMES#18051
ngxson merged 7 commits intoggml-org:masterfrom
ngxson:xsn/arch_refactor_llm_names

ngxson commented Dec 15, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ngxson commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Before

After

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ngxson commented Dec 15, 2025 •

edited

Loading