convert : store ffn_gate_inp_shexp as F32 by CISC · Pull Request #19606 · ggml-org/llama.cpp

CISC · 2026-02-13T21:44:01Z

This tensor has been inadvertently stored as BF16 in several models, and since it's 1D it will never be quantized.

ggerganov · 2026-02-14T07:17:45Z

@CISC High-jacking this thread to report a minor regression that I observe. I need the following patch to be able to convert https://huggingface.co/Qwen/Qwen3-30B-A3B-Base:

diff --git a/convert_hf_to_gguf.py b/convert_hf_to_gguf.py
index 825080b58..1e5209cdc 100755
--- a/convert_hf_to_gguf.py
+++ b/convert_hf_to_gguf.py
@@ -4161,7 +4161,7 @@ class Qwen2MoeModel(TextModel):
             return
 
         if name.find("experts") != -1:
-            n_experts = self.hparams["num_experts"]
+            n_experts = self.find_hparam(["num_local_experts", "num_experts"])
             assert bid is not None
 
             if self._experts is None:

CISC · 2026-02-14T07:19:37Z

@CISC High-jacking this thread to report a minor regression that I observe. I need the following patch to be able to convert https://huggingface.co/Qwen/Qwen3-30B-A3B-Base:

Hmmm, ok, not sure why that would have regressed, I'll look into it.

ggerganov · 2026-02-14T07:36:07Z

Last I tested in mid-Jan, it was converting successfully with this commit: 2bbe4c2. Not sure if a problem in the convert script, or in something in the Python dependencies has changed.

CISC · 2026-02-14T07:38:28Z

Last I tested in mid-Jan, it was converting successfully with this commit: 2bbe4c2. Not sure if a problem in the convert script, or in something in the Python dependencies has changed.

Are you sure? I see the non-base model uses num_experts, I think this just changed in transformers by the time they released the base model.

CISC · 2026-02-14T07:42:26Z

Last I tested in mid-Jan, it was converting successfully with this commit: 2bbe4c2. Not sure if a problem in the convert script, or in something in the Python dependencies has changed.

Are you sure? I see the non-base model uses num_experts, I think this just changed in transformers by the time they released the base model.

Edit: Ah, I see the problem, it's transformers itself that changed, it's num_experts in the config, but AutoConfig returns num_local_experts.

store ffn_gate_inp_shexp as f32

63df933

CISC requested a review from ggerganov February 13, 2026 21:44

github-actions bot added the python python script changes label Feb 13, 2026

ggerganov approved these changes Feb 14, 2026

View reviewed changes

CISC merged commit 0d00ef6 into master Feb 14, 2026
9 checks passed

CISC deleted the cisc/convert-gate-inp-shexp-f32 branch February 14, 2026 07:17

ggerganov mentioned this pull request Feb 14, 2026

models : optimizing qwen3next graph #19375

Merged

3 tasks

liparetejas pushed a commit to liparetejas/llama.cpp that referenced this pull request Feb 23, 2026

convert : store ffn_gate_inp_shexp as F32 (ggml-org#19606)

c7cbe09

bartowski1182 pushed a commit to bartowski1182/llama.cpp that referenced this pull request Mar 2, 2026

convert : store ffn_gate_inp_shexp as F32 (ggml-org#19606)

3392198

ArberSephirotheca pushed a commit to ArberSephirotheca/llama.cpp that referenced this pull request Mar 3, 2026

convert : store ffn_gate_inp_shexp as F32 (ggml-org#19606)

fbe3dfd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

convert : store ffn_gate_inp_shexp as F32#19606

convert : store ffn_gate_inp_shexp as F32#19606
CISC merged 1 commit intomasterfrom
cisc/convert-gate-inp-shexp-f32

CISC commented Feb 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

ggerganov commented Feb 14, 2026

Uh oh!

CISC commented Feb 14, 2026

Uh oh!

ggerganov commented Feb 14, 2026

Uh oh!

CISC commented Feb 14, 2026

Uh oh!

CISC commented Feb 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

CISC commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ggerganov commented Feb 14, 2026

Uh oh!

CISC commented Feb 14, 2026

Uh oh!

ggerganov commented Feb 14, 2026

Uh oh!

CISC commented Feb 14, 2026

Uh oh!

CISC commented Feb 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CISC commented Feb 13, 2026 •

edited

Loading