Be able to use imatrix computed with merged ffn_gate_up_exps by ikawrakow · Pull Request #1419 · ikawrakow/ik_llama.cpp

ikawrakow · 2026-03-13T13:28:34Z

This PR is a sibling of #1418.

If one has an imatrix available that has been computed using a model with merged ffn_gate_up_exps tensors, but one has a model where ffn_up_exps and ffn_gate_exps are separate, the PR allow the imatrix to be still used to quantize this model. Basically, the ffn_up_exps, ffn_gate_exps, and ffn_gate_up_exps tensors "see" exactly the same activations, so one can use the imatrix data for ffn_gate_up_exps also for ffn_up_exps and ffn_gate_exps. Correspondingly, also the reverse case is now supported. I.e., one has an imatrix computed with separate ffn_up_exps and ffn_gate_exps tensors, but now one wants to use it to quantize a model with merged ffn_gate_up_exps tensors. This will also work with this PR.

ubergarm · 2026-03-13T16:05:32Z

Thanks for holding the world together a little longer!!

ikawrakow added 2 commits March 13, 2026 13:22

Be able to use imatrix computed with merged ffn_gate_up_exps

69afb43

Also the other way around

c12fdbe

ubergarm mentioned this pull request Mar 13, 2026

llama: Add option to merge gate and exp weights ggml-org/llama.cpp#19139

Merged

ikawrakow merged commit c2b8e95 into main Mar 13, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Be able to use imatrix computed with merged ffn_gate_up_exps#1419

Be able to use imatrix computed with merged ffn_gate_up_exps#1419
ikawrakow merged 2 commits intomainfrom
ik/quantize_fused_up_gate

ikawrakow commented Mar 13, 2026 •

edited

Loading

Uh oh!

ubergarm commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ikawrakow commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ubergarm commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ikawrakow commented Mar 13, 2026 •

edited

Loading