feat: Add conversion support in GraniteHybrid for non-hybrid (all attn) by gabe-l-hart · Pull Request #16177 · ggml-org/llama.cpp

gabe-l-hart · 2025-09-22T17:27:20Z

This is a configuration of the hparams in the GraniteHybrid architecture that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x). It may be used for some models in the Granite 4 family with the GraniteHybrid architecture acting as a superset arch. Rather than support it directly in the c++ graph, we simply coerce the architecture flag back to the correct "granite" or "granitemoe" architecture.

convert_hf_to_gguf.py

gabe-l-hart · 2025-09-22T18:13:37Z

@CISC thanks for catching my slop. Too much multi-tasking!

convert_hf_to_gguf.py

This is a configuration of the hparams in the GraniteHybrid architecture that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x). It may be used for some models in the Granite 4 family with the GraniteHybrid architecture acting as a superset arch. Rather than support it directly in the c++ graph, we simply coerce the architecture flag back to the correct "granite" or "granitemoe" architecture. Branch: gabe-l-hart/GraniteNonHybridConversion Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

gabe-l-hart · 2025-09-22T18:24:41Z

Rebased, squashed, and added you as co-author for your troubles. Much appreciated.

CISC · 2025-09-22T18:31:57Z

Will merge once you've verified conversion works as expected (unless you have access to merge yourself, not sure).

gabe-l-hart · 2025-09-22T18:34:43Z

Confirmed, conversion works. I seem to have lost the merge button some time over the weekend, so not sure the cause there.

CISC · 2025-09-22T18:40:49Z

Confirmed, conversion works. I seem to have lost the merge button some time over the weekend, so not sure the cause there.

See #16113

gabe-l-hart · 2025-09-22T19:31:16Z

Ah, makes complete sense. Thanks for the pointer!

@danbev

* origin/master: (39 commits) ci : disable AMD workflows + update NVIDIA workflows (ggml-org#16200) ci : enable Vulkan workflow on Mac (ggml-org#16194) ggml-cpu: Respect cpumask settings (ggml-org#16164) ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (ggml-org#15928) zdnn: refactor codebase + add docs (ggml-org#16178) codeowners : add @danbev to model-conversion example [no ci] (ggml-org#16190) devops: add s390x containers (ggml-org#15915) ggml-cpu : fix typo in gemm comments [no ci] (ggml-org#16189) feat: Add conversion support in GraniteHybrid for non-hybrid (all attn) (ggml-org#16177) clang-tidy : disable warning about performance enum size (ggml-org#16127) ggml : implement set_rows with i32 index (ggml-org#16159) codeowners : update + cleanup (ggml-org#16174) common : enable `--offline` mode without curl support (ggml-org#16137) webui : fix handling incomplete chunks (ggml-org#16107) embedding : fix typos in README (ggml-org#16171) common : remove unused local variables (ggml-org#16140) ggml : extend ggml_can_fuse to work with non-sequential nodes (ggml-org#16123) ggml : add ggml_op_is_empty (ggml-org#16122) codeowners : update ownership for @ngxson and @allozuar (ggml-org#16128) Vulkan: add conv_transpose_2d operation (ggml-org#16022) ...

…n) (ggml-org#16177) This is a configuration of the hparams in the GraniteHybrid architecture that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x). It may be used for some models in the Granite 4 family with the GraniteHybrid architecture acting as a superset arch. Rather than support it directly in the c++ graph, we simply coerce the architecture flag back to the correct "granite" or "granitemoe" architecture. Branch: gabe-l-hart/GraniteNonHybridConversion Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

…n) (#16177) This is a configuration of the hparams in the GraniteHybrid architecture that devolves to the Granite (or GraniteMoe) architecture (ie Granite 3.x). It may be used for some models in the Granite 4 family with the GraniteHybrid architecture acting as a superset arch. Rather than support it directly in the c++ graph, we simply coerce the architecture flag back to the correct "granite" or "granitemoe" architecture. Branch: gabe-l-hart/GraniteNonHybridConversion Signed-off-by: Gabe Goodhart <ghart@us.ibm.com> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

gabe-l-hart requested a review from CISC as a code owner September 22, 2025 17:27

github-actions bot added the python python script changes label Sep 22, 2025

gabe-l-hart force-pushed the GraniteNonHybridConversion branch from 3584895 to bb98b31 Compare September 22, 2025 17:29

gabe-l-hart assigned CISC Sep 22, 2025

CISC reviewed Sep 22, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

CISC reviewed Sep 22, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

CISC reviewed Sep 22, 2025

View reviewed changes

convert_hf_to_gguf.py Outdated Show resolved Hide resolved

gabe-l-hart force-pushed the GraniteNonHybridConversion branch from b7f9b8c to 2668925 Compare September 22, 2025 18:24

CISC approved these changes Sep 22, 2025

View reviewed changes

CISC merged commit 1d0125b into ggml-org:master Sep 22, 2025
7 checks passed

gabe-l-hart deleted the GraniteNonHybridConversion branch September 22, 2025 19:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add conversion support in GraniteHybrid for non-hybrid (all attn)#16177

feat: Add conversion support in GraniteHybrid for non-hybrid (all attn)#16177
CISC merged 1 commit intoggml-org:masterfrom
gabe-l-hart:GraniteNonHybridConversion

gabe-l-hart commented Sep 22, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gabe-l-hart commented Sep 22, 2025

Uh oh!

Uh oh!

gabe-l-hart commented Sep 22, 2025

Uh oh!

CISC commented Sep 22, 2025

Uh oh!

gabe-l-hart commented Sep 22, 2025

Uh oh!

Uh oh!

CISC commented Sep 22, 2025

Uh oh!

gabe-l-hart commented Sep 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

gabe-l-hart commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gabe-l-hart commented Sep 22, 2025

Uh oh!

Uh oh!

gabe-l-hart commented Sep 22, 2025

Uh oh!

CISC commented Sep 22, 2025

Uh oh!

gabe-l-hart commented Sep 22, 2025

Uh oh!

Uh oh!

CISC commented Sep 22, 2025

Uh oh!

gabe-l-hart commented Sep 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gabe-l-hart commented Sep 22, 2025 •

edited

Loading