Name and Version
❯ ./build/bin/llama-server --version
version: 9495 (166fe29)
built with GNU 14.2.0 for Linux x86_64
Operating systems
Linux
GGML backends
CPU
Hardware
AMD Ryzen 9 7950X 16-Core Processor
Models
ggml-org/gemma-4-12B-it-GGUF
Problem description & steps to reproduce
I know this is bleeding edge and maybe this model isn't yet supported. If so, just close this issue and I'll try again later
Trying to load this model gives an error:
Thread 1 "llama-server" received signal SIGFPE, Arithmetic exception.
llama.cpp built from source with:
cmake -B build -DCMAKE_BUILD_TYPE=Debug
cmake --build build --config Debug -j $(nproc)
And run with:
llama-server -hf ggml-org/gemma-4-12B-it-GGUF
First Bad Commit
No response
Relevant log output
Thread 1 "llama-server" received signal SIGFPE, Arithmetic exception.
0x00007ffff57748ac in clip_graph::clip_graph (this=0x55555575e580, ctx=0x5555556a4a30, img=...) at /nas/achin/devel/llama.cpp/tools/mtmd/clip.cpp:250
250 d_head(n_embd / n_head),
(gdb) bt
#0 0x00007ffff57748ac in clip_graph::clip_graph (this=0x55555575e580, ctx=0x5555556a4a30, img=...) at /nas/achin/devel/llama.cpp/tools/mtmd/clip.cpp:250
#1 0x00007ffff577fda3 in clip_graph_gemma4uv::clip_graph_gemma4uv (this=0x55555575e580, ctx=0x5555556a4a30, img=...) at /nas/achin/devel/llama.cpp/tools/mtmd/models/models.h:22
#2 0x00007ffff579c730 in std::make_unique<clip_graph_gemma4uv, clip_ctx*&, clip_image_f32 const&> () at /usr/include/c++/14/bits/unique_ptr.h:1077
#3 0x00007ffff5776cd2 in clip_image_build_graph (ctx=0x5555556a4a30, imgs=...) at /nas/achin/devel/llama.cpp/tools/mtmd/clip.cpp:871
#4 0x00007ffff579a77f in clip_model_loader::reserve_compute_meta (ctx_clip=..., batch=...) at /nas/achin/devel/llama.cpp/tools/mtmd/clip.cpp:2816
#5 0x00007ffff579a06c in clip_model_loader::warmup (ctx_clip=..., batch=...) at /nas/achin/devel/llama.cpp/tools/mtmd/clip.cpp:2751
#6 0x00007ffff5799ddb in clip_model_loader::warmup (ctx_clip=...) at /nas/achin/devel/llama.cpp/tools/mtmd/clip.cpp:2742
#7 0x00007ffff5777638 in clip_init (
fname=0x5555555b6ca0 "/home/achin/.cache/huggingface/hub/models--ggml-org--gemma-4-12B-it-GGUF/snapshots/0f3915622134b2b6279d02f482cb12adc3d9ca3d/mmproj-gemma-4-12B-it-Q8_0.gguf",
ctx_params=...) at /nas/achin/devel/llama.cpp/tools/mtmd/clip.cpp:2980
#8 0x00007ffff56905b3 in mtmd_context::mtmd_context (this=0x5555556bbec0,
mmproj_fname=0x5555555b6ca0 "/home/achin/.cache/huggingface/hub/models--ggml-org--gemma-4-12B-it-GGUF/snapshots/0f3915622134b2b6279d02f482cb12adc3d9ca3d/mmproj-gemma-4-12B-it-Q8_0.gguf", text_model=0x0, ctx_params=..., no_alloc=false) at /nas/achin/devel/llama.cpp/tools/mtmd/mtmd.cpp:228
#9 0x00007ffff568dd98 in mtmd_get_memory_usage (
mmproj_fname=0x5555555b6ca0 "/home/achin/.cache/huggingface/hub/models--ggml-org--gemma-4-12B-it-GGUF/snapshots/0f3915622134b2b6279d02f482cb12adc3d9ca3d/mmproj-gemma-4-12B-it-Q8_0.gguf", ctx_params=...) at /nas/achin/devel/llama.cpp/tools/mtmd/mtmd.cpp:1604
#10 0x00007ffff7444564 in server_context_impl::load_model (this=0x5555556cf9f0, params=...) at /nas/achin/devel/llama.cpp/tools/server/server-context.cpp:789
#11 0x00007ffff741c9c2 in server_context::load_model (this=0x7fffffffb168, params=...) at /nas/achin/devel/llama.cpp/tools/server/server-context.cpp:3542
#12 0x00007ffff734acb5 in llama_server (argc=3, argv=0x7fffffffde08) at /nas/achin/devel/llama.cpp/tools/server/server.cpp:296
#13 0x0000555555556239 in main (argc=3, argv=0x7fffffffde08) at /nas/achin/devel/llama.cpp/tools/server/main.cpp:4
(gdb) print(n_embd)
$1 = 3840
(gdb) print(n_head)
$2 = 0
Name and Version
❯ ./build/bin/llama-server --version
version: 9495 (166fe29)
built with GNU 14.2.0 for Linux x86_64
Operating systems
Linux
GGML backends
CPU
Hardware
AMD Ryzen 9 7950X 16-Core Processor
Models
ggml-org/gemma-4-12B-it-GGUF
Problem description & steps to reproduce
I know this is bleeding edge and maybe this model isn't yet supported. If so, just close this issue and I'll try again later
Trying to load this model gives an error:
llama.cpp built from source with:
cmake -B build -DCMAKE_BUILD_TYPE=Debug cmake --build build --config Debug -j $(nproc)And run with:
First Bad Commit
No response
Relevant log output