Add --parse-special for enabling parsing of special tokens in imatrix calculation by bartowski1182 · Pull Request #13389 · ggml-org/llama.cpp

bartowski1182 · 2025-05-08T19:33:00Z

There's been talks lately of trying to parse special tokens with imatrix tokenization to try to yield better overall results

I'm not positive yet if this will actually provide a true benefit, but I think it's definitely worth fully investigating, and adding a flag for it will make it far easier for everyone who wants to try parsing special tokens to do so

… calculation

ngxson

~~I think we can repurpose the --special flag~~

No this looks good, thanks!

* origin/master: (39 commits) server : vision support via libmtmd (ggml-org#12898) sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs (ggml-org#12858) metal : optimize MoE for large batches (ggml-org#13388) CUDA: FA support for Deepseek (Ampere or newer) (ggml-org#13306) llama : do not crash if there is no CPU backend (ggml-org#13395) CUDA: fix crash on large batch size for MoE models (ggml-org#13384) imatrix : Add --parse-special for enabling parsing of special tokens in imatrix calculation (ggml-org#13389) llama-run: add support for downloading models from ModelScope (ggml-org#13370) mtmd : fix batch_view for m-rope (ggml-org#13397) llama : one-off chat template fix for Mistral-Small-2503 (ggml-org#13398) rpc : add rpc_msg_set_tensor_hash_req (ggml-org#13353) vulkan: Allow up to 4096 elements for mul_mat_id row_ids (ggml-org#13326) server : (webui) rename has_multimodal --> modalities (ggml-org#13393) ci : limit write permission to only the release step + fixes (ggml-org#13392) mtmd : Expose helper_decode_image_chunk (ggml-org#13366) server : (webui) fix a very small misalignment (ggml-org#13387) server : (webui) revamp the input area, plus many small UI improvements (ggml-org#13365) convert : support rope_scaling type and rope_type (ggml-org#13349) mtmd : fix the calculation of n_tokens for smolvlm (ggml-org#13381) context : allow cache-less context for embeddings (ggml-org#13108) ...

Add --parse-special for enabling parsing of special tokens in imatrix…

cbf7e25

… calculation

github-actions bot added the examples label May 8, 2025

whitespace

e4bd553

ngxson approved these changes May 9, 2025

View reviewed changes

ngxson merged commit efb8b47 into ggml-org:master May 9, 2025
44 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add --parse-special for enabling parsing of special tokens in imatrix calculation#13389

Add --parse-special for enabling parsing of special tokens in imatrix calculation#13389
ngxson merged 2 commits intoggml-org:masterfrom
bartowski1182:special_imatrix

bartowski1182 commented May 8, 2025

Uh oh!

ngxson left a comment •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

bartowski1182 commented May 8, 2025

Uh oh!

ngxson left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ngxson left a comment •

edited

Loading