Add Q1_0 as new GGUF type by khosravipasha · Pull Request #2077 · huggingface/huggingface.js

khosravipasha · 2026-04-06T21:19:25Z

This is to add support to Q1_0 newly added GGUF type.
We just released 3 models in this format as 1-bit Bonsai (see more info here)

And this PR is to show correct naming on the hugging-face gguf website tab:

https://huggingface.co/prism-ml/Bonsai-8B-gguf

PR that merged Q1_0: ggml-org/llama.cpp#21273

Note

Low Risk
Low risk: adds a new GGUF quantization enum/value mapping and updates ordering/regex inputs; main risk is mis-numbering or ordering causing incorrect quant label parsing or size calculations.

Overview
Adds support for the newly introduced GGUF Q1_0 1-bit quantization.

This extends quant metadata to include a human-readable description/source link and a bits-per-weight size calculation, and updates the tasks-side GGUF quant enums and quant ordering lists so Q1_0 is recognized when parsing/labeling and when selecting the nearest available quant.

^{Reviewed by Cursor Bugbot for commit a4007ba. Bugbot is set up for automated code reviews on this repo. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 492aa15. Configure here.}

CISC · 2026-04-06T22:56:13Z

Remember to update your GGUFs filetype to the new enum (maybe possible to do directly in the HF GGUF Editor?).

khosravipasha · 2026-04-06T23:25:12Z

Fixed ordering.

@CISC good point, which enum is this based on the on, llama_ftype? https://github.com/ggml-org/llama.cpp/blob/d0a6dfeb28a09831d904fc4d910ddb740da82834/include/llama.h#L116

I can edit and re-upload models probably, don't see an option to edit in the UI.

For this one Q1_0 with group size 128 in our fork which I originally made the models from was 41 general.file_type | 41, in main llama ended up at 40 since we removed the extra type and only went with Q1_0.

CISC · 2026-04-07T07:38:28Z

@CISC good point, which enum is this based on the on, llama_ftype? https://github.com/ggml-org/llama.cpp/blob/d0a6dfeb28a09831d904fc4d910ddb740da82834/include/llama.h#L116

Yes.

I can edit and re-upload models probably, don't see an option to edit in the UI.

If you click the GGUF file there should be an edit link at the top.

mishig25 · 2026-04-07T08:55:08Z

If you click the GGUF file there should be an edit link at the top.

yep, when you go to https://huggingface.co/prism-ml/Bonsai-8B-gguf/blob/main/Bonsai-8B.gguf, you should see GGUF Editor

mishig25 · 2026-04-07T09:00:59Z

for the hf model page gguf section to work, could you also rename Bonsai-8B.gguf to Bonsai-8B-Q1_0.gguf

hf model page uses a valid gguf type name suffix to present the available gguf quants on the page. Example: unsloth/GLM-4.7-Flash-GGUF

https://huggingface.co/unsloth/GLM-4.7-Flash-GGUF	https://huggingface.co/unsloth/GLM-4.7-Flash-GGUF/tree/main

khosravipasha · 2026-04-07T17:16:22Z

Oh nice good feature for gguf editor, had to go the gguf itself was looking somewhere else. Editted to 40 for all 3 Q1_0 ggufs we have.

For adding Q1_0 suffix Bonsai-8B-Q1_0.gguf is that required? We only have one format at the moment.
Model has been downloaded a lot already and some apps using it so don't want break things. I can add the correct suffix with our next releases.

Oh so the website might use the suffix name to decide the type? In that case I can try uploading same copy as Bonsai-8B-Q1_0.gguf, something like that. Or rename it and notify people to update their model URL/name.

julien-c · 2026-04-07T17:19:30Z

Model has been downloaded a lot already and some apps using it so don't want break things.

if it's just a file rename most clients won't re-download an identical file (at least the HF clients shouldn't – https://huggingface.co/docs/hub/local-cache)

khosravipasha · 2026-04-07T19:51:21Z

Thanks, this is good. I will look into renaming the models then. Need to update our demo code and give heads up to few apps that are hosting the model to make sure the file name is not hardcoded. Otherwise should be okay.

There is a Test / Browser CI failing, not sure if issue caused by this PR.

mishig25 · 2026-04-08T08:48:04Z

one of the benefits of following this "gguf type suffx" naming standard is that: if your repository has multiple quants, it makes it possible to select from your llama-server/ollama/etc

example: https://huggingface.co/unsloth/GLM-4.7-Flash-GGUF?local-app=llama.cpp

mishig25 · 2026-04-08T08:48:11Z

There is a Test / Browser CI failing, not sure if issue caused by this PR.

is not related

khosravipasha · 2026-04-08T17:06:07Z

@mishig25 good idea thanks. We might just upload a duplicate file with -Q1_0 suffix to not break current apps that are using the old name. Will do more testing after this changes are deployed.

mishig25 · 2026-04-10T10:27:13Z

@khosravipasha everything is deployed to prod fomr hf side. Once you "upload a duplicate file with -Q1_0 suffix", you will see the image below on your model page 🙌

Add Q1_0

492aa15

khosravipasha requested review from SBrandeis, Wauplin, gary149, julien-c, mishig25, ngxson and pcuenca as code owners April 6, 2026 21:19

cursor Bot reviewed Apr 6, 2026

View reviewed changes

Comment thread packages/tasks/src/gguf.ts Outdated

fix order

a4007ba

mishig25 approved these changes Apr 8, 2026

View reviewed changes

mishig25 merged commit e24d628 into huggingface:main Apr 8, 2026
5 of 6 checks passed

Conversation

khosravipasha commented Apr 6, 2026 • edited by mishig25 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CISC commented Apr 6, 2026

Uh oh!

khosravipasha commented Apr 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CISC commented Apr 7, 2026

Uh oh!

mishig25 commented Apr 7, 2026

Uh oh!

mishig25 commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

khosravipasha commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

julien-c commented Apr 7, 2026

Uh oh!

khosravipasha commented Apr 7, 2026

Uh oh!

mishig25 commented Apr 8, 2026

Uh oh!

mishig25 commented Apr 8, 2026

Uh oh!

Uh oh!

khosravipasha commented Apr 8, 2026

Uh oh!

mishig25 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

khosravipasha commented Apr 6, 2026 •

edited by mishig25

Loading

khosravipasha commented Apr 6, 2026 •

edited

Loading

mishig25 commented Apr 7, 2026 •

edited

Loading

khosravipasha commented Apr 7, 2026 •

edited

Loading