Error saving 4-bit quantized version of trained model

On a Geforce 4060, 16 GB ram on Windows.
I have trained an 8b deepseek model, and am trying to save it:

```
model.save_pretrained_gguf("model", tokenizer, quantization_method = "q4_k_m")
```

However, this is happening. A 7MB size ```unsloth.BF16.gguf``` file is being saved, (I have gotten up to 6% progress before the script moves ahead to the next line) and I am getting no 4 bit quantized file.

![Image](https://github.com/user-attachments/assets/153dd12c-be3c-43be-845e-12c76cddb468)

![Image](https://github.com/user-attachments/assets/599d13f9-2ecf-43d1-a335-628927447702)

I can, however, save the lora adapters properly. Is there any way I can combine them with my model manually?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Error saving 4-bit quantized version of trained model #1917

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Error saving 4-bit quantized version of trained model #1917

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions