RuntimeError: Missing 'llama-quantize' File When Pushing to Hub in Unsloth

Attempting to push a model to the Hugging Face Hub using `unsloth` results in a `RuntimeError`, indicating that the required `llama-quantize` file is missing.

   ```python
   model.push_to_hub_gguf(
       "aollm/qwen", 
       tokenizer, 
       quantization_method="q4_k_m", 
       token=""
   )
```

RuntimeError: Unsloth: The file ('llama.cpp/llama-quantize' or 'llama.cpp/llama-quantize.exe' if you are on Windows WSL) or 'llama.cpp/quantize' does not exist.
But we expect this file to exist! Maybe the llama.cpp developers changed the name or check extension of the llama-quantize file.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RuntimeError: Missing 'llama-quantize' File When Pushing to Hub in Unsloth #1905

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

RuntimeError: Missing 'llama-quantize' File When Pushing to Hub in Unsloth #1905

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions