Attempting to push a model to the Hugging Face Hub using unsloth results in a RuntimeError, indicating that the required llama-quantize file is missing.
model.push_to_hub_gguf(
"aollm/qwen",
tokenizer,
quantization_method="q4_k_m",
token=""
)
RuntimeError: Unsloth: The file ('llama.cpp/llama-quantize' or 'llama.cpp/llama-quantize.exe' if you are on Windows WSL) or 'llama.cpp/quantize' does not exist.
But we expect this file to exist! Maybe the llama.cpp developers changed the name or check extension of the llama-quantize file.
Attempting to push a model to the Hugging Face Hub using
unslothresults in aRuntimeError, indicating that the requiredllama-quantizefile is missing.RuntimeError: Unsloth: The file ('llama.cpp/llama-quantize' or 'llama.cpp/llama-quantize.exe' if you are on Windows WSL) or 'llama.cpp/quantize' does not exist.
But we expect this file to exist! Maybe the llama.cpp developers changed the name or check extension of the llama-quantize file.