Findings from ggml-org/llama.cpp#7062 and Discord chats:
Notebook for repro: https://colab.research.google.com/drive/1djwQGbEJtUEZo_OuqzN_JF6xSOUKhm4q?usp=sharing
- Unsloth + float16 + QLoRA = WORKS
- Unsloth + bfloat16 + QLoRA = WORKS
- Unsloth + bfloat16 + LoRA = WORKS
- Unsloth + float16 + QLoRA + GGUF-f16 = FAILS
- Unsloth + bfloat16 + LoRA + GGUF-f16 = FAILS
Todo:
Findings from ggml-org/llama.cpp#7062 and Discord chats:
Notebook for repro: https://colab.research.google.com/drive/1djwQGbEJtUEZo_OuqzN_JF6xSOUKhm4q?usp=sharing
Todo: