GGUF breaks - llama-3

Findings from https://github.com/ggerganov/llama.cpp/issues/7062 and Discord chats:
Notebook for repro: https://colab.research.google.com/drive/1djwQGbEJtUEZo_OuqzN_JF6xSOUKhm4q?usp=sharing

1. Unsloth +   float16 + QLoRA = WORKS
2. Unsloth + bfloat16 + QLoRA = WORKS
3. Unsloth + bfloat16 +    LoRA = WORKS
4. Unsloth +   float16 + QLoRA + GGUF-f16 = FAILS
5. Unsloth + bfloat16 +    LoRA + GGUF-f16 = FAILS

Todo:
- [ ] HF directly + float16 + QLoRA + GGUF-f16
- [x] HF directly + float16 +    LoRA + GGUF-f16


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GGUF breaks - llama-3 #430

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

GGUF breaks - llama-3 #430

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions