Add script to convert GGMLv3 LLaMA models to GGUF by KerfuffleV2 · Pull Request #2682 · ggml-org/llama.cpp

KerfuffleV2 · 2023-08-20T15:36:56Z

Currently in a pretty reasonable state. Testing/feedback would be appreciated.

Converted file tested to parse these prompts to the same tokens as pre-GGUF llama.cpp:

你喜欢小狗吗？
Once upon a time, in a dark forest, there lived a little fox

I also tested these models with the second prompt:

Random LLaMA1 7B
openorca-platypus2-13b.ggmlv3.q5_K_M.bin
gplatty-30b-superhot-8k.ggmlv3.q4_K_M.bin
platypus2-70b-instruct.ggmlv3.q4_K_M.bin

Identical generation compared to loading the actual GGML file with pre-GGUF llama.cpp when specifying a seed.

Note: When testing, be sure to specify --eps and --gqa as is appropriate. You'll probably also want to specify --context-length (it defaults to 2048).

edit: It's now possible to use HF or "original" format metadata like vocab when converting. Some information about this and the current state of the pull: #2682 (comment)

Some perplexity results here: #2682 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add script to convert GGMLv3 LLaMA models to GGUF#2682

Add script to convert GGMLv3 LLaMA models to GGUF#2682
ggerganov merged 11 commits intoggml-org:gguffrom
KerfuffleV2:feat-convert-ggml-to-gguf

KerfuffleV2 commented Aug 20, 2023 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Conversation

KerfuffleV2 commented Aug 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

KerfuffleV2 commented Aug 20, 2023 •

edited

Loading