llama.cpp : add documentation about rope_freq_base and scale values by slaren · Pull Request #3401 · ggml-org/llama.cpp

slaren · 2023-09-29T15:02:22Z

Previously, setting rope_freq_base to 10000 and rope_freq_scale to 1 in llama_context_params would cause llama.cpp to use model's default value. Now, to use the model default values, these parameters must be set to zero.

Setting rope_freq_base to 10000 will cause problems with models trained on a different value, such as CodeLlama-7B. Therefore, downstream users should be careful to update the default values of these parameters.

When using llama_context_default_params() to initialize llama_context_params without changing these parameters, no further action is required. llama_context_default_params() will correctly set the value of these parameters to zero.

slaren · 2023-09-29T15:04:20Z

I noticed that this is an issue in llama-cpp-python: abetlen/llama-cpp-python#765

Downstream users should be careful to update the default values.

ggerganov

Maybe also add a hot topics entry given the impact and importance of this change

…example * 'master' of github.com:ggerganov/llama.cpp: ggml-cuda : perform cublas mat mul of quantized types as f16 (ggml-org#3412) llama.cpp : add documentation about rope_freq_base and scale values (ggml-org#3401) train : fix KQ_pos allocation (ggml-org#3392) llama : quantize up to 31% faster on Linux and Windows with mmap (ggml-org#3206) readme : update hot topics + model links (ggml-org#3399) readme : add link to grammars app (ggml-org#3388) swift : fix build on xcode 15 (ggml-org#3387) build : enable more non-default compiler warnings (ggml-org#3200) ggml_tensor: update the structure comments. (ggml-org#3283) ggml : release the requested thread pool resource (ggml-org#3292) llama.cpp : split llama_context_params into model and context params (ggml-org#3301) ci : multithreaded builds (ggml-org#3311) train : finetune LORA (ggml-org#2632) gguf : basic type checking in gguf_get_* (ggml-org#3346) gguf : make token scores and types optional (ggml-org#3347) ci : disable freeBSD builds due to lack of VMs (ggml-org#3381) llama : custom attention mask + parallel decoding + no context swaps (ggml-org#3228) docs : mark code as Bash (ggml-org#3375) readme : add Mistral AI release 0.1 (ggml-org#3362) ggml-cuda : perform cublas fp16 matrix multiplication as fp16 (ggml-org#3370)

…cent breaking change ref: ggml-org/llama.cpp#3401

…gml-org#3401) * llama.cpp : add documentation about rope_freq_base and scale values * add notice to hot topics

llama.cpp : add documentation about rope_freq_base and scale values

2486725

ggerganov approved these changes Sep 29, 2023

View reviewed changes

slaren added 3 commits September 29, 2023 18:18

add notice to hot topics

1e3781c

Update README.md

6d80a03

Update README.md

777dae5

slaren merged commit 40e07a6 into master Sep 29, 2023

slaren deleted the cparams-doc branch September 29, 2023 16:42

This was referenced Sep 29, 2023

Change defaults for ROPE scaling Josh-XT/AGiXT#1015

Merged

The default values for rope_freq_base and rope_freq_scale override the model values abetlen/llama-cpp-python#765

Closed

Fix rope scaling defaults abetlen/llama-cpp-python#767

Merged

jhen0409 added a commit to mybigday/llama.rn that referenced this pull request Oct 3, 2023

fix(android): default rope_freq_base / rope_freq_scale to 0 during re…

43e036e

…cent breaking change ref: ggml-org/llama.cpp#3401

yusiwen pushed a commit to yusiwen/llama.cpp that referenced this pull request Oct 7, 2023

llama.cpp : add documentation about rope_freq_base and scale values (g…

0d04abb

…gml-org#3401) * llama.cpp : add documentation about rope_freq_base and scale values * add notice to hot topics

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama.cpp : add documentation about rope_freq_base and scale values#3401

llama.cpp : add documentation about rope_freq_base and scale values#3401
slaren merged 4 commits intomasterfrom
cparams-doc

slaren commented Sep 29, 2023 •

edited

Loading

Uh oh!

slaren commented Sep 29, 2023

Uh oh!

ggerganov left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

slaren commented Sep 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

slaren commented Sep 29, 2023

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

slaren commented Sep 29, 2023 •

edited

Loading