feat: implement top-nsigma sampling method by AlpinDale · Pull Request #825 · aphrodite-engine/aphrodite-engine

AlpinDale · 2024-11-19T22:45:37Z

This PR adds the top-nσ sampling method from the paper "Top-nσ: Not All Logits Are You Need".

Top-nσ seems to be a novel sampling method that operates directly on pre-softmax logits by leveraging statistical properties of the distribution. Instead of using complex probability manipulations like top-p/top-k, it filters tokens based on their distance from the maximum logit in terms of standard deviations.

I haven't tested it much, but values should be between 0.0 (disabled) and 2.0, probably.

TODO:

See if this sampler should disable all other truncation samplers (top-p/k/a, min_p)
Write tests

This reverts commit 2242389.

AlpinDale · 2024-11-20T02:35:09Z

@Tomorrowdawn hi! I was wondering if it's a good idea to disable alphabet sampling (and other truncators) when nsigma is being used? We used to do this for mirostat as well, but it would disable all other samplers instead of just truncators.

Tomorrowdawn · 2024-11-20T02:40:46Z

It's an honor to see this PR! nsigma and top-p/top-a/min-p are definitely conflicting and thus not recommended. While it doesn't conflict with top-k, k typically only reduces the number of available tokens. I'm not sure what the practical requirements are, but technically this is how it works.

Tomorrowdawn · 2024-11-20T03:04:01Z

Correction: After careful thought, the order is important. If you use top-nsigma before all alphabet sampling, it won't cause errors; however, if you use top-nsigma after alphabet sampling, it will cause errors due to the introduction of negative infinity(you will get a -inf mean and std).

AlpinDale · 2024-11-20T03:16:36Z

The current order is penalties -> temperature -> top-n-sigma -> alphabet. We also allow users to perform temperature sampling last if they wish. Do you recommend moving it to before or after temperature?

I had some users test this sampler offline, and they noted it performs better when temperature is applied before top-nsigma. I guess all that remains is whether to disable alphabet sampling when this one is applied -- it won't cause errors, but will it cause degraded generations?

Tomorrowdawn · 2024-11-20T03:25:49Z

It is same before or after temperature scaling, see the temperature invariance Sec 3.2.

For the concern of quality: I guess the best choice is to let the user decide. All these methods are "denoising" the distribution, so the final result is bounded by the most strict sampler. At worst, it just tends toward greedy decoding.

AlpinDale · 2024-11-20T03:28:01Z

I suppose you're right. I'll add sampler order soon so the users can control how this happens. Merging for now as it passes all tests.

AlpinDale · 2024-11-20T03:30:33Z

Whoops, messed up the git command, but it's in 2242760

feat: implement top-nsigma sampling method

22429d0

AlpinDale mentioned this pull request Nov 19, 2024

feat: add top nsigma sampler support SillyTavern/SillyTavern#3094

Merged

1 task

AlpinDale added 4 commits November 19, 2024 23:14

fix comparison sign

22423ef

the above fix: fixed

2242654

hijack epsilon cutoff

2242389

Revert "hijack epsilon cutoff "

22421c9

This reverts commit 2242389.

Merge branch 'main' into top-nsigma

2242bc7

add tests

2242029

AlpinDale marked this pull request as ready for review November 20, 2024 03:27

AlpinDale closed this Nov 20, 2024

Tomorrowdawn mentioned this pull request Jan 19, 2025

sampling: add Top-nσ sampler ggml-org/llama.cpp#11223

Merged

DocShotgun mentioned this pull request Feb 16, 2025

sampling: add Top-nσ sampler to llama-server ggml-org/llama.cpp#11896

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: implement top-nsigma sampling method#825

feat: implement top-nsigma sampling method#825
AlpinDale wants to merge 7 commits intomainfrom
top-nsigma

AlpinDale commented Nov 19, 2024

Uh oh!

AlpinDale commented Nov 20, 2024

Uh oh!

Tomorrowdawn commented Nov 20, 2024 •

edited

Loading

Uh oh!

Tomorrowdawn commented Nov 20, 2024

Uh oh!

AlpinDale commented Nov 20, 2024 •

edited

Loading

Uh oh!

Tomorrowdawn commented Nov 20, 2024

Uh oh!

AlpinDale commented Nov 20, 2024

Uh oh!

AlpinDale commented Nov 20, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

AlpinDale commented Nov 19, 2024

Uh oh!

AlpinDale commented Nov 20, 2024

Uh oh!

Tomorrowdawn commented Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Tomorrowdawn commented Nov 20, 2024

Uh oh!

AlpinDale commented Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Tomorrowdawn commented Nov 20, 2024

Uh oh!

AlpinDale commented Nov 20, 2024

Uh oh!

AlpinDale commented Nov 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Tomorrowdawn commented Nov 20, 2024 •

edited

Loading

AlpinDale commented Nov 20, 2024 •

edited

Loading

AlpinDale commented Nov 20, 2024 •

edited

Loading