Improve active defrag in jemalloc 5.2 #9778

oranagra · 2021-11-14T16:12:14Z

Background:
Following the upgrade to jemalloc 5.2, there was a test that used to be flaky and started failing consistently (on 32bit), so we disabled it (see #9645).

This is a test that i introduced in #7289 when i attempted to solve a rare stagnation problem, and it later turned out i failed to solve it, ans what's more i added a test that caused it to be not so rare, and as i mentioned, now in jemalloc 5.2 it became consistent on 32bit.

Stagnation can happen when all the slabs of the bin are equally utilized, so the decision to move an allocation from a relatively empty slab to a relatively full one, will never happen, and in that test all the slabs are at 50% utilization, so the defragger could just keep scanning the keyspace and not move anything.

What this PR changes:

First, finally in jemalloc 5.2 we have the count of non-full slabs, so when we compare the utilization of the current slab, we can compare it to the average utilization of the non-full slabs in our bin, instead of the total average of our bin. this takes the full slabs out of the game, since they're not candidates for migration (neither source nor target).
Secondly, We add some 12% (100/8) to the decision to defrag an allocation, this is the part that aims to avoid stagnation, and it's especially important since the above mentioned change can get us closer to stagnation.
Thirdly, since jemalloc 5.2 adds sharded bins, we take into account all shards (something that's missing from the original PR that merged it), this isn't expected to make any difference since anyway there should be just one shard.

How this was benchmarked.
What i did was run the memefficiency test unit with --verbose and compare the defragger hits and misses the tests reported.
At first, when i took into consideration only the non-full slabs, it got a lot worse (i got into stagnation, or just got a lot of misses and a lot of hits), but when i added the 10% i got back to results that were slightly better than the ones of the jemalloc 5.1 branch. i.e. full defragmentation was achieved with fewer hits (relocations), and fewer misses (keyspace scans).

deps/jemalloc/include/jemalloc/internal/jemalloc_internal_inlines_c.h

…nes_c.h Co-authored-by: yoav-steinberg <yoav@monfort.co.il>

Background: Following the upgrade to jemalloc 5.2, there was a test that used to be flaky and started failing consistently (on 32bit), so we disabled it (see redis#9645). This is a test that i introduced in redis#7289 when i attempted to solve a rare stagnation problem, and it later turned out i failed to solve it, ans what's more i added a test that caused it to be not so rare, and as i mentioned, now in jemalloc 5.2 it became consistent on 32bit. Stagnation can happen when all the slabs of the bin are equally utilized, so the decision to move an allocation from a relatively empty slab to a relatively full one, will never happen, and in that test all the slabs are at 50% utilization, so the defragger could just keep scanning the keyspace and not move anything. What this PR changes: * First, finally in jemalloc 5.2 we have the count of non-full slabs, so when we compare the utilization of the current slab, we can compare it to the average utilization of the non-full slabs in our bin, instead of the total average of our bin. this takes the full slabs out of the game, since they're not candidates for migration (neither source nor target). * Secondly, We add some 12% (100/8) to the decision to defrag an allocation, this is the part that aims to avoid stagnation, and it's especially important since the above mentioned change can get us closer to stagnation. * Thirdly, since jemalloc 5.2 adds sharded bins, we take into account all shards (something that's missing from the original PR that merged it), this isn't expected to make any difference since anyway there should be just one shard. How this was benchmarked. What i did was run the memefficiency test unit with `--verbose` and compare the defragger hits and misses the tests reported. At first, when i took into consideration only the non-full slabs, it got a lot worse (i got into stagnation, or just got a lot of misses and a lot of hits), but when i added the 10% i got back to results that were slightly better than the ones of the jemalloc 5.1 branch. i.e. full defragmentation was achieved with fewer hits (relocations), and fewer misses (keyspace scans).

jemalloc 5.2 defrag fix

6df9034

oranagra requested a review from yoav-steinberg November 14, 2021 16:12

oranagra linked an issue Nov 14, 2021 that may be closed by this pull request

fix test "Active defrag edge case" in 32bit with new jemalloc #9686

Closed

yoav-steinberg reviewed Nov 15, 2021

View reviewed changes

deps/jemalloc/include/jemalloc/internal/jemalloc_internal_inlines_c.h Outdated Show resolved Hide resolved

Update deps/jemalloc/include/jemalloc/internal/jemalloc_internal_inli…

e8d8dab

…nes_c.h Co-authored-by: yoav-steinberg <yoav@monfort.co.il>

oranagra requested a review from yossigo November 15, 2021 07:53

yoav-steinberg approved these changes Nov 15, 2021

View reviewed changes

oranagra added the 7.0-must-have label Nov 17, 2021

yossigo approved these changes Nov 21, 2021

View reviewed changes

oranagra merged commit d4e7ffb into redis:unstable Nov 21, 2021

oranagra deleted the fix_defrag_for_jemalloc_5.2 branch November 21, 2021 11:35

yoav-steinberg mentioned this pull request Apr 21, 2022

Improved active-defrag #10586

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve active defrag in jemalloc 5.2 #9778

Improve active defrag in jemalloc 5.2 #9778

Uh oh!

oranagra commented Nov 14, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improve active defrag in jemalloc 5.2 #9778

Improve active defrag in jemalloc 5.2 #9778

Uh oh!

Conversation

oranagra commented Nov 14, 2021

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants