Add support to defrag ebuckets incrementally by sundb · Pull Request #13842 · redis/redis

sundb · 2025-03-04T13:44:56Z

In PR #13229, we introduced the ebucket for HFE.
Before this PR, when updating eitems stored in ebuckets, the lack of incremental fragmentation support for non-kvstore data structures (until PR #13814) meant that we had to reverse lookup the position of the eitem in the ebucket and then perform the update.
This approach was inefficient as it often required frequent traversals of the segment list to locate and update the item.

To address this issue, in this PR, This PR implements incremental fragmentation for hash dict ebuckets and server.hexpires.
By incrementally defrag the ebuckets, we also perform defragmentation for the associated items, eliminates the need for frequent traversals of the segment list for defragging the eitem.

src/defrag.c

src/ebuckets.c

src/ebuckets.h

Co-authored-by: Moti Cohen <moticless@gmail.com>

src/ebuckets.c

src/ebuckets.h

Co-authored-by: Moti Cohen <moticless@gmail.com>

moticless

LGTM

src/ebuckets.c

src/defrag.c

Co-authored-by: Moti Cohen <moticless@gmail.com>

snyk-io · 2025-05-14T13:37:25Z

🎉 Snyk checks have passed. No issues have been found so far.

✅ security/snyk check is complete. No issues have been found. (View Details)

✅ license/snyk check is complete. No issues have been found. (View Details)

Copilot

Pull Request Overview

This PR implements incremental defragmentation for hash dict ebuckets and server.hexpires to improve defragmentation efficiency and reduce costly segment traversals. Key changes include:

Updates to testing in tests/unit/memefficiency.tcl with revised assertions and new configurations.
Refactoring of defrag APIs in src/rax., src/module.c, and src/ebuckets. to include an additional privdata parameter.
Extensive modifications in src/defrag.c to add new defrag stages, including support for hexpires and hash fields with TTL.

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
tests/unit/memefficiency.tcl	Updated test procedures and expected results for defrag incremental support.
src/rax.h, src/rax.c	Updated callback signature to include privdata for defrag node callbacks.
src/module.c	Adjusted module defrag callbacks to account for the new signature.
src/ebuckets.h, src/ebuckets.c	Introduced new defrag APIs (e.g., ebScanDefrag) and refactored list/rax defrag.
src/defrag.c	Revised defrag strategies for hash objects, added hexpires defrag stages, and refined callbacks.

tests/unit/memefficiency.tcl

src/defrag.c

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

This PR fixes three defrag issues. 1. Fix the issue that forget to update cgroup_ref_node when the consume group was reallocated. This crash was introduced by #14130 In this PR, when performing defragmentation on `s->cgroups` using `defragRadixTree()`, we no longer rely on the automatic data defragmentation of `defragRadixTree()`. Instead, we manually defragment the consumer group and then update its reference in `s->cgroups`. 2. Fix a use-after-free issue caused by updating dictionary keys after HFE key is reallocated. This issue was introduced by #13842 3. Fix the issue that forgot to be updated NextSegHdr->firstSeg when the first segment was reallocated. This issue was introduced by #13842 --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

From the malloc-stats reports of both failures and successes, we can see that the additional fragments mainly come from bin24. By analyzing the fragments mainly from the entries of the dict, since `large_ebrax` test uses a dictionary with 1600 elements, it will move a large number of entries during the rehashing process, and we will not perform defragmentation on the dict entries. In #13842 we changed to use two dicts alternately to generate frag. Normally, the entries should also alternate, but rehashing disrupted this, which resulted in bin24 frag that can't be defragged. ## Solution In this PR, the length of a single dictionary was reduced from 1600 to 500 to avoid excessive rehashing, and the threshold was also lowered. --------- Co-authored-by: oranagra <oran@redislabs.com>

This PR fixes three defrag issues. 1. Fix the issue that forget to update cgroup_ref_node when the consume group was reallocated. This crash was introduced by redis#14130 In this PR, when performing defragmentation on `s->cgroups` using `defragRadixTree()`, we no longer rely on the automatic data defragmentation of `defragRadixTree()`. Instead, we manually defragment the consumer group and then update its reference in `s->cgroups`. 2. Fix a use-after-free issue caused by updating dictionary keys after HFE key is reallocated. This issue was introduced by redis#13842 3. Fix the issue that forgot to be updated NextSegHdr->firstSeg when the first segment was reallocated. This issue was introduced by redis#13842 --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

sundb added 11 commits March 4, 2025 21:43

Add support to defrag ebuckets incremental

2fc548d

Merge branch 'unstable' into incrementail_defrag_ebuckets

11fe3e6

Fix conflict

be60ace

Avoid compiler warning with old build chain

4f130a7

defrag ebucket incremental later

547f2fe

spell

98aeb1c

Fix setting a wrong hash defrag phase

419333f

Add comments

0820cde

Refine activeDefragHExpiresStringOB method name

41f83cd

Add comment and revert a change for test

6827b8f

Make test to cover hashtable hash

6a6ddaf

sundb marked this pull request as ready for review March 6, 2025 03:08

sundb added 7 commits March 6, 2025 16:05

Refine ebDefragRax()

3a15feb

Should have the hash field create the frags, not hash val

17d8cf0

Make the same change to listpack as above

b6b5f93

Rename var

916abef

Fix TODO

2755b2a

Save next node instead of current node for rax defrag

223c243

Refine comment

a644f54

moticless reviewed Mar 11, 2025

View reviewed changes

src/defrag.c Outdated Show resolved Hide resolved

moticless reviewed Mar 11, 2025

View reviewed changes

src/ebuckets.c Outdated Show resolved Hide resolved

moticless reviewed Mar 11, 2025

View reviewed changes

src/ebuckets.c Outdated Show resolved Hide resolved

src/ebuckets.c Outdated Show resolved Hide resolved

src/ebuckets.c Outdated Show resolved Hide resolved

moticless reviewed Mar 11, 2025

View reviewed changes

src/ebuckets.h Outdated Show resolved Hide resolved

sundb and others added 4 commits March 12, 2025 14:55

Update hfield ref directly when it doesn't has TTL

175115f

Co-authored-by: Moti Cohen <moticless@gmail.com>

Refine ebDefrag()

b1395b6

Co-authored-by: Moti Cohen <moticless@gmail.com>

Rename last to next

5f30cf9

Rename ebDefrag to ebScanDefrag

3280640

moticless reviewed Mar 12, 2025

View reviewed changes

sundb and others added 2 commits March 12, 2025 17:39

Fix CRs

417daa8

Co-authored-by: Moti Cohen <moticless@gmail.com>

Revert unnecessary change

870ed5d

sundb added 6 commits May 12, 2025 18:06

Merge branch 'unstable' into incrementail_defrag_ebuckets

75bdf68

Fix complain

5ab7de4

Fix merge issue

d1827e0

Revert change

f33bce9

Fix test

f15613d

relax threshold

119b282

sundb requested a review from moticless May 13, 2025 07:11

sundb added 5 commits May 13, 2025 16:01

Fix updating the hfield key

afed064

Refine activeDefragHfieldDictCallback

55ff26f

Defrag the rax structure of ebucket

0fc52bc

Revert changes

baf8d2e

Merge branch 'unstable' into incrementail_defrag_ebuckets

7d71723

moticless approved these changes May 14, 2025

View reviewed changes

src/ebuckets.c Outdated Show resolved Hide resolved

src/defrag.c Outdated Show resolved Hide resolved

sundb and others added 2 commits May 14, 2025 21:37

Update src/defrag.c

d25d86c

Co-authored-by: Moti Cohen <moticless@gmail.com>

Update src/ebuckets.c

07a4691

Co-authored-by: Moti Cohen <moticless@gmail.com>

sundb requested a review from Copilot May 16, 2025 01:45

Copilot AI reviewed May 16, 2025

View reviewed changes

tests/unit/memefficiency.tcl Outdated Show resolved Hide resolved

src/defrag.c Show resolved Hide resolved

Update tests/unit/memefficiency.tcl

0d1a689

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

sundb added the state:to-be-merged The PR should be merged soon, even if not yet ready, this is used so that it won't be forgotten label May 17, 2025

Indentation

472bcb5

sundb merged commit 5d0d64b into redis:unstable May 18, 2025
18 checks passed

github-project-automation bot moved this from Todo to Done in Redis 8.2 May 18, 2025

sundb removed the state:to-be-merged The PR should be merged soon, even if not yet ready, this is used so that it won't be forgotten label May 18, 2025

sundb deleted the incrementail_defrag_ebuckets branch August 7, 2025 02:08

sundb mentioned this pull request Sep 4, 2025

Fix defrag issues for stream defrag and HFE #14323

Merged

sundb mentioned this pull request Sep 9, 2025

Fix Active Defrag HFE with large_ebrax test #14344

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support to defrag ebuckets incrementally#13842

Add support to defrag ebuckets incrementally#13842
sundb merged 47 commits intoredis:unstablefrom
sundb:incrementail_defrag_ebuckets

sundb commented Mar 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

moticless left a comment

Uh oh!

Uh oh!

Uh oh!

snyk-io bot commented May 14, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sundb commented Mar 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

moticless left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

snyk-io bot commented May 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🎉 Snyk checks have passed. No issues have been found so far.

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sundb commented Mar 4, 2025 •

edited

Loading

snyk-io bot commented May 14, 2025 •

edited

Loading