Clone pending updates in buffered storages by coszio · Pull Request #7801 · qdrant/qdrant

coszio · 2025-12-18T15:51:05Z

Audits and implements better delayed flushing.

This fixes the case where a flusher could lose its pending updates if the flusher closure fails or is never executed.

Instead of taking the current pending updates, and leaving them initialized, we should clone them, and reconcile them after persisting them (often times asyncronously).

This PR applies this fix to MutableIdTracker, MmapSliceBufferedUpdateWrapper, and MmapBitsliceBufferedUpdateWrapper, which were the only storages missing this behavior in my audit.

timvisee

(review ongoing, here are my remarks thus far)

lib/segment/src/id_tracker/mutable_id_tracker.rs

timvisee · 2025-12-18T16:07:18Z

lib/segment/src/id_tracker/mutable_id_tracker.rs

+    let mut delete_up_to = 0;
+    for (pending, persisted) in pending.iter().zip(changes) {
+        if pending != persisted {
+            // This should not happen, since flushers are supposed to be requested->executed
+            // one at a time, but if it does, it should be fine.
+            //
+            // The only consequence is to persist some operations again.
+            break;
+        }
+        delete_up_to += 1;
+    }
+
+    *pending = pending.split_off(delete_up_to);


We can probably use retain here too. As I remember it is very efficient on diffs like this. And it prevents us having to do this index magic.

Also, this indexed approach will break if we have concurrent flushers. We shouldn't have that, but you never know what happens in the future.

I am not sure about this one, since they are vecs representing a log.

If we have a sequence like:

changes: [1, 2, 3] pending_updates: [1, 2, 3, 4, 5, 2]

and we do something like .retain(|pending| !changes.contain(pending)), we will end up with

pending_updates: [4, 5]

which will lose the latest mapping.

You're right. It is a log. We should keep the same ordering and retain breaks that.

The good thing is that these are idempotent. It means that if a flush didn't clear anything from the pending updates list, it's fine if we apply it again in the next flush iteration.

I do have something in mind that makes it more resilient against concurrent flushers though. I might implement that in the future.

My idea is that the beginning of the lists don't have to match.

pending_updates: [1, 2, 3]

flusher 1 created: [1, 2, 3]

update comes in

pending_updates: [1, 2, 3, 4]

flusher 2 created: [1, 2, 3, 4]

flusher 1 flushes: [1, 2, 3]
pending_updates: [4]

flusher 2 flushes: [1, 2, 3, 4]
pending_updates: [] (first three were missing)

Or a slightly more complicated example:

pending_updates: [1, 2, 3]

flusher 1 created: [1, 2, 3]

update comes in

pending_updates: [1, 2, 3, 4]

flusher 2 created: [1, 2, 3, 4]

update comes in:
pending_updates: [1, 2, 3, 4, 5]

flusher 1 flushes: [1, 2, 3]
pending_updates: [4, 5]

flusher 2 flushes: [1, 2, 3, 4]
pending_updates: [5] (first three were missing)

And maybe we have to enforce flushers are also applied in the same order because:

pending_updates: [1, 2, 3]

flusher 1 created: [1, 2, 3]

update comes in

pending_updates: [1, 2, 3, 4]

flusher 2 created: [1, 2, 3, 4]

flusher 2 flushes: [1, 2, 3, 4]
pending_updates: []

flusher 1 flushes: [1, 2, 3] (⚠️ we now don't write 4 at the end!)
pending_updates: []

lib/segment/src/common/mmap_slice_buffered_update_wrapper.rs

timvisee

Pre-approval for when all review remarks are resolved.

Then we can merge and test over night.

coszio · 2025-12-18T18:14:56Z

I'm afraid this still gets missing points when running crasher locally. Let's see the effect on chaos... Merging

* in MmapSliceBufferedUpdateWrapper * in MmapBitsliceBufferedUpdateWrapper * in MutableIdTracker's versions updates * in MutableIdTracker's mapping updates * clone updates only when non-empty * only lock for reconciling pending changes * simpler reconciling * use Mutex as argument to ensure we only lock within reconciliation

timvisee · 2025-12-19T11:25:10Z

Noticed there were two more components that needed this treatment. I've implemented that in #7805.

coszio requested a review from timvisee December 18, 2025 15:51

This comment was marked as resolved.

Sign in to view

qdrant deleted a comment from coderabbitai bot Dec 18, 2025

timvisee reviewed Dec 18, 2025

View reviewed changes

lib/segment/src/common/mmap_slice_buffered_update_wrapper.rs Outdated Show resolved Hide resolved

lib/segment/src/common/mmap_slice_buffered_update_wrapper.rs Outdated Show resolved Hide resolved

timvisee added the release:1.16.3 label Dec 18, 2025

coszio marked this pull request as draft December 18, 2025 16:27

in MmapSliceBufferedUpdateWrapper

fce4dfc

timvisee approved these changes Dec 18, 2025

View reviewed changes

coszio added 3 commits December 18, 2025 10:37

in MmapBitsliceBufferedUpdateWrapper

8be0944

in MutableIdTracker's versions updates

aba3165

in MutableIdTracker's mapping updates

b62d3ed

coszio force-pushed the clone-pending-updates branch from bbe2952 to b62d3ed Compare December 18, 2025 16:40

coszio added 3 commits December 18, 2025 10:54

clone updates only when non-empty

c955370

only lock for reconciling pending changes

3963497

simpler reconciling

67e1964

coszio marked this pull request as ready for review December 18, 2025 17:28

This comment was marked as resolved.

Sign in to view

use Mutex as argument to ensure we only lock within reconciliation

ffff4ea

coszio merged commit c619cb2 into dev Dec 18, 2025
15 checks passed

coszio deleted the clone-pending-updates branch December 18, 2025 18:15

qdrant deleted a comment from coderabbitai bot Dec 19, 2025

This was referenced Dec 19, 2025

Switch to drain, it is simpler and recommended by docs #7803

Merged

Fix reconciliation in more flushers #7805

Merged

timvisee mentioned this pull request Dec 19, 2025

Bump version to 1.16.3 #7806

Merged

coderabbitai bot mentioned this pull request Dec 19, 2025

Use RwLock in flushers, drop is alive lock early #7811

Merged

9 tasks

coderabbitai bot mentioned this pull request Jan 6, 2026

Id tracker single write of pending updates #7872

Closed

This was referenced Jan 7, 2026

id tracker persisted mappings offset #7877

Merged

More graceful handle of flush cancellation #7798

Merged

claude bot mentioned this pull request Jan 9, 2026

chore(deps): update docker.io/qdrant/qdrant docker tag to v1.16.3 cbcoutinho/nextcloud-mcp-server#464

Merged

1 task

coderabbitai bot mentioned this pull request Jan 11, 2026

log mutable id tracker mapping updates #7894

Merged

coderabbitai bot mentioned this pull request Feb 20, 2026

Gridstore non-blocking flush #8188

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clone pending updates in buffered storages#7801

Clone pending updates in buffered storages#7801
coszio merged 8 commits intodevfrom
clone-pending-updates

coszio commented Dec 18, 2025 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

timvisee left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timvisee Dec 18, 2025 •

edited

Loading

Uh oh!

coszio Dec 18, 2025

Uh oh!

timvisee Dec 19, 2025

Uh oh!

Uh oh!

Uh oh!

timvisee left a comment

Uh oh!

This comment was marked as resolved.

Uh oh!

coszio commented Dec 18, 2025

Uh oh!

Uh oh!

timvisee commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

coszio commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

timvisee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

timvisee Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coszio Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

timvisee Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

timvisee left a comment

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

coszio commented Dec 18, 2025

Uh oh!

Uh oh!

timvisee commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coszio commented Dec 18, 2025 •

edited

Loading

timvisee Dec 18, 2025 •

edited

Loading