Fix Gridstore flushing, correctly implement drain persist by timvisee · Pull Request #7741 · qdrant/qdrant

timvisee · 2025-12-10T16:41:35Z

Fix a critical Gridstore bug that breaks flushing with alternating puts and deletes.

Gridstore uses deferred flushing. It means that we copy the list of pending updates when the flusher is created. Once flushing is invoked and updates are persisted, we remove these from the current list of updates (which might already have new entries). This 'removing' we call 'drain persisted'. It is critical to persist every change only once.

This drain function was broken and failed on some edge cases. For the problematic edge case we found in testing, I've added a new test.

The problem only appears when deferring a flush (for some time). If flushing immediately, the pending and persisted set of updates are equal and everything works as expected.

Bear in mind that I use the concept of a 'set' and 'unset' list in this PR. I plan to open a separate PR to apply this concept everywhere as it's much more easy to understand. Since it's only partially applied in this PR it might look like boilerplate.

All Submissions:

Contributions should target the dev branch. Did you create your branch from dev?
Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

Changes to Core Features:

Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

timvisee · 2025-12-10T16:46:17Z

lib/gridstore/src/tracker.rs

    }
+
+    #[test]
+    fn test_value_pointer_drain_set_unset_set() {


New test is here. This failed before fixing drain_persisted_and_drop.

timvisee · 2025-12-10T16:50:19Z

lib/gridstore/src/tracker.rs

+        // In current history, range of unset entries
+        let unset_range = if self.latest_is_set {
+            0..self.history.len().saturating_sub(1)
+        } else {
+            0..self.history.len()
+        };


I'll significantly improve this in a future PR where I'll use a separate set/unset list.

timvisee · 2025-12-10T17:14:42Z

lib/gridstore/src/tracker.rs

+        debug_assert_eq!(
+            self.history.iter().copied().collect::<AHashSet<_>>().len(),
+            self.history.len(),
+            "self must not have duplicate pointers in history",
+        );


We also do the same debug assertion in the caller of this function. But I vote to keep it for now, until I more properly refactor it later. It does not affect release builds.

coszio

I think the changes make sense. Though I made an alternative solution to (IMO) simplify draining logic in #7745. Let's discuss tomorrow 😄

coszio · 2025-12-11T03:00:10Z

lib/gridstore/src/tracker.rs

+            (Some(last), Some(set)) if last == set => {
+                self.history.pop();
+                self.latest_is_set = false;
+            }


Isn't this the same as checking if the entire PointerUpdates is the same? In such case we can return early here, since the rest of the code will remove the remaining history.

I also thought so.

But in this function I implemented the complete thing on purpose, because I don't want to make any assumptions anymore. Don't want to spend another week on it in the future. 🙃

* Fix Gridstore tracker drain and persist not working properly * Patch existing test * Add new test to assert buggy scenario we found * Mention PR in test

timvisee added 3 commits December 10, 2025 17:39

Fix Gridstore tracker drain and persist not working properly

7b57958

Patch existing test

0576d72

Add new test to assert buggy scenario we found

bdcae85

timvisee marked this pull request as ready for review December 10, 2025 16:45

timvisee commented Dec 10, 2025

View reviewed changes

timvisee requested review from agourlay, coszio and generall December 10, 2025 16:46

Mention PR in test

dbc9726

This comment was marked as resolved.

Sign in to view

qdrant deleted a comment from coderabbitai bot Dec 10, 2025

timvisee commented Dec 10, 2025

View reviewed changes

This comment was marked as resolved.

Sign in to view

qdrant deleted a comment from coderabbitai bot Dec 10, 2025

timvisee commented Dec 10, 2025

View reviewed changes

coszio mentioned this pull request Dec 11, 2025

[gridstore] Fix draining bug by using opnums #7745

Closed

coszio reviewed Dec 11, 2025

View reviewed changes

agourlay approved these changes Dec 11, 2025

View reviewed changes

timvisee merged commit 14ebe94 into dev Dec 11, 2025
15 checks passed

timvisee deleted the fix-gridstore-tracker-drain-persist branch December 11, 2025 09:27

agourlay mentioned this pull request Dec 11, 2025

Test Gridstore corruption on flush #7713

Merged

timvisee mentioned this pull request Dec 11, 2025

Gridstore: split pointer updates set/unset, simplify draining #7749

Merged

9 tasks

timvisee added the release:1.16.3 label Dec 11, 2025

timvisee mentioned this pull request Dec 19, 2025

Bump version to 1.16.3 #7806

Merged

claude bot mentioned this pull request Jan 9, 2026

chore(deps): update docker.io/qdrant/qdrant docker tag to v1.16.3 cbcoutinho/nextcloud-mcp-server#464

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Gridstore flushing, correctly implement drain persist#7741

Fix Gridstore flushing, correctly implement drain persist#7741
timvisee merged 4 commits intodevfrom
fix-gridstore-tracker-drain-persist

timvisee commented Dec 10, 2025 •

edited

Loading

Uh oh!

timvisee Dec 10, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

timvisee Dec 10, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

timvisee Dec 10, 2025 •

edited

Loading

Uh oh!

coszio left a comment

Uh oh!

coszio Dec 11, 2025

Uh oh!

timvisee Dec 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

timvisee commented Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

All Submissions:

Changes to Core Features:

Uh oh!

timvisee Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

timvisee Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

timvisee Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

coszio left a comment

Choose a reason for hiding this comment

Uh oh!

coszio Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

timvisee Dec 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

timvisee commented Dec 10, 2025 •

edited

Loading

timvisee Dec 10, 2025 •

edited

Loading