Limit number of Clocks by generall · Pull Request #8105 · qdrant/qdrant

generall · 2026-02-11T14:14:06Z

Clocks are used to track partial ordering of updates across whole cluster. Number of clocks limit the total update parallelism of individual peer, but this limitation seems to acceptable in order to prevent clogging state with too many clocks

ffuugoo

Overall, LGTM. Maybe add explicit yield_now().await in between iterations.

lib/collection/src/shards/replica_set/update.rs

generall · 2026-02-11T16:40:40Z

async fn get_clock(&self) -> ClockGuard

there are about 100 usage in sync tests, it would be a huge refactoring

…rhead created by spikes

lib/collection/src/shards/replica_set/update.rs

Co-authored-by: Roman Titov <ffuugoo@users.noreply.github.com>

timvisee · 2026-02-12T09:24:37Z

lib/collection/src/wal_delta.rs

            // Release some kept clocks
-            kept_clocks.retain(|(keep_for, _)| *keep_for > 1);
+            kept_clocks.retain_mut(|(keep_for, _)| {
+                if *keep_for == 0 {
+                    return false;
+                }
+                *keep_for -= 1;
+                true
+            });


This artificial release mechanism in the test was broken, as it did not decrease the counter.

I've also swapped the number of rounds we keep locks for in the above two loops. Now the smaller loop keeps clocks for longer, while the bigger loop keeps locks for shorter. This way we stay below the specified maximum of 256 clocks and prevent the test panicking.

I believe it's a reasonable way to make the test happy.

* [manual] limit number of update clocks to 64 to prevent permanent overhead created by spikes * explicit yield + fix tests * Add explicit comment on why we yield async thread * In resolve_wal_delta_randomized test fix clock cleanup and satisfy test * Use tokio timeout Co-authored-by: Roman Titov <ffuugoo@users.noreply.github.com> --------- Co-authored-by: Tim Visée <tim+github@visee.me> Co-authored-by: timvisee <tim@visee.me> Co-authored-by: Roman Titov <ffuugoo@users.noreply.github.com>

monoid · 2026-02-16T10:35:09Z

lib/collection/src/shards/replica_set/update.rs

+            match self.clock_set.lock().await.get_clock() {
+                Some(clock) => return clock,
+                // Prevent blocking async runtime with spinlock
+                None => yield_now().await,


We may have a condvar that we can await on.

There is many clocks. We should not wait with a condvar on all of them.

Having a separate atomic or notification channel is possible, but it makes the structure more complicated.

We don't expect to hit this branch under normal circumstances. Only when an excessive amount of parallelism is used, which is a problem on its own.

generall requested review from agourlay and ffuugoo February 11, 2026 14:14

generall changed the title ~~Limit count of Clocks~~ Limit number of Clocks Feb 11, 2026

This comment was marked as resolved.

Sign in to view

ffuugoo approved these changes Feb 11, 2026

View reviewed changes

lib/collection/src/shards/replica_set/update.rs Outdated Show resolved Hide resolved

lib/collection/src/shards/replica_set/update.rs Outdated Show resolved Hide resolved

timvisee added the release:1.17.0 label Feb 11, 2026

qdrant deleted a comment from coderabbitai bot Feb 11, 2026

This comment was marked as resolved.

Sign in to view

generall added 2 commits February 11, 2026 18:01

[manual] limit number of update clocks to 64 to prevent permanent ove…

7782381

…rhead created by spikes

explicit yield + fix tests

d4da977

generall force-pushed the limit-clocks-count branch from 32257ea to d4da977 Compare February 11, 2026 17:02

This comment was marked as resolved.

Sign in to view

ffuugoo reviewed Feb 11, 2026

View reviewed changes

lib/collection/src/shards/replica_set/update.rs Outdated Show resolved Hide resolved

lib/collection/src/shards/replica_set/update.rs Outdated Show resolved Hide resolved

timvisee reviewed Feb 12, 2026

View reviewed changes

lib/collection/src/shards/replica_set/update.rs Outdated Show resolved Hide resolved

timvisee and others added 3 commits February 12, 2026 09:58

Add explicit comment on why we yield async thread

610b1ab

In resolve_wal_delta_randomized test fix clock cleanup and satisfy test

c1ed434

Use tokio timeout

e5893bc

Co-authored-by: Roman Titov <ffuugoo@users.noreply.github.com>

timvisee reviewed Feb 12, 2026

View reviewed changes

timvisee approved these changes Feb 12, 2026

View reviewed changes

github-actions bot mentioned this pull request Feb 12, 2026

Flaky test tests::hw_metrics::test_hw_metrics_cancellation #6070

Open

timvisee merged commit 538eb74 into dev Feb 12, 2026
15 checks passed

timvisee deleted the limit-clocks-count branch February 12, 2026 09:56

IvanPleshkov mentioned this pull request Feb 16, 2026

Lock free clock set #8140

Closed

monoid reviewed Feb 16, 2026

View reviewed changes

timvisee mentioned this pull request Feb 17, 2026

Bump version to 1.17.0 #8160

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit number of Clocks#8105

Limit number of Clocks#8105
timvisee merged 5 commits intodevfrom
limit-clocks-count

generall commented Feb 11, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

ffuugoo left a comment

Uh oh!

Uh oh!

Uh oh!

generall commented Feb 11, 2026

Uh oh!

This comment was marked as resolved.

This comment was marked as resolved.

Uh oh!

Uh oh!

Uh oh!

timvisee Feb 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

monoid Feb 16, 2026

Uh oh!

timvisee Feb 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

generall commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

ffuugoo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

generall commented Feb 11, 2026

Uh oh!

This comment was marked as resolved.

This comment was marked as resolved.

Uh oh!

Uh oh!

Uh oh!

timvisee Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

monoid Feb 16, 2026

Choose a reason for hiding this comment

Uh oh!

timvisee Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

generall commented Feb 11, 2026 •

edited

Loading

timvisee Feb 12, 2026 •

edited

Loading

timvisee Feb 16, 2026 •

edited

Loading