Currently running optimizers in Metrics by JojiiOfficial · Pull Request #7316 · qdrant/qdrant

JojiiOfficial · 2025-09-26T08:07:04Z

Supersedes #7275

Implements the optimizer_running_processes metrics as discussed:

# HELP optimizer_running_processes number of optimization processes running in total
# TYPE optimizer_running_processes gauge
optimizer_running_processes 1

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (2)

src/common/metrics.rs (2)
178-189: Type clarity: make the accumulator explicitly usize

Avoid relying on numeric literal inference; make the intent unambiguous.
-        let mut total_optimizations_running = 0;
+        let mut total_optimizations_running: usize = 0;
191-196: Prometheus naming: avoid “total” in a Gauge name

“_total” is reserved for monotonically increasing counters. This metric is a gauge, so prefer a name like optimizer_processes_running and adjust the help accordingly.
-        metrics.push(metric_family(
-            "optimizer_total_processes_running",
-            "number of optimization processes running in total",
+        metrics.push(metric_family(
+            "optimizer_processes_running",
+            "current number of optimization processes running",
             MetricType::GAUGE,
             vec![gauge(total_optimizations_running as f64, &[])],
         ));

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7d7f8a3 and 4c73996.

📒 Files selected for processing (6)

lib/collection/src/shards/local_shard/telemetry.rs (1 hunks)
lib/collection/src/telemetry.rs (2 hunks)
lib/common/common/src/types.rs (2 hunks)
src/actix/api/service_api.rs (2 hunks)
src/common/metrics.rs (1 hunks)
src/common/telemetry_reporting.rs (1 hunks)

🧰 Additional context used

📓 Path-based instructions (2)

**/*.rs

📄 CodeRabbit inference engine (.github/review-rules.md)

**/*.rs: Prefer explicit SomeType::from(x) over implicit x.into() in Rust code
Do not use transmute_from_u8, transmute_to_u8, transmute_from_u8_to_slice, transmute_from_u8_to_mut_slice, transmute_to_u8_slice in new code; use bytemuck or zerocopy instead

Files:

lib/collection/src/telemetry.rs
lib/common/common/src/types.rs
src/common/metrics.rs
src/actix/api/service_api.rs
lib/collection/src/shards/local_shard/telemetry.rs
src/common/telemetry_reporting.rs

**/src/**/*.rs

📄 CodeRabbit inference engine (.github/review-rules.md)

**/src/**/*.rs: Prefer exhaustive match arms over a catch-all _ arm to avoid missing new enum variants (except in tests/benchmarks or when provably safe)
Prefer explicit field ignoring with : _ over .. in struct patterns (except in tests/benchmarks or when provably safe)

Files:

lib/collection/src/telemetry.rs
lib/common/common/src/types.rs
src/common/metrics.rs
src/actix/api/service_api.rs
lib/collection/src/shards/local_shard/telemetry.rs
src/common/telemetry_reporting.rs

🧬 Code graph analysis (1)

src/common/metrics.rs (1)

src/actix/api/service_api.rs (1)

metrics (68-98)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (12)

GitHub Check: rust-tests (windows-latest)
GitHub Check: rust-tests (ubuntu-latest)
GitHub Check: rust-tests (macos-latest)
GitHub Check: test-shard-snapshot-api-s3-minio
GitHub Check: e2e-tests
GitHub Check: storage-compat-test
GitHub Check: rust-tests-no-rocksdb (ubuntu-latest)
GitHub Check: test-consistency
GitHub Check: integration-tests
GitHub Check: lint
GitHub Check: integration-tests-consensus
GitHub Check: test-consensus-compose

🔇 Additional comments (5)

lib/collection/src/shards/local_shard/telemetry.rs (1)

73-74: LGTM: gated optimizer logs via either Level4 or optimizer_logs flag

This enables metrics collection without bumping overall detail level. Nice.

src/actix/api/service_api.rs (2)

46-50: Telemetry endpoint: explicit optimizer_logs=false

Clear and consistent with privacy posture for standard telemetry.

82-86: Metrics endpoint: enable optimizer logs for aggregation

Correctly sets optimizer_logs=true at Level3 to support the new metric.

src/common/telemetry_reporting.rs (1)

12-16: Reporter DETAIL updated correctly

Keeping optimizer_logs=false here is appropriate.

lib/common/common/src/types.rs (1)

36-36: All TelemetryDetail initializers include optimizer_logs Verified constant DETAIL, Default impl, and all struct literals explicitly set optimizer_logs.

lib/collection/src/telemetry.rs

timvisee · 2025-09-29T14:31:14Z

lib/collection/src/telemetry.rs

+    /// Amount of optimizers currently running.
+    ///
+    /// Note: A `DetailsLevel` of 4 or setting `telemetry_detail.optimizer_logs` to true is required.
+    ///       Otherwise, this function will return 0, which may not be correct.
+    pub fn count_optimizers_running(&self) -> usize {
+        self.shards
+            .iter()
+            .flatten()
+            .filter_map(|replica_set| replica_set.local.as_ref())
+            .flat_map(|local_shard| local_shard.optimizations.log.iter().flatten())
+            .filter(|log| log.status == TrackerStatus::Optimizing)
+            .count()
+    }


When reading the PR description I thought this could be done with counting with some atomic number and a guard.

But I think your approach is better, where we use state that we already have. 👍

* Currently running optimizer count in metrics * Clearly state the prerequisites of count_optimizers_running() * Minor improvements * improve metric naming

JojiiOfficial changed the base branch from master to dev September 26, 2025 08:07

JojiiOfficial force-pushed the metrics-currently-running-optimizers branch from 098ce61 to 4c73996 Compare September 26, 2025 08:13

JojiiOfficial requested review from generall and timvisee September 26, 2025 08:14

qdrant deleted a comment from coderabbitai bot Sep 26, 2025

JojiiOfficial mentioned this pull request Sep 26, 2025

Add more Optimizer data in metrics #7275

Closed

coderabbitai bot reviewed Sep 26, 2025

View reviewed changes

lib/collection/src/telemetry.rs Show resolved Hide resolved

qdrant deleted a comment from coderabbitai bot Sep 26, 2025

JojiiOfficial force-pushed the metrics-currently-running-optimizers branch from 1ea9600 to 3977b28 Compare September 26, 2025 08:28

qdrant deleted a comment from coderabbitai bot Sep 26, 2025

JojiiOfficial mentioned this pull request Sep 26, 2025

Add num points/vectors to metrics API #7302

Merged

generall approved these changes Sep 28, 2025

View reviewed changes

timvisee reviewed Sep 29, 2025

View reviewed changes

timvisee approved these changes Sep 29, 2025

View reviewed changes

JojiiOfficial added 4 commits October 22, 2025 15:26

Currently running optimizer count in metrics

c02ec6a

Clearly state the prerequisites of count_optimizers_running()

27516d2

Minor improvements

9d085c2

improve metric naming

c4bfa71

JojiiOfficial force-pushed the metrics-currently-running-optimizers branch from df33643 to c4bfa71 Compare October 22, 2025 13:27

qdrant deleted a comment from coderabbitai bot Oct 22, 2025

JojiiOfficial merged commit f2c0127 into dev Oct 23, 2025
15 checks passed

JojiiOfficial deleted the metrics-currently-running-optimizers branch October 23, 2025 08:46

timvisee pushed a commit that referenced this pull request Nov 14, 2025

Currently running optimizers in Metrics (#7316)

3fd07eb

* Currently running optimizer count in metrics * Clearly state the prerequisites of count_optimizers_running() * Minor improvements * improve metric naming

timvisee mentioned this pull request Nov 14, 2025

Bump version to 1.16.0 #7535

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Currently running optimizers in Metrics#7316

Currently running optimizers in Metrics#7316
JojiiOfficial merged 4 commits intodevfrom
metrics-currently-running-optimizers

JojiiOfficial commented Sep 26, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

timvisee Sep 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JojiiOfficial commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

timvisee Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JojiiOfficial commented Sep 26, 2025 •

edited

Loading