Skip to content

ThanosRuleHighRuleEvaluationWarnings firing due to sufix#1784

Merged
naveenpaul1 merged 1 commit intonoobaa:masterfrom
naveenpaul1:thanos-rule-warning-fix
Jan 26, 2026
Merged

ThanosRuleHighRuleEvaluationWarnings firing due to sufix#1784
naveenpaul1 merged 1 commit intonoobaa:masterfrom
naveenpaul1:thanos-rule-warning-fix

Conversation

@naveenpaul1
Copy link
Contributor

@naveenpaul1 naveenpaul1 commented Jan 22, 2026

Describe the Problem

ThanosRuleHighRuleEvaluationWarnings firing due to following metrics,

NooBaa_providers_ops_read_num
NooBaa_providers_ops_write_num
NooBaa_providers_bandwidth_write_size
NooBaa_providers_bandwidth_read_size 

Explain the Changes

  1. Metrics are updated with sufix accepted by Thanos-ruler
  2. Update the respetive ruless in operator
NooBaa_providers_ops_read_num ->  NooBaa_providers_ops_read_count
NooBaa_providers_ops_write_num -> NooBaa_providers_ops_write_count
NooBaa_providers_bandwidth_write_size -> NooBaa_providers_bandwidth_write_size_total
NooBaa_providers_bandwidth_read_size ->  NooBaa_providers_bandwidth_read_size_total

Issues: Fixed #xxx / Gap #xxx

  1. https://issues.redhat.com/browse/DFBUGS-3822

Testing Instructions:

Core PR : noobaa/noobaa-core#9388

  • Doc added/updated
  • Tests added

Summary by CodeRabbit

  • Chores
    • Updated monitoring rules to use corrected metrics for IOPS and throughput calculations, improving accuracy and reliability of storage performance telemetry.

✏️ Tip: You can customize this high-level summary in your review settings.

@coderabbitai
Copy link

coderabbitai bot commented Jan 22, 2026

📝 Walkthrough

Walkthrough

Prometheus rule expressions in the noobaa-odf.rules group were updated to reference different metric names: IOPS now uses NooBaa_providers_ops_read_count and NooBaa_providers_ops_write_count; throughput uses NooBaa_providers_bandwidth_read_size_total and NooBaa_providers_bandwidth_write_size_total.

Changes

Cohort / File(s) Change Summary
Prometheus Rules Configuration
deploy/internal/prometheus-rules.yaml
Replaced read/write IOPS metrics to use _count variants (NooBaa_providers_ops_read_count, NooBaa_providers_ops_write_count) and replaced throughput bandwidth metrics to use _total variants (NooBaa_providers_bandwidth_read_size_total, NooBaa_providers_bandwidth_write_size_total) (two lines changed: +2 / -2)
Bundle Hash Update
pkg/bundle/deploy.go
Updated Sha256_deploy_internal_prometheus_rules_yaml constant to new hash reflecting the updated prometheus-rules.yaml; metric names in embedded expressions updated to match file changes.

Sequence Diagram(s)

(omitted — changes are metric renames/config update only)

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~10 minutes

Possibly related PRs

Suggested labels

size/L

Suggested reviewers

  • jackyalbo
  • liranmauda
🚥 Pre-merge checks | ✅ 3
✅ Passed checks (3 passed)
Check name Status Explanation
Title check ✅ Passed The title references the problem (ThanosRuleHighRuleEvaluationWarnings firing) but contains a typo ('sufix' instead of 'suffix') and lacks clarity about the solution (metric name updates).
Description check ✅ Passed The description covers the problem, metric changes, and related issue/PR links, but the testing instructions section is empty and both checklists remain unchecked.
Docstring Coverage ✅ Passed No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing touches
  • 📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

Copy link

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🤖 Fix all issues with AI agents
In `@deploy/internal/prometheus-rules.yaml`:
- Line 49: The Prometheus recording rule's expression contains a typo: the
metric name "NooBaa_NooBaa_providers_ops_read_count" in the sum by (namespace,
managedBy, job, service) (...) expression is duplicated; replace it with the
correct metric name "NooBaa_providers_ops_read_count" so the expression becomes
sum by (namespace, managedBy, job, service)
(rate(NooBaa_providers_ops_read_count[5m]) +
rate(NooBaa_providers_ops_write_count[5m])) to restore correct IOPS recording.

@naveenpaul1 naveenpaul1 force-pushed the thanos-rule-warning-fix branch 2 times, most recently from 2d56853 to 8b6f3aa Compare January 26, 2026 07:08
Signed-off-by: Naveen Paul <napaul@redhat.com>
@naveenpaul1 naveenpaul1 force-pushed the thanos-rule-warning-fix branch from 8b6f3aa to 526361b Compare January 26, 2026 07:59
@naveenpaul1 naveenpaul1 merged commit a9d4be3 into noobaa:master Jan 26, 2026
15 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants