fix: report registry container working set memory instead of raw cgro…#838
Conversation
|
Skipping CI for Draft Pull Request. |
|
Warning Review limit reached
More reviews will be available in 23 minutes and 28 seconds. Learn how PR review limits work. Your organization has run out of usage credits. Purchase more in the billing tab. ⌛ How to resolve this issue?After more reviews become available, a review can be triggered using the We recommend that you space out your commits to avoid hitting the rate limit. 🚦 How do rate limits work?CodeRabbit enforces hourly rate limits for each developer per organization. Our paid plans include higher PR review limits than trial, open-source, and free plans. In all cases, reviews become available again over time. During sustained high-volume PR review activity, CodeRabbit may temporarily slow when the next review becomes available. Please see our Fair Usage Limits Policy for further information. ℹ️ Review info⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (3)
📝 WalkthroughWalkthroughThe PR adds a working-set memory metric to registry container monitoring. The metrics collector script now computes working-set memory by subtracting inactive file pages from container memory usage and emits it as a new Prometheus gauge. The Grafana dashboard panel is updated to display both working-set and total memory, and documentation is expanded to explain the distinction. ChangesRegistry Memory Metrics Expansion
Estimated code review effort🎯 2 (Simple) | ⏱️ ~10 minutes Possibly related PRs
Suggested labels
Suggested reviewers
Poem
🚥 Pre-merge checks | ✅ 5✅ Passed checks (5 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
38beb3f to
6a5ede9
Compare
There was a problem hiding this comment.
Actionable comments posted: 1
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.
Inline comments:
In `@docs/hv-metrics.md`:
- Line 158: The citation for "working set" has mismatched issue numbers: the
link text shows containers/common#2455 while the URL points to .../issues/2454;
update the reference so both the link text and the URL use the same issue number
(choose either `#2454` or `#2455` consistently) by editing the markdown near the
"working set" line (the link text `containers/common#2455` and the URL
`.../issues/2454`) so they match.
🪄 Autofix (Beta)
Fix all unresolved CodeRabbit comments on this PR:
- Push a commit to this branch (recommended)
- Create a new PR with the fixes
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: b84edafe-abac-4535-bfbe-3121fe0e526b
📒 Files selected for processing (3)
ansible/roles/hv-metrics-server/files/dashboards/bastion-dashboard.jsonansible/roles/hv-metrics-server/templates/registry-traffic-collector.sh.j2docs/hv-metrics.md
…up usage The registry container memory panel was reporting cgroup memory.current which includes reclaimable page cache, inflating reported usage far beyond actual container memory consumption. Add a working_set metric computed as memory.current - inactive_file (the standard calculation used by podman stats and Docker stats per containers/common#2455) and display it as the primary series on the bastion dashboard. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
6a5ede9 to
87af9ae
Compare
|
/lgtm |
|
/approve |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: agurenko, mcornea The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
fef075c
into
redhat-performance:main
…up usage
The registry container memory panel was showing cgroup memory.current which includes reclaimable page cache, inflating reported usage (~75 GiB) far beyond actual working set memory (~25 GiB). Add a working_set metric computed as memory.current minus inactive_file (matching kubectl top / cadvisor approach) and display it as the primary series on the dashboard.
Summary by CodeRabbit
New Features
Documentation