Expose PVC metrics via kubelet prometheus by wongma7 · Pull Request #51553 · kubernetes/kubernetes

wongma7 · 2017-08-29T19:16:13Z

This depends on #51448, opening early though. second commit is mine and mostly a copy/paste job.

implements metrics listed in here kubernetes/community#855 following method here kubernetes/community#930 (comment)

Which issue this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close that issue when PR gets merged): kubernetes/enhancements#363

Special notes for your reviewer:

Release note:

PersistentVolumeClaim metrics like "volume_stats_inodes" and "volume_stats_capacity_bytes" are now reported via kubelet prometheus

wongma7 · 2017-08-29T19:16:31Z

/sig storage

jingxu97 · 2017-08-29T23:06:39Z

/lgtm

derekwaynecarr · 2017-08-30T18:17:50Z

/retest

eparis · 2017-08-30T18:19:19Z

Appears that do-not-merge was added because the release note section was not originally filled out. It appears to be filled out now, so removing the label.

derekwaynecarr · 2017-08-30T18:21:33Z

kubelet changes /lgtm

/approve

wongma7 · 2017-08-30T18:30:30Z

@jingxu97 I will ping you later to re-add lgtm, we've decided to wait for #51448 to merge first, thanks :]

wongma7 · 2017-08-31T16:04:55Z

/retest

wongma7 · 2017-08-31T20:53:32Z

@jingxu97 please re-add /lgtm, the queue is extremely slow (Estimated Merging 0 PRs per day.) so I'm now more scared of getting left behind than I am of* git, thank you!

jingxu97 · 2017-09-01T05:29:38Z

/lgtm

jingxu97 · 2017-09-01T05:30:17Z

@wongma7 Could you rebase? After that, you could put lgtm too.

k8s-github-robot · 2017-09-01T16:54:14Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: derekwaynecarr, gnufied, jingxu97, wongma7

Associated issue: 363

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these OWNERS Files:

~~pkg/kubelet/OWNERS~~ [derekwaynecarr]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

wongma7 · 2017-09-01T19:04:20Z

/retest
#49761

wongma7 · 2017-09-01T19:17:21Z

/retest

gnufied · 2017-09-02T16:51:20Z

Failure here is related to #51856 and #49613 by the looks of it. Other PRs are failing for same reason.

gnufied · 2017-09-02T16:51:46Z

/test pull-kubernetes-kubemark-e2e-gce
/test pull-kubernetes-e2e-kops-aws

fejta-bot · 2017-09-03T02:51:59Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to @fejta).

Review the full test history for this PR.

k8s-github-robot · 2017-09-03T03:35:35Z

/test all [submit-queue is verifying that this PR is safe to merge]

k8s-github-robot · 2017-09-03T04:22:42Z

Automatic merge from submit-queue

k8s-ci-robot · 2017-09-03T04:54:21Z

@wongma7: The following test failed, say /retest to rerun them all:

Test name	Commit	Details	Rerun command
pull-kubernetes-e2e-kops-aws	`dac2068`	link	`/test pull-kubernetes-e2e-kops-aws`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

tanner-bruce · 2017-09-06T18:13:40Z

Is it possible to get this cherry picked back to 1.6 / 1.7?

jingxu97 · 2017-09-06T18:33:39Z

@tanner-bruce because this is 1.8 feature so unfortunately we could not cherry pick this PR.

piosz · 2017-09-07T11:54:22Z

Prometheus endpoint in kubelet should only export metrics about kubelet state and not about the pods running on the node. Same applies to other components, for example apiserver exports metrics about number of handled requests, but doesn't export any metrics about pods running in Kubernetes cluster.

Resource usage metrics should be exported by:

Kubelet Summary API if they are critical for Kubernetes components to operate (e.g. for scheduler to place pods in optimal way)
or
3rd party node-level agent
See more details in Monitoring Architecture.

Due to historical reasons, as a side effect of linking cadvisor into Kubelet, there are some resource usage metrics exposed through Prometheus endpoint in Kubelet, but we plan to remove it at some point (of course in a graceful way). In particularly this means that this PR uses a deprecated approach.

cc @fgrzadkowski @kubernetes/sig-instrumentation-pr-reviews

brancz · 2017-09-13T09:09:47Z

I agree with @piosz in regards to the inconsistencies with the monitoring architecture, however, due to the nature of how deeply the kubelet is involved in setting up the volumes I can't currently see a better way. Maybe this would be yet another case of the exporter we have been discussing on sig-instrumentation lately? I'm concerned though that the responsibility of this exporter might explode before we even started implementing it.

eedugon · 2017-09-13T09:49:11Z

@piosz , @brancz ,

I agree with the view explained by @piosz , but then a quick question in case cadvisor is removed from the picture.

who should be responsible of translating information of kubelet summary API into prometheus metrics then?

Because PVs and PVCs are critical components for pods to operate (if the PV is full the pod/container using that pod will have problems for sure). Currently the info is available in summary API, but not in a prometheus metric format.

Thanks and regards!

brancz · 2017-09-13T09:53:46Z

The idea is that the kubelet is able to retrieve that very information that the scheduler requires for its purposes itself, which could be through a library, that both the kubelet and the cAdvisor replacement would use. If you're interested in the topic I'd suggest to join the bi-weekly sig-instrumentation call 🙂 .

See kubernetes#51553.

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Aug 29, 2017

k8s-ci-robot added the sig/storage Categorizes an issue or PR as relevant to SIG Storage. label Aug 29, 2017

k8s-github-robot assigned dashpole and pmorie Aug 29, 2017

k8s-github-robot added the release-note-label-needed label Aug 29, 2017

jingxu97 self-requested a review August 29, 2017 23:04

k8s-ci-robot assigned jingxu97 Aug 29, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 29, 2017

k8s-github-robot added the do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. label Aug 29, 2017

wongma7 changed the title ~~WIP: Expose PVC metrics via kubelet prometheus~~ Expose PVC metrics via kubelet prometheus Aug 29, 2017

k8s-github-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. and removed release-note-label-needed labels Aug 29, 2017

eparis removed the do-not-merge DEPRECATED. Indicates that a PR should not merge. Label can only be manually applied/removed. label Aug 30, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 30, 2017

wongma7 force-pushed the pvc-prometheus branch from bbed4db to 7738204 Compare August 30, 2017 18:26

k8s-github-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Aug 30, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 1, 2017

k8s-github-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Sep 1, 2017

Expose PVC metrics via kubelet prometheus

dac2068

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Sep 1, 2017

abgworrall modified the milestone: v1.8 Sep 2, 2017

k8s-github-robot merged commit 5781958 into kubernetes:master Sep 3, 2017

k8s-ci-robot added the sig/instrumentation Categorizes an issue or PR as relevant to SIG Instrumentation. label Sep 7, 2017

brancz mentioned this pull request Sep 13, 2017

Monitoring Kubernetes PersistentVolumes prometheus-operator/prometheus-operator#485

Closed

satyamz mentioned this pull request Sep 22, 2017

OpenEBS volume should be complaint with PVC metrics openebs/openebs#366

Closed

cofyc pushed a commit to cofyc/kubernetes that referenced this pull request Sep 26, 2017

Expose PVC metrics via kubelet prometheus

db42f96

See kubernetes#51553.

dashpole mentioned this pull request Jan 22, 2018

Volume metrics exposed in /stats/summary not available in /metrics #34137

Closed

cofyc mentioned this pull request Feb 1, 2018

Fix kubelet PVC stale metrics #59170

Merged

dashpole mentioned this pull request Apr 13, 2018

Unable to see container's metrics about external volumes attached (k8s persistent volumes) google/cadvisor#1702

Closed

NickrenREN mentioned this pull request Oct 8, 2018

Exposing ephemeral storage metrics to prometheus #69507

Open

pacoxu mentioned this pull request Dec 9, 2020

k8s 1.19.4 kubelet stops presenting kubelet_volume_stats_used_bytes metrics #97138

Closed

bells17 mentioned this pull request Aug 31, 2021

Check inodes as well and increase volume topolvm/pvc-autoresizer#59

Closed

dwbrown2 mentioned this pull request Sep 29, 2022

whitelist PV usage metrics kubecost/kubecost#1713

Merged

Conversation

wongma7 commented Aug 29, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wongma7 commented Aug 29, 2017

Uh oh!

jingxu97 commented Aug 29, 2017

Uh oh!

derekwaynecarr commented Aug 30, 2017

Uh oh!

eparis commented Aug 30, 2017

Uh oh!

derekwaynecarr commented Aug 30, 2017

Uh oh!

wongma7 commented Aug 30, 2017

Uh oh!

wongma7 commented Aug 31, 2017

Uh oh!

wongma7 commented Aug 31, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jingxu97 commented Sep 1, 2017

Uh oh!

jingxu97 commented Sep 1, 2017

Uh oh!

k8s-github-robot commented Sep 1, 2017

Uh oh!

wongma7 commented Sep 1, 2017

Uh oh!

wongma7 commented Sep 1, 2017

Uh oh!

gnufied commented Sep 2, 2017

Uh oh!

gnufied commented Sep 2, 2017

Uh oh!

fejta-bot commented Sep 3, 2017

Uh oh!

k8s-github-robot commented Sep 3, 2017

Uh oh!

k8s-github-robot commented Sep 3, 2017

Uh oh!

k8s-ci-robot commented Sep 3, 2017

Uh oh!

tanner-bruce commented Sep 6, 2017

Uh oh!

jingxu97 commented Sep 6, 2017

Uh oh!

piosz commented Sep 7, 2017

Uh oh!

brancz commented Sep 13, 2017

Uh oh!

eedugon commented Sep 13, 2017

Uh oh!

brancz commented Sep 13, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants

wongma7 commented Aug 29, 2017 •

edited

Loading

wongma7 commented Aug 31, 2017 •

edited

Loading