Multiple metrics hpa by gjtempleton · Pull Request #78503 · kubernetes/kubernetes

gjtempleton · 2019-05-29T22:33:34Z

What type of PR is this?

/kind cleanup

What this PR does / why we need it:
Currently the HPA controller will fail to scale up targets if they are scaling on multiple metrics and one or more of these metrics is invalid/unavailable, even if one of the metrics is available and indicates a scale up should happen.

This PR allows a scale up to happen in this case, whilst still ensuring that it fails safe and refuses to scale down if one or more metrics are unavailable.

Which issue(s) this PR fixes:

Fixes #61007

Special notes for your reviewer:
I wasn't 100% sure whether to go for the first or last invalid metric to be reported as the error in the case of refusing to scale up, however going with the last simplified the changes required massively, meaning no need for passing around conditions.

Credit to @bskiba as a large amount of the changes come from her previous PR #61423

Does this PR introduce a user-facing change?:

Horizontal Pod Autoscaling can now scale targets up even when one or more metrics are invalid/unavailable as long as one metric indicates a scale up should occur.

/sig autoscaling
/priority important-longterm

Add three tests for handling invalid metrics when scaling on multiple metrics - one for scaling up successfully (new behaviour) and two for ensuring we don't scale down (existing behaviour).

Handle a case in the Horizontal Pod Autoscaler Controller when scaling on multiple metrics and one or more is missing or invalid. If all metrics are missing - return an error and leave the isScalingActive condition as that for the last invalid metric. If some metrics are missing/invalid and some are valid and found - if a scale up would be triggered by the valid metrics ignore the missing metrics and scale up, if a scale down would be triggered, return an error and leave the isScalingActive condition as that for the last invalid metric.

k8s-ci-robot · 2019-05-29T22:33:42Z

Hi @gjtempleton. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

gjtempleton · 2019-05-29T22:36:33Z

cc: @josephburnett

josephburnett · 2019-05-30T13:10:33Z

/lgtm

I'm not yet part of the Kubernetes org, so I can't add /ok-to-test. @mwielgus can you please.

k8s-ci-robot · 2019-05-30T13:10:41Z

@josephburnett: changing LGTM is restricted to assignees, and only kubernetes/kubernetes repo collaborators may be assigned issues.

Details

In response to this:

/lgtm

I'm not yet part of the Kubernetes org, so I can't add /ok-to-test. @mwielgus can you please.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

bskiba · 2019-05-30T13:45:25Z

/ok-to-test

bskiba · 2019-05-31T07:33:17Z

Thanks!
/lgtm

bskiba · 2019-05-31T07:33:28Z

/test pull-kubernetes-bazel-build

bskiba · 2019-05-31T07:35:04Z

@MaciekPytel @mwielgus Can one of you approve?

gjtempleton · 2019-05-31T09:41:16Z

/test pull-kubernetes-bazel-build

bskiba · 2019-05-31T11:03:33Z

/assign @mwielgus
/assgin @MaciekPytel

bskiba · 2019-05-31T11:03:42Z

/assign @MaciekPytel

mwielgus

/lgtm
/approve

k8s-ci-robot · 2019-06-03T22:18:11Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: gjtempleton, mwielgus

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~pkg/controller/podautoscaler/OWNERS~~ [mwielgus]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

aguckenber-chwy · 2023-05-17T15:56:39Z

Maybe this is worthy of a new issue but regarding this merged feature. Its great that the hpa can scale up with an unknown metric, but is there a flag that is set-able to allow the HPA to scale down for unknown metrics? We have issues where the cluster scales up then a custom metric becomes unknown and the cluster stays way scaled up costing us a fortune.

All because of this:

If multiple metrics are specified in a HorizontalPodAutoscaler, this calculation is done for each metric, and then the largest of the desired replica counts is chosen. If any of these metrics cannot be converted into a desired replica count (e.g. due to an error fetching the metrics from the metrics APIs) and a scale down is suggested by the metrics which can be fetched, scaling is skipped. This means that the HPA is still capable of scaling up if one or more metrics give a desiredReplicas greater than the current value.

gjtempleton · 2023-05-18T08:23:03Z

@aguckenber-chwy that's definitely best discussed as a new issue.

Can definitely understand it as a use case users may want, but there's probably a number of different ways we could expose it as a feature, so likely to be some back on forth on that if nothing else.

aguckenber-chwy · 2023-05-18T13:48:13Z

@aguckenber-chwy that's definitely best discussed as a new issue.

Can definitely understand it as a use case users may want, but there's probably a number of different ways we could expose it as a feature, so likely to be some back on forth on that if nothing else.

Quick question, where is the best spot to put feature requests? The issues in this repository don't have it. It this the correct spot https://github.com/kubernetes/kubernetes/issues ? Or should it go on the community forums under General Discussion?

gjtempleton added 2 commits May 29, 2019 23:11

Add tests for handling scaling on unavailable metrics

ee4dbbc

Add three tests for handling invalid metrics when scaling on multiple metrics - one for scaling up successfully (new behaviour) and two for ensuring we don't scale down (existing behaviour).

k8s-ci-robot added needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. sig/apps Categorizes an issue or PR as relevant to SIG Apps. labels May 29, 2019

k8s-ci-robot requested review from mwielgus and piosz May 29, 2019 22:33

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels May 30, 2019

k8s-ci-robot assigned bskiba May 31, 2019

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 31, 2019

k8s-ci-robot assigned mwielgus May 31, 2019

k8s-ci-robot assigned MaciekPytel May 31, 2019

mwielgus approved these changes Jun 3, 2019

View reviewed changes

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 3, 2019

mwielgus added this to the v1.15 milestone Jun 3, 2019

k8s-ci-robot merged commit 9f85c5c into kubernetes:master Jun 4, 2019

This was referenced Jun 4, 2019

Update HPA Algorithm Docs for v1.15 kubernetes/website#14728

Merged

In Horizontal Pod Autoscaler Controller when scaling on multiple metrics, handle invalid metrics. #61423

Closed

gjtempleton deleted the Multiple-Metrics-HPA branch June 7, 2019 10:23

gjtempleton mentioned this pull request Jun 9, 2019

Correct Comment on HPA Logic #78827

Closed

vvicaretti mentioned this pull request Jun 25, 2019

HPA should work when configured multiple metrics but some of the metrics cannot be get #74052

Closed

josephburnett mentioned this pull request Jul 3, 2019

REQUEST: New membership for josephburnett kubernetes/org#983

Closed

6 tasks

gjtempleton mentioned this pull request Sep 3, 2019

REQUEST: New membership for gjtempleton kubernetes/org#1153

Closed

6 tasks

gjtempleton mentioned this pull request Apr 29, 2024

HPA - Add gjtempleton to reviewers #124607

Merged

Conversation

gjtempleton commented May 29, 2019

Uh oh!

k8s-ci-robot commented May 29, 2019

Uh oh!

gjtempleton commented May 29, 2019

Uh oh!

josephburnett commented May 30, 2019

Uh oh!

k8s-ci-robot commented May 30, 2019

Uh oh!

bskiba commented May 30, 2019

Uh oh!

bskiba commented May 31, 2019

Uh oh!

bskiba commented May 31, 2019

Uh oh!

bskiba commented May 31, 2019

Uh oh!

gjtempleton commented May 31, 2019

Uh oh!

bskiba commented May 31, 2019

Uh oh!

bskiba commented May 31, 2019

Uh oh!

mwielgus left a comment

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Jun 3, 2019

Uh oh!

aguckenber-chwy commented May 17, 2023

Uh oh!

gjtempleton commented May 18, 2023

Uh oh!

aguckenber-chwy commented May 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants