feat: add fallback support for value metric type by y-rabie · Pull Request #6655 · kedacore/keda

y-rabie · 2025-03-27T03:18:17Z

Since the constraint on having fallback only for AverageValue seems to me kinda unwarranted, it's here relaxed a bit, and can be removed altogether if we opt for another implementation.

The constraint now becomes that the scaleTargetRef object has a field .status.readyReplicas that its controller updates with the number of ready replicas, so that we can directly use that. This is de facto the case with Deployments/StatefulSets/Replicasets/Argo Rollouts.

We can then generically fetch the object as unstructured and access the value of the field to divide by it. A brief math illustration starting with the HPA's equation

desiredReplicas = ceil [currentReplicas * (currentMetricValue/desiredMetricValue) ]

By passing currentMetricValue = desiredMetricValue * fallbackReplicas / currentReplicas

We end up with

desiredReplicas = ceil [currentReplicas * (( desiredMetricValue * fallbackReplicas / currentReplicas )/desiredMetricValue) ]

desiredReplicas = ceil [currentReplicas * (fallbackReplicas / currentReplicas ) ] = ceil [fallbackReplicas] = fallbackReplicas

Emphasis: currentReplicas in HPA's equation is the number of ready replicas.

I preferred this approach to the other one (which would remove the .status.readyReplicas field constraint) which is manually counting the number of ready pods (similar to what HPA does here), since it'd be quite involved. If it seems a better approach to you, I can implement it.
For full clarity, a problematic nit with this: is that we're dependent on the object's controller updating the .status.readyReplicas in a timely manner.

If there had been a lag, then the currentReplicas the HPA multiplies by (which it gets by manually counting pods) would deviate from the currentReplicas value we divide by.

If ours is less, then we'd scale higher than fallbackReplicas for a brief while; if ours is more, then we'd scale less than fallbackReplicas. Either way we should eventually stabilize at fallbackReplicas exactly.
Final unrelated small change for correctness sake (that I think should be fixed regardless of this PR), the line

Value: *resource.NewMilliQuantity(normalisationValue*1000*replicas, resource.DecimalSI),

has been changed to

Value: *resource.NewMilliQuantity(int64(normalisationValue*1000*replicas), resource.DecimalSI),

with normalizationValue being float64 instead of int.

This prevents early casting to int, which would discard fractions in normalizationValue early on, that would have been already rounded when multiplying by 1000 below.

Imagine normalisationValue=2.5 and fallbackReplicas=10, previously, Value = 2*1000*10 = 20000.

Now, Value = 2.5*1000*10 = int64(25000.0) = 2500.

Obviously the former will cause HPA to not scale to fallbackReplicas exactly.

For the unchecked list item, if this looks good, I'll open a PR to the docs repo.

Checklist

When introducing a new scaler, I agree with the scaling governance policy
I have verified that my change is according to the deprecations & breaking changes policy
Tests have been added
Changelog has been updated and is aligned with our changelog requirements
A PR is opened to update our Helm chart (repo) (if applicable, ie. when deployment manifests are modified)
A PR is opened to update the documentation on (repo) (if applicable) - feat: add fallback support for value metric type keda-docs#1606
Commits are signed with Developer Certificate of Origin (DCO - learn more)

Fixes #4205

pkg/fallback/fallback.go

stale · 2025-05-26T04:35:45Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

y-rabie · 2025-05-28T08:53:37Z

@JorTurFer Is this something you guys would be interested in? If not, since it's stale now, I'll close it

JorTurFer · 2025-06-01T16:49:32Z

This is interesting indeed, the limitation of the metric type is something to fix. Sorry for not reviewing the PR, it was missed on my notification, I'm going to review it

JorTurFer · 2025-06-01T17:23:22Z

/run-e2e fallback
Update: You can check the progress here

JorTurFer

In general, the code looks nice, and it's a quite interesting solution! I'd need to check it by myself to be 100% sure that it works as expected, so let's see how e2e test goes. I've kept one small comment inline

pkg/fallback/fallback.go

JorTurFer · 2025-06-01T17:31:19Z

@wozniakjan PTAL too

SanthanCH · 2025-07-11T11:16:06Z

@JorTurFer can we expect this in the next release?? , the version update was causing many issues as the average value is enforced , this would be helpful for me to update to latest version without any schema change for all the existing scaled objects

rickbrouwer · 2025-07-11T11:40:53Z

I see there are also merge conflicts in this PR

SanthanCH · 2025-07-11T13:06:08Z

@y-rabie hey can you please take a look into it , and resolve the merge conflicts , this is much needed feature

y-rabie · 2025-07-15T10:03:53Z

@SanthanCH @rickbrouwer Conflicts resolved

rickbrouwer · 2025-07-15T10:19:40Z

/run-e2e fallback
Update: You can check the progress here

SanthanCH · 2025-07-16T03:32:40Z

@rickbrouwer can you review now , i could see all the checks passed

rickbrouwer · 2025-07-16T07:54:40Z

@rickbrouwer can you review now , i could see all the checks passed

I think it's good to add this PR to the Release 2.18 list. That gives me some time to take a closer look at the PR, but overall, everything looks good.

SanthanCH · 2025-07-28T02:25:30Z

Hey @rickbrouwer is this feature going to be in next release? And when is the next release , is there any ETA ?

y-rabie · 2025-09-01T18:09:53Z

@JorTurFer Yep, I think the timeout was introduced because we now test rollouts and deployments. I've splitted them into two files

SanthanCH · 2025-09-03T14:04:00Z

Hey @JorTurFer ,@zroubalik ,@rickbrouwer
When is the next release planned ? , waiting for this feature from long time 😔🫠🙏

rickbrouwer · 2025-09-03T14:18:02Z

/run-e2e fallback*
Update: You can check the progress here

zroubalik · 2025-09-03T19:22:26Z

@SanthanCH unfortunately e2e tests still fail, PTAl

SanthanCH · 2025-09-05T13:40:03Z

Hey @y-rabie could you PTAL

Signed-off-by: y-rabie <youssef.rabie@procore.com>

rickbrouwer · 2025-09-08T05:10:56Z

/run-e2e fallback*
Update: You can check the progress here

SanthanCH · 2025-09-10T14:07:49Z

Hey @JorTurFer ,@zroubalik ,@rickbrouwer
All the checks are passed now , could you please review

* feat: add fallback support for value metric type Signed-off-by: y-rabie <youssef.rabie@procore.com> * Alternative implementation for getReadyReplicasCount in fallback Signed-off-by: y-rabie <youssef.rabie@procore.com> * fix: restructure fallback tests to avoid timeouts Signed-off-by: y-rabie <youssef.rabie@procore.com> --------- Signed-off-by: y-rabie <youssef.rabie@procore.com>

SanthanCH · 2025-10-10T14:37:27Z

Still getting this error in keda admission webhook pod

ERROR scaledobject-validation-webhook validation error {"name": "scaledobject", "error": "at least one trigger (that is not cpu or memory) has to have the AverageValuetype for the fallback to be enabled"}

This scaled object has only cpu trigger and metrics type is utilization

JorTurFer · 2025-10-10T15:40:43Z

Fallback doesn't work for CPU and Memory scalers because they those metrics aren't managed by KEDA. KEDA generates the HPA for the k8s metrics server in your behalf but nothing else. To use fallback after 2.18, you need at least one scaler different from memory or cpu

recollir · 2025-10-17T08:24:25Z

Fallback doesn't work for CPU and Memory scalers because they those metrics aren't managed by KEDA. KEDA generates the HPA for the k8s metrics server in your behalf but nothing else. To use fallback after 2.18, you need at least one scaler different from memory or cpu

We also run into this. And did not see it in the documentation of the cpu scaler. Maybe it is stated somewhere else. Would you accept a PR for the docs about this?

inrdkec · 2025-10-17T09:08:23Z

It is not in the CPU and mem scaler doc, but it is here: https://keda.sh/docs/2.18/reference/scaledobject-spec/#fallback
But it is actually a good point to improve the cpu and mem scaler doc

Fallback is supported for all triggers of both Value and AverageValue metric types, except CPU & memory triggers. It’s also only supported by ScaledObjects, not ScaledJobs.

recollir · 2025-10-17T11:57:28Z

@inrdkec 🙏 . Thanks for pointing out the right place in the documentation.

* feat: add fallback support for value metric type Signed-off-by: y-rabie <youssef.rabie@procore.com> * Alternative implementation for getReadyReplicasCount in fallback Signed-off-by: y-rabie <youssef.rabie@procore.com> * fix: restructure fallback tests to avoid timeouts Signed-off-by: y-rabie <youssef.rabie@procore.com> --------- Signed-off-by: y-rabie <youssef.rabie@procore.com> Signed-off-by: Dmitriy Altuhov <altuhovd@gmail.com>

* feat: add fallback support for value metric type Signed-off-by: y-rabie <youssef.rabie@procore.com> * Alternative implementation for getReadyReplicasCount in fallback Signed-off-by: y-rabie <youssef.rabie@procore.com> * fix: restructure fallback tests to avoid timeouts Signed-off-by: y-rabie <youssef.rabie@procore.com> --------- Signed-off-by: y-rabie <youssef.rabie@procore.com>

y-rabie requested a review from a team as a code owner March 27, 2025 03:18

semgrep-app bot reviewed Mar 27, 2025

View reviewed changes

pkg/fallback/fallback.go Outdated Show resolved Hide resolved

semgrep-app bot reviewed Mar 27, 2025

View reviewed changes

pkg/fallback/fallback.go Outdated Show resolved Hide resolved

y-rabie force-pushed the support-fallback-for-value-type branch 5 times, most recently from aeb2c2d to 3349924 Compare March 27, 2025 03:56

stale bot added the stale All issues that are marked as stale due to inactivity label May 26, 2025

stale bot removed the stale All issues that are marked as stale due to inactivity label May 28, 2025

JorTurFer reviewed Jun 1, 2025

View reviewed changes

pkg/fallback/fallback.go Outdated Show resolved Hide resolved

akahn mentioned this pull request Jun 3, 2025

Fallback for metricType: Value #4205

Closed

rickbrouwer mentioned this pull request Jun 3, 2025

Admission Webhook blocks ScaledObject with fallback (using AverageValue) 2.17 #6696

Closed

y-rabie force-pushed the support-fallback-for-value-type branch 4 times, most recently from 0d0f680 to f302bd8 Compare July 15, 2025 09:05

y-rabie force-pushed the support-fallback-for-value-type branch from 2a545c5 to e373fc8 Compare September 1, 2025 18:20

y-rabie added 2 commits September 8, 2025 02:29

feat: add fallback support for value metric type

e8d6c72

Signed-off-by: y-rabie <youssef.rabie@procore.com>

Alternative implementation for getReadyReplicasCount in fallback

a479642

Signed-off-by: y-rabie <youssef.rabie@procore.com>

y-rabie force-pushed the support-fallback-for-value-type branch 3 times, most recently from de92e68 to 1095496 Compare September 8, 2025 00:11

fix: restructure fallback tests to avoid timeouts

7add249

Signed-off-by: y-rabie <youssef.rabie@procore.com>

y-rabie force-pushed the support-fallback-for-value-type branch from 1095496 to 7add249 Compare September 8, 2025 00:15

rickbrouwer added the ok-to-merge This PR can be merged label Sep 12, 2025

wozniakjan approved these changes Sep 12, 2025

View reviewed changes

wozniakjan merged commit e438e93 into kedacore:main Sep 12, 2025
25 checks passed

mburke5678 mentioned this pull request Jan 23, 2026

OSDOCS 17854 Upstream KEDA 2.18.1 changes openshift/openshift-docs#104664

Open

1 task

This was referenced Feb 27, 2026

🌱 CNCF mission generation 2026-02-27 kubestellar/console-kb#6

Closed

🌱 CNCF mission generation 2026-02-27 kubestellar/console-kb#11

Merged

Conversation

y-rabie commented Mar 27, 2025 • edited by wozniakjan Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Checklist

Uh oh!

Uh oh!

Uh oh!

stale bot commented May 26, 2025

Uh oh!

y-rabie commented May 28, 2025

Uh oh!

JorTurFer commented Jun 1, 2025

Uh oh!

JorTurFer commented Jun 1, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JorTurFer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

JorTurFer commented Jun 1, 2025

Uh oh!

SanthanCH commented Jul 11, 2025

Uh oh!

rickbrouwer commented Jul 11, 2025

Uh oh!

SanthanCH commented Jul 11, 2025

Uh oh!

y-rabie commented Jul 15, 2025

Uh oh!

rickbrouwer commented Jul 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SanthanCH commented Jul 16, 2025

Uh oh!

rickbrouwer commented Jul 16, 2025

Uh oh!

SanthanCH commented Jul 28, 2025

Uh oh!

y-rabie commented Sep 1, 2025

Uh oh!

SanthanCH commented Sep 3, 2025

Uh oh!

rickbrouwer commented Sep 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zroubalik commented Sep 3, 2025

Uh oh!

SanthanCH commented Sep 5, 2025

Uh oh!

rickbrouwer commented Sep 8, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SanthanCH commented Sep 10, 2025

Uh oh!

Uh oh!

SanthanCH commented Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JorTurFer commented Oct 10, 2025

Uh oh!

recollir commented Oct 17, 2025

Uh oh!

inrdkec commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

recollir commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

y-rabie commented Mar 27, 2025 •

edited by wozniakjan

Loading

JorTurFer commented Jun 1, 2025 •

edited by github-actions bot

Loading

rickbrouwer commented Jul 15, 2025 •

edited by github-actions bot

Loading

rickbrouwer commented Sep 3, 2025 •

edited by github-actions bot

Loading

rickbrouwer commented Sep 8, 2025 •

edited by github-actions bot

Loading

SanthanCH commented Oct 10, 2025 •

edited

Loading

inrdkec commented Oct 17, 2025 •

edited

Loading