vmcluster/statefulset: run rolling upgrades in batches of multiple pods by vpedosyuk · Pull Request #1458 · VictoriaMetrics/operator

vpedosyuk · 2025-07-07T11:49:23Z

This PR introduces a new VMCluster CRD property called maxUnavailable in:

spec.vmstorage.rollingUpdateStrategyBehavior.maxUnavailable
spec.vmselect.rollingUpdateStrategyBehavior.maxUnavailable

It's very similar to this (alpha) feature of Kubernetes:
https://kubernetes.io/docs/concepts/workloads/controllers/statefulset/#maximum-unavailable-pods

This PR makes it work with the OnDelete upgrade policy used in the operator by default.

Additionally, this PR makes it available for much older Kubernetes versions or, for example, it makes it finally available for those running GKE clusters that don't support alpha features.

Some examples for the maxUnavailable field:

maxUnavailable: 1 - default, which mostly (see details on PDBs below) follows the current behavior of the operator.
maxUnavailable: 2 - for cases when -replicationFactor=3.
maxUnavailable: 100% - for cases when the "minimum downtime strategy" is preferable.

Since more than one pod can now become unavailable during an upgrade (i.e. voluntary disruptions) it's important to respect a PDB configuration to some degree. So this PR introduces a partial support for PDBs targeted at StatefulSets of VMCluster according to this:
https://kubernetes.io/docs/tasks/run-application/configure-pdb/#arbitrary-controllers-and-selectors

This is achieved with the Eviction API instead of directly deleting Pods, as described here:
https://kubernetes.io/docs/concepts/scheduling-eviction/api-eviction/

And since it appears that "write" RBAC permission for Pods is no longer needed, I've moved it to the "read-only" section.

Please let me know if this makes sense!

internal/controller/operator/factory/reconcile/statefulset.go

f41gh7

Thanks for the pull request, it looks good to me!

Could you please mention new API field at docs/CHANGELOG.md as a feature?

Also, it looks like this PR fixes possible issue with breaching of PodDisruption policy by using Eviction instead of Delete

f41gh7

LGTM

f41gh7 · 2025-07-08T08:05:47Z

Thanks for contribution!

Previously operator perform pod deletion without check for PodDisruptionBudget interption. It was fixed at #1458 Signed-off-by: f41gh7 <nik@victoriametrics.com>

vmcluster/statefulset: run rolling upgrades in batches of multiple pods

ffda073

vpedosyuk marked this pull request as ready for review July 7, 2025 11:52

vpedosyuk requested review from AndrewChubatiuk, Haleygo and f41gh7 as code owners July 7, 2025 11:52

f41gh7 reviewed Jul 7, 2025

View reviewed changes

internal/controller/operator/factory/reconcile/statefulset.go Show resolved Hide resolved

f41gh7 reviewed Jul 7, 2025

View reviewed changes

add a feature reference into the changelog file

62d27d0

f41gh7 approved these changes Jul 8, 2025

View reviewed changes

f41gh7 merged commit b0afe6f into VictoriaMetrics:master Jul 8, 2025

f41gh7 added a commit that referenced this pull request Jul 9, 2025

docs: mention statefulset reconcile bugfix

59693a5

Previously operator perform pod deletion without check for PodDisruptionBudget interption. It was fixed at #1458 Signed-off-by: f41gh7 <nik@victoriametrics.com>

thejuan mentioned this pull request Nov 26, 2025

vmcluster maxUnavailable clashes with PDB #1640

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vmcluster/statefulset: run rolling upgrades in batches of multiple pods#1458

vmcluster/statefulset: run rolling upgrades in batches of multiple pods#1458
f41gh7 merged 2 commits intoVictoriaMetrics:masterfrom
vpedosyuk:feature/max-unavailable

vpedosyuk commented Jul 7, 2025

Uh oh!

Uh oh!

f41gh7 left a comment

Uh oh!

f41gh7 left a comment

Uh oh!

f41gh7 commented Jul 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vpedosyuk commented Jul 7, 2025

Uh oh!

Uh oh!

f41gh7 left a comment

Choose a reason for hiding this comment

Uh oh!

f41gh7 left a comment

Choose a reason for hiding this comment

Uh oh!

f41gh7 commented Jul 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants