fix:cri: Stable order for StatusResponse.RuntimeHandlers by EricMountain · Pull Request #12036 · containerd/containerd

EricMountain · 2025-06-30T08:44:04Z

The RuntimeHandlers list in the response to the CRI Status() method (crictl info) has unstable ordering
since commit 97eb1cd (underlying switch from list to map) that was shipped in v2.1.0.

On Kubernetes nodes this causes the kubelet to update node status subresources
every time the order of runtime handlers changes in the status response from
containerd. The likelihood increases with the number of runtime handlers present
on nodes. In some clusters this leads to every single node sending a status update
every few seconds, leading in turn to excessive Kube API server load.

This change enforces stable ordering on runtime handler names to restore the original behaviour.

k8s-ci-robot · 2025-06-30T08:44:15Z

Hi @EricMountain. Thanks for your PR.

I'm waiting for a containerd member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Signed-off-by: Eric Mountain <eric.mountain@datadoghq.com>

The runtimeHandlers list in the response to `crictl info` has unstable ordering since commit 97eb1cd (underlying switch from list to map) that was shipped in v2.1.0. On Kubernetes nodes this causes the kubelet to update node status subresources every time the order of runtime handlers changes in the status response from containerd. The lieklihood increases with the number of runtime handlers present on nodes. In some clusters this leads to every single node sending a status update every few seconds leading to excessive Kube API server load. This change enforces stable ordering on runtime handler names. Signed-off-by: Eric Mountain <eric.mountain@datadoghq.com>

Signed-off-by: Eric Mountain <eric.mountain@datadoghq.com>

AkihiroSuda · 2025-07-01T22:31:18Z

/ok-to-test

AkihiroSuda · 2025-07-02T00:52:19Z

/cherry-pick release/2.1

k8s-infra-cherrypick-robot · 2025-07-02T00:52:57Z

@AkihiroSuda: new pull request created: #12054

Details

In response to this:

/cherry-pick release/2.1

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

github-project-automation bot added this to Pull Request Review Jun 30, 2025

github-project-automation bot moved this to Needs Triage in Pull Request Review Jun 30, 2025

k8s-ci-robot added needs-ok-to-test size/L labels Jun 30, 2025

dosubot bot added area/cri Container Runtime Interface (CRI) kind/bug labels Jun 30, 2025

EricMountain force-pushed the eric.mountain/stable-rh-order branch 3 times, most recently from dbdbdfa to 2cc6656 Compare June 30, 2025 13:40

EricMountain added 3 commits July 1, 2025 09:57

Test showing RuntimeHandlers in Status() are unordered

f51a2fb

Signed-off-by: Eric Mountain <eric.mountain@datadoghq.com>

Amend runtime handler test for stable order

eb63b5b

Signed-off-by: Eric Mountain <eric.mountain@datadoghq.com>

EricMountain force-pushed the eric.mountain/stable-rh-order branch from 2cc6656 to eb63b5b Compare July 1, 2025 07:58

AkihiroSuda approved these changes Jul 1, 2025

View reviewed changes

AkihiroSuda added the cherry-pick/2.1.x Change to be cherry picked to release/2.1 branch label Jul 1, 2025

k8s-ci-robot added ok-to-test and removed needs-ok-to-test labels Jul 1, 2025

djdongjin approved these changes Jul 1, 2025

View reviewed changes

github-project-automation bot moved this from Needs Triage to Review In Progress in Pull Request Review Jul 1, 2025

AkihiroSuda added this pull request to the merge queue Jul 2, 2025

Merged via the queue into containerd:main with commit 0107c53 Jul 2, 2025
53 checks passed

github-project-automation bot moved this from Review In Progress to Done in Pull Request Review Jul 2, 2025

k8s-infra-cherrypick-robot mentioned this pull request Jul 2, 2025

[release/2.1] Update status response to return stable order for runtime handlers #12054

Merged

austinvazquez added cherry-picked/2.1.x PR commits are cherry picked into the release/2.1 branch and removed cherry-pick/2.1.x Change to be cherry picked to release/2.1 branch labels Jul 21, 2025

jaredledvina mentioned this pull request Sep 12, 2025

[v2.1-dd] Cherry-picks DataDog/containerd#17

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix:cri: Stable order for StatusResponse.RuntimeHandlers#12036

fix:cri: Stable order for StatusResponse.RuntimeHandlers#12036
AkihiroSuda merged 3 commits intocontainerd:mainfrom
DataDog:eric.mountain/stable-rh-order

EricMountain commented Jun 30, 2025

Uh oh!

k8s-ci-robot commented Jun 30, 2025

Uh oh!

AkihiroSuda commented Jul 1, 2025

Uh oh!

Uh oh!

AkihiroSuda commented Jul 2, 2025

Uh oh!

k8s-infra-cherrypick-robot commented Jul 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

EricMountain commented Jun 30, 2025

Uh oh!

k8s-ci-robot commented Jun 30, 2025

Uh oh!

AkihiroSuda commented Jul 1, 2025

Uh oh!

Uh oh!

AkihiroSuda commented Jul 2, 2025

Uh oh!

k8s-infra-cherrypick-robot commented Jul 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants