apiserver: add --shutdown-delay-duration to keep serving until LBs stop sending traffic by sttts · Pull Request #74416 · kubernetes/kubernetes

sttts · 2019-02-22T14:16:05Z

This is meant to delay the apiserver shutdown for a defined time duration in order to give the SDN a chance to update changed endpoints.

The reconciler is part of the "master controller", also called "bootstrap controller". It has a pre shutdown hook triggered by the stopCh. We delay the internalStopCh being closed which triggers to stop serving.

Add --shutdown-delay-duration to kube-apiserver in order to delay a graceful shutdown. `/healthz` will keep returning success during this time and requests are normally served, but `/readyz` will return faillure immediately. This delay can be used to allow the SDN to update iptables on all nodes and stop sending traffic.

sttts · 2019-02-22T14:18:00Z

/assign @deads2k

k8s-ci-robot · 2019-02-22T14:21:36Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: sttts

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~staging/src/k8s.io/apiserver/OWNERS~~ [sttts]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

sttts · 2019-02-22T19:59:25Z

/assign @stewart-yu

stewart-yu · 2019-02-23T00:47:33Z

staging/src/k8s.io/apiserver/pkg/server/options/server_run_options.go

can can remove "" at the end of line 184?

I am just following the style in this file.

staging/src/k8s.io/apiserver/pkg/server/options/server_run_options.go

deads2k · 2019-02-25T19:11:30Z

This lgtm. It helps limit an unnecessary race. It's not foolproof because we cannot be aware of all of our consumers, but it makes it possible to avoid unnecessary dead endpoints.

@kubernetes/sig-api-machinery-misc

lavalamp · 2019-02-25T19:33:12Z

/hold

not that I necessarily disagree, but I think this might be a big enough addition to the surface area that I want to think about it for a second.

sttts · 2019-07-05T12:05:33Z

Rebased.

@lavalamp @logicalhan please cancel the hold here.

sttts · 2019-07-09T08:41:20Z

/retest

staging/src/k8s.io/apiserver/pkg/server/genericapiserver.go

…op serving traffic

logicalhan · 2019-07-09T20:10:45Z

/lgtm

logicalhan · 2019-07-11T20:20:02Z

/hold cancel

fejta-bot · 2019-07-11T22:42:52Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

fejta-bot · 2019-07-12T01:09:51Z

/retest
This bot automatically retries jobs that failed/flaked on approved PRs (send feedback to fejta).

Review the full test history for this PR.

Silence the bot with an /lgtm cancel or /hold comment for consistent failures.

k8s-ci-robot · 2019-07-12T01:47:47Z

@sttts: The following test failed, say /retest to rerun them all:

Test name	Commit	Details	Rerun command
pull-kubernetes-e2e-gce-100-performance	`408f36b`	link	`/test pull-kubernetes-e2e-gce-100-performance`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

We set the shutdown delay to `20s` which will lead to the fact that kube-apiserver will serve requests normally and `/healthz` returns success, but `/readyz` will immediately return `false`. Graceful termination starts after this delay has elapsed. We are using this to allow load balancers to stop sending traffic to this server (the SDN has time to update the iptables on all nodes and stop sending traffic). Previously, the kube-apiserver was stopping servering requests while it may still got sent traffic. See kubernetes/kubernetes#74416

k8s-ci-robot assigned deads2k Feb 22, 2019

sttts added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 22, 2019

k8s-ci-robot removed the needs-kind Indicates a PR lacks a `kind/foo` label and requires one. label Feb 22, 2019

sttts added the sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. label Feb 22, 2019

k8s-ci-robot removed the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Feb 22, 2019

sttts added priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. labels Feb 22, 2019

k8s-ci-robot removed the needs-priority Indicates a PR lacks a `priority/foo` label and requires one. label Feb 22, 2019

sttts removed the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Feb 22, 2019

k8s-ci-robot added approved Indicates a PR has been approved by an approver from all required OWNERS files. area/apiserver labels Feb 22, 2019

k8s-ci-robot requested review from tallclair and xiangpengzhao February 22, 2019 14:22

k8s-ci-robot assigned stewart-yu Feb 22, 2019

stewart-yu reviewed Feb 23, 2019

View reviewed changes

sttts force-pushed the sttts-apiserver-minimum-shutdown-duration branch from 1ec3faf to 6418b0c Compare February 25, 2019 14:51

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. and removed size/S Denotes a PR that changes 10-29 lines, ignoring generated files. labels Feb 25, 2019

k8s-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Feb 25, 2019

sttts force-pushed the sttts-apiserver-minimum-shutdown-duration branch from 77bb860 to bd1f77a Compare July 5, 2019 12:05

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jul 5, 2019

sttts added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jul 5, 2019

logicalhan reviewed Jul 9, 2019

View reviewed changes

staging/src/k8s.io/apiserver/pkg/server/genericapiserver.go Outdated Show resolved Hide resolved

sttts force-pushed the sttts-apiserver-minimum-shutdown-duration branch from bd1f77a to e0d6b98 Compare July 9, 2019 20:00

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jul 9, 2019

apiserver: add --shutdown-delay-duration to keep serving until LBs st…

408f36b

…op serving traffic

sttts force-pushed the sttts-apiserver-minimum-shutdown-duration branch from e0d6b98 to 408f36b Compare July 9, 2019 20:09

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jul 9, 2019

k8s-ci-robot removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Jul 11, 2019

k8s-ci-robot merged commit 7e17aeb into kubernetes:master Jul 12, 2019

php-coder mentioned this pull request Sep 25, 2019

Fix typo in URI kubernetes/website#16522

Merged

linki mentioned this pull request Feb 11, 2020

Shutdown masters more gracefully zalando-incubator/kubernetes-on-aws#2972

Merged

liggitt mentioned this pull request Feb 19, 2020

--shutdown-delay-duration causes kube-apiserver to shut down shortly after startup if set to 20s or more #88293

Closed

Joseph-Goergen mentioned this pull request Feb 26, 2020

Add livez and readyz to liveness and readiness probes openshift/hypershift-toolkit#125

Merged

timuthy mentioned this pull request Aug 25, 2021

Re-discover K8s version during shoot reconciliation gardener/gardener#4554

Merged

Conversation

sttts commented Feb 22, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sttts commented Feb 22, 2019

Uh oh!

k8s-ci-robot commented Feb 22, 2019

Uh oh!

sttts commented Feb 22, 2019

Uh oh!

stewart-yu Feb 23, 2019

Choose a reason for hiding this comment

Uh oh!

sttts Feb 25, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

deads2k commented Feb 25, 2019

Uh oh!

lavalamp commented Feb 25, 2019

Uh oh!

sttts commented Jul 5, 2019

Uh oh!

sttts commented Jul 9, 2019

Uh oh!

Uh oh!

logicalhan commented Jul 9, 2019

Uh oh!

logicalhan commented Jul 11, 2019

Uh oh!

fejta-bot commented Jul 11, 2019

Uh oh!

fejta-bot commented Jul 12, 2019

Uh oh!

k8s-ci-robot commented Jul 12, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

sttts commented Feb 22, 2019 •

edited

Loading