Disable cgroups-per-qos pending Burstable/cpu.shares being set by derekwaynecarr · Pull Request #42052 · kubernetes/kubernetes

derekwaynecarr · 2017-02-24T15:18:57Z

Disable cgroups-per-qos to allow kubemark problems to still be resolved.

Re-enable it once the following merge:
#41753
#41644
#41621

Enabling it before cpu.shares is set on qos tiers can cause regressions since Burstable and BestEffort pods are given equal time.

k8s-reviewable · 2017-02-24T15:19:06Z

This change is

derekwaynecarr · 2017-02-24T15:20:11Z

FYI @vishh @dchen1107 @sjenning @ncdc @sjenning

vishh · 2017-02-24T15:48:42Z

what was the symptom? are pods being starved?

derekwaynecarr · 2017-02-24T15:50:28Z

@vishh -- that was one theory as pods in question were in burstable tier. @sjenning -- is running kubemark runs in the interim so we can gather more data, but this should unblock folks.

vishh · 2017-02-24T15:51:38Z

it's not obvious what the conclusion from #42000 was.

ncdc · 2017-02-24T15:53:03Z

@k8s-bot non-cri e2e test this #41893 #39821

vishh · 2017-02-24T15:53:46Z

Ah. are they being CPU starved? this reminds me of bugs in node allocatable level too

derekwaynecarr · 2017-02-24T15:54:06Z

@vishh -- my theory was they are starved since it will be 1024 shares, but it was just a theory.

ncdc · 2017-02-24T15:54:27Z

@k8s-bot kubemark e2e test this kubernetes/test-infra#2012

vishh · 2017-02-24T15:55:25Z

/LGTM
/approve

k8s-github-robot · 2017-02-24T15:55:36Z

[APPROVALNOTIFIER] This PR is APPROVED

The following people have approved this PR: derekwaynecarr, vishh

Needs approval from an approver in each of these OWNERS Files:

~~pkg/apis/componentconfig/OWNERS~~ [vishh]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

derekwaynecarr · 2017-02-24T16:02:26Z

I am bumping priority on this.

fejta · 2017-02-24T16:35:10Z

@k8s-bot kubemark e2e test this kubernetes/test-infra#2012

wojtek-t · 2017-02-24T16:49:57Z

LGTM - thanks!

derekwaynecarr · 2017-02-24T22:13:20Z

kubemark is hot looping, that needs to be fixed, this can merge in the interim, but we need to root cause why kubemark is hot-looping.

#42000 (comment)

ncdc · 2017-02-24T22:15:02Z

Cross-posting here - it's the real kubelet that is hot looping too

dchen1107 · 2017-02-24T23:00:46Z

Can someone please help me to fill the gap here? Why cgroup-per-qos might cause issue for Pod startup latency regression? Shouldn't we agreed at sig-node meeting for 1.6 release, by default, all cgroup-per-qos should be unlimited? Each Kubernetes vendor decide the limit later based on the performance benchmark and other monitoring stats?

Or we mistakenly set the limit for each top cgroup?

derekwaynecarr · 2017-02-24T23:04:36Z

see: #42000 (comment)

we are not yet setting cpu shares on qos tier (which is required otherwise there is a regression under contention).

dchen1107 · 2017-02-25T00:27:28Z

@derekwaynecarr This is exactly why I am confused. I thought I raised this concern at sig-node meeting, and finally we agreed on the following regarding to NodeAllocatable & QoS tree rollout in 1.6 release:

Step 1: Creating all top level QoS cgroup and per pod cgroup, but unlimit them (hence: set the limit to something equivalent to the node capacity / node allocatable)
Step 2: Introduce another flag for enforcement based on QoS design, but disable it by default.

But based on #42000 (comment), it looks like we messed up with step 1. Instead of unlimit those top-level cgroup, we unset them. At least for burstable cpu cgroup, it has 1024 which looks like an unset value to me.

EDITED: Forget this comment here. I realized there would be another set of issue. :-)

vishh · 2017-02-25T04:15:57Z

@dchen1107

The issue is that the default value for cpu shares is 1024. Even if we set the top level cgroup to node capacity, all its children QoS level and pod level cgroups will get 1024 as default cpu shares.
This kernel behavior leads to regression in CPU isolation.

k8s-github-robot · 2017-02-25T10:17:54Z

Automatic merge from submit-queue (batch tested with PRs 41714, 41510, 42052, 41918, 31515)

vishh · 2017-02-26T22:37:48Z

@derekwaynecarr when re-enabling --cgroups-per-qos, also set --enforce-node-allocatable to pods.

Disble cgroups-per-qos pending Burstable/cpu.shares being set

36f4256

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Feb 24, 2017

k8s-github-robot assigned jessfraz Feb 24, 2017

k8s-github-robot added size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. release-note-label-needed labels Feb 24, 2017

derekwaynecarr mentioned this pull request Feb 24, 2017

Revert "Enable pod level cgroups by default" #42047

Closed

derekwaynecarr added release-note-none Denotes a PR that doesn't merit a release note. and removed release-note-label-needed labels Feb 24, 2017

derekwaynecarr assigned vishh and wojtek-t and unassigned jessfraz Feb 24, 2017

derekwaynecarr added this to the v1.6 milestone Feb 24, 2017

derekwaynecarr changed the title ~~Disble cgroups-per-qos pending Burstable/cpu.shares being set~~ Disable cgroups-per-qos pending Burstable/cpu.shares being set Feb 24, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 24, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 24, 2017

derekwaynecarr added the priority/P1 label Feb 24, 2017

k8s-github-robot merged commit a93904e into kubernetes:master Feb 25, 2017

derekwaynecarr mentioned this pull request Mar 2, 2017

enable cgroups tiers and node allocatable enforcement on pods by default. #42350

Merged

Conversation

derekwaynecarr commented Feb 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k8s-reviewable commented Feb 24, 2017

Uh oh!

derekwaynecarr commented Feb 24, 2017

Uh oh!

vishh commented Feb 24, 2017

Uh oh!

derekwaynecarr commented Feb 24, 2017

Uh oh!

vishh commented Feb 24, 2017

Uh oh!

ncdc commented Feb 24, 2017

Uh oh!

vishh commented Feb 24, 2017

Uh oh!

derekwaynecarr commented Feb 24, 2017

Uh oh!

ncdc commented Feb 24, 2017

Uh oh!

vishh commented Feb 24, 2017

Uh oh!

k8s-github-robot commented Feb 24, 2017

Uh oh!

derekwaynecarr commented Feb 24, 2017

Uh oh!

fejta commented Feb 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wojtek-t commented Feb 24, 2017

Uh oh!

derekwaynecarr commented Feb 24, 2017

Uh oh!

ncdc commented Feb 24, 2017

Uh oh!

dchen1107 commented Feb 24, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

derekwaynecarr commented Feb 24, 2017

Uh oh!

dchen1107 commented Feb 25, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

vishh commented Feb 25, 2017

Uh oh!

k8s-github-robot commented Feb 25, 2017

Uh oh!

vishh commented Feb 26, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

derekwaynecarr commented Feb 24, 2017 •

edited

Loading

fejta commented Feb 24, 2017 •

edited

Loading

dchen1107 commented Feb 24, 2017 •

edited

Loading

dchen1107 commented Feb 25, 2017 •

edited

Loading