Reworking kube-proxy to only compute endpointChanges on apply by robscott · Pull Request #83206 · kubernetes/kubernetes

robscott · 2019-09-26T18:49:34Z

What type of PR is this?
/kind cleanup

What this PR does / why we need it:
Computing EndpointChanges is a relatively expensive operation for kube-proxy when Endpoint Slices are used. This had been computed on every EndpointSlice update which became quite inefficient at high levels of scale when multiple EndpointSlice update events would be triggered before a syncProxyRules call.

Profiling results showed that computing this on each update could consume ~80% of total kube-proxy CPU utilization at high levels of scale. This change reduced that to as little as 3% of total kube-proxy utilization at high levels of scale.

It's worth noting that the difference is minimal when there is a 1:1 relationship between EndpointSlice updates and proxier syncs. This is primarily beneficial when there are many EndpointSlice updates between proxier sync loops.

Does this PR introduce a user-facing change?:

Significant kube-proxy performance improvements when using Endpoint Slices at scale.

Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.:

Enhancement Issue: EndpointSlice API enhancements#752

/sig network
/cc @freehan
/priority important-longterm

robscott · 2019-09-26T21:48:55Z

/retest

freehan · 2019-10-02T23:44:05Z

pkg/proxy/endpoints.go

add a unit test for this?

may need a more significant name than getChangesAndMarkApplied. This function essentailly emits the changes in cache and refresh the cache. Plus, maybe other stuff

Naming Idea:
CheckoutChanges

Thanks, CheckoutChanges is a much better name. Added some unit tests for this as well.

freehan · 2019-10-02T23:45:44Z

pkg/proxy/endpointslicecache.go

It may helped the performance more by not comparing the difference and just trigger sync as always.

But it may not make any difference given the test result.

Unfortunately both the EndpointChangesPending metric and the last change trigger times rely on this being available on EndpointSlice addition (computing at proxier sync would not be helpful). The profiling I've done suggests that the full updatePending function, including this diffing, takes ~3% of total kube-proxy compute time (0.12s of 3.61s scaling to 10k endpoints). The actual esInfoChanged call within that is small enough to not be reported. Given the relative insignificance of that time I think it's worth maintaining the metrics as they exist.

pkg/proxy/endpoints.go

freehan · 2019-10-02T23:51:01Z

pkg/proxy/endpoints.go

consider moving all these into getChangesAndMarkApplied

I couldn't find a great way to integrate it there, but I did move this into a similarly named function.

pkg/proxy/endpoints.go

pkg/proxy/endpointslicecache.go

freehan · 2019-10-03T18:42:15Z

Consider putting endpointslicecache.go into a different package (e.g. endpointslice/util or apimachinery), and define the interface better for reuse. Other consumer will need similar logic for this.

robscott · 2019-10-07T20:07:50Z

/retest

robscott · 2019-10-08T00:29:25Z

/retest

freehan

/assign @freehan

The interface of EndpointSliceCache, EndpointTracker is not too isolated. Hence making the synchronization and locking more unclear. But it seems not changing the existing flow.

pkg/proxy/endpointslicecache.go

freehan · 2019-10-14T18:39:58Z

pkg/proxy/endpoints.go

nit: consider enclose the critical section into a built in func

func(){ ect.lock.Lock() defer ect.lock.Unlock() ... }()

you have lock in endpointSliceCache already. Do you still need lock here?

After discussing this a bit more I think it makes sense to hold off on any additional changes here until endpointslicecache can be moved into its own package.

Computing EndpointChanges is a relatively expensive operation for kube-proxy when Endpoint Slices are used. This had been computed on every EndpointSlice update which became quite inefficient at high levels of scale when multiple EndpointSlice update events would be triggered before a syncProxyRules call. Profiling results showed that computing this on each update could consume ~80% of total kube-proxy CPU utilization at high levels of scale. This change reduced that to as little as 3% of total kube-proxy utilization at high levels of scale. It's worth noting that the difference is minimal when there is a 1:1 relationship between EndpointSlice updates and proxier syncs. This is primarily beneficial when there are many EndpointSlice updates between proxier sync loops.

freehan

/lgtm
/approve

k8s-ci-robot · 2019-10-16T00:25:09Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: freehan, robscott

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~pkg/proxy/OWNERS~~ [freehan]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot requested a review from freehan September 26, 2019 18:49

robscott force-pushed the endpointslice-proxy-endpointchange-perf branch from 7442a93 to 24dc65f Compare September 26, 2019 20:12

freehan reviewed Oct 2, 2019

View reviewed changes

robscott force-pushed the endpointslice-proxy-endpointchange-perf branch 4 times, most recently from 98040fc to e59fdf7 Compare October 7, 2019 17:40

k8s-ci-robot added the sig/apps Categorizes an issue or PR as relevant to SIG Apps. label Oct 7, 2019

robscott force-pushed the endpointslice-proxy-endpointchange-perf branch from e59fdf7 to 8178d13 Compare October 7, 2019 20:23

freehan reviewed Oct 14, 2019

View reviewed changes

k8s-ci-robot assigned freehan Oct 14, 2019

robscott force-pushed the endpointslice-proxy-endpointchange-perf branch from 8178d13 to 8e7de45 Compare October 15, 2019 23:31

k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Oct 15, 2019

freehan reviewed Oct 16, 2019

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 16, 2019

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 16, 2019

k8s-ci-robot merged commit 30ba9f6 into kubernetes:master Oct 16, 2019

k8s-ci-robot added this to the v1.17 milestone Oct 16, 2019

robscott deleted the endpointslice-proxy-endpointchange-perf branch March 11, 2021 04:56

Conversation

robscott commented Sep 26, 2019

Uh oh!

robscott commented Sep 26, 2019

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

robscott Oct 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

freehan commented Oct 3, 2019

Uh oh!

robscott commented Oct 7, 2019

Uh oh!

robscott commented Oct 8, 2019

Uh oh!

freehan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

freehan left a comment

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Oct 16, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

robscott Oct 4, 2019 •

edited

Loading