Reuse events used for syncing watchers by serathius · Pull Request #17563 · etcd-io/etcd

serathius · 2024-03-10T15:56:03Z

Part of #16839
Improve etcd behavior during watch congestion by reducing the memory needed.
Congestion causes watch requests to backup and resync loop to allocate tons of memory. This patch has shown to reduce memory needed 5-10 times.

Reuse events	object size [KB]	etcd memory[MB]	latency 50%ile	latency 90%ile	latency 99%ile
FALSE	5	111.72	0.1005	0.1543	0.1856
TRUE	5	109.096	0.1012	0.1556	0.1874
FALSE	10	1103.924	2.7177	7.3661	11.9127
TRUE	10	291.184	2.6623	7.687	12.5456
FALSE	100	5138.712	14.0618	26.2721	29.1691
TRUE	100	641.844	13.9372	25.8267	28.519

Tested using command go run ./tools/benchmark/main.go watch-latency --watch-per-stream 1000 --streams 1 --put-total 200 --val-size 100000. Parameters adjusted for total object size be a 20MB.

k8s-ci-robot · 2024-03-10T15:56:05Z

Skipping CI for Draft Pull Request.
If you want CI signal for your change, please convert it to an actual PR.
You can still manually trigger a test run with /test all

chaochn47 · 2024-03-11T17:27:02Z

Limit event prefix by slowest watch instead of compaction.

Do we expect sending events to compacted watchers (the slowest one) though? I think it should not. WDYT?

serathius · 2024-03-11T17:35:37Z

Do we expect sending events to compacted watchers (the slowest one) though? I think it should not. WDYT?

I expect that minRev and rev of victims is above compactedRev, if now we can just add a protection.

codecov · 2024-11-29T17:32:56Z

Codecov Report

Attention: Patch coverage is 97.14286% with 1 line in your changes missing coverage. Please review.

Project coverage is 68.71%. Comparing base (78885f6) to head (348c0cb).
Report is 5 commits behind head on main.

Files with missing lines	Patch %	Lines
server/storage/mvcc/watchable_store.go	97.14%	1 Missing ⚠️

Additional details and impacted files

Files with missing lines	Coverage Δ
server/storage/mvcc/watchable_store.go	`93.91% <97.14%> (+0.41%)`	⬆️

... and 22 files with indirect coverage changes

@@            Coverage Diff             @@
##             main   #17563      +/-   ##
==========================================
- Coverage   68.80%   68.71%   -0.09%     
==========================================
  Files         420      420              
  Lines       35599    35619      +20     
==========================================
- Hits        24494    24476      -18     
- Misses       9675     9712      +37     
- Partials     1430     1431       +1

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 78885f6...348c0cb. Read the comment docs.

serathius · 2024-11-29T22:14:45Z

Ready to review
ping @ahrtr

serathius · 2024-12-02T13:44:16Z

/retest

ahrtr · 2024-12-02T16:45:17Z

+		{minRev: 2, maxRev: 3, expectEvents: expectEvents[0:1]},
+		{minRev: 2, maxRev: 4, expectEvents: expectEvents[0:2]},


These two cases are duplicated? see line 238-239. Better to sort the cases by minRev or maxRev?

Yes, it's intentional. We group multiple cases into a scenarios. Added comments to make them easier to see. This is to better test the reuse, as the function call has side effects with modifying evs.

Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

ahrtr

LGTM

It would be wonderful if we have more performance data (i.e. in K8s scale test) to showcase the benefit of such improvement.

k8s-ci-robot · 2024-12-02T17:21:19Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahrtr, serathius

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [ahrtr,serathius]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

serathius · 2024-12-02T17:50:40Z

It would be wonderful if we have more performance data (i.e. in K8s scale test) to showcase the benefit of such improvement.

Don't think K8s scale are any close to push etcd watch into congestion, if they did they K8s would not work. See kubernetes/kubernetes#123448

That's why I'm using benchmarks. They are pretty good, however without a easily accessible metric they are not used. I'm thinking about doing some MVP to run a benchmark and upload results to perf-dash K8s, however the problem is that they have a very strict schema requirement, which doesn't match etcd.

k8s-ci-robot added the do-not-merge/work-in-progress label Mar 10, 2024

serathius force-pushed the sync-reuse branch 2 times, most recently from 1eecad5 to 05db321 Compare March 10, 2024 19:05

serathius mentioned this pull request Mar 11, 2024

Watch starvation can cause OOMs #16839

Open

serathius mentioned this pull request Nov 29, 2024

Increase sync watchers period to 1 second #17583

Closed

k8s-ci-robot added the size/M label Nov 29, 2024

serathius force-pushed the sync-reuse branch from 05db321 to 305b9db Compare November 29, 2024 17:15

k8s-ci-robot added the approved label Nov 29, 2024

serathius force-pushed the sync-reuse branch from 305b9db to 71b56dd Compare November 29, 2024 17:16

serathius marked this pull request as ready for review November 29, 2024 17:16

k8s-ci-robot removed the do-not-merge/work-in-progress label Nov 29, 2024

serathius changed the title ~~[Draft] Reuse events used for syncing watchers~~ Reuse events used for syncing watchers Nov 29, 2024

serathius force-pushed the sync-reuse branch from 71b56dd to edb08ba Compare November 29, 2024 21:08

k8s-ci-robot added size/L and removed size/M labels Nov 29, 2024

serathius force-pushed the sync-reuse branch from edb08ba to e7821e3 Compare November 29, 2024 21:31

serathius mentioned this pull request Dec 1, 2024

Use size of events to batch Watch responses #18975

Closed

ahrtr reviewed Dec 2, 2024

View reviewed changes

Comment thread server/storage/mvcc/watchable_store.go

serathius force-pushed the sync-reuse branch from e7821e3 to 900accd Compare December 2, 2024 13:30

serathius mentioned this pull request Dec 2, 2024

Add verification to ensure the watch events are in the expected range #18980

Closed

ahrtr reviewed Dec 2, 2024

View reviewed changes

Comment thread server/storage/mvcc/watchable_store.go

ahrtr reviewed Dec 2, 2024

View reviewed changes

Comment thread server/storage/mvcc/watchable_store_test.go

Comment thread server/storage/mvcc/watchable_store_test.go Outdated

serathius force-pushed the sync-reuse branch from 900accd to b14a30b Compare December 2, 2024 16:21

ahrtr reviewed Dec 2, 2024

View reviewed changes

serathius force-pushed the sync-reuse branch 4 times, most recently from 9b39f3f to ebb0f64 Compare December 2, 2024 17:04

serathius added 2 commits December 2, 2024 18:06

Extract rangeEvents function

1f4439c

Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

Reuse events between sync loops

348c0cb

Signed-off-by: Marek Siarkowicz <siarkowicz@google.com>

serathius force-pushed the sync-reuse branch from ebb0f64 to 348c0cb Compare December 2, 2024 17:07

ahrtr mentioned this pull request Dec 2, 2024

Extract rangeEvents function #18981

Merged

ahrtr approved these changes Dec 2, 2024

View reviewed changes

serathius merged commit 6fa7342 into etcd-io:main Dec 2, 2024

ahrtr mentioned this pull request Dec 3, 2024

Document for 3.6 release etcd-io/website#926

Closed

serathius mentioned this pull request Aug 8, 2025

Use SharedBufReadTxMode for (*readView) Rev() and (*readView) FirstRev() #20411

Merged

tjungblu mentioned this pull request Feb 24, 2026

Increased memory usage with 3.6 event reuse #21355

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reuse events used for syncing watchers#17563

Reuse events used for syncing watchers#17563
serathius merged 2 commits intoetcd-io:mainfrom
serathius:sync-reuse

serathius commented Mar 10, 2024 •

edited

Loading

Uh oh!

k8s-ci-robot commented Mar 10, 2024

Uh oh!

chaochn47 commented Mar 11, 2024

Uh oh!

serathius commented Mar 11, 2024

Uh oh!

codecov bot commented Nov 29, 2024 •

edited

Loading

Uh oh!

serathius commented Nov 29, 2024 •

edited

Loading

Uh oh!

Uh oh!

serathius commented Dec 2, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ahrtr Dec 2, 2024

Uh oh!

serathius Dec 2, 2024

Uh oh!

ahrtr left a comment

Uh oh!

k8s-ci-robot commented Dec 2, 2024

Uh oh!

serathius commented Dec 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

		{minRev: 2, maxRev: 3, expectEvents: expectEvents[0:1]},
		{minRev: 2, maxRev: 4, expectEvents: expectEvents[0:2]},

Conversation

serathius commented Mar 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

k8s-ci-robot commented Mar 10, 2024

Uh oh!

chaochn47 commented Mar 11, 2024

Uh oh!

serathius commented Mar 11, 2024

Uh oh!

codecov bot commented Nov 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

serathius commented Nov 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

serathius commented Dec 2, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ahrtr Dec 2, 2024

Choose a reason for hiding this comment

Uh oh!

serathius Dec 2, 2024

Choose a reason for hiding this comment

Uh oh!

ahrtr left a comment

Choose a reason for hiding this comment

Uh oh!

k8s-ci-robot commented Dec 2, 2024

Uh oh!

serathius commented Dec 2, 2024

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

4 participants

serathius commented Mar 10, 2024 •

edited

Loading

codecov bot commented Nov 29, 2024 •

edited

Loading

serathius commented Nov 29, 2024 •

edited

Loading