Adjust worker goroutines to number of backend connections by MichaelEischer · Pull Request #3611 · restic/restic

MichaelEischer · 2021-12-29T23:12:49Z

What does this PR change? What problem does it solve?

Adapt worker count based on whether an operation is CPU or IO-bound.

Use runtime.GOMAXPROCS(0) as worker count for CPU-bound tasks, repo.Connections() for IO-bound task and a combination if a task can be both.

Typical IO-bound tasks are download / uploading / deleting files. Decoding / Encoding / Verifying are usually CPU-bound. Several tasks are a combination of both, e.g. for combined download and decode functions. In the latter case add both limits together. As the backends have their own concurrency limits restic still won't download more than repo.Connections() files in parallel, but the additional workers can decode already downloaded data in parallel.

This PR also includes the commits from #3489.
The last commit "Adapt concurrency to streaming check --read-data and prune" only makes sense together with #3484.

Alternatives:

The current construction still requires users to adjust one knob namely the connections limit. restic could just use e.g. the CPU count as a proxy for the system performance and use it to scale the number of backend connections. Each operation could then use a different scaling factor. This has the downside that the CPU count and network connection can be correlated (e.g. for servers in the cloud) but don't have to be. That heuristic thus would only work well in some cases.

Finding the optimal number of goroutines automatically could also be possible by using a self-tuning control loop, but that would complicate thing quite a bit.

Was the change previously discussed in an issue or on the forum?

Fixes #2162
Fixes #1467

Checklist

I have read the contribution guidelines.
I have enabled maintainer edits.
~~[ ] I have added tests for all code changes.~~ I don't see how I could test this.
~~[ ] I have added documentation for relevant changes (in the manual).~~ The connections option should already be documented by Asynchronously upload pack files #3489.
There's a new file in changelog/unreleased/ that describes the changes for our users (see template).
I have run gofmt on the code in all commits.
All commit messages are formatted in the same style as the other commits in the repo.
I'm done! This pull request is ready for review.

greatroar · 2022-01-23T19:39:54Z

This looks sensible. The main downside is that crash reports become much longer with more goroutines. However, shouldn't more channels be buffered to ensure smooth throughput?

MichaelEischer · 2022-02-12T20:05:00Z

The main downside is that crash reports become much longer with more goroutines.

I guess that's a price worth paying if it helps with the throughput.

However, shouldn't more channels be buffered to ensure smooth throughput?

The idea was to compensate for that by adding as many goroutines as there are CPU cores in methods where substantial processing is necessary for further IO.
In most places the IO-tasks are also generated by submitting a preexisting list on a channel. My expectation would be that the IO is "slow" enough that we shouldn't run into too much congestion on the channel.

Use runtime.GOMAXPROCS(0) as worker count for CPU-bound tasks, repo.Connections() for IO-bound task and a combination if a task can be both. Streaming packs is treated as IO-bound as adding more worker cannot provide a speedup. Typical IO-bound tasks are download / uploading / deleting files. Decoding / Encoding / Verifying are usually CPU-bound. Several tasks are a combination of both, e.g. for combined download and decode functions. In the latter case add both limits together. As the backends have their own concurrency limits restic still won't download more than repo.Connections() files in parallel, but the additional workers can decode already downloaded data in parallel.

MichaelEischer · 2022-07-03T10:33:23Z

This PR completes the upload pipeline reworking started by #3489 and several other PRs. It (hopefully) addresses some of the scalability issues that have showed up regarding backends with high latency.

MichaelEischer mentioned this pull request Jan 22, 2022

Uncap delete worker concurrency for dramatic prune speed-ups [0.12.1] #3632

Open

MichaelEischer force-pushed the auto-concurrency branch from 7f9dd91 to e616205 Compare February 12, 2022 19:52

MichaelEischer force-pushed the auto-concurrency branch 2 times, most recently from 13f4c56 to 8c19be6 Compare April 9, 2022 10:48

MichaelEischer force-pushed the auto-concurrency branch from 8c19be6 to 55a5f19 Compare April 30, 2022 10:38

MichaelEischer mentioned this pull request Apr 30, 2022

Add backup --file-read-concurrency flag #2750

Merged

8 tasks

MichaelEischer force-pushed the auto-concurrency branch from 55a5f19 to e3d4e2a Compare June 5, 2022 13:07

MichaelEischer mentioned this pull request Jun 8, 2022

Backblaze B2 transfer rate limited (not reaching Backblaze's cli tool's speed), presumably due to missing threading #3791

Closed

MichaelEischer force-pushed the auto-concurrency branch 2 times, most recently from 356580c to 85348e1 Compare July 3, 2022 10:15

MichaelEischer added 2 commits July 3, 2022 12:19

Document automatic CPU/IO-concurrency

3af9c2c

MichaelEischer force-pushed the auto-concurrency branch from 85348e1 to 3af9c2c Compare July 3, 2022 10:19

MichaelEischer merged commit b6a38d4 into restic:master Jul 3, 2022

MichaelEischer deleted the auto-concurrency branch July 3, 2022 10:33

FabioPedretti mentioned this pull request Jul 3, 2022

Multithreading borgbackup/borg#37

Open

MichaelEischer mentioned this pull request Jul 3, 2022

Feature: min packsize flag #3731

Merged

8 tasks

MichaelEischer mentioned this pull request Aug 18, 2022

Local Backend doesn't support connections option? #3864

Closed

sgalsaleh mentioned this pull request Nov 30, 2022

Upgrade Restic to v0.14.0 for faster data pruning vmware-tanzu/velero#5650

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjust worker goroutines to number of backend connections#3611

Adjust worker goroutines to number of backend connections#3611
MichaelEischer merged 2 commits intorestic:masterfrom
MichaelEischer:auto-concurrency

MichaelEischer commented Dec 29, 2021 •

edited

Loading

Uh oh!

greatroar commented Jan 23, 2022

Uh oh!

MichaelEischer commented Feb 12, 2022

Uh oh!

MichaelEischer commented Jul 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MichaelEischer commented Dec 29, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR change? What problem does it solve?

Was the change previously discussed in an issue or on the forum?

Checklist

Uh oh!

greatroar commented Jan 23, 2022

Uh oh!

MichaelEischer commented Feb 12, 2022

Uh oh!

MichaelEischer commented Jul 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MichaelEischer commented Dec 29, 2021 •

edited

Loading