Make `restic mount` faster and consume less memory by aawsome · Pull Request #2587 · restic/restic

aawsome · 2020-02-18T16:29:21Z

What is the purpose of this change? What does it change?

In the FUSE implementation restic data structure have been read (meaning loading blobs from the repository) and internal data structures have been created when a FUSE "node" was created. This was the case, for instance, for all item within a dir when this dir was accessed. This lead to quite some memory usage and made the FUSE filesystem "unresponsive".

This PR changes this behavior and only reads restic data structure when the information is really needed. Moreover, the internal data structure have been optimized. Also concurrent operation within FUSE should now work correctly. (FUSE documentation says that access should be synced which wasn't)

Was the change discussed in an issue or in the forum before?

closes #1680

Thanks @greatroar for issuing #2585 which makes me think about whether the bottleneck really is only reading tree blobs.

Checklist

I have read the Contribution Guidelines
I have not added tests for all changes in this PR
I have not added documentation for the changes (in the manual)
There's a new file in changelog/unreleased/ that describes the changes for our users (template here)
I have run gofmt on the code in all commits
All commit messages are formatted in the same style as the other commits in the repo
I'm done, this Pull Request is ready for review

internal/fuse/file.go

MichaelEischer · 2020-02-23T10:50:34Z

It might be an option to implement the NodeOpener which is called when a file is opened. That way the chunk size collection would happen immediately when opening a file and is not delayed until reading the first bytes.

The main difference is the point in time when the user would be notified in case some blobs are missing. The current implementation in restic probably fails with an error before providing any details on the file node, with the change this is delayed until the first bytes are read. I think that the new semantic is actually what a user would expect: Reading a damaged file fails, but usually you are able to see the metadata.

aawsome · 2020-02-23T19:43:26Z

@MichaelEischer Thanks for searching the manual and offering the NodeOpener option which I'm using now.
I tried to do performance tests of this new version but did not really succeed to see differences when comparing this to. In fact it seems that there a lot of variation w.r.t. performance when using FUSE (maybe some kernel caching?)...
Is there anyone who can help out with testing this change?

Memory footprint is however clearly reduced.

ProactiveServices · 2020-02-23T20:45:34Z

@aawsome Happy to help, do you have any specific tests you'd like me to perform? I have several hundred-gig repos I can test with.

aawsome · 2020-02-24T21:11:05Z

@ProactiveServices Thank you very much! I'm especially interested in response times and CPU usage and comparison between "plain" restic and a version including this PR.
The code changes should apply for listing and reading files, e.g. ls file (also ls dir) or cat file/cp file somewhere.

I used /usr/bin/time restic mount to get CPU timings and a simple time before the commands, e.g. time ls file to get response times.

aawsome · 2020-02-24T21:45:18Z

With the last commit, also dir operations should be faster (and a bit less memory consuming)...

internal/fuse/file.go

aawsome · 2020-06-14T11:51:41Z

rebased to master and added the changes suggested by @greatroar

aawsome · 2020-07-12T19:16:37Z

Rebased this PR after #2790 has been merged.

aawsome · 2020-07-12T19:23:29Z

Noticed that the new blob cache by @greatroar already implements locking; hence I removed the locks from file.go and just left a comment there.

aawsome · 2020-07-25T18:57:21Z

rebased after #2787 has been merged.

changelog/unreleased/issue-1680

internal/fuse/dir.go

…emory - Add Open() functionality to dir - only access index for blobs when file is read - Implement NodeOpener and put one-time file stuff there - Add comment about locking as suggested by bazil.org/fuse => Thanks at Michael Eischer for suggesting the last two improvements

MichaelEischer

LGTM. I've force-pushed the branch to retrigger the CI. Seems like the flaky rclone test error has changed into a flaky rclone test crash, after #2855.

aawsome mentioned this pull request Feb 18, 2020

Smaller memory footprint for restic mount #2585

Closed

7 tasks

greatroar reviewed Feb 18, 2020

View reviewed changes

internal/fuse/file.go Outdated Show resolved Hide resolved

aawsome mentioned this pull request Feb 25, 2020

Horrible fuse mount performance on large repo with many small files #1680

Closed

greatroar reviewed Feb 25, 2020

View reviewed changes

internal/fuse/file.go Outdated Show resolved Hide resolved

greatroar reviewed Feb 25, 2020

View reviewed changes

internal/fuse/file.go Outdated Show resolved Hide resolved

aawsome mentioned this pull request Jun 14, 2020

Reduce index memory #2781

Merged

6 tasks

aawsome force-pushed the optimize-fuse branch from 8045917 to efe13b4 Compare June 14, 2020 11:50

This was referenced Jun 14, 2020

Remove blob size cache from restic mount #2787

Merged

Fix quadratic file reading in restic mount #2790

Merged

aawsome force-pushed the optimize-fuse branch from efe13b4 to a6e3fe3 Compare July 12, 2020 19:15

aawsome force-pushed the optimize-fuse branch from a6e3fe3 to 3754d19 Compare July 12, 2020 19:22

aawsome force-pushed the optimize-fuse branch from 3754d19 to a65e48b Compare July 25, 2020 18:56

MichaelEischer reviewed Jul 25, 2020

View reviewed changes

aawsome force-pushed the optimize-fuse branch from a65e48b to 938d0ad Compare July 27, 2020 17:08

MichaelEischer force-pushed the optimize-fuse branch from 938d0ad to f831694 Compare July 28, 2020 21:05

MichaelEischer approved these changes Jul 28, 2020

View reviewed changes

MichaelEischer merged commit 248c7c3 into restic:master Jul 28, 2020

aawsome deleted the optimize-fuse branch December 7, 2020 08:34

Conversation

aawsome commented Feb 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What is the purpose of this change? What does it change?

Was the change discussed in an issue or in the forum before?

Checklist

Uh oh!

Uh oh!

MichaelEischer commented Feb 23, 2020

Uh oh!

aawsome commented Feb 23, 2020

Uh oh!

ProactiveServices commented Feb 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aawsome commented Feb 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aawsome commented Feb 24, 2020

Uh oh!

Uh oh!

Uh oh!

aawsome commented Jun 14, 2020

Uh oh!

aawsome commented Jul 12, 2020

Uh oh!

aawsome commented Jul 12, 2020

Uh oh!

aawsome commented Jul 25, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MichaelEischer left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aawsome commented Feb 18, 2020 •

edited

Loading

ProactiveServices commented Feb 23, 2020 •

edited

Loading

aawsome commented Feb 24, 2020 •

edited

Loading