MergeOp docs by sipsma · Pull Request #2212 · moby/buildkit

sipsma · 2021-06-30T00:21:08Z

These docs outline the proposed approach to implementing MergeOp and the interrelated DiffOp. They are split up into 4 docs to keep things more manageable and will probably make the most sense to read in the order specified by the file name's prefix. Feel free to just review one at a time as it's obviously a lot to read all at once.

sipsma · 2021-06-30T17:40:03Z

cc @AkihiroSuda @coryb @ktock please take a look if you have time and let me know if you have any concerns with the new features proposed here.

coryb

Looks great! I am not too familiar with the caching internals or with import/export but what you have written makes sense to me.

coryb · 2021-07-04T17:08:27Z

docs/mergeop/2_conflict_detection.md

+* Additionally, to support (de)serialization of this data structure, we will simply use the tar format, but instead of including actual file data the tar archive will consist of zero-sized files and thus consist only of tar headers, which have all the metadata we need for the file.
+    * When storing serialized data in the cache metadata, we will also include a format specifier, enabling us to support other formats in the future if ever needed
+    * It’s worth noting that there are degenerate cases where storing this metadata may noticeably increase disk+memory usage, namely where there are very large numbers of files created in a layer.
+        * For example, if there is 128 bytes of file/dir metadata, 100k files+dirs would be about 12 MB of metadata. As a reference, the base ubuntu image has just under 3k files, so it’s unlikely this should become meaningful for all but the most extreme cases.


I would expect with compression we could reduce this size by >90%, the metadata should be highly compressable.

That makes sense to me, once we get to this stage of implementation I'll experiment and see what ratios can be achieved

sipsma · 2021-07-06T17:52:29Z

Marking this as a draft as @tonistiigi doesn't want us to merge the docs for now; the docs are ready to review though and we will still use this PR as a way of collecting comments

AkihiroSuda · 2021-07-09T01:50:14Z

Maybe we should put them in ./docs/proposal directory until the implementation gets merged?

Or just have a header to clarity that the implementation of the proposal is not merged yet.

ktock

Thank you for the great write-up!

ktock · 2021-07-09T06:15:27Z

docs/mergeop/1_mergeop.md

+## **Requirements**
+


It'd be great if we have a description about the high-level use-cases/benefits by MergeOp. (e.g. which Dockerfile instructions are expected to be optimized by this?)

Good call, I had only discussed it with people familiar with the use cases before this and forgot to include them here :-)

Added a Use Cases section.

ktock · 2021-07-09T06:20:20Z

docs/mergeop/1_mergeop.md

+
+stateC := Scratch().
+   	 File(Mkfile("/foo", 0777, []byte("C"))).
+   	 File(Mkfile("/c", 0777, []byte("C")))


If here we do File(Rm("/a")), will Merge(stateB, stateC) contain /a ?
IIUC stateC doesn't contain /a so the whiteout won't be recorded by overlayfs and Merge(stateB, stateC) will do contain /a. Is this an expected result?

Yeah good question but that's the expected behavior. In general I think it will be very rare for a user to want to create a layer with a deletion at a non-existent file, but in those obscure cases they can create a base layer with the file (i.e. Scratch().File(Mkfile("/a"))) and then do the deletion on top of that.

I don't love that but it's a fairly simple workaround that works and can be made seamless by higher-level tools consuming LLB if needed.

ktock · 2021-07-09T06:25:52Z

docs/mergeop/1_mergeop.md

+The only significant interface change will be extending cache.Accessor with this method:
+* `Merge(ctx context.Context, parents []ImmutableRef, opts ...RefOption) (ImmutableRef, error)`
+
+Merge, as you’d expect, creates an ImmutableRef that represents the merge of the given parents and returns it to the caller. That ImmutableRef can then be used just like any other ImmutableRef can be today.


If we want to use the "merged" ImmutableRef just like any other ImmutableRef (e.g. using this as a parent of other snapshots), the merged result needs to be recorded to the snapshotter as a snapshot, right? IIUC containerd's overlayfs snaphsotter doesn't support importing a directory as a snapshot, I guess we need to fork our own snapshotter implementation.

Surprisingly, we shouldn't need to fork snapshotters; I believe we can implement the whole design on top of any overlay-based snapshotter, including containerd's overlay and even stargz.

For the case you brought up, using a merged ref as the parent of a new snapshot, all of the special handling will take place in Buildkit's cache package. So, if you want to merge together A and B and then create a new snapshot C whose parent is that merged ref, the cache package will create a brand new empty snapshot for C and then merge its mounts on top of the merged mounts of A and B. Essentially, while the end user of LLB thinks of C as a snapshot on top of Merge(A, B), in the underlying snapshotter there are actually 3 separate snapshot chains for A, B and C.

I detailed exactly how this will work in the Efficient Snapshot Merge of the 1_mergeop.md doc (plus the Efficient Snapshot Merge Details section in the appendix).

There are some significant corner cases surrounding all this described in the DiffOp and Import/Export docs, but they all have fairly straightforward workarounds thankfully. Let me know if you can think of any I missed though!

ktock · 2021-07-09T06:29:53Z

docs/mergeop/2_conflict_detection.md

+    * Cons
+        * The format is not perfectly optimized for our use case of conflict detection (same as Tar)
+        * Not clear if the continuity package is maintained much these days besides the util functions under fs used by containerd’s differ (which the proto is separate from). We may become partially or fully responsible for maintenance of the proto and thus have to deal with any other consumers of it that exist out in the world.
+* Custom Proto or JSON


TOC by stargz/eStargz (doc) might be another good candidate of JSON-formatted filesystem metadata :)
(zstd:chunked by Podman also uses the similar one.)

Ah interesting, thanks for pointing me to that! I never thought to look but makes sense that stargz had to solve a similar problem too. I think for now I still want to go with the proposal in the doc of using tar because it simplifies some of details around import/export of this metadata and doesn't tie us into any changes you may need to make to the stargz format in the future.

However, I added a section mentioning it in case we ever want to reconsider this decision in the future and are looking back for other options.

ktock · 2021-07-09T06:31:51Z

docs/mergeop/2_conflict_detection.md

+
+## Conflict Detection Alternatives
+
+There are a few existing userspace emulations of overlayfs that were considered for implementing our layer conflict detection features:


~~Why do we need userspace filesystem for conflict calculation? Isn't it enough to just compare filesystem metadata between layers? (maybe I'm missing something)~~

EDIT: sorry, nevermind. This section says they are alternatives so the primary method of conflict detection is to compare filesystem metadata, right?

No worries, yeah these are the alternatives, I just wanted to mention them to give context on why we're proposing doing our own custom code instead of using a library.

Added an extra sentence clarifying this.

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

sipsma · 2021-07-11T17:52:27Z

Maybe we should put them in ./docs/proposal directory until the implementation gets merged?

Good idea, moved it to proposals.

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

sipsma · 2022-01-06T23:41:41Z

Closing this in favor of a better style of docs, as mentioned+tracked here.

sipsma requested a review from tonistiigi June 30, 2021 00:21

coryb approved these changes Jul 4, 2021

View reviewed changes

sipsma added the status/do-not-merge label Jul 6, 2021

sipsma marked this pull request as draft July 6, 2021 17:50

ktock reviewed Jul 9, 2021

View reviewed changes

sipsma added 6 commits July 11, 2021 16:33

Add MergeOp doc.

14c1f72

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

Add merge conflict detection+handling doc.

50d683e

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

Add DiffOp doc.

3f7469c

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

Add Merge+Diff import/export doc.

91470b3

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

Fix formatting in mergeop.md

7a6b1c0

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

Clarify use cases and other details in mergeop docs.

f351001

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

sipsma force-pushed the mergeop_doc branch from 53b6f63 to f351001 Compare July 11, 2021 17:51

Mention stargz TOC format as alternative.

42636e1

Signed-off-by: Erik Sipsma <erik@sipsma.dev>

sipsma mentioned this pull request Sep 3, 2021

MergeOp #2335

Merged

sipsma closed this Jan 6, 2022


		## Conflict Detection Alternatives

		There are a few existing userspace emulations of overlayfs that were considered for implementing our layer conflict detection features:

Conversation

sipsma commented Jun 30, 2021

Uh oh!

sipsma commented Jun 30, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coryb left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sipsma commented Jul 6, 2021

Uh oh!

AkihiroSuda commented Jul 9, 2021

Uh oh!

ktock left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ktock Jul 9, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sipsma commented Jul 11, 2021

Uh oh!

sipsma commented Jan 6, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sipsma commented Jun 30, 2021 •

edited

Loading

ktock Jul 9, 2021 •

edited

Loading