Frozen Prototype File Chunk Writing/Reading Infrastructure by original-brownbear · Pull Request #68270 · elastic/elasticsearch

original-brownbear · 2021-02-01T11:08:25Z

This adds all the necessary infrastructure to use the reusable, single-file cache in practice:

Create cache file in a data directory instead of a temp directory
Fully pre-allocate it (the existing solution would at least on Linux still do a sparse allocation)
Manage file channel resource by ref counting
Add minimal abstraction in place of exposing FileChannel to allow for adjusting the concrete paging approach under the hood in a follow-up

I'd open a follow-up for the more complicated logical changes introducing two page sizes based on this (almost done with those, but I hope/think this gives us what we need for initial experiments meanwhile and is easier to review in isolation).

…-armin

elasticmachine · 2021-02-01T11:08:28Z

Pinging @elastic/es-distributed (Team:Distributed)

original-brownbear · 2021-02-01T11:14:14Z

+    @Override
+    protected void closeInternal() {
+        try {
+            IOUtils.close(fileChannel, path == null ? null : () -> Files.deleteIfExists(path));


I wasn't super sure here, as long as we don't persist the contents of this file in a way that allows for reusing it I figured there was no point in keeping it around across restarts (except for making start-up faster). It also felt safer to delete here. Otherwise configuring a large file and then failing to allocate it (because of concurrent disk space usage changes or so) would cause the disk to be left full even though the node fails to start up.

Yeah, we don't need to persist this file across restarts. We should also make sure it's cleaned up when fileSize == 0 after restart

++ I pushed 046f998

original-brownbear · 2021-02-01T11:16:32Z

            final CacheKey cacheKey = createCacheKey(file.physicalName());
            cacheService.removeFromCache(cacheKey);
-            frozenCacheService.removeFromCache(cacheKey);
+            if (partial) {


I need to add this to make some tests pass with today's changes.

original-brownbear · 2021-02-01T11:48:42Z

Jenkins run elasticsearch-ci/bwc (known JDBC issue)

ywelsch · 2021-02-01T11:45:24Z

-            // write one byte at the end of the file to make sure all bytes are allocated
-            fileChannel.write(ByteBuffer.allocate(1), fileSize - 1);
+            for (Path path : environment.dataFiles()) {
+                // TODO: be resilient to this check failing and try next path?


Ideally we would split this up in some form across the available data paths?

I guess so (on the splitting up) but I wonder if we should even continue to start up if a given data path is broken?
I certainly see the potential performance benefit of using multiple disks and it's probably nice to be a little more even in the disk use as well.
But we should do this in the follow-up if we want it once we're using multiple file channels I think to not blow this up too much.

ywelsch · 2021-02-01T11:48:15Z

+            if (io == null || io.tryIncRef() == false) {
+                final IO newIO;
+                boolean success = false;
+                incRef();


tryIncref already incremented by 1?

This is for SharedBytes not IO. We have a ref count on the SharedBytes here that is incremented for each IO we created and one for the IO itself. The IO will decrement shared bytes once its fully released.

ok, we should have tests for these eventually as well.

ywelsch · 2021-02-01T11:54:15Z

+    @Override
+    protected void closeInternal() {
+        try {
+            IOUtils.close(fileChannel, path == null ? null : () -> Files.deleteIfExists(path));


Yeah, we don't need to persist this file across restarts. We should also make sure it's cleaned up when fileSize == 0 after restart

ywelsch

LGTM

original-brownbear · 2021-02-01T12:38:22Z

Jenkins run elasticsearch-ci/1 (unrelated but important https://gradle-enterprise.elastic.co/s/whnbotirrvizq) I'll look into it

tlrx

LGTM

original-brownbear · 2021-02-01T13:11:22Z

Thanks Yannick + Tanguy!

original-brownbear added 7 commits January 29, 2021 16:29

step 1

601bbd5

step 2

41922ca

Merge remote-tracking branch 'elastic/frozen-proto' into frozen-proto…

9685b90

…-armin

fix compile

535aa3a

woerks

5f96d3e

woerks better

ae295ab

Merge remote-tracking branch 'elastic/frozen-proto' into frozen-proto…

a99883b

…-armin

original-brownbear added the :Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs label Feb 1, 2021

elasticmachine added the Team:Distributed Meta label for distributed team. label Feb 1, 2021

original-brownbear commented Feb 1, 2021

View reviewed changes

original-brownbear requested review from tlrx and ywelsch February 1, 2021 11:38

ywelsch reviewed Feb 1, 2021

View reviewed changes

CR: delete old cache if not configured

046f998

original-brownbear requested a review from ywelsch February 1, 2021 12:19

ywelsch approved these changes Feb 1, 2021

View reviewed changes

tlrx approved these changes Feb 1, 2021

View reviewed changes

original-brownbear merged commit fdfbe5d into elastic:frozen-proto Feb 1, 2021

original-brownbear deleted the frozen-proto-just-infra branch February 1, 2021 13:11

original-brownbear restored the frozen-proto-just-infra branch April 18, 2023 21:05

Conversation

original-brownbear commented Feb 1, 2021

Uh oh!

elasticmachine commented Feb 1, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Feb 1, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Feb 1, 2021

Uh oh!

tlrx left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Feb 1, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants