Make static Store access shard lock aware by bleskes · Pull Request #19416 · elastic/elasticsearch

bleskes · 2016-07-13T12:33:12Z

We currently have concurrency issue between the static methods on the Store class and store changes that are done via a valid open store. An example of this is the async shard fetch which can reach out to a node while a local shard copy is shutting down (the fetch does check if we have an open shard and tries to use that first, but if the shard is shutting down, it will not be available from IndexService).

Specifically, async shard fetching tries to read metadata from store, concurrently the shard that shuts down commits to lucene, changing the segments_N file. this causes a file not find exception on the shard fetching side. That one in turns makes the master think the shard is unusable. In tests this can cause the shard assignment to be delayed (up to 1m) which fails tests. See https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+java9-periodic/570 for details.

This is one of the things #18938 caused to bubble up.

We currently have concurrency issue between the static methods on the Store class and store changes that are done via a valid open store. An example of this is the async shard fetch which can reach out to a node while a local shard copy is shutting down (the fetch does check if we have an open shard and tries to use that first, but if the shard is shutting down, it will not be available from IndexService). Specifically, async shard fetching tries to read metadata from store, concurrently the shard that shuts down commits to lucene, changing the segments_N file. this causes a file not find exception on the shard fetching side. That one in turns makes the master think the shard is unusable. In tests this can cause the shard assignment to be delayed (up to 1m) which fails tests. See https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+java9-periodic/570 for details.

…en we have an open shard

nik9000 · 2016-07-13T16:56:36Z

core/src/main/java/org/elasticsearch/indices/store/TransportNodesListShardStoreMetaData.java

            }
-            return new StoreFilesMetaData(shardId, Store.readMetadataSnapshot(shardPath.resolveIndex(), shardId, logger));
+            // note that this may fail if it can't get access to the shard lock. Since we check above there is an active shard, this means:
+            // 1) a shard is being constructed, which means the master will not use a copy of this replcia


Did you mean to add a 2) here?

hehe, yeah - ADD for the works :)
2) A shard is shutting down and has not cleared it's content within lock timeout. In this case the master may not reuse local resources.

pending the merge of #19416

s1monw · 2016-07-18T08:26:30Z

core/src/main/java/org/elasticsearch/index/store/Store.java

+     * A shard lock supplier that is used by the static methods on this class. Normal methods rely on
+     * the shard lock passed to the constructor.
+     */
+    @FunctionalInterface


can we move this into NodeEnv instead?

s1monw · 2016-07-18T08:27:01Z

left on nit - LGTM otherwise

ywelsch · 2016-07-18T08:30:14Z

LGTM

…ss_lock

In several places in our code we need to get a consistent list of files + metadata of the current index. We currently have a couple of ways to do in the `Store` class, which also does the right things and tries to verify the integrity of the smaller files. Sadly, those methods can run into trouble if anyone writes into the folder while they are busy. Most notably, the index shard's engine decides to commit half way and remove a `segment_N` file before the store got to checksum (but did already list it). This race condition typically doesn't happen as almost all of the places where we list files also happen to be places where the relevant shard doesn't yet have an engine. There is however an exception (of course :)) which is the API to list shard stores, used by the master when it is looking for shard copies to assign to. I already took one shot at fixing this in #19416 , but it turns out not to be enough - see for example https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-os-compatibility/os=sles/822. The first inclination to fix this was to add more locking to the different Store methods and acquire the `IndexWriter` lock, thus preventing any engine for accessing if if the a shard is offline and use the current index commit snapshotting logic already existing in `IndexShard` for when the engine is started. That turned out to be a bad idea as we create more subtleties where, for example, a store listing can prevent a shard from starting up (the writer lock doesn't wait if it can't get access, but fails immediately, which is good). Another example is running on a shared directory where some other engine may actually hold the lock. Instead I decided to take another approach: 1) Remove all the various methods on store and keep one, which accepts an index commit (which can be null) and also clearly communicates that the *caller* is responsible for concurrent access. This also tightens up the API which is a plus. 2) Add a `snapshotStore` method to IndexShard that takes care of all the concurrency aspects with the engine, which is now possible because it's all in the same place. It's still a bit ugly but at least it's all in one place and we can evaluate how to improve on this later on. I also renamed the `snapshotIndex` method to `acquireIndexCommit` to avoid confusion and I think it communicates better what it does.

…ked during shard state fetching (#21656) PR #19416 added a safety mechanism to shard state fetching to only access the store when the shard lock can be acquired. This can lead to the following situation however where a shard has not fully shut down yet while the shard fetching is going on, resulting in a ShardLockObtainFailedException. PrimaryShardAllocator that decides where to allocate primary shards sees this exception and treats the shard as unusable. If this is the only shard copy in the cluster, the cluster stays red and a new shard fetching cycle will not be triggered as shard state fetching treats exceptions while opening the store as permanent failures. This commit makes it so that PrimaryShardAllocator treats the locked shard as a possible allocation target (although with the least priority).

bleskes added 3 commits July 13, 2016 14:05

TransportNodesListGatewayStartedShards shouldn't try to get a lock wh…

37e0668

…en we have an open shard

go space

d68991a

bleskes added >bug :Distributed/Store Issues around managing unopened Lucene indices. If it touches Store.java, this is a likely label. v5.0.0-alpha5 labels Jul 13, 2016

nik9000 reviewed Jul 13, 2016
View reviewed changes

bleskes added a commit that referenced this pull request Jul 18, 2016

mute testAckedIndexing

798ee17

pending the merge of #19416

s1monw reviewed Jul 18, 2016
View reviewed changes

bleskes added 4 commits July 18, 2016 11:09

Merge remote-tracking branch 'upstream/master' into store_static_acce…

896455a

…ss_lock

missing comment

b174474

move ShardLocker to NodeEnvironment

d9bedf3

remove await fix

e7d2d7c

bleskes merged commit 9ededa4 into elastic:master Jul 18, 2016

bleskes deleted the store_static_access_lock branch July 18, 2016 09:23

bleskes mentioned this pull request Jul 29, 2016

Tighten up concurrent store metadata listing and engine writes #19684

Merged

ywelsch mentioned this pull request Nov 18, 2016

Allow master to assign primary shard to node that has shard store locked during shard state fetching #21656

Merged

clintongormley added :Distributed/Engine Anything around managing Lucene and the Translog in an open shard. and removed :Distributed/Store Issues around managing unopened Lucene indices. If it touches Store.java, this is a likely label. labels Feb 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make static Store access shard lock aware#19416

Make static Store access shard lock aware#19416
bleskes merged 7 commits intoelastic:masterfrom
bleskes:store_static_access_lock

bleskes commented Jul 13, 2016 •

edited

Loading

Uh oh!

nik9000 Jul 13, 2016

Uh oh!

bleskes Jul 14, 2016 •

edited

Loading

Uh oh!

s1monw Jul 18, 2016

Uh oh!

bleskes Jul 18, 2016

Uh oh!

s1monw commented Jul 18, 2016

Uh oh!

ywelsch commented Jul 18, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

bleskes commented Jul 13, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nik9000 Jul 13, 2016

Choose a reason for hiding this comment

Uh oh!

bleskes Jul 14, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

s1monw Jul 18, 2016

Choose a reason for hiding this comment

Uh oh!

bleskes Jul 18, 2016

Choose a reason for hiding this comment

Uh oh!

s1monw commented Jul 18, 2016

Uh oh!

ywelsch commented Jul 18, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

bleskes commented Jul 13, 2016 •

edited

Loading

bleskes Jul 14, 2016 •

edited

Loading