Move all Snapshot Master Node Steps to SnapshotsService by original-brownbear · Pull Request #56365 · elastic/elasticsearch

original-brownbear · 2020-05-07T16:33:36Z

This refactoring has three motivations:

Separate all master node steps during snapshot operations from all data node steps in code.
Set up next steps in concurrent repository operations and general improvements by centralizing tracking of each shard's state in the repository in SnapshotsService so that operations for each shard can be linearized efficiently (i.e. without having to inspect the full snapshot state for all shards on every cluster state update, allowing us to track more in memory and only fall back to inspecting the full CS on master failover like we do in the snapshot shards service).
- This PR already contains some best effort examples of this, but obviously this could be way improved upon still (just did not want to do it in this PR for complexity reasons)
Make the SnapshotsService less expensive on the CS thread for large snapshots

This refactoring has two motivations: 1. Separate all master node steps during snapshot operations from all data node steps in code. 2. Set up next steps in concurrent repository operations and general improvments by centralizing tracking of each shard's state in the repository in `SnapshotShardsService` so that operations for each shard can be linearized efficiently.

elasticmachine · 2020-05-07T16:33:38Z

Pinging @elastic/es-distributed (:Distributed/Snapshot/Restore)

original-brownbear · 2020-05-07T18:57:20Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

+
+            @Override
+            public void clusterStateProcessed(String source, ClusterState oldState, ClusterState newState) {
+                if (changed) {


This is kind of lazy and obviously a better solution would be to simply track those snapshots that were completed in the update by adding them to a list or so to be processed here. Just didn't want to do that in this PR since it introduces quite a bit of complexity. But I figured just adding the changed flag here and in the other tasks was enough to illustrate the motivation for this change and is already a huge win in terms of not having to iterate all the shards on every CS application.

original-brownbear · 2020-05-08T05:12:12Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotShardsService.java

-    /**
-     * Internal request that is used to send changes in snapshot status to master
-     */
-    public static class UpdateIndexShardSnapshotStatusRequest extends MasterNodeRequest<UpdateIndexShardSnapshotStatusRequest> {


This stuff all got moved to SnapshotsService exactly as is without changes

Now it is used in different services it could maybe be located in its own file

original-brownbear · 2020-05-08T05:13:59Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

+                        try {
+                            listener.onResponse(new UpdateIndexShardSnapshotStatusResponse());
+                        } finally {
+                            endCompletedSnapshots(newState);


This is the only change relative to what this code did and looked like in SnapshotShardsService.

original-brownbear · 2020-05-08T05:17:17Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

-                    // 1. Completed snapshots
-                    // 2. Snapshots in state INIT that a previous master of an older version failed to start
-                    // 3. Snapshots in any other state that have all their shard tasks completed
-                    snapshotsInProgress.entries().stream().filter(


Running this check on every CS update wasn't great and added a lot of cycles on the CS thread for larger snapshots. It would be even worse when we start actually having multiple snapshot-in-progress entries. We really should only have to do a full check if anything changed about the entries' shards or on master fail-over.
This is change moves all the updating of the shard entries into this class so we can selectively run this check (though see my other comment below, we could be even more selective in a follow-up :)).

original-brownbear · 2020-05-08T05:19:13Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

+                    logger.info("snapshot [{}] started", snapshot);
+                    listener.onResponse(snapshot);
+                } finally {
+                    if (newEntry.state().completed() || newEntry.shards().isEmpty()) {


No need to actually run the full check here over all entries in newState here, the only two options of ending the snapshot here right away are if it has no shards or was set to FAILED right away.

tlrx

LGTM, thanks for the helpful comments. I left some very minor suggestions and feel free to follow them or not.

tlrx · 2020-05-12T13:08:00Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotShardsService.java

-    /**
-     * Internal request that is used to send changes in snapshot status to master
-     */
-    public static class UpdateIndexShardSnapshotStatusRequest extends MasterNodeRequest<UpdateIndexShardSnapshotStatusRequest> {


Now it is used in different services it could maybe be located in its own file

tlrx · 2020-05-12T13:09:03Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

    // Set of snapshots that are currently being ended by this node
    private final Set<Snapshot> endingSnapshots = Collections.synchronizedSet(new HashSet<>());

+    private final SnapshotsService.SnapshotStateExecutor snapshotStateExecutor = new SnapshotsService.SnapshotStateExecutor();


nit: I don't think it need to be fully qualified?

tlrx · 2020-05-12T13:10:53Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

        if (DiscoveryNode.isMasterNode(settings)) {
            // addLowPriorityApplier to make sure that Repository will be created before snapshot
            clusterService.addLowPriorityApplier(this);
+            // The constructor of UpdateSnapshotStatusAction will register itself to the TransportService.


nit: maybe comment this right before this.updateSnapshotStatusHandler = ... and not in this block

tlrx · 2020-05-12T13:16:37Z

server/src/main/java/org/elasticsearch/snapshots/SnapshotsService.java

+    private void endCompletedSnapshots(ClusterState state) {
+        SnapshotsInProgress snapshotsInProgress = state.custom(SnapshotsInProgress.TYPE);
+        assert snapshotsInProgress != null;
+        // Cleanup all snapshots that have no more work left:


nit: this could be javadoc

…hards-service

original-brownbear · 2020-05-12T15:21:04Z

Thanks Tanguy! All nits applied :)

Follow up to #56365. Instead of redundantly checking snapshots for completion over and over, just track the completed snapshots in the CS updates that complete them instead of looping over the smae snapshot entries over and over. Also, in the batched snapshot shard status updates, only check for completion of a snapshot entry if it isn't already finalizing.

) This refactoring has three motivations: 1. Separate all master node steps during snapshot operations from all data node steps in code. 2. Set up next steps in concurrent repository operations and general improvements by centralizing tracking of each shard's state in the repository in `SnapshotsService` so that operations for each shard can be linearized efficiently (i.e. without having to inspect the full snapshot state for all shards on every cluster state update, allowing us to track more in memory and only fall back to inspecting the full CS on master failover like we do in the snapshot shards service). * This PR already contains some best effort examples of this, but obviously this could be way improved upon still (just did not want to do it in this PR for complexity reasons) 3. Make the `SnapshotsService` less expensive on the CS thread for large snapshots

Follow up to #56365. Instead of redundantly checking snapshots for completion over and over, just track the completed snapshots in the CS updates that complete them instead of looping over the smae snapshot entries over and over. Also, in the batched snapshot shard status updates, only check for completion of a snapshot entry if it isn't already finalizing.

original-brownbear added >non-issue :Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs v8.0.0 v7.9.0 labels May 7, 2020

elasticmachine added the Team:Distributed Meta label for distributed team. label May 7, 2020

original-brownbear marked this pull request as draft May 7, 2020 16:59

original-brownbear added WIP and removed v7.9.0 v8.0.0 labels May 7, 2020

better efficiency

99b16d6

original-brownbear commented May 7, 2020

View reviewed changes

original-brownbear marked this pull request as ready for review May 8, 2020 05:10

original-brownbear added v7.9.0 v8.0.0 and removed WIP labels May 8, 2020

original-brownbear commented May 8, 2020

View reviewed changes

original-brownbear requested review from DaveCTurner and tlrx May 8, 2020 05:21

tlrx approved these changes May 12, 2020

View reviewed changes

original-brownbear added 2 commits May 12, 2020 16:11

Merge remote-tracking branch 'elastic/master' into cleaner-snapshot-s…

2992dba

…hards-service

CR: Nits :)

e2e5071

original-brownbear merged commit 36be0ac into elastic:master May 12, 2020

original-brownbear deleted the cleaner-snapshot-shards-service branch May 12, 2020 15:21

original-brownbear added the backport pending label May 12, 2020

original-brownbear mentioned this pull request May 13, 2020

More Efficient Snapshot State Handling #56669

Merged

original-brownbear mentioned this pull request Jul 12, 2020

Move all Snapshot Master Node Steps to SnapshotsService (#56365) #59373

Merged

original-brownbear removed the backport pending label Jul 12, 2020

This was referenced Jul 13, 2020

Fix Snapshot not Starting in Partial Snapshot Corner Case #59428

Merged

More Efficient Snapshot State Handling (#56669) #59430

Merged

original-brownbear restored the cleaner-snapshot-shards-service branch August 6, 2020 18:23

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Conversation

original-brownbear commented May 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented May 7, 2020

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tlrx left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented May 12, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

original-brownbear commented May 7, 2020 •

edited

Loading