Don't assign persistent tasks to nodes shutting down by dakrone · Pull Request #72260 · elastic/elasticsearch

dakrone · 2021-04-26T17:10:30Z

This commit changes the PersistentTasksClusterService to limit nodes for a task to a subset of
nodes (candidates) that are not currently shutting down.

It does not yet cancel tasks that may already be running on the nodes that are shut down, that will
be added in a subsequent request.

Relates to #70338

This commit changes the `PersistentTasksClusterService` to limit nodes for a task to a subset of nodes (candidates) that are not currently shutting down. It does not yet cancel tasks that may already be running on the nodes that are shut down, that will be added in a subsequent request. Relates to elastic#70338

elasticmachine · 2021-04-26T17:10:33Z

Pinging @elastic/es-core-infra (Team:Core/Infra)

davidkyle · 2021-04-27T13:05:14Z

Pinging @elastic/ml-core for some persistent tasks love

henningandersen

LGTM, left a few minor comments to consider.

server/src/main/java/org/elasticsearch/cluster/metadata/NodesShutdownMetadata.java

server/src/main/java/org/elasticsearch/persistent/PersistentTasksClusterService.java

.../plugin/ml/src/main/java/org/elasticsearch/xpack/ml/action/TransportStartDatafeedAction.java

henningandersen · 2021-04-27T19:57:46Z

...tdown/src/internalClusterTest/java/org/elasticsearch/xpack/shutdown/NodeShutdownTasksIT.java

+        assertBusy(() -> assertNotNull("expected to have candidate nodes chosen for task", candidates.get()));
+        // Check that the node that is not shut down is the only candidate
+        assertThat(candidates.get().stream().map(DiscoveryNode::getId).collect(Collectors.toSet()), contains(candidateNode));
+        assertThat(candidates.get().stream().map(DiscoveryNode::getId).collect(Collectors.toSet()), not(contains(shutdownNode)));


Maybe also verify that the candidateNode was the chosen one by putting the nodeId from the task into taskCompleted in nodeOperation below?

We don't actually have access to the nodeId from nodeOperation below, only the parent task id (which is just "cluster" in this case)

AthenaEryma

LGTM as well!

server/src/main/java/org/elasticsearch/cluster/metadata/NodesShutdownMetadata.java

benwtrent

ML + transform stuff looks good! Excited for this API :D

benwtrent · 2021-04-28T11:15:07Z

...gin/ml/src/main/java/org/elasticsearch/xpack/ml/job/task/OpenJobPersistentTasksExecutor.java

        // If we already know that we can't find an ml node because all ml nodes are running at capacity or
        // simply because there are no ml nodes in the cluster then we fail quickly here:
-        PersistentTasksCustomMetadata.Assignment assignment = getAssignment(params, clusterState);
+        PersistentTasksCustomMetadata.Assignment assignment = getAssignment(params, clusterState.nodes().getAllNodes(), clusterState);


This is tricky, it is possible that this validation passes as all the possible assigning nodes are shutting down, but we don't catch that.

The ML team might remove that last validation (awaiting_lazy_assignment) as now it is unreliable.

I don't think the way it is now is any worse than it was before:

Possible new scenario: validation passes because get assignment thinks an ML node is available when it's actually shutting down, hence the job opens but cannot be assigned

Old scenario this replaces: validation passes because get assignment finds an ML node is available that is about to be shut down (but nobody apart from the operator knows), job gets assigned, then shortly afterwards the node shuts down and the job cannot be assigned

In both scenarios we end up with a job that is open but cannot be assigned because the cluster doesn't have room for it. And in both scenarios the solution is to add another ML node to the cluster.

…tdown

…afeedAction

This commit changes the `PersistentTasksClusterService` to limit nodes for a task to a subset of nodes (candidates) that are not currently shutting down. It does not yet cancel tasks that may already be running on the nodes that are shut down, that will be added in a subsequent request. Relates to elastic#70338

…72426) * Don't assign persistent tasks to nodes shutting down (#72260) This commit changes the `PersistentTasksClusterService` to limit nodes for a task to a subset of nodes (candidates) that are not currently shutting down. It does not yet cancel tasks that may already be running on the nodes that are shut down, that will be added in a subsequent request. Relates to #70338 * Fix transport client usage in test

dakrone added v8.0.0 :Core/Infra/Node Lifecycle Node startup, bootstrapping, and shutdown v7.14.0 labels Apr 26, 2021

dakrone requested review from AthenaEryma and henningandersen April 26, 2021 17:10

elasticmachine added the Team:Core/Infra Meta label for core/infra team label Apr 26, 2021

dakrone mentioned this pull request Apr 26, 2021

Add node shutdown API for shutting down nodes cleanly #70338

Closed

22 tasks

henningandersen approved these changes Apr 27, 2021

View reviewed changes

AthenaEryma approved these changes Apr 27, 2021

View reviewed changes

server/src/main/java/org/elasticsearch/cluster/metadata/NodesShutdownMetadata.java Show resolved Hide resolved

benwtrent approved these changes Apr 28, 2021

View reviewed changes

dakrone added 4 commits April 28, 2021 08:21

Merge remote-tracking branch 'origin/master' into persistent-task-shu…

d9e9ddd

…tdown

Add assert that cluster state should never be null

8ec8efd

Switch mastersFirstStream() -> getAllNodes().stream()

7e00446

Add comment about why candidateNodes is not used in TransportStartDat…

fbec5d8

…afeedAction

dakrone merged commit 0f50800 into elastic:master Apr 28, 2021

dakrone deleted the persistent-task-shutdown branch April 28, 2021 20:00

dakrone mentioned this pull request Apr 28, 2021

[7.x] Don't assign persistent tasks to nodes shutting down (#72260) #72426

Merged

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

probakowski added the >enhancement label Jul 30, 2021

droberts195 mentioned this pull request Feb 21, 2022

Persistent task assignment explanations can be confusing when nodes are shutting down #84195

Open

droberts195 mentioned this pull request Apr 28, 2023

Add explanation for persistent task assignment failure due to node shutdown #86923

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't assign persistent tasks to nodes shutting down#72260

Don't assign persistent tasks to nodes shutting down#72260
dakrone merged 5 commits intoelastic:masterfrom
dakrone:persistent-task-shutdown

dakrone commented Apr 26, 2021

Uh oh!

elasticmachine commented Apr 26, 2021

Uh oh!

davidkyle commented Apr 27, 2021

Uh oh!

henningandersen left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

henningandersen Apr 27, 2021

Uh oh!

dakrone Apr 28, 2021

Uh oh!

AthenaEryma left a comment

Uh oh!

Uh oh!

benwtrent left a comment

Uh oh!

benwtrent Apr 28, 2021

Uh oh!

droberts195 Apr 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

Conversation

dakrone commented Apr 26, 2021

Uh oh!

elasticmachine commented Apr 26, 2021

Uh oh!

davidkyle commented Apr 27, 2021

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

henningandersen Apr 27, 2021

Choose a reason for hiding this comment

Uh oh!

dakrone Apr 28, 2021

Choose a reason for hiding this comment

Uh oh!

AthenaEryma left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

benwtrent left a comment

Choose a reason for hiding this comment

Uh oh!

benwtrent Apr 28, 2021

Choose a reason for hiding this comment

Uh oh!

droberts195 Apr 28, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants