Disk indicator refactoring by gmarouli · Pull Request #90366 · elastic/elasticsearch

gmarouli · 2022-09-26T15:33:55Z

During this refactoring we introduce the DiskHealthAnalyzer which will determine the health state, the symptom, the impacts and the diagnoses.

The benefits of this class is that the constructor is using the input information to fill in data structures that answer the questions we need to determine the symptom, the impacts and the diagnosis. These features are quite complex to calculate so the simpler the code there the best it is.

We also restructured the tests to say in the test name the expected status and then the premises, blocked indices and/or status of the nodes. We introduce a comment that explains what the tests simulates and what's expected.

Finally, we change the retrieval of the indices per node to use the RoutingNodes instead of the RoutingTable directly for efficiency.

Closes #90212

elasticsearchmachine · 2022-09-26T15:40:17Z

Pinging @elastic/es-data-management (Team:Data Management)

andreidan

Thanks for refactoring this indicator Mary.
This is heading in the right direction 🚀

I left a few suggestions and questions

server/src/main/java/org/elasticsearch/health/node/DiskHealthIndicatorService.java

andreidan · 2022-09-27T10:20:52Z

server/src/main/java/org/elasticsearch/health/node/DiskHealthIndicatorService.java

-            .filter(routing -> indices.contains(routing.index().getName()))
-            .map(ShardRouting::currentNodeId)
-            .collect(Collectors.toSet());
+        DiskHealthAnalyzer diskHealthAnalyzer = new DiskHealthAnalyzer(diskHealthInfoMap, blockedIndices, clusterState);


Since the DiskHealthAnalyzer already has the cluster state, could it look and compute the blocked indices?

We now compute them in the indicator but not use them anywhere else except passing them to DiskHealthAnalyzer.

What do you think?

That is true, my reasoning was that the interface shows the information needed to calculate it, but we can document why we need the cluster state and leave it at that ;) .

in that case I will move the details too.

server/src/main/java/org/elasticsearch/health/node/DiskHealthIndicatorService.java

server/src/main/java/org/elasticsearch/health/node/HealthIndicatorDisplayValues.java

andreidan · 2022-09-27T11:20:16Z

server/src/main/java/org/elasticsearch/cluster/routing/RoutingNode.java

    }

+    public Index[] copyIndices() {
+        return shardsByIndex.keySet().toArray(Index.EMPTY_ARRAY);


@dakrone do you know of a different/more efficient way of getting all the indices on a node?

@dakrone I merged this PR to move forward but please if you know about a better way to get all the indices per node, I will fix it in a new PR.

Co-authored-by: Andrei Dan <andrei.dan@elastic.co>

andreidan

Thanks for iterating on this Mary. LGTM 🚀

Left a suggestion

server/src/main/java/org/elasticsearch/health/node/DiskHealthIndicatorService.java

Introduced the DiskHealthAnalyzer which determines the health state, the symptom, the impacts, the diagnoses and the details. We also changed the retrieval of the indices per node to use the RoutingNodes instead of the RoutingTable directly for efficiency. Co-authored-by: Andrei Dan <andrei.dan@elastic.co>

elasticsearchmachine · 2022-09-28T13:14:37Z

💚 Backport successful

Status	Branch	Result
✅	8.5

Introduced the DiskHealthAnalyzer which determines the health state, the symptom, the impacts, the diagnoses and the details. We also changed the retrieval of the indices per node to use the RoutingNodes instead of the RoutingTable directly for efficiency. Co-authored-by: Andrei Dan <andrei.dan@elastic.co>

gmarouli added 3 commits September 26, 2022 17:03

Small polishing

ca4b476

Polish

88a7b67

Polish

3b730bc

elasticsearchmachine added the v8.6.0 label Sep 26, 2022

gmarouli added :Distributed/Health Issues for the health report API v8.5.1 >refactoring labels Sep 26, 2022

gmarouli requested a review from andreidan September 26, 2022 15:39

gmarouli marked this pull request as ready for review September 26, 2022 15:39

elasticsearchmachine added the Team:Data Management (obsolete) DO NOT USE. This team no longer exists. label Sep 26, 2022

andreidan reviewed Sep 27, 2022

View reviewed changes

gmarouli and others added 4 commits September 28, 2022 09:52

Merge branch 'main' into disk-indicator-refactoring

2ebd51d

Several review comments

6f16f46

Fix typo in javadoc

bc1ad5c

Co-authored-by: Andrei Dan <andrei.dan@elastic.co>

Split the diagnosis of master and other nodes

70fb730

gmarouli requested a review from andreidan September 28, 2022 11:41

Format fix

e668999

andreidan approved these changes Sep 28, 2022

View reviewed changes

server/src/main/java/org/elasticsearch/health/node/DiskHealthIndicatorService.java Outdated Show resolved Hide resolved

server/src/main/java/org/elasticsearch/health/node/DiskHealthIndicatorService.java Outdated Show resolved Hide resolved

gmarouli added the auto-backport Automatically create backport pull requests when merged label Sep 28, 2022

Replace putIfAbsent with computeIfAbsent

137d0f3

gmarouli merged commit 30f24d4 into elastic:main Sep 28, 2022

gmarouli deleted the disk-indicator-refactoring branch September 28, 2022 13:13

gmarouli mentioned this pull request Sep 28, 2022

[8.5] Disk indicator refactoring (#90366) #90456

Merged

csoulios removed the v8.5.1 label Nov 1, 2022

csoulios added the v8.5.0 label Nov 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disk indicator refactoring#90366

Disk indicator refactoring#90366
gmarouli merged 9 commits intoelastic:mainfrom
gmarouli:disk-indicator-refactoring

gmarouli commented Sep 26, 2022 •

edited

Loading

Uh oh!

elasticsearchmachine commented Sep 26, 2022

Uh oh!

andreidan left a comment

Uh oh!

Uh oh!

andreidan Sep 27, 2022

Uh oh!

gmarouli Sep 28, 2022

Uh oh!

gmarouli Sep 28, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreidan Sep 27, 2022

Uh oh!

gmarouli Sep 28, 2022

Uh oh!

andreidan left a comment

Uh oh!

Uh oh!

Uh oh!

elasticsearchmachine commented Sep 28, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

gmarouli commented Sep 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Sep 26, 2022

Uh oh!

andreidan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

andreidan Sep 27, 2022

Choose a reason for hiding this comment

Uh oh!

gmarouli Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

gmarouli Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

andreidan Sep 27, 2022

Choose a reason for hiding this comment

Uh oh!

gmarouli Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

andreidan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

elasticsearchmachine commented Sep 28, 2022

💚 Backport successful

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

gmarouli commented Sep 26, 2022 •

edited

Loading