-
Notifications
You must be signed in to change notification settings - Fork 25.8k
Add last failed node allocation in the Allocation Explain API #28018
Copy link
Copy link
Closed
Labels
:Distributed/DistributedA catch all label for anything in the Distributed Area. Please avoid if you can.A catch all label for anything in the Distributed Area. Please avoid if you can.>enhancement
Description
Right now, if you use the allocation explain API you will see an output containing the following:
"unassigned_info": {
"reason": "ALLOCATION_FAILED",
"at": "xxxx-xx-xx:05:32.602Z",
"failed_allocation_attempts": 6,
"details": "failed to create shard, failure FileSystemException[/xxxxx/xxxxx/xxxx/xxxxxx: Input/output error]",
"last_allocation_status": "no_attempt"
},
Along with all the decisions (who are just saying what node we tried on) we do not have the failed node. The only way to retrieve that is to go to the master node and look at the logs, for this particular error, and find it there. In the case you have a large number of shards and nodes i think that it will be useful to know at least which node was the one showing this problem.
The request here is to add into the details the last node that failed, along with the current detail.
CC @ywelsch
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
:Distributed/DistributedA catch all label for anything in the Distributed Area. Please avoid if you can.A catch all label for anything in the Distributed Area. Please avoid if you can.>enhancement
Type
Fields
Give feedbackNo fields configured for issues without a type.