Improve slow logging in MasterService by DaveCTurner · Pull Request #45086 · elastic/elasticsearch

DaveCTurner · 2019-08-01T12:41:13Z

Adds a tighter threshold for logging a warning about slowness in the
MasterService instead of relying on the cluster service's 30-second warning
threshold. This new threshold applies to the computation of the cluster state
update in isolation, so we get a warning if computing a new cluster state
update takes longer than 10 seconds even if it is subsequently applied quickly.
It also applies independently to the length of time it takes to notify the
cluster state tasks on completion of publication, in case any of these
notifications holds up the master thread for too long.

Relates #45007

Adds a tighter threshold for logging a warning about slowness in the `MasterService` instead of relying on the cluster service's 30-second warning threshold. This new threshold applies to the computation of the cluster state update in isolation, so we get a warning if computing a new cluster state update takes longer than 10 seconds even if it is subsequently applied quickly. It also applies independently to the length of time it takes to notify the cluster state tasks on completion of publication, in case any of these notifications holds up the master thread for too long. Relates elastic#45007

elasticmachine · 2019-08-01T12:41:16Z

Pinging @elastic/es-distributed

DaveCTurner · 2019-08-01T14:46:28Z

@elasticmachine please run elasticsearch-ci/2 (unrelated watcher failure it seems)

ywelsch

Two minor comments, looking good o.w.

server/src/main/java/org/elasticsearch/cluster/service/ClusterService.java

ywelsch · 2019-08-05T12:17:13Z

server/src/main/java/org/elasticsearch/cluster/service/MasterService.java

+                    taskInputs.summary,
+                    previousClusterState.nodes(),
+                    previousClusterState.routingTable(),
+                    previousClusterState.getRoutingNodes()),


this call is potentially expensive (as it lazily builds the routing nodes), so I wonder if we should still guard this logging call with logger.isTraceEnabled()

Ok, I think passing a message supplier is enough to avoid that, see 45c1919.

…er-service

DaveCTurner · 2019-08-06T08:52:21Z

Failures look infrastructural. You do you, Jenkins.

@elasticmachine please run elasticsearch-ci/bwc
@elasticmachine please run elasticsearch-ci/oss-distro-docs

ywelsch

LGTM

Adds a tighter threshold for logging a warning about slowness in the `MasterService` instead of relying on the cluster service's 30-second warning threshold. This new threshold applies to the computation of the cluster state update in isolation, so we get a warning if computing a new cluster state update takes longer than 10 seconds even if it is subsequently applied quickly. It also applies independently to the length of time it takes to notify the cluster state tasks on completion of publication, in case any of these notifications holds up the master thread for too long. Relates elastic#45007

Adds a tighter threshold for logging a warning about slowness in the `MasterService` instead of relying on the cluster service's 30-second warning threshold. This new threshold applies to the computation of the cluster state update in isolation, so we get a warning if computing a new cluster state update takes longer than 10 seconds even if it is subsequently applied quickly. It also applies independently to the length of time it takes to notify the cluster state tasks on completion of publication, in case any of these notifications holds up the master thread for too long. Relates #45007 Backport of #45086

DaveCTurner added >enhancement :Distributed/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. v8.0.0 v7.4.0 labels Aug 1, 2019

DaveCTurner requested review from original-brownbear and ywelsch August 1, 2019 12:41

DaveCTurner added 2 commits August 1, 2019 13:44

Revert

4372986

Try-with-resources

249ac9b

ywelsch reviewed Aug 5, 2019

View reviewed changes

DaveCTurner added 3 commits August 6, 2019 09:33

Merge branch 'master' into 2019-08-01-tighter-timeout-warning-in-mast…

b24e688

…er-service

Move setting

253ac0b

Lazy message

45c1919

DaveCTurner requested a review from ywelsch August 6, 2019 08:46

ywelsch approved these changes Aug 6, 2019

View reviewed changes

DaveCTurner merged commit 6143ebf into elastic:master Aug 6, 2019

DaveCTurner deleted the 2019-08-01-tighter-timeout-warning-in-master-service branch August 6, 2019 14:09

DaveCTurner mentioned this pull request Aug 6, 2019

Improve slow logging in MasterService #45241

Merged

DaveCTurner added the backport pending label Aug 6, 2019

DaveCTurner removed the backport pending label Aug 6, 2019

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve slow logging in MasterService#45086

Improve slow logging in MasterService#45086
DaveCTurner merged 6 commits intoelastic:masterfrom
DaveCTurner:2019-08-01-tighter-timeout-warning-in-master-service

DaveCTurner commented Aug 1, 2019

Uh oh!

elasticmachine commented Aug 1, 2019

Uh oh!

DaveCTurner commented Aug 1, 2019

Uh oh!

ywelsch left a comment

Uh oh!

Uh oh!

ywelsch Aug 5, 2019

Uh oh!

DaveCTurner Aug 6, 2019

Uh oh!

DaveCTurner commented Aug 6, 2019

Uh oh!

ywelsch left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

DaveCTurner commented Aug 1, 2019

Uh oh!

elasticmachine commented Aug 1, 2019

Uh oh!

DaveCTurner commented Aug 1, 2019

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ywelsch Aug 5, 2019

Choose a reason for hiding this comment

Uh oh!

DaveCTurner Aug 6, 2019

Choose a reason for hiding this comment

Uh oh!

DaveCTurner commented Aug 6, 2019

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants