Skip to content

Proposer-Based Timestamps: Diagnostic 'runbook' #7756

@williambanfield

Description

@williambanfield

Summary

Proposer-based timestamps adds a new criteria for considering a block 'valid'. Therefore, validators may begin to prevote nil in cases that they would not previously have done so. It may not be immediately clear to operators and chains why a validator is prevoting nil. We should generate a runbook to help operators diagnose this condition. This runbook should rely on the newly added metric for determining timely-ness that was added as part of the implementation steps. See: #7709 for the relevant metric. It should consist of fine-grained and actionable steps that an operator may take to determine if the reason that they are not producing blocks is related to issues with synchronization and similarly actionable steps that they can take to correct the issue.

Open Questions

Is it reasonable to assume that operators are monitoring their nodes with prometheus? Prometheus is a very common tool used to capture metrics for production software. If we can assume it is being used, the runbook can rely on standard monitoring and alerting that users of prometheus are likely to have in place.

Metadata

Metadata

Labels

C:docsComponent: Documentation

Type

No type

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions