Skip to content

loss of quorum recovery tools: when performing replica recovery operations update rangelog with performed changes. #73679

@aliher1911

Description

@aliher1911

Replica recovery operations are destructive and could cause data loss. If cluster is kept as is after recovery and data is not migrated to a healthy cluster afterwards it could exhibit unexpected behaviours stemming from corrupted data. Subsequent investigations could be hard as normal consensus and group membership logic would not apply.

Save update info into the range log to allow subsequent investigations. Consider also increasing retention time for those particular events in rangelog from default 30 days.

Metadata

Metadata

Assignees

Labels

A-kv-replicationRelating to Raft, consensus, and coordination.C-enhancementSolution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions