Skip to content

kvserver: avoid lease transfers to restarted nodes that could become overloaded #96521

@kvoli

Description

@kvoli

We have seen that a node restarting in the cluster can have very negative effects on request latency.

A reproduction test was added in #95161

Allowing Lease Transfers
Extremely High P99 SQL Latency
image
image

Blocking Lease Transfers
No impact on P99 SQL Latency
image
image

This issue tracks the improvements to crdb in 23.1 to prevent a restarted node impacting end latency. The improvements are narrowly scoped but have been proven to have a significant effect in keeping request latency normal.

Jira issue: CRDB-24164

Metadata

Metadata

Assignees

Labels

A-kv-distributionRelating to rebalancing and leasing.C-enhancementSolution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)GA-blockerT-kvKV Teambranch-release-23.1Used to mark GA and release blockers, technical advisories, and bugs for 23.1

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions