server: don't drain when decommissioning by tbg · Pull Request #28707 · cockroachdb/cockroach

tbg · 2018-08-16T15:30:43Z

Prior to this commit, a server which enters decommissioning would
automatically drain. This wasn't a great idea since it meant that
pointing the decommissioning command at a set of nodes required
for the cluster to work would brick that cluster, and would be
difficult to recover from (since the draining state is picked
up by the nodes when they start).

Instead, decouple draining from decommissioning. Decommissioning simply
tells the allocator to move data off that node; when the node gets shut
down cleanly (the operator's responsiblity), it will drain.

The resulting behavior when trying to decommission a too-large set of
nodes is now that the decommission command will simply not finish.
For example, starting a three node cluster and decommissioning all
nodes will simply hang (though the nodes will be marked as
decommissioning). Add three more nodes to the cluster and replicas
will move over to the newly added nodes, and the decommissioning
command will finish.

As a bonus, recommissioning a node now doesn't require the target
node to restart.

As a second bonus, the decommissioning acceptance test now takes
around 60% of the previous time (~45s down from 70+).

One remaining caveat is that users may forget that they attempted
to decommission nodes. We need to check that we prominently alert
in the UI when nodes are decommissioning. This isn't a new problem.

Fixes #27444.
Fixes #27025.

Release note (bug fix): decommissioning multiple nodes is now possible
without posing a risk to cluster health. Recommissioning a node does
no longer require a restart of the target nodes to take effect.

cockroach-teamcity · 2018-08-16T15:30:48Z

This change is

nvb

but I think you mean "don't drain when decommissioning" in the commit/PR title.

Reviewed 3 of 3 files at r1.
Reviewable status: complete! 0 of 0 LGTMs obtained (and 1 stale)

pkg/acceptance/decommission_test.go, line 284 at r1 (raw file):

		exp := [][]string{
			decommissionHeader,
			// Expect the same usual, except this time the node should be draining

s/same usual/same as usual/

Prior to this commit, a server which enters decommissioning would automatically drain. This wasn't a great idea since it meant that pointing the decommissioning command at a set of nodes required for the cluster to work would brick that cluster, and would be difficult to recover from (since the draining state is picked up by the nodes when they start). Instead, decouple draining from decommissioning. Decommissioning simply tells the allocator to move data off that node; when the node gets shut down cleanly (the operator's responsiblity), it will drain. The resulting behavior when trying to decommission a too-large set of nodes is now that the decommission command will simply not finish. For example, starting a three node cluster and decommissioning all nodes will simply hang (though the nodes will be marked as decommissioning). Add three more nodes to the cluster and replicas will move over to the newly added nodes, and the decommissioning command will finish. As a bonus, recommissioning a node now doesn't require the target node to restart. As a second bonus, the decommissioning acceptance test now takes around 60% of the previous time (~45s down from 70+). One remaining caveat is that users may forget that they attempted to decommission nodes. We need to check that we prominently alert in the UI when nodes are decommissioning. This isn't a new problem. Fixes cockroachdb#27444. Fixes cockroachdb#27025. Release note (bug fix): decommissioning multiple nodes is now possible without posing a risk to cluster health. Recommissioning a node does no longer require a restart of the target nodes to take effect.

tbg · 2018-08-16T19:48:48Z

Changed both, TFTR!

bors r=nvanbenschoten

28684: storage: make subsumption atomic with merge commit application r=bdarnell,tschottdorf a=benesch ~Please review the first commit separately in #28661.~ During a merge, the subsumed range logically ceases to exist at the moment that the merge transaction commit applies. It's important that we remove the subsumed range from disk in the same batch as the merge commit to protect against an ill-timed crash. Tease apart Replica.destroyRaftMuLocked to make this possible. Release note: None 28707: server: don't drain when decommissioning r=nvanbenschoten a=tschottdorf Prior to this commit, a server which enters decommissioning would automatically drain. This wasn't a great idea since it meant that pointing the decommissioning command at a set of nodes required for the cluster to work would brick that cluster, and would be difficult to recover from (since the draining state is picked up by the nodes when they start). Instead, decouple draining from decommissioning. Decommissioning simply tells the allocator to move data off that node; when the node gets shut down cleanly (the operator's responsiblity), it will drain. The resulting behavior when trying to decommission a too-large set of nodes is now that the decommission command will simply not finish. For example, starting a three node cluster and decommissioning all nodes will simply hang (though the nodes will be marked as decommissioning). Add three more nodes to the cluster and replicas will move over to the newly added nodes, and the decommissioning command will finish. As a bonus, recommissioning a node now doesn't require the target node to restart. As a second bonus, the decommissioning acceptance test now takes around 60% of the previous time (~45s down from 70+). One remaining caveat is that users may forget that they attempted to decommission nodes. We need to check that we prominently alert in the UI when nodes are decommissioning. This isn't a new problem. Fixes #27444. Fixes #27025. Release note (bug fix): decommissioning multiple nodes is now possible without posing a risk to cluster health. Recommissioning a node does no longer require a restart of the target nodes to take effect. Co-authored-by: Nikhil Benesch <nikhil.benesch@gmail.com> Co-authored-by: Tobias Schottdorf <tobias.schottdorf@gmail.com>

craig · 2018-08-16T20:12:41Z

Build succeeded

GitHub CI (Cockroach)

This is somehow the salient part of cockroachdb#28707 that I managed to miss during the initial backport of that PR: we don't want to set the node to draining when it is being decommissioned. Note that the /health?ready=1 endpoint will continue to return a 503 error once decommissioning starts due to a readiness check (something I am preparing a PR against master for). Release note: None

This is somehow the salient part of cockroachdb#28707 that I managed to miss during the initial backport of that PR: we don't want to set the node to draining when it is being decommissioned. Note that the /health?ready=1 endpoint will continue to return a 503 error once decommissioning starts due to a readiness check (something I am preparing a PR against master for). Release note: None Release note: None

tbg requested a review from a team August 16, 2018 15:30

tbg force-pushed the fix/decommission-quiesce branch from 068f8f9 to 89a2852 Compare August 16, 2018 16:48

tbg requested a review from nvb August 16, 2018 16:49

nvb approved these changes Aug 16, 2018

View reviewed changes

tbg changed the title ~~server: don't quiesce when decommissioning~~ server: don't drain when decommissioning Aug 16, 2018

tbg force-pushed the fix/decommission-quiesce branch from 89a2852 to f7c66c1 Compare August 16, 2018 19:48

craig bot merged commit f7c66c1 into cockroachdb:master Aug 16, 2018

tbg deleted the fix/decommission-quiesce branch August 20, 2018 13:44

tbg mentioned this pull request Aug 20, 2018

cli: cockroach quit should warn users that there are under replicated ranges when killing a node #28506

Closed

tbg added the docs-todo label Aug 22, 2018

jseldess mentioned this pull request Aug 28, 2018

server: don't drain when decommissioning cockroachdb/docs#3660

Closed

tbg mentioned this pull request Sep 18, 2018

backport-2.0: don't drain on decommissioning #30351

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

server: don't drain when decommissioning#28707

server: don't drain when decommissioning#28707
craig[bot] merged 1 commit intocockroachdb:masterfrom
tbg:fix/decommission-quiesce

tbg commented Aug 16, 2018

Uh oh!

cockroach-teamcity commented Aug 16, 2018

Uh oh!

nvb left a comment

Uh oh!

tbg commented Aug 16, 2018

Uh oh!

craig bot commented Aug 16, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tbg commented Aug 16, 2018

Uh oh!

cockroach-teamcity commented Aug 16, 2018

Uh oh!

nvb left a comment

Choose a reason for hiding this comment

Uh oh!

tbg commented Aug 16, 2018

Uh oh!

craig bot commented Aug 16, 2018

Build succeeded

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants