storage: make subsumption atomic with merge commit application by benesch · Pull Request #28684 · cockroachdb/cockroach

benesch · 2018-08-16T05:11:33Z

~~Please review the first commit separately in #28661.~~

During a merge, the subsumed range logically ceases to exist at the
moment that the merge transaction commit applies. It's important that we
remove the subsumed range from disk in the same batch as the merge
commit to protect against an ill-timed crash. Tease apart
Replica.destroyRaftMuLocked to make this possible.

Release note: None

cockroach-teamcity · 2018-08-16T05:11:38Z

This change is

tbg

Reviewed 10 of 10 files at r1, 8 of 8 files at r2.
Reviewable status: complete! 0 of 0 LGTMs obtained

pkg/storage/replica.go, line 809 at r2 (raw file):

	// Suggest the cleared range to the compactor queue.
	r.store.compactor.Suggest(ctx, storagebase.SuggestedCompaction{

This should happen later, after you've actually committed it, in postDestroyRaftMuLocked (or any later point in time).

pkg/storage/replica.go, line 872 at r2 (raw file):

	}

	ms := r.GetMVCCStats()

Would feel better if you grabbed this before actually destroying the replica. It's unclear what answer you're getting right now.

bdarnell

s/subsumation/subsumption/

Reviewed 2 of 10 files at r1, 8 of 8 files at r2.
Reviewable status: complete! 0 of 0 LGTMs obtained

pkg/storage/store.go, line 2529 at r2 (raw file):

// RemoveOptions bundles boolean parameters for Store.RemoveReplica.
type RemoveOptions struct {
	DestroyReplica bool

I prefer the old name. The Replica object is effectively destroyed either way; the main effect of destroyRaftMuLocked is to delete the data.

benesch

D'oh, thanks. I knew "subsumation" sounded funny.

Reviewable status: complete! 0 of 0 LGTMs obtained

pkg/storage/replica.go, line 809 at r2 (raw file):

Previously, tschottdorf (Tobias Schottdorf) wrote…

This should happen later, after you've actually committed it, in postDestroyRaftMuLocked (or any later point in time).

Done.

pkg/storage/replica.go, line 872 at r2 (raw file):

Previously, tschottdorf (Tobias Schottdorf) wrote…

Would feel better if you grabbed this before actually destroying the replica. It's unclear what answer you're getting right now.

Done.

pkg/storage/store.go, line 2529 at r2 (raw file):

Previously, bdarnell (Ben Darnell) wrote…

I prefer the old name. The Replica object is effectively destroyed either way; the main effect of destroyRaftMuLocked is to delete the data.

Done.

During a merge, the subsumed range logically ceases to exist at the moment that the merge transaction commit applies. It's important that we remove the subsumed range from disk in the same batch as the merge commit to protect against an ill-timed crash. Tease apart Replica.destroyRaftMuLocked to make this possible. Release note: None

benesch · 2018-08-16T19:29:26Z

bors r=bdarnell,tschottdorf

craig · 2018-08-16T19:46:05Z

Build failed

GitHub CI (Cockroach)

benesch · 2018-08-16T19:53:05Z

Examples-ORMs flake.

bors r=bdarnell,tschottdorf

28684: storage: make subsumption atomic with merge commit application r=bdarnell,tschottdorf a=benesch ~Please review the first commit separately in #28661.~ During a merge, the subsumed range logically ceases to exist at the moment that the merge transaction commit applies. It's important that we remove the subsumed range from disk in the same batch as the merge commit to protect against an ill-timed crash. Tease apart Replica.destroyRaftMuLocked to make this possible. Release note: None 28707: server: don't drain when decommissioning r=nvanbenschoten a=tschottdorf Prior to this commit, a server which enters decommissioning would automatically drain. This wasn't a great idea since it meant that pointing the decommissioning command at a set of nodes required for the cluster to work would brick that cluster, and would be difficult to recover from (since the draining state is picked up by the nodes when they start). Instead, decouple draining from decommissioning. Decommissioning simply tells the allocator to move data off that node; when the node gets shut down cleanly (the operator's responsiblity), it will drain. The resulting behavior when trying to decommission a too-large set of nodes is now that the decommission command will simply not finish. For example, starting a three node cluster and decommissioning all nodes will simply hang (though the nodes will be marked as decommissioning). Add three more nodes to the cluster and replicas will move over to the newly added nodes, and the decommissioning command will finish. As a bonus, recommissioning a node now doesn't require the target node to restart. As a second bonus, the decommissioning acceptance test now takes around 60% of the previous time (~45s down from 70+). One remaining caveat is that users may forget that they attempted to decommission nodes. We need to check that we prominently alert in the UI when nodes are decommissioning. This isn't a new problem. Fixes #27444. Fixes #27025. Release note (bug fix): decommissioning multiple nodes is now possible without posing a risk to cluster health. Recommissioning a node does no longer require a restart of the target nodes to take effect. Co-authored-by: Nikhil Benesch <nikhil.benesch@gmail.com> Co-authored-by: Tobias Schottdorf <tobias.schottdorf@gmail.com>

craig · 2018-08-16T20:12:39Z

Build succeeded

GitHub CI (Cockroach)

benesch requested review from a team, bdarnell and tbg August 16, 2018 05:11

tbg approved these changes Aug 16, 2018

View reviewed changes

bdarnell approved these changes Aug 16, 2018

View reviewed changes

benesch force-pushed the merge-atomic branch from 36944b0 to 84600ff Compare August 16, 2018 18:51

benesch commented Aug 16, 2018

View reviewed changes

benesch changed the title ~~storage: make subsumation atomic with merge commit application~~ storage: make subsumption atomic with merge commit application Aug 16, 2018

benesch force-pushed the merge-atomic branch from 84600ff to 4fcc6b5 Compare August 16, 2018 19:14

craig bot merged commit 4fcc6b5 into cockroachdb:master Aug 16, 2018

benesch deleted the merge-atomic branch August 17, 2018 01:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: make subsumption atomic with merge commit application#28684

storage: make subsumption atomic with merge commit application#28684
craig[bot] merged 1 commit intocockroachdb:masterfrom
benesch:merge-atomic

benesch commented Aug 16, 2018 •

edited

Loading

Uh oh!

cockroach-teamcity commented Aug 16, 2018

Uh oh!

tbg left a comment

Uh oh!

bdarnell left a comment

Uh oh!

benesch left a comment

Uh oh!

benesch commented Aug 16, 2018

Uh oh!

craig bot commented Aug 16, 2018

Uh oh!

benesch commented Aug 16, 2018

Uh oh!

craig bot commented Aug 16, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

benesch commented Aug 16, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented Aug 16, 2018

Uh oh!

tbg left a comment

Choose a reason for hiding this comment

Uh oh!

bdarnell left a comment

Choose a reason for hiding this comment

Uh oh!

benesch left a comment

Choose a reason for hiding this comment

Uh oh!

benesch commented Aug 16, 2018

Uh oh!

craig bot commented Aug 16, 2018

Build failed

Uh oh!

benesch commented Aug 16, 2018

Uh oh!

craig bot commented Aug 16, 2018

Build succeeded

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

benesch commented Aug 16, 2018 •

edited

Loading