Peer recovery should flush at the end by dnhatn · Pull Request #41660 · elastic/elasticsearch

dnhatn · 2019-04-30T00:55:55Z

Flushing at the end of a peer recovery (if needed) can bring these benefits:

Closing an index won't end up with the red state for a recovering replica should always be ready for closing whether it performs the verifying-before-close step or not.
Good opportunities to compact store (i.e., flushing and merging Lucene, and trimming translog)

Closes #40024
Closes #39588
Relates #33888

elasticmachine · 2019-04-30T00:55:57Z

Pinging @elastic/es-distributed

ywelsch

I've left a question.

ywelsch · 2019-04-30T07:35:31Z

server/src/main/java/org/elasticsearch/indices/recovery/RecoveryTarget.java

+        // if all those uncommitted operations have baked into the existing Lucene index commit already.
+        final SequenceNumbers.CommitInfo commitInfo = SequenceNumbers.loadSeqNoInfoFromLuceneCommit(
+            indexShard.commitStats().getUserData().entrySet());
+        return commitInfo.maxSeqNo != commitInfo.localCheckpoint


I wonder if the condition above about the translog is sufficient. What situation is the condition here addressing that's not addressed by the above one?

If a file-based occurs, the primary also sends its translog to replica. These operations are uncommitted on the replica even though they are baked into the commit already. We need this condition to avoid flushing in this case to keep the syncId. I pushed 07c3a7c to use another check.

dnhatn · 2019-04-30T21:18:00Z

@elasticmachine test this please

henningandersen

I think this could solve the issue and have other benefits as described.

But I am a bit worried about the implications, especially for future maintenance. If we ever add anything into VerifyShardBeforeClose, we need to also ensure the same holds at the end of a recovery. Also, I am not 100% sure recovery is the only place to ensure this (though I have no concrete cases).

I would find it more intuitive to (maybe in addition to this) add a check in MetaDataIndexStateService.closeRoutingTable to fail closing the index if the routing table contains unvalidated shard copies (meaning we would have to collect more info in the previous steps).

ywelsch · 2019-05-02T14:28:49Z

Good point @henningandersen, but failing the closing operation would also not be very user-friendly, as shards are free to move around based on rebalancing decisions. Let's consider more options here.

dnhatn · 2019-05-06T14:58:24Z

@henningandersen found that we can always validate max_seq_no equals to the global checkpoint in ReadOnlyEngine with this change. I pushed 6e952c5 to enable it.

henningandersen · 2019-05-08T10:51:32Z

@henningandersen found that we can always validate max_seq_no equals to the global checkpoint in ReadOnlyEngine with this change. I pushed 6e952c5 to enable it.

I tend to think I was wrong about this, since FrozenEngine extends ReadOnlyEngine. If something was frozen on 6.7 or 7.0, it might not obey the invariant if they have #41041 ?

This reverts commit 6e952c5.

ywelsch

LGTM

tlrx

LGTM

This reverts commit 91811b7.

dnhatn · 2019-05-22T02:34:51Z

Thanks everyone!

Flushing at the end of a peer recovery (if needed) can bring these benefits: 1. Closing an index won't end up with the red state for a recovering replica should always be ready for closing whether it performs the verifying-before-close step or not. 2. Good opportunities to compact store (i.e., flushing and merging Lucene, and trimming translog) Closes #40024 Closes #39588

Flushing at the end of a peer recovery (if needed) can bring these benefits: 1. Closing an index won't end up with the red state for a recovering replica should always be ready for closing whether it performs the verifying-before-close step or not. 2. Good opportunities to compact store (i.e., flushing and merging Lucene, and trimming translog) Closes elastic#40024 Closes elastic#39588

Peer recovery should flush at the end

34db798

dnhatn added >enhancement :Distributed/Recovery Anything around constructing a new shard, either from a local or a remote source. v8.0.0 v7.2.0 labels Apr 30, 2019

dnhatn requested review from henningandersen and ywelsch April 30, 2019 00:55

ywelsch suggested changes Apr 30, 2019

View reviewed changes

dnhatn added 2 commits April 30, 2019 09:59

update comment

baceaad

use count translog ops

07c3a7c

dnhatn requested a review from ywelsch April 30, 2019 14:22

Merge branch 'master' into peer-recovery-flush

a972df3

henningandersen reviewed May 2, 2019

View reviewed changes

dnhatn added 2 commits May 6, 2019 10:52

Merge branch 'master' into peer-recovery-flush

3959605

always check max_seq_no = gcp in readonly engine

6e952c5

tlrx mentioned this pull request May 7, 2019

Replicate closed indices #33888

Closed

50 tasks

dnhatn added 3 commits May 16, 2019 09:34

Revert "always check max_seq_no = gcp in readonly engine"

d3ee8fa

This reverts commit 6e952c5.

Merge branch 'master' into peer-recovery-flush

ad605a4

Fix compilation

a4aff09

ywelsch approved these changes May 17, 2019

View reviewed changes

tlrx approved these changes May 17, 2019

View reviewed changes

dnhatn added 4 commits May 21, 2019 15:31

Merge branch 'master' into peer-recovery-flush

e03da52

AwaitsFix testRefreshMetric

91811b7

Merge branch 'master' into peer-recovery-flush

9638eaf

Revert "AwaitsFix testRefreshMetric"

c8d08a3

This reverts commit 91811b7.

Merge branch 'master' into peer-recovery-flush

8218602

dnhatn merged commit 75be2a6 into elastic:master May 22, 2019

dnhatn deleted the peer-recovery-flush branch May 22, 2019 02:35

dnhatn added the backport pending label May 22, 2019

dnhatn removed the backport pending label May 22, 2019

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Peer recovery should flush at the end#41660

Peer recovery should flush at the end#41660
dnhatn merged 14 commits intoelastic:masterfrom
dnhatn:peer-recovery-flush

dnhatn commented Apr 30, 2019 •

edited

Loading

Uh oh!

elasticmachine commented Apr 30, 2019

Uh oh!

ywelsch left a comment

Uh oh!

ywelsch Apr 30, 2019

Uh oh!

dnhatn Apr 30, 2019

Uh oh!

dnhatn commented Apr 30, 2019

Uh oh!

henningandersen left a comment

Uh oh!

ywelsch commented May 2, 2019

Uh oh!

dnhatn commented May 6, 2019

Uh oh!

henningandersen commented May 8, 2019

Uh oh!

ywelsch left a comment

Uh oh!

tlrx left a comment

Uh oh!

dnhatn commented May 22, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

dnhatn commented Apr 30, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticmachine commented Apr 30, 2019

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

ywelsch Apr 30, 2019

Choose a reason for hiding this comment

Uh oh!

dnhatn Apr 30, 2019

Choose a reason for hiding this comment

Uh oh!

dnhatn commented Apr 30, 2019

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

ywelsch commented May 2, 2019

Uh oh!

dnhatn commented May 6, 2019

Uh oh!

henningandersen commented May 8, 2019

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

tlrx left a comment

Choose a reason for hiding this comment

Uh oh!

dnhatn commented May 22, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

dnhatn commented Apr 30, 2019 •

edited

Loading