Skip to content

Always rebuild checkpoint tracker for old indices#46340

Merged
dnhatn merged 2 commits intoelastic:6.8from
dnhatn:rebuild_checkpoint_tracker
Sep 7, 2019
Merged

Always rebuild checkpoint tracker for old indices#46340
dnhatn merged 2 commits intoelastic:6.8from
dnhatn:rebuild_checkpoint_tracker

Conversation

@dnhatn
Copy link
Copy Markdown
Member

@dnhatn dnhatn commented Sep 4, 2019

The max_seq_no of Lucene commit of the old indices (before 6.6.2) can be smaller than seq_no of some documents in the commit (see #38879). Although we fixed this bug in 6.6.2 and 7.0.0, a problematic index commit can still affect the newer version after a rolling upgrade or full cluster restart. In particular, if a FollowingEngine (or InternalEngine with MSU enabled) restores from a problematic commit, then it can apply MSU optimization for existing documents. The symptom that we see here is the local checkpoint tracker assertion is violated.

Closes #46311
Relates #38879

@dnhatn dnhatn added >bug :Distributed/Engine Anything around managing Lucene and the Translog in an open shard. v6.8.4 labels Sep 4, 2019
@elasticmachine
Copy link
Copy Markdown
Collaborator

Pinging @elastic/es-distributed

Copy link
Copy Markdown
Contributor

@ywelsch ywelsch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've asked for two cosmetic changes, o.w. looking good.

Copy link
Copy Markdown
Contributor

@henningandersen henningandersen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@dnhatn
Copy link
Copy Markdown
Member Author

dnhatn commented Sep 5, 2019

run elasticsearch-ci/2

@dnhatn
Copy link
Copy Markdown
Member Author

dnhatn commented Sep 7, 2019

@ywelsch and @henningandersen Thanks for reviewing.

@dnhatn dnhatn merged commit eae6361 into elastic:6.8 Sep 7, 2019
@dnhatn dnhatn deleted the rebuild_checkpoint_tracker branch September 7, 2019 03:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>bug :Distributed/Engine Anything around managing Lucene and the Translog in an open shard. v6.8.4

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants