Generating and committing a history_uuid on existing old indices destroys translog recovery info by bleskes · Pull Request #26734 · elastic/elasticsearch

bleskes · 2017-09-21T12:13:32Z

This is a bug introduced in #26694 . The issue comes from the attempt to share code that commits the new history uuid and/or a new translog uuid. This goes wrong an existing 5.6 index that is recovered from store:

A new history uuid is generated as it doesn't exist in the index
The translog is opened and it's uuid doesn't change.
The committing the new history uuid used the standard commitIndexWriter method.
The latter asks the translog for the oldest file generation that is required to recover from the local checkpoint + 1
The local checkpoint on old indices is -1 (until something is indexed) and the translog is asked to recover from position 0. That excludes any operations the translog that do not have a seq no assigned, causing the FullClusterRestart bwc tests to fail.

To bypass this PR moves away from the attempt to share the committing code between a new translog uuid and a new history uuid. Instead we do what we did before and open the translog with a potential commit. Afterwards we commit the history uuid if needed. This comes with the expense of opening up the method to commit an index writer in the engine.

This PR is opened against 6.x . We have the option to leave the code on master as is. Let me know which you prefer for the long term. I can go either way.

…hat expect a commit to have translog info

ywelsch

LGTM

This PR is opened against 6.x . We have the option to leave the code on master as is. Let me know which you prefer for the long term. I can go either way.

I have a weak preference to keep master as is and have this only go into 6.x. I'll leave it up to you.

ywelsch · 2017-09-21T14:10:44Z

core/src/main/java/org/elasticsearch/index/engine/InternalEngine.java

+            // assert we don't loose key entries
+            assert commitDataAsMap(writer).containsKey(Translog.TRANSLOG_UUID_KEY) : "commit misses translog uuid";
+            assert commitDataAsMap(writer).containsKey(Translog.TRANSLOG_GENERATION_KEY) : "commit misses translog generation";
+            assert commitDataAsMap(writer).containsKey(MAX_UNSAFE_AUTO_ID_TIMESTAMP_COMMIT_ID) : "commit misses max unsafe times stamps";


bleskes · 2017-09-21T14:28:49Z

Thx @ywelsch . I will leave master as is. Having two commitIndexWriter methods doesn't have a good smell (even with the assertions I added to protect against abuse).

…roys translog recovery info (#26734) This is a bug introduced in #26694 . The issue comes from the attempt to share code that commits the new history uuid and/or a new translog uuid. This goes wrong an existing 5.6 index that is recovered from store: 1) A new history uuid is generated as it doesn't exist in the index 2) The translog is opened and it's uuid doesn't change. 3) The committing the new history uuid used the standard commitIndexWriter method. 4) The latter asks the translog for the oldest file generation that is required to recover from the local checkpoint + 1 5) The local checkpoint on old indices is -1 (until something is indexed) and the translog is asked to recover from position 0. That excludes any operations the translog that do not have a seq no assigned, causing the FullClusterRestart bwc tests to fail. To bypass this commit moves away from the attempt to share the committing code between a new translog uuid and a new history uuid. Instead we do what we did before and open the translog with a potential commit. Afterwards we commit the history uuid if needed. This comes with the expense of opening up the method to commit an index writer in the engine.

bleskes added 4 commits September 21, 2017 12:18

roll back combined history/translog commit

7316467

move back to a follow history uuid commit, so not to confuse people t…

52d6287

…hat expect a commit to have translog info

relax assertions

8dea75c

name

ecac747

bleskes added :Engine >non-issue v6.0.0 v6.1.0 labels Sep 21, 2017

bleskes requested a review from ywelsch September 21, 2017 12:13

ywelsch approved these changes Sep 21, 2017

View reviewed changes

typo

2f4e7ea

bleskes merged commit 07b0c26 into elastic:6.x Sep 21, 2017

bleskes deleted the history_uuid_commit_6x branch September 21, 2017 14:27

colings86 added v6.0.0-rc1 and removed v6.0.0 labels Sep 22, 2017

lcawl removed the v6.1.0 label Dec 12, 2017

dnhatn mentioned this pull request Dec 18, 2017

Backport for using lastSyncedGlobalCheckpoint in deletion policy #27866

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generating and committing a history_uuid on existing old indices destroys translog recovery info#26734

Generating and committing a history_uuid on existing old indices destroys translog recovery info#26734
bleskes merged 5 commits intoelastic:6.xfrom
bleskes:history_uuid_commit_6x

bleskes commented Sep 21, 2017

Uh oh!

ywelsch left a comment

Uh oh!

ywelsch Sep 21, 2017

Uh oh!

bleskes commented Sep 21, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

bleskes commented Sep 21, 2017

Uh oh!

ywelsch left a comment

Choose a reason for hiding this comment

Uh oh!

ywelsch Sep 21, 2017

Choose a reason for hiding this comment

Uh oh!

bleskes commented Sep 21, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants