Repository Cleanup Endpoint by original-brownbear · Pull Request #43900 · elastic/elasticsearch

original-brownbear · 2019-07-03T06:40:31Z

Snapshot cleanup functionality via transport/REST endpoint.
Added all the infrastructure for this with the HLRC and node client
Made use of it in tests and resolved relevant TODO
Added new Custom CS element that tracks the cleanup logic. Kept it similar to the delete and in progress classes and gave it some (for now) redundant way of handling multiple cleanups but only allow one
Use the exact same mechanism used by deletes to have the combination of CS entry and increment in repository state ID provide some concurrency safety (the initial approach of just an entry in the CS was not enough, we must increment the repository state ID to be safe against concurrent modifications, otherwise we run the risk of "cleaning up" blobs that just got created without noticing)
Isolated the logic to the transport action class as much as I could. It's not ideal, but we don't need to keep any state and do the same for other repository operations (like getting the detailed snapshot shard status)

tlrx

Thanks for the changes, I liked it better without the LongConsumer. Can we decide on the response output? That's the last item that prevents me to LGTM ;)

Also, did you consider to have a dry-run option?

tlrx · 2019-08-09T15:51:07Z

server/src/main/java/org/elasticsearch/common/blobstore/DeleteResult.java

+
+    public static final DeleteResult ZERO = new DeleteResult(0, 0);
+
+    private final long blobsDeleted;


nit: this could be just blobs & bytes

Hmm, I'd rather not, then I have to use this in add :D

...r/src/main/java/org/elasticsearch/rest/action/admin/cluster/RestCleanupRepositoryAction.java

original-brownbear · 2019-08-09T15:59:59Z

Thanks @tlrx

Thanks for the changes, I liked it better without the LongConsumer.

Np, so do I :D

Can we decide on the response output? That's the last item that prevents me to LGTM ;)

I like the one level of nesting better but have no strong arguments for it other than maybe consistency with other APIs and extensibility.

Also, did you consider to have a dry-run option?

Yea but in the end I'm not sure I see the point. If you think about it, all you can present to the user is a bunch of uuids for indices and stale root blobs). What can they really get out of that?
On the other hand, the response could be problematically huge in some cases and if we add more logic here to do some deep cleaning of shards it gets questionable how we would even present the results of that (IMO) -> not sure it's worth it unless someone sees a good use case?

original-brownbear · 2019-08-09T20:02:22Z

Jenkins run elasticsearch-ci/bwc

original-brownbear · 2019-08-10T10:01:37Z

Jenkins run elasticsearch-ci/bwc

original-brownbear · 2019-08-10T18:11:19Z

Jenkins run elasticsearch-ci/bwc

andrershov

LGTM

original-brownbear · 2019-08-13T16:38:55Z

Jenkins run elasticsearch-ci/packaging-sample

original-brownbear · 2019-08-21T09:19:46Z

@tlrx ping :) (no rush I know you just returned)

Can we decide on the response output? That's the last item that prevents me to LGTM ;)

I still like the current format :) ... mainly for its easier extensibility. If that's the last thing blocking this maybe we can go with the current version and merge this? :)

tlrx

LGTM

Concerning the response output I expressed my opinion but if both you and @andrershov prefer the proposed format then I'm fine :)

Thanks for the response on the dry-run option, I wanted to know if it was considered at some point. Since this is something that can be added later, we'll see if someone requests it.

original-brownbear · 2019-08-21T10:00:47Z

Thanks so much @tlrx and @andrershov for reviewing this big one!

Disabling BwC tests so #45780 can be merged

* Repository Cleanup Endpoint (#43900) * Snapshot cleanup functionality via transport/REST endpoint. * Added all the infrastructure for this with the HLRC and node client * Made use of it in tests and resolved relevant TODO * Added new `Custom` CS element that tracks the cleanup logic. Kept it similar to the delete and in progress classes and gave it some (for now) redundant way of handling multiple cleanups but only allow one * Use the exact same mechanism used by deletes to have the combination of CS entry and increment in repository state ID provide some concurrency safety (the initial approach of just an entry in the CS was not enough, we must increment the repository state ID to be safe against concurrent modifications, otherwise we run the risk of "cleaning up" blobs that just got created without noticing) * Isolated the logic to the transport action class as much as I could. It's not ideal, but we don't need to keep any state and do the same for other repository operations (like getting the detailed snapshot shard status)

original-brownbear added 30 commits June 25, 2019 20:02

Repo cleanup endpoint start

add8cbb

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

b850f48

50%

7abc873

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

7b698f4

just ack for now

80624cd

just ack for now

8d22cd5

just ack for now

6e14c27

bck

059eca9

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

fd6f8df

bck

bfd7e63

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

4d1ed1f

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

0b5b0c0

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

a828306

bck

d2952d4

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

d913ad4

Fix functionality

c54a075

add rest action

71446c5

fix compilation

48da5d0

fix compilation

6f2e702

nicer formatting

5a0c826

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

79bcc6b

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

87785e1

bck

2bbd0f0

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

29b419d

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

2340375

merge fixes

b2ebb52

add some documentation

ba1ad03

nicer cleanup logic

266471f

bck

28a4b69

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

81ba190

original-brownbear requested a review from andrershov August 9, 2019 13:02

tlrx reviewed Aug 9, 2019

View reviewed changes

original-brownbear requested a review from tlrx August 9, 2019 17:02

add missing empty line

3af846c

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

f8238cf

andrershov approved these changes Aug 13, 2019

View reviewed changes

Merge remote-tracking branch 'elastic/master' into cleanup-repo-ep

6678156

original-brownbear mentioned this pull request Aug 18, 2019

Crashing during snapshot deletion might result in unreferenced data left in repository #13159

Closed

tlrx approved these changes Aug 21, 2019

View reviewed changes

original-brownbear merged commit df01766 into elastic:master Aug 21, 2019

original-brownbear deleted the cleanup-repo-ep branch August 21, 2019 10:02

original-brownbear added the backport pending label Aug 21, 2019

original-brownbear mentioned this pull request Aug 21, 2019

Repository Cleanup Endpoint (#43900) #45780

Merged

original-brownbear added a commit that referenced this pull request Aug 21, 2019

Disable BwC Tests for #43900 (#45781)

31e3e71

Disabling BwC tests so #45780 can be merged

original-brownbear removed the backport pending label Aug 23, 2019

jen-huang mentioned this pull request Aug 23, 2019

[SR] Allow users to perform cleanup action for a repository elastic/kibana#43904

Closed

This was referenced Oct 14, 2019

7.4 meta ticket elastic/elasticsearch-net#4133

Closed

Implement snapshot repository cleanup elastic/elasticsearch-net#4145

Merged

mkleen mentioned this pull request Jun 29, 2020

Track Shard-Snapshot Index Generation at Repository Root crate/crate#10128

Merged

5 tasks

original-brownbear restored the cleanup-repo-ep branch August 6, 2020 18:38

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository Cleanup Endpoint#43900

Repository Cleanup Endpoint#43900
original-brownbear merged 105 commits intoelastic:masterfrom
original-brownbear:cleanup-repo-ep

original-brownbear commented Jul 3, 2019 •

edited

Loading

Uh oh!

tlrx left a comment

Uh oh!

tlrx Aug 9, 2019

Uh oh!

original-brownbear Aug 9, 2019

Uh oh!

Uh oh!

original-brownbear commented Aug 9, 2019

Uh oh!

original-brownbear commented Aug 9, 2019

Uh oh!

original-brownbear commented Aug 10, 2019

Uh oh!

original-brownbear commented Aug 10, 2019

Uh oh!

andrershov left a comment

Uh oh!

original-brownbear commented Aug 13, 2019

Uh oh!

original-brownbear commented Aug 21, 2019

Uh oh!

tlrx left a comment

Uh oh!

original-brownbear commented Aug 21, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants


		public static final DeleteResult ZERO = new DeleteResult(0, 0);

		private final long blobsDeleted;

Conversation

original-brownbear commented Jul 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tlrx left a comment

Choose a reason for hiding this comment

Uh oh!

tlrx Aug 9, 2019

Choose a reason for hiding this comment

Uh oh!

original-brownbear Aug 9, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

original-brownbear commented Aug 9, 2019

Uh oh!

original-brownbear commented Aug 9, 2019

Uh oh!

original-brownbear commented Aug 10, 2019

Uh oh!

original-brownbear commented Aug 10, 2019

Uh oh!

andrershov left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Aug 13, 2019

Uh oh!

original-brownbear commented Aug 21, 2019

Uh oh!

tlrx left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Aug 21, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

original-brownbear commented Jul 3, 2019 •

edited

Loading