[DSL Global Retention] Use data stream global retention metadata by gmarouli · Pull Request #106221 · elastic/elasticsearch

gmarouli · 2024-03-12T08:04:35Z

In this PR we make use in the code the global retention metadata that we introduced in #106170. We use it in responses of GET requests when data stream lifecycle was included.

{
  ...
  "lifecycle": {
    "data_retention": "365d",
    "effective_retention": "90d",
    "retention_determined_by": "max_global_retention"
  }

Currently, there is no way to set the global retention for the user, so this PR is also marked as a >non-issue, it adds two new fields but the global retention functionality cannot influence them yet.

elasticsearchmachine · 2024-03-19T13:35:17Z

Pinging @elastic/es-data-management (Team:Data Management)

gmarouli · 2024-03-19T13:45:50Z

Known failure: #106457

docs/reference/data-streams/lifecycle/tutorial-manage-new-data-stream.asciidoc

masseyke · 2024-03-19T16:43:55Z

server/src/main/java/org/elasticsearch/cluster/metadata/DataStreamLifecycle.java

    public static final ConstructingObjectParser<DataStreamLifecycle, Void> PARSER = new ConstructingObjectParser<>(
        "lifecycle",
-        false,
+        true,


I'm a little confused by this change. I think it's because we don't expect (or allow) the user to pass in effective_retention or retention_determined_by, but we use this parser for round-trip serialization tests, which need to use that. Is that right? It might be worth a short comment in the code since the ConstructingObjectParser constructor's javadocs recommend against setting it to true (This should generally be set to true only when parsing responses from external systems, never when parsing requests from users).

Oh it's more than just the round-trip serialization test -- I tried setting this to false to see what else was impacted, and a yaml rest test also failed when the xcontent was being deserialized by DataStreamLifecycle.fromXContent().

Thank you for raising this topic @masseyke . I spent some more time looking around and I decided to change the approach for the following reasons:

Well, javadoc is very specific for the use of ignoreUnknownFields:

@param ignoreUnknownFields Should this parser ignore unknown fields? This should generally be set to true only when parsing responses from external systems, never when parsing requests from users.

DataStreamLifecycle is used for user input and using ignoreUnknownFields means that a user could add different fields including effective_retention which will be ignored but the user won't know.

The problem here I think is that DataStreamLifecycle is not only used for user input but it's also deserialised in the cluster state as XContent as part of the data stream.

As you see we do not have this problem with the rollover information, because they are only used when creating a response to the user. So, I decided to follow this approach. I added a flag, that is set only when we are creating a response to the user and it's false otherwise. This will ensure that the new and unexpected fields will only be present in the user responses and not during other serialisations.

I requested another review because the approach changed and I would like your input. Thanks again for bringing this up! I think the original approach was not thought through properly.

And uncovered a mistake on a payload... that's why the yaml was failing, or at least one of the reasons

What you did makes sense to me. I like that we no longer ignore unknown fields.

masseyke · 2024-03-19T17:48:34Z

It looks good to me. It would probably be good to have some yaml rest tests -- maybe you're waiting to add those until you've added a way to set global retention?

masseyke

I had a few minor questions/comments, but it looks good to me.

gmarouli · 2024-03-20T09:41:21Z

It looks good to me. It would probably be good to have some yaml rest tests -- maybe you're waiting to add those until you've added a way to set global retention?

Yes, indeed that would be in the final PR.

masseyke · 2024-03-20T18:01:38Z

Docs preview here: https://elasticsearch_bk_106221.docs-preview.app.elstc.co/diff

gmarouli added 5 commits March 11, 2024 16:39

Use the global retention in data stream lifecycle retention calculation

109bf28

Update data stream APIs

b5d4378

Update explain APIs

78a0efd

Update template related APIs

fab2817

Update telemetry to use the configured retention

73f0225

elasticsearchmachine added the v8.14.0 label Mar 12, 2024

gmarouli added 13 commits March 12, 2024 10:13

Merge with main

c74ff7d

Merge branch 'main' into use-data-stream-global-retention-metadata

76f058a

Fix DataStreamTests

91ccdb0

Fix DataStreamTests

b58d10b

Update some doc tests with the new fields

57f39a2

Add missing commas in json payload

9eff27b

Merge branch 'main' into use-data-stream-global-retention-metadata

bed854d

Checkpont

e36d47a

Merge with main

758cbbf

format

83c4b93

rounding up tests & polishing

c696e52

Fix tests

39aeb28

Remove seeds

f95180d

gmarouli mentioned this pull request Mar 19, 2024

[Meta] Data stream lifecycle global retention #106169

Closed

4 tasks

gmarouli requested review from masseyke and parkertimmins March 19, 2024 13:34

gmarouli added >non-issue :StorageEngine/Data streams Data streams and their lifecycles labels Mar 19, 2024

gmarouli marked this pull request as ready for review March 19, 2024 13:34

elasticsearchmachine added the Team:Data Management (obsolete) DO NOT USE. This team no longer exists. label Mar 19, 2024

masseyke reviewed Mar 19, 2024

View reviewed changes

docs/reference/data-streams/lifecycle/tutorial-manage-new-data-stream.asciidoc Outdated Show resolved Hide resolved

masseyke reviewed Mar 19, 2024

View reviewed changes

masseyke approved these changes Mar 19, 2024

View reviewed changes

gmarouli added 4 commits March 20, 2024 09:38

Merge branch 'main' into use-data-stream-global-retention-metadata

c310f52

Change the approach of adding the effective retention

3e8b2d5

Polishing

2da8616

typo

fca0063

gmarouli requested a review from masseyke March 20, 2024 09:42

gmarouli added 4 commits March 20, 2024 15:09

Fix test

c5f8ad7

Fix yaml test

ce33ba9

Fix test

1150a64

Remove seed

79f6809

masseyke approved these changes Mar 20, 2024

View reviewed changes

gmarouli merged commit 2988799 into elastic:main Mar 20, 2024

gmarouli deleted the use-data-stream-global-retention-metadata branch March 20, 2024 18:34

This was referenced Mar 21, 2024

Add DownsampleMetrics #106637

Closed

Set index mode earlier for new downsample index #106728

Merged

Added initial metrics for synthetic source #106732

Merged

masseyke mentioned this pull request May 1, 2024

Removing global retention functionality for templates #108170

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DSL Global Retention] Use data stream global retention metadata#106221

[DSL Global Retention] Use data stream global retention metadata#106221
gmarouli merged 26 commits intoelastic:mainfrom
gmarouli:use-data-stream-global-retention-metadata

gmarouli commented Mar 12, 2024 •

edited

Loading

Uh oh!

elasticsearchmachine commented Mar 19, 2024

Uh oh!

gmarouli commented Mar 19, 2024

Uh oh!

Uh oh!

masseyke Mar 19, 2024

Uh oh!

masseyke Mar 19, 2024

Uh oh!

gmarouli Mar 20, 2024

Uh oh!

gmarouli Mar 20, 2024

Uh oh!

gmarouli Mar 20, 2024

Uh oh!

masseyke Mar 20, 2024

Uh oh!

masseyke commented Mar 19, 2024

Uh oh!

masseyke left a comment

Uh oh!

gmarouli commented Mar 20, 2024

Uh oh!

masseyke commented Mar 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gmarouli commented Mar 12, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Mar 19, 2024

Uh oh!

gmarouli commented Mar 19, 2024

Uh oh!

Uh oh!

masseyke Mar 19, 2024

Choose a reason for hiding this comment

Uh oh!

masseyke Mar 19, 2024

Choose a reason for hiding this comment

Uh oh!

gmarouli Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

gmarouli Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

gmarouli Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

masseyke Mar 20, 2024

Choose a reason for hiding this comment

Uh oh!

masseyke commented Mar 19, 2024

Uh oh!

masseyke left a comment

Choose a reason for hiding this comment

Uh oh!

gmarouli commented Mar 20, 2024

Uh oh!

masseyke commented Mar 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gmarouli commented Mar 12, 2024 •

edited

Loading