Redesign parser interface to return metrics and not lines by krajorama · Pull Request #15115 · prometheus/prometheus

krajorama · 2024-10-07T05:46:38Z

Related to #13529 , to simplify the implementation there.

The current exposition parser design is returning individual lines from the text formats and emulates this behavior over Protobuf. This PR rewrites the interface to return metrics, that is individual series with all information about the series in one interface: all samples of a histogram/summary, type, unit, help, exemplars, created timestamp. This should make implementing NHCB simpler.

The code will be fairly similar to the text to DTO parser code in prometheus/common , however it should also support OpenMetrics and be more efficient.
To be tested with #15083 .

…eries If we turn on NHCB, we need to parse classic values even if they are dropped as classic series, to be able to do NHCB Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>

Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>

The idea is to load metrics into curr, but if we see a change in metric name or labels we load them into next , return curr as detected and continue from next. Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>

bwplotka · 2024-10-07T08:52:47Z

model/textparse/interface.go

-	Next() (Entry, error)
+	// Parse returns the next metric with all information collected about it, such as
+	// metric type, help text, unit, created timestamps, samples, etc.
+	Next(d DropperCache, keepClassicHistogramSeries bool) (interface{}, error)


Quite a redesign! As we discussed on Slack, it's getting similar to expfmt, should we consider merging those?

I wonder how to make that decision for total redesign to return interface/types vs doubling down for method for each complex metric type (similar to proto getters kind of)...

The current exposition parser design is returning individual lines from the text formats and emulates this behavior over Protobuf. This PR rewrites the interface to return metrics, that is individual series with all information about the series in one interface: all samples of a histogram/summary, type, unit, help, exemplars, created timestamp. This should make implementing NHCB simpler.

My understanding of this parser was that we were asking callers to provide methods, because we wanted to avoid type-to-type conversions. This is because we want to append directly low-level values. For protobuf you already unmarshal to a "type" (unless we write our own parser or something), so hard to avoid. With this work we seem to create a new types to convert from/into. Isn't that a regression? Wouldn't this be simpler if we would just just a protobuf (dto) types (which is essentially what expfmt is doing)? Then the question is.. should it be OM or Prom proto (:

To sum up, what's the best way to make decision here?

Also if we do such redesign, I would highly suggest textparsev2 or expparse or some other package. This will give us way to slowly add implementations/formats and slowly migrate stuff and compare efficiencies.

cc @beorn7 @bboreham

Also, what's the TL;DR blocker for doing line by line?

For nhcb - we already “abstract” line by line with CreatedTimestamp logic. Can we assume group of classic histograms that represent nhcb is our new "one" line?
For proto - it's weird indeed, but is it really slower/less efficient because of that interface?

Ok, I understand small confusion, I think we focused here on this idea that we can have a wrapper parser for all 3 implementations 🤔 that is one perspective, but I would argue, for low-level parsing, with our learnings from CT parsing in OM Text, it’s much faster/lean to NOT use abstracted Metrics/Series etc methods, but use things inside from lexer directly.

This is because we want to append directly low-level values.

From what I could gather this, in practice, means:

parser takes a byte slice

that byte slice is a metrics response body, so the whole body needs to be first read into memory, often also uncompressed fully

this means that scrape code needs a lot of byte slices, so it ends up with a lot of GC pressure and so there's a sync.Pool to help mitigate this

these low level values need to be parsed later to get the actual metrics

this also means that scrape code uses raw response metric strings, instead of parsed one, which isn't always correct - Scrape cache should use labels hash after relabeling #14712

IMHO it would be be great if parser was given io.Reader instead of byte slice, so that there's no need to fully read the response before starting to parse metrics, and then there would be no need to use sync.Pool and, hopefully, memory pressure from scrapes would go down.
But I do understand that current implementation avoids parsing most metrics on every scrape if they're already in the scrape cache (but that can cause issues like #14712). Maybe a cache in the parser itself could mitigate that.

I gave up this approach because it turned out to be way more complicated than I thought. There's a lot of baggage due to the text formats and how scrape works now, so it just didn't fit our timeline to try and implement this.

This rewrite would be an XL size project, so might as well include io.Reader change as @prymitive suggests.

Anyway, since I'm unlikely to have time for this, I'm closing the PR, but feel free anyone to take it on. See PR #14978 and many followups that was merged instead.

krajorama · 2024-10-29T14:44:48Z

Alternate solution used in the end #14978

krajorama added 3 commits October 3, 2024 15:26

wip:wip

9b748fd

Needs to know if it is ok to unconditionally drop classic histogram s…

a99a12a

…eries If we turn on NHCB, we need to parse classic values even if they are dropped as classic series, to be able to do NHCB Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>

Start implement promparse with the metric output instead of line output

61b8ed3

Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>

krajorama mentioned this pull request Oct 7, 2024

textparse: Refactored benchmark #15083

Merged

Rework with state functions and curr+next

b3edeaa

The idea is to load metrics into curr, but if we see a change in metric name or labels we load them into next , return curr as detected and continue from next. Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>

bwplotka reviewed Oct 7, 2024

View reviewed changes

krajorama closed this Oct 29, 2024

prymitive mentioned this pull request May 15, 2025

prometheus support stream parsing mode #16591

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redesign parser interface to return metrics and not lines#15115

Redesign parser interface to return metrics and not lines#15115
krajorama wants to merge 4 commits intomainfrom
redesign-parser-interface

krajorama commented Oct 7, 2024 •

edited

Loading

Uh oh!

bwplotka Oct 7, 2024

Uh oh!

bwplotka Oct 7, 2024

Uh oh!

bwplotka Oct 7, 2024 •

edited

Loading

Uh oh!

bwplotka Oct 7, 2024 •

edited

Loading

Uh oh!

prymitive Oct 7, 2024

Uh oh!

krajorama Oct 29, 2024

Uh oh!

krajorama commented Oct 29, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

krajorama commented Oct 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bwplotka Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

bwplotka Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

bwplotka Oct 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bwplotka Oct 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

prymitive Oct 7, 2024

Choose a reason for hiding this comment

Uh oh!

krajorama Oct 29, 2024

Choose a reason for hiding this comment

Uh oh!

krajorama commented Oct 29, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

krajorama commented Oct 7, 2024 •

edited

Loading

bwplotka Oct 7, 2024 •

edited

Loading

bwplotka Oct 7, 2024 •

edited

Loading