Add parser for the unofficial prometheus dump format by ArthurSens · Pull Request #8320 · prometheus/prometheus

ArthurSens · 2020-12-27T22:33:28Z

As briefly discussed at Prometheus office hours and at the last Storage Working Group meeting, it might be useful to have a way to backfill blocks from a file resulted from promtool tsdb dump.
This is not intended to be used as an efficient strategy to move huge amounts of data between Prometheus instances, but as an easy way to "dump and undump" specific filtered samples. A more efficient dump-undump strategy for big amounts of data is still being discussed.

It is still necessary to add more logic to be able to do the backfilling via promtool, but I believe it will be easier to review separating into two PRs.

This PR is only adding a text parser to the dump format.

PS: I've never written a text parser before, I wrote this one by reading and trying to understand the OpenMetrics and Prometheus exposition format parsers and replicating. I couldn't find any documentation or tutorials for golex and the code generated following these instructions don't even compile if I don't do some manual changes

If anyone knows a good place where I could learn more about golex, that would be great 😛

Signed-off-by: arthursens <arthursens2005@gmail.com>

roidelapluie · 2020-12-27T22:40:01Z

I think that the format of tsdb dump is already covered, it is the prometheus standard text parser.

roidelapluie · 2020-12-27T22:40:41Z

It really looks like it is the prometheus exposition format.

roidelapluie · 2020-12-27T22:44:46Z

Maybe just the handling of __name__ differs

ArthurSens · 2020-12-27T22:51:20Z

Maybe just the handling of __name__ differs

Yeah, that's what I understood as well. I've just tried to parse {__name__="metric", foo="bar"} 10 1608996530647 with the standard prometheus text parser and it failed with:

"INVALID" is not a valid start token

The metric name handling is indeed different. Also, the dump format always includes a timestamp, which is optional on the standard one

ArthurSens · 2020-12-27T22:52:51Z

cc @beorn7, you might want to take a look at this one :)

Signed-off-by: arthursens <arthursens2005@gmail.com>

brian-brazil · 2020-12-28T00:37:06Z

Why is a whole new metrics format being used here? We shouldn't be proliferating additional formats.

The textparse directory is only for the two parsers used for scraping, and is tightly bound to the tsdb. No other parsers should go in here as that'll only cause confusion.

ArthurSens · 2020-12-28T01:21:20Z

While reading the codebase and figuring out how to backfill from the dump format, I saw that the current command promtool tsdb create-blocks-from openmetrics uses the openmetricsparse.go from this same package.

That's why I thought that this directory should be the home for a new textparser used in a to-be-developed promtool tsdb create-blocks-from dump subcommand.

If this new command is going to be accepted and the textparse directory is indeed not the ideal place, where should this code live? 🤔

brian-brazil · 2020-12-28T09:11:13Z

My point is that there shouldn't be a new parser at all, as we shouldn't be using any metrics formats that aren't the Prometheus or OpenMetrics format.

ArthurSens · 2020-12-28T11:37:45Z

Hmm ok, got your point

Do you see this idea moving forward if the dump created a file in the Prometheus format instead of the current one?

beorn7 · 2020-12-28T11:56:28Z

To provide more context: The premise here is what @ArthurSens described: You can already run promtool tsdb dump, but the result so far is only used by humans to inspect, and probably many people hacked some ad-hoc tools to process it. It would be nice to have an "undump" feature so that you can dump a specific piece of the TSDB data of one Prometheus instance and backfill it into another Prometheus instance. An issue is that the current dump command dumps an ad-hoc format. We discussed various approaches here, including:

Create a promtool tsdb create-blocks-from dump command that can read the dumps created by a promtool tsdb dump command from the same version of promtool. Pro: Minimal effort to add, gives us all the freedom to change the dump format so that it is easy and efficient to dump and to convert to blocks again. Con: Dumps are not necessarily portable between versions. Dumps don't use a standardized format.
Modify promtool tsdb dump to (generally or optionally) dump OpenMetrics. Pro: Standardized and portable format. Con: OpenMetrics takes more effort to create and read back because its ordering requirements don't match how we retrieve and write TSDB data. It's generally not very efficient if you want to dump just a few time series with a lot of samples.
Modify promtool tsdb dump to (generally or optionally) dump a less ad-hoc format, for example in the form of JSON, which is structurally aligned with TSDB so that it can be efficiently dumped and read back. Pro: Allows efficient dumping and backfilling. JSON is easy for arbitrary other tools to read in an ad-hoc fashion (and as such might be able to fulfill the dev-summit consensus: "For data science applications, we encourage the creation of tooling to export data from TSDB to a format that is readable by Python & Java.") Con: Yet another format. Most work to implement (need to specify an at least semi-stable format, neet to write new dump code and new backfill code).

beorn7 · 2020-12-28T12:12:43Z

Personally, I would just go for (1) for now as it is low effort, unlocks the feature quickly, and doesn't commit us to anything beyond what we already do and what's written on the tin (it's just undump of whatever is dumped). If we later went with (2) or (3), we would still fulfill the contract of (1) implicitly.

While (2) is certainly nice-to-have, I see it as the least necessary of all options. It's needlessly expensive if you just want to transfer some data between Prometheus instances and don't care about the format. And it doesn't unlock anything for non-Prometheus-related tools (which will be happy with JSON using some ad-hoc schema but those tools are unlikely to ever get native OpenMetrics import). It's only really useful once other tools exist that process files in OpenMetrics format.

(3) kind of gives you the best of both worlds. We can design the format to be efficient for what we want here (including the "few times series with many samples" use case), and we open up import of the data by arbitrary tools. But it needs more upfront work to get it going.

So I suggest to implement (1) with a migration towards (3) in mind.

An OpenMetrics dump can be implemented independent of this effort at any time.

brian-brazil · 2020-12-28T12:28:10Z

OpenMetrics takes more effort to create and read back because its ordering requirements don't match how we retrieve and

That's a problem for reading when we're trying to produce 2h blocks. It's not a problem for writing out, as the tsdb interfaces are already as we want them. The same applies for the Prometheus text format (ignoring that multiple timestamps there is technically defined behaviour).

Allows efficient dumping and backfilling.

I'm not seeing how JSON would help or be efficient here compared to the existing metrics formats. We'd have all the exact same problems when it comes to reading it back in when trying to write out blocks efficiently. Plus we'd still be defining yet another new metrics format.

I think we should be looking to eliminate the extra metrics format that was inadvertently created, and do 2.

brian-brazil · 2020-12-28T12:34:35Z

Though thinking on it, OM format doesn't work as it doesn't support stale NaNs.

If the goal is to be able to dump and undump, I suspect that the only reasonable format is the tsdb block.

beorn7 · 2020-12-28T12:45:00Z

Yeah, just dumping the pieces of data we want as blocks to copy into another Prometheus instance would be an elegant and direct way to solve the "let me transfer this piece of data from Prometheus A to Prometheus B". Definitely another option.

WRT stale markers: That's a good point. I still think a text dump including the stale markers would be worthwhile. Originally, the dump command was introduced to let humans inspect what's actually in the TSDB. Stale markers are of potential interest in this use case, too. And they are interesting "for data science applications" etc., too.

The revelation about stale markers proves the viability of option (1): Such a feature can just be added without breaking any contract.

beorn7 · 2020-12-28T12:48:37Z

WRT efficiency: The strict ordering requirement of OpenMetrics could get in the way of efficient retrieval. Perhaps it won't, but we cannot know. OpenMetrics is not designed to be ordered in a way that fits the use case of dumping data from the Prometheus TSDB or reading it back. A format with more flexible ordering would be, well, more flexible. We would be free to pick the right trade-off in the ordering between dumping and un-dumping. Another aspect is that OpenMetrics requires us to repeat the whole metric identifier (name and labels) for each sample.

brian-brazil · 2020-12-28T13:21:46Z

The strict ordering requirement of OpenMetrics could get in the way of efficient retrieval. Perhaps it won't, but we cannot know.

It won't, the tsdb APIs always return samples for a given series in order. So dumping should be fine, reading it back is trickier due to what you likely want to then do with it.

I still think a text dump including the stale markers would be worthwhile.

Yeah, some way to view those that's not the remote read API would be nice. On the other hand, I've never had a need to view raw stale markers since implementing them so it would seem that that's not important for debugging in practice.

I note that the current code will dump them as NaN.

Another aspect is that OpenMetrics requires us to repeat the whole metric identifier (name and labels) for each sample.

Anything that's not effectively the raw tsdb block is going to be less efficient than a tsdb block.

without breaking any contract.

I don't believe we've any stability guarantees around promtool.

In relation to this issue, I think we should not do anything here right now. This PR is speculative in terms of use cases, and per the above it seems it is not the best way to go about that class of use case. I think the dump subcommand should be left to ad-hoc debugging/extraction, and we not try to tie it into use cases for which it is not suited merely because it seems like it might make sense at a surface level.

beorn7 · 2020-12-28T13:48:53Z

It won't, the tsdb APIs always return samples for a given series in order.

But the ordering requirements of OpenMetrics are not per (Prometheus) series. Series belonging to the same metric (_count, _sum, _created, …) have to be stitched together. See https://github.com/OpenObservability/OpenMetrics/blob/master/specification/OpenMetrics.md#metricpoint-1

I've never had a need to view raw stale markers since implementing them so it would seem that that's not important for debugging in practice.

I would have liked to.

I note that the current code will dump them as NaN.

That might even help in debugging cases, but it's also ambiguous and therefore dangerous.

Anything that's not effectively the raw tsdb block is going to be less efficient than a tsdb block.

That's true, but it also doesn't add anything of value to the discussion. The TSDB block only solves one use case.

brian-brazil · 2020-12-28T14:02:20Z

But the ordering requirements of OpenMetrics are not per (Prometheus) series.

We have none of that information here, everything would be an unknown type. As things stand anyway.

That might even help in debugging cases, but it's also ambiguous and therefore dangerous.

Indeed, that should be fixed.

ArthurSens · 2020-12-28T16:52:39Z

Oh, nice to see that the PR at least brought a good discussion around the idea :)

Thank you Beorn for also bringing the other ideas that were already discussed. Yeah, the approach I was aiming for in this PR was approach number (1). Despite the complexity of building a text parser, it was fairly easy to understand the promparse and do some minor changes to fit with the current dump format. Adding a backfiller on top of that parser shouldn't be too complicated too, since I can just copy the OpenMetrics one and apply the necessary changes.

That being said, I agree that having a new formalized format and even worse, one with no stability contract between different Prometheus versions, might guide the community to create tooling based on several different and incompatible formats. I think we do need to choose an already formalized format, or if a new format is going to be created, it should at least be formalized with a contract.

In relation to this issue, I think we should not do anything here right now. This PR is speculative in terms of use cases, and per the above it seems it is not the best way to go about that class of use case.

Yep, after the above discussion, I also agree that there is no need for a review on the parser for now.

There is no concrete use case, but that sounds like something that might be explored in the future, right? As I was already advised before, I think it's time to write a design doc around the idea and all the possible approaches 😅. I wasn't totally confident about what other approaches existed, but this PR alone has enough reading material to start the design document. Thanks again you both!

Add parser for the unofficial prometheus dump format

23fa28e

Signed-off-by: arthursens <arthursens2005@gmail.com>

ArthurSens mentioned this pull request Dec 27, 2020

promtool: Add ability to "dump" whole blocks #8281

Open

Copy-pasta cleanup

8074f36

Signed-off-by: arthursens <arthursens2005@gmail.com>

Base automatically changed from master to main February 23, 2021 19:36

stale bot added the stale label Apr 24, 2021

ArthurSens closed this Feb 1, 2022

ArthurSens deleted the promdump_textparser branch February 1, 2022 17:43

beorn7 mentioned this pull request Aug 30, 2023

promtool: Add support for native histograms #12146

Open

fatsheep9146 mentioned this pull request Sep 1, 2023

promtool tsdb dump support native histogram #12775

Merged

Conversation

ArthurSens commented Dec 27, 2020

Uh oh!

roidelapluie commented Dec 27, 2020

Uh oh!

roidelapluie commented Dec 27, 2020

Uh oh!

roidelapluie commented Dec 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurSens commented Dec 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ArthurSens commented Dec 27, 2020

Uh oh!

brian-brazil commented Dec 28, 2020

Uh oh!

ArthurSens commented Dec 28, 2020

Uh oh!

brian-brazil commented Dec 28, 2020

Uh oh!

ArthurSens commented Dec 28, 2020

Uh oh!

beorn7 commented Dec 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beorn7 commented Dec 28, 2020

Uh oh!

brian-brazil commented Dec 28, 2020

Uh oh!

brian-brazil commented Dec 28, 2020

Uh oh!

beorn7 commented Dec 28, 2020

Uh oh!

beorn7 commented Dec 28, 2020

Uh oh!

brian-brazil commented Dec 28, 2020

Uh oh!

beorn7 commented Dec 28, 2020

Uh oh!

brian-brazil commented Dec 28, 2020

Uh oh!

ArthurSens commented Dec 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

roidelapluie commented Dec 27, 2020 •

edited

Loading

ArthurSens commented Dec 27, 2020 •

edited

Loading

beorn7 commented Dec 28, 2020 •

edited

Loading

ArthurSens commented Dec 28, 2020 •

edited

Loading