Logs performance benchmarks by pohly · Pull Request #115358 · kubernetes/kubernetes

pohly · 2023-01-27T10:37:58Z

What type of PR is this?

/kind cleanup

What this PR does / why we need it:

Improves usability and realism of the performance tests for logging.

Does this PR introduce a user-facing change?

NONE

/assign @serathius

pohly · 2023-01-27T10:45:16Z

@serathius wrote in #115277 (comment):

Overall benchmark changes makes sense for me, however I'm not sure why we need the symlink magic. Could you explain more about why you added it?

With the directories set up like this, running benchmarks against some log files from the CI only need a single command as setup (the gsutil mentioned in the updated README). I could keep the symlinks out of the commit, but creating them or moving files is complicated enough that I then would have to add a script instead that users need to invoke - I don't think that this will be simpler.

Without any symlinks, the same file would have to be replicated in different places (once for each verbosity level).

pohly · 2023-01-27T10:46:51Z

Hmm, I had forgotten about the additional mangling of the files. Perhaps a script is better after all.

serathius · 2023-01-27T14:45:02Z

Why ln -s and not mv? What's the benefit of using symlinks here?

The same file gets referenced in two different directories and then serves as input for different test cases (default log level and v 3). I could copy it, but that seems wasteful.

@serathius: I've added some comments to the script about this and also completed the support for running the script multiple times (would have failed when symlinks already exist).

pohly · 2023-01-28T16:45:47Z

/retest

serathius · 2023-01-30T10:58:04Z

Not super convinced on necessity of symlinks in repo. Would love some second opinion. cc @dims

dims · 2023-01-30T11:48:24Z

looks like this is a developer oriented script right? let's leave it simple (and have them duplicated)

pohly · 2023-01-30T11:53:45Z

Not super convinced on necessity of symlinks in repo.

I already removed them?

logicalhan · 2023-02-09T17:40:28Z

/triage accepted

pohly · 2023-02-10T14:21:41Z

@dims, @serathius: what exactly do you want me to change?

pohly · 2023-03-07T14:11:13Z

Not super convinced on necessity of symlinks in repo.

I replaced with plain copies.

There was a merge conflict because v1.Container was modified by some different PR in the meantime. To avoid having to update the load_test.go each time that happens, I added one commit where I picked some different struct to test with.

@serathius: please take another look.

serathius · 2023-03-07T14:19:25Z

Is this still needed?

No, simplified.

serathius · 2023-03-07T14:21:47Z

Is this still needed as we removed symlinks?

No, removed.

serathius · 2023-03-07T14:22:00Z

Is this still needed?

Hmm, now I regret that we are not symlinking anymore. Each individual copy of a file gets printed again, which can lead to substantial redundant output.

Let me replace with hashing the file content.

Done. While at it, I switched from "path" to "path/filepath" because it works better on Windows.

serathius · 2023-03-07T14:29:13Z

One problem I expect for this script is log format changing and we forget how the original logs looked. Without knowing original log lines we will loose understanding what sed script did.

Can you provide an example what you expect this sed does? A example of line before, and example of line after. Note in commit message is ok, but git history might get obfuscated.

Added as comments.

serathius · 2023-03-07T14:29:19Z

When trying again with recent log files from the CI job, it was found that some JSON messages get split across multiple lines, both in container logs and in the systemd journal: 2022-12-21T07:09:47.914739996Z stderr F {"ts":1671606587914.691,"caller":"rest/request.go:1169","msg":"Response ... 2022-12-21T07:09:47.914984628Z stderr F 70 72 6f 78 79 10 01 1a 13 53 ... \".|\n","v":8} Note the different time stamp on the second line. That first line is long (17384 bytes). This seems to happen because the data must pass through a stream-oriented pipe and thus may get split up by the Linux kernel. The implication is that lines must get merged whenever the JSON decoder encounters an incomplete line. The benchmark loader now supports that. To simplifies this, stripping the non-JSON line prefixes must be done before using a log as test data. The updated README explains how to do that when downloading a CI job result. The amount of manual work gets reduced by committing symlinks under data to the expected location under ci-kubernetes-kind-e2e-json-logging and ignoring them when the data is not there. Support for symlinks gets removed and path/filepath is used instead of path because it has better Windows support.

The same effect can be achieved with `-bench=BenchmarkEncoding/none`.

For long strings the output of assert.Contains is not very readable.

The benchmarks and unit tests were written so that they used custom APIs for each log format. This made them less realistic because there were subtle differences between the benchmark and a real Kubernetes component. Now all logging configuration is done with the official k8s.io/component-base/logs/api/v1. To make the different test cases more comparable, "messages/s" is now reported instead of the generic "ns/op".

v1.Container is still changing a log which caused the test to fail each time a new field was added. To test loading, let's better use something that is unlikely to change. The runtimev1.VersionResponse gets logged by kubelet and seems to be stable.

serathius · 2023-03-07T15:12:45Z

/lgtm
/approve

k8s-ci-robot · 2023-03-07T15:12:51Z

LGTM label has been added.

Details

Git tree hash: b6baedab2557474afb9edeb68dc5cdc47d73e7c9

k8s-ci-robot · 2023-03-07T15:13:09Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: pohly, serathius

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~test/integration/logs/OWNERS~~ [pohly,serathius]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

pohly · 2023-03-07T16:36:42Z

/retest

k8s-ci-robot assigned serathius Jan 27, 2023

k8s-ci-robot requested review from coffeepac and logicalhan January 27, 2023 10:39

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jan 27, 2023

pohly mentioned this pull request Jan 27, 2023

klog update #115277

Merged

pohly force-pushed the logs-performance-benchmarks branch from a2caa47 to 037aba8 Compare January 27, 2023 14:38

k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Jan 27, 2023

serathius reviewed Jan 27, 2023

View reviewed changes

pohly force-pushed the logs-performance-benchmarks branch from 037aba8 to 6d69ee9 Compare January 28, 2023 13:05

k8s-ci-robot added the triage/accepted Indicates an issue or PR is ready to be actively worked on. label Feb 9, 2023

k8s-ci-robot removed the needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. label Feb 9, 2023

k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 1, 2023

pohly force-pushed the logs-performance-benchmarks branch from 6d69ee9 to 1ac27c6 Compare March 7, 2023 14:09

k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Mar 7, 2023

serathius reviewed Mar 7, 2023

View reviewed changes

pohly added 5 commits March 7, 2023 16:03

test/integration/logs: remove useless stats case

a862a26

The same effect can be achieved with `-bench=BenchmarkEncoding/none`.

test/integration/logs: replace assert.Contains

10c15d7

For long strings the output of assert.Contains is not very readable.

pohly force-pushed the logs-performance-benchmarks branch from 1ac27c6 to 5ee679b Compare March 7, 2023 15:06

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Mar 7, 2023

k8s-ci-robot merged commit 7ec3c27 into kubernetes:master Mar 7, 2023

k8s-ci-robot added this to the v1.27 milestone Mar 7, 2023

Conversation

pohly commented Jan 27, 2023

What type of PR is this?

What this PR does / why we need it:

Does this PR introduce a user-facing change?

Uh oh!

pohly commented Jan 27, 2023

Uh oh!

pohly commented Jan 27, 2023

Uh oh!

serathius Jan 27, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pohly commented Jan 28, 2023

Uh oh!

serathius commented Jan 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dims commented Jan 30, 2023

Uh oh!

pohly commented Jan 30, 2023

Uh oh!

logicalhan commented Feb 9, 2023

Uh oh!

pohly commented Feb 10, 2023

Uh oh!

pohly commented Mar 7, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

serathius Mar 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

serathius commented Mar 7, 2023

Uh oh!

k8s-ci-robot commented Mar 7, 2023

Uh oh!

k8s-ci-robot commented Mar 7, 2023

Uh oh!

pohly commented Mar 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

serathius Jan 27, 2023 •

edited

Loading

serathius commented Jan 30, 2023 •

edited

Loading

serathius Mar 7, 2023 •

edited

Loading