Skip to content

upgrade/pacific-x/stress-split: Add prometheus metrics test#47024

Closed
pdvian wants to merge 1 commit intoceph:mainfrom
pdvian:wip-mgr-upgrade-test
Closed

upgrade/pacific-x/stress-split: Add prometheus metrics test#47024
pdvian wants to merge 1 commit intoceph:mainfrom
pdvian:wip-mgr-upgrade-test

Conversation

@pdvian
Copy link
Contributor

@pdvian pdvian commented Jul 8, 2022

On partial/full cluster upgrade, the prometheus metrics
was not reflecting the correct ceph version for the
upgraded MONs. This upgrade testcase will verify
that metrics reflecting correct ceph version for MONs.

Signed-off-by: Prashant D pdhange@redhat.com

Contribution Guidelines

Checklist

  • Tracker (select at least one)
    • References tracker ticket
    • Very recent bug; references commit where it was introduced
    • New feature (ticket optional)
    • Doc update (no ticket needed)
    • Code cleanup (no ticket needed)
  • Component impact
    • Affects Dashboard, opened tracker ticket
    • Affects Orchestrator, opened tracker ticket
    • No impact that needs to be tracked
  • Documentation (select at least one)
    • Updates relevant documentation
    • No doc update is appropriate
  • Tests (select at least one)
Show available Jenkins commands
  • jenkins retest this please
  • jenkins test classic perf
  • jenkins test crimson perf
  • jenkins test signed
  • jenkins test make check
  • jenkins test make check arm64
  • jenkins test submodules
  • jenkins test dashboard
  • jenkins test dashboard cephadm
  • jenkins test api
  • jenkins test docs
  • jenkins render docs
  • jenkins test ceph-volume all
  • jenkins test ceph-volume tox
  • jenkins test windows

@pdvian pdvian requested a review from a team as a code owner July 8, 2022 15:31
@pdvian
Copy link
Contributor Author

pdvian commented Jul 8, 2022

This upgrade testcase is intended for quincy release. Adding DNM till PR #45382 gets merged to main.

@pdvian pdvian added the DNM label Jul 8, 2022
@pdvian pdvian force-pushed the wip-mgr-upgrade-test branch 2 times, most recently from 70d28bc to ac5ec6e Compare July 12, 2022 01:14
@pdvian pdvian force-pushed the wip-mgr-upgrade-test branch from ac5ec6e to d551201 Compare July 14, 2022 03:11
@ljflores
Copy link
Member

jenkins test make check

Copy link
Member

@ljflores ljflores left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you mean upgrade/pacifix-x/stress-split in your PR title, or did you mean to push parallel tests?

On partial/full cluster upgrade, the prometheus metrics
was not reflecting the correct ceph version for the
upgraded MONs. This upgrade testcase will verify
that metrics reflecting correct ceph version for MONs.

Signed-off-by: Prashant D <pdhange@redhat.com>
@pdvian pdvian force-pushed the wip-mgr-upgrade-test branch from d551201 to 1912063 Compare July 26, 2022 06:12
@pdvian pdvian changed the title upgrade/pacific-x/parallel: Add prometheus metrics test upgrade/pacific-x/stress-split: Add prometheus metrics test Jul 26, 2022
@pdvian
Copy link
Contributor Author

pdvian commented Jul 26, 2022

Did you mean upgrade/pacifix-x/stress-split in your PR title, or did you mean to push parallel tests?

It should be upgrade/pacifix-x/stress-split. I have corrected it. Thanks!

@pdvian
Copy link
Contributor Author

pdvian commented Jul 26, 2022

jenkins test make check

@@ -0,0 +1,43 @@
#!/bin/sh -ex
Copy link
Contributor

@ktdreyer ktdreyer Apr 26, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There is a lot of piping in this script (to jq, grep, etc). Lots of things could fail in these pipes. Bash has set -o pipefail - do we have to use /bin/sh here or can this be a full bash script?

Copy link
Member

@epuertat epuertat left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we want this test (or similar one) to stop regressions like the 17.2.6's, then we could also add these 2 gates:

  • An OpenMetrics format linter: to ensure that metrics, comments, and overall syntax meets the OpenMetrics format. This should run against both mgr/prometheus and ceph-exporter /metrics endpoint.
  • Metric names snapshot (similar to this): to detect regressions on specific metrics names, we should compare metrics names from mgr/prometheus and ceph-exporter /metrics endpoints with a saved snapshot. Since some metrics names could be missing depending on the type of daemons available in the testbed, we could try to ensure that at least the metrics used in Grafana PromQL queries are validated.

None of the above actually check the values of any metric, as for example this PR does.

What do you think @nizamial09 , @pereman2 , @avanthakkar ?

@pereman2
Copy link
Contributor

pereman2 commented Apr 26, 2023

@epuertat Yup, I think integration testing on the prometheus endpoints should be done one time and let the endpoint be configured as we want to test the exporter with the same testing infra.

About OpenMetrics, I think it is a good idea but let's hope we don't have lots of metrics not conforming to it :P.

Btw @pdvian thanks for the patch.

@github-actions
Copy link

This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days.
If you are a maintainer or core committer, please follow-up on this pull request to identify what steps should be taken by the author to move this proposed change forward.
If you are the author of this pull request, thank you for your proposed contribution. If you believe this change is still appropriate, please ensure that any feedback has been addressed and ask for a code review.

@github-actions github-actions bot added the stale label Jun 25, 2023
@github-actions
Copy link

This pull request has been automatically closed because there has been no activity for 90 days. Please feel free to reopen this pull request (or open a new one) if the proposed change is still appropriate. Thank you for your contribution!

@github-actions github-actions bot closed this Jul 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants