qa/cephfs: ignore when specific OSD is reported down during upgrade#58486
qa/cephfs: ignore when specific OSD is reported down during upgrade#58486rishabh-d-dave merged 1 commit intoceph:mainfrom
Conversation
45ecf64 to
acdbe87
Compare
|
Same - ceph API failed due to error unrelated to this PR. https://jenkins.ceph.com/job/ceph-api/77616/ |
|
jenkins test api |
| ceph: | ||
| log-ignorelist: | ||
| - OSD_DOWN | ||
| - cluster *\[WRN\] *osd.*is down |
There was a problem hiding this comment.
oh wow - regex's are allowed. That's because these strings are used in grep, yes?
There was a problem hiding this comment.
Yes, grep allows regex so it should be fine. There's on other way to catch that error message otherwise.
There was a problem hiding this comment.
what's wrong with osd.*is down?
There was a problem hiding this comment.
Current pattern ensures that regex matches no messages other than cluster warning message.
There was a problem hiding this comment.
But why do we need to be that specific? We've often had to put the same type of message in the ignorelist twice because it appears as a health warning and a cluster message.
There was a problem hiding this comment.
Okay, I didn't want to ignore unnecessary message. Will change this.
There was a problem hiding this comment.
|
jenkins test api |
1 similar comment
|
jenkins test api |
We already ignore health warning regarding OSD being down during upgrade but health warning regarding specific OSD being down is not added to the ignorelist which causes upgrade jobs to be marked as failed even though they were successful. Fixes: https://tracker.ceph.com/issues/66877 Signed-off-by: Rishabh Dave <ridave@redhat.com>
acdbe87 to
9ae2c89
Compare
|
This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days. |
|
jenkins test make check arm64 |
|
This PR is under test in https://tracker.ceph.com/issues/68354. |
|
jenkins test make check arm64 |
rishabh-d-dave
left a comment
There was a problem hiding this comment.
QA run was successful - https://tracker.ceph.com/projects/cephfs/wiki/Main#2024-Oct-18
We already ignore health warning regarding OSD being down during upgrade
but health warning regarding specific OSD being down is not added to the
ignorelist which causes upgrade jobs to be marked as failed even though
they were successful.
Fixes: https://tracker.ceph.com/issues/66877
Contribution Guidelines
To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an
xbetween the brackets:[x]. Spaces and capitalization matter when checking off items this way.Checklist
Show available Jenkins commands
jenkins retest this pleasejenkins test classic perfjenkins test crimson perfjenkins test signedjenkins test make checkjenkins test make check arm64jenkins test submodulesjenkins test dashboardjenkins test dashboard cephadmjenkins test apijenkins test docsjenkins render docsjenkins test ceph-volume alljenkins test ceph-volume toxjenkins test windowsjenkins test rook e2e