mds: don't add counters in warning for standby-replay MDS#56065
mds: don't add counters in warning for standby-replay MDS#56065rishabh-d-dave merged 2 commits intoceph:mainfrom
Conversation
c6d7941 to
8ad1787
Compare
8ad1787 to
23ce887
Compare
23ce887 to
84c984c
Compare
84c984c to
4dd42d1
Compare
|
Note to reviewers, the test is 90% ready. Leaving it here for now till we're sure that modifications to src/ are 100% correct. |
4dd42d1 to
ed43db4
Compare
@vshankar Based on your reviews, I am assuming that I am moving in the right direction, I am proceeding to add finishing touches to this PR. |
ed43db4 to
e5556a5
Compare
dffdca8 to
dea6c6f
Compare
|
@vshankar PTAL |
Don't include inode and stray counters in the health warnings printed for standby-replay MDSs. Since these counters are present in the health warnings only due to replay, it can confuse users, and therefore, do not include them. Fixes: https://tracker.ceph.com/issues/63514 Signed-off-by: Rishabh Dave <ridave@redhat.com>
ca81997 to
bc99fe7
Compare
|
Unrelated error on |
|
jenkins make check |
|
jenkins test make check |
|
make check passed - https://jenkins.ceph.com/job/ceph-pull-requests/133611/ |
bc99fe7 to
b4bf71e
Compare
|
Minor, non-functional change that'll make traceback more readable. |
- Add tests to verify that inode and stray counters are not replayed/included in the health warnings printed for the standby-replay MDS. - Add "MDS_CACHE_OVERSIZED" health warning to ignorelist to failover.yaml. - Add a helper method to qa.tasks.cephfs.filesystem.Filesystem to get MDS name of standby-replay MDS. Signed-off-by: Rishabh Dave <ridave@redhat.com>
b4bf71e to
2784e22
Compare
|
Added |
|
I've run tests several times and for last few times it has been running fine - https://pulpito.ceph.com/rishabh-2024-04-24_07:28:07-fs:multifs-rishabh-mds-health-testing-default-smithi This PR is ready for QA. |
Could you please explain a bit more about this? Why wasn't the traceback less readable before this update? |
The command arguments were not on the same line as the function call, which made it impossible to look at what command failed by looking at traceback. It looked like following - With that change it will look like following - It might look minor but while debugging I don't need to remember the line number from traceback and then check QA code to find out what command failed. The command failed is apparent from code snippet in traceback itself which makes debugging much much easier and quicker since a test method may have multiple calls to such methods. |
As discussed, please share how the old and new backtraces look like. @rishabh-d-dave |
|
@vshankar Snippet from Traceback 2 - |
* refs/pull/56065/head: qa/cephfs: test that counters are not printed for SR MDS mds: add no counters in warning for standby-replay MDS Reviewed-by: Venky Shankar <vshankar@redhat.com> Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
There was a problem hiding this comment.
QA run was successful - https://tracker.ceph.com/projects/cephfs/wiki/main#3-May-2024.
Testing took more time than expected because there were 25-30 new failures. Most of them caused by a PR in the testing branch but these were resolved on removing that PR.
Looks fine. |
|
Linking to QA run summary once more - #56065 (review) |
Don't print cache size warnings for standby-replay MDS since it is
simply replaying. Repeating warnings confuses users that cache for 2
MDSs is crossing the limit.
Fixes: https://tracker.ceph.com/issues/63514
Contribution Guidelines
To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an
xbetween the brackets:[x]. Spaces and capitalization matter when checking off items this way.Checklist
Show available Jenkins commands
jenkins retest this pleasejenkins test classic perfjenkins test crimson perfjenkins test signedjenkins test make checkjenkins test make check arm64jenkins test submodulesjenkins test dashboardjenkins test dashboard cephadmjenkins test apijenkins test docsjenkins render docsjenkins test ceph-volume alljenkins test ceph-volume toxjenkins test windowsjenkins test rook e2e