mds: prevent scrubbing for standby-replay MDS#53301
Conversation
631bb8b to
c181dd8
Compare
c181dd8 to
8982199
Compare
|
@neesingh-rh please ping when this ready for review again. |
I have updated the PR, ready for re-review. |
* refs/pull/53301/head: mds: prevent scrub start for standby-replay MDS Reviewed-by: Venky Shankar <vshankar@redhat.com>
vshankar
left a comment
There was a problem hiding this comment.
@neesingh-rh LGTM -- please add a test.
Added the test. PTAL |
807985f to
5f504d4
Compare
| # start the scrub and verify | ||
| with self.assertRaises(CommandFailedError) as ce: | ||
| self.fs.run_scrub(["start", abs_test_path, "recursive"]) | ||
| self.assertEqual(ce.exception.exitstatus, errno.EINVAL) |
There was a problem hiding this comment.
| self.assertEqual(ce.exception.exitstatus, errno.EINVAL) | |
| self.assertEqual(ce.exception.exitstatus, errno.EINVAL) |
There was a problem hiding this comment.
We should be checking for the error status only after getting the command fail exit status. I guess it should be this way only. https://github.com/ceph/ceph/blob/quincy/qa/tasks/cephfs/test_scrub_checks.py#L354
There was a problem hiding this comment.
ce would be available inside with .., isn't it?
There was a problem hiding this comment.
with.. is an alternative for try-catch, how can the statement be executed after the exception is encountered at the run_scrub only. Pls correct me if you mean something else.
qa/tasks/cephfs/test_scrub_checks.py
Outdated
|
|
||
| # start the scrub and verify | ||
| with self.assertRaises(CommandFailedError) as ce: | ||
| self.fs.run_scrub(["start", abs_test_path, "recursive"]) |
There was a problem hiding this comment.
I'm puzzled. Is this test not racing with the MDS becoming active?
I also expected to see a scrub command directed at the standby-replay daemon.
There was a problem hiding this comment.
Spoke to @neesingh-rh - this needs fixing. Fetch the s-r mds daemons id and do a ceph tell mds.<> scrub start to the s-r daemon.
There was a problem hiding this comment.
Updated the PR with the proposed changes.
|
@neesingh-rh ping? |
|
This pull request has been automatically marked as stale because it has not had any activity for 60 days. It will be closed if no further activity occurs for another 30 days. |
|
jenkins test make check |
|
@neesingh-rh please relook the make check failure. |
|
@neesingh-rh I see you added a change to fix a failing dashboard test and then removed it?? |
Nope, I pushed a commit by mistake that wasn't part of this PR. I added it locally just for testing. |
|
vstart_runner results: |
|
jenkins test dashboard |
|
jenkins test docs |
Fixes: https://tracker.ceph.com/issues/62537 Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
|
jenkins test make check |
* refs/pull/53301/head: qa: adding test for preventing scrub when mds is inactive mds: prevent scrub start for standby-replay MDS Reviewed-by: Dhairya Parmar <dparmar@redhat.com> Reviewed-by: Venky Shankar <vshankar@redhat.com> Reviewed-by: Milind Changire <mchangir@redhat.com>
Fixes: https://tracker.ceph.com/issues/62537
Signed-off-by: Neeraj Pratap Singh neesingh@redhat.com
Contribution Guidelines
To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "pacific"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
Checklist
Show available Jenkins commands
jenkins retest this pleasejenkins test classic perfjenkins test crimson perfjenkins test signedjenkins test make checkjenkins test make check arm64jenkins test submodulesjenkins test dashboardjenkins test dashboard cephadmjenkins test apijenkins test docsjenkins render docsjenkins test ceph-volume alljenkins test ceph-volume toxjenkins test windows