mgr/progress: Global Recovery Event in ceph -s#37327
Conversation
|
This is a new PR continuing from #29467 |
jdurgin
left a comment
There was a problem hiding this comment.
This looks good! I think we'll want to follow up with further PRs to improve the efficiency here. For large clusters, looking through all the pg states for this event (and others) by serializing to json with every update is pretty expensive.
The PGMap keeps track of num_pg_by_state internally, so if we expose python accessors to that we don't need to iterate over all pgs to get the number of active+clean pgs.
0e808da to
341ab1e
Compare
|
@kamoltat looks great! could you squash these changes into a single commit? |
bd36140 to
3e8b69d
Compare
Done! |
tchaikov
left a comment
There was a problem hiding this comment.
2020-10-15T09:06:17.364 INFO:tasks.cephfs_test_runner:test_osd_cannot_recover (tasks.mgr.test_progress.TestProgress) ... FAIL
2020-10-15T09:06:17.365 INFO:tasks.cephfs_test_runner:
2020-10-15T09:06:17.365 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-15T09:06:17.366 INFO:tasks.cephfs_test_runner:FAIL: test_osd_cannot_recover (tasks.mgr.test_progress.TestProgress)
2020-10-15T09:06:17.366 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-15T09:06:17.366 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2020-10-15T09:06:17.366 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_ceph-c_wip-kefu-testing-2020-10-14-1541/qa/tasks/mgr/test_pro
gress.py", line 264, in test_osd_cannot_recover
2020-10-15T09:06:17.366 INFO:tasks.cephfs_test_runner: self.assertEqual(len(self._all_events()), osd_count - pool_size)
2020-10-15T09:06:17.367 INFO:tasks.cephfs_test_runner:AssertionError: 2 != 1
@tchaikov Thanks, just updated the test suite |
d8d079b to
f2d50cd
Compare
|
jenkins retest this please |
f2d50cd to
908fb3a
Compare
|
@kamoltat no need to run 400+ tests for exercising this change. would suffice. |
c5d51f3 to
1c61434
Compare
1c61434 to
7275611
Compare
|
https://pulpito.ceph.com/ksirivad-2020-10-22_10:35:40-rados:mgr-wip-mgr-progress-global-event-distro-basic-smithi/ |
7275611 to
0118ef5
Compare
|
jenkins retest this please |
0118ef5 to
7396ce2
Compare
Modified the progress module and BaseMgrModule to support Global Recovert Event. Adding more arguments to update_progress_event, ceph_update_progress_event. To only show global recovery event progress with `ceph -s`. All sub events have been move to `ceph progress` Signed-off-by: Kamoltat <ksirivad@redhat.com>
7396ce2 to
2af2afa
Compare
Only show Global Progress event in ceph -s
Right now if you want to see all the sub events
You have to do ceph progress.
Signed-off-by: Kamoltat (Junior) Sirivadhna ksirivad@redhat.com
Checklist
Show available Jenkins commands
jenkins retest this pleasejenkins test classic perfjenkins test crimson perfjenkins test signedjenkins test make checkjenkins test make check arm64jenkins test submodulesjenkins test dashboardjenkins test apijenkins test docsjenkins render docsjenkins test ceph-volume alljenkins test ceph-volume tox