Skip to content

qa/cephfs: add test_session_evict_non_blocklisted test case#57458

Merged
vshankar merged 2 commits intoceph:mainfrom
lxbsz:wip-session-evict
Aug 7, 2024
Merged

qa/cephfs: add test_session_evict_non_blocklisted test case#57458
vshankar merged 2 commits intoceph:mainfrom
lxbsz:wip-session-evict

Conversation

@lxbsz
Copy link
Member

@lxbsz lxbsz commented May 14, 2024

When evicting the clients or sessions during the
mds_session_blocklist_on_evict option is disabled the clients should reconnect to MDS successfully later after new IOs being sent.

URL: https://tracker.ceph.com/issues/65647

Contribution Guidelines

  • To sign and title your commits, please refer to Submitting Patches to Ceph.

  • If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.

  • When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an x between the brackets: [x]. Spaces and capitalization matter when checking off items this way.

Checklist

  • Tracker (select at least one)
    • References tracker ticket
    • Very recent bug; references commit where it was introduced
    • New feature (ticket optional)
    • Doc update (no ticket needed)
    • Code cleanup (no ticket needed)
  • Component impact
    • Affects Dashboard, opened tracker ticket
    • Affects Orchestrator, opened tracker ticket
    • No impact that needs to be tracked
  • Documentation (select at least one)
    • Updates relevant documentation
    • No doc update is appropriate
  • Tests (select at least one)
Show available Jenkins commands
  • jenkins retest this please
  • jenkins test classic perf
  • jenkins test crimson perf
  • jenkins test signed
  • jenkins test make check
  • jenkins test make check arm64
  • jenkins test submodules
  • jenkins test dashboard
  • jenkins test dashboard cephadm
  • jenkins test api
  • jenkins test docs
  • jenkins render docs
  • jenkins test ceph-volume all
  • jenkins test ceph-volume tox
  • jenkins test windows
  • jenkins test rook e2e

@lxbsz lxbsz requested a review from a team May 14, 2024 04:58
@github-actions github-actions bot added cephfs Ceph File System tests labels May 14, 2024
@lxbsz lxbsz force-pushed the wip-session-evict branch from db0a004 to 81f6ff6 Compare May 14, 2024 04:59
@lxbsz
Copy link
Member Author

lxbsz commented May 15, 2024

The qa run

@lxbsz lxbsz force-pushed the wip-session-evict branch 5 times, most recently from baac7f4 to fb3885f Compare May 15, 2024 07:44
@lxbsz
Copy link
Member Author

lxbsz commented May 15, 2024

The new qa runs, which are all successful.

self.fs.set_max_mds(2)
status = self.fs.wait_for_daemons()

self.fs.set_ceph_conf('client', 'client reconnect stale', True)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mds_defer_session_stale defaults to true, i hope this doesn't hamper the test case

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, the default should be false:

 88 - name: client_reconnect_stale                                                                                                                                                                                                                            
 89   type: bool
 90   level: advanced
 91   desc: reconnect when the session becomes stale
 92   default: false
 93   services:
 94   - mds_client

Isn't it ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you referred to client_reconnect_stale, i was pointing to mds_defer_session_stale, both are a bit different

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Okay, my mistake. No I won't.

Check that mds evicts without blocklisting client
"""

self.fs.set_ceph_conf('mds', 'mds session blocklist on evict', False)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If you use self.config_set(...), the MDS restart done below isn't required.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, this will be better. I will switch to it.


mount_a_client_id = self.mount_a.get_global_id()
self.fs.mds_asok(['session', 'evict', "%s" % mount_a_client_id],
mds_id=self.fs.get_rank(rank=0, status=status)['name'])
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why fetch mds_id for rank0 again?

Copy link
Member Author

@lxbsz lxbsz Jun 7, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@vshankar Do you mean reusing the mds_id from previous one ?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes

lxbsz added 2 commits June 7, 2024 10:03
When evicting the clients or sessions during the
mds_session_blocklist_on_evict option is disabled the clients should
reconnect to MDS successfully later after new IOs being sent.

URL: https://tracker.ceph.com/issues/65647
Signed-off-by: Xiubo Li <xiubli@redhat.com>
@lxbsz lxbsz force-pushed the wip-session-evict branch from f6dd69f to d2645fd Compare June 7, 2024 02:20
@vshankar
Copy link
Contributor

@lxbsz Note that this changes needs to be tested with the respective kernel driver changes. I'll be doing that.

@lxbsz
Copy link
Member Author

lxbsz commented Jun 17, 2024

@lxbsz Note that this changes needs to be tested with the respective kernel driver changes. I'll be doing that.

Get it. thanks!

@vshankar
Copy link
Contributor

jenkins test api

@vshankar
Copy link
Contributor

@lxbsz dropping this from the test branch since the kclient fix isn't ready yet.

@vshankar
Copy link
Contributor

Please tag me when its ready.

@lxbsz
Copy link
Member Author

lxbsz commented Jul 29, 2024

@lxbsz dropping this from the test branch since the kclient fix isn't ready yet.

@vshankar This is just a test case to help reproduce the issue, not related to the kclient fix. We can just go on to test and merge this one.

@vshankar
Copy link
Contributor

@lxbsz dropping this from the test branch since the kclient fix isn't ready yet.

@vshankar This is just a test case to help reproduce the issue, not related to the kclient fix. We can just go on to test and merge this one.

@lxbsz My understanding was this change tests a fix in the kclient and the kclient change isn't fully ready yet. Is that not the case?

@lxbsz
Copy link
Member Author

lxbsz commented Jul 29, 2024

@lxbsz dropping this from the test branch since the kclient fix isn't ready yet.

@vshankar This is just a test case to help reproduce the issue, not related to the kclient fix. We can just go on to test and merge this one.

@lxbsz My understanding was this change tests a fix in the kclient and the kclient change isn't fully ready yet. Is that not the case?

No. This is just a test missed to test this issue.

@vshankar
Copy link
Contributor

This PR is under test in https://tracker.ceph.com/issues/67252.

vshankar added a commit to vshankar/ceph that referenced this pull request Jul 31, 2024
* refs/pull/57458/head:
	qa/cephfs: add test_session_evict_non_blocklisted test case
	mds: cleanup the debug logs to make it to be more readable
vshankar added a commit to vshankar/ceph that referenced this pull request Jul 31, 2024
* refs/pull/57458/head:
	qa/cephfs: add test_session_evict_non_blocklisted test case
	mds: cleanup the debug logs to make it to be more readable
Copy link
Contributor

@vshankar vshankar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@vshankar vshankar merged commit 92ad0c8 into ceph:main Aug 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cephfs Ceph File System tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants