mds: find a new head for the batch ops when the head is dead by lxbsz · Pull Request #56941 · ceph/ceph

lxbsz · 2024-04-17T08:00:15Z

If the batch head request is already dead and then we need to choose a new batch head anyways and release the reference for the current batch head request. Else it will be reported as slow request.

Fixes: https://tracker.ceph.com/issues/65536

Contribution Guidelines

To sign and title your commits, please refer to Submitting Patches to Ceph.
If you are submitting a fix for a stable branch (e.g. "quincy"), please refer to Submitting Patches to Ceph - Backports for the proper workflow.
When filling out the below checklist, you may click boxes directly in the GitHub web UI. When entering or editing the entire PR message in the GitHub web UI editor, you may also select a checklist item by adding an x between the brackets: [x]. Spaces and capitalization matter when checking off items this way.

Checklist

Tracker (select at least one)
- References tracker ticket
- Very recent bug; references commit where it was introduced
- New feature (ticket optional)
- Doc update (no ticket needed)
- Code cleanup (no ticket needed)
Component impact
- Affects Dashboard, opened tracker ticket
- Affects Orchestrator, opened tracker ticket
- No impact that needs to be tracked
Documentation (select at least one)
- Updates relevant documentation
- No doc update is appropriate
Tests (select at least one)
- Includes unit test(s)
- Includes integration test(s)
- Includes bug reproducer
- No tests

Show available Jenkins commands

jenkins retest this please
jenkins test classic perf
jenkins test crimson perf
jenkins test signed
jenkins test make check
jenkins test make check arm64
jenkins test submodules
jenkins test dashboard
jenkins test dashboard cephadm
jenkins test api
jenkins test docs
jenkins render docs
jenkins test ceph-volume all
jenkins test ceph-volume tox
jenkins test windows
jenkins test rook e2e

vshankar

Otherwise LGTM.

src/mds/MDCache.cc

src/mds/Server.cc

src/mds/MDCache.cc

src/mds/Server.cc

If the batch head request is already dead and then we need to choose a new batch head anyways and release the reference for the current batch head request. Else it will be reported as slow request. Fixes: https://tracker.ceph.com/issues/65536 Signed-off-by: Xiubo Li <xiubli@redhat.com>

dparmar18

LGTM

vshankar · 2024-04-30T09:31:10Z

I'll run this through tests this week (was on PTO last week)

lxbsz · 2024-04-30T09:33:09Z

I'll run this through tests this week (was on PTO last week)

Sure, thanks @vshankar

* refs/pull/56941/head: mds: find a new head for the batch ops when the head is dead Reviewed-by: Venky Shankar <vshankar@redhat.com> Reviewed-by: Dhairya Parmar <dparmar@redhat.com> Reviewed-by: Kotresh Hiremath Ravishankar <khiremat@redhat.com>

vshankar · 2024-05-09T11:46:45Z

https://pulpito.ceph.com/vshankar-2024-05-07_03:44:24-fs-wip-vshankar-testing-20240506.153513-testing-default-smithi/

(unfortunately, failed are infra issues related to cephadm - would need a rebuild)

vshankar · 2024-05-09T11:49:31Z

This PR is under test in https://tracker.ceph.com/issues/65882.

batrick · 2024-05-16T01:09:23Z

jenkins test make check

batrick · 2024-05-16T01:09:30Z

jenkins test make check arm64

batrick · 2024-05-16T01:14:39Z

src/mds/MDCache.cc

+    dout(10) << __func__ << ": dead " << *mdr << dendl;
+    //if the mdr is a "batch_op" and it has followers, pick a follower as
+    //the new "head of the batch ops" and go on processing the new one.
+    if (mdr->client_request && mdr->is_batch_head()) {


I understand why this works but it is weird not to select a new batch head in ::request_cleanup. I would have preferred that but I think this PR needs to be merged urgently. Maybe in a follow-up cleanup?

Does it really need ?

2079 void Server::respond_to_request(const MDRequestRef& mdr, int r) 2080 { 2081 mdr->result = r; 2082 if (mdr->client_request) { 2083 if (mdr->is_batch_head()) { 2084 dout(20) << __func__ << ": batch head " << *mdr << dendl; 2085 mdr->release_batch_op()->respond(r); 2086 } else { 2087 reply_client_request(mdr, make_message<MClientReply>(*mdr->client_request, r)); 2088 } 2089 } else if (mdr->internal_op > -1) { 2090 dout(10) << __func__ << ": completing with result " << cpp_strerror(r) << " on internal " << *mdr << dendl; 2091 auto c = mdr->internal_op_finish; 2092 if (!c) 2093 ceph_abort_msg("trying to respond to internal op without finisher"); 2094 mdcache->request_finish(mdr); 2095 c->complete(r); 2096 } 2097 }

We can see in Line#2085 it will cleanup all the batch requests when the batch head finishes.

Let me have a check carefully and try to improve the code.

@batrick please confirm that the if (dead) check only applies to internal requests, and can be moved after the client and peer conditional dispatches

We can see in Line#2085 it will cleanup all the batch requests when the batch head finishes.

In this case, Server::respond_to_request is not called because the request was killed (that's why dead == true). This logic for batch_head handling in ::respond_to_request and now MDCache::dispatch_request should be uniformly handled in a single location: MDCache::request_cleanup.

right, this is what @lxbsz had also figured out. So my question then is, shouldn't we just move that new if(dead) check into the else?

No, I think we need to make a better effort unify handling of requests in the MDS. A dead request should not be dispatched no matter its type.

@batrick But in the case when the sessions are killed and then all the corresponding requests will be marked killed too. Is that fine if we just remove the queued requests from the finisher queue directly ? If not then we cannot avoid this IMO, because the killed requests maybe still in the finisher queue and not dispatched yet.

But we can just choose a batch head directly when the a batch head request is being killed instead of doing this when the request is being dispatched.

@batrick But in the case when the sessions are killed and then all the corresponding requests will be marked killed too. Is that fine if we just remove the queued requests from the finisher queue directly ? If not then we cannot avoid this IMO, because the killed requests maybe still in the finisher queue and not dispatched yet.

I understand. I am probably being confusing. It's okay that a dead request is re-dispatched through MDCache::dispatch_request but if it's marked dead, do nothing else and return;.

But we can just choose a batch head directly when the a batch head request is being killed instead of doing this when the request is being dispatched.

Yes!

Let me try to improve it. Thanks @batrick @leonid-s-usov

vshankar

https://pulpito.ceph.com/?branch=wip-vshankar-testing-20240509.114851-debug

batrick · 2024-05-16T13:08:00Z

https://pulpito.ceph.com/?branch=wip-vshankar-testing-20240509.114851-debug

https://tracker.ceph.com/issues/65882

(Note @vshankar I like to link to the qa run ticket as it's a SSOT.)

vshankar · 2024-05-17T05:04:13Z

https://pulpito.ceph.com/?branch=wip-vshankar-testing-20240509.114851-debug

https://tracker.ceph.com/issues/65882

(Note @vshankar I like to link to the qa run ticket as it's a SSOT.)

Isn't that already done by ptl-tool when including a PR in a branch? Why relink it again? (If there are more than one QA trackers due to rebuild, etc., the last QA tracker linked is the SSOT).

batrick · 2024-05-17T15:53:03Z

https://pulpito.ceph.com/?branch=wip-vshankar-testing-20240509.114851-debug

https://tracker.ceph.com/issues/65882
(Note @vshankar I like to link to the qa run ticket as it's a SSOT.)

Isn't that already done by ptl-tool when including a PR in a branch? Why relink it again? (If there are more than one QA trackers due to rebuild, etc., the last QA tracker linked is the SSOT).

Well, actually, I like to link to the comment in the ticket where I approved the run which usually includes a link to the wiki for the full breakdown of failures. Sometimes a PR may also not have a comment linking to the ticket because it was added to a run after the QA ticket was already created; --update-qa does not add comments to PRs (to avoid spam).

(Ultimately this is just a nit but I thought I'd share my thinking on it.)

lxbsz requested a review from a team April 17, 2024 08:00

github-actions bot added the cephfs Ceph File System label Apr 17, 2024

lxbsz force-pushed the wip-65536 branch 3 times, most recently from ca8437a to e4f14a2 Compare April 17, 2024 09:37

vshankar approved these changes Apr 17, 2024

View reviewed changes

src/mds/MDCache.cc Show resolved Hide resolved

dparmar18 reviewed Apr 17, 2024

View reviewed changes

src/mds/MDCache.cc Show resolved Hide resolved

lxbsz force-pushed the wip-65536 branch from e4f14a2 to 8879e8c Compare April 18, 2024 01:36

dparmar18 reviewed Apr 18, 2024

View reviewed changes

src/mds/Server.cc Show resolved Hide resolved

kotreshhr reviewed Apr 18, 2024

View reviewed changes

src/mds/MDCache.cc Show resolved Hide resolved

dparmar18 reviewed Apr 18, 2024

View reviewed changes

src/mds/Server.cc Show resolved Hide resolved

lxbsz force-pushed the wip-65536 branch from 8879e8c to 793ea12 Compare April 18, 2024 14:29

dparmar18 approved these changes Apr 18, 2024

View reviewed changes

kotreshhr approved these changes Apr 18, 2024

View reviewed changes

vshankar added the wip-vshankar-testing1 label Apr 30, 2024

batrick approved these changes May 16, 2024

View reviewed changes

vshankar approved these changes May 16, 2024

View reviewed changes

vshankar merged commit e0efed7 into ceph:main May 16, 2024

vshankar removed the wip-vshankar-testing1 label May 16, 2024

lxbsz mentioned this pull request May 16, 2024

squid: mds: find a new head for the batch ops when the head is dead #57494

Merged

Conversation

lxbsz commented Apr 17, 2024

Contribution Guidelines

Checklist

Uh oh!

vshankar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dparmar18 left a comment

Choose a reason for hiding this comment

Uh oh!

vshankar commented Apr 30, 2024

Uh oh!

lxbsz commented Apr 30, 2024

Uh oh!

vshankar commented May 9, 2024

Uh oh!

vshankar commented May 9, 2024

Uh oh!

batrick commented May 16, 2024

Uh oh!

batrick commented May 16, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vshankar left a comment

Choose a reason for hiding this comment

Uh oh!

batrick commented May 16, 2024

Uh oh!

vshankar commented May 17, 2024

Uh oh!

batrick commented May 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants