msg: Code cleanup and optimizations by MaxKellermann · Pull Request #60220 · ceph/ceph

MaxKellermann · 2024-10-09T13:23:28Z

Checklist

Tracker (select at least one)
- References tracker ticket
- Very recent bug; references commit where it was introduced
- New feature (ticket optional)
- Doc update (no ticket needed)
- Code cleanup (no ticket needed)
Component impact
- Affects Dashboard, opened tracker ticket
- Affects Orchestrator, opened tracker ticket
- No impact that needs to be tracked
Documentation (select at least one)
- Updates relevant documentation
- No doc update is appropriate
Tests (select at least one)
- Includes unit test(s)
- Includes integration test(s)
- Includes bug reproducer
- No tests

batrick

The locking changes are the ones I'm least certain about. I'd request those go in a separate PR too since they will require more rigorous review and broad QA. The other changes are uncontroversial and could be merged quickly.

batrick · 2024-10-09T13:34:16Z

src/msg/async/AsyncConnection.cc

@@ -305,12 +305,12 @@ ssize_t AsyncConnection::read_bulk(char *buf, unsigned len)
 ssize_t AsyncConnection::write(ceph::buffer::list &bl,
                               std::function<void(ssize_t)> callback,


Suggested change

std::function<void(ssize_t)> callback,

std::function<void(ssize_t)>&& callback,

also appropriate?

Would it be better?
Without &&, the std::function is created on the caller's stack which gets passed to the method.
With the &&, the std::function is created on the caller's stack, and additionally, a pointer to this stack address gets passed to the method.
This is only useful if you already have a reference which you forward, so you don't allocate any stack space.

Just conventional, I suppose. I don't think it matters much.

batrick · 2024-10-09T13:50:28Z

src/msg/async/ProtocolV1.cc

    do {
      if (connection->is_queued()) {
-	if (r = connection->_try_send(); r!= 0) {
+	connection->write_lock.unlock();


msg/async/ProtocolV[12]: unlock write_lockbefore calling_try_send()`` please expand the commit message why this is appropriate/safe.

That's going to be difficult, because I have to explain the absence of something - try_send() does not access anything that needs write_lock protection. If somebody thinks otherwise, I'd like to hear about it.

But anyway, the code doesn't make much sense - it calls dispatch_event_external() which is a function designed to be called from outside the I/O thread, but by definition, we must be inside the I/O thread (or else we wouldn't be allowed to access outgoing_bl). This is confusing, but I guess I'm here to clean up the mess.

That's going to be difficult, because I have to explain the absence of something - try_send() does not access anything that needs write_lock protection. If somebody thinks otherwise, I'd like to hear about it.

I think that's what makes this challenging to review too. We would want to be certain your supposition is correct.

You won't hear much argument from here that this code can be messy. I'm glad you're here!

The problem is that while locks are extremely important, documentation on what they protect is nowhere to be found. My theory is that write_lock really protects nothing at all. At least I couldn't figure out anything. So I'm trying to remove it one-by-one from calls that look obviously ok without that lock; most importantly to calls that do I/O, and you should never to I/O while holding locks.

My profiler says that the MDS spends 20% of the CPU time only in sendmsg() - and that's actual CPU time, without the off-cpu time that must be added on top. That means write_lock is kept locked at least 20% of the time, for no reason at all.

I forgot: AsyncConnection::write_lock does protect something - just not in class AsyncConnection. It protects ProtocolV[12]::out_queue (and others in these two classes). This mutex should probably be moved there, out of AsyncConnection, to avoid confusion.

@MaxKellermann your profile of sengmsg() is correect, at least from the profiling I've done. It's a major issue and one of the reasons I was looking at reducing the number of async messenger threads talking to the kernel.

I have several patches which reduce the number of sendmsg() system calls, but I can only submit them only after some of the other PRs are merged because they build on top of them.
Anyway, the number of I/O threads isn't a performance problem; but a real performance problem is all the bouncing between workers, dispatcher, finisher, and submit thread. One single request engages all of them, and that causes a lot of inter-thread communication overhead which is very visible in my profiler.
Reducing lock contention is really important, and several of my PRs do that; but it would be much better to stay in one thread for handling a request.

MaxKellermann · 2024-10-09T14:24:16Z

The locking changes are the ones I'm least certain about. I'd request those go in a separate PR too since they will require more rigorous review and broad QA.

Dropped for now. (These two patches are part of the patch set that make the MDS faster. Reducing lock contention is one important piece.)

batrick · 2024-10-09T20:39:09Z

/home/jenkins-build/build/workspace/ceph-pull-requests/src/msg/async/ProtocolV1.cc:1257:58: error: no member named 'second' in 'ProtocolV1::out_q_entry_t'
      ldout(cct, 20) << __func__ << " discard " << entry.second << dendl;
                                                   ~~~~~ ^
/home/jenkins-build/build/workspace/ceph-pull-requests/src/msg/async/ProtocolV1.cc:1258:13: error: no member named 'second' in 'ProtocolV1::out_q_entry_t'
      entry.second->put();
      ~~~~~ ^
2 errors generated.

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

This allows eliminating one lookup in `_get_next_outgoing()` because we can pass the iterator instead of the key to `erase()`. Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

Since `std::function` is nullable and as an `operator bool()`, we can easily eliminate the `std::optional` overhead. Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

batrick · 2024-10-10T00:30:49Z

jenkins test make check

batrick · 2024-10-10T00:30:53Z

jenkins test make check arm64

batrick · 2024-10-19T00:57:25Z

This PR is under test in https://tracker.ceph.com/issues/68629.

* refs/pull/60220/head: msg/async/AsyncConnection: move the writeCallback instead of copying it msg/async/AsyncConnection: do not wrap writeCallback in `std::optional` msg/async/frames_v2: use zero-initialization instead of memset() msg/async/Event: use zero-initialization instead of memset() msg/Message: use zero-initialization instead of memset() msg/async/ProtocolV2: eliminate redundant std::map lookups msg/async/ProtocolV[12]: reverse the std::map sort order msg/async/ProtocolV[12]: use `auto` msg/async/ProtocolV[12]: use range-based `for` msg/async/ProtocolV1: use zero-initialization instead of memset()

batrick · 2024-10-22T00:50:53Z

https://tracker.ceph.com/issues/68629#note-1

MaxKellermann requested a review from a team as a code owner October 9, 2024 13:23

github-actions bot added the core label Oct 9, 2024

batrick requested changes Oct 9, 2024

View reviewed changes

MaxKellermann force-pushed the msg_optimizations branch from a2d32e2 to a1c70b8 Compare October 9, 2024 14:23

batrick approved these changes Oct 9, 2024

View reviewed changes

batrick added needs-qa wip-pdonnell-testing2 labels Oct 9, 2024

MaxKellermann added 10 commits October 9, 2024 23:13

msg/async/ProtocolV1: use zero-initialization instead of memset()

cae1af3

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/async/ProtocolV[12]: use range-based for

a143844

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/async/ProtocolV[12]: use auto

988705a

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/async/ProtocolV[12]: reverse the std::map sort order

342a25b

This allows eliminating one lookup in `_get_next_outgoing()` because we can pass the iterator instead of the key to `erase()`. Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/async/ProtocolV2: eliminate redundant std::map lookups

6597d77

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/Message: use zero-initialization instead of memset()

62ebf16

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/async/Event: use zero-initialization instead of memset()

7fcb8a8

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/async/frames_v2: use zero-initialization instead of memset()

10a9914

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/async/AsyncConnection: do not wrap writeCallback in std::optional

c72dae9

Since `std::function` is nullable and as an `operator bool()`, we can easily eliminate the `std::optional` overhead. Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

msg/async/AsyncConnection: move the writeCallback instead of copying it

425fc4d

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

MaxKellermann force-pushed the msg_optimizations branch from a1c70b8 to 425fc4d Compare October 9, 2024 21:18

markhpc changed the title ~~Code cleanup and optimizations in "msg"~~ msg: Code cleanup and optimizations Oct 10, 2024

markhpc added the performance label Oct 10, 2024

batrick merged commit bb13534 into ceph:main Oct 22, 2024

MaxKellermann deleted the msg_optimizations branch October 22, 2024 05:09

		@@ -305,12 +305,12 @@ ssize_t AsyncConnection::read_bulk(char *buf, unsigned len)
		ssize_t AsyncConnection::write(ceph::buffer::list &bl,
		std::function<void(ssize_t)> callback,

	std::function<void(ssize_t)> callback,
	std::function<void(ssize_t)>&& callback,

Conversation

MaxKellermann commented Oct 9, 2024

Checklist

Uh oh!

batrick left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MaxKellermann commented Oct 9, 2024

Uh oh!

batrick commented Oct 9, 2024

Uh oh!

batrick commented Oct 10, 2024

Uh oh!

batrick commented Oct 10, 2024

Uh oh!

batrick commented Oct 19, 2024

Uh oh!

batrick commented Oct 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants