[improve][broker] PIP-379: Key_Shared Draining Hashes for Improved Message Ordering #23352

lhotari · 2024-09-25T16:34:56Z

Fixes #23307
Fixes #21199
Fixes #15705
Fixes #21656
Fixes #20899
Fixes #20885

Implementation for PIP-379: Key_Shared Draining Hashes for Improved Message Ordering

4.0.x docs for the implementation: https://pulsar.apache.org/docs/4.0.x/concepts-messaging/#preserving-order-of-message-delivery-by-key

Motivation

See PIP-379: Key_Shared Draining Hashes for Improved Message Ordering

PIP-379 aims to address several issues with the current implementation and introduce a more efficient mechanism for managing message ordering.

Problem:
The current Key_Shared implementation faces challenges including:

Complex management of "recently joined consumers"
Incomplete fulfillment of ordering guarantees
Unnecessary message blocking
Poor observability

PIP-379 introduces a "draining hashes" concept to efficiently manage
message ordering by tracking affected hashes when consumer assignments
change.

Benefits:

Improved message ordering guarantees
Reduced unnecessary message blocking
Better scalability and performance
Enhanced observability

This proposal would replace the existing "recently joined consumers"
mechanism, addressing its limitations while providing a more robust
solution.

Modifications

See PIP-379: Key_Shared Draining Hashes for Improved Message Ordering for high level design with simplified code examples.

Documentation

doc
doc-required
doc-not-needed
doc-complete

… forward

…iption from getting stuck when unblocking a hash" This reverts commit e5470a3.

This reverts commit 7c02595.

codelipenghui

The change looks good.
Just left a few minor comments.

managed-ledger/src/main/java/org/apache/bookkeeper/mledger/ManagedCursor.java

managed-ledger/src/main/java/org/apache/bookkeeper/mledger/impl/ManagedCursorImpl.java

...c/main/java/org/apache/pulsar/broker/service/ConsistentHashingStickyKeyConsumerSelector.java

- it's used for quick profiling

nicoloboschi

LGTM, nice work

.../apache/pulsar/broker/service/persistent/PersistentStickyKeyDispatcherMultipleConsumers.java

…ssage Ordering (apache#23352)

3pacccccc · 2025-08-15T02:50:39Z

.../java/org/apache/pulsar/broker/service/persistent/PersistentDispatcherMultipleConsumers.java

+        // deduplication for readMoreEntriesAsync calls
+        if (readMoreEntriesAsyncRequested.compareAndSet(false, true)) {
+            topic.getBrokerService().executor().execute(() -> {
+                readMoreEntriesAsyncRequested.set(false);


Hi, Lari, sorry to bother you, But when I read the source code, I'm confused about this:

It uses an atomic flag readMoreEntriesAsyncRequested to "block" duplicate requests

But resets the flag right when the task starts, not after completion

Since topic.getBrokerService().executor() is a multi-threaded pool (has multiple core threads):
This lets new requests jump in immediately

Multiple readMoreEntries() can run concurrently

could you explain how this work? I would be so much appreciate!

@3pacccccc The intention isn't to prevent concurrent execution. The short comment "deduplication for readMoreEntriesAsync calls" explains the intention. In Pulsar code, readMoreEntries and readMoreEntriesAsync is called in multiple locations to ensure that all entries that were made available in a way or another would get dispatched. That's why the deduplication makes sense since it will continue to provide that guarantee while reducing the extra overhead of unnecessary concurrent or subsequent calls to readMoreEntries.
Since in most (or all) dispatcher implementations the method "readMoreEntries" itself is a synchronous method, it will serialize the execution eventually. I agree that this isn't an optimal solution how this is handled in dispatchers, but changing it wasn't the intention when the deduplication was added in this PR.

@lhotari OK, I get it, thank you so much!

lhotari added the category/reliability The function does not work properly in certain specific environments or failures. e.g. data lost label Sep 25, 2024

lhotari added this to the 4.0.0 milestone Sep 25, 2024

lhotari requested review from codelipenghui, dao-jun, dlg99, eolivelli, equanz, hrsakai, merlimat, poorbarcode and shibd September 25, 2024 16:34

lhotari self-assigned this Sep 25, 2024

github-actions bot added the doc-required Your PR changes impact docs and you will update later. label Sep 25, 2024

lhotari mentioned this pull request Sep 27, 2024

[improve][pip] PIP-379: Key_Shared Draining Hashes for Improved Message Ordering #23309

Merged

4 tasks

lhotari force-pushed the lh-pip-379-implementation branch 2 times, most recently from 923f379 to 48b2022 Compare October 1, 2024 11:40

lhotari marked this pull request as ready for review October 1, 2024 11:41

lhotari added the ready-to-test label Oct 1, 2024

lhotari force-pushed the lh-pip-379-implementation branch 2 times, most recently from 1855f33 to a6d9bf3 Compare October 1, 2024 23:51

lhotari added 10 commits October 2, 2024 11:14

Remove PIP-282 solution

1fef930

Implement PIP-379 draining hashes solution

989fa25

Handle TTL case for replay queue where the mark delete position moves…

aa66c2d

… forward

Revisit TTL test

a391306

Join ranges with single consumer

23a8126

Fix getting impacted hashes

5ee3da2

Improve

3d19279

Add more logging

9e4991d

merge overlapping ranges

f024921

Fix condition for overlapping ranges

38ebadf

lhotari added 8 commits October 7, 2024 23:44

Remove unused fields after removing logic

29df175

Revert "Add readOpCount to ManagedCursor to prevent Key_Shared subscr…

2d339a1

…iption from getting stuck when unblocking a hash" This reverts commit e5470a3.

Revert "Add hasPendingReadRequest to ManagedCursor"

f4402e8

This reverts commit 7c02595.

Revisit RescheduleReadHandler used for the Key_Shared implementation

ead7bce

Remove unused field

bf1e89c

Revisit removeConsumer method

186dc30

Rename parameter

fbd7695

Polish

22e1709

codelipenghui reviewed Oct 7, 2024

View reviewed changes

Add some tests for RescheduleReadHandlerTest

58bddf7

codelipenghui approved these changes Oct 8, 2024

View reviewed changes

lhotari added 5 commits October 8, 2024 05:11

Add test for adding 1000 consumers

fb7a8ea

Optimize snapshotting by assuming that entries are sorted

92f9a29

Optimize snapshotting step by assuming sorted order as input

640b48a

Disable perf test by default since there's no validation to be performed

a81393e

- it's used for quick profiling

Remove invalid test case that makes assumptions about the implementation

34f925d

nicoloboschi approved these changes Oct 8, 2024

View reviewed changes

lhotari merged commit 3d0625b into apache:master Oct 8, 2024

poorbarcode reviewed Oct 8, 2024

View reviewed changes

.../apache/pulsar/broker/service/persistent/PersistentStickyKeyDispatcherMultipleConsumers.java Show resolved Hide resolved

.../apache/pulsar/broker/service/persistent/PersistentStickyKeyDispatcherMultipleConsumers.java Show resolved Hide resolved

equanz reviewed Oct 11, 2024

View reviewed changes

.../apache/pulsar/broker/service/persistent/PersistentStickyKeyDispatcherMultipleConsumers.java Show resolved Hide resolved

This was referenced Oct 11, 2024

[Bug][PIP-379] Hash ranges should not move from one consumer to another when a consumer joins #23439

Closed

key_shared subscription will get suck sometimes when add consumers #14902

Closed

hanmz pushed a commit to hanmz/pulsar that referenced this pull request Feb 12, 2025

[improve][broker] PIP-379: Key_Shared Draining Hashes for Improved Me…

8d12c5b

…ssage Ordering (apache#23352)

izumo27 mentioned this pull request Apr 4, 2025

[fix][doc] Update Topic stats for PIP-379 apache/pulsar-site#998

Merged

2 tasks

3pacccccc reviewed Aug 15, 2025

View reviewed changes

poorbarcode mentioned this pull request Sep 17, 2025

[improve][broker] Part-1 of PIP-434: Expose Netty channel configuration WRITE_BUFFER_WATER_MARK to pulsar conf and pause receive requests when channel is unwritable #24423

Merged

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[improve][broker] PIP-379: Key_Shared Draining Hashes for Improved Message Ordering #23352

[improve][broker] PIP-379: Key_Shared Draining Hashes for Improved Message Ordering #23352

Uh oh!

lhotari commented Sep 25, 2024 •

edited

Loading

Uh oh!

codelipenghui left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nicoloboschi left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

3pacccccc Aug 15, 2025

Uh oh!

lhotari Aug 15, 2025

Uh oh!

3pacccccc Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

[improve][broker] PIP-379: Key_Shared Draining Hashes for Improved Message Ordering #23352

[improve][broker] PIP-379: Key_Shared Draining Hashes for Improved Message Ordering #23352

Uh oh!

Conversation

lhotari commented Sep 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Documentation

Uh oh!

codelipenghui left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nicoloboschi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

3pacccccc Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

lhotari Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

3pacccccc Aug 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

lhotari commented Sep 25, 2024 •

edited

Loading