[improve][pip] PIP-425: Support connecting with next available endpoint for multi-endpoint serviceUrls #24394

AuroraTwinkle · 2025-06-07T09:55:11Z

Implementation: #24387

Motivation

As #22934 and #22933 mentioned, when most of the nodes in serviceurl are down (but there is at least one available node), creating consumers and producers through PulsarClient will most likely fail. I think this is not as expected. If the code is robust enough, as long as there is one available node, it should be accessible normally. Therefore, this pip is going to optimize the logic, remove unavailable nodes through the feedback mechanism, and improve the success rate of PulsarClient requests.

By the way, #22935 removes faulty nodes through a regular health check mechanism, but this brings new problems (frequent creation of connections and increased system load), so this solution is abandoned. See #22934 (comment) for more details!

Modifications

Verifying this change

Make sure that the change passes the CI checks.

(Please pick either of the following options)

This change is a trivial rework / code cleanup without any test coverage.

(or)

This change is already covered by existing tests, such as (please describe tests).

(or)

This change added tests and can be verified as follows:

(example:)

Added integration tests for end-to-end deployment with large payloads (10MB)
Extended integration test for recovery after broker failure

Does this pull request potentially affect one of the following parts:

If the box was checked, please highlight the changes

Documentation

doc
doc-required
doc-not-needed
doc-complete

Matching PR in forked repository

PR in forked repository:

…failed when many nodes in PulsarClient serviceUrl become unavailable

github-actions · 2025-06-07T09:55:43Z

@AuroraTwinkle Please add the following content to your PR description and select a checkbox:

- [ ] `doc` <!-- Your PR contains doc changes -->
- [ ] `doc-required` <!-- Your PR changes impact docs and you will update later -->
- [ ] `doc-not-needed` <!-- Your PR changes do not impact docs -->
- [ ] `doc-complete` <!-- Docs have been already added -->

lhotari

Some initial feedback

pip/pip-425.md

AuroraTwinkle · 2025-06-09T10:11:26Z

Some initial feedback

@lhotari All good suggestions, I will fix them and update later. Thanks for your help!

AuroraTwinkle · 2025-06-09T11:08:58Z

Some initial feedback

Updated!

lhotari

good progress! added some follow up comments

pip/pip-425.md

AuroraTwinkle · 2025-06-09T14:57:55Z

good progress! added some follow up comments

@lhotari Modified and updated as suggested. Thanks!

pip/pip-425.md

codelipenghui

Hi @AuroraTwinkle

The proposal looks good to me.
I’ve added a few comments to help clarify the problem and the proposed solution, making it easier to understand.

pip/pip-425.md

AuroraTwinkle · 2025-06-17T03:04:04Z

Hi @AuroraTwinkle

The proposal looks good to me. I’ve added a few comments to help clarify the problem and the proposed solution, making it easier to understand.

Ok, I will fix them later, Thanks!

Co-authored-by: Penghui Li <penghui@apache.org>

AuroraTwinkle · 2025-06-17T03:51:05Z

Hi @AuroraTwinkle

The proposal looks good to me. I’ve added a few comments to help clarify the problem and the proposed solution, making it easier to understand.

@codelipenghui Very interesting and detailed suggestions, I have fixed them, thank you again!

pip/pip-425.md

lhotari

LGTM, good work @AuroraTwinkle

liangyepianzhou · 2025-06-25T06:22:41Z

@lhotari @codelipenghui @315157973 We need more votes on the mailing list to close this PIP. Could you please help vote when you have a moment?
https://lists.apache.org/thread/c2zvjwf7bqp8nc2rpzbxd4kdtztk23xp

…nt for multi-endpoint serviceUrls (apache#24394) Fixes apache#22934 (comment) Main Issue: apache#22934 (comment) Implementation: apache#24387 ### Motivation As apache#22934 and apache#22933 mentioned, when most of the nodes in serviceurl are down (but there is at least one available node), creating consumers and producers through PulsarClient will most likely fail. I think this is not as expected. If the code is robust enough, as long as there is one available node, it should be accessible normally. Therefore, this pip is going to optimize the logic, remove unavailable nodes through the feedback mechanism, and improve the success rate of PulsarClient requests. By the way, apache#22935 removes faulty nodes through a regular health check mechanism, but this brings new problems (frequent creation of connections and increased system load), so this solution is abandoned. See apache#22934 (comment) for more details!

[improve][pip] PIP-425: fix problem that consumer or producer create …

d7dd357

…failed when many nodes in PulsarClient serviceUrl become unavailable

github-actions bot added PIP doc-label-missing labels Jun 7, 2025

github-actions bot added doc-not-needed Your PR changes do not impact docs and removed doc-label-missing labels Jun 7, 2025

AuroraTwinkle mentioned this pull request Jun 7, 2025

[improve][client]PIP-425:Support connecting with next available endpoint for multi-endpoint serviceUrls #24387

Merged

15 tasks

AuroraTwinkle marked this pull request as ready for review June 7, 2025 10:02

AuroraTwinkle mentioned this pull request Jun 7, 2025

[BUG] consumer or producer will create failed frequently when build PulsarClient with many unavailable broker nodes #22934

Closed

2 tasks

lhotari reviewed Jun 9, 2025

View reviewed changes

AuroraTwinkle changed the title ~~[improve][pip] PIP-425: fix problem that consumer or producer create failed when many nodes in PulsarClient serviceUrl become unavailable~~ [improve][pip] PIP-425: Support connecting with next available endpoint for multi-endpoint serviceUrls Jun 9, 2025

fix review comments

edeed64

AuroraTwinkle force-pushed the PIP-425 branch from ba65ddf to edeed64 Compare June 9, 2025 11:07

lhotari reviewed Jun 9, 2025

View reviewed changes

pip/pip-425.md Outdated Show resolved Hide resolved

pip/pip-425.md Outdated Show resolved Hide resolved

pip/pip-425.md Outdated Show resolved Hide resolved

pip/pip-425.md Outdated Show resolved Hide resolved

AuroraTwinkle force-pushed the PIP-425 branch 2 times, most recently from 7f314b9 to 817a231 Compare June 9, 2025 14:53

[part2]fix review comments

6d3d056

AuroraTwinkle force-pushed the PIP-425 branch from 817a231 to 6d3d056 Compare June 9, 2025 14:56

lhotari reviewed Jun 12, 2025

View reviewed changes

pip/pip-425.md Outdated Show resolved Hide resolved

fix review comments

8b0905d

codelipenghui assigned AuroraTwinkle Jun 16, 2025

codelipenghui reviewed Jun 16, 2025

View reviewed changes

codelipenghui added this to the 4.1.0 milestone Jun 16, 2025

AuroraTwinkle and others added 3 commits June 17, 2025 11:08

Update pip/pip-425.md

7dc259a

Co-authored-by: Penghui Li <penghui@apache.org>

Update pip/pip-425.md

fa4e64e

Co-authored-by: Penghui Li <penghui@apache.org>

Update pip/pip-425.md

ece2b2a

Co-authored-by: Penghui Li <penghui@apache.org>

AuroraTwinkle and others added 5 commits June 17, 2025 11:10

Update pip/pip-425.md

ec024ac

Co-authored-by: Penghui Li <penghui@apache.org>

Update pip/pip-425.md

6cbe13e

Co-authored-by: Penghui Li <penghui@apache.org>

Update pip/pip-425.md

f24a5f1

Co-authored-by: Penghui Li <penghui@apache.org>

fix review comments

b4cad0c

fix review comments

0e376b7

codelipenghui approved these changes Jun 17, 2025

View reviewed changes

add discuss link

0c8e808

315157973 reviewed Jun 21, 2025

View reviewed changes

pip/pip-425.md Outdated Show resolved Hide resolved

315157973 reviewed Jun 21, 2025

View reviewed changes

pip/pip-425.md Outdated Show resolved Hide resolved

fix review comments

9592001

liangyepianzhou approved these changes Jun 24, 2025

View reviewed changes

lhotari approved these changes Jun 24, 2025

View reviewed changes

AuroraTwinkle requested a review from 315157973 June 24, 2025 09:26

315157973 approved these changes Jun 25, 2025

View reviewed changes

Technoboy- approved these changes Jun 25, 2025

View reviewed changes

liangyepianzhou merged commit be385c4 into apache:master Jun 25, 2025
20 checks passed

[improve][pip] PIP-425: Support connecting with next available endpoint for multi-endpoint serviceUrls #24394

[improve][pip] PIP-425: Support connecting with next available endpoint for multi-endpoint serviceUrls #24394

Uh oh!

Conversation

AuroraTwinkle commented Jun 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Modifications

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

Matching PR in forked repository

Uh oh!

github-actions bot commented Jun 7, 2025

Uh oh!

lhotari left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AuroraTwinkle commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AuroraTwinkle commented Jun 9, 2025

Uh oh!

lhotari left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AuroraTwinkle commented Jun 9, 2025

Uh oh!

Uh oh!

codelipenghui left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AuroraTwinkle commented Jun 17, 2025

Uh oh!

AuroraTwinkle commented Jun 17, 2025

Uh oh!

Uh oh!

Uh oh!

lhotari left a comment

Choose a reason for hiding this comment

Uh oh!

liangyepianzhou commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

AuroraTwinkle commented Jun 7, 2025 •

edited

Loading

AuroraTwinkle commented Jun 9, 2025 •

edited

Loading

liangyepianzhou commented Jun 25, 2025 •

edited

Loading