Add MultiRead support in Keeper and internal ZK client by antonio2368 · Pull Request #41410 · ClickHouse/ClickHouse

antonio2368 · 2022-09-16T12:02:42Z

Changelog category (leave one):

Improvement

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Add support for MultiRead in Keeper and internal ZooKeeper client.

Continuation of #36725

Information about CI checks: https://clickhouse.com/docs/en/development/continuous-integration/

utils/zookeeper-dump-tree/main.cpp

antonio2368 · 2022-09-16T12:06:44Z

The MultiRead supported by ZooKeeper supports only 2 operations:

Get
Old version of list which we call SimpleList
This is the only place where the old version of the list (without stat) is used.
I don't see why we wouldn't be able to send multiple regular list operations in MultiRead and other read requests like exists.

Because of that and because it's hard for us to differentiate when are we connected to ZK or Keeper, I propose we support MultiRead operations only when we connect to Keeper which supports it.

cc @tavplubix @alesapin

tavplubix · 2022-09-28T15:40:53Z

src/Storages/MergeTree/ReplicatedMergeTreeQueue.cpp

+};
+
+template <bool with_multiread>
+std::vector<BlockInfoInZooKeeper> getBlockInfos(const auto & partitions, const auto & zookeeper, const auto & zookeeper_path)


There are much more places where we send multiple get requests asynchronously. Consider implementing such method in ZooKeeper client, that will take parent path and array (or iterators range) of node names, check API version and retrieve data using async get requests or MultiRead

I've been thinking about it, but didn't do it as part of this task because it could end up a bit complex.
Mostly because of the way how we process with async gets, when we get the result of the next get request we instantly process it, we don't wait for all of them to finish.

One way to implement it would be to send a callback to be called on each get request but I can do it as part of a different PR.

tavplubix · 2022-09-28T15:44:19Z

src/Common/ZooKeeper/ZooKeeperCommon.cpp

+    if (operation_type.has_value() && *operation_type != type)
+        throw Exception("Illegal mixing of read and write operations in multi request", Error::ZBADARGUMENTS);


It means that our code tried to send an ill-formed request. Consider throwing LOGICAL_ERROR (but it will be DB::Exception, not Coordination::Exception then) or adding chassert here

antonio2368 · 2022-09-28T16:55:26Z

@tavplubix I would like to hear your thoughts on something.
When I checked async read calls, some of them are okay with ZNONODE.
Now, MultiRead behaves same as Multi, if there is any request that is not ZOK, rest of request won't be processed and return ZRUNTIME. It makes sense for write requests because we can get invalid data if we don't abort on a faulty request but for read requests it doesn't matter.
Keeping it like that is more consistent but as we support MultiRead on Keeper only, we can change that behavior process all requests no matter what is the returned value.

tavplubix · 2022-09-28T17:21:01Z

I think it would be more convenient to return responses for all read requests even if some nodes do not exist. Sometimes we need to get many nodes which may be being removed concurrently (and it's not an error if some node does not exist). But multi should throw ZNONODE exception in this case anyway (and tryMulti should fill all responses with either data or error code)

antonio2368 · 2022-09-28T17:27:47Z

So on the Keeper's side, it should set the error code of the first detected response which is not ZOK but continue processing and the client will handle it differently depending on if it's `multi/tryMulti'.
This should allow us to cover much more cases, and add support for exists for example.

antonio2368 · 2022-09-29T11:30:07Z

I added the discussed behavior of always processing every read request, and not stopping on the first failure.
Also, I created API for running multiple read requests of the same type on some paths (it picks which method to use based on the server support).
It may look complicated, but I tried to hide knowledge of which method was used as much as possible, and to me, it looks easy to use.
Get, exists and getChildren are all supported.
The 3 changes I made are more for showcasing the API. I'll continue in different PR searching for places where I can use it instead of async read requests.

tavplubix · 2022-10-03T11:17:40Z

Integration tests - test_disks_app_func was broken in master
Stateless tests (aarch64) - #41934
Stress test (msan) - #41548
Stress test (tsan) - The specified key does not exist in BC check

antonio2368 added 4 commits September 16, 2022 09:55

Define client for multi read

5662e00

Use multiread for dump

97f4cbe

Add support for MultiRead in Keeper

81f7cf3

Add get to multi request

0665298

antonio2368 commented Sep 16, 2022

View reviewed changes

utils/zookeeper-dump-tree/main.cpp Show resolved Hide resolved

antonio2368 mentioned this pull request Sep 16, 2022

Added zookeeper multi-read support #36725

Closed

4 tasks

antonio2368 changed the title ~~Keeper multiread~~ Add MultiRead support in Keeper and internal ZK client Sep 16, 2022

robot-clickhouse added the pr-improvement Pull request with some product improvements label Sep 16, 2022

antonio2368 added 8 commits September 26, 2022 07:16

Merge branch 'master' into keeper-multiread

f833366

Add support for simple list

937d534

Revert dump tree

99665f1

Use multiread

cc3719e

Better TransactionLog with multiread

56cc3c7

Format

97385ca

Fix zookeeper_log

7a9afc4

Merge branch 'master' into keeper-multiread

265c8b3

antonio2368 marked this pull request as ready for review September 27, 2022 12:39

tavplubix self-assigned this Sep 27, 2022

Support filtered list

d0457ad

tavplubix approved these changes Sep 28, 2022

View reviewed changes

antonio2368 marked this pull request as draft September 29, 2022 06:12

antonio2368 added 5 commits September 29, 2022 06:50

Dont fail MultiRead on first failed op

349bd7f

Merge branch 'master' into keeper-multiread

3109ce5

Use chassert

bcefa6e

Define methods for multi read requests

fcc5410

Cleanup

0056eeb

Add support for exists in multiread

94f1fe3

antonio2368 marked this pull request as ready for review September 29, 2022 11:27

antonio2368 requested a review from tavplubix September 29, 2022 11:27

tavplubix approved these changes Sep 29, 2022

View reviewed changes

tavplubix merged commit 00914a1 into master Oct 3, 2022

tavplubix deleted the keeper-multiread branch October 3, 2022 11:17

alexey-milovidov mentioned this pull request Oct 8, 2022

Intern Tasks 2021/2022 #29601

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MultiRead support in Keeper and internal ZK client#41410

Add MultiRead support in Keeper and internal ZK client#41410
tavplubix merged 19 commits intomasterfrom
keeper-multiread

antonio2368 commented Sep 16, 2022

Uh oh!

Uh oh!

antonio2368 commented Sep 16, 2022

Uh oh!

tavplubix Sep 28, 2022

Uh oh!

antonio2368 Sep 28, 2022

Uh oh!

tavplubix Sep 28, 2022

Uh oh!

antonio2368 commented Sep 28, 2022

Uh oh!

tavplubix commented Sep 28, 2022

Uh oh!

antonio2368 commented Sep 28, 2022 •

edited

Loading

Uh oh!

antonio2368 commented Sep 29, 2022 •

edited

Loading

Uh oh!

tavplubix commented Oct 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if (operation_type.has_value() && *operation_type != type)
		throw Exception("Illegal mixing of read and write operations in multi request", Error::ZBADARGUMENTS);

Conversation

antonio2368 commented Sep 16, 2022

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes to CHANGELOG.md):

Uh oh!

Uh oh!

antonio2368 commented Sep 16, 2022

Uh oh!

tavplubix Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

antonio2368 Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

tavplubix Sep 28, 2022

Choose a reason for hiding this comment

Uh oh!

antonio2368 commented Sep 28, 2022

Uh oh!

tavplubix commented Sep 28, 2022

Uh oh!

antonio2368 commented Sep 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

antonio2368 commented Sep 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tavplubix commented Oct 3, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

antonio2368 commented Sep 28, 2022 •

edited

Loading

antonio2368 commented Sep 29, 2022 •

edited

Loading