Make TaskBatcher Less Lock-Heavy by original-brownbear · Pull Request #82227 · elastic/elasticsearch

original-brownbear · 2022-01-04T19:53:06Z

In many shards benchmarks we see a lot of contention when submitting tasks.
This is obvious when working with lots of large task batches and doing long iterations.

We don't need to lock in runIfNotProcessed past removing the task set for a key
and can be a little more efficient when it comes to creating the new tasks set in
submitTasks as well.
Also, we don't need to use a fully locking map as we often have operations for different
batching keys interleaved so moving to CHM as well.

This change is particularly relevant for stability because we often submit tasks from
network threads directly where grinding through a e.g. bunch of shard state updates and
having to lock on the map over and over while e.g. a huge batch of index create or so
was iterated over in runIfNotProcessed caused very visible latency.

relates #77466

In many shards benchmarks we see a lot of contention when submitting tasks. This is obvious when working with lots of large task batches and doing long iterations. We don't need to lock in `runIfNotProcessed` past removign the task set for a key and can be a little more efficient when it comes to creating the new tasks set in `submitTasks` as well. Also, we don't need to use a fully locking map as we often have operations for different batching keys interleaved so moving to CHM as well. This change is particularly relevant for stability because we often submit tasks from network threads directly where grinding through a e.g. bunch of shard state updates and having to lock on the map over and over while e.g. a huge batch of index create or so was iterated over in `runIfNotProcessed` caused very visible latency.

elasticmachine · 2022-01-04T19:53:10Z

Pinging @elastic/es-distributed (Team:Distributed)

DaveCTurner · 2022-01-05T08:13:34Z

Also relates #81626. I'm unlikely to get to this today, bear with me.

DaveCTurner · 2022-01-05T08:15:50Z

server/src/main/java/org/elasticsearch/cluster/service/TaskBatcher.java

+                return new LinkedHashSet<>(tasks);
+            }
            for (BatchedTask existing : existingTasks) {
                // check that there won't be two tasks with the same identity for the same batching key


FWIW I don't think I've ever seen this check fail, maybe we could make it an assertion?

I'm all for it. I think we can only get here via a bug and I've also never seen this happen outside of bugs during experimenting with stuff. I'll make it an assertion :)

…batcher

original-brownbear · 2022-01-05T08:54:26Z

I'm unlikely to get to this today, bear with me.

No rush :) + Made the dup. check an assertion now.

henningandersen

I wonder if we can use an immutable (or alternatively concurrent) structure in the map?

henningandersen · 2022-01-13T14:13:39Z

server/src/main/java/org/elasticsearch/cluster/service/TaskBatcher.java

+            if (existingTasks == null) {
+                return new LinkedHashSet<>(tasks);
+            }
+            existingTasks.addAll(tasks);


I am not too fond of this pattern. I think it works due to how CHM synchronizes on the head of the bucket in both compute and remove. But I think there is no guarantee from CHM to do either. It seems unlikely that compute could run the remapping function on the same key in parallel (since that would break the call only once guarantee), but I am less certain that a future evolution of CHM could not remove the synchronized in remove (do not see a way to do it though).

In short, I think we rely too much on the internals of CHM here in a very central place. Could we perhaps go immutable here and return a list of the two lists - and flatten it when extracted/removed?

I think immutable either gets quite tricky (if we keep a list of lists, the timeout task removal gets awkward) or slow if we copy over and over. I'd rather go synchronized I think, it shouldn't be too much overhead here given how we will never have contention on the collection.

Though tbh. I'm not sure about the need for this. CHM docs state:

This class obeys the same functional specification as {@link java.util.Hashtable}

which means that remove will always be synchronized with the compute operation I'd assume?

It says right after that:

This class is fully * interoperable with {@code Hashtable} in programs that rely on its * thread safety but not on its synchronization details.

It is also just nice to have this code "obviously correct", since it is so important here.

…batcher

original-brownbear · 2022-01-13T17:33:32Z

Made the collection synchronized now 56ba16c let me know what you think :)

henningandersen

Thanks, I think synchronized is fine, though we should explicitly synchronize instead.

henningandersen · 2022-01-13T17:42:50Z

server/src/main/java/org/elasticsearch/cluster/service/TaskBatcher.java

-                        }
+            final Set<BatchedTask> pending = tasksPerBatchingKey.remove(updateTask.batchingKey);
+            if (pending != null) {
+                for (BatchedTask task : pending) {


I think we would want to synchronize explicitly on the pending set instead of relying on Collections.synchronizedSet, iterating a synchronized set like this is not really safe.

Right fair point :) Added locking to this loop.

henningandersen · 2022-01-13T17:47:50Z

server/src/main/java/org/elasticsearch/cluster/service/TaskBatcher.java

+            if (existingTasks == null) {
+                return new LinkedHashSet<>(tasks);
+            }
+            existingTasks.addAll(tasks);


It says right after that:

This class is fully * interoperable with {@code Hashtable} in programs that rely on its * thread safety but not on its synchronization details.

It is also just nice to have this code "obviously correct", since it is so important here.

original-brownbear · 2022-01-13T19:29:07Z

Jenkins run elasticsearch-ci/bwc

…batcher

henningandersen

LGTM.

original-brownbear · 2022-01-14T15:36:38Z

Thanks Henning + David!

elasticsearchmachine · 2022-01-14T15:37:47Z

💚 Backport successful

Status	Branch	Result
✅	8.0

In many shards benchmarks we see a lot of contention when submitting tasks. This is obvious when working with lots of large task batches and doing long iterations. We don't need to lock in `runIfNotProcessed` past removign the task set for a key and can be a little more efficient when it comes to creating the new tasks set in `submitTasks` as well. Also, we don't need to use a fully locking map as we often have operations for different batching keys interleaved so moving to CHM as well. This change is particularly relevant for stability because we often submit tasks from network threads directly where grinding through a e.g. bunch of shard state updates and having to lock on the map over and over while e.g. a huge batch of index create or so was iterated over in `runIfNotProcessed` caused very visible latency.

original-brownbear added >enhancement :Distributed/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. v8.0.0 v8.1.0 labels Jan 4, 2022

elasticmachine added the Team:Distributed Meta label for distributed team. label Jan 4, 2022

original-brownbear requested review from DaveCTurner and henningandersen January 4, 2022 21:07

DaveCTurner reviewed Jan 5, 2022

View reviewed changes

original-brownbear added 2 commits January 5, 2022 09:36

Merge remote-tracking branch 'elastic/master' into less-locking-task-…

f622905

…batcher

assertion

64da9c0

original-brownbear requested a review from DaveCTurner January 5, 2022 08:53

henningandersen reviewed Jan 13, 2022

View reviewed changes

original-brownbear added 2 commits January 13, 2022 17:21

Merge remote-tracking branch 'elastic/master' into less-locking-task-…

c713554

…batcher

sync"

56ba16c

original-brownbear requested a review from henningandersen January 13, 2022 17:24

henningandersen reviewed Jan 13, 2022

View reviewed changes

sync

1da8fa8

original-brownbear requested a review from henningandersen January 13, 2022 18:52

Merge remote-tracking branch 'elastic/master' into less-locking-task-…

9c4c9ad

…batcher

henningandersen approved these changes Jan 14, 2022

View reviewed changes

original-brownbear added the auto-backport-and-merge label Jan 14, 2022

original-brownbear merged commit dc27d1d into elastic:master Jan 14, 2022

original-brownbear deleted the less-locking-task-batcher branch January 14, 2022 15:36

This was referenced Jan 14, 2022

Fix Large Shard Count Scalability Issues #77466

Open

[8.0] Make TaskBatcher Less Lock-Heavy (#82227) #82624

Merged

pugnascotia added v8.0.0-rc2 and removed v8.0.0 labels Feb 1, 2022

original-brownbear restored the less-locking-task-batcher branch April 18, 2023 20:50

Conversation

original-brownbear commented Jan 4, 2022

Uh oh!

elasticmachine commented Jan 4, 2022

Uh oh!

DaveCTurner commented Jan 5, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Jan 5, 2022

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Jan 13, 2022

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Jan 13, 2022

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

original-brownbear commented Jan 14, 2022

Uh oh!

elasticsearchmachine commented Jan 14, 2022

💚 Backport successful

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants