the transport/http worker thread will not be released for a long time by hanbj · Pull Request #57284 · elastic/elasticsearch

hanbj · 2020-05-28T12:52:53Z

When the cluster is very large, when you call interfaces such as _cat/shards or _cat/indices or _all/_mapping, there is no response for a long time, and sometimes even gateway timeout, which is caused by the response is too large.
For example, the rest request method requests _all/_mapping. ES finally calls the toXContent() method to construct a json return. This is a very time-consuming operation and is executed in the http_server_worker thread.
A special thread pool is the same. When submitting tasks to the same thread pool, the worker thread is still the transport_worker / http_server_worker thread. If the task submitted to the same thread pool is a large task, it will cause the worker thread to not be released for a long time. Reduce overall system throughput.

cla-checker-service · 2020-05-28T12:52:56Z

💚 CLA has been signed

dliappis · 2020-05-28T13:45:50Z

@hanbj Thank you for the PR. Can you please sign the Contributor Agreement?

elasticmachine · 2020-05-28T13:46:03Z

Pinging @elastic/es-distributed (:Distributed/Network)

Tim-Brooks · 2020-05-28T15:36:40Z

Can you provide the context behind this PR? My understanding here is that you are concerned that the XContent response is serialized on the transport thread for _cat/shards or _cat/indices or _all/_mapping calls. How large of responses are we discussing and what type of latency does that introduce?

hanbj · 2020-05-29T01:37:55Z

@tbrooks8 thank you, the background is as follows:
cluster info：
{
"cluster_name" : "ecm",
"status" : "green",
"timed_out" : false,
"number_of_nodes" : 159,
"number_of_data_nodes" : 136,
"active_primary_shards" : 31283,
"active_shards" : 57878,
"relocating_shards" : 0,
"initializing_shards" : 0,
"unassigned_shards" : 0,
"delayed_unassigned_shards" : 0,
"number_of_pending_tasks" : 0,
"number_of_in_flight_fetch" : 0,
"task_max_waiting_in_queue_millis" : 0,
"active_shards_percent_as_number" : 100.0
}

I added time to print before and after the messageReceived method of the InboundHandler class. In the log, I found that the messageReceived method sometimes takes a few seconds or even tens of seconds. I observed that when the cluster metadata is very large, such as indices and shards, there is almost no response when I call the _cat/shards interface.

curl -XGET http://127.0.0.1:9200/_mappings > mappings
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 93.4M 100 93.4M 0 0 14.3M 0 0:00:06 0:00:06 --:--:-- 20.8M

At this time, the messageReceived method will also take a lot of time.
[2020-05-29T11:38:54,097][WARN ][o.e.m.r.RequestTracker ] [ecm-0] tcp cost too long，outRequestId=null, requestId=10804437, action=indices:admin/mappings/get, cost=6111ms

So I read the relevant code logic and found that some interfaces are executed in the same thread pool. Therefore, the transport_worker / http_server_worker thread may not be released for a long time and cannot process new requests. The overall throughput of the system will be affected.

hanbj · 2020-06-01T01:28:50Z

@hanbj Thank you for the PR. Can you please sign the Contributor Agreement?
@dliappis Yes, Thank you. I have signed the Contributor Agreement，Why all checks still fail?

dliappis · 2020-06-01T07:00:04Z

@dliappis Yes, Thank you. I have signed the Contributor Agreement，Why all checks still fail?

@hanbj the problem is that your first commit used a different email (hanbj <hanbaojun@didi<REDACTING>.com>) to your GitHub email (@163.com).

One easy option is to resign with the email you committed with.

…or a long time

hanbj · 2020-06-05T15:42:55Z

@tbrooks8 @dliappis Do you have any other ideas about PR, I see _cat/indices interface is submitted to the MANAGEMENT thread pool for execution, or we can add a thread pool in the ThreadPool class to handle large response.

original-brownbear · 2020-06-05T17:39:05Z

@hanbj how did you measure those 6s time and interpret it as the time to serialize (mainly wondering if your measurement includes non-blocking IO time as well)? Even though there's some expensive operations going on when serializing the mapping response, I find it somewhat unlikely that it will take 6s, even in the case of almost 100MB in mappings.

I think if we decide to fix this it might be fine to simply move the serialization to the generic pool and be done with it (since this shouldn't be called at a high frequency I don't think there's much point in optimizing the logic itself) but I'm having a hard time reproducing a multi-second time to serialize.

hanbj · 2020-06-10T14:50:01Z

@original-brownbear
I used the arthas tool. https://alibaba.github.io/arthas/trace.html

By using trace command, we can actively search the method call path corresponding to class pattern / method pattern, render and count all the performance overhead and trace the call link on the whole call link.

[arthas@1669]$ trace org.elasticsearch.transport.TcpTransport handleResponse '#cost > 5000'
Press Q or Ctrl+C to abort.
Affect(class count: 5 , method count: 1) cost in 513 ms, listenerId: 3
---ts=2020-06-10 22:42:05;thread_name=elasticsearch[ecm-clientnode-19][transport_worker][T#19];id=3c;is_daemon=true;priority=5;TCCL=sun.misc.Launcher$AppClassLoader@70dea4e ---[6939.108529ms] org.elasticsearch.transport.TcpTransport:handleResponse()
+---[26.661807ms] org.elasticsearch.transport.TransportResponseHandler:read() # 982
+---[0.011376ms] org.elasticsearch.common.transport.TransportAddress:() # 983
+---[0.010936ms] org.elasticsearch.transport.TransportResponse:remoteAddress() # 983
+---[0.03432ms] org.elasticsearch.transport.TransportResponseHandler:executor() # 989
+---[0.010652ms] org.elasticsearch.threadpool.ThreadPool:executor() # 989
`---[0.011082ms] org.elasticsearch.transport.TcpTransport$1:() # 989

The handleresponse() method took 6939ms.

original-brownbear · 2020-06-10T15:36:55Z

Thanks @hanbj that makes sense. I don't think the suggested fix here is viable (we can't simply put all kinds of response handling on the management pool just because of a large response) but the problem seems valid to me. I opened #57937 with a suggested fix.

original-brownbear · 2020-08-11T12:50:07Z

I hope you don't mind that I'll close this one for now since the change in this PR isn't something we want to go with as explained above.
There is a possible (though still debatable and debated) mitigation open in #57937 and a performance improvement for compression in #60953 that should also help here.

Thanks so much for bringing this to our attention @hanbj

Use thread-local buffers and deflater and inflater instances to speed up compressing and decompressing from in-memory bytes. Not manually invoking `end()` on these should be safe since their off-heap memory will eventually be reclaimed by the finalizer thread which should not be an issue for thread-locals that are not instantiated at a high frequency. This significantly reduces the amount of byte copying and object creation relative to the previous approach which had to create a fresh temporary buffer (that was then resized multiple times during operations), copied bytes out of that buffer to a freshly allocated `byte[]`, used 4k stream buffers needlessly when working with bytes that are already in arrays (`writeTo` handles efficient writing to the compression logic now) etc. Relates #57284 which should be helped by this change to some degree. Also, I expect this change to speed up mapping/template updates a little as those make heavy use of these code paths.

dliappis added the :Distributed/Network Http and internode communication implementations label May 28, 2020

elasticmachine added the Team:Distributed Meta label for distributed team. label May 28, 2020

hanbj closed this Jun 1, 2020

the transport_worker/http_server_worker thread will not be released f…

d0d2c23

…or a long time

hanbj reopened this Jun 1, 2020

original-brownbear mentioned this pull request Jun 10, 2020

Serialize Get Mappings Response on Generic ThreadPool #57937

Merged

original-brownbear mentioned this pull request Aug 11, 2020

Simplify and Speed up some Compression Usage #60953

Merged

original-brownbear closed this Aug 11, 2020

hanbj deleted the same branch August 11, 2020 14:54

original-brownbear mentioned this pull request Aug 12, 2020

Simplify and Speed up some Compression Usage (#60953) #61008

Merged

original-brownbear mentioned this pull request Aug 20, 2020

Speed up Compression Logic by Pooling Resources #61358

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the transport/http worker thread will not be released for a long time#57284

the transport/http worker thread will not be released for a long time#57284
hanbj wants to merge 1 commit intoelastic:masterfrom
hanbj:same

hanbj commented May 28, 2020 •

edited

Loading

Uh oh!

cla-checker-service bot commented May 28, 2020 •

edited

Loading

Uh oh!

dliappis commented May 28, 2020

Uh oh!

elasticmachine commented May 28, 2020

Uh oh!

Tim-Brooks commented May 28, 2020

Uh oh!

hanbj commented May 29, 2020 •

edited

Loading

Uh oh!

hanbj commented Jun 1, 2020 •

edited

Loading

Uh oh!

dliappis commented Jun 1, 2020

Uh oh!

hanbj commented Jun 5, 2020 •

edited

Loading

Uh oh!

original-brownbear commented Jun 5, 2020 •

edited

Loading

Uh oh!

hanbj commented Jun 10, 2020 •

edited

Loading

Uh oh!

original-brownbear commented Jun 10, 2020

Uh oh!

original-brownbear commented Aug 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

hanbj commented May 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cla-checker-service bot commented May 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dliappis commented May 28, 2020

Uh oh!

elasticmachine commented May 28, 2020

Uh oh!

Tim-Brooks commented May 28, 2020

Uh oh!

hanbj commented May 29, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanbj commented Jun 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dliappis commented Jun 1, 2020

Uh oh!

hanbj commented Jun 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

original-brownbear commented Jun 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hanbj commented Jun 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

original-brownbear commented Jun 10, 2020

Uh oh!

original-brownbear commented Aug 11, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

hanbj commented May 28, 2020 •

edited

Loading

cla-checker-service bot commented May 28, 2020 •

edited

Loading

hanbj commented May 29, 2020 •

edited

Loading

hanbj commented Jun 1, 2020 •

edited

Loading

hanbj commented Jun 5, 2020 •

edited

Loading

original-brownbear commented Jun 5, 2020 •

edited

Loading

hanbj commented Jun 10, 2020 •

edited

Loading