Add http request content stream support by mhl-b · Pull Request #111438 · elastic/elasticsearch

mhl-b · 2024-07-30T02:19:50Z

This PR adds support of streamed content to RestRequest. Major changes:

Change type of content in RestRequest and HttpRequest from BytesReference to new interface HttpBody
HttpBody has 2 sub interfaces Full for fully aggregated requests and Stream for lazy readers
Added netty based implementation for the streamed content Netty4HttpRequestContentStream
Added aggregation decider to HttpObjectAggregator, wrapped into new class Netty4HttpAggregator

elasticsearchmachine · 2024-07-30T02:21:36Z

Pinging @elastic/es-distributed (Team:Distributed)

modules/transport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpAggregator.java

DaveCTurner

I need to look at the details some more, but I left comments from an initial pass.

DaveCTurner · 2024-07-30T15:26:47Z

...y4/src/internalClusterTest/java/org/elasticsearch/http/netty4/Netty4HttpRequestStreamIT.java

+                            }
+                        };
+
+                        contentStream.setHandler(contentConsumer); // must setup handler before first request


I wonder, rather than another layer of indirection here could we add a chunk-consuming method to RestChannelConsumer? Or possibly return something from prepareRequest which implements both RestChannelConsumer and a new RequestBodyChunkConsumer method to indicate we want to process a streaming body?

I added RequestBodyChunkConsumer

public interface RequestBodyChunkConsumer extends RestChannelConsumer { void handleChunk(RestChannel channel, ReleasableBytesReference chunk, boolean isLast); }

and wiring for handler into body stream in handleRequest

if (request.isStreamedContent()) { assert action instanceof RequestBodyChunkConsumer; var chunkConsumer = (RequestBodyChunkConsumer) action; request.contentStream().setHandler((chunk, isLast) -> chunkConsumer.handleChunk(channel, chunk, isLast)); }

implementation looks like this

@Override protected RestChannelConsumer prepareRequest(RestRequest request, NodeClient client) { return new RequestBodyChunkConsumer() { int totalBytes = 0; @Override public void handleChunk(RestChannel channel, ReleasableBytesReference chunk, boolean isLast) { try (chunk) { totalBytes += chunk.length(); if (isLast == false) { request.contentStream().requestBytes(1024); } else { channel.sendResponse(new RestResponse(RestStatus.OK, Integer.toString(totalBytes))); } } } @Override public void accept(RestChannel channel) throws Exception { request.contentStream().requestBytes(1024); // ask for first chunk } }; }

...y4/src/internalClusterTest/java/org/elasticsearch/http/netty4/Netty4HttpRequestStreamIT.java

server/src/main/java/org/elasticsearch/http/HttpContent.java

...y4/src/internalClusterTest/java/org/elasticsearch/http/netty4/Netty4HttpRequestStreamIT.java

server/src/main/java/org/elasticsearch/http/HttpContent.java

server/src/main/java/org/elasticsearch/http/HttpClientStatsTracker.java

ywangd · 2024-07-31T07:33:20Z

Sorry for being slow on the review. I started to read it and find that I need more time to even understand what is going on in this newer version. If @DaveCTurner and you can move fast, definitely no need to block on me. I still plan to review it which is also a learning opportunity for me. Thanks!

server/src/main/java/org/elasticsearch/http/HttpContent.java

nicktindall · 2024-08-01T05:20:45Z

...ransport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpPipeliningHandler.java

+                                Unpooled.EMPTY_BUFFER,
+                                request.headers(),
+                                EmptyHttpHeaders.INSTANCE
+                            ),


Are we just using a FullHttpRequest because this is a POC, and to minimise the change? HttpRequest would seem more appropriate here.

to minimise the change?

Yes, it's one of the non-essential small things that touches many lines of code and bloat PR. Still need to address this, no reason to pass FullHttpRequest here.

ywangd

I had another look and understand it better now. Left a few comment including a semi-important one for potential breaking change for audit logs. Thanks!

modules/transport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpRequest.java

...ransport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpPipeliningHandler.java

ywangd · 2024-08-05T08:17:10Z

...ransport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpPipeliningHandler.java

+                        netty4HttpRequest = new Netty4HttpRequest(readSequence++, fullHttpRequest);
+                    } else {
+                        var contentStream = new Netty4HttpRequestBodyStream(ctx.channel());
+                        currentRequestStream = contentStream;


Similarly, can we assert currentRequestStream is null here or it can be non-null?

It can be non-null if previous request was stream too. When we see HttpRequest that means we received all parts of previous request - all HttpContent's and single LastHttpContent. At this point previous parts should be either in previous stream queue or processed by rest handler.

It can be non-null if previous request was stream too

We set currentRequestStream back to null on receiving LastHttpContent. Or do you mean that may not necessary happen for every streaming request?

Right, sorry I lost track of my own changes :) In normal circumstances should be null. If request is not properly terminated (no last content) and there is new request then currentRequestStream might be not null, but it will end up with decoding failure and connection shutdown. I will add test.

++ to a test, but could we also assert currentRequestStream == null here? It would be useful documentation if nothing else.

ywangd · 2024-08-05T08:20:11Z

...ransport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpPipeliningHandler.java

-                netty4HttpRequest = new Netty4HttpRequest(readSequence++, fullHttpRequest);
+                handlePipelinedRequest(ctx, netty4HttpRequest);
+            } finally {
+                activityTracker.stopActivity();


I think the activityTracker needs to include the else branch for stream process?

Still figuring out activity tracker change. When there is enough data in channel and we turned off auto-read then next channel.read() will be executed right a way in same thread and stack. That means we hit activity tracker multiple times due to recursive calls in netty. It trips our assertions that thread is already active.
This recursion should not be an issue, since read-buffer is 64kb, and chunks would be about 8kb, so up to 8 recursive calls might happen.

We need to make ActivityTracker reentrant for tracking write-path activity anyway, may as well do that here too:

public boolean maybeStartActivity() { assert trackedThread == Thread.currentThread() : trackedThread.getName() + " vs " + Thread.currentThread().getName(); if (isIdle(get())) { startActivity(); return true; } else { return false; } }

After my changes in Netty4HttpRequestBodyStream that incorporated flow control and queue its no longer an issue. Moved start/end activity to it's original place. * scratching my head *

...ransport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpRequestBodyStream.java

server/src/main/java/org/elasticsearch/rest/RestRequest.java

ywangd · 2024-08-05T08:43:54Z

server/src/main/java/org/elasticsearch/rest/RestRequest.java

+        if (httpRequest.body().isFull()) {
+            return httpRequest.body().asFull().bytes();
+        } else {
+            return BytesArray.EMPTY;
+        }


Hmm this now has a problem with audit logs which can be configured to log requests body. It is not really advisable to do that for bulk requests. But we do see that from time to time. The streaming here now makes it impossible to log the request body. Unless we find a workaround, it would be a breaking change.

It's possible to log with stream. You can wrap stream handlers one into another and do metering, logging, sniffing, even filtering. Might be not too friendly, pseudo-code:

currentHandler = httpRequest.body().contentStream().handler(); loggingBuffer = new SomeByteBuffer(); httpRequest.body().contentStream().setHandler((chunk, isLast) -> { copyChunk = chunk.copy(); loggingBuffer.add(copyChunk); if (isLast) { log.info("content={}", loggingBuffer); } currentHandler.onNext(chunk, isLast); });

Not sure how viable this would be with how auditing works today. The request payload is logged when the Rest handler is invoked. At this point, we simply don't have the full body.

We can add a bit of logic into HttpAggregator, when logging is enabled do full aggregation. But it's confusing side effect of enabling logging. Yet not breaking.

...ransport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpRequestBodyStream.java

...ternalClusterTest/java/org/elasticsearch/http/netty4/Netty4IncrementalRequestHandlingIT.java

Rename `xContent.streamSeparator()` and `RestHandler.supportsStreamContent()` to `xContent.bulkSeparator()` and `RestHandler.supportsBulkContent()`. I want to reserve use of "supportsStreamContent" for current work in HTTP layer to [support incremental content handling](#111438) besides fully aggregated byte buffers. `supportsStreamContent` would indicate that handler can parse chunks of http content as they arrive.

mhl-b · 2024-08-10T00:17:54Z

@DaveCTurner @ywangd @nicktindall @Tim-Brooks @henningandersen
A resume of current state. Below list of tasks and conclusions that have been identified and made so far.
If there no major concerns with current code I propose to merge it into feature branch.

Remaining work from comments and discussions outside of PR:

100-continue and 413/417 for oversized content. Since I bypass HttpObjectAggregator I need to handle them explicitly
request body logging for audit trail. Might be a painful one
request body metrics in HttpClientStatsTracker
content length estimation for circuit breaker
connect rest controller and http aggregator to decide which routes can bypass aggregation (or remove aggregator)
redundancy in Netty4HttpRequest that requires FullHttpRequest
control flow before/after decompression?
other places that expects full content?

There might be a cleaner path forward with HttpObjectAggregator removal. Use aggregation in rest handler with default behaviour - "aggregate all into single buffer", so all existing handlers wont notice the difference. There are other benefits of HttpObjectAggregator removal, such as dealing with 100/413/417 in correct pipelining order. Right now these responses might go back out-of-order, since aggregator is located before pipelining handler and not aware of it.

We are getting very close or already there with rest handler interface. There is no slicing or aggregation in netty, ByteBuf chunks will flow to the rest handler. There is only queueing in netty to protect downstream from receiving not-requested chunks, in practice should be few chunks.

The minimal set is stream.next() to request next network buffer and void handleChunk(RestChannel channel, ReleasableBytesReference chunk, boolean isLast) to process chunk. A more complex logic can be composed on top of it, like bulk content slicing based on separator.

new RequestBodyChunkConsumer(){
    public void handleChunk(RestChannel channel, ReleasableBytesReference chunk, boolean isLast) {
        processChunk(chunk);
        if (isLast == false) {
            channel.request().contentStream().next();
        }
    }
}

DaveCTurner

I think this is a good basis for the rest of the work in this area. I left a few superficial comments for now but they can be addressed in follow-ups.

DaveCTurner · 2024-08-10T14:17:54Z

...ransport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpPipeliningHandler.java

+                        netty4HttpRequest = new Netty4HttpRequest(readSequence++, fullHttpRequest);
+                    } else {
+                        var contentStream = new Netty4HttpRequestBodyStream(ctx.channel());
+                        currentRequestStream = contentStream;


++ to a test, but could we also assert currentRequestStream == null here? It would be useful documentation if nothing else.

DaveCTurner · 2024-08-10T14:55:34Z

...ternalClusterTest/java/org/elasticsearch/http/netty4/Netty4IncrementalRequestHandlingIT.java

+            var t = queue.poll(SAFE_AWAIT_TIMEOUT.seconds(), TimeUnit.SECONDS);
+            assertNotNull("queue is empty", t);
+            return t;
+        } catch (InterruptedException e) {


nit: please restore interrupt status flag:

Suggested change

} catch (InterruptedException e) {

} catch (InterruptedException e) {

Thread.currentThread().interrupt();

DaveCTurner · 2024-08-10T14:59:16Z

...ternalClusterTest/java/org/elasticsearch/http/netty4/Netty4IncrementalRequestHandlingIT.java

+            for (int mb = 0; mb <= 50; mb += 10) {
+                var minBufSize = payloadSize - MBytes(10 + mb);
+                var maxBufSize = payloadSize - MBytes(mb);
+                assertBusy(() -> {


Do we need to assertBusy() here? If so, could you comment explaining why? I would expect that once the client's event loop is idle this assertion should pass straight away. Maybe there's no easy way to wait for the event loop to become idle tho?

DaveCTurner · 2024-08-10T15:00:40Z

...ternalClusterTest/java/org/elasticsearch/http/netty4/Netty4IncrementalRequestHandlingIT.java

+            for (int i = 0; i < 5; i++) {
+                ctx.clientChannel.writeAndFlush(randomContent(MBytes(10), false));
+            }
+            ctx.clientChannel.writeAndFlush(LastHttpContent.EMPTY_LAST_CONTENT);


Can we assert that this future's isDone() returns false because of the backpressure?

I will add assertion, but I dont think it's a reliable indicator of backpressure. Even without backpressure, flushing 50MB might return false immediately, flush is not a blocking operation and there might be a few flushes to kernel buffer before netty's channel buffer will be drained.

DaveCTurner · 2024-08-10T15:01:51Z

...ternalClusterTest/java/org/elasticsearch/http/netty4/Netty4IncrementalRequestHandlingIT.java

+
+    static HttpRequest httpRequest(String uri, String opaqueId, int contentLength) {
+        var req = new DefaultHttpRequest(HTTP_1_1, POST, uri);
+        req.headers().add(CONTENT_LENGTH, contentLength);


Could we also have some chunked requests which don't specify the content-length up front?

DaveCTurner · 2024-08-10T15:06:57Z

...ternalClusterTest/java/org/elasticsearch/http/netty4/Netty4IncrementalRequestHandlingIT.java

+
+        static final String ROUTE = "/_test/request-stream";
+
+        static final ConcurrentHashMap<String, ServerRequestHandler> handlers = new ConcurrentHashMap<>();


I expect there'll only be one active handler at once, so I think we could use that to simplify things along the lines of org.elasticsearch.rest.ChunkedZipResponseIT.RandomZipResponsePlugin#responseRef.

Suppose to be used by pipelining tests where multiple requests in fly. I havent added test yet.

Rename `xContent.streamSeparator()` and `RestHandler.supportsStreamContent()` to `xContent.bulkSeparator()` and `RestHandler.supportsBulkContent()`. I want to reserve use of "supportsStreamContent" for current work in HTTP layer to [support incremental content handling](elastic#111438) besides fully aggregated byte buffers. `supportsStreamContent` would indicate that handler can parse chunks of http content as they arrive.

This commit back ports all of the work introduced in: #113044 * #111438 - 5e1f655 * #111865 - 478baf1 * #112179 - 1b77421 * #112227 - cbcbc34 * #112267 - c00768a * #112154 - a03fb12 * #112479 - 95b42a7 * #112608 - ce2d648 * #112629 - 0d55dc6 * #112767 - 2dbbd7d * #112724 - 58e3a39 * dce8a0b * #112974 - 92daeeb * 529d349 * #113161 - e3424bd

mhl-b added 2 commits July 29, 2024 17:57

add http request content stream support

2ff3afa

add comments and fix last chunk aggregation

07f97cc

elasticsearchmachine added the needs:triage Requires assignment of a team area label label Jul 30, 2024

mhl-b added >enhancement :Distributed/Network Http and internode communication implementations Team:Distributed Meta label for distributed team. v8.16.0 and removed needs:triage Requires assignment of a team area label labels Jul 30, 2024

mhl-b requested review from DaveCTurner and ywangd July 30, 2024 02:21

mhl-b mentioned this pull request Jul 30, 2024

Add support for partial http requests handling with pipelining #111258

Closed

mhl-b requested a review from Tim-Brooks July 30, 2024 02:23

change composite buffer type

5d44698

pxsalehi reviewed Jul 30, 2024

View reviewed changes

modules/transport-netty4/src/main/java/org/elasticsearch/http/netty4/Netty4HttpAggregator.java Show resolved Hide resolved

DaveCTurner reviewed Jul 30, 2024

View reviewed changes

ywangd reviewed Jul 31, 2024

View reviewed changes

server/src/main/java/org/elasticsearch/http/HttpContent.java Outdated Show resolved Hide resolved

ywangd reviewed Jul 31, 2024

View reviewed changes

server/src/main/java/org/elasticsearch/http/HttpClientStatsTracker.java Outdated Show resolved Hide resolved

DaveCTurner reviewed Jul 31, 2024

View reviewed changes

server/src/main/java/org/elasticsearch/http/HttpContent.java Outdated Show resolved Hide resolved

update stream interface

839e575

nicktindall reviewed Aug 1, 2024

View reviewed changes

mhl-b added 6 commits August 2, 2024 19:18

add backpressure integ test

83bec15

poke ci

14a06ce

remove flow control handler

fbf5d28

cleanup

116e802

update documentation

d34d47f

simplify conditions around last content

90c8b5d

ywangd reviewed Aug 5, 2024

View reviewed changes

release queued buffers on channel close evt

cd53cd7

mhl-b added 2 commits August 7, 2024 18:40

change requestBytes(bytes) to next()

7b66986

Merge remote-tracking branch 'upstream/main' into partial-rest-content

e0899b5

mhl-b requested review from a team as code owners August 8, 2024 01:42

mhl-b removed request for a team August 8, 2024 01:44

mhl-b mentioned this pull request Aug 8, 2024

Rename streamContent/Separator to bulkContent/Separator #111716

Merged

mhl-b added 2 commits August 9, 2024 10:29

Merge remote-tracking branch 'upstream/main' into partial-rest-content

6dc8e46

address comments and test changes

f140b7d

DaveCTurner reviewed Aug 10, 2024

View reviewed changes

mhl-b added 2 commits August 12, 2024 13:38

add assertions and comments

660e4ca

Merge remote-tracking branch 'upstream/main' into partial-rest-content

aada8ca

mhl-b merged commit d3591ef into elastic:partial-rest-requests Aug 13, 2024

Tim-Brooks pushed a commit that referenced this pull request Sep 17, 2024

Add http request content stream support (#111438)

6e6b2af

Tim-Brooks pushed a commit that referenced this pull request Sep 18, 2024

Add http request content stream support (#111438)

7274053

Tim-Brooks pushed a commit that referenced this pull request Sep 18, 2024

Add http request content stream support (#111438)

5e1f655

Tim-Brooks pushed a commit to Tim-Brooks/elasticsearch that referenced this pull request Sep 19, 2024

Add http request content stream support (elastic#111438)

35dd405

Tim-Brooks mentioned this pull request Sep 19, 2024

Backport incremental bulk execution #113215

Merged

	} catch (InterruptedException e) {
	} catch (InterruptedException e) {
	Thread.currentThread().interrupt();


		static final String ROUTE = "/_test/request-stream";

		static final ConcurrentHashMap<String, ServerRequestHandler> handlers = new ConcurrentHashMap<>();

Conversation

mhl-b commented Jul 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Jul 30, 2024

Uh oh!

Uh oh!

DaveCTurner left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ywangd commented Jul 31, 2024

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ywangd left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mhl-b Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mhl-b Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mhl-b Aug 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mhl-b Aug 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mhl-b commented Aug 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DaveCTurner left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

mhl-b commented Jul 30, 2024 •

edited

Loading

mhl-b Aug 5, 2024 •

edited

Loading

mhl-b Aug 5, 2024 •

edited

Loading

mhl-b Aug 5, 2024 •

edited

Loading

mhl-b Aug 9, 2024 •

edited

Loading

mhl-b commented Aug 10, 2024 •

edited

Loading