[http] Move HTTP1 request flood checks from response encode to request decode. by antoniovicente · Pull Request #10475 · envoyproxy/envoy

antoniovicente · 2020-03-21T02:26:36Z

Description: Move HTTP1 request flood checks from response encode to request decode.
Decode of a new request is guaranteed to happen on a shallow call stack under HCM, which makes it a better choice for flood checks since it reduces the number of unique call stacks under which FrameFloodException may be thrown.
Fixes several cases where FrameFloodException was thrown outside a try/catch block, resulting in an Envoy crash.
Risk Level: low
Testing: unit and integration
Docs Changes: n/a
Release Notes: n/a

Decode of a new request is guaranteed to happen on a shallow call stack under HCM, which makes it a better choice for flood checks since it reduces the number of unique call stacks under which FrameFloodException may be thrown. Signed-off-by: Antonio Vicente <avd@google.com>

Signed-off-by: Antonio Vicente <avd@google.com>

alyssawilk

Cool, I think moving this is much simpler than the try / catch PR, but I think you need to update some comments and probably docs. The logs, thrown error message and stats all talk about too many responses, where now we're reacting to an incoming request when there are too many responses.

mattklein123 · 2020-03-23T18:02:47Z

+1 I prefer this solution. I think we will need to make similar changes to HTTP/2 but I'm not sure? cc @yanavlasov.

Increase the number of requests processed by IntegrationTest.TestManyBadRequests Signed-off-by: Antonio Vicente <avd@google.com>

antoniovicente · 2020-03-24T00:36:34Z

Cool, I think moving this is much simpler than the try / catch PR, but I think you need to update some comments and probably docs. The logs, thrown error message and stats all talk about too many responses, where now we're reacting to an incoming request when there are too many responses.

I went through "git show f5b0294" and updated relevant comments and exception strings.

antoniovicente · 2020-03-24T03:11:41Z

+1 I prefer this solution. I think we will need to make similar changes to HTTP/2 but I'm not sure? cc @yanavlasov.

The "throws FrameFloodException" in the H2 codec are protected by either dispatching_downstream_data_ or dispatching_, so I think uncaught exception shouldn't be a problem. I need to look into this further but I think we have the problem of incrementOutboundFrameCount only throwing when dispatching_downstream_data_ is true. Calls to ConnectionImpl::addOutboundFrameFragment under an upstream read callback will buffer infinitely (up to flow control limits).

mattklein123

Awesome, thanks. Will defer to @alyssawilk for further review.

alyssawilk

hm, I thought it was worth updating more docs but honestly I think this is clear enough. One fix to (prior) comments and you're good to go!

source/common/http/http1/codec_impl.cc

Signed-off-by: Antonio Vicente <avd@google.com>

mattklein123 · 2020-03-25T16:34:30Z

@antoniovicente it looks like on TSAN IpVersions/IntegrationTest.TestFloodUpstreamErrors/IPv6 hung. Can you maybe try locally with --runs_per_test just to see if we now have a flake? Thank you.

/wait

alyssawilk

also LGTM modulo CI

… connection backs up and the response flood can be detected in IntegrationTest.TestFloodUpstreamErrors Signed-off-by: Antonio Vicente <avd@google.com>

antoniovicente · 2020-03-25T21:54:04Z

@antoniovicente it looks like on TSAN IpVersions/IntegrationTest.TestFloodUpstreamErrors/IPv6 hung. Can you maybe try locally with --runs_per_test just to see if we now have a flake? Thank you.

/wait

It is a consistent failure. The test case I added takes requires processing over a thousand tiny responses before it fills the TCP connection buffer. Setting SO_RCVBUF to 1024 on the client connection reduces the time until the socket backs up. In opt mode the test case runs in about 2 seconds, but under tsan it takes 20 seconds for each of ipv4/ipv6 (down from 120 secs each before the SO_RCVBUF change).

Signed-off-by: Antonio Vicente <avd@google.com>

antoniovicente added 2 commits March 20, 2020 22:17

fix comment

26b74b3

Signed-off-by: Antonio Vicente <avd@google.com>

mattklein123 assigned mattklein123, alyssawilk and yanavlasov Mar 22, 2020

alyssawilk reviewed Mar 23, 2020

View reviewed changes

mattklein123 added the waiting label Mar 23, 2020

Update comments and exception doc string.

2ead1f5

Increase the number of requests processed by IntegrationTest.TestManyBadRequests Signed-off-by: Antonio Vicente <avd@google.com>

repokitteh-read-only bot removed the waiting label Mar 24, 2020

mattklein123 previously approved these changes Mar 24, 2020

View reviewed changes

alyssawilk reviewed Mar 24, 2020

View reviewed changes

source/common/http/http1/codec_impl.cc Outdated Show resolved Hide resolved

fix caps

1d62511

Signed-off-by: Antonio Vicente <avd@google.com>

antoniovicente dismissed mattklein123’s stale review via 1d62511 March 24, 2020 23:39

antoniovicente mentioned this pull request Mar 24, 2020

[http] Gracefully handle HTTP request flood exceptions thrown from ConnectionManagerImpl::ActiveStream::sendLocalReply #10438

Closed

mattklein123 previously approved these changes Mar 25, 2020

View reviewed changes

repokitteh-read-only bot added the waiting label Mar 25, 2020

alyssawilk previously approved these changes Mar 25, 2020

View reviewed changes

Reduce TCP receive buffer on client connection to speed up time until…

ce1cb23

… connection backs up and the response flood can be detected in IntegrationTest.TestFloodUpstreamErrors Signed-off-by: Antonio Vicente <avd@google.com>

antoniovicente dismissed stale reviews from alyssawilk and mattklein123 via ce1cb23 March 25, 2020 21:53

repokitteh-read-only bot removed the waiting label Mar 25, 2020

antoniovicente changed the title ~~Move HTTP1 request flood checks from response encode to request decode.~~ [http] Move HTTP1 request flood checks from response encode to request decode. Mar 26, 2020

fix build

0845a67

Signed-off-by: Antonio Vicente <avd@google.com>

alyssawilk approved these changes Mar 26, 2020

View reviewed changes

mattklein123 merged commit bdd849c into envoyproxy:master Mar 26, 2020

asraa mentioned this pull request Apr 21, 2020

[HTTP] High level design: Remove exceptions in codecs #10878

Closed

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[http] Move HTTP1 request flood checks from response encode to request decode.#10475

[http] Move HTTP1 request flood checks from response encode to request decode.#10475
mattklein123 merged 6 commits intoenvoyproxy:masterfrom
antoniovicente:flood_protection_on_decode

antoniovicente commented Mar 21, 2020

Uh oh!

alyssawilk left a comment

Uh oh!

mattklein123 commented Mar 23, 2020

Uh oh!

antoniovicente commented Mar 24, 2020

Uh oh!

antoniovicente commented Mar 24, 2020

Uh oh!

mattklein123 left a comment

Uh oh!

alyssawilk left a comment

Uh oh!

Uh oh!

mattklein123 commented Mar 25, 2020

Uh oh!

alyssawilk left a comment

Uh oh!

antoniovicente commented Mar 25, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

antoniovicente commented Mar 21, 2020

Uh oh!

alyssawilk left a comment

Choose a reason for hiding this comment

Uh oh!

mattklein123 commented Mar 23, 2020

Uh oh!

antoniovicente commented Mar 24, 2020

Uh oh!

antoniovicente commented Mar 24, 2020

Uh oh!

mattklein123 left a comment

Choose a reason for hiding this comment

Uh oh!

alyssawilk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattklein123 commented Mar 25, 2020

Uh oh!

alyssawilk left a comment

Choose a reason for hiding this comment

Uh oh!

antoniovicente commented Mar 25, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants