config: add fallback timeout to tls_inspector by lambdai · Pull Request #7853 · envoyproxy/envoy

lambdai · 2019-08-07T17:30:03Z

Description:

Istio is seeing mysql connection rejected when deprecating bind_to_port=false listeners by migrating to listener with a large amount of filter chains.

tls_inspector always expecting some bytes from client. Otherwise it block the filterchain match.
There are some application protocols that requires server send the handshake packet first. E.g. MySQL.
That means if there are two filter chains, one requires tls_inpsector listener filter and the other filter chain expecting mysql traffic
can never listed in the same listener.
If MySQL client connects, the connection is handled by envoy listener and blocked at tls_inspector::onAccept()
before trying to match any filter chain.

One of the solution is to allow tls_inspector fallback to determine it as plain text after a certain timeout (e.g. 10ms). (Not implemented yet, prototype https://github.com/silentdai/envoy/tree/toimpl)

The envoy listener may wrongly determine tls connection as TCP, and it's the matched filter chain's responsibility to either reject the connection or survive.
The MySQL filter chain would see the above fall back timeout (10ms -ish)

Back compatibility: if the tls_inspector config proto is not setting fallback timeout field, the default timeout is infinity.

Fix #7195

Risk Level: LOW
Testing:
Docs Changes: TODO
Release Notes: TODO

Signed-off-by: Yuchen Dai <silentdai@gmail.com>

repokitteh-read-only · 2019-08-07T17:32:15Z

CC @envoyproxy/api-shepherds: Your approval is needed for changes made to api/.

🐱

Caused by: #7853 was synchronize by silentdai.

see: more, trace.

lambdai · 2019-08-07T17:33:51Z

http inspector listener has the exact same issue. Will apply to http_inspector once approved.

lambdai · 2019-08-07T17:34:37Z

Request for comment: @htuch @rshriram

Signed-off-by: Yuchen Dai <silentdai@gmail.com>

repokitteh-read-only · 2019-08-07T19:12:53Z

CC @envoyproxy/api-shepherds: Your approval is needed for changes made to api/.

🐱

Caused by: #7853 was synchronize by silentdai.

see: more, trace.

lizan

I don't like this approach on per listener filter basis, #7195 should be applied to all listener filters at connection handler level.

lambdai · 2019-08-07T23:05:20Z

@lizan
I think It's hard to say if envoy could bypass the opinion of each listener filter. It's quite different from the decision of rejecting the connection.
If any listener filter cannot tolerate the bypass, we should not pass. Where do you provide the chance for those listener filters to say no?

rshriram · 2019-08-07T23:11:17Z

Do you really need that level of granularity though? Even in Istio, we are most likely to only ever set the same timeout value for every filter chain. There is actually no scenario where we would set the timeout for one chain but not for the other because we do not know the protocol apriori.

IMO it is easy to go from a global option to a more fine grained option (local overrides global), than it is to go from a local only option to a global option, IMO. I dont mind either implementation - which ever is quicker works!

api/envoy/config/filter/listener/tls_inspector/v2alpha1/tls_inspector.proto

lambdai · 2019-08-07T23:17:24Z

@lizan Sorry I may misinterpret your comment. I total agree if there is a single timeout for the listener.

What I am proposed is that if any listener filter cannot follow the fallback timeout, that listener filter can block the onAccept, until envoy decide to close the connection

lizan · 2019-08-07T23:17:32Z

Let me pull up my WIP branch to a Draft PR soon.

I think It's hard to say if envoy could bypass the opinion of each listener filter. It's quite different from the decision of rejecting the connection.
If any listener filter cannot tolerate the bypass, we should not pass. Where do you provide the chance for those listener filters to say no?

We don't do this today, a StopIteration from listener filters today just means it wants more data, it is not an intent to decline a connection. We can always provide closing function via ListenerFilterCallbacks, which actively close the connection, if there is any desire for that.

lizan · 2019-08-08T00:14:45Z

We can always provide closing function via ListenerFilterCallbacks, which actively close the connection, if there is any desire for that.

Actually this is what continueFilterChain(false) do.

lambdai · 2019-08-08T00:31:32Z

I read through #7859 It pushes the flag continueOnListenerFiltersTimeout to listener but we do have something.
I have small concerns:

Ideally the timeout to close connection and timeout to continue could be 2 things... But istio survives without a timeout to close. I am fine with single timeout value :)
Previously it's either close the connection or envoy go through each listener filter so that each listener could have the opportunity to clean up in onAccept, or on file event. E.g. tls_inspector may created a file event and waiting to be triggered, with your current PR that file event is leaked or somehow triggered later. IMHO it continueOnListenerFiltersTimeout should do something before call newConnection().

lambdai · 2019-08-08T00:32:34Z

I believe #7859 will eventually address the my concern. Close this PR

lizan · 2019-08-08T00:33:21Z

your current PR that file event is leaked or somehow triggered later. IMHO it continueOnListenerFiltersTimeout should do something before call newConnection().

I'm spotting this working on tests, should be fixed.

lambdai · 2019-08-08T00:33:51Z

@lizan Thanks!

api: add timeout for tls_inspector

7249e24

Signed-off-by: Yuchen Dai <silentdai@gmail.com>

lambdai force-pushed the snifftoapi branch from 6d41f4a to 7249e24 Compare August 7, 2019 17:32

fix tests

eef9454

Signed-off-by: Yuchen Dai <silentdai@gmail.com>

lizan suggested changes Aug 7, 2019

View reviewed changes

rshriram reviewed Aug 7, 2019

View reviewed changes

api/envoy/config/filter/listener/tls_inspector/v2alpha1/tls_inspector.proto Show resolved Hide resolved

lambdai closed this Aug 8, 2019

lizan mentioned this pull request Aug 9, 2019

listener: add an option to continue on listener filters timeout #7859

Merged

Conversation

lambdai commented Aug 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

repokitteh-read-only bot commented Aug 7, 2019

Uh oh!

lambdai commented Aug 7, 2019

Uh oh!

lambdai commented Aug 7, 2019

Uh oh!

repokitteh-read-only bot commented Aug 7, 2019

Uh oh!

lizan left a comment

Choose a reason for hiding this comment

Uh oh!

lambdai commented Aug 7, 2019

Uh oh!

rshriram commented Aug 7, 2019

Uh oh!

Uh oh!

lambdai commented Aug 7, 2019

Uh oh!

lizan commented Aug 7, 2019

Uh oh!

lizan commented Aug 8, 2019

Uh oh!

lambdai commented Aug 8, 2019

Uh oh!

lambdai commented Aug 8, 2019

Uh oh!

lizan commented Aug 8, 2019

Uh oh!

lambdai commented Aug 8, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lambdai commented Aug 7, 2019 •

edited

Loading