router: add x-envoy-overloaded header by mattklein123 · Pull Request #1793 · envoyproxy/envoy

mattklein123 · 2017-10-02T21:12:06Z

This change reduces retry explosion:

Retry is never performed if local circuit breaking occurred.
If traffic was dropped (circuit breaking or maintenance mode),
Envoy will set an x-envoy-overloaded header. (Right now it is
set to "true" which has no real meaning. We might define more
values in the future).
Envoy will not retry if it sees the x-envoy-overloaded header
set by upstream. Of course this header can be propagated
downstream by the caller if desired and application other than
Envoy can set this header. However, even in the Envoy <-> Envoy
single hop case this can reduce retry explosion when under
duress.

One downside of this change is that a single bad upstream might
create a situation in which retries do not happen that might
otherwise succeed. In general, it seems better to err on the side
of less retries and bad single hosts should be ejected via proper
outlier detection policies.

Signed-off-by: Matt Klein mklein@lyft.com

This change reduces retry explosion: 1) Retry is never performed if local circuit breaking occurred. 2) If traffic was dropped (circuit breaking or maintenance mode), Envoy will set an x-envoy-overloaded header. (Right now it is set to "true" which has no real meaning. We might define more values in the future). 3) Envoy will not retry if it sees the x-envoy-overloaded header set by upstream. Of course this header can be propagated downstream by the caller if desired and application other than Envoy can set this header. However, even in the Envoy <-> Envoy single hop case this can reduce retry explosion when under duress. One downside of this change is that a single bad upstream might create a situation in which retries do not happen that might otherwise succeed. In general, it seems better to err on the side of less retries and bad single hosts should be ejected via proper outlier detection policies. Signed-off-by: Matt Klein <mklein@lyft.com>

mattklein123 · 2017-10-02T21:12:38Z

source/common/router/retry_state_impl.cc

    return RetryStatus::NoOverflow;
  }

+  if (!runtime_.snapshot().featureEnabled("upstream.use_retry", 100)) {


This was moved for perf reasons only. No point in doing lookup unless we are actually going to retry.

htuch

LGTM, some nits, feel free to ship whenevs.

htuch · 2017-10-03T01:45:28Z

source/common/router/retry_state_impl.cc

 bool RetryStateImpl::wouldRetry(const Http::HeaderMap* response_headers,
                                const Optional<Http::StreamResetReason>& reset_reason) {
+  // We never retry if the overloaded header is set.
+  if (response_headers && response_headers->EnvoyOverloaded() != nullptr) {


Tiny nit: the first clause is a pointer and relies on implicit null checking, the second is also presumably a pointer and does an explicit nullptr comparison. So, might be nice to be consistent there.

htuch · 2017-10-03T01:49:53Z

source/common/router/router.cc

+  // This is a customized version of send local reply that allows us to set the overloaded
+  // header.
+  Http::Utility::sendLocalReply(
+      [&](Http::HeaderMapPtr&& headers, bool end_stream) -> void {


Nit: prefer explicit capture.

htuch · 2017-10-03T01:51:25Z

source/common/router/router.cc

      upstream_host->stats().rq_error_.inc();
    }
-    Http::Utility::sendLocalReply(*callbacks_, stream_destroyed_, code, body);
+    sendLocalReply(code, body, dropped);


I think there are some other sites for sendLocalReply in this file, but maybe not all apply.

Signed-off-by: Matt Klein <mklein@lyft.com>

mattklein123 · 2017-10-03T15:32:21Z

@htuch updated

…" (envoyproxy#1793) This reverts commit e9cdb20.

**Description** Extend time-to-first-token histogram bucket boundaries from 10s max to 60s max to capture slow responses from upstream providers. Adds buckets: 15.0, 20.0, 30.0, 45.0, 60.0. Signed-off-by: Vein Kong <vk@modular.com>

mattklein123 commented Oct 2, 2017

View reviewed changes

danielhochman previously approved these changes Oct 2, 2017

View reviewed changes

htuch previously approved these changes Oct 3, 2017

View reviewed changes

comments

2cfc90e

Signed-off-by: Matt Klein <mklein@lyft.com>

mattklein123 dismissed stale reviews from htuch and danielhochman via 2cfc90e October 3, 2017 15:23

htuch approved these changes Oct 3, 2017

View reviewed changes

mattklein123 merged commit 659ce34 into master Oct 3, 2017

mattklein123 deleted the overloaded_header branch October 3, 2017 16:22

mattklein123 mentioned this pull request Oct 3, 2017

Add x-envoy-overloaded response header #1573

Closed

rshriram pushed a commit to rshriram/envoy that referenced this pull request Oct 30, 2018

Revert "Not to flush out batched report in destructor (envoyproxy#1790)…

a1dbb95

…" (envoyproxy#1793) This reverts commit e9cdb20.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

router: add x-envoy-overloaded header#1793

router: add x-envoy-overloaded header#1793
mattklein123 merged 2 commits intomasterfrom
overloaded_header

mattklein123 commented Oct 2, 2017

Uh oh!

mattklein123 Oct 2, 2017

Uh oh!

htuch left a comment

Uh oh!

htuch Oct 3, 2017

Uh oh!

htuch Oct 3, 2017

Uh oh!

htuch Oct 3, 2017

Uh oh!

mattklein123 commented Oct 3, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mattklein123 commented Oct 2, 2017

Uh oh!

mattklein123 Oct 2, 2017

Choose a reason for hiding this comment

Uh oh!

htuch left a comment

Choose a reason for hiding this comment

Uh oh!

htuch Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

htuch Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

htuch Oct 3, 2017

Choose a reason for hiding this comment

Uh oh!

mattklein123 commented Oct 3, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants