Skip to content

DNS timeouts cause no healthy upstream #9927

@chrisgoffinet

Description

@chrisgoffinet

Title: DNS timeouts cause no healthy upstream

[optional Relevant Links:]
I have a test case outlined here on how to reproduce this. I have verified this happens on master branch too.

Bug Template

Description:
In the event that DNS resolver has transient errors (i.e timeout) envoy currently doesn't check the c-ares status of SUCCESS before overriding the address_list with an empty array. We have observed in production cases where we have healthy hosts, and the next time a DNS query happens if it were to timeout, our service ends up going down.

Repro steps:
Test case to reproduce. a simple iptables block on the DNS resolver will do.

https://github.com/chrisgoffinet/envoy-dns
Patch: chrisgoffinet@950c734

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions