-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Description
What is the problem you're trying to solve
We launch some of our hosts on Azure where we sometimes see failed deployments with containerd having in the logs only:
Mar 28 03:37:39 worker-4-393118301-cf37 containerd[2872]: time="2020-03-28T03:37:39.648934594Z" level=info msg="PullImage "quay.io/thanos/thanos:v0.8.1""
and this is it. We suspect that the Azure networking is having issues, but we do not have any proof for it as containerd does not give any warnings whatsoever that the download is still in progress e.g. every 3-5 minutes. In my test lab silently downloading container image for 18 minutes straight was not a problem for containerd - it will happily keep downloading images no matter how long it takes without any warning.
Describe the solution you'd like
Would it be possible to add some more logging around the process of downloading the image and e.g. have periodical messages with the information how much data was downloaded and with which speed? This would give us a "smoking gun" kind of a proof that we can act upon and would prove that containerd is not at fault here.
Additional context
NONE