Decouple removing the fileWatcher from reading by tombooth · Pull Request #27782 · moby/moby

tombooth · 2016-10-26T13:09:17Z

Description

When a container exits the state change will bubble up through libcontainerd into the daemon causing the container to be reset. As part of this it will close the LogDriver associated with the container and the driver will close all of its children. We have observed a race condition between closing a reader and receiving the final data from the log.

followLogs will receive a IN_MODIFY event while it is trying to close the fileWatcher. This causes a deadlock as https://github.com/docker/docker/blob/master/vendor/src/gopkg.in/fsnotify.v1/inotify.go#L266 is trying to put the modify event on the Events channel, but because https://github.com/docker/docker/blob/master/daemon/logger/jsonfilelog/read.go#L235 is trying to process the close it cannot take the event off the channel. Remove is blocked because it is waiting on a condition to be broadcast https://github.com/docker/docker/blob/master/vendor/src/gopkg.in/fsnotify.v1/inotify.go#L157, this would be triggered by receiving an IN_IGNORE event https://github.com/docker/docker/blob/master/vendor/src/gopkg.in/fsnotify.v1/inotify.go#L289, but due to the code stuck trying to push the IN_MODIFY onto the Events channel this is never processed.

We initially saw this behaviour with v1.12.1, but have replicated it building the daemon from master. Additionally, we first observed this using the HTTP API to follow the logs, but the same behaviour can been seen with docker logs -f

Here is a gist that should show you the order of events that is causing this deadlock: https://gist.github.com/tombooth/e38175f07233c683704528cf406c3120

Steps to reproduce the issue:

Run the attached script, eventually the docker logs -f command will hang

#!/bin/bash

while true
do
        container_id="$(docker run -d ubuntu:latest /bin/bash -e -c "echo -n \"foo\"")"
        docker logs -f "${container_id}"
done

Describe the results you received:
The logs function hung

Describe the results you expected:
It should have exited

Additional information you deem important (e.g. issue happens only occasionally):
This doesn't occur on every execution of the the loop block. On my local machine, running Linux in a container, it seems to hang one in two-three times.

Output of docker version:

Client:
Version: 1.12.2
API version: 1.24
Go version: go1.6.3
Git commit: bb80604
Built: Tue Oct 11 18:29:41 2016
OS/Arch: linux/amd64

Server:
Version: 1.12.2
API version: 1.24
Go version: go1.6.3
Git commit: bb80604
Built: Tue Oct 11 18:29:41 2016
OS/Arch: linux/amd64

Output of docker info:

Containers: 209
Running: 1
Paused: 0
Stopped: 208
Images: 109
Server Version: 1.12.2
Storage Driver: aufs
Root Dir: /var/lib/docker/aufs
Backing Filesystem: extfs
Dirs: 505
Dirperm1 Supported: true
Logging Driver: json-file
Cgroup Driver: cgroupfs
Plugins:
Volume: local
Network: bridge host null overlay
Swarm: inactive
Runtimes: runc
Default Runtime: runc
Security Options: apparmor seccomp
Kernel Version: 4.4.0-45-generic
Operating System: Ubuntu 16.04.1 LTS
OSType: linux
Architecture: x86_64
CPUs: 4
Total Memory: 3.859 GiB
Name: vagrant
ID: QXD3:X4JO:E52F:CWCS:IY3J:RN36:EV36:I2KK:7JH4:WOJ3:MSL5:7O7P
Docker Root Dir: /var/lib/docker
Debug Mode (client): false
Debug Mode (server): false
Registry: https://index.docker.io/v1/
WARNING: No swap limit support
Insecure Registries:
127.0.0.0/8

Additional environment details (AWS, VirtualBox, physical, etc.):

We've seen this on AWS, in VMWare Fusion and on a physical machine.

tombooth · 2016-10-26T13:52:09Z

I added in the initial commit because golint failed by build because it claimed we didn't need the cast. Now the build fails because the cast isn't there, is there a way of ignoring that lint fail?

cpuguy83 · 2016-10-26T14:04:33Z

Nice, I've observed this deadlock as well.
Good catch!

cpuguy83 · 2016-10-26T14:05:00Z

13:35:42 daemon/logger/jsonfilelog/read.go:123: cannot use bytes.NewBuffer(bytes.Join(ls, ([]byte)("\n"))) (type *bytes.Buffer) as type io.ReadSeeker in assignment:
13:35:42    *bytes.Buffer does not implement io.ReadSeeker (missing Seek method)

tombooth · 2016-10-26T14:34:18Z

@cpuguy83 see my comment above, I changed the signature of a reader because golint on your build servers didn't think the io.Reader from io.ReadSeeker cast was needed. It seems it was though as it fails the build. I'm not 100% sure what to do.

FYI without the removal of the cast the tests all happily passed locally.

tombooth · 2016-10-26T14:48:52Z

I could do something like:

diff --git a/daemon/logger/jsonfilelog/read.go b/daemon/logger/jsonfilelog/read.go
index df67fb0..d628dc3 100644
--- a/daemon/logger/jsonfilelog/read.go
+++ b/daemon/logger/jsonfilelog/read.go
@@ -113,7 +113,7 @@ func (l *JSONFileLogger) readLogs(logWatcher *logger.LogWatcher, config logger.R
 }

 func tailFile(f io.ReadSeeker, logWatcher *logger.LogWatcher, tail int, since time.Time) {
-   var rdr io.Reader = f
+   var rdr io.Reader
    if tail > 0 {
        ls, err := tailfile.TailFile(f, tail)
        if err != nil {
@@ -121,6 +121,8 @@ func tailFile(f io.ReadSeeker, logWatcher *logger.LogWatcher, tail int, since ti
            return
        }
        rdr = bytes.NewBuffer(bytes.Join(ls, []byte("\n")))
+   } else {
+       rdr = f.(io.Reader)
    }
    dec := json.NewDecoder(rdr)
    l := &jsonlog.JSONLog{}

That should keep both lint and the build happy.

tombooth · 2016-10-26T14:54:15Z

I've pushed that, so lets see what happens 🍀

cpuguy83 · 2016-10-26T15:56:49Z

daemon/logger/jsonfilelog/read.go

There should be no need to cast here.
Maybe just for simplicity (removing the else) we can do:

var rdr io.Reader rdr = f if tail > 0 { ... }

And the linter may be happy?

I'll let this build run to see if everything is happy and then make that change.

I've pushed the change. Last time it passed all test suites, so as this update has got passed lint I am hopeful 😄

`golint` had the following issue when linting this file: ``` daemon/logger/jsonfilelog/read.go:116:10: should omit type io.Reader from declaration of var rdr; it will be inferred from the right-hand side ``` In order to keep it happy changing it to an indirect assignment will still maintain the same functionality. Signed-off-by: Tom Booth <tombooth@gmail.com>

cpuguy83 · 2016-10-26T18:57:57Z

daemon/logger/jsonfilelog/read.go

Looking at the watch loop below, I think we need some way to cancel this goroutine since the loop may exit without logwatcher closing. May be time to whip out a context.

Good shout! I've just pushed an update to the code, does it look a little better? If it's not quite how you imagined let me know.

What do you reckon the chances of this making it into v1.13.0 are @cpuguy83 ?

cpuguy83 · 2016-10-26T22:53:09Z

daemon/logger/jsonfilelog/read.go

Maybe instead of creating this channel, we can do context.WithCancel(ctx) to create a nested context.

Agreed and done.

cpuguy83 · 2016-10-26T23:25:09Z

daemon/logger/jsonfilelog/read.go

Sorry, one more thing, I think this need to run in either case

It's no problem, I had assumed the deferred fileWatcher.Close() would have taken care of it but looking at the source it doesn't clear up any open inotify watchers so we would be leaking descriptors. Fix incoming.

:) defer filewatcher.Remove above the select, but no worries.

Sorry, it is a little late over here. I've just been having a think about the order of execution. If a fileWatcher.Remove happens after fileWatcher.Close it would likely hang as Remove would be waiting for an IN_IGNORE event but readEvents (in https://github.com/docker/docker/blob/362eb4cfbb09b5dc0fe648360a2e6d0546bf6613/vendor/src/gopkg.in/fsnotify.v1/inotify.go) would have exited because the watcher was closed so it would never receive it. This makes me a little concerned about the Remove after ctx.Done() as the fileWatcher.Close() in the second defer could have run by the time Remove is called.

I'm fairly new to Go so if that doesn't sounds right (could also be because I'm tried) let me know, otherwise I might switch to a fileWatcher.Remove before the Close in the defer above the go block. If it has already been removed it should have no issue, it will just return an error that we ignore anyway: https://github.com/docker/docker/blob/362eb4cfbb09b5dc0fe648360a2e6d0546bf6613/vendor/src/gopkg.in/fsnotify.v1/inotify.go#L135

cpuguy83 · 2016-10-26T23:44:24Z

LGTM if janky is happy

LK4D4 · 2016-10-27T22:30:50Z

daemon/logger/jsonfilelog/read.go

this is not compatible with go1.6 I think, you should use golang.org/x/net/context I think.

I've switched this, thanks for the pointer.

LK4D4 · 2016-10-27T22:49:12Z

daemon/logger/jsonfilelog/read.go

I think top context is unnecessary(maybe we'll pass it from api call later).
So, I think this line should look like:

ctx, cancel := context.WithCancel(ctx) defer cancel()

We need one to notify that the watcher is closed, and the other to notify that the goroutine needs to exit.

I don't really understand why we can't use one for both?

Other than multiple owners of cancel()? It's probably ok. I prefer the separation, but assuming nothing panics it seems ok.

I've moved it to a single context.

LK4D4 · 2016-10-31T15:47:29Z

LGTM

cpuguy83

LGTM

tombooth · 2016-11-01T11:51:17Z

Thanks @cpuguy83 @LK4D4 😄

michael-holzheu · 2016-11-09T19:29:08Z

Does this fix #16462 as well? /cc @michael-holzheu

Perhaps.

Unfortunately in the last months I never saw the TestLogsFollowSlowStdoutConsumer testcase failing again. I would be fine with closing #16462.

thaJeztah · 2016-11-09T20:03:38Z

thanks @michael-holzheu, let me close it for now, we can reopen if it occurs again

`golint` had the following issue when linting this file: ``` daemon/logger/jsonfilelog/read.go:116:10: should omit type io.Reader from declaration of var rdr; it will be inferred from the right-hand side ``` In order to keep it happy changing it to an indirect assignment will still maintain the same functionality. fix DTS2017062809604 backport from moby#27782 without conflicts. Signed-off-by: Tom Booth <tombooth@gmail.com> Signed-off-by: Yuanhong Peng <pengyuanhong@huawei.com>

Fixes moby#27779 Currently `followLogs` can get into a deadlock if we receive an inotify IN_MODIFY event while we are trying to close the `fileWatcher`. This is due to the fact that closing the `fileWatcher` happens in the same block as consumes events from the `fileWatcher`. We are trying to run `fileWatcher.Close`, which is waiting for an IN_IGNORE event to come in over inotify to confirm the watch was been removed. But, because an IN_MODIFY event has appeared after `Close` was entered but before the IN_IGNORE, the broadcast never comes. The IN_MODIFY cannot be consumed as the events channel is unbuffered and the only `select` that reads from it is busy waiting for the IN_IGNORE event. In order to try and fix this race condition I've moved the removal of the `fileWatcher` out to a separate go block that waits for a signal to close, removes the watcher and then signals to the previous selects on the close signal. This has introduced a `fileWatcher.Remove` in the final case, but if we try and remove a watcher that does not exist it will just return an error saying so. We are not doing any checking on the return of `Remove` so this shouldn't cause any side-effects. fix DTS2017062809604 backport from moby#27782 without conflicts. Signed-off-by: Tom Booth <tombooth@gmail.com> Signed-off-by: Yuanhong Peng <pengyuanhong@huawei.com>

Fix docker logs hang Steps to reproduce the issue: 1. Run the attached script, eventually the `docker logs -f` command will hang: ``` #!/bin/bash while true do container_id="$(docker run -d ubuntu:latest /bin/bash -e -c "echo -n \"foo\"")" docker logs -f "${container_id}" done ``` fix DTS2017062809604 backport from moby#31856 backport from moby#27782 backport from moby#22088 conflicts: daemon/attach.go daemon/logs.go daemon/logger/journald/read.go daemon/logger/jsonfilelog/read.go daemon/logs.go Signed-off-by: Jim Minter <jminter@redhat.com> Signed-off-by: Tom Booth <tombooth@gmail.com> Signed-off-by: Amit Krishnan <krish.amit@gmail.com> Signed-off-by: Yuanhong Peng <pengyuanhong@huawei.com> See merge request docker/docker!625

kolyshkin · 2018-08-29T01:39:38Z

For the sake of people reading this: the bug causing the issue is a bug in fsnotify package (fsnotify/fsnotify#195, fsnotify/fsnotify#123, fsnotify/fsnotify#115), which has since been fixed by commit fsnotify/fsnotify@4da3e2c "Fix deadlock in Remove (linux/inotify)". The fix has appeared in fsnotify v.1.4.7, which was vendored to moby/moby in commit 9f016c0.

The fix in this PR was just a workaround for the above fsnotify bug.

All this means the workarond can be now removed.

When daemon.ContainerLogs() is called with options.follow=true (as in "docker logs --follow"), the "loggerutils.followLogs()" function never returns (even then the logs consumer is gone). As a result, all the resources associated with it (including an opened file descriptor for the log file being read, two FDs for a pipe, and two FDs for inotify watch) are never released. If this is repeated (such as by running "docker logs --follow" and pressing Ctrl-C a few times), this results in DoS caused by either hitting the limit of inotify watches, or the limit of opened files. The only cure is daemon restart. Apparently, what happens is: 1. logs producer (a container) is gone, calling (*LogWatcher).Close() for all its readers (daemon/logger/jsonfilelog/jsonfilelog.go:175). 2. WatchClose() is properly handled by a dedicated goroutine in followLogs(), cancelling the context. 3. Upon receiving the ctx.Done(), the code in followLogs() (daemon/logger/loggerutils/logfile.go#L626-L638) keeps to send messages _synchronously_ (which is OK for now). 4. Logs consumer is gone (Ctrl-C is pressed on a terminal running "docker logs --follow"). Method (*LogWatcher).Close() is properly called (see daemon/logs.go:114). Since it was called before and due to to once.Do(), nothing happens (which is kinda good, as otherwise it will panic on closing a closed channel). 5. A goroutine (see item 3 above) keeps sending log messages synchronously to the logWatcher.Msg channel. Since the channel reader is gone, the channel send operation blocks forever, and resource cleanup set up in defer statements at the beginning of followLogs() never happens. Alas, the fix is somewhat complicated: 1. Distinguish between close from logs producer and logs consumer. To that effect, - yet another channel is added to LogWatcher(); - {Watch,}Close() are renamed to {Watch,}ProducerGone(); - {Watch,}ConsumerGone() are added; *NOTE* that ProducerGone()/WatchProducerGone() pair is ONLY needed in order to stop ConsumerLogs(follow=true) when a container is stopped; otherwise we're not interested in it. In other words, we're only using it in followLogs(). 2. Code that was doing (logWatcher*).Close() is modified to either call ProducerGone() or ConsumerGone(), depending on the context. 3. Code that was waiting for WatchClose() is modified to wait for either ConsumerGone() or ProducerGone(), or both, depending on the context. 4. followLogs() are modified accordingly: - context cancellation is happening on WatchProducerGone(), and once it's received the FileWatcher is closed and waitRead() returns errDone on EOF (i.e. log rotation handling logic is disabled); - due to this, code that was writing synchronously to logWatcher.Msg can be and is removed as the code above it handles this case; - function returns once ConsumerGone is received, freeing all the resources -- this is the bugfix itself. While at it, 1. Let's also remove the ctx usage to simplify the code a bit. It was introduced by commit a69a59f ("Decouple removing the fileWatcher from reading") in order to fix a bug. The bug was actually a deadlock in fsnotify, and the fix was just a workaround. Since then the fsnofify bug has been fixed, and a new fsnotify was vendored in. For more details, please see moby#27782 (comment) 2. Since `(*filePoller).Close()` is fixed to remove all the files being watched, there is no need to explicitly call fileWatcher.Remove(name) anymore, so get rid of the extra code. Should fix moby#37391 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>

When daemon.ContainerLogs() is called with options.follow=true (as in "docker logs --follow"), the "loggerutils.followLogs()" function never returns (even then the logs consumer is gone). As a result, all the resources associated with it (including an opened file descriptor for the log file being read, two FDs for a pipe, and two FDs for inotify watch) are never released. If this is repeated (such as by running "docker logs --follow" and pressing Ctrl-C a few times), this results in DoS caused by either hitting the limit of inotify watches, or the limit of opened files. The only cure is daemon restart. Apparently, what happens is: 1. logs producer (a container) is gone, calling (*LogWatcher).Close() for all its readers (daemon/logger/jsonfilelog/jsonfilelog.go:175). 2. WatchClose() is properly handled by a dedicated goroutine in followLogs(), cancelling the context. 3. Upon receiving the ctx.Done(), the code in followLogs() (daemon/logger/loggerutils/logfile.go#L626-L638) keeps to send messages _synchronously_ (which is OK for now). 4. Logs consumer is gone (Ctrl-C is pressed on a terminal running "docker logs --follow"). Method (*LogWatcher).Close() is properly called (see daemon/logs.go:114). Since it was called before and due to to once.Do(), nothing happens (which is kinda good, as otherwise it will panic on closing a closed channel). 5. A goroutine (see item 3 above) keeps sending log messages synchronously to the logWatcher.Msg channel. Since the channel reader is gone, the channel send operation blocks forever, and resource cleanup set up in defer statements at the beginning of followLogs() never happens. Alas, the fix is somewhat complicated: 1. Distinguish between close from logs producer and logs consumer. To that effect, - yet another channel is added to LogWatcher(); - {Watch,}Close() are renamed to {Watch,}ProducerGone(); - {Watch,}ConsumerGone() are added; *NOTE* that ProducerGone()/WatchProducerGone() pair is ONLY needed in order to stop ConsumerLogs(follow=true) when a container is stopped; otherwise we're not interested in it. In other words, we're only using it in followLogs(). 2. Code that was doing (logWatcher*).Close() is modified to either call ProducerGone() or ConsumerGone(), depending on the context. 3. Code that was waiting for WatchClose() is modified to wait for either ConsumerGone() or ProducerGone(), or both, depending on the context. 4. followLogs() are modified accordingly: - context cancellation is happening on WatchProducerGone(), and once it's received the FileWatcher is closed and waitRead() returns errDone on EOF (i.e. log rotation handling logic is disabled); - due to this, code that was writing synchronously to logWatcher.Msg can be and is removed as the code above it handles this case; - function returns once ConsumerGone is received, freeing all the resources -- this is the bugfix itself. While at it, 1. Let's also remove the ctx usage to simplify the code a bit. It was introduced by commit a69a59f ("Decouple removing the fileWatcher from reading") in order to fix a bug. The bug was actually a deadlock in fsnotify, and the fix was just a workaround. Since then the fsnofify bug has been fixed, and a new fsnotify was vendored in. For more details, please see moby#27782 (comment) 2. Since `(*filePoller).Close()` is fixed to remove all the files being watched, there is no need to explicitly call fileWatcher.Remove(name) anymore, so get rid of the extra code. Should fix moby#37391 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com> (cherry picked from commit 916eabd)

When daemon.ContainerLogs() is called with options.follow=true (as in "docker logs --follow"), the "loggerutils.followLogs()" function never returns (even then the logs consumer is gone). As a result, all the resources associated with it (including an opened file descriptor for the log file being read, two FDs for a pipe, and two FDs for inotify watch) are never released. If this is repeated (such as by running "docker logs --follow" and pressing Ctrl-C a few times), this results in DoS caused by either hitting the limit of inotify watches, or the limit of opened files. The only cure is daemon restart. Apparently, what happens is: 1. logs producer (a container) is gone, calling (*LogWatcher).Close() for all its readers (daemon/logger/jsonfilelog/jsonfilelog.go:175). 2. WatchClose() is properly handled by a dedicated goroutine in followLogs(), cancelling the context. 3. Upon receiving the ctx.Done(), the code in followLogs() (daemon/logger/loggerutils/logfile.go#L626-L638) keeps to send messages _synchronously_ (which is OK for now). 4. Logs consumer is gone (Ctrl-C is pressed on a terminal running "docker logs --follow"). Method (*LogWatcher).Close() is properly called (see daemon/logs.go:114). Since it was called before and due to to once.Do(), nothing happens (which is kinda good, as otherwise it will panic on closing a closed channel). 5. A goroutine (see item 3 above) keeps sending log messages synchronously to the logWatcher.Msg channel. Since the channel reader is gone, the channel send operation blocks forever, and resource cleanup set up in defer statements at the beginning of followLogs() never happens. Alas, the fix is somewhat complicated: 1. Distinguish between close from logs producer and logs consumer. To that effect, - yet another channel is added to LogWatcher(); - {Watch,}Close() are renamed to {Watch,}ProducerGone(); - {Watch,}ConsumerGone() are added; *NOTE* that ProducerGone()/WatchProducerGone() pair is ONLY needed in order to stop ConsumerLogs(follow=true) when a container is stopped; otherwise we're not interested in it. In other words, we're only using it in followLogs(). 2. Code that was doing (logWatcher*).Close() is modified to either call ProducerGone() or ConsumerGone(), depending on the context. 3. Code that was waiting for WatchClose() is modified to wait for either ConsumerGone() or ProducerGone(), or both, depending on the context. 4. followLogs() are modified accordingly: - context cancellation is happening on WatchProducerGone(), and once it's received the FileWatcher is closed and waitRead() returns errDone on EOF (i.e. log rotation handling logic is disabled); - due to this, code that was writing synchronously to logWatcher.Msg can be and is removed as the code above it handles this case; - function returns once ConsumerGone is received, freeing all the resources -- this is the bugfix itself. While at it, 1. Let's also remove the ctx usage to simplify the code a bit. It was introduced by commit a69a59f ("Decouple removing the fileWatcher from reading") in order to fix a bug. The bug was actually a deadlock in fsnotify, and the fix was just a workaround. Since then the fsnofify bug has been fixed, and a new fsnotify was vendored in. For more details, please see moby#27782 (comment) 2. Since `(*filePoller).Close()` is fixed to remove all the files being watched, there is no need to explicitly call fileWatcher.Remove(name) anymore, so get rid of the extra code. Should fix moby#37391 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com> (cherry picked from commit 916eabd) [Conflict while cherry-picking due to missing commit 94a1015] Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>

When daemon.ContainerLogs() is called with options.follow=true (as in "docker logs --follow"), the "loggerutils.followLogs()" function never returns (even then the logs consumer is gone). As a result, all the resources associated with it (including an opened file descriptor for the log file being read, two FDs for a pipe, and two FDs for inotify watch) are never released. If this is repeated (such as by running "docker logs --follow" and pressing Ctrl-C a few times), this results in DoS caused by either hitting the limit of inotify watches, or the limit of opened files. The only cure is daemon restart. Apparently, what happens is: 1. logs producer (a container) is gone, calling (*LogWatcher).Close() for all its readers (daemon/logger/jsonfilelog/jsonfilelog.go:175). 2. WatchClose() is properly handled by a dedicated goroutine in followLogs(), cancelling the context. 3. Upon receiving the ctx.Done(), the code in followLogs() (daemon/logger/loggerutils/logfile.go#L626-L638) keeps to send messages _synchronously_ (which is OK for now). 4. Logs consumer is gone (Ctrl-C is pressed on a terminal running "docker logs --follow"). Method (*LogWatcher).Close() is properly called (see daemon/logs.go:114). Since it was called before and due to to once.Do(), nothing happens (which is kinda good, as otherwise it will panic on closing a closed channel). 5. A goroutine (see item 3 above) keeps sending log messages synchronously to the logWatcher.Msg channel. Since the channel reader is gone, the channel send operation blocks forever, and resource cleanup set up in defer statements at the beginning of followLogs() never happens. Alas, the fix is somewhat complicated: 1. Distinguish between close from logs producer and logs consumer. To that effect, - yet another channel is added to LogWatcher(); - {Watch,}Close() are renamed to {Watch,}ProducerGone(); - {Watch,}ConsumerGone() are added; *NOTE* that ProducerGone()/WatchProducerGone() pair is ONLY needed in order to stop ConsumerLogs(follow=true) when a container is stopped; otherwise we're not interested in it. In other words, we're only using it in followLogs(). 2. Code that was doing (logWatcher*).Close() is modified to either call ProducerGone() or ConsumerGone(), depending on the context. 3. Code that was waiting for WatchClose() is modified to wait for either ConsumerGone() or ProducerGone(), or both, depending on the context. 4. followLogs() are modified accordingly: - context cancellation is happening on WatchProducerGone(), and once it's received the FileWatcher is closed and waitRead() returns errDone on EOF (i.e. log rotation handling logic is disabled); - due to this, code that was writing synchronously to logWatcher.Msg can be and is removed as the code above it handles this case; - function returns once ConsumerGone is received, freeing all the resources -- this is the bugfix itself. While at it, 1. Let's also remove the ctx usage to simplify the code a bit. It was introduced by commit a69a59f ("Decouple removing the fileWatcher from reading") in order to fix a bug. The bug was actually a deadlock in fsnotify, and the fix was just a workaround. Since then the fsnofify bug has been fixed, and a new fsnotify was vendored in. For more details, please see moby/moby#27782 (comment) 2. Since `(*filePoller).Close()` is fixed to remove all the files being watched, there is no need to explicitly call fileWatcher.Remove(name) anymore, so get rid of the extra code. Should fix moby/moby#37391 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com> Upstream-commit: 916eabd459fe707b5c4a86377d12e2ad1871b353 Component: engine

When daemon.ContainerLogs() is called with options.follow=true (as in "docker logs --follow"), the "loggerutils.followLogs()" function never returns (even then the logs consumer is gone). As a result, all the resources associated with it (including an opened file descriptor for the log file being read, two FDs for a pipe, and two FDs for inotify watch) are never released. If this is repeated (such as by running "docker logs --follow" and pressing Ctrl-C a few times), this results in DoS caused by either hitting the limit of inotify watches, or the limit of opened files. The only cure is daemon restart. Apparently, what happens is: 1. logs producer (a container) is gone, calling (*LogWatcher).Close() for all its readers (daemon/logger/jsonfilelog/jsonfilelog.go:175). 2. WatchClose() is properly handled by a dedicated goroutine in followLogs(), cancelling the context. 3. Upon receiving the ctx.Done(), the code in followLogs() (daemon/logger/loggerutils/logfile.go#L626-L638) keeps to send messages _synchronously_ (which is OK for now). 4. Logs consumer is gone (Ctrl-C is pressed on a terminal running "docker logs --follow"). Method (*LogWatcher).Close() is properly called (see daemon/logs.go:114). Since it was called before and due to to once.Do(), nothing happens (which is kinda good, as otherwise it will panic on closing a closed channel). 5. A goroutine (see item 3 above) keeps sending log messages synchronously to the logWatcher.Msg channel. Since the channel reader is gone, the channel send operation blocks forever, and resource cleanup set up in defer statements at the beginning of followLogs() never happens. Alas, the fix is somewhat complicated: 1. Distinguish between close from logs producer and logs consumer. To that effect, - yet another channel is added to LogWatcher(); - {Watch,}Close() are renamed to {Watch,}ProducerGone(); - {Watch,}ConsumerGone() are added; *NOTE* that ProducerGone()/WatchProducerGone() pair is ONLY needed in order to stop ConsumerLogs(follow=true) when a container is stopped; otherwise we're not interested in it. In other words, we're only using it in followLogs(). 2. Code that was doing (logWatcher*).Close() is modified to either call ProducerGone() or ConsumerGone(), depending on the context. 3. Code that was waiting for WatchClose() is modified to wait for either ConsumerGone() or ProducerGone(), or both, depending on the context. 4. followLogs() are modified accordingly: - context cancellation is happening on WatchProducerGone(), and once it's received the FileWatcher is closed and waitRead() returns errDone on EOF (i.e. log rotation handling logic is disabled); - due to this, code that was writing synchronously to logWatcher.Msg can be and is removed as the code above it handles this case; - function returns once ConsumerGone is received, freeing all the resources -- this is the bugfix itself. While at it, 1. Let's also remove the ctx usage to simplify the code a bit. It was introduced by commit a69a59f ("Decouple removing the fileWatcher from reading") in order to fix a bug. The bug was actually a deadlock in fsnotify, and the fix was just a workaround. Since then the fsnofify bug has been fixed, and a new fsnotify was vendored in. For more details, please see moby/moby#27782 (comment) 2. Since `(*filePoller).Close()` is fixed to remove all the files being watched, there is no need to explicitly call fileWatcher.Remove(name) anymore, so get rid of the extra code. Should fix moby/moby#37391 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com> (cherry picked from commit 916eabd459fe707b5c4a86377d12e2ad1871b353) Upstream-commit: 84a5b528aede5579861201e869870d10fc98c07c Component: engine

When daemon.ContainerLogs() is called with options.follow=true (as in "docker logs --follow"), the "loggerutils.followLogs()" function never returns (even then the logs consumer is gone). As a result, all the resources associated with it (including an opened file descriptor for the log file being read, two FDs for a pipe, and two FDs for inotify watch) are never released. If this is repeated (such as by running "docker logs --follow" and pressing Ctrl-C a few times), this results in DoS caused by either hitting the limit of inotify watches, or the limit of opened files. The only cure is daemon restart. Apparently, what happens is: 1. logs producer (a container) is gone, calling (*LogWatcher).Close() for all its readers (daemon/logger/jsonfilelog/jsonfilelog.go:175). 2. WatchClose() is properly handled by a dedicated goroutine in followLogs(), cancelling the context. 3. Upon receiving the ctx.Done(), the code in followLogs() (daemon/logger/loggerutils/logfile.go#L626-L638) keeps to send messages _synchronously_ (which is OK for now). 4. Logs consumer is gone (Ctrl-C is pressed on a terminal running "docker logs --follow"). Method (*LogWatcher).Close() is properly called (see daemon/logs.go:114). Since it was called before and due to to once.Do(), nothing happens (which is kinda good, as otherwise it will panic on closing a closed channel). 5. A goroutine (see item 3 above) keeps sending log messages synchronously to the logWatcher.Msg channel. Since the channel reader is gone, the channel send operation blocks forever, and resource cleanup set up in defer statements at the beginning of followLogs() never happens. Alas, the fix is somewhat complicated: 1. Distinguish between close from logs producer and logs consumer. To that effect, - yet another channel is added to LogWatcher(); - {Watch,}Close() are renamed to {Watch,}ProducerGone(); - {Watch,}ConsumerGone() are added; *NOTE* that ProducerGone()/WatchProducerGone() pair is ONLY needed in order to stop ConsumerLogs(follow=true) when a container is stopped; otherwise we're not interested in it. In other words, we're only using it in followLogs(). 2. Code that was doing (logWatcher*).Close() is modified to either call ProducerGone() or ConsumerGone(), depending on the context. 3. Code that was waiting for WatchClose() is modified to wait for either ConsumerGone() or ProducerGone(), or both, depending on the context. 4. followLogs() are modified accordingly: - context cancellation is happening on WatchProducerGone(), and once it's received the FileWatcher is closed and waitRead() returns errDone on EOF (i.e. log rotation handling logic is disabled); - due to this, code that was writing synchronously to logWatcher.Msg can be and is removed as the code above it handles this case; - function returns once ConsumerGone is received, freeing all the resources -- this is the bugfix itself. While at it, 1. Let's also remove the ctx usage to simplify the code a bit. It was introduced by commit 9de9efe ("Decouple removing the fileWatcher from reading") in order to fix a bug. The bug was actually a deadlock in fsnotify, and the fix was just a workaround. Since then the fsnofify bug has been fixed, and a new fsnotify was vendored in. For more details, please see moby/moby#27782 (comment) 2. Since `(*filePoller).Close()` is fixed to remove all the files being watched, there is no need to explicitly call fileWatcher.Remove(name) anymore, so get rid of the extra code. Should fix moby/moby#37391 Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>

GordonTheTurtle added the status/0-triage label Oct 26, 2016

tombooth force-pushed the 27779-decouple-watcher-close branch from dc0b73e to 2763344 Compare October 26, 2016 13:15

GordonTheTurtle added the dco/no Automatically set by a bot when one of the commits lacks proper signature label Oct 26, 2016

tombooth force-pushed the 27779-decouple-watcher-close branch from 2763344 to 5d06603 Compare October 26, 2016 13:16

GordonTheTurtle added the dco/no Automatically set by a bot when one of the commits lacks proper signature label Oct 26, 2016

tombooth force-pushed the 27779-decouple-watcher-close branch from 5d06603 to 5a446f7 Compare October 26, 2016 13:17

GordonTheTurtle removed the dco/no Automatically set by a bot when one of the commits lacks proper signature label Oct 26, 2016

cpuguy83 added status/2-code-review and removed status/0-triage labels Oct 26, 2016

tombooth force-pushed the 27779-decouple-watcher-close branch from 5a446f7 to f78e713 Compare October 26, 2016 14:53

cpuguy83 reviewed Oct 26, 2016

View reviewed changes

tombooth force-pushed the 27779-decouple-watcher-close branch from f78e713 to 814f5ed Compare October 26, 2016 16:38

cpuguy83 reviewed Oct 26, 2016

View reviewed changes

tombooth force-pushed the 27779-decouple-watcher-close branch 2 times, most recently from 9c69552 to bb1500c Compare October 26, 2016 20:47

cpuguy83 added this to the 1.13.0 milestone Oct 26, 2016

cpuguy83 reviewed Oct 26, 2016

View reviewed changes

tombooth force-pushed the 27779-decouple-watcher-close branch from bb1500c to 8d9380c Compare October 26, 2016 23:07

cpuguy83 reviewed Oct 26, 2016

View reviewed changes

tombooth force-pushed the 27779-decouple-watcher-close branch from 8d9380c to b96f139 Compare October 26, 2016 23:33

tombooth force-pushed the 27779-decouple-watcher-close branch from b96f139 to 68c3c21 Compare October 27, 2016 01:47

LK4D4 suggested changes Oct 27, 2016

View reviewed changes

LK4D4 approved these changes Oct 31, 2016

View reviewed changes

cpuguy83 approved these changes Oct 31, 2016

View reviewed changes

cpuguy83 merged commit 0cdcb9e into moby:master Oct 31, 2016

tombooth deleted the 27779-decouple-watcher-close branch November 1, 2016 11:52

thaJeztah mentioned this pull request Nov 9, 2016

Race in "docker logs --follow" leads to hang in TestLogsFollowSlowStdoutConsumer #16462

Closed

thaJeztah mentioned this pull request Dec 13, 2016

FAIL: docker_cli_logs_test.go:250: DockerSuite.TestLogsFollowSlowStdoutConsumer #29341

Closed

kolyshkin mentioned this pull request Jul 31, 2018

Using follow log output causes json-log file descriptors to leak #37391

Closed

kolyshkin mentioned this pull request Aug 29, 2018

daemon.ContainerLogs(): fix resource leak on follow #37576

Merged

thaJeztah mentioned this pull request Jul 28, 2020

Flaky test: TestLogsFollowSlowStdoutConsumer #41276

Open

Conversation

tombooth commented Oct 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tombooth commented Oct 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cpuguy83 commented Oct 26, 2016

Uh oh!

cpuguy83 commented Oct 26, 2016

Uh oh!

tombooth commented Oct 26, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tombooth commented Oct 26, 2016

Uh oh!

tombooth commented Oct 26, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cpuguy83 commented Oct 26, 2016

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

LK4D4 commented Oct 31, 2016

Uh oh!

cpuguy83 left a comment

Choose a reason for hiding this comment

Uh oh!

tombooth commented Nov 1, 2016

Uh oh!

michael-holzheu commented Nov 9, 2016

Uh oh!

thaJeztah commented Nov 9, 2016

Uh oh!

kolyshkin commented Aug 29, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

tombooth commented Oct 26, 2016 •

edited

Loading

tombooth commented Oct 26, 2016 •

edited

Loading

tombooth commented Oct 26, 2016 •

edited

Loading

kolyshkin commented Aug 29, 2018 •

edited

Loading