Add a docker plugin - dockerlogbeat by fearful-symmetry · Pull Request #13761 · elastic/beats

fearful-symmetry · 2019-09-22T21:10:26Z

This is the result of discussions between myself and @henrikno about the needs of cloud monitoring. This PR adds a newbeat, dockerlogbeat, that's a docker logging plugin.

Right now, this is a functional PoC. When configured it'll send logs to elasticsearch, and because it uses libbeat, you can (mostly) pass it the same beat-cli style of config options you would any other beat: --log-opt output.elasticsearch.hosts="172.18.0.2:9200"

This plugin uses the libbeat publisher/pipeline library. When a new container starts that uses our plugin, the plugin checks the user-supplied config against a hashtable of libbeat pipelines. If the pipeline exists, it starts a new client for that pipeline and hands it to the logger. If not, it starts up a new pipeline with that config. When a container stops, it closes the client, and if there are no open clients associated with that pipeline, it closes the pipeline as well.

There's a detailed readme with instructions on building, running and debugging.

This is a draft that needs a lot of work. Here's a tentative TODO.

For an MVP:

GA release:

How do we want to integrate this with ingest pipelines?
Can we get this to send its own logs / health data to ES?
We want this to support file spooling, which is currently in beta. We have a few issues with file spooling as-is. Namely, the spools have no 'garbage collection' to clean up after files that are no longer used. Also, the system isn't very good at managing the multiple spool files that would accumulate with multiple libbeat pipelines.
some kind of integration test suite.
Update documentation in config.json

dockerlogbeat/pipelineManager/pipelineManager.go

dockerlogbeat/pipelineManager/clientLogReader.go

x-pack/dockerlogbeat/pipelineManager/pipelineManager.go

x-pack/dockerlogbeat/pipelineManager/clientLogReader.go

fearful-symmetry · 2019-09-26T18:02:31Z

I'm still struggling to get the build system working. Docker plugins require a bunch of docker build operations, and trying to cram in all the dependencies to get it to build inside the container seems to be a problem.

fearful-symmetry · 2019-09-27T16:10:47Z

Okay, the build tooling stuff at least works now. Still fighting with govendor.

x-pack/dockerlogbeat/magefile.go

fearful-symmetry · 2019-10-04T15:26:34Z

jenkins, test this

exekias · 2019-10-07T08:04:29Z

Great to see this progressing! let's try to open a PR against a feature branch as soon as we have something mergeable, you can solve the rest of TODOs in more PRs

urso · 2019-10-07T11:20:56Z

x-pack/dockerlogbeat/Dockerfile

See .go-version file for current go version in the repository.

Is there some way to access that from mage?

dev-tools/mage/settings.go exports GoVersion

x-pack/dockerlogbeat/main.go

x-pack/dockerlogbeat/pipelineManager/clientLogReader.go

urso · 2019-10-07T11:30:00Z

x-pack/dockerlogbeat/pipelineManager/clientLogReader.go

Where did you get this line from?

Docker's example logging plugin does this, so every other plugin copied it and does the same thing. I'm guessing it's to 'reset' the reader in case of some undefined error, but I can't really make sense of it.

Implementation: https://github.com/gogo/protobuf/blob/master/io/uint32.go#L114

It does not reset the reader, but returns an ErrShortBuffer. The generated reader reads the complete event into memory and then unmarshals it. The 2e6 is the max read buffer size.

Once you get a message that is potentially bigger, you are stuck. The length field has already been read (read pointer has advanced), and you MUST NOT continue reading from the reader once you get ErrShortBuffer. This potentially leaves you with another problem.

I think it is fair to rule out old big messages (keep memory usage in check). Just creating a new reader (as done here) without consuming any more bytes will put you in the middle of a protobuf message, giving you an invalid read. If we want a reader that can drop large messages, then we need another implementation. But the current loop will happily produce errors and reinitialize the reader over and over again after until it did find a valid length field. The way protobuf is encoded, there are lengths fields for nested structures and strings all over the place. Unmarshaling will likely fail over and over again itself.

Ah, thanks. I'll keep reading into this.

urso · 2019-10-07T11:31:31Z

x-pack/dockerlogbeat/pipelineManager/clientLogReader.go

Have you considered an alternative non-blocking mode?

...There's a non-blocking mode?

See: https://github.com/elastic/beats/blob/master/libbeat/beat/pipeline.go#L176

x-pack/dockerlogbeat/pipelineManager/pipelineManager.go

urso · 2019-10-07T11:36:44Z

x-pack/dockerlogbeat/pipelineManager/pipelineManager.go

We run CloseClientWithFile in a separate go-routine (building an unbuffered 'channel'), because we don't want the API to block if CloseClientWithFile blocks. Due to the mutex here we do allow CreateClientWithConfig to block indirectly, though. Is this ok?

Yes. If CreateClientWithConfig bombs out while the HTTP handler is waiting for it, the container startup will fail and an error will be returned to the user via the CLI or whatever they're using to start. Otherwise a container might fail after a user a started it, or we get into some weird scenario where a container is running but the user doesn't know it's not logging. I'm less worried about the container stop passing its errors to the user.

To clarify: We are ok with CreateClientWithConfig and startLoggingHandler blocking?

The WaitClose setting (currently 0 and not exposed) could force startLoggingHandler to block for as many seconds as are configured on this setting (no matter which queue).

x-pack/dockerlogbeat/pipelineManager/pipelineManager.go

urso · 2019-10-07T11:40:28Z

x-pack/dockerlogbeat/pipelineManager/pipelineManager.go

If you want similar behavior to other Beats, consider common.NewConfigFrom(...)

common.NewConfigFrom(...) is called after parseCfgKeys. I had to hack this in because NewConfigFrom doesn't like the map[string]string structures we get from docker. I had to coerce the string value into an an interface wrapper so it'll handle config keys like ["192.168.1.2","192.168.1.3"]

Hm, sounds like a bug in go-ucfg. It should happily accept map[string]string.

Oh, it'll accept map[string]string, but if you pass it something like

map[string]string{ "output.elasticsearch.hosts" : `["192.168.1.2","192.168.1.3"]` }

You get Elasticsearch url: http://['172.18.0.3:9200,'172.18.0.2']:9200

oh, I see. The -E flag is implemented in libbeat/cfgfile/cfgfile.go. It uses SettingFlag from libbeat/common/flags.go. The final flag parsing is implemented here: https://github.com/elastic/go-ucfg/blob/master/flag/value.go#L48

The value parser is an internal package, but maybe we can export it for use-cases like this: CLI settings getting passed indirectly.

fearful-symmetry · 2019-10-07T15:22:51Z

Removed the panic()s from main and just printed to stderr, since I didn't want to import another log handler just to deal with the errors we get outside of the http handler.

x-pack/dockerlogbeat/config.json

x-pack/dockerlogbeat/main.go

exekias

Left some comments, this looks good to me!

x-pack/dockerlogbeat/config.json

x-pack/dockerlogbeat/handlers.go

x-pack/dockerlogbeat/pipelinemanager/clientLogReader.go

exekias · 2019-10-09T15:28:46Z

x-pack/dockerlogbeat/pipelinemanager/clientLogReader.go

to restart the reader after an error?

Yah, Steffen and I were discussing this. This was from docker's example, but it's almost certainly a terrible idea.

exekias · 2019-10-09T15:36:17Z

x-pack/dockerlogbeat/readme.md

can we move this to an issue?

What exactly? How do debug?

Oh, the Issues? Yah, that's getting ported.

x-pack/dockerlogbeat/readme.md

fearful-symmetry · 2019-10-09T19:24:15Z

Iteratively removing the vendored deps, since I discovered that if you break something with govendor, it's not easy to fix with govendor.

* init commit of dockerlogbeat

* Add a docker plugin - Elastic Log Driver (#13761)

* Add a docker plugin - Elastic Log Driver (elastic#13761) (cherry picked from commit 4a7a8c3)

* Add a docker plugin - Elastic Log Driver (#13761) (cherry picked from commit 4a7a8c3)

* init commit of dockerlogbeat

fearful-symmetry added new beat Team:Integrations Label for the Integrations team labels Sep 22, 2019

fearful-symmetry requested review from a team September 22, 2019 21:10

fearful-symmetry self-assigned this Sep 22, 2019

houndci-bot reviewed Sep 22, 2019

View reviewed changes

dockerlogbeat/pipelineManager/pipelineManager.go Outdated Show resolved Hide resolved

dockerlogbeat/pipelineManager/clientLogReader.go Outdated Show resolved Hide resolved

houndci-bot reviewed Sep 23, 2019

View reviewed changes

x-pack/dockerlogbeat/pipelineManager/pipelineManager.go Outdated Show resolved Hide resolved

x-pack/dockerlogbeat/pipelineManager/clientLogReader.go Outdated Show resolved Hide resolved

andresrc added the [zube]: In Progress label Sep 24, 2019

houndci-bot reviewed Oct 1, 2019

View reviewed changes

x-pack/dockerlogbeat/magefile.go Outdated Show resolved Hide resolved

houndci-bot reviewed Oct 1, 2019

View reviewed changes

x-pack/dockerlogbeat/magefile.go Outdated Show resolved Hide resolved