Proposal: Metrics for log driver log loss, logs sent, throughput, in Docker Stats or prometheus metrics

### Description

Recently, I worked on a project to benchmark log loss with the AWSLogs driver in `non-blocking` mode with different values of `max-buffer-size` and log output throughput. These results will be published soon and will be linked here once published. 

One of the issues I see with users using the benchmark results to guide their own configurations is that there’s limited visibility into logging with log drivers. I can’t easily find the output rate of my container. When the [buffer](https://github.com/moby/moby/blob/master/daemon/logger/ring.go#L160) drops logs for lack of buffer space, there’s no log message or metric emitted. 


### Problem Statement

Log drivers have limited visibility. Ideally, users should be able to track metrics or logs on the rate of log emission from their containers and the rate of log loss vs log upload success from the driver. 

With `non-blocking` mode, the main concern is log loss in the [buffer](https://github.com/moby/moby/blob/master/daemon/logger/ring.go#L160). When logs are dropped, it is completely silent, there’s no way to track it. 

The highest priority problem is lack of visibility into log loss. Tracking log throughput is secondary. 


### Proposal: Metrics for log driver log loss, logs sent, throughput, in Docker Stats

While the log loss visibility problem could be solved with logs emitted by the Docker daemon, this approach doesn’t scale. Users want to quantify log loss per container, and counting messages emitted by the entire daemon is inconvenient. Also, in cases of repeated high rate of log loss, the docker daemon logs would be spammed with many error statements.

The ideal place for this sort of data is to add new counters in the `docker stats` interface OR the prometheus metrics interface. 

![AWSLogs buffering (3) (2) (1)](https://github.com/moby/moby/assets/29443996/13ef687e-d972-4d05-8805-2beb59c9bf86)

As shown in this diagram, log loss by the non-blocking buffer could be tracked solely by Docker, without the need for changes in any log drivers. This could make implementation simpler. 




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: Metrics for log driver log loss, logs sent, throughput, in Docker Stats or prometheus metrics #45953

Description

Problem Statement