I have installed telegraf v1.4.4 via rpm and configured a input for kafka_consumer as follows:
## Read metrics from Kafka topic(s)
[[inputs.kafka_consumer]]
brokers = ["localhost:9092"]
topics = ["telegraf"]
It works well for gathering kafka metrics. Unfortunately, when the kafka broker is down abnormally, It is failed to restart telegraf. The telegraf log hints this:
$ service telegraf status
Redirecting to /bin/systemctl status telegraf.service
● telegraf.service - The plugin-driven server agent for reporting metrics into InfluxDB
Loaded: loaded (/usr/lib/systemd/system/telegraf.service; enabled; vendor preset: disabled)
Active: inactive (dead) since Fri 2018-01-26 20:18:16 CST; 45min ago
Docs: https://github.com/influxdata/telegraf
Process: 8954 ExecStart=/usr/bin/telegraf -config /etc/telegraf/telegraf.conf -config-directory /etc/telegraf/telegraf.d $TELEGRAF_OPTS (code=exited, status=0/SUCCESS)
Main PID: 8954 (code=exited, status=0/SUCCESS)
/var/log/telegraf/telegraf.log
2018-01-26T10:33:56Z I! Starting Telegraf v1.4.4
2018-01-26T10:33:56Z I! Loaded outputs: influxdb opentsdb prometheus_client
2018-01-26T10:33:56Z I! Loaded inputs: inputs.kernel inputs.system inputs.filestat inputs.mongodb inputs.kafka_consumer inputs.diskio inputs.swap inputs.disk inputs.docker inputs.internal inputs.kernel_vmstat inputs.cpu inputs.mem inputs.processes inputs.net inputs.zookeeper inputs.logparser
2018-01-26T10:33:56Z I! Tags enabled: host=127.0.0.1 user=telegraf
2018-01-26T10:33:56Z I! Agent Config: Interval:10s, Quiet:false, Flush Interval:10s
2018-01-26T10:33:57Z E! Error when creating Kafka Consumer, brokers: [localhost:9092], topics: [telegraf]
2018-01-26T10:33:57Z E! Service for input inputs.kafka_consumer failed to start, exiting
kafka: client has run out of available brokers to talk to (Is your cluster reachable?)
Expected behavior:
Telegraf restart successfully regardless of some inputs internal error.
Actual behavior:
Telegraf restart failed owing to a input plugin kafka_consumer 's internal error.
I have installed telegraf v1.4.4 via rpm and configured a input for kafka_consumer as follows:
It works well for gathering kafka metrics. Unfortunately, when the kafka broker is down abnormally, It is failed to restart telegraf. The telegraf log hints this:
Expected behavior:
Telegraf restart successfully regardless of some inputs internal error.
Actual behavior:
Telegraf restart failed owing to a input plugin kafka_consumer 's internal error.