Telegraf agent not able to monitor zookeeper

We use VMWare Wavefront for monitoring and visualization. We use telegraf agent as metrics collector. As part of Kafka monitoring, we have created alerts for Zookeeper availability. It was working fine initially but stopped working after a Kafka upgrade. We checked telegraf agent and zookeeper logs but could not find anything suspicious.

We checked telegraf zookeeper github repository. It mentioned that it uses zookeeper mntr command to get monitoring data. We hadn’t whitelisted any zookeeper 4lw commands in earlier versions of Kafka too. It turned out that from Zookeeper version 3.5.3 onwards, we need to explicitly whilelist commands. As mntr is disabled by default, telegraf was not able to collect metrics. Issue got resolved after whitelisting mntr command in zookeeper.properties file and restart of zookeeper.

Related Post