...
Application "TOPOLOGY HEALTH CHECK" provide embedded collector script to ingest hadoop/hbase jmx metric as eagle stream and provide ability to define alert policy and detect anomaly in real-time from metricaims to monior those servies with a master-slave structured topology and provide metrics at host level.
Type | TOPOLOGY_HEALTH_CHECK |
---|---|
Version | 0.5.0-version |
Description | Collect MR,HBASE,HDFS node status and cluster ratio |
Streams | TOPOLOGY_HEALTH_CHECK_STREAM |
Configuration |
|
...
- Make sure already setup a site (here use a demo site named "sandbox")
- Install "Hadoop JMX MonitorTopology Health Check" app in eagle server
- Configure Application settings
- Ensure a kafka topic named
topology_health_check (In current guide, it should be
Setup metric collector for monitored Hadoop/HBase using hadoop_jmx_collector and modify the configurationCollector scripts: https://github.com/apache/incubator-eagle/tree/master/eagle-external/hadoop_jmx_collectortopology_health_check
) Rename config-sample.json to config.json: https://github.com/apache/incubator-eagle/blob/master/eagle-external/hadoop_jmx_collector/config-sample.json
Code Block language js title config.json collapse true { env: { site: "sandbox", name_node: { hosts: [ "sandbox.hortonworks.com" ], port: 50070, https: false }, resource_manager: { hosts: [ "sandbox.hortonworks.com" ], port: 50030, https: false } }, inputs: [{ component: "namenode", host: "server.eagle.apache.org", port: "50070", https: false, kafka_topic: "nn_jmx_metric_sandbox" }, { component: "resourcemanager", host: "server.eagle.apache.org", port: "8088", https: false, kafka_topic: "rm_jmx_metric_sandbox" }, { component: "datanode", host: "server.eagle.apache.org", port: "50075", https: false, kafka_topic: "dn_jmx_metric_sandbox" }], filter: { monitoring.group.selected: [ "hadoop", "java.lang" ] }, output: { kafka: { brokerList: [ "localhost:9092" ] } } }
- Click "Install" button then you will see the
TOPOLOGY_HEALTH_CHECK_STREAM_{SITE_ID}
in Streams
Usage
...
Define Health Check Alert Policy
- Go to "Define Policy"
- Select TOPOLOGY_HEALTH_CHECK related streams
Define SQL-Like policy, for example
Code Block language sql from TOPOLOGY_HEALTH_CHECK_STREAM_SANDBOX[status=='dead'] select * insert into topology_health_check_stream_out;