Monitoring Hadoop

Get to grips with the intricacies of Hadoop monitoring using the power of Ganglia and Nagios

Book Description:

With the exponential growth of data and many enterprises crunching more and more data, Hadoop as a data platform has gained a lot of popularity. The Hadoop platform needs to be monitored with respect to how it works and functions. There is an ever-increasing need to keep the Hadoop platform clean and healthy.

This book will help you to integrate Hadoop and Nagios in a seamless and easy way. At the start, the book covers the basics of operating system logging and monitoring. Getting to grips with the characteristics of Hadoop monitoring, metrics, and log collection will help Hadoop users, especially Hadoop administrators, diagnose and troubleshoot clusters better. In essence, the book teaches you how to set up an all-inclusive and robust monitoring system for the Hadoop platform. The book also serves as a quick reference to the various metrics available in Hadoop.

Concluding with the visualization of Hadoop metrics, you will get acquainted with the workings of Hadoop in a short span of time with the help of step-by-step instructions in each chapter.

What You Will Learn

  • Install Nagios and Ganglia and understand logging at the operating system level
  • Create and configure Nagios nodes for monitoring with custom checks
  • Monitor Hadoop daemons such as NameNode, DataNode, JobTracker, and so on
  • Configure logs for various daemons and set up audits for the options done on the cluster
  • Track important parameters for the File System, MapReduce, and other counters
  • Set up Nagios master and client nodes with checks for the system and applications running on it
  • Configure the Hadoop metrics collection and visualize it for nontechnical users
  • Understand the communication between different daemons and protocols and the ports they use