Once we have all 5 nodes up and running we’ll want to monitor them, for this I intend to use Nagios, which is a enterprise monitoring tool based on a client/server architecture. We install the server on pimaster and then the client on all nodes. Nagios will then provide a web based interface to view various performance characteristics of our running super computer.
To install use the following command
sudo apt-get install nagios3
During the install a number of config screens appear, the first asks you for the password for ‘nagiosadmin’ user.
Enter a password and the again at the re-confirmation screen which appears next, then wait because the next bit takes a while but it eventually ends. You can check its running with by either running rcconf or
sudo /etc/init.d/nagios3 status [ ok ] checking /usr/sbin/nagios3...done (running).
To access Nagios, enter the URL http://pimaster/nagios3/ into a browser and when prompted for userid and password enter, nagiosadmin for the userid and the password we entered above. You will then be presented with a screen similar to this
Congratulations you have installed Nagios and confirmed its running, its time to start configuring it to monitor our nodes
To configure each node as a client we need add the Nagios plugs. Most of this next bill is pulled from Platonic
sudo apt-get install nagios-plugins nagios-nrpe-plugin nagios-nrpe-server
You can then view the nagios server via a web browser at http://