Nagios 简明教程
Nagios - NRPE
Nagios 服务程序在 NRPE(Nagios 远程插件执行器)中对远程机器执行检查。允许你在其他机器上远程运行 Nagios 插件。你可以监测远程机器的指标,例如磁盘使用情况、CPU 负载等。它还可以通过一些 Windows 代理插件检查远程 Windows 机器指标。
The Nagios daemon which run checks on remote machines in NRPE (Nagios Remote Plugin Executor). It allows you to run Nagios plugins on other machines remotely. You can monitor remote machine metrics such as disk usage, CPU load etc. It can also check metrics of remote windows machines through some windows agent addons.
data:image/s3,"s3://crabby-images/80922/80922e4b7892696686d2894e391c6d4b923ee396" alt="plugin executor"
让我们分步了解如何在需要监测的客户端机器上安装并配置 NRPE。
Let us see how to install and configure NRPE step by step on client machine which needs to be monitored.
Step 1 − 运行以下命令在需要监测的远程 Linux 机器上安装 NRPE。
Step 1 − Run below command to install NRPE on the remote linux machine to be monitored.
sudo apt-get install nagios-nrpe-server nagios-plugins
Step 2 − 现在,在服务器目录中创建一个主机文件,并放入该主机的所有必要定义。
Step 2 − Now, create a host file inside the server directory, and put all the necessary definitions for the host.
sudo gedit /usr/local/nagios/etc/servers/ubuntu_host.cfg
# Ubuntu Host configuration file
define host {
use linux-server
host_name ubuntu_host
alias Ubuntu Host
address 192.168.1.10
register 1
}
define service {
host_name ubuntu_host
service_description PING
check_command check_ping!100.0,20%!500.0,60%
max_check_attempts 2
check_interval 2
retry_interval 2
check_period 24x7
check_freshness 1
contact_groups admins
notification_interval 2
notification_period 24x7
notifications_enabled 1
register 1
}
define service {
host_name ubuntu_host
service_description Check Users
check_command check_local_users!20!50
max_check_attempts 2
check_interval 2
retry_interval 2
check_period 24x7
check_freshness 1
contact_groups admins
notification_interval 2
notification_period 24x7
notifications_enabled 1
register 1
}
define service {
host_name ubuntu_host
service_description Local Disk
check_command check_local_disk!20%!10%!/
max_check_attempts 2
check_interval 2
retry_interval 2
check_period 24x7
check_freshness 1
groups admins
notification_interval 2
notification_period 24x7
notifications_enabled 1
register 1
}
define service {
host_name ubuntu_host
service_description Check SSH
check_command check_ssh
max_check_attempts 2
check_interval 2
retry_interval 2
check_period 24x7
check_freshness 1
contact_groups admins
notification_interval 2
notification_period 24x7
notifications_enabled 1
register 1
}
define service {
host_name ubuntu_host
service_description Total Process
check_command check_local_procs!250!400!RSZDT
max_check_attempts 2
check_interval 2
retry_interval 2
check_period 24x7
check_freshness 1
contact_groups admins
notification_interval 2
notification_period 24x7
notifications_enabled 1
register 1
}
Step 3 − 运行以下命令显示的命令以验证配置文件。
Step 3 − Run the command shown below for the verification of configuration file.
sudo /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg
data:image/s3,"s3://crabby-images/3d206/3d206f816b11f9f6e97a196fa5d8a2e68c1476cf" alt="verification configuration"
Step 4 − 如果没有错误,请重启 NRPE、Apache 和 Nagios。
Step 4 − Restart NRPE, Apache and Nagios if there are no errors.
service nagios-nrpe-server restart
service apache2 restart
service nagios restart
Step 5 − 打开浏览器并访问 Nagios Web 界面。你可以看到需要监测的主机已被添加到 Nagios 核心服务中。同样,你可以添加更多的主机供 Nagios 监测。
Step 5 − Open your browser and go to Nagios web interface. You can see the host which needs to be monitored has been added to Nagios core service. Similarly, you can add more hosts to be monitored by Nagios.
data:image/s3,"s3://crabby-images/9e2ad/9e2adb31b352adfd95883ee892229e0c689d8e49" alt="nagios web interface"