作為L(zhǎng)inux系統(tǒng)管理員,系統(tǒng)的狀態(tài)不可能隨時(shí)查看,流量異常,負(fù)載突然增高,可能引起事故的發(fā)生。通過(guò)監(jiān)控軟件可以自動(dòng)監(jiān)控系統(tǒng)狀態(tài),發(fā)現(xiàn)異常就會(huì)報(bào)警,也可通過(guò)腳本監(jiān)控。
創(chuàng)新互聯(lián)建站是一家集網(wǎng)站建設(shè),郴州企業(yè)網(wǎng)站建設(shè),郴州品牌網(wǎng)站建設(shè),網(wǎng)站定制,郴州網(wǎng)站建設(shè)報(bào)價(jià),網(wǎng)絡(luò)營(yíng)銷,網(wǎng)絡(luò)優(yōu)化,郴州網(wǎng)站推廣為一體的創(chuàng)新建站企業(yè),幫助傳統(tǒng)企業(yè)提升企業(yè)形象加強(qiáng)企業(yè)競(jìng)爭(zhēng)力??沙浞譂M足這一群體相比中小企業(yè)更為豐富、高端、多元的互聯(lián)網(wǎng)需求。同時(shí)我們時(shí)刻保持專業(yè)、時(shí)尚、前沿,時(shí)刻以成就客戶成長(zhǎng)自我,堅(jiān)持不斷學(xué)習(xí)、思考、沉淀、凈化自己,讓我們?yōu)楦嗟钠髽I(yè)打造出實(shí)用型網(wǎng)站。1、Nagios
是開源軟件,免費(fèi)使用,可以監(jiān)控主機(jī)狀態(tài),windows、linux、打印機(jī)都可以使用。
基于web界面,登錄網(wǎng)站查看各項(xiàng)指標(biāo)。
支持短信或郵件通知。
可以自定義腳本實(shí)現(xiàn)自定義化監(jiān)控。
官網(wǎng)www.nagios.org
2、安裝nagios服務(wù)端
分為服務(wù)端(centos6.4 192.168.0.105)和客戶端(centos6.4 192.168.0.104)??蛻舳吮O(jiān)控主機(jī)的狀態(tài),數(shù)據(jù)上報(bào)給服務(wù)端,服務(wù)端去處理數(shù)據(jù)。
安裝擴(kuò)展源
[root@client ~]# rpm -ivh http://www.lishiming.net/data/p_w_upload/forum/month_1211/epel-release-6-7.noarch.rpm //有些系統(tǒng)默認(rèn)已經(jīng)安裝
安裝nagios
[root@client ~]# yum install -y httpd nagios nagios-plugins nagios-plugins-all nrpe nagios-plugins-nrpe
設(shè)置http登錄密碼
[root@client ~]# htpasswd -c /etc/nagios/passwd nagiosadmin New password: Re-type new password: Adding password for user nagiosadmin
查看配置文件
[root@client ~]# vim /etc/nagios/nagios.cfg
檢測(cè)配置文件是否出錯(cuò)
[root@client ~]# nagios -v /etc/nagios/nagios.cfg
啟動(dòng)服務(wù)
[root@client ~]# service httpd start Starting httpd: httpd: apr_sockaddr_info_get() failed for client httpd: Could not reliably determine the server's fully qualified domain name, using 127.0.0.1 for ServerName [ OK ] [root@client ~]# service nagios start Starting nagios: done.IEshang
瀏覽器上訪問(wèn):http://192.168.0.105/nagios
3、安裝nagios客戶端
安裝擴(kuò)展源
[root@localhost ~]# rpm -ivh http://www.lishiming.net/data/p_w_upload/forum/month_1211/epel-release-6-7.noarch.rpm
安裝nagios
[root@localhost ~]# yum install -y nagios-plugins nagios-plugins-all nrpe nagios-plugins-nrpe
修改配置文件
[root@localhost ~]# vim /etc/nagios/nrpe.cfg allowed_hosts=127.0.0.1,192.168.0.105 ont_blame_nrpe=1
啟動(dòng)客戶端
[root@localhost ~]# /etc/init.d/nrpe start Starting nrpe: [ OK ]
4、監(jiān)控中心添加被監(jiān)控主機(jī)(服務(wù)端)
[root@client ~]# cd /etc/nagios/conf.d/ [root@client conf.d]# vim 192.168.0.104.cfg //客戶端IP命令 define host{ use linux-server ; Name of host template to use ; This host definition will inherit all variables that are defined ; in (or inherited by) the linux-server host template definition. host_name 192.168.0.104 alias 0.12 address 192.168.0.104 } define service{ use generic-service host_name 192.168.0.104 service_description check_ping check_command check_ping!100.0,20%!200.0,50% max_check_attempts 5 normal_check_interval 1 } //監(jiān)控ping服務(wù) define service{ use generic-service host_name 192.168.0.104 service_description check_ssh check_command check_ssh max_check_attempts 5 normal_check_interval 1 } //監(jiān)控ssh服務(wù) define service{ use generic-service host_name 192.168.0.104 service_description check_http check_command check_http max_check_attempts 5 normal_check_interval 1 } //監(jiān)控http服務(wù)
[root@client ~]# cd /etc/nagios/conf.d/ [root@client conf.d]# vim 192.168.0.104.cfg define host{ use linux-server //Name of host template to use //This host definition will inherit all variables that are defined //in (or inherited by) the linux-server host template definition. host_name 192.168.0.12 alias 0.12 address 192.168.0.12 } define service{ use generic-service host_name 192.168.0.12 service_description check_ping check_command check_ping!100.0,20%!200.0,50% max_check_attempts 5 //遇到問(wèn)題,檢測(cè)5次在報(bào)警 normal_check_interval 1 //重新檢測(cè)時(shí)間間隔,1分鐘 notification_interval 60 //服務(wù)出現(xiàn)異常后,故障一直沒解決,對(duì)使用者隔60分鐘發(fā)出通知 } //監(jiān)控ping服務(wù) define service{ use generic-service host_name 192.168.0.12 service_description check_ssh check_command check_ssh max_check_attempts 5 normal_check_interval 1 } //監(jiān)控ssh服務(wù) define service{ use generic-service host_name 192.168.0.12 service_description check_http check_command check_http max_check_attempts 5 normal_check_interval 1 } //監(jiān)控http服務(wù)
5、實(shí)現(xiàn)監(jiān)控遠(yuǎn)程的服務(wù)(服務(wù)端)
[root@client conf.d]# vim /etc/nagios/objects/commands.cfg //添加下面的語(yǔ)句 define command{ command_name check_nrpe command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ } [root@client conf.d]# vim /etc/nagios/conf.d/192.168.0.104.cfg define service{ use generic-service host_name 192.168.0.105 service_description check_load check_command check_nrpe!check_load //check_load是遠(yuǎn)程主機(jī)上的檢測(cè)腳本 max_check_attempts 5 normal_check_interval 1 } define service{ use generic-service host_name 192.168.0.105 service_description check_disk_hda1 check_command check_nrpe!check_hda1 max_check_attempts 5 normal_check_interval 1 } define service{ use generic-service host_name 192.168.0.105 service_description check_disk_hda2 check_command check_nrpe!check_hda2 max_check_attempts 5 normal_check_interval 1 }
查看check_load(客戶端)
[root@localhost ~]# vim /etc/nagios/nrpe.cfg command[check_users]=/usr/lib/nagios/plugins/check_users -w 5 -c 10 command[check_load]=/usr/lib/nagios/plugins/check_load -w 15,10,5 -c 30,25,20 command[check_hda1]=/usr/lib/nagios/plugins/check_disk -w 20% -c 10% -p /dev/hda1 //hda1修改成sda1 //剩余20%就報(bào)警,剩余10%報(bào)紅色警 command[check_zombie_procs]=/usr/lib/nagios/plugins/check_procs -w 5 -c 10 -s Z command[check_total_procs]=/usr/lib/nagios/plugins/check_procs -w 150 -c 200
[root@localhost ~]# /usr/lib/nagios/plugins/check_disk -w 20% -c 10% -p /dev/sda1 DISK OK - free space: /boot 429 MB (93% inode=99%);| /boot=29MB;387;435;0;484 //可以自定義寫監(jiān)控腳本,只要產(chǎn)生的結(jié)果格式是一致的
定義一個(gè)check_hda2
[root@localhost ~]# vim /etc/nagios/nrpe.cfg //客戶端 //添加下面一句 command[check_hda2]=/usr/lib/nagios/plugins/check_disk -w 20% -c 10% -p /dev/sda2
客戶端、服務(wù)端重啟服務(wù),刷新頁(yè)面測(cè)試
查看監(jiān)控服務(wù)
[root@client conf.d]# ls /usr/lib/nagios/plugins/ check_breeze check_icmp check_nrpe check_smtp check_by_ssh check_ide_smart check_nt check_snmp check_clamd check_imap check_ntp check_spop check_cluster check_ircd check_ntp_peer check_ssh check_dhcp check_jabber check_ntp.pl check_ssmtp check_dig check_ldap check_ntp_time check_swap check_disk check_ldaps check_nwstat check_tcp check_disk_smb check_load check_oracle check_time check_dns check_log check_overcr check_udp check_dummy check_mailq check_pgsql check_ups check_file_age check_mrtg check_ping check_users check_flexlm check_mrtgtraf check_pop check_wave check_fping check_mysql check_procs eventhandlers check_ftp check_mysql_query check_real negate check_game check_nagios check_rpc urlize check_hpjd check_nntp check_sensors utils.pm check_http check_nntps check_simap utils.sh //監(jiān)控命令,大多是二進(jìn)制文件
6、配置郵件告警
[root@client conf.d]# vim /etc/nagios/objects/contacts.cfg define contact{ contact_name 123 //自定義名字 use generic-contact //模板 alias aming email zhouyan_wy@163.com //郵件 }
自定義告警策略
notifications_enabled //是否開啟提醒功能。1為開啟,0為禁用。一般,這個(gè)選項(xiàng)會(huì)在主配置文件(nagios.cfg)中定義,效果相同。 notification_interval //之前剛介紹過(guò),表示重復(fù)發(fā)送提醒信息的最短間隔時(shí)間。默認(rèn)間隔時(shí)間是60分鐘。如果這個(gè)值設(shè)置為0,將不會(huì)發(fā)送重復(fù)提醒。 notification_period //發(fā)送提醒的時(shí)間段。非常重要的主機(jī)(服務(wù))我定義為7×24,一般的主機(jī)(服務(wù))就定義為上班時(shí)間。如果不在定義的時(shí)間段內(nèi),無(wú)論什么問(wèn)題發(fā)生,都不會(huì)發(fā)送提醒。 notification_options //這個(gè)參數(shù)定義了發(fā)送提醒包括的情況:d = 狀態(tài)為DOWN, u = 狀態(tài)為UNREACHABLE , r = 狀態(tài)恢復(fù)為OK , f = flapping,n=不發(fā)送提醒。
另外有需要云服務(wù)器可以了解下創(chuàng)新互聯(lián)scvps.cn,海內(nèi)外云服務(wù)器15元起步,三天無(wú)理由+7*72小時(shí)售后在線,公司持有idc許可證,提供“云服務(wù)器、裸金屬服務(wù)器、高防服務(wù)器、香港服務(wù)器、美國(guó)服務(wù)器、虛擬主機(jī)、免備案服務(wù)器”等云主機(jī)租用服務(wù)以及企業(yè)上云的綜合解決方案,具有“安全穩(wěn)定、簡(jiǎn)單易用、服務(wù)可用性高、性價(jià)比高”等特點(diǎn)與優(yōu)勢(shì),專為企業(yè)上云打造定制,能夠滿足用戶豐富、多元化的應(yīng)用場(chǎng)景需求。