Multipme nagios process and lost "nagios.cmd"

Gili Lapid GiliL at sodaclub.co.il
Wed Dec 29 08:20:05 CET 2004


Hi All
 
If I do "ps -efw | grep nadios" I can see one or more nagios process.
<snip>
[root at nagios etc]# ps -efw | grep "nagios"
nagios   11458     1  0 16:54 ?        00:00:01 /usr/local/nagios/bin/nagios
-d /usr/local/nagios/etc/nagios.cfg
nagios   20148     1  0 18:03 ?        00:00:00 /usr/local/nagios/bin/nagios
-d /usr/local/nagios/etc/nagios.cfg
nagios   20149 20148  0 18:03 ?        00:00:00
/usr/local/nagios/libexec/check_ping -H 10.1.1.1 -w 100.0,20% -c 500.0,60%
-p
nagios   20150 20149  0 18:03 ?        00:00:00 /bin/ping -n -U -c 5
10.1.1.1
<snap>
They all have 1 in the PPID (The parent ID). some time I can see only one,
but when there are more then one I can see the plugins are running too...
 
Also I do not have the nagios.cmd in the var/rw folder. I read the manual in
this link and did as I told (restart the apache & nagios :-) at the end...),
and nothing... 
http://nagios.sourceforge.net/docs/1_0/commandfile.html
<http://nagios.sourceforge.net/docs/1_0/commandfile.html> 
 
Also, if a plugin have a "time out" status the "Current Attempt" stay at 1
(if let say I have 1 out of 5) and not sending allerts...
The system is up and running for a month or so whiteout any problems,
suddenly this started...
<snip>
[root at nagios etc]# ll /usr/local/nagios/var/
total 23572
drwxrwxr-x    2 nagios   apache       4096 Dec 28 00:00 archives
-rw-rw-r--    1 nagios   nagios          0 Dec 28 16:54 comment.log
-rw-rw-r--    1 nagios   nagios          0 Dec 28 16:54 downtime.log
-rw-rw-r--    1 nagios   apache     417288 Dec 12 14:41 hostperf.log
-rw-r--r--    1 root     root            6 Dec 28 16:54 nagios.lock
-rw-r--r--    1 nagios   nagios     130742 Dec 28 17:54 nagios.log
-rw-r--r--    1 root     apache        122 Nov 29 17:20
perfparse.log.20041129.log
-rw-r--r--    1 root     root          248 Dec 12 16:58
perfparse.log.20041212.log
drwxrws---    2 nagios   nagiocmd     4096 Dec 28 16:06 rw
drwxr-xr-x    5 nagios   apache       4096 Dec 27 16:10 sat
-rw-rw-r--    1 nagios   apache   23478843 Dec 12 14:44 serviceperf.log
-rw-rw-r--    1 nagios   nagios      23583 Dec 28 17:57 status.log
-rw-rw-r--    1 nagios   nagios      17437 Dec 28 17:54 status.sav
[root at nagios etc]# ll /usr/local/nagios/var/rw/
total 0
[root at nagios etc]# cat /etc/group | grep nagiocmd
nagiocmd:x:501:nagios,nobody,apache
[root at nagios etc]# df -h
Filesystem            Size  Used Avail Use% Mounted on
/dev/hdb1             3.8G  1.6G  2.0G  44% /
none                   61M     0   61M   0% /dev/shm
/dev/hdb6             193M  5.0M  178M   3% /tmp
/dev/hdb5             6.7G  4.0G  2.3G  63% /usr
/dev/hdb3             1.9G  868M 1002M  47% /var
<snap>
 
TIA,
Gili
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20041229/fa5a3a02/attachment.html>


More information about the Users mailing list