Multiple Nagios processes

Brian Sudis SudisB at crlcorp.com
Thu Sep 15 15:34:33 CEST 2005


I've had Nagios 1.2 running for over a year now and have never seen this
problem before.

Currently running on RH ES 3 (2.4.21-32.0.1ELsmp).

Sometime in during the night the web interface started reports the
warning that monitoring processes may not be running.

Process info page reports the process status as warning and check
command output states
"Nagios problem: located 4 processes, status log updated 1126790135
seconds ago"

I have reviewed F0021, and F0123 and neither seem to apply. 

I've validated the config (nagios -v nagios.cfg) and it passes
correctly.

The number of located processes changes.

Here is a quick look at nagios owned processes.  Obviously there is more
than one nagios daemon running, which is a bad thing.  I've stopped,
check that they stopped, and started nagios and it reverts back to this
each time.  It looks like a new nagios daemon is being spawned for each
attempted check.

Here is thee successive looks at process status.

nagios   10720     1  0 08:02 ?        00:00:03 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios   13898     1  0 08:20 ?        00:00:00 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios   13899 13898  0 08:20 ?        00:00:00
/usr/libexec/nagios/check_ping -H sassvr1.crlcorp.com -w 100.0,20% -c
500.0,60% -p 5
nagios   13900 13899  0 08:20 ?        00:00:00 /bin/ping -n -U -c 5
sassvr1.crlcorp.com

Next process check.

nagios   10720     1  0 08:02 ?        00:00:03 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios   14101     1  0 08:21 ?        00:00:00 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios   14102 14101  0 08:21 ?        00:00:00
/usr/libexec/nagios/check_ping -H devsys.crlcorp.com -w 100.0,20% -c
500.0,60% -p 5
nagios   14103 14102  0 08:21 ?        00:00:00 /bin/ping -n -U -c 5
devsys.crlcorp.com

Next process check.

nagios   10720     1  0 08:02 ?        00:00:03 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios   14148     1  0 08:21 ?        00:00:00 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios   14149 14148  0 08:21 ?        00:00:00
/usr/libexec/nagios/check_ping -H ghostsvr.crlcorp.com -w 100.0,20% -c
500.0,60% -p 5
nagios   14150 14149  0 08:21 ?        00:00:00 /bin/ping -n -U -c 5
ghostsvr.crlcorp.com


The nagios.log file appears to be updating correctly. (At least it is
updating.)


One item of intrigue seems to be that on the program info page it shows
program start time as 12-31-1969 18:00:00!

System date and time are correct and ntp is running.


Any suggestions?

Thanks,

Brian


-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server. 
Download it for free - -and be entered to win a 42" plasma tv or your very
own Sony(tm)PSP.  Click here to play: http://sourceforge.net/geronimo.php
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list