Hibrid configuration, active and passive(syslog-ng) and forkedprocesses not terminating

Marcel Mitsuto Fucatu Sugano msugano at uolinc.com
Fri Jun 23 18:36:12 CEST 2006


Sorry about this, but...

On Fri, 2006-06-23 at 13:01 -0300, Marcel Mitsuto Fucatu Sugano wrote:
> Is there someone having problems with Nagios 2.4 forking processes that
> simply does not terminate in a hibrid monitoring configuration of some
> active checks and a lot of passive checks?
> 
> It seems that when Nagios forks to execute an active check, the forked
> process did not terminate itself. Therefore it begins to double
> submitting check results to our central Nagios, and messing up with
> services statuses becoming staled.
> 
> Someone have any advice on how to detect forked Nagios processes that
> will mess with central Nagios freshness service checking? 
> 
> Just grepping -v `head -n 1 nagiosl.lock` file isn't enough, as some
> forked process might be actively checking something. But, if that
> process keep itself running, it should be killed. Still don't know why
> the forked process did not terminate itself, but i think it might have
> something to do with syslog-ng, but can't tell for sure.

This is what i'm talking about:

nagios    2223     1  2 13:01 ?        00:00:37 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios   28074     1  0 13:21 ?        00:00:00 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios   30447     1  0 13:22 ?        00:00:00 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios   30592     1  0 13:22 ?        00:00:00 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios   30865  2223  0 13:22 ?        00:00:00 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios   30867 30865  0 13:22 ?        00:00:00 /bin/sh /opt/nagios-2.4/libexec/submit_check_result sensei4-b Cron Daemon OK proc=cron n_procs=1 threads=1 mem_size=240 mem_vsize=1736 mem_r
nagios   30868 30867  0 13:22 ?        00:00:00 /bin/sh /opt/nagios-2.4/libexec/submit_check_result sensei4-b Cron Daemon OK proc=cron n_procs=1 threads=1 mem_size=240 mem_vsize=1736 mem_r
nagios   30869 30867  0 13:22 ?        00:00:00 /bin/sh /opt/nagios-2.4/libexec/submit_check_result sensei4-b Cron Daemon OK proc=cron n_procs=1 threads=1 mem_size=240 mem_vsize=1736 mem_r
[some other time]
nagios    2223     1  3 13:01 ?        00:00:42 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios   15338     1  0 13:25 ?        00:00:00 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios   15661  2223  0 13:25 ?        00:00:00 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios   15663 15661  0 13:25 ?        00:00:00 /bin/sh /opt/nagios-2.4/libexec/submit_check_result sensei18-b Disk Space OK (/ 14 90)(/boot 36 90)(/export 7 90)(/opt 46 90)(/usr 25 90)(/v
nagios   15664 15663  0 13:25 ?        00:00:00 /bin/sh /opt/nagios-2.4/libexec/submit_check_result sensei18-b Disk Space OK (/ 14 90)(/boot 36 90)(/export 7 90)(/opt 46 90)(/usr 25 90)(/v
nagios   15665 15663  0 13:25 ?        00:00:00 /bin/sh /opt/nagios-2.4/libexec/submit_check_result sensei18-b Disk Space OK (/ 14 90)(/boot 36 90)(/export 7 90)(/opt 46 90)(/usr 25 90)(/v
[one minute after a restart]
nagios     373     1  5 13:27 ?        00:00:01 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios    2675   373  0 13:28 ?        00:00:00 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg
nagios    2676  2675  0 13:28 ?        00:00:00 /bin/sh /opt/nagios-2.4/libexec/submit_check_result shogun96-c PING OK OK - 172.27.248.195: rta 1.849ms, lost 0%
nagios    2679  2676  0 13:28 ?        00:00:00 /bin/sh /opt/nagios-2.4/libexec/submit_check_result shogun96-c PING OK OK - 172.27.248.195: rta 1.849ms, lost 0%


As you can see, there is something very strange.

The two first, are messages coming from syslog-ng, and the third is
active checking. Stracing pids did not lead to anything conclusive, as
each process is acting within their processing loop.

Will try to recompile Nagios, from the latest cvs.

TIA.


AVISO: A informação contida neste e-mail, bem como em qualquer de seus anexos, é CONFIDENCIAL e destinada ao uso exclusivo do(s) destinatário(s) acima referido(s), podendo conter informações sigilosas e/ou legalmente protegidas. Caso você não seja o destinatário desta mensagem, informamos que qualquer divulgação, distribuição ou cópia deste e-mail e/ou de qualquer de seus anexos é absolutamente proibida. Solicitamos que o remetente seja comunicado imediatamente, respondendo esta mensagem, e que o original desta mensagem e de seus anexos, bem como toda e qualquer cópia e/ou impressão realizada a partir destes, sejam permanentemente apagados e/ou destruídos. Informações adicionais sobre nossa empresa podem ser obtidas no site http://sobre.uol.com.br/.

NOTICE: The information contained in this e-mail and any attachments thereto is CONFIDENTIAL and is intended only for use by the recipient named herein and may contain legally privileged and/or secret information. If you are not the e-mail´s intended recipient, you are hereby notified that any dissemination, distribution or copy of this e-mail, and/or any attachments thereto, is strictly prohibited. Please immediately notify the sender replying to the above mentioned e-mail address, and permanently delete and/or destroy the original and any copy of this e-mail and/or its attachments, as well as any printout thereof. Additional information about our company may be obtained through the site http://www.uol.com.br/ir/.

Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list