nagios 3 host checks logic problem on some kernels/distros

Thomas Stolle it0a60 at retail-sc.com
Fri Sep 21 14:03:48 CEST 2007


Additional infos 
from Matthias (mess) from german nagios forum. (He has mail problem at the 
moment and can not post by himself) 

*******************************************************************************
running the last cvs changes on our test server I noticed that 
$HOSTADDRESS$
macro intermittently is not resolved correctly. I get numerous messages in 
nagios.log

[1190372305] HOST ALERT: host1;DOWN;SOFT;1;check_icmp: Failed to resolve 
-w
[1190372345] HOST ALERT: host1;UP;SOFT;2;OK - NNN.NNN.NNN.NNN: rta 
0.285ms,
lost 0%

check_icmp is defined like this: $USER1$/check_icmp -H $HOSTADDRESS$ -w 
$ARG1$
-c $ARG2$
********************************************************************************

Best regards
Thomas


P Please consider the environmental impact of needlessly printing this 
e-mail. 




Thomas Stolle <it0a60 at retail-sc.com> 
Sent by: nagios-devel-bounces at lists.sourceforge.net
21.09.2007 12:52
Please respond to
Nagios Developers List <nagios-devel at lists.sourceforge.net>


To
nagios-devel at lists.sourceforge.net
cc
nagios-devel-bounces at lists.sourceforge.net
Subject
Re: [Nagios-devel] nagios 3 host checks logic problem   on      some 
kernels/distros







Dear List 

today I installed the new CVS to get rid of the host check logic problem 
and the high cpu load. 
I can confirm that load is in a normal range now but after installing CVS 
someting with host- and servicechecks went terribly wrong. 
Many checks return a critical result even if the checked system or service 
is up. (I compared with a second nagios server running 2.9. Everything was 
ok there.) 
I executed the checkcommands manual from the commandline and received 
correct values and an OK state, while Nagios said it is critical. 

Caused by this I switched back to nagios 3.0b3. All services and hosts 
returned to a normal state but of course CPU load is high again now. 

Best regards 
Thomas 


P Please consider the environmental impact of needlessly printing this 
e-mail. 



Ethan Galstad <nagios at nagios.org> 
Sent by: nagios-devel-bounces at lists.sourceforge.net 
20.09.2007 23:20 

Please respond to
nagios at nagios.org; Please respond to
Nagios Developers List <nagios-devel at lists.sourceforge.net>


To
Nagios Developers List <nagios-devel at lists.sourceforge.net> 
cc

Subject
Re: [Nagios-devel] nagios 3 host checks logic problem on        some  
kernels/distros








Thanks all - I found the cause of the problem and fixed it.  A patch 
will be in CVS shortly.

Thomas Stolle wrote:
> 
> From: SCHAER Frederic <frederic.schaer <at> cea.fr>
> Subject: *nagios 3 host checks logic problem on some kernels/distros* 
> <
http://news.gmane.org/find-root.php?message_id=%3cEA04FF699CD5274E9EC52CB5EC0508707667A0%40DIODON.extra.cea.fr%3e
>*
> Newsgroups: <http://news.gmane.org/gmane.network.nagios.devel>* 
> <http://news.gmane.org/gmane.network.nagios.devel>*MailScanner has 
> detected a possible fraud attempt from "news.gmane.org" claiming to be* 
> *gmane.network.nagios.devel* 
> <http://news.gmane.org/gmane.network.nagios.devel>*
> Date: 2007-09-10 16:17:30 GMT (1 week, 15 hours and 23 minutes ago) *
> 
> *Hi, *
> 
> *  *
> 
> *I think I identified a problem (but not and the solution) on the nagios 

> 3 source tree? *
> 
> *I tried with both the 3.0b3 and cvs HEAD source files and could not get 

> rid of the problem. *
> 
> *I?m running a 2.4.21 kernel on a RHEL3 box. *
> 
> *  *
> 
> *What happens is that as soon as I start nagios 3, it starts eating all 
> of the *CPU*. *
> 
[snip]
> 
> *I have 53 hosts defined, I don?t understand why nagios is checking ever 

> and ever the same host? and why this is not happening on all systems. *
> 
> *  *
> 
> *De-activating host checks magically ?solves? the problem. *
> 
> *  *
> 
> *I just found out that commenting hosts ?check_command? caused this 
> behaviour (with host_checks_enabled=true), and that defining a correct 
> check_command prevented nagios from being so *CPU* hungry? *
> 
> *  *
> 
> *Hope I helped? *
> 
> *  *
> 
> *Cheers *
> 
> 
> 
> Dear List,
> 
> I can confirm the problem Frederic reported.
> I am using Nagios 3.0b3 on CentOS 4.4
> After starting nagios, the process catches nearly 100 % CPU (See 
> top-output  below)
> Disableing hostchecks let the process return to normal values.
> As far as I can remember, the problem did not occour with nagios3.0a 
> (but I can not verify at the moment)
> 
> Tasks:  89 total,   3 running,  86 sleeping,   0 stopped,   0 zombie
> Cpu(s): 26.0% us,  1.3% sy,  0.0% ni, 72.6% id,  0.0% wa,  0.1% hi, 
>  0.0% si
> Mem:   4041580k total,  1373844k used,  2667736k free,    60200k buffers
> Swap:  4192956k total,        0k used,  4192956k free,  1137348k cached
> 
>   PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
> 28617 nagios    25   0 29756  10m 1056 R   96  0.3  17:12.48 nagios
>     1 root      16   0  4752  552  460 S    0  0.0   0:02.75 init
>     2 root      RT   0     0    0    0 S    0  0.0   0:00.04 migration/0
> 
> 
> Thomas
> 
> 
> P *Please consider the environmental impact of needlessly printing this 
> e-mail.*
> 


Ethan Galstad,
Nagios Developer
---
Email: nagios at nagios.org
Website: http://www.nagios.org

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Nagios-devel mailing list
Nagios-devel at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-devel

RSC Commercial Services OHG Wanheimer Straße 70, D-40468 Düsseldorf 
Registergericht: Düsseldorf, HRA 12655 
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Nagios-devel mailing list
Nagios-devel at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-devel


--
RSC Commercial Services OHG
Wanheimer Strasse 70, D-40468 Duesseldorf
Registergericht: Duesseldorf, HRA 12655

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/developers/attachments/20070921/db6d95eb/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
-------------- next part --------------
_______________________________________________
Nagios-devel mailing list
Nagios-devel at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-devel


More information about the Developers mailing list