Ndo2db hanging after 2-3 weeks

Hendrik Bäcker andurin at process-zero.de
Wed Nov 14 19:10:31 CET 2007


Hi together,

if I remember correctly, Ethan told me on the conference that the hole
NDO stuff has a blocking character.
So, every action from the core over the ndomod speaking to ndo2db which
es talking to the database is blocking.

It might be (I'm just guessing) that the first step

core --> ndomod is not blocking cause of the buffered writing to the module.

Nevertheless, IIRC Ethan told me that this behavior should be fixed
sometimes after the release of Nagios 3.

-
Hendrik

Michael Lübben schrieb:
> Hi Alex,
> 
> we have same Problem. In the MySQL-Database, we can see that a delete-Statemant in the database more as 10-20 sec. used. I think that the problem! The ndo is 2-3 Weeks ok and when the NDO dropped data that older then 2 weeks (configured in the config-file) than nagios hang. In that time nagios doesn't make checks. When nagios hang, we have more then 600 forked childprocesse from Nagios.
> 
> We have the same workaround ;-)
> 
> Bye
> Michael
> 
> P.S.: One user in the german nagios-portal post the same problem.
> 
> -----Ursprüngliche Nachricht-----
> Von: Nagios Developers List <nagios-devel at lists.sourceforge.net>
> Gesendet: 14.11.07 16:51:37
> An: Nagios Developers List <nagios-devel at lists.sourceforge.net>
> Betreff: [Nagios-devel] Ndo2db hanging after 2-3 weeks
> 
> 
> 
> I am using Nagios 2.10 and ndoutils-1.4b6 with Nagvis.  Every 2-3 weeks 
> Nagvis reports:
> 
> 'NDO claims that Nagios did no status Update for more than "180" 
> seconds.  Make sure that Nagios and ndo2db are running.'
> 
> If I attempt to stop Nagios, it will not stop properly:
> 
> # service nagios stop
> Stopping nagios: ..........
> Warning - nagios did not exit in a timely manner
> # ps ax | grep nagios
> 18178 ?        Ss     0:00 /usr/local/nagios/bin/ndo2db -c 
> /usr/local/nagios/etc/ndo2db.cfg
> 28978 ?        Ssl   40:55 /usr/local/nagios/bin/nagios -d 
> /usr/local/nagios/etc/nagios.cfg
> 27550 ?        S      0:58 /usr/local/nagios/bin/ndo2db -c 
> /usr/local/nagios/etc/ndo2db.cfg
> 
> If I kill the processes and restart nagios and ndo, Nagvis still does 
> not work.  It doesn't appear to be a problem with Nagvis as ndo2db 
> appears to hang.
> 
> What I have done to get around the problem is to drop the nagios 
> database and re-create it using the ndo installdb script.  After that, 
> everything works again for 2-3 weeks.  I have done this three times so 
> far.  I have also seen this with Nagios 2.8 / ndoutils-1.4b6.  I am 
> using RHEL4 64bit.
> 
> The nagios database was 287MB, and the previous time it was 273MB.
> 
> I have copies of the old databases so I can do some checks if needed.  I 
> have included the output of mysqlshow --status nagios.
> 
> Would any of the fixes in ndoutils-1.4b7 help?  I will give it a try, 
> but it will be 2-4 weeks before I can report back the results.
> 
> Alex
> 
> 
> 
> <hr>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> 
> <hr>
> _______________________________________________
> Nagios-devel mailing list
> Nagios-devel at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-devel
> 
> 
> 
> 
> _____________________________________________________________________
> Der WEB.DE SmartSurfer hilft bis zu 70% Ihrer Onlinekosten zu sparen!
> http://smartsurfer.web.de/?mc=100071&distributionid=000000000066
> 
> 
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> _______________________________________________
> Nagios-devel mailing list
> Nagios-devel at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-devel
> 

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/




More information about the Developers mailing list