bizarre Nagios 2.12 memory leak

Rick Mangus rick.mangus+nagios at gmail.com
Fri Apr 16 15:43:47 CEST 2010


Have you checked where your memory is being used?  I had a similar
problem, and found I had 30k+ processes of nsca eating swap and PIDs.
The system would die one of two ways:  OOM or unable to spawn new
processes due to lack of PIDs.

In my case, it turned out that processing perfdata could block due to
database problems, causing nagios to bog down and nsca processes to
back up in a major way.  Finding and fixing that was ... special.

Anyway, running out of swap is good to know, but is nagios using 16GB
of RAM?  Or is it disappearing elsewhere?

Good Luck

--Rick

On Thu, Apr 15, 2010 at 4:13 PM, Andreas Ericsson <ae at op5.se> wrote:
> On 04/15/2010 05:24 PM, Jeremy wrote:
>>
>> I know I really should get around to upgrading to Nagios 3.x but no time for
>> that yet and it's going to be a pain to upgrade them all at once without
>> being blind for a little bit, so pretend Nagios 3.x isn't an option just
>> yet.
>>
>
> The truth of the matter though is that noone really cares about fixing a
> problem in 2.12 unless it's also a problem in 3.2.1, and especially if
> it's a bug as hard to debug as this one. Insofar as I know, configuration
> files are compatible between those two revisions, so you could just use
> 3.2.1 as a drop-in replacement for 2.12.
>
> Some minor things have to be changed in nagios.cfg (and possibly cgi.cfg),
> but the bulk of the configuration should be ok the way it is.
>
> Since this is an otherwise intermittent error which may well depend on other
> variables (such as pthread library version, glibc library version or any
> other system library), it's well nigh impossible to debug without having
> you run Nagios through valgrind until it crashes due to lack of memory.
>
> Rest assured that that will keep your Nagios running crippled longer than
> an upgrade would
>
> If the problem persists with Nagios 3.2.1, you should look into upgrading
> the rest of your system. If that doesn't help either, it's time to report
> it as a bug.
>
> --
> Andreas Ericsson                   andreas.ericsson at op5.se
> OP5 AB                             www.op5.se
> Tel: +46 8-230225                  Fax: +46 8-230231
>
> Considering the successes of the wars on alcohol, poverty, drugs and
> terror, I think we should give some serious thought to declaring war
> on peace.
>
> ------------------------------------------------------------------------------
> Download Intel® Parallel Studio Eval
> Try the new software tools for yourself. Speed compiling, find bugs
> proactively, and fine-tune applications for parallel performance.
> See why Intel Parallel Studio got high marks during beta.
> http://p.sf.net/sfu/intel-sw-dev
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>

------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list