Nagios process segfaulting, wedging

Dustin J. Mitchell dustin at v.igoro.us
Thu Oct 19 17:12:43 CEST 2006


I'm afraid I don't have a lot of detail on this problem yet.  On two occasions
about a week apart, my nagios process has wedged itself.  I'm using nagios-2.5
on amd64, with a Gentoo install.  Its log contains (sanitized; these were all
different hosts/services):

Oct  6 00:00:00 [nagios] CURRENT SERVICE STATE:
some-host;some-service;OK;HARD;1;HTTP OK HTTP/1.1 200 OK - 0.568 second
response time_
Oct  6 00:00:00 [nagios] Caught SIGSEGV, shutting down..._

in one case (this was at midnight, directly after the logfile rotation), and

[1160714430] EXTERNAL COMMAND:
PROCESS_SERVICE_CHECK_RESULT;some-host;some-service;0;Blah blah
[1160714480] EXTERNAL COMMAND:
PROCESS_SERVICE_CHECK_RESULT;some-host;some-service;0;Blah blah
[1160714481] Caught SIGSEGV, shutting down...

(this was *not* at midnight, so no fair blaming the logfile rotation)

This seemed to start after I implemented a number of service checks for
cronjobs; these are implemented as "OK" reports delivered via NSCA on
successful completion of the cronjob, and a freshness check set to some small
multiple of the cronjob frequency.  Some of these cronjobs run every fifteen
minutes on a half-dozen hosts, so it's a small but non-trivial amount of traffic.

Based on mailing list archives, I tried bombarding nagios with nsca requests
and watching its memory consumption -- it hovered at a reasonable number.  I
put nagios in debugging mode, which unfortunately makes it fairly unusable for
actual monitoring, so I can't leave it in that mode for a week.  I could not
replicate the crash.

My questions, then, are:
 * is there a known bug that could be causing this?
 * is there anything I can do to help track down what might be causing this
next time it happens?

Thanks!

Dustin

-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list