3.0b6 crashes with SIGSEGV

Boeckman, Matthew mboeckman at servicemagic.com
Tue Nov 6 18:26:40 CET 2007


 

> -----Original Message-----
> From: nagios-devel-bounces at lists.sourceforge.net 
> [mailto:nagios-devel-bounces at lists.sourceforge.net] On Behalf 
> Of Ethan Galstad
> Sent: Tuesday, November 06, 2007 10:19 AM
> To: Nagios Developers List
> Subject: Re: [Nagios-devel] 3.0b6 crashes with SIGSEGV
> 
> Boeckman, Matthew wrote:
> > Just upgraded to 3.0b6 from what was a stable 3.0b3 
> install. Check of 
> > the config files comes back fine, but when starting the 
> deamon I get 
> > after a few moments:
> > 
> > Caught SIGSEGV, shutting down...
> > 
> > I have debug set pretty high but do not see anything 
> jumping out at me 
> > from the logs as a cause (brief snippet of nagios.debug final few 
> > lines
> > below) - is there a known issue, or anything more I can provide? I 
> > will probably roll back to 3.0b3 shortly.
> > 
> > -Matthew
> > 
> > Nagios.debug:
> > 
> > [1194366930.175483] [2048.0] [pid=28012] UNHANDLED MACRO 
> #149! THIS IS 
> > A BUG!
> > [1194366930.175499] [2048.0] [pid=28012] UNHANDLED MACRO 
> #150! THIS IS 
> > A BUG!
> > [1194366930.175512] [2048.0] [pid=28012] UNHANDLED MACRO 
> #151! THIS IS 
> > A BUG!
> > [1194366930.175526] [2048.0] [pid=28012] UNHANDLED MACRO 
> #152! THIS IS 
> > A BUG!
> 
> 
> Well, this is one bug.  :-)  I'll get this patched in CVS in a moment.
> 
> 
> > [1194366930.184171] [016.2] [pid=28012] Moving temp check 
> result file 
> > '/usr/local/nagios/var/spool/check
> > results/checkZWY3uz' to queue file
> > '/usr/local/nagios/var/spool/checkresults/cRAAfPr'...
> > [1194366930.192109] [016.2] [pid=28011] Moving temp check 
> result file 
> > '/usr/local/nagios/var/spool/check
> > results/checkbx9gcH' to queue file
> > '/usr/local/nagios/var/spool/checkresults/cNf2sxz'...
> > [1194366930.424821] [008.1] [pid=27982] ** Event Check Loop 
> > [1194366930.424845] [008.1] [pid=27982] Next High Priority 
> Event Time:
> > Tue Nov  6 09:35:31 2007
> > [1194366930.424858] [008.1] [pid=27982] Next Low Priority 
> Event Time:
> > Tue Nov  6 09:35:31 2007
> > [1194366930.424867] [008.1] [pid=27982] Current/Max Service Checks: 
> > 7/0 [1194366930.424888] [008.2] [pid=27982] No events to execute at 
> > the moment.  Idling for a bit...
> > [1194366930.424897] [001.0] [pid=27982] 
> check_for_external_commands() 
> > [1194366930.424909] [064.1] [pid=27982] Making callbacks (type 8)...
> > [1194366930.677126] [008.1] [pid=27982] ** Event Check Loop 
> > [1194366930.677142] [008.1] [pid=27982] Next High Priority 
> Event Time:
> > Tue Nov  6 09:35:31 2007
> > [1194366930.677154] [008.1] [pid=27982] Next Low Priority 
> Event Time:
> > Tue Nov  6 09:35:31 2007
> > [1194366930.677162] [008.1] [pid=27982] Current/Max Service Checks: 
> > 7/0 [1194366930.677172] [008.2] [pid=27982] No events to execute at 
> > the moment.  Idling for a bit...
> > [1194366930.677181] [001.0] [pid=27982] 
> check_for_external_commands() 
> > [1194366930.677191] [064.1] [pid=27982] Making callbacks (type 8)...
> > [1194366930.928492] [008.1] [pid=27982] ** Event Check Loop 
> > [1194366930.928509] [008.1] [pid=27982] Next High Priority 
> Event Time:
> > Tue Nov  6 09:35:31 2007
> > [1194366930.928521] [008.1] [pid=27982] Next Low Priority 
> Event Time:
> > Tue Nov  6 09:35:31 2007
> > [1194366930.928530] [008.1] [pid=27982] Current/Max Service Checks: 
> > 7/0 [1194366930.928540] [008.2] [pid=27982] No events to execute at 
> > the moment.  Idling for a bit...
> > [1194366930.928548] [001.0] [pid=27982] 
> check_for_external_commands() 
> > [1194366930.928559] [064.1] [pid=27982] Making callbacks (type 8)...
> > [1194366931.179900] [008.1] [pid=27982] ** Event Check Loop 
> > [1194366931.179941] [008.1] [pid=27982] Next High Priority 
> Event Time:
> > Tue Nov  6 09:35:31 2007
> > [1194366931.179954] [008.1] [pid=27982] Next Low Priority 
> Event Time:
> > Tue Nov  6 09:35:31 2007
> > [1194366931.179963] [008.1] [pid=27982] Current/Max Service Checks: 
> > 7/0 [1194366931.179975] [001.0] [pid=27982] 
> handle_timed_event() start 
> > [1194366931.179985] [064.1] [pid=27982] Making callbacks (type 8)...
> > [1194366931.179996] [008.0] [pid=27982] ** Timed Event ** 
> Type: 5, Run
> > Time: Tue Nov  6 09:35:31 2007
> > [1194366931.180005] [008.0] [pid=27982] ** Check Result Reaper 
> > [1194366931.180013] [001.0] [pid=27982] reap_check_results() start 
> > [1194366931.180021] [016.0] [pid=27982] Starting to reap 
> check results.
> > [1194366931.180197] [064.1] [pid=27982] Making callbacks (type 9)...
> > 
> 
> Can you run the unstripped nagios binary under gdb and do a 
> backtrace when it segfaults?  That will help track down the 
> source of the segfault.
> 
> 
> cd <nagios-src>/base
> gdb --args ./nagios /usr/local/nagios/etc/nagios.cfg
> r
> 
> When it segfaults, do a backtrace...
> 
> bt
> 
> and send the output.
> 

Sure can!

Warning: Host 'xxxxx' has no services associated with it!
[New Thread -1208861808 (LWP 3432)]

Program received signal SIGSEGV, Segmentation fault.
[Switching to Thread -1208858416 (LWP 3429)]
0x08082ddb in process_check_result_queue (dirname=0x9994090
"/usr/local/nagios/var/spool/checkresults")
    at utils.c:2195
2195            while((dirfile=readdir(dirp))!=NULL){
(gdb) bt
#0  0x08082ddb in process_check_result_queue (
    dirname=0x9994090 "/usr/local/nagios/var/spool/checkresults") at
utils.c:2195
#1  0x08060670 in reap_check_results () at checks.c:145
#2  0x0806e470 in handle_timed_event (event=0x99b95a0) at events.c:1238
#3  0x0806eaf8 in event_execution_loop () at events.c:944
#4  0x0805629c in main (argc=Cannot access memory at address 0x1d8
) at nagios.c:779

> 
> 
> Ethan Galstad
> Nagios Developer
> ___
> Email: nagios at nagios.org
> Web:   www.nagios.org
> 
> --------------------------------------------------------------
> -----------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems?  Stop.
> Now Search log events and configuration files using AJAX and 
> a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> _______________________________________________
> Nagios-devel mailing list
> Nagios-devel at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-devel
> 

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/




More information about the Developers mailing list