Nagios 3.0 RC2: Segmentation faults after starting

Christian Schneemann cschneemann at suse.de
Thu Feb 14 11:02:59 CET 2008


On Thursday February 14 2008 10:53:26 am Hendrik Bäcker wrote:
> Hi Christian,
>
> Christian Schneemann schrieb:
> > Hallo,
> > our test Nagios dies after every restart with a segmentation fault since
> > last week.
> > I have tried to find the error with gdb. I hope someone can point me to
> > my error.
> >
> >
> > d139:/etc/nagios/objects/hosts #
> > gdb --args /usr/sbin/nagios /etc/nagios/nagios.cfg
> > GNU gdb 6.6
> > Copyright (C) 2006 Free Software Foundation, Inc.
> > GDB is free software, covered by the GNU General Public License, and you
> > are welcome to change it and/or distribute copies of it under certain
> > conditions. Type "show copying" to see the conditions.
> > There is absolutely no warranty for GDB.  Type "show warranty" for
> > details. This GDB was configured as "x86_64-suse-linux"...
> > (no debugging symbols found)
>
> Please try again with a modified makefile.
I will try it in a few.

>
> You need to 'enable' the CFLAGS used for valgrind in base/Makefile and
> re-compile the binary.
> After that try to run nagios against the binary in base/nagios or do a
>
> make install-unstripped
>
> That should include debug information to the binary and the gdb should
> be more verbose.
>
> > Using host libthread_db library "/lib64/libthread_db.so.1".
> > (gdb) run
>
> May be the gdb command: "set follow-fork-mode child" would be nice here
> but I don't know exactly.
It hasn't changed anything in the output.

>
> > Starting program: /usr/sbin/nagios /etc/nagios/nagios.cfg
> > (no debugging symbols found)
>
> <<<< SNIP >>>>>
>
> > Program received signal SIGSEGV, Segmentation fault.
> > [Switching to Thread 47105982427648 (LWP 15658)]
> > 0x000000000043b90a in process_check_result_queue ()
> > (gdb) bt
> > #0  0x000000000043b90a in process_check_result_queue ()
> > #1  0x000000000041c536 in reap_check_results ()
> > #2  0x0000000000429119 in handle_timed_event ()
> > #3  0x0000000000429701 in event_execution_loop ()
> > #4  0x00000000004120fe in main ()
>
> hm... check_result_queue...
>
> Do you have any old checkresult files in the spool directory which could
> cause the problem?
No, it is the same segfault if I delete all files from /var/spool/nagios/

>
> Kind Regards
> Hendrik


With kind regards;
-- 
Christian Schneemann

-------------------------------------
SUSE LINUX Products GmbH,
Maxfeldstr. 5, D - 90409 Nürnberg

Phone:  +49 (0)911 - 740 53 0
e-mail: cschneemann at suse.de
-------------------------------------
SUSE LINUX Products GmbH, GF: Markus Rex
HRB 16746 (AG Nürnberg)

-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/




More information about the Developers mailing list