Failing HUP signal on nagios 2.0

Javier Palacios javiplx at gmail.com
Tue Feb 14 23:18:27 CET 2006


Hello,

I'm experiencing a very strange problem while reloading nagios 2.0 config.
The best way I can explain is that it stops to work, actually killing
the process instead of reloading the configuration. It happens using
-HUP and also with the external commands file.
The really strange point is that during some periods it worked.

Initially it worked fine, then stopping to work. After many trials,
wondering If it did really worked anytime, I completelly remove nagios
and perform a new install. Many of them actually, until at one point
it begins to work again. Then, it has been working properly for many
hours.
I use some plugins that constructs/modify nagios cfg files and reload
the server, and this time I'm pretty sure that it had worked, so I did
stop, cleared the generated stuff and restarting the whole process,
but now configuration reload fails again.

Any idea as well as any wayt to trace the problem is quite wellcome. I
run Scientific Linux 3 (a RHEL3 recompilation) with the rpm from DAG
repository.

I have not performed extensive strace works, but the final lines are
open("/usr/lib/perl5/5.8.0/i386-linux-thread-multi/CORE/libgcc_s.so.1",
O_RDONLY) = -1 ENOENT (No such file or directory)
open("/opt/glite/externals/lib/libgcc_s.so.1", O_RDONLY) = -1 ENOENT
(No such file or directory)
open("/opt/glite/lib/libgcc_s.so.1", O_RDONLY) = -1 ENOENT (No such
file or directory)
open("/opt/globus/lib/libgcc_s.so.1", O_RDONLY) = -1 ENOENT (No such
file or directory)
open("/lib/ssa/libgcc_s.so.1", O_RDONLY) = 5
read(5, "\177ELF\1\1\1\0\0\0\0\0\0\0\0\0\3\0\3\0\1\0\0\0\214\25"..., 512) = 512
fstat64(5, {st_mode=S_IFREG|0644, st_size=150982, ...}) = 0
old_mmap(NULL, 31932, PROT_READ|PROT_EXEC, MAP_PRIVATE, 5, 0) = 0xc2d000
old_mmap(0xc34000, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_FIXED,
5, 0x6000) = 0xc34000
close(5)                                = 0
mprotect(0xbfffd000, 4096, PROT_READ|PROT_WRITE|PROT_EXEC|PROT_GROWSDOWN) = 0
mprotect(0xb69c5000, 10485760, PROT_READ|PROT_WRITE|PROT_EXEC) = 0
write(2, "libgcc_s.so.1 must be installed "..., 59) = 59
exit_group(127)                         = ?
Process 24190 detached

but libgcc_s.so.1 is installed and is found in LD_LIBRARY_PATH. Adding
the directory to /etc/ld.so.conf makes no difference.
And, in any case, nothing appart from nagios has been modified in the meantime.


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642




More information about the Developers mailing list