Yet another NRPE issue...

Tom Welsh twelsh at square-box.com
Tue Dec 3 08:52:03 CET 2002


I had this happen to me once on a similar system. It all worked well and
when I returned from holiday it was not working. 

What I found out was it was a combination of the networking guys playing
with the firewall guys and shutting ports down

If you know your box configurations have not changed, and you know
nobody has removed the nrpe configs on your remote boxes then id be
looking in the network infrastructure change logs to see whats changed
whilst you were away

But then again I may be barking up the completely wrong tree

Cheers

Tom Welsh
squareBOX technologies
twelsh at square-box.com

-----Original Message-----
From: nagios-users-admin at lists.sourceforge.net
[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of
andrew.slater at netstrategy.net
Sent: 03 December 2002 06:07
To: Nagios-users at lists.sourceforge.net
Subject: [Nagios-users] Yet another NRPE issue...

Hi all,

The story goes like this;

I had my Nagios setup and working beautifully with NRPE working fine on
20
odd hosts. I went away for a couple of weeks and as the system is
somewhat
developmental at the moment, was happy enough to have it shutdown for
the
time I was away. The problem is that now I am back, the system is
plugged
back in I cant get any response from any server with NRPE on it.

The details;

- check_nrpe run on the target machine works fine
- Nagios called check_nrpe command fails with "CHECK_NRPE: Received 0
bytes. Are we allowed to connect to the host?"
- telnet to port 5666 on target machine from Nagios machine fails


- telnet to localhost:5666 on target machine fails;
[root at cache01 xinetd.d]# telnet
telnet> open xxx.xxx.xxx.xxx 5666
Trying xxx.xxx.xxx.xxx...
Connected to xxx.xxx.xxx.xxx.
Escape character is '^]'.
Connection closed by foreign host. (Which happens instantaneously)


- xinetd _is_ listening on port 5666 on the target machines;
[root at cache01 xinetd.d]# netstat -pan | grep
tcp        0      0 0.0.0.0:5666            0.0.0.0:*
LISTEN
5320/xinetd


- strace printout of a commandline "./check_nrpe xxx.xxx.xxx.xxx -p 5666
-c
check_users" to a machine on the same subnet
...
...
send(3, "\0\0\0\1\0\0\0\1\0\0\0\0\0\0\0\vcheck_users\0\0\0\0\0"...,
1040,
0) = 1040
time([1038894731])                      = 1038894731
recv(3, 0xbffff2e0, 1040, 0)            = -1 ECONNRESET (Connection
reset
by peer)
fstat64(1, {st_mode=S_IFCHR|0620, st_rdev=makedev(136, 1), ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0)
= 0x40014000
write(1, "CHECK_NRPE: Error receiving data"..., 44CHECK_NRPE: Error
receiving data from host.
) = 44
close(3)                                = 0
alarm(0)                                = 10
munmap(0x40014000, 4096)                = 0
_exit(3)                                = ?



- strace printout of a commandline "./check_nrpe xxx.xxx.xxx.xxx -p 5666
-c
check_users" to a machine on a different subnet
...
...
send(3, "\0\0\0\1\0\0\0\1\0\0\0\0\0\0\0\vcheck_users\0\0\0\0\0"...,
1040,
0) = 1040
time([1038894876])                      = 1038894876
recv(3, "", 1040, 0)                    = 0
fstat64(1, {st_mode=S_IFCHR|0620, st_rdev=makedev(136, 1), ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0)
= 0x40014000
write(1, "CHECK_NRPE: Received 0 bytes.  A"..., 70CHECK_NRPE: Received 0
bytes.
 Are we allowed to connect to the host?
) = 70
close(3)                                = 0
alarm(0)                                = 10
munmap(0x40014000, 4096)                = 0
_exit(3)                                = ?


I have to admit that i'm still pretty much in holiday mood so there
could
be something ridiculously simple I am missing (here's hoping) but I am
getting to the end of my useful troubleshooting ideas, so any light from
any listmembers would be greatly appreciated.

Thanks to all,

Andrew Slater




-------------------------------------------------------
This SF.net email is sponsored by: Get the new Palm Tungsten T 
handheld. Power & Color in a compact size! 
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users




-------------------------------------------------------
This SF.net email is sponsored by: Get the new Palm Tungsten T 
handheld. Power & Color in a compact size! 
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en




More information about the Users mailing list