NRPE mystery

farhan ahmed farhhanahmed at hotmail.com
Thu Nov 20 04:26:48 CET 2008



Hello,

I am having really NRPE weird issue, never faced this type of problem before, configured NRPE on hundreds Hosts without any problem but this one is killing me

Environment:

OS: Solaris 10 Zone

NRPE Version: 2.12

Nagios-plugins Version: 1.4.13

OpenSSL: 0.9.8i

Nagios Server: Ubuntu 8.04(hardy)

Nagios Version: 3.0.5

Problem:

I compiled , installed and started NRPE successfully on Solaris 10 zone. But somehow Nagios doesn't show the status critical or warning for this host even services are down. Nagios shows status 'OK' all the time though services are critical

For example 

On Solaris 10 zone, nrpe.cfg file has

command[check_users]=/usr/local/sys/nrpe/libexec/check_users -w 2 -c 3
command[check_remote]=/usr/local/sys/nrpe/libexec/check_tcp -p 1250

root at asqstage#  /usr/local/sys/nrpe/libexec/check_users -w 2 -c 3
USERS CRITICAL - 11 users currently logged in |users=11;2;3;0

root at asqstage#  /usr/local/sys/nrpe/libexec/check_tcp -p 1250
Connection refused

On Nagios Server:
root at cuba:~# /usr/local/nagios/libexec/check_nrpe -H asqstage -c check_users
USERS CRITICAL - 10 users currently logged in |users=10;2;3;0

root at cuba:~# /usr/local/nagios/libexec/check_nrpe -H asqstage -c check_remote
Connection refused

So, it means NRPE is passing write information to the check_nrpe on nagios, but somehow Nagios is showing them as OK status


check users     OK      11-20-2008 14:20:41     0d 0h 20m 28s   1/3     USERS CRITICAL - 10 users currently logged in

check Remote Trace      OK      11-20-2008 14:16:57     0d 0h 19m 12s   1/3     Connection refused


Here is my nagios configuration

define host{
        use                     generic-host            ; Name of host template to use
        host_name               asqstage
        alias                   asqstage
        parents                 sheila
        address                 163.189.222.51
        check_command           check-host-alive
        max_check_attempts      10
        contact_groups          CMESMonitoringGroup
        notification_interval   1440
        notification_period     24x7
        notification_options    d,u,r
        }



define service{
        use                             generic-service         ; Name of service template to use
        host_name                       asqstage
        service_description             check users
        check_command                   check_nrpe!check_users
        }

define service{
        use                             generic-service         ; Name of service template to use
        host_name                       asqstage
        service_description             check Remote Trace
        check_command                   check_nrpe!check_remote
        }

Please anyone who can help me in this. I am really stuck into it and can't figure out why it is happening. 


Thanks,


-- 
Regards,
Farhan Ahmed
 


_________________________________________________________________
Your dream beach house escape for summer! Sign up for the Hotmail Road Trip today.
http://www.ninemsn.com.au/hotmailroadtrip
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20081120/4acd8321/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list