check_ping stuck on 'PENDING'

Robert Yeo r.yeo at utah.edu
Mon Jan 12 16:33:21 CET 2009


The ! is to remove servers that have been included already in the line above
in the hostgroup_name, that way I do not need to spell them all out
individually or create different groups if you have one server that doesn't
fit with everything.  Basically I do not want to monitor Check CPU Load on
every server in the Nmw_Servers group, so I exclude them on the next line.
I thought this was OK to do this?

I will remove the host_name in case the ! is making things weird and my
service dependencies as well and let you know what happens.

Thanks for the help!

Rob Yeo



On 1/12/09 8:26 AM, "jmoseley at corp.xanadoo.com" <jmoseley at corp.xanadoo.com>
wrote:

> First, what's with the '!' preceding many of your hosts names?   Second, in
> your service and service dependency configs, why are you defining both
> hosts and hostgroups?  For example:
> 
> define service{
>     hostgroup_name            Nmw_Servers,Wi_Servers
>     host_name
> !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,Dgh_000_012_FairUse,Art
> 
> _104_014_FilmServer,!Nmw_104_092_Nagios3
> 
>     service_description        Check CPU Load
>     use                        medium-high-priority-service
>     check_command              check_nrpe!check_load
>     }
> 
> I'd start with removing the host_name portion of the config and including
> 'Dgh_Servers' in the hostgroup_name stanza.  Otherwise, you've got a lot of
> overkill...
> 
> Lastly, get rid of the service dependencies until you have this sorted out
> and working.
> 
> 
> 
> James Moseley
> 
> 
> 
> 
>                  
>              Robert Yeo
>              <r.yeo at utah.edu>
>                                                                         To
>              01/12/2009 08:57          <Nagios-users at lists.sourceforge.net
>              AM                        >
>                                                                         cc
>                  
>                                                                    Subject
>                                        Re: [Nagios-users] check_ping stuck
>                                        on 'PENDING'
>                  
>                  
>                  
>                  
>                  
>                  
> 
> 
> 
> 
> The operation I am employing is Check CPU Load but the symptoms were the
> same as Michiel's check ping.
> 
> Hopefully all the information is there now.
> 
> Thanks!
> 
> Rob Yeo
> 
> # Finearts Nagios 2 Server
> define host{
> 
>     host_name        Dgh_000_012_FairUse
>     alias            Finearts Nagios 2 Server
>     address          155.xxx.xxx.xxx
> 
>     use              server-high-priority
>     hostgroups       Dgh_Servers
>     parents          Dgh_Switch_02
>     icon_image       mac40.png
>     statusmap_image  mac40.gd2
>     }
> 
> # '003 DGH Servers' server group definition
> define hostgroup{
>     hostgroup_name       Dgh_Servers
>     alias                DGH_SERVERS
>     }
> 
> # 'check_nrpe' command definition
> define command {
>     command_name     check_nrpe
>     command_line    $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ -u
> }'
> 
> define service{
>     hostgroup_name            Nmw_Servers,Wi_Servers
>     host_name
> !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,Dgh_000_012_FairUse,Art
> 
> 
> _104_014_FilmServer,!Nmw_104_092_Nagios3
> 
>     service_description        Check CPU Load
>     use                        medium-high-priority-service
>     check_command              check_nrpe!check_load
>     }
> 
> define servicedependency{
>     hostgroup_name                      Nmw_Servers,Wi_Servers
>     host_name
> !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,!Nmw_104_011_Finearts_F
> 
> ileserver,Dgh_000_012_FairUse,Art_104_014_FilmServer,!Nmw_104_092_Nagios3
> 
>     service_description                 Check NRPE
>     dependent_hostgroup_name            Nmw_Servers,Wi_Servers
>     dependent_host_name
> !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,!Nmw_104_011_Finearts_F
> 
> ileserver,Dgh_000_012_FairUse,Art_104_014_FilmServer,!Nmw_104_092_Nagios3
>     dependent_service_description    Check CPU Load,Check Free
> Space,Current
> Users,Total Processes,Zombie Processes
> 
>     execution_failure_criteria          w,u,c,p
>     notification_failure_criteria       w,u,c,p
>     }
> 
> define host{
>         name                            generic-host
>         notifications_enabled           1
>         event_handler_enabled           1
>         flap_detection_enabled          1
>         failure_prediction_enabled      1
>         process_perf_data               1
>         retain_status_information       1
>         retain_nonstatus_information    1
>         register                        0
>         }
> 
> define host{
>     name                                server-high-priority
>     use                                 generic-host
>     check_period                        24x7
>     check_interval                      2
>     max_check_attempts                  2
>     check_command                       check-host-alive
>     notification_period                 24x7
> notification_interval               120
>     notification_options                d,r,f
>     contact_groups                      nagiosadmin
>     register                            0
>     }
> 
> 
> define service{
>         name                            medium-high-priority-service
>         use                             generic-service
>         check_period                    24x7
>         max_check_attempts              5
>         normal_check_interval           5
>         retry_check_interval            2
>          contact_groups                 nagiosadmin
>         notification_options            c,r,f
>         notification_interval           180
>         notification_period             extendedworkhours
>         register                        0
>         }
> 
> define service{
>         name                            generic-service
>         active_checks_enabled           1
>         passive_checks_enabled          1
>         parallelize_check               1
>         obsess_over_service             0
>         check_freshness                 0
>         notifications_enabled           1
>         event_handler_enabled           1
>         flap_detection_enabled          1
>         failure_prediction_enabled      1
>         process_perf_data               1
>         retain_status_information       1
>         retain_nonstatus_information    1
>         is_volatile                     0
>         register                        0
>         }
> 
> # 'extendedworkhours' timeperiod definition
> define timeperiod{
>     timeperiod_name    extendedworkhours
>     alias       Extended Work Hours
>     sunday      10:00-22:00
>     monday      07:00-22:00
>     tuesday     07:00-22:00
>     wednesday   07:00-22:00
>     thursday    07:00-22:00
>     friday      07:00-22:00
>     saturday    10:00-22:00
>     }
> 
> # '24x7' timeperiod definition
> define timeperiod{
>     timeperiod_name    24x7
>     alias       24 Hours A Day, 7 Days A Week
>     sunday      00:00-24:00
>     monday      00:00-24:00
>     tuesday     00:00-24:00
>     wednesday   00:00-24:00
>     thursday    00:00-24:00
>     friday      00:00-24:00
>     saturday    00:00-24:00
>     }
> 
> Nagios 3.0.6 on 10.4.11 OSX.
> 
> The service check is stuck in pending for the DGH server, and is returning
> fine for 2 of the NMW servers, but not all of them.
> 
> Thanks,
> 
> Rob Yeo
> 
> 
> On 1/9/09 4:34 PM, "jmoseley at corp.xanadoo.com" <jmoseley at corp.xanadoo.com>
> wrote:
> 
>> I don't see a service definition for 'server-high-priority', nor do I see
>> where you are employing a check_ping operation, nor have you included the
>> check_period config for 24x7.
>> 
>> 
>> James Moseley
>> 
>> 
>> 
>> 
>> 
>>              Robert Yeo
>>              <r.yeo at utah.edu>
>> 
> To
>>              01/09/2009 05:13          nagios-users Mailinglist
>>              PM
> <Nagios-users at lists.sourceforge.net
>>> 
>> 
> cc
>> 
>> 
> Subject
>>                                        Re: [Nagios-users] check_ping
> stuck
>>                                        on 'PENDING'
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> 
>> This is an example config for one of the services.  Please let me know if
> I
>> am missing any info.
>> 
>> # Finearts Nagios 2 Server
>> define host{
>> 
>>     host_name        Dgh_000_012_FairUse
>>     alias            Finearts Nagios 2 Server (fairuse.music.utah.edu)
>>     address          155.97.55.12
>> 
>>     use              server-high-priority
>>     hostgroups       Dgh_Servers
>>     parents          Dgh_Switch_02
>>     icon_image       mac40.png
>>     statusmap_image  mac40.gd2
>>     }
>> 
>> # '003 DGH Servers' server group definition
>> define hostgroup{
>>     hostgroup_name       Dgh_Servers
>>     alias                DGH_SERVERS
>>     }
>> 
>> # 'check_nrpe' command definition
>> define command {
>>     command_name     check_nrpe
>>     command_line    $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ -u
>> }'
>> 
>> define service{
>>     hostgroup_name            Nmw_Servers,Wi_Servers
>>     host_name
>> 
> !Nmw_104_010_Finearts_Dataserver,!Nmw_104_020_Nagios,Dgh_000_012_FairUse,Art
> 
>> 
>> _104_014_FilmServer,!Nmw_104_092_Nagios3
>> 
>>     service_description        Check CPU Load
>>     use                        medium-high-priority-service
>>     check_command              check_nrpe!check_load
>>     }
>> 
>> define service{
>>         name                            medium-high-priority-service
>>         use                             generic-service
>>         check_period                    24x7
>>         max_check_attempts              5
>>         normal_check_interval           5
>>         retry_check_interval            2
>>          contact_groups                 nagiosadmin
>>         notification_options            c,r,f
>>         notification_interval           180
>>         notification_period             extendedworkhours
>>         register                        0
>>         }
>> 
>> define service{
>>         name                            generic-service
>>         active_checks_enabled           1
>>         passive_checks_enabled          1
>>         parallelize_check               1
>>         obsess_over_service             0
>>         check_freshness                 0
>>         notifications_enabled           1
>>         event_handler_enabled           1
>>         flap_detection_enabled          1
>>         failure_prediction_enabled      1
>>         process_perf_data               1
>>         retain_status_information       1
>>         retain_nonstatus_information    1
>>         is_volatile                     0
>>         register                        0
>>         }
>> 
>> # 'extendedworkhours' timeperiod definition
>> define timeperiod{
>>     timeperiod_name    extendedworkhours
>>     alias       Extended Work Hours
>>     sunday      10:00-22:00
>>     monday      07:00-22:00
>>     tuesday     07:00-22:00
>>     wednesday   07:00-22:00
>>     thursday    07:00-22:00
>>     friday      07:00-22:00
>>     saturday    10:00-22:00
>>     }
>> 
>> Nagios 3.0.6 on 10.4.11 OSX.
>> 
>> The service check is stuck in pending for the DGH server, and is
> returning
>> fine for 2 of the NMW servers, but not all of them.
>> 
>> Thanks,
>> 
>> Rob Yeo
>> 
>> On 1/9/09 4:04 PM, "Marc Powell" <marc at ena.com> wrote:
>> 
>>> 
>>> On Jan 9, 2009, at 4:48 PM, <ibloodyhatespam at gmail.com>
>>> <ibloodyhatespam at gmail.com
>>>> wrote:
>>> 
>>>> As per the subject... Nagios 3.0.6 and it happens both to lan and
>>>> external servers.
>>> 
>>>> As ping¹s a very basic check and as far as I can see everything
>>>> should work normally, right now I¹m stumped. Any idea what's going
>>>> on here?
>>> 
>>> Actual config information is necessary otherwise we're just guessing
>>> in the dark. Nagios doesn't think it's supposed to perform a check. Is
>>> the timeperiod right?
>>> 
>>> host definition, service definition (and templates for both if used)
>>> and timeperiod definition would be a good start.
>>> 
>>> --
>>> Marc
>>> 
>> 
> ------------------------------------------------------------------------------
> 
>> 
>>> Check out the new SourceForge.net Marketplace.
>>> It is the best place to buy or sell services for
>>> just about anything Open Source.
>>> http://p.sf.net/sfu/Xq1LFB
>>> _______________________________________________
>>> Nagios-users mailing list
>>> Nagios-users at lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>>> ::: Please include Nagios version, plugin version (-v) and OS when
>> reporting
>>> any issue.
>>> ::: Messages without supporting info will risk being sent to /dev/null
>> 
>> 
>> 
> ------------------------------------------------------------------------------
> 
>> 
>> Check out the new SourceForge.net Marketplace.
>> It is the best place to buy or sell services for
>> just about anything Open Source.
>> http://p.sf.net/sfu/Xq1LFB
>> _______________________________________________
>> Nagios-users mailing list
>> Nagios-users at lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when
>> reporting any issue.
>> ::: Messages without supporting info will risk being sent to /dev/null
>> 
>> 
> 
> 
> ------------------------------------------------------------------------------
> 
> Check out the new SourceForge.net Marketplace.
> It is the best place to buy or sell services for
> just about anything Open Source.
> http://p.sf.net/sfu/Xq1LFB
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
> 
> 


------------------------------------------------------------------------------
Check out the new SourceForge.net Marketplace.
It is the best place to buy or sell services for
just about anything Open Source.
http://p.sf.net/sfu/Xq1LFB
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list