Service Check Timed Out

Alexandru Ionica gremlin at networked.ro
Mon Jan 9 15:51:46 CET 2006


Hello,
this is the error i see in the cgi web interface.
in the error log i see stuff like:


SERVICE NOTIFICATION:
noc;host-aaa;FPING;CRITICAL;notify-by-email;(Service Check Timed Out)

SERVICE ALERT: host-bbb;FPING;CRITICAL;HARD;3;(Service Check Timed Out)

SERVICE ALERT: host-ccc;FPING;CRITICAL;SOFT;1;(Service Check Timed Out)


The FPING service is a perl script i wrote wich does a fping, and a
snmpwalk. In the worst case scenario this should take are 30 seconds to run.

I've set up service_check_timeout=90 in the main (nagios.cfg) config file.
The machine is under medium to heavy load, but i can't understand why
it's getting a timeout, as 90 seconds seem more than enough for any of
the scripts to run.
Is there any possibility that this is generated by the check-host-alive
plugin and it escalates ? or something like that ... ?

The nagios configuration is pretty simple, some of the more important
things are:
====================================
inter_check_delay_method=s
service_interleave_factor=s
max_concurrent_checks=0
service_reaper_frequency=10
service_check_timeout=90
host_check_timeout=90
event_handler_timeout=30
notification_timeout=30
ocsp_timeout=15
perfdata_timeout=15
check_for_orphaned_services=0
check_service_freshness=1
freshness_check_interval=60
==================================

The service template is:

==================================
define service{
        name                            generic-ping2
        active_checks_enabled           1
        passive_checks_enabled          1
        parallelize_check               1
        obsess_over_service             1
        check_freshness                 0
        notifications_enabled           1
        event_handler_enabled           1
        flap_detection_enabled          1
        process_perf_data               1
        retain_status_information       1
        retain_nonstatus_information    1
        service_description             FPING
        is_volatile                     0
        check_period                    24x7
        max_check_attempts              3
        normal_check_interval           5
        retry_check_interval            1
        contact_groups                  noc-acm
        notification_interval           1800
        notification_period             24x7
        notification_options            c,r
        register                        0
        }
====================================

any ideeas ?


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list