Missed checks - stale data?

Matthew Wilson matthewwilson at dsl.pipex.com
Thu Mar 4 21:03:32 CET 2004


Dear List,
I am having some problems with check scheduling.  Specifically, nagios
seems to be missing quite a few checks.  I am monitoring about 60 hosts,
each with a ping service (5 min frequency) and several with another
custom service (which just requests a webpage from our satelite
terminals' management interface - https - every 15mins).  We use apan
for all our services and there are clearly times when nagios is missing
it's checks. This is mostly occuring with the ping services.

I have examined the nagios log and I am seeing a lot of: 
Warning: The results of service 'PING' on host 'Kilchoan-627' are stale
by 46 seconds

I assume this means that there has been a delay of 46 seconds from when
the check results were received to when they are processed by a service
reaper event.  I guess this is causing the missed checks we are seeing -
if the check is not reported within 5 mins then rrdtool will log a NaN. 

In nagios.cfg I have:
max_concurrent_checks=0
service_reaper_frequency=3
inter_check_delay_method=s

The server is a PIII 500 and I am seeing loads of about 2-4.  It has
384Mb of RAM and is swapping a bit.   Any suggestions for sorting this
out would be appreciate.  I realise if we increase our monitoring by
much I will need a higher spec server... but the load doesn't seem to
indicate I'm purely running out of CPU time.

cheers
Matthew Wilson
DC-Sat.net



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list