Distributed monitoring Freshness checking failing then recovering

Sean McAvoy smcavoy at ca.afilias.info
Fri Oct 12 18:40:27 CEST 2007


Hello,
I have 1 central nagios system with 5 distributed servers. I have  
enabled freshness checking on both central and remote systems. I am  
constantly seeing services go to unknown status for 1-3 minutes and  
then recover.
on the remotes I have:
check_service_freshness=1
service_freshness_check_interval=10
check_host_freshness=1
host_freshness_check_interval=60
service_inter_check_delay_method=s
max_service_check_spread=10
service_interleave_factor=1
host_inter_check_delay_method=s
max_host_check_spread=30
max_concurrent_checks=0

It does appear as though checks are being run in parallel. I'm wonder  
how I can best determine where the problem is, with the execution of  
checks, submittal to the central system or other.
Thanks.


_sean

-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list