Execution Queue and latency

Bishop, Dean dean.bishop at tcdsb.org
Tue Oct 29 17:03:52 CET 2002


yeah, i think that you are correct in stating that it provides a
recommendation for max_concurrent_checks.  i originally saw the time that it
estimated for a single run and thought that this was a guess/calculation of
how long my checks would take.

it doesn't.



-----Original Message-----
From: Marc Powell [mailto:mpowell at ena.com]
Sent: Tuesday, October 29, 2002 10:56 AM
To: Bishop, Dean; nagios-users at lists.sourceforge.net
Subject: RE: RE : [Nagios-users] Execution Queue and latency


That's interesting, my understanding of the -s switch is that it only
provides a recommendation for max_concurrent_checks. Everything else is
taken directly from the config files (avg check interval, inter-check delay
method, inter-check delay (calculated from total hosts, services and avg
check interval), etc).

--
Marc

> -----Original Message-----
> From: Bishop, Dean [mailto:dean.bishop at tcdsb.org]
> Sent: Tuesday, October 29, 2002 9:48 AM
> To: Marc Powell; louafi; nagios-users at lists.sourceforge.net
> Subject: RE: RE : [Nagios-users] Execution Queue and latency
> 
> Good morning,
> 
> Marc:
> 
> Hmmm, yes, you are quite right.  i am not sure how to account for that and
> i
> also have experience with nagios -s that tells me that it cannot be used
> for
> reliable capacity planning.  Mine tells me 17 minutes when 5 can do the
> job
> quite nicely.
> 
> best of two evils??
> 
> later,
> dean
> 
> 
> -----Original Message-----
> From: Marc Powell [mailto:mpowell at ena.com]
> Sent: Tuesday, October 29, 2002 10:42 AM
> To: Bishop, Dean; louafi; nagios-users at lists.sourceforge.net
> Subject: RE: RE : [Nagios-users] Execution Queue and latency
> 
> 
> Your estimation presumes that service checks are not being executed in
> parallel, which is not the case for a default installation.
> 
> Sebastien:
> 
> 	You may want to look at your max_concurrent_checks value to make
> sure it is sized properly for your installation. You can run 'nagios -s
> /path/to/nagios.cfg' to get a recommendation ala
> 
> [root at noctools nagios]# bin/nagios -s etc/nagios.cfg
> 
> Nagios 1.0b3
> Copyright (c) 1999-2002 Ethan Galstad (nagios at nagios.org)
> Last Modified: 06-09-2002
> License: GPL
> 
> 
>         SERVICE SCHEDULING INFORMATION
>         -------------------------------
>         Total services:             2030
>         Total hosts:                1677
> 
>         Check reaper interval:      10 sec
> 
>         Inter-check delay method:   SMART
>         Average check interval:     300.000 sec
>         Inter-check delay:          0.148 sec
> 
>         Interleave factor method:   SMART
>         Average services per host:  1.210
>         Service interleave factor:  2
> 
>         Initial service check scheduling info:
>         --------------------------------------
>         First scheduled check:      1035905875 -> Tue Oct 29 09:37:55 2002
>         Last scheduled check:       1035906175 -> Tue Oct 29 09:42:55 2002
> 
>         Rough guidelines for max_concurrent_checks value:
>         -------------------------------------------------
>         Absolute minimum value:     68
>         Recommend value:            204
> 
>         Notes:
>         The recommendations for the max_concurrent_checks value
>         assume that the average execution time for service
>         checks is less than the service check reaper interval.
>         The minimum value also reflects best case scenarios
>         where there are no problems on your network.  You will
>         have to tweak this value as necessary after testing.
>         High latency values for checks are often indicative of
>         the max_concurrent_checks value being set too low and/or
>         the service_reaper_frequency being set too high.
>         It is important to note that the values displayed above
>         do not reflect current performance information for any
>         Nagios process that may currently be running.  They are
>         provided solely to project expected and recommended
>         values based on the current data in the config files.
> 
> --
> Marc
> 
> > -----Original Message-----
> > From: Bishop, Dean [mailto:dean.bishop at tcdsb.org]
> > Sent: Tuesday, October 29, 2002 9:28 AM
> > To: 'louafi'; nagios-users at lists.sourceforge.net
> > Cc: Bishop, Dean
> > Subject: RE: RE : [Nagios-users] Execution Queue and latency
> >
> > well, assuming that you haven't modified "check_ping" it uses 4 pings to
> > test the OK'ness.
> >
> > On my system it takes just over 4seconds to check_ping a host on the
> LAN.
> >
> > 4 seconds * 70 hosts = 280 seconds or just under 5 minutes.
> >
> > so it will take about 5 minutes just to do the check_pings if things are
> > OK.
> > it just gets worse if things are not OK because then each of the four
> > pings
> > would have to timeout (defaults to 10seconds).
> >
> > 2 suggestions
> >
> > 1) use:
> >
> >         command_line   /opt/nagios/libexec/check_ping -H $HOSTADDRESS$ -
> w
> > $ARG1$ -c $ARG2$ -p 1 -t 5
> >
> > 2) use:
> >
> > 	        command_line   /opt/nagios/libexec/check_tcp -H
> > $HOSTADDRESS$ -p 23 -w $ARG1$ -c $ARG2$
> >
> > substitute the port 23 for something useful.  i use 23 for my switches
> and
> > 135 for my windows boxes.
> >
> > btw
> >
> > check_tcp -h host -p 135 takes about .01s
> > check_ping -h host -p 1 takes about .01s
> >
> > take your pick.
> >
> >
> >
> > later,
> > dean
> >
> >
> >
> > -----Original Message-----
> > From: louafi [mailto:sebastien.louafi at e-qual.fr]
> > Sent: Tuesday, October 29, 2002 10:19 AM
> > To: nagios-users at lists.sourceforge.net
> > Cc: 'Bishop, Dean'
> > Subject: RE : [Nagios-users] Execution Queue and latency
> >
> >
> > Ok my check_ping definition is :
> >
> > define command{
> >         command_name    check_ping
> >         command_line   /opt/nagios/libexec/check_ping -H $HOSTADDRESS$
> > -w $ARG1$ -c $ARG2$
> >         }
> >
> > I have around 70 hosts and 800 services...
> >
> > I'm using perfdata to dump check result in a mysql database.
> >
> >
> > Thanks
> >
> >
> > -----Message d'origine-----
> > De : Bishop, Dean [mailto:dean.bishop at tcdsb.org]
> > Envoyé : mardi 29 octobre 2002 15:59
> > À : 'louafi'; nagios-users at lists.sourceforge.net
> > Cc : Bishop, Dean
> > Objet : RE: RE : [Nagios-users] Execution Queue and latency
> >
> >
> > please send the snippet from checkcommands.cfg where "check_ping" is
> > defined.
> >
> > also, how many hosts/services are you checking?
> >
> > -----Original Message-----
> > From: louafi [mailto:sebastien.louafi at e-qual.fr]
> > Sent: Tuesday, October 29, 2002 9:37 AM
> > To: nagios-users at lists.sourceforge.net
> > Cc: 'Bishop, Dean'
> > Subject: RE : [Nagios-users] Execution Queue and latency
> >
> >
> > OK i give you my config files.
> >
> > My services.cfg contains a lot of services like :
> >
> > define service{
> >         host_name 10.10.10.10
> >         service_description PING
> >         check_command check_ping!3000,20%!5000,60%
> >         max_check_attempts 5
> >         normal_check_interval 5
> >         retry_check_interval 3
> >         active_checks_enabled 1
> >         passive_checks_enabled 0
> >         check_period 24x7
> >         parallelize_check 1
> >         check_freshness 0
> >         flap_detection_enabled 0
> >         process_perf_data 1
> >         retain_status_information 1
> >         retain_nonstatus_information 1
> >         notification_interval 60
> >         notification_period 24x7
> >         notification_options w,c,r
> >         contact_groups louafi
> >         stalking_options o,w,u,c
> > }
> >
> > My hosts.cfg is :
> >
> > define host{
> >         host_name 10.10.10.10
> >         alias 10.10.10.10
> >         address 10.10.10.10
> >         check_command check_alive
> >         max_check_attempts 5
> >         checks_enabled 1
> >         flap_detection_enabled 0
> >         process_perf_data 1
> >         retain_status_information 1
> >         retain_nonstatus_information 1
> >         notification_interval 60
> >         notification_period 24x7
> >         notification_options d,u,r
> > }
> >
> > Thanks for your help
> >
> >
> > -----Message d'origine-----
> > De : Bishop, Dean [mailto:dean.bishop at tcdsb.org]
> > Envoyé : mardi 29 octobre 2002 14:24
> > À : 'louafi'; nagios-users at lists.sourceforge.net
> > Objet : RE: [Nagios-users] Execution Queue and latency
> >
> >
> > It would help if you sent a snippet from your services.cfg and hosts.cfg
> >
> > -----Original Message-----
> > From: louafi [mailto:sebastien.louafi at e-qual.fr]
> > Sent: Tuesday, October 29, 2002 5:18 AM
> > To: nagios-users at lists.sourceforge.net
> > Subject: [Nagios-users] Execution Queue and latency
> >
> >
> > Hi,
> >
> > I have posted this problem few weeks ago without answers...
> >
> > I have a high latency value for my check executions.
> >
> > How to solve this problem ???
> >
> > I have the following options set :
> >
> > inter_check_delay_method=s
> >
> > service_interleave_factor=s
> >
> > max_concurrent_checks=0
> >
> > service_reaper_frequency=10
> >
> > Thanks a lot.
> >
> >
> >
> > -------------------------------------------------------
> > This sf.net email is sponsored by:ThinkGeek
> > Welcome to geek heaven.
> > http://thinkgeek.com/sf _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> >
> >
> >
> >
> > -------------------------------------------------------
> > This sf.net email is sponsored by:ThinkGeek
> > Welcome to geek heaven.
> > http://thinkgeek.com/sf
> > _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users


-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf




More information about the Users mailing list