Nagios 3.0.4 performance issue

Alloo, Vincent v-alloo at ti.com
Wed Nov 19 13:56:46 CET 2008


Andreas,
Here is an extract of my setup:

define servicegroup{
	servicegroup_name	nrpe_services
	alias			NRPE Services
}

define servicedependency{
	host_name			svxnagios02
	service_description		check_uname
	dependent_servicegroup_name	nrpe_services
    notification_failure_criteria	w,u,c
}

define service {
use                            unix_24_7
host_name                      svxnagios02
service_description            check_uname
check_command                  check_nrpe_ssl!uname!0
notification_options           c,r
process_perf_data	       0
}

And a bunch of:
define service {
use                         	unix_24_7
hostgroup_name              	sol-servers,linux-servers,sol-zone-servers,sol-servers-with_hotspare
service_description          	CPU load
check_command                	check_nrpe_ssl!check_load!5,4,3!6,5,4
servicegroups			nrpe_services
}
.....(3600 services within the nrpe_services service group)

Vincent Alloo
TI France Design Systems Operations Manager
Europe and Middle East IT Services
Texas Instruments France

E-Mail: v-alloo at ti.com
Phone: +33 4 93 22 26 97
Mobile: +33 6 82 13 00 80

-----Original Message-----
From: Andreas Ericsson [mailto:ae at op5.se] 
Sent: Wednesday, November 19, 2008 12:56 PM
To: Alloo, Vincent
Cc: nagios-users at lists.sourceforge.net
Subject: Re: [Nagios-users] Nagios 3.0.4 performance issue

Alloo, Vincent wrote:
> By removing all "servicegroups", my load is back to normal.
> I had all my NRPE services within the same service group in
> order to put in place a service dependency. It means 3600
> services within the same service group.
> It was causing the huge load seen on my machine.
> Is it a normal behavior, or is it a bug?
> 

It's a bug, but in the configuration. A servicegroup with 3600
services cannot be used to create a sane servicedependency setup.

My guess is that you somehow managed to set a *lot* of service
dependencies for a *lot* of services, which in turn meant that
all of those dependencies had to be checked each time one of
the services had to be run. In short, before each service check,
a list with 3600 items was traversed. That would add quite a
substantial load on the CPU, so I'm not surprised.

Can you please copy-paste the servicedependency you had created
using this supersized servicegroup?

Thanks

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231


-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list