A lot of process using nagios distributed monitor

Marc Powell marc at ena.com
Mon Jul 17 17:24:53 CEST 2006



> -----Original Message-----
> From: nagios-users-bounces at lists.sourceforge.net [mailto:nagios-users-
> bounces at lists.sourceforge.net] On Behalf Of Tiago Cruz
> Sent: Monday, July 17, 2006 7:52 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] A lot of process using nagios distributed
monitor
> 
> Hello all,
> 
> I'm using 03 server running nagios 2.4, with distributed arch:
> 
> +-------------+    +--------------+    +-------------+
> | 10.0.0.170  |    | 172.16.24.16 |    | 192.168.0.5 |
> | Nagios #1   | => | Nagios #2    | <= | Nagios #3   |
> | 200 clients |    | Display      |    | 100 clients |
> +-------------+    +--------------+    +-------------+
> 
> All the clients have nrpe running. Well, anyway... my problem is with
> the Display Server (that was running apache and nsca): The "nagios"
> process start with 1 and after 2 or 3 hours we have a lot "nagios"
> process, some like 600 and still increasing until finish the Swap and
> crash the machine :-/

This shouldn't be an issue. I have a very similar setup, 5 data
collectors reporting 3646 total services to 2 central hosts  

Verify that you have command_check_interval in nagios.cfg set to a
reasonable number (I use -1). 

Verify that your host check command is reasonable and exits quickly
(i.e. a single ping is usually sufficient). I personally don't use host
checks at all due to the volatility of the devices I'm monitoring.

aggregate_status_updates set to 1 may help, status_update_interval is 15
here.

Verify that your hardware (specifically drives) are operating correctly
and aren't starving for resources (cpu, memory, etc). I had a similar
problem once caused by bad writes to a drive in a hardware mirror. The
drive didn't fail out but the performance was miserable.

--
Marc


-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list