Nagios and passive checks

pshemko pshemko at ihug.co.nz
Mon Sep 20 00:37:39 CEST 2004


Hi,

I have a Nagios configuration with over a 1000 checks (services). All the
status information are sent directly to Nagios through it's command file.
There are up to 2000 results a minute, up to 20 processes sends data to
Nagios. 
The problem is that within 10 minutes even up 3000 messages are lost -
processes that write to the command file (pipe) simply can't write and time
out. If message can't be sent for 10 minutes to Nagios it's discarded. On
the other hand sometimes for over an hour there are no problems at all. 
It's not load-related issue as the load never gets higher then 1 - 1.5 (box
is dual Xeon 2.8Gz, 2GB RAM). Nagios doesn't execute any active checks. 
The writing processes are written in Perl and use the following syntax to
send data:

open NAGIOSPIPE, ">>$commandFile";
syswrite NAGIOSPIPE, $messageList;
close NAGIOSPIPE;

Does anyone have a clue why the processes timeout so often?

Regards
Pshemko



-------------------------------------------------------
This SF.Net email is sponsored by: YOU BE THE JUDGE. Be one of 170
Project Admins to receive an Apple iPod Mini FREE for your judgement on
who ports your project to Linux PPC the best. Sponsored by IBM.
Deadline: Sept. 24. Go here: http://sf.net/ppc_contest.php
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list