Services get stuck in warning/critical state

Thomas Guyot-Sionnest dermoth at aei.ca
Sat Oct 4 05:10:27 CEST 2008


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 03/10/08 04:54 AM, Andreas Ericsson wrote:
> Bartlomiej Korupczynski wrote:
>> Hello,
>>
>> I probably should also mention, that nagios host
>> sometimes has quite high load (up to 2.0 on uniprocessor machine), as a
>> result of monitoring scripts. Next thing is that monitoring host has big
>> constant clock skew that I can't get rid of (time runs faster, ca. 5s for
>> every 2 minutes, this gets corrected by ntpdate every 2 minutes).
> 
> And this is almost certainly the problem. Are you running Nagios in a
> vmware system? If yes, what happens when you move it out to its own hardware?
> Nagios relies on a reasonably accurate system clock. One that jumps backwards
> and forwards will cause problems.

Hey Andreas...

FYI I noticed the similar load thing (have no problem with the clock
though)... Upon upgrading from 2.7 to 3.0.1-cvs (IIRC pretty much
between the two releases) I had to adjust the load thresholds for that
box as I had very high spikes. Everything keeps running smoothly, the
number of process, cpu usage, check latency, check execution time and
checks performed per minute graphs are nearly flat (1-minute average
graphs), I/O wait and number of zombies stays very low. There's nothing
else running besides Nagios and a minutely Cacti poller that was there
years ago

The server is a dual CPU (+ HT = 4-way), 2GB ram and fast SCSI RAID,
running at ~ 43% cpu load with ~ 1280 active checks per minute.
temp_path and check_result_path are on tmpfs filesystems.

I attached a graph of the load average...

- --
Thomas
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFI5t6j6dZ+Kt5BchYRAlavAKDNA1KBgHuHxW9cZ4JsNM0uLgbAAACfdOG6
cTZAhfA7Fv+cKqElq1CFWOg=
=6Ggx
-----END PGP SIGNATURE-----
-------------- next part --------------
A non-text attachment was scrubbed...
Name: nagios_load.png
Type: image/png
Size: 55679 bytes
Desc: not available
URL: <https://www.monitoring-lists.org/archive/users/attachments/20081003/8e24824b/attachment.png>
-------------- next part --------------
-------------------------------------------------------------------------
This SF.Net email is sponsored by the Moblin Your Move Developer's challenge
Build the coolest Linux based applications with Moblin SDK & win great prizes
Grand prize is a trip for two to an Open Source event anywhere in the world
http://moblin-contest.org/redirect.php?banner_id=100&url=/
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list