Building a reliable uptime monitoring model

Matthew Jurgens nagiosusers at edcint.co.nz
Wed Mar 21 00:03:23 CET 2012


>> So I was wondering how is everyone reliably checking and notifying the
>> intended audience of server reboots with high rate of success.
>>

Get a plugin that checks the actual device uptime value.

You can then alert on it when values of uptime are less than whatever 
suits your check frequency and max check attempts values eg 30 minutes. 
How you actually check is going to be dependent on the OS of the device

For Windows servers I use
check_wmi_plus.pl -m checkuptime
This particular plugin also provides performance data and I get a nice 
graph like this -


I also do that same thing (using other plugins) for linux, wireless 
access points etc

I imagine that SNMP probably can get at the uptime values for various 
devices also.

> Smartmon System Monitoring <http://www.smartmon.com.au>
> www.smartmon.com.au <http://www.smartmon.com.au>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20120321/ddbe72e6/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: dfehgfbc.png
Type: image/png
Size: 13399 bytes
Desc: not available
URL: <https://www.monitoring-lists.org/archive/users/attachments/20120321/ddbe72e6/attachment.png>
-------------- next part --------------
------------------------------------------------------------------------------
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list