Monitoring shared filesystems

Kumar, Ashish xml.devel at gmail.com
Tue May 22 11:10:33 CEST 2012


Dear fellow Nagios users,

We have a bunch of servers on which we have shared filesystems.  The
filesystem actually resides on the SAN storage and are connected to
physical servers.  Let's say /home is shared among server1 and server2.
When /home breaches the WARNING or CRITICAL threshold it raises two
tickets, one from each server.

I tried utilising the check_cluster and check_cluster2 plugins to raise
only one ticket when there is a problem instead of two.

I have the following definitions...

"Disk Monitor /home" is being monitored on server1 and server2 using NRPE
with notifications disabled.

hostgroups
==========

define hostgroup {
        hostgroup_name                 clustered_servers
        alias                          Clustered servers
        members                        server1,server2
        }



servicecluster.cfg
==================
define service {
        service_description             Clustered disk /home
        hostgroup_name                  clustered_servers
        check_command
check_cluster_service2!$SERVICESTATEID:server1:Disk Monitor
/home$,$SERVICESTATEID:server2:Disk Monitor /home$
        max_check_attempts              3
        check_interval                  2
        retry_interval                  1
        notification_interval           15
        contacts                        sysadms
        register                        1
        }


It seems when the hostgroup_name expands internally in Nagios it creates
two service checks in both the servers so it is understood that there will
be two alerts.  What changes can I make to raise only one ticket?  In
certain cases we have around 10 servers with shared filesystems, so you can
understand the havoc it creates.

Please let me know if you have any questions.

Thanks,
Ashish
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20120522/ed364e1d/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list