Service Configuration Question

Paul L. Allen pla at softflare.com
Wed Mar 24 19:46:49 CET 2004


joerg.helmert at aracomp.de writes: 

> This is true, but you need additional service checks for this.

Yes and no.  If everything is OK you know the transport is working.  If
everything is critical you don't know if the transport has died or if
the machine itself has died.  If you have a mix and at least one OK
is more recent than the criticals, you know the transport is working. 

> Maybe ssh is not the best example, cause it might be needed for some
> other production work, not only for monitoring.

All the machines we monitor are remotely located.  We DO need SSH.
However, I've found that anything that kills SSH connections usually
kills the whole box.  The usual reason for this happening is somebody
forgetting to correct the broken logrotate configuration for mailman
on Red Hat 7.3 and eventually it fork-bombs the machine trying to
rotate zillions of misnamed mailman logs so sshd can't fork a shell
for you. 

> Of course, if I run 20 or so checks by snmp against one host, it might be
> wise to implement a check if snmp is up itself.

If you're running 20 or so checks you'll know if snmp is OK if one of
the checks returns an OK.  If none of them do then the machine could
be down or snmp could be down and there's no way of telling which is
the problem. 

> That way making one able to make all the "checks by snmp" dependent on 
> that "check of snmp". Would give 1 notification instead of 20 if snmpd 
> dies...

But it could really be that the machine itself is dead and so you're
suppressing notifications that might actually be true. 

> The conclusion is, that it is enough to send an unknown state and that
> what _I_ wanted to achieve can be done with that.

So everyone is happy with the status quo now. 

-- 
Paul Allen
Softflare Support 



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list