Service notifications when host is down

Quanah Gibson-Mount quanah at stanford.edu
Thu Apr 22 17:53:32 CEST 2004



--On Thursday, April 22, 2004 9:04 AM -0400 Sean Dilda 
<agrajag at dragaera.net> wrote:

> On Wed, 2004-04-21 at 19:20, Quanah Gibson-Mount wrote:
>> Quoting Ben Whaley <Benjamin.Whaley at colorado.edu>:
>>
>> >  > I though of setting a 'ping' service check on the host, and making
>> > all other
>> >  > checks dependent on it, but that seems to me to be more of a
>> > workaround than
>> >  > a solution, and it doesn't fully solve the scheduled downtime
>> > problem.
>> >
>> > Yes, we had the same idea. I am currently using some other, similar
>> > work
>> >
>> > arounds to solve problems that Nagios doesn't have a solution for but
>> > they have introduced more problems than they've fixed.
>> >
>> > What's strange about this particular case, however, is that Nagios
>> > *usually* catches it. For example, in the following sequence, the host
>> > down alert was generated before the service checks, thus avoiding the
>> > unnecessary notification:
>>
>> The consistent problem we have seen with Nagios is that once a host goes
>> down, it only ever emits a single host down alert, and does not keep
>> paging  that the host is down (it correctly does not keep paging that
>> the services  for the host are down).  Despite querying the list, an
>> answer to this has  never appeared.  Perhaps it is time to file a bug on
>> this in sourceforge.
>
> The only time I ever had this problem was because my notification
> commands were broken.  Nagios uses different notification commands for
> hosts and services, so your service notification command could be
> working while your host one isn't.  Did you check the logs to see if
> nagios attempted a notification?

Please read what I said... We do get >one< notification.  If the 
notification script was broken, we wouldn't get any.  The problem is that 
Nagios never sends out any more notifications after the initial one. 
Viewing the logs shows that Nagios does continue to query the host via the 
host check command, and continues to see that it is down -- It just never 
sends out further notifications that the host is down.  We even verified 
that Nagios was correctly checking the host by sniffing the TCP traffic 
between Nagios and the host.

--Quanah

--
Quanah Gibson-Mount
Principal Software Developer
ITSS/TSS/Computing Systems
ITSS/TSS/Infrastructure Operations
Stanford University
GnuPG Public Key: http://www.stanford.edu/~quanah/pgp.html


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list