notifications for recovery, but not on error

Jon Lyons jlyons30 at yahoo.com
Sat Nov 2 19:29:18 CET 2002


Is it the same host or hosts having the problem, or random hosts? Dependencies? Maybe attach your hosts/services/depends for a second pair of eyes..
 Greg Haygood <ghaygood at brightlane.com> wrote:I wish it were that easy, but all contacts are defined like that.   When I view the config cgi, the "good" and "bad" hosts & services seem to have the same parameters across the board.  I even setup a wildcard hostgroup that included every host, adding me as the sole contact.  Again, when I faked the outages, I encountered the same inconsistent notifications as before.   Anything else I should be looking for?-----Original Message-----
From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Jon Lyons
Sent: Friday, November 01, 2002 11:42 PM
To: Greg Haygood; nagios-users at lists.sourceforge.net
Subject: Re: [Nagios-users] notifications for recovery, but not on error


What do you have defined for the host & services? Maybe a typo?

service_notification_options    w,u,c,r
host_notification_options       d,u,r



 Greg Haygood <ghaygood at brightlane.com> wrote: 
I'm having a frustrating problem with the notifications. Don't remember
anything coming across the list in the last couple of months, and there's
nothing in the archives that I can find. Some of my hosts/services react
the way I expect them to. A lot of them don't, and I can't figure out why.

I'm simulating outages with ipf on the nagios host, by cutting off access to
various hosts and services. When the outage occurs, nagios always notices
and logs the problem. Sometimes it will log the H/S NOTIFICATIONs, other
times it won't. When it does, notifications go out fine; when they don't,
nothing (obviously). However, when I re-enable the access (by turning off
ipf), I get recovery notices for everything, including the H & S that I
didn't receive outage notices for. The notifications info for any host or
service reflect the same: no problems were rep! orted, but all recoveries
were. Argh.

My timeperiods are all 24x7. The ones that work are in two hostgroups, but
all the other groups look the same (and some are configured the same, except
for name & label). My contacts and contact groups look fine. I've
forcefully re-enabled all checks and notifications via a * hostgroup. I've
zero'd the retention variables and killed the state retention file. Nada.

Anyone have any ideas, or seen something similar to this? Does this make
sense? I've been pounding on it most of the week, so my head's a little
foggy by now.

My configs are split among a lot of files, but I'll post if necessary.

Thanks in advance.

-g

--
Greg Haygood
ghaygood at brightlane.com
v) 678.385.2837
Director of Operations
BrightLane, a TeamStaff, Inc. Company




-------------------------------------------------------
This sf.net email is sponsored by: See the NEW Palm 
T! ungsten T handheld. Power & Color in a compact size!
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0001en
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users


---------------------------------
Do you Yahoo!?
HotJobs - Search new jobs daily now


---------------------------------
Do you Yahoo!?
HotJobs - Search new jobs daily now
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20021102/d1f1ac93/attachment.html>


More information about the Users mailing list