Escalate after X warnings or criticals

Andrew Li andrew1.li at citi.com
Wed Jun 16 03:31:28 CEST 2010


Does anyone know if the notification count problem got fixed in 3.2.1?

I had a read of the ChangeLog but it doesn't mention anything related to
this problem since 3.0.6.

Cheers,
Andrew


On Mon, 2009-11-09 at 08:55, Neil Ramsay wrote:
> Hi Martin,
> 
> The escalation_options don't take the state into consideration during
> the notification count. So if you have an escalate rule on the 4th
> notification and only escalate on Critical in the escalation_options
> then following scenario is can occur: 
> You have 3 warning notifications and the 4th is Critical then it will
> escalate as there have been 4 notifications and a Critical. I posted a
> help request on this issue a week or two ago and would really like
> this to be patched or built into the next update.
> http://article.gmane.org/gmane.network.nagios.user/64997/match=escalation+state
> 
> Cheers,
> 
> Neil
> 
> On Sat, Nov 7, 2009 at 12:56 AM, Martin Melin <mmelin at gmail.com>
> wrote:
>         The existing escalation_options directive in escalation
>         definitions will likely get you this behavior without the need
>         for a patch.
>         
>         http://nagios.sourceforge.net/docs/3_0/escalations.html - see
>         the very bottom of this page as well as the object definition
>         documentation for escalation_options.
>         
>         Regards,
>         Martin Melin
>         
>         On Fri, Nov 6, 2009 at 3:49 AM, Mark Gius
>         <mgius at createspace.com> wrote:
>                 Currently, service notifications contain
>                 "first/last_notification"
>                 directives, that specify the range of notifications
>                 that the escalation
>                 should apply to.  This method of escalation has a
>                 weakness however.
>                 
>                 At my work, we let warnings go to the default contact
>                 (which happens to
>                 be email), and escalate to a pager chain on critical.
>                 However, if a
>                 service sits in WARNING for a length of time (which is
>                 likely to happen
>                 in the middle of the night), by the time the service
>                 enters a CRITICAL
>                 state the notification count exceeds our highest
>                 escalation, and our
>                 entire team gets paged immediately.
>                 
>                 What I'd like to see is the ability to distinguish
>                 between a WARNING
>                 notification and a CRITICAL notification in the
>                 escalation, and set up
>                 escalation chains that work based on the number of
>                 CRITICAL's that have
>                 been sent, as opposed to the total number of
>                 notifications.
>                 
>                 I am planning on patching nagios to support this
>                 behavior if there isn't
>                 a way to achieve this behavior with the current
>                 implementation.  My plan
>                 is to add a warning/critical count to service, add a
>                 first/last
>                 warning/critical state to service escalations, and add
>                 the directives
>                 "(first|last)_(warning|critical)_notification" to the
>                 service escalation
>                 configs.  The idea is also to keep the current
>                 behavior
>                 (notification_count and first/last_notification would
>                 still be present),
>                 but allow finer grained control over when escalations
>                 are sent out.
>                 This way if somebody didn't want to use the finer
>                 grained control their
>                 behavior would stay the same.  My current plan is to
>                 match the
>                 escalation if _any_ of the 3 notification ranges match
>                 (all/warning/critical).
>                 
>                 Any advice on making this behavior happen with Nagios
>                 as-is, or
>                 suggestions/advice on the implementation are welcome.
>                 
>                 -Gius
>                 


------------------------------------------------------------------------------
ThinkGeek and WIRED's GeekDad team up for the Ultimate 
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the 
lucky parental unit.  See the prize list and enter to win: 
http://p.sf.net/sfu/thinkgeek-promo
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list