Setting up Escalations.

dOE doepain at gmail.com
Mon Apr 26 20:08:02 CEST 2010


Thank you your guide helped

On Tue, Apr 6, 2010 at 4:31 PM, Giorgio Zarrelli <zarrelli at linux.it> wrote:

> Hi,
>
> the workflow for check/notification/escalation is the following:
>
>
> 1. The service/host is checked in OK state with the check_interval
> timing;
>
> 2. As the service/host goes into a NON OK state, but didn't reach the
> max_check_attempts, the service enters a SOFT NON OK state and the next
> check is scheduled with the retry_interval timing;
>
> 3. As the service/host in NON OK state reaches the
> max_check_attempts_value, the service enters an HARD NON OK STATE and
> the next service/host check is scheduled with the check_interval timing;
>
> 4. Now, if you set first_notification_delay, this can alter the timing
> for the first notification to be sent (0 means notifications to be sent
> immediately);
>
> 5. If you didn't set the first_notification_delay, the first
> notification is immediately sent and the following will be scheduled
> with the notification_interval timing (0 means only the first
> notification will be sent, the other will not be sent);
>
> 6. In your escalation, at the third notification (with the
> notification_interval taking place), the notification interval changes
> to 45 minutes, so the first notification will be sent after the
> max_check_attempts value will be reached (assuming you didn't put any
> delay), then the second after 10 time units, usually 10 minutes, the
> third 10 mins after the second, the fourth 45 mins after the third, the
> fifth 45 mins after the fourth, the sixth 45 mins after the fifth.
>
>
> 7. From the sixth notification, the new escalation comes into play. The
> seventh notification will be sent after 60 minutes, and all the other
> notifications will be sent 60 mins after each other. Keep in mind that
> having used 0 as the last_notification value, you escalation will never
> end till your check will return an OK status.
>
>
> I do not know if I answered to your questions, I hope to have explained
> the notification/escalation timings in a correct and clear manner.
>
> On the Nagios wiki you will find a flowchart I wrote to clarify the
> logic dealing with the notification framework.
>
> Ciao,
>
> Giorgio
>
>
>
> Il giorno lun, 05/04/2010 alle 12.48 -0400, dOE ha scritto:
> > Thanks Giorgia,
> >
> > Then for notification_interval for  production environment set to "10"
> > would process the escalation after 10 minutes of the alert NIT being
> > "ok"?, and subsequent notification_interval should be set to more than
> > "10" so that they would then be notified too?
> >
> > Am I understanding this correctly?
> >
> > On Mon, Apr 5, 2010 at 6:34 AM, Giorgio Zarrelli <zarrelli at linux.it>
> > wrote:
> >         Hi,
> >
> >         First, local definitions win over those written in templates,
> >         so if in the template you have a notification_interval value
> >         and in the escalation you have another, escalation wins and
> >         its value is adopted.
> >
> >         Second, notification_interval il the interval between two
> >         consecutive notifications for a host or a service, after it
> >         enters a non ok status and has exceeded max_check_attempts
> >         value.
> >
> >         Ciao,
> >
> >         Giorgio
> >
> >         Il giorno 05/apr/2010, alle ore 04.50, dOE <doepain at gmail.com>
> >         ha scritto:
> >
> >
> >                 I am having a difficulty getting escalations to work
> >                 on Nagios 3.0.3
> >
> >                 The following is pulled from the documentation:
> >
> >                 define serviceescalation{
> >                        host_name               webserver
> >                        service_description     HTTP
> >                        first_notification      3
> >                        last_notification       5
> >                        notification_interval   45
> >                        contact_groups          ITOps_Oncall,managers
> >                        }
> >
> >                 define serviceescalation{
> >                        host_name               webserver
> >                        service_description     HTTP
> >                        first_notification      6
> >                        last_notification       0
> >                        notification_interval   60
> >                        contact_groups
> >                 ITOps_Oncall,managers,everyone
> >                        }
> >
> >                 I have read the documentation, but I don't understand
> >                 what the "notification_interval" are based on, and
> >                 sine we have hosts inheriting from a "core" template
> >                 it is very difficult to test escalations.
> >                 We use OpCfg to do our Nagios configuration, but it
> >                 does not stop me from occasionally going into the
> >                 actual configuration files to make changes either.
> >
> >                 If anyone has this working, and could shed some light
> >                 on how I can get this to work, or clarify the
> >                 documentation explanation of it.  Also, since I am
> >                 inheriting from a template I feel as though the
> >                 changes I make to a particular host (to test) is being
> >                 ignored or it maybe me not understanding what the
> >                 "notification_interval" are exactly.
> >
> >                 Any advice is very much appreciated.
> >
> >
> ------------------------------------------------------------------------------
> >                 Download Intel® Parallel Studio Eval
> >                 Try the new software tools for yourself. Speed
> >                 compiling, find bugs
> >                 proactively, and fine-tune applications for parallel
> >                 performance.
> >                 See why Intel Parallel Studio got high marks during
> >                 beta.
> >                 http://p.sf.net/sfu/intel-sw-dev
> >                 _______________________________________________
> >                 Nagios-users mailing list
> >                 Nagios-users at lists.sourceforge.net
> >
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> >                 ::: Please include Nagios version, plugin version (-v)
> >                 and OS when reporting any issue.
> >                 ::: Messages without supporting info will risk being
> >                 sent to /dev/null
> >
> >
>
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20100426/a8331ddf/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list