Unexpected service alerts based on plugin time dout - Bug, WAD, or Administrator

Nolan Martin Nolan.Martin at co.travis.tx.us
Wed Aug 21 20:36:58 CEST 2002


The attempt column will show that it is 1 of 5, etc.  In other words, it
is not 5 of 5.  

I am certain that it a service alert.  I am seeing these alerts on the
"Service Status Details For All Host Groups" screen.  If you click on
Status Overview, then click on the "All Problem" link near the top of
the page.  Likewise, these alerts do not show under the respective Host
Status totals page.

As part of troubleshooting this issue, I have clicked on the service
and viewed that page as well as the Alert History.  The history shows
that the alerts are in a soft state and for an attempt that is less than
the maximum check attempt.

Thoughts or suggestions?  I would be happy to post any pertinent cfg
files (or portions of), if it sounds as if my Nagios system behavior is
not what others are seeing...

>>> Darren Gamble <Darren.Gamble at sjrb.ca> 08/21/02 01:25PM >>>
Good day,

> > I receive on-going (unexpected and undesired) service alerts due
to
> > timed out plugins.  This occurs for pretty much any service,
> including
> > the check_ping, the check_http and check_nwstat (for abends,
etc.).
> > 
> > The problem is that the plugin timed out alerts do not seem to
> follow
> > the max_check_attempts for the service.  So, even if the previous
> ping
> > was okay, if the next ping check generates a time out, it
> immediately
> > generates a service alert (despite the fact that
max_check_attempts
> is
> > set to 5.

This is not working as designed.  I often see plugins time out (for
whatever
reason) but the plugin just returns a Critical status and are treated
properly as such.  Nagios can not tell why the result was generated;
only
that the plugin returned "Critical".

When this happens, what does the "Attempt" column say? (what out of
what)?

Also, are you sure you're not confusing a host alert with a service
alert?
If the host check fails (which will be checked as a result of even one
failure on a service), you will get a HOST alert for the device, NOT a
service alert.  The host alerts use a different set of configuration
parameters.

============================
Darren Gamble
Planner, Regional Services
Shaw Cablesystems GP
630 - 3rd Avenue SW
Calgary, Alberta, Canada
T2P 4L4
(403) 781-4948


-------------------------------------------------------
This sf.net email is sponsored by: OSDN - Tired of that same old
cell phone?  Get a new here for FREE!
https://www.inphonic.com/r.asp?r=sourceforge1&refcode1=vs3390




More information about the Users mailing list