Nagios 2.7/Solaris 10 - Excessive Host Notifications

Martin de Koning martin.dekoning at ardec.com.au
Thu Feb 15 06:30:19 CET 2007


Hi everyone,

I'm wondering if someone can help me, I'm getting bombarded with host UP
notifications and no warnings for down hosts or services.

In my inbox I'm getting a constant stream of these warnings:


Subject: Host UP alert for proddse!

***** Nagios  *****

Notification Type: PROBLEM
Host: proddse
State: UP
Address: 10.0.2.1
Info: PING OK - Packet loss = 0%, RTA = 0.31 ms

Date/Time: Thu Feb 15 15:34:26 CST 2007


Subject: Host UP alert for stagingdse!

***** Nagios  *****

Notification Type: PROBLEM
Host: stagingdse
State: UP
Address: 127.0.0.1
Info: PING OK - Packet loss = 0%, RTA = 0.08 ms

Date/Time: Thu Feb 15 15:36:41 CST 2007


Now you can see it reports it as a PROBLEM, yet the output from
check_ping is OK. It also states in the subject that the host is up.

What's going on?? If I look at the Notifications page in the Nagios
website interface, I see all my normal notifications for down services
etc. I just don't see them come into my Inbox and instead I get these
strange warnings.


Here is some important parts from my configuration:

####################################
# COMMANDS
####################################


define command{
         command_name    notify-by-email
         command_line    /usr/bin/printf "%b" "***** Nagios
*****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState:
$HOSTSTATE$\nAddress: $H
OSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" |
/usr/bin/mailx -s "Host $HOSTSTATE$ alert for $HOSTNAME$!" $CONTACTEMAIL$
         }

define command{
         command_name    host-notify-by-email
         command_line    /usr/bin/printf "%b" "***** Nagios
*****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState:
$HOSTSTATE$\nAddress: $H
OSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" |
/usr/bin/mailx -s "Host $HOSTSTATE$ alert for $HOSTNAME$!" $CONTACTEMAIL$
         }

define command{
         command_name    check-host-alive
         command_line    $USER1$/check_ping -H $HOSTADDRESS$ -w 99,99%
-c 100,100% -p 1
         }

define command{
         command_name    check_ping
         command_line    $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$
-c $ARG2$ -p 5
         }


####################################
# HOSTS
####################################

define host{
         use                     generic-host            ; Name of host
template to use
         host_name               stagingdse
         alias                   localhost - Staging Server
         address                 127.0.0.1
         check_command           check-host-alive
         max_check_attempts      10
         check_period            24x7
         notification_interval   30
         notification_period     24x7
         notification_options    d,r
         contact_groups          Ardec
         }

define host{
         use                     generic-host
         host_name               proddse
         alias                   proddse-mon - Production Server
         address                 10.0.2.1
         check_command           check-host-alive
         max_check_attempts      10
         check_period            24x7
         notification_interval   30
         notification_period     24x7
         notification_options    d,r
         contact_groups          Ardec
         }

####################################
# SERVICES
####################################

define service{
         use                             generic-service         ; Name
of service template to use
         host_name                       stagingdse
         service_description             PING
         is_volatile                     0
         check_period                    24x7
         max_check_attempts              4
         normal_check_interval           5
         retry_check_interval            1
         contact_groups                  Ardec
         notification_options            w,u,c,r
         notification_interval           960
         notification_period             24x7
         check_command                   check_ping!100.0,20%!500.0,60%
         }

define service{
         use                             generic-service         ; Name
of service template to use
         host_name                       proddse
         service_description             PING
         is_volatile                     0
         check_period                    24x7
         max_check_attempts              4
         normal_check_interval           5
         retry_check_interval            1
         contact_groups                  Ardec
         notification_options            w,u,c,r
         notification_interval           960
         notification_period             24x7
         check_command                   check_ping!100.0,20%!500.0,60%
         }


Everything is working perfectly (even graphing) except for
notifications, any help would be greatly apreciated.

Thanks
Martin


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list