service notification logged but not done

MAYER Hans mayer at iiasa.ac.at
Thu Mar 10 09:05:07 CET 2011


Dear Chad


Ø  Have you recently upgraded Nagios?
Yes, I am running Core 3.2.3 since Feb 24th


Ø  When did you start noticing that it was missing execution runs?
I noticed the problem month ago. even with version 3.2.1 - therefore I made an upgrade to the latest version, to see, if this would fix the problem.


Ø  Do you have enough disk space free?
As I said: 91 % free, only 9 % used


Ø  What are the permissions of the script set to?
-rwxr-xr-x   1 nagios   nagios      1035 Feb 18 10:17 rshsendsms
I said, it happens only sometimes. Wrong permissions would result in a never working situation.


Ø  Were they recently changed?
No.


Ø  Have you done any type of software changes with any type of supporting packages (i.e. Perl) that could have brought up this issue?
No, this server is running since Jun 2010 unchanged.

What happens within Nagios between writing the log-file and executing the script ? Something permits to execute the script, but only sometimes.


Kind regards
Hans


From: Chad Rhyner [mailto:crhyner at box.net]
Sent: Wednesday, March 09, 2011 6:32 PM
To: Nagios Users List
Cc: MAYER Hans
Subject: Re: [Nagios-users] service notification logged but not done

Have you recently upgraded Nagios?  When did you start noticing that it was missing execution runs?  Do you have enough disk space free?  What are the permissions of the script set to?  Were they recently changed?  Have you done any type of software changes with any type of supporting packages (i.e. Perl) that could have brought up this issue?

Here are some thoughts on where I would start looking.  Anything that you can dig up we can look at more closely to identify a potential cause for this issue.

~Chad
On Wed, Mar 9, 2011 at 1:29 AM, MAYER Hans <mayer at iiasa.ac.at<mailto:mayer at iiasa.ac.at>> wrote:


Dear all

Using Nagios since a lot of years, I was starting with one of the first versions of "netsaint", and more than 25 years of experience with UNIX, I have now a strange problem I never had before.
I am running Nagios Core 3.2.3 on Solaris 10 OS. Hardware is M3000 with SPARC V9 architecture.

My problem is, I see sometimes - not always - a service notification in the log, but it is not really done.

Here an example, the entry in the log

[03-09-2011 09:13:25] SERVICE NOTIFICATION: sms_mayer;amazon;DISK/p14amazon;OK;notify-service-by-sms;DISK OK - free space: /p14amazon 4531 MB (6% inode=99%):

Here is the definition for notify-service-by-sms

# 'notify-service-by-sms' command definition
define command{
        command_name    notify-service-by-sms
        command_line    $USER1$/rshsendsms $CONTACTPAGER$ \"Info: $HOSTALIAS$/$SERVICEDESC$ $SERVICEOUTPUT$ \"
        }


As you see I execute a command named "rshsendsms". And this are the first lines of the shell script:

:

# Wed Jan 19 10:12:15 MET 2011 - mayer initial
# Wed Feb 16 10:11:54 MET 2011 - mayer logging the UID

# usage:
# rshsendsms 0043664xxxxxxx  '"hello world - how are you "'
# info: both types of apostrophes are important

export PATH LOG NUMBER TEXT ID UID NOTSENT RUNLOG

PATH=/usr/bin:$PATH

LOG=/var/adm/rshsendsms.log
RUNLOG=/var/adm/rshsendsms_run.log

date '+%y%m%d %H:%M' >> $RUNLOG

The first action I do, I write a log-entry. (91% of the disk is free) But in this case I cannot find the entry. The last one is dated with  110309 06:39, where I received a SMS really. I also switched on the process accounting weeks ago. But there is no entry to be found, that the shell script was executed.
I also switched on the debug facility of "syslog". I can find an equivalent entry like in the Nagios log. But there are no other messages, that something could be wrong.
But on other hand I was informed at 06:39 and nothing was changed in the meantime. This is not the first time this problem happens. Most of the time notification works fine, but sometimes not. This is of course a pain as notification is one central functionality of Nagios.

Any idea where I can start searching for the error ?

Kind regards
Hans




------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net<mailto:Nagios-users at lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20110310/0c385c55/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list