problem starting nagios

Anirudh Srinivasan srianirudh at gmail.com
Wed Mar 18 18:13:26 CET 2009


First of all please dont bother because the reply is too big. I need to get
this stuff done , been fighting with this for a while and finally came here
for help .
Sorry i was out for a conference , and thats why could not reply to you in
time.

1) What does nagios.log have to say?

 Error: Could not create external command file
'/usr/local/nagios/var/rw/nagios.cmd' as named pipe: (2) -> No such file or
directory.  If this fi
le already exists and you are sure that another copy of Nagios is not
running, you should delete this file.
[1237394094] Bailing out due to errors encountered while trying to
initialize the external command file... (PID=31557)

2) Do you see the process running in the ps list?

no , i dont see any proces running

3) Can you run nagios in the foreground as root (/usr/local/nagios/bin/
nagios /usr/local/nagios/etc/nagios.cfg)?

This is what i see when i run this .and to tell you i dint create time
period which i think is no way related to this problem:

Nagios 3.0.6
Copyright (c) 1999-2008 Ethan Galstad (http://www.nagios.org)
Last Modified: 12-01-2008
License: GPL

Nagios 3.0.6 starting... (PID=31661)
Local time is Wed Mar 18 12:59:33 EDT 2009
Error: Template 'generic-host' specified in host definition could not be not
found (config file '/usr/local/nagios/etc/objects/localhost.cfg', starting
on line 20)
Error: Template 'generic-service' specified in service definition could not
be not found (config file '/usr/local/nagios/etc/objects/localhost.cfg',
starting on line 64)
Warning: Service 'Current Load' on host 'localhost' has no check time period
defined!
Warning: Service 'Current Load' on host 'localhost' has no notification time
period defined!
Warning: Service 'Current Users' on host 'localhost' has no check time
period defined!
Warning: Service 'Current Users' on host 'localhost' has no notification
time period defined!
Warning: Service 'HTTP' on host 'localhost' has no check time period
defined!
Warning: Service 'HTTP' on host 'localhost' has no notification time period
defined!
Warning: Service 'PING' on host 'localhost' has no check time period
defined!
Warning: Service 'PING' on host 'localhost' has no notification time period
defined!
Warning: Service 'Root Partition' on host 'localhost' has no check time
period defined!
Warning: Service 'Root Partition' on host 'localhost' has no notification
time period defined!
Warning: Service 'SSH' on host 'localhost' has no check time period defined!
Warning: Service 'SSH' on host 'localhost' has no notification time period
defined!
Warning: Service 'Swap Usage' on host 'localhost' has no check time period
defined!
Warning: Service 'Swap Usage' on host 'localhost' has no notification time
period defined!
Warning: Service 'Total Processes' on host 'localhost' has no check time
period defined!
Warning: Service 'Total Processes' on host 'localhost' has no notification
time period defined!
Warning: Contact 'nagiosadmin' has no service notification time period
defined!
Warning: Contact 'nagiosadmin' has no host notification time period defined!
Error: Could not create external command file
'/usr/local/nagios/var/rw/nagios.cmd' as named pipe: (2) -> No such file or
directory.  If this file already exists and you are sure that another copy
of Nagios is not running, you should delete this file.
Bailing out due to errors encountered while trying to initialize the
external command file... (PID=31661)


When run as nagios user i see same thing as above

In daemon mode as the nagios user (/usr/local/nagios/bin/nagios -d /
usr/local/nagios/etc/nagios.cfg)?

when i run this , and press enter it takes it but doesnt show anything ,
meaning like this:

[nagios at DELL8B81Y11 ~]$ /usr/local/nagios/bin/nagios -d
/usr/local/nagios/etc/nagios.cfg
[nagios at DELL8B81Y11 ~]$

Is there anything special about the mount point that nagios lives under?

no there is no such special mount point



Thanks in adv


On Mon, Mar 16, 2009 at 3:06 PM, <nagios-users-request at lists.sourceforge.net
> wrote:

> Send Nagios-users mailing list submissions to
>        nagios-users at lists.sourceforge.net
>
> To subscribe or unsubscribe via the World Wide Web, visit
>        https://lists.sourceforge.net/lists/listinfo/nagios-users
> or, via email, send a message with subject or body 'help' to
>        nagios-users-request at lists.sourceforge.net
>
> You can reach the person managing the list at
>        nagios-users-owner at lists.sourceforge.net
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Nagios-users digest..."
>
>
> Today's Topics:
>
>   1. Re: Host checks instead of service checks (Richard Quintin)
>   2. Re: Host checks instead of service checks (Deborah Martin)
>   3. Re: Host checks instead of service checks (Deborah Martin)
>   4. send_mail.pl not work Nagios 2.9 (Carlos Herrera Polo)
>   5. Monitoring IBM server-hardware (x3650) running    Windows
>      (Klaus Umbach)
>   6. Re: Monitoring IBM server-hardware (x3650) running        Windows
>      (Kevin Keane)
>   7. Re: Host checks instead of service checks (Jim Avery)
>   8. Re: Host checks instead of service checks (Deborah Martin)
>   9. problem starting nagios (Anirudh Srinivasan)
>  10. Re: problem starting nagios (Marc Powell)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Mon, 16 Mar 2009 11:26:12 -0400
> From: Richard Quintin <rich+nagios at quintinz.com<rich%2Bnagios at quintinz.com>
> >
> Subject: Re: [Nagios-users] Host checks instead of service checks
> To: Deborah Martin <Deborah.Martin at kognitio.com>
> Cc: "nagios-users at lists.sourceforge.net"
>        <nagios-users at lists.sourceforge.net>
> Message-ID:
>        <822f14060903160826r5f6c711ds7e97a9670d647559 at mail.gmail.com>
> Content-Type: text/plain; charset=ISO-8859-1
>
> You could use check_dummy for the host check and just have it always
> return OK.
> Or perhaps the opposite you could add a check_dummy service check for
> all hosts.
>
> Which you choose will depend upon your personal preference and how you
> want dependencies to be handled.
>
> On Mon, Mar 16, 2009 at 11:06 AM, Deborah Martin
> <Deborah.Martin at kognitio.com> wrote:
> > Hi Folks,
> >
> > Currently, the main nagios box (running old version of SuSE) and Nagios
> > 2.0b4 is running just with service checks, one of which is an ssh check
> (we
> > don't allow ping)
> >
> > So in the web interface, I see all hosts as up.
> >
> > However, I've built a new box (which hopefully will replace the above)
> with
> > SLES 10SP1 and Nagios 3.0.6. I've put the same config files (services.cfg
> > and hosts.cfg)
> >
> > on the this new system. But now the hosts all show as "Pending". So I
> then
> > moved the ssh check from a service check to a host check and can see that
> > the more hosts I move the? less hosts are pending. That's good so far...
> >
> > But, when I run the pre-flight check (/usr/local/nagios/bin/nagios -v
> > nagios.cfg), I now get warnings to say some hosts don't have any service
> > checks associated with it!
> >
> > This is true as some nodes will only have ssh checks against them whilst
> > others will have other checks against them. I'd rather it didn't warn me
> as
> > I have hundreds
> >
> > of hosts appear in the pre-flight check warnings and it now looks
> incredibly
> > untidy to see all this.
> >
> > How can I get rid of these warnings ?
> >
> > I'm probably missing something here so any help would be appreciated. I'm
> > wondering for example, if I should force the initial state to be UP for
> > hosts rather than
> >
> > moving the ssh service check to a host check. But then what would happen
> if
> > the service check found a node was down - would it reflect that in the
> "host
> > problems" ?
> >
> > regards,
> > deborah
> >
> >
> ***************************************************************************
> > This email and any files transmitted with it are confidential and
> intended
> > solely for the use of the individual or entity to whom they are
> addressed.
> >
> > Any unauthorised distribution or copying is strictly prohibited.
> > Whilst Kognitio Limited takes steps to prevent the transmission of
> viruses
> > via e-mail, we can not guarantee that any email or attachment is free
> from
> > computer viruses and you are strongly advised to undertake your own
> > anti-virus precautions.
> >
> > Kognitio grants no warranties regarding performance, use or quality of
> any
> > e-mail or attachment and undertakes no liability for loss or damage,
> > howsoever caused.
> >
> > Kognitio Limited, a company registered in England and Wales. Registered
> > number 0212 7833. Registered Office: 3a Waterside Park, Cookham Road,
> > Bracknell, Berks, RG12 1RB. VAT number 864 4378 92.
> >
> > Kognitio Inc, a company incorporated in Delaware, principal office 180
> North
> > Stetson, Suite 3500, Chicago, IL 60601, USA
> >
> ***************************************************************************
> >
> >
> ------------------------------------------------------------------------------
> > Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
> > powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
> > easily build your RIAs with Flex Builder, the Eclipse(TM)based
> development
> > software that enables intelligent coding and step-through debugging.
> > Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
> > _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > ::: Please include Nagios version, plugin version (-v) and OS when
> reporting
> > any issue.
> > ::: Messages without supporting info will risk being sent to /dev/null
> >
>
>
>
> --
> Richard Quintin, DBA
> Database & Application Administration
> Virginia Tech
>
>
>
> ------------------------------
>
> Message: 2
> Date: Mon, 16 Mar 2009 15:34:25 -0000
> From: Deborah Martin <Deborah.Martin at Kognitio.com>
> Subject: Re: [Nagios-users] Host checks instead of service checks
> To: 'Richard Quintin' <rich+nagios at quintinz.com<rich%2Bnagios at quintinz.com>
> >
> Cc: nagios-users at lists.sourceforge.net
> Message-ID:
>        <84836290D5AD43418C40DCF0C4A54ED38E3B40 at kogex02.pmpl.co.uk>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Thanks Richard. I'll give that a try.
>
> regards,
> deborah
>
>
> -----Original Message-----
> From: Richard Quintin [mailto:rich+nagios at quintinz.com<rich%2Bnagios at quintinz.com>
> ]
> Sent: 16 March 2009 15:26
> To: Deborah Martin
> Cc: nagios-users at lists.sourceforge.net
> Subject: Re: [Nagios-users] Host checks instead of service checks
>
> You could use check_dummy for the host check and just have it always return
> OK.
> Or perhaps the opposite you could add a check_dummy service check for all
> hosts.
>
> Which you choose will depend upon your personal preference and how you want
> dependencies to be handled.
>
> On Mon, Mar 16, 2009 at 11:06 AM, Deborah Martin
> <Deborah.Martin at kognitio.com> wrote:
> > Hi Folks,
> >
> > Currently, the main nagios box (running old version of SuSE) and
> > Nagios
> > 2.0b4 is running just with service checks, one of which is an ssh
> > check (we don't allow ping)
> >
> > So in the web interface, I see all hosts as up.
> >
> > However, I've built a new box (which hopefully will replace the above)
> > with SLES 10SP1 and Nagios 3.0.6. I've put the same config files
> > (services.cfg and hosts.cfg)
> >
> > on the this new system. But now the hosts all show as "Pending". So I
> > then moved the ssh check from a service check to a host check and can
> > see that the more hosts I move the? less hosts are pending. That's good
> so
> far...
> >
> > But, when I run the pre-flight check (/usr/local/nagios/bin/nagios -v
> > nagios.cfg), I now get warnings to say some hosts don't have any
> > service checks associated with it!
> >
> > This is true as some nodes will only have ssh checks against them
> > whilst others will have other checks against them. I'd rather it
> > didn't warn me as I have hundreds
> >
> > of hosts appear in the pre-flight check warnings and it now looks
> > incredibly untidy to see all this.
> >
> > How can I get rid of these warnings ?
> >
> > I'm probably missing something here so any help would be appreciated.
> > I'm wondering for example, if I should force the initial state to be
> > UP for hosts rather than
> >
> > moving the ssh service check to a host check. But then what would
> > happen if the service check found a node was down - would it reflect
> > that in the "host problems" ?
> >
> > regards,
> > deborah
> >
> > **********************************************************************
> > ***** This email and any files transmitted with it are confidential
> > and intended solely for the use of the individual or entity to whom
> > they are addressed.
> >
> > Any unauthorised distribution or copying is strictly prohibited.
> > Whilst Kognitio Limited takes steps to prevent the transmission of
> > viruses via e-mail, we can not guarantee that any email or attachment
> > is free from computer viruses and you are strongly advised to
> > undertake your own anti-virus precautions.
> >
> > Kognitio grants no warranties regarding performance, use or quality of
> > any e-mail or attachment and undertakes no liability for loss or
> > damage, howsoever caused.
> >
> > Kognitio Limited, a company registered in England and Wales.
> > Registered number 0212 7833. Registered Office: 3a Waterside Park,
> > Cookham Road, Bracknell, Berks, RG12 1RB. VAT number 864 4378 92.
> >
> > Kognitio Inc, a company incorporated in Delaware, principal office 180
> > North Stetson, Suite 3500, Chicago, IL 60601, USA
> > **********************************************************************
> > *****
> >
> > ----------------------------------------------------------------------
> > -------- Apps built with the Adobe(R) Flex(R) framework and Flex
> > Builder(TM) are powering Web 2.0 with engaging, cross-platform
> > capabilities. Quickly and easily build your RIAs with Flex Builder,
> > the Eclipse(TM)based development software that enables intelligent
> > coding and step-through debugging.
> > Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
> > _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > ::: Please include Nagios version, plugin version (-v) and OS when
> > reporting any issue.
> > ::: Messages without supporting info will risk being sent to /dev/null
> >
>
>
>
> --
> Richard Quintin, DBA
> Database & Application Administration
> Virginia Tech
>
> ***************************************************************************
> This email and any files transmitted with it are confidential and intended
> solely for the use of the individual or entity to whom they are addressed.
>
> Any unauthorised distribution or copying is strictly prohibited.
> Whilst Kognitio Limited takes steps to prevent the transmission of viruses
> via e-mail, we can not guarantee that any email or attachment is free from
> computer viruses and you are strongly advised to undertake your own
> anti-virus precautions.
>
> Kognitio grants no warranties regarding performance, use or quality of any
> e-mail or attachment and undertakes no liability for loss or damage,
> howsoever caused.
>
> Kognitio Limited, a company registered in England and Wales. Registered
> number 0212 7833. Registered Office:  3a Waterside Park, Cookham Road,
> Bracknell, Berks, RG12 1RB. VAT number 864 4378 92.
>
> Kognitio Inc, a company incorporated in Delaware, principal office 180
> North Stetson, Suite 3500, Chicago, IL 60601, USA
> ***************************************************************************
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 3
> Date: Mon, 16 Mar 2009 15:15:45 -0000
> From: Deborah Martin <Deborah.Martin at Kognitio.com>
> Subject: Re: [Nagios-users] Host checks instead of service checks
> To: "'nagios-users at lists.sourceforge.net'"
>        <nagios-users at lists.sourceforge.net>
> Message-ID:
>        <84836290D5AD43418C40DCF0C4A54ED38E3B3E at kogex02.pmpl.co.uk>
> Content-Type: text/plain; charset="us-ascii"
>
> Folks,
>
> I should have also mentioned that the reason for doing this is to filter
> out
> host problems separately to service problems.
> If I leave ssh checks as a service and 44 nodes are switched off, I see 44
> ssh critical alerts under Service Problems. I'd rather see 44 ssh critical
> alerts under
> Host Problems. (I thought my method below would a good way to filter these
> out). When we resize database systems, we only want to know about critical
> alerts
> for the database as a service problem (we may reduce a DB by 50 nodes but
> DB
> is still valid for monitoring)
>
> regards,
> deborah
>
> > _____________________________________________
> > From:         Deborah Martin
> > Sent: 16 March 2009 15:06
> > To:   'nagios-users at lists.sourceforge.net'
> > Subject:      Host checks instead of service checks
> > Importance:   High
> >
> > Hi Folks,
> >
> > Currently, the main nagios box (running old version of SuSE) and Nagios
> > 2.0b4 is running just with service checks, one of which is an ssh check
> > (we don't allow ping)
> >
> > So in the web interface, I see all hosts as up.
> >
> > However, I've built a new box (which hopefully will replace the above)
> > with SLES 10SP1 and Nagios 3.0.6. I've put the same config files
> > (services.cfg and hosts.cfg)
> > on the this new system. But now the hosts all show as "Pending". So I
> then
> > moved the ssh check from a service check to a host check and can see that
> > the more hosts I move the  less hosts are pending. That's good so far...
> >
> > But, when I run the pre-flight check (/usr/local/nagios/bin/nagios -v
> > nagios.cfg), I now get warnings to say some hosts don't have any service
> > checks associated with it!
> > This is true as some nodes will only have ssh checks against them whilst
> > others will have other checks against them. I'd rather it didn't warn me
> > as I have hundreds
> > of hosts appear in the pre-flight check warnings and it now looks
> > incredibly untidy to see all this.
> >
> > How can I get rid of these warnings ?
> >
> > I'm probably missing something here so any help would be appreciated. I'm
> > wondering for example, if I should force the initial state to be UP for
> > hosts rather than
> > moving the ssh service check to a host check. But then what would happen
> > if the service check found a node was down - would it reflect that in the
> > "host problems" ?
> >
> > regards,
> > deborah
> >
> >
>
>
> ***************************************************************************
> This email and any files transmitted with it are confidential and intended
> solely for the use of the individual or entity to whom they are addressed.
>
> Any unauthorised distribution or copying is strictly prohibited.
> Whilst Kognitio Limited takes steps to prevent the transmission of viruses
> via e-mail, we can not guarantee that any email or attachment is free from
> computer viruses and you are strongly advised to undertake your own
> anti-virus precautions.
>
> Kognitio grants no warranties regarding performance, use or quality of any
> e-mail or attachment and undertakes no liability for loss or damage,
> howsoever caused.
>
> Kognitio Limited, a company registered in England and Wales. Registered
> number 0212 7833. Registered Office:  3a Waterside Park, Cookham Road,
> Bracknell, Berks, RG12 1RB. VAT number 864 4378 92.
>
> Kognitio Inc, a company incorporated in Delaware, principal office 180
> North Stetson, Suite 3500, Chicago, IL 60601, USA
> ***************************************************************************
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 4
> Date: Mon, 16 Mar 2009 11:02:51 -0500
> From: Carlos Herrera Polo <carlos.herrerapolo at gmail.com>
> Subject: [Nagios-users] send_mail.pl not work Nagios 2.9
> To: nagios-users at lists.sourceforge.net
> Message-ID:
>        <e9cb31b80903160902q2ba07186k808c75f2f64907ac at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Do not get it to work "send_mail.pl" then making the settings:
>
> *commands.cfg*
> -----------------------
>
> command_line    /usr/lib/nagios/plugins/send_mail.pl -n \"HOST
> $NOTIFICATIONTYPE$\" -h \"$HOSTNAME$\"-s \"$HOSTSTATE$\" -a
> \"$HOSTADDRESS$\" -i \"$HOSTOUTPUT$\" -d \"$LONGDATETIME$\" -e
> \"$CONTACTEMAIL$\"
>
>
> *send_mai.pl*
> -------------------
> my $mailhost    =    '192.168.1.1';
> my $maildomain    =    'mydomain.com';
> my $mailfrom    =    'chp at mydomain.com';
> my $mailto    =    'chp at mydomain.com';
> my $timeout    =    30;
> my $mailsubject    =    '';                            #    Leave blank
> my $mailbody    =    '';                            #    Leave blank
> my $logfile        =    '/tmp/mail.log';            #    Put somewhere
> better
> my $debug        =    1;                            #    To enable SMTP
> session debugging to logfile
>
>
> ---------------------
> In the log file (mail.log) I see no effort when a warning event, but when I
> use the default configuration of Nagios works all right, only send_mail.pl
> not working.....
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 5
> Date: Mon, 16 Mar 2009 17:20:12 +0100
> From: Klaus Umbach <treibholz at sozial-inkompetent.de>
> Subject: [Nagios-users] Monitoring IBM server-hardware (x3650) running
>        Windows
> To: nagios-users at lists.sourceforge.net
> Message-ID: <20090316162012.GB4715 at umbach-racing.de>
> Content-Type: text/plain; charset=us-ascii
>
> Hi,
> How can I monitor the hardware in IBM servers running Windows, especially
> the physical harddrives?
>
> I can't see anything via SNMP that makes sense and the IBM Director Agents
> sends no traps if a drive fails.
>
> I played around with check_serveraid, changed the ssh-stuff to winexe,
> but I don't like that solution because it needs an administrative account
> to execute ipssend.exe AND it doesn't work an all servers (sometimes it
> says "Found 0 IBM ServeRAID controller(s).", which is definitly a lie!)
> Before that inconsistency I even thought about compiling it with pp and
> run it via NRPE.
>
> My current workaround is checking the application-eventlog and filter for
> the source substr:ServeRAID.
>
> Has anybody found a better solution?
>
> Cheers
>        Klaus
>
> --
> BOFH excuse #92:
>
> Stale file handle (next time use Tupperware(tm)!)
>
>
>
> ------------------------------
>
> Message: 6
> Date: Mon, 16 Mar 2009 10:07:42 -0700
> From: Kevin Keane <subscription at kkeane.com>
> Subject: Re: [Nagios-users] Monitoring IBM server-hardware (x3650)
>        running Windows
> Cc: nagios-users at lists.sourceforge.net
> Message-ID: <49BE875E.4090307 at kkeane.com>
> Content-Type: text/plain;       charset="US-ASCII";     format="flowed"
>
> https://sourceforge.net/projects/tntnagiosplugin/Windows SNMP is really
> not all that useful in my experience. I have a feeling that Microsoft
> would like it to go away in favor of their proprietary MOM. What you may
> be able to use to get to hard disk failure information is WMI. In fact,
> I wrote a plugin that checks the SMART information (not RAID, though) on
> either one host, or all workstations in a domain (it will need
> appropriate permissions, which translates to either administrator access
> or the Local System account). You can find that plugin in my
> tnt_nagios_plugins project on SourceForge. I'll be happy to create a
> plugin to check the IBM raid controller as long as you can provide me
> with the needed documentation and either do the testing for me or give
> me access to a development server for testing.
>
> Klaus Umbach wrote:
> > Hi,
> > How can I monitor the hardware in IBM servers running Windows, especially
> > the physical harddrives?
> >
> > I can't see anything via SNMP that makes sense and the IBM Director
> Agents
> > sends no traps if a drive fails.
> >
> > I played around with check_serveraid, changed the ssh-stuff to winexe,
> > but I don't like that solution because it needs an administrative account
> > to execute ipssend.exe AND it doesn't work an all servers (sometimes it
> > says "Found 0 IBM ServeRAID controller(s).", which is definitly a lie!)
> > Before that inconsistency I even thought about compiling it with pp and
> > run it via NRPE.
> >
> > My current workaround is checking the application-eventlog and filter for
> > the source substr:ServeRAID.
> >
> > Has anybody found a better solution?
> >
> > Cheers
> >       Klaus
> >
> >
>
>
> --
> Kevin Keane
> Owner
> The NetTech
> Find the Uncommon: Expert Solutions for a Network You Never Have to Think
> About
>
> Office: 866-642-7116
> http://www.4nettech.com
>
> This e-mail and attachments, if any, may contain confidential and/or
> proprietary information. Please be advised that the unauthorized use or
> disclosure of the information is strictly prohibited. The information herein
> is intended only for use by the intended recipient(s) named above. If you
> have received this transmission in error, please notify the sender
> immediately and permanently delete the e-mail and any copies, printouts or
> attachments thereof.
>
>
>
>
> ------------------------------
>
> Message: 7
> Date: Mon, 16 Mar 2009 17:28:44 +0000
> From: Jim Avery <jim at jimavery.me.uk>
> Subject: Re: [Nagios-users] Host checks instead of service checks
> To: Deborah Martin <Deborah.Martin at kognitio.com>
> Cc: nagios-users at lists.sourceforge.net
> Message-ID:
>        <765d77c80903161028v31f1d8f3v6f6571b4f967c647 at mail.gmail.com>
> Content-Type: text/plain; charset=ISO-8859-1
>
> 2009/3/16 Deborah Martin <Deborah.Martin at kognitio.com>:
> > Hi Folks,
> >
> > Currently, the main nagios box (running old version of SuSE) and Nagios
> > 2.0b4 is running just with service checks, one of which is an ssh check
> (we
> > don't allow ping)
> >
> > So in the web interface, I see all hosts as up.
> >
> > However, I've built a new box (which hopefully will replace the above)
> with
> > SLES 10SP1 and Nagios 3.0.6. I've put the same config files (services.cfg
> > and hosts.cfg)
> >
> > on the this new system. But now the hosts all show as "Pending". So I
> then
> > moved the ssh check from a service check to a host check and can see that
> > the more hosts I move the? less hosts are pending. That's good so far...
> >
> > But, when I run the pre-flight check (/usr/local/nagios/bin/nagios -v
> > nagios.cfg), I now get warnings to say some hosts don't have any service
> > checks associated with it!
> >
> > This is true as some nodes will only have ssh checks against them whilst
> > others will have other checks against them. I'd rather it didn't warn me
> as
> > I have hundreds
> >
> > of hosts appear in the pre-flight check warnings and it now looks
> incredibly
> > untidy to see all this.
> >
> > How can I get rid of these warnings ?
> >
> > I'm probably missing something here so any help would be appreciated. I'm
> > wondering for example, if I should force the initial state to be UP for
> > hosts rather than moving the ssh service check to a host check. But then
> what would happen if
> > the service check found a node was down - would it reflect that in the
> "host
> > problems" ?
>
> You're right, it wouldn't.  For the host check, you need some method
> of checking if the host is up.  If ssh is the only method at your
> disposal to check if a host is up, and there are no other services you
> can monitor as service checks then my humble opinion is you're best
> off specifying ssh for both your host and service checks on those
> hosts.  I am allowed to use ping, so I do.  For those nodes on which I
> monitor no services, I also use ping as the service check - to me
> that's functionally equivalent to using ssh for both, it's just a
> different service.
>
> If you're monitoring other services and you're only using the ssh
> checks to see if the host is up or not, then I'd recommend just using
> the ssh check as a host check, leaving the others as service checks.
>
> I wonder, though, if it might be possible for you to check the host
> using a passive check?  For example you could have cron send a check
> using NSCA from your server every minute and use freshness checking to
> see if the host is down or not.  I can't say I've ever tried it but
> guess it's another option worth thinking about.
>
>
>
> Cheers,
>
> Jim
>
>
>
> ------------------------------
>
> Message: 8
> Date: Mon, 16 Mar 2009 17:34:57 -0000
> From: Deborah Martin <Deborah.Martin at Kognitio.com>
> Subject: Re: [Nagios-users] Host checks instead of service checks
> To: nagios-users at lists.sourceforge.net
> Message-ID:
>        <84836290D5AD43418C40DCF0C4A54ED38E3B44 at kogex02.pmpl.co.uk>
> Content-Type: text/plain; charset="us-ascii"
>
> Folks,
>
> Whilst using check_dummy as a service check does resolve the pre-flight
> warnings, it's not the solution I think I'm looking for.
>
> The total number of service checks now includes the dummy checks which i've
> chosen to always exit as UP so that it doesn't show up as a service problem
> - which to me
> doesn't sound like the right way to do this. I want all service checks to
> be
> useful service checks rather than have checks which would need to be
> ignored
> as they don't return
> anything useful as with check_dummy - I wouldn't ever use that check in any
> Nagios SLA reporting.
>
> What I really want to do is the following :
>
> Hosts -->> ssh check -->> When "CRITICAL" alert under "Host problems" only.
>
> The docs show a distinct difference for behaviour between Nagios 2.0b4 and
> Nagios 3.0.6.
>
> >From Nagios 2.0b4, under Host Definitions:-
> check_command:   This directive is used to specify the short name of the
> <http://193.35.206.171/nagios/docs/xodtemplate.html#command> command that
> should be used to check if the host is up or down. Typically, this command
> would try and ping the host to see if it is "alive". The command must
> return
> a status of OK (0) or Nagios will assume the host is down. If you leave
> this
> argument blank, the host will not be checked - Nagios will always assume
> the
> host is up. This is useful if you are monitoring printers or other devices
> that are frequently turned off. The maximum amount of time that the
> notification command can run is controlled by the
> <http://193.35.206.171/nagios/docs/configmain.html#host_check_timeout>
> host_check_timeout option.
>
>
> >From Nagios 3.0.6
> check_command:   This directive is used to specify the short name of the
> <http://193.35.206.117/nagios/docs/objectdefinitions.html#command> command
> that should be used to check if the host is up or down. Typically, this
> command would try and ping the host to see if it is "alive". The command
> must return a status of OK (0) or Nagios will assume the host is down. If
> you leave this argument blank, the host will not be actively checked. Thus,
> Nagios will likely always assume the host is up (it may show up as being in
> a "PENDING" state in the web interface). This is useful if you are
> monitoring printers or other devices that are frequently turned off. The
> maximum amount of time that the notification command can run is controlled
> by the
> <http://193.35.206.117/nagios/docs/configmain.html#host_check_timeout>
> host_check_timeout option.
>
> The difference is highlighted in RED. So on changing the "ssh check"  from
> a
> service to a host definition (to prevent PENDING on the hosts), the
> pre-flight warnings now complain there are no services associated with the
> hosts. On big systems here (100 nodes+) this makes the pre-flight output
> really unreadable and not really accurate as surely we should be able to
> choose whether something is a host check or service check but not have to
> define BOTH!
>
> Ultimately, I just want to filter SSH check problems to appear under "Host
> problems" rather than under "Service problems" so users can easily spot
> other service problem issues and not have to trawl through 100's of lines
> of
> output in the web interface. Believe me, Nagios web interface users can be
> a
> fickle bunch!
>
> A switch to tell Nagios to not complain about this would be really useful
> unless anyone thinks of a good reason why this would be a bad idea.
>
> any help / pointers would be appreciated, even if it's to tell me politely
> how stupid i'm being - I can take it!
>
> regards,
> deborah
>
>  _____
>
> From: Deborah Martin [mailto:Deborah.Martin at Kognitio.com]
> Sent: 16 March 2009 15:34
> To: 'Richard Quintin'
> Cc: nagios-users at lists.sourceforge.net
> Subject: Re: [Nagios-users] Host checks instead of service checks
>
>
>
> Thanks Richard. I'll give that a try.
>
> regards,
> deborah
>
>
> -----Original Message-----
> From: Richard Quintin [mailto:rich+nagios at quintinz.com<rich%2Bnagios at quintinz.com>
> <mailto:rich+nagios at quintinz.com <rich%2Bnagios at quintinz.com>> ]
> Sent: 16 March 2009 15:26
> To: Deborah Martin
> Cc: nagios-users at lists.sourceforge.net
> Subject: Re: [Nagios-users] Host checks instead of service checks
>
> You could use check_dummy for the host check and just have it always return
> OK.
> Or perhaps the opposite you could add a check_dummy service check for all
> hosts.
>
> Which you choose will depend upon your personal preference and how you want
> dependencies to be handled.
>
> On Mon, Mar 16, 2009 at 11:06 AM, Deborah Martin
> <Deborah.Martin at kognitio.com> wrote:
> > Hi Folks,
> >
> > Currently, the main nagios box (running old version of SuSE) and
> > Nagios
> > 2.0b4 is running just with service checks, one of which is an ssh
> > check (we don't allow ping)
> >
> > So in the web interface, I see all hosts as up.
> >
> > However, I've built a new box (which hopefully will replace the above)
> > with SLES 10SP1 and Nagios 3.0.6. I've put the same config files
> > (services.cfg and hosts.cfg)
> >
> > on the this new system. But now the hosts all show as "Pending". So I
> > then moved the ssh check from a service check to a host check and can
> > see that the more hosts I move the  less hosts are pending. That's good
> so
> far...
> >
> > But, when I run the pre-flight check (/usr/local/nagios/bin/nagios -v
> > nagios.cfg), I now get warnings to say some hosts don't have any
> > service checks associated with it!
> >
> > This is true as some nodes will only have ssh checks against them
> > whilst others will have other checks against them. I'd rather it
> > didn't warn me as I have hundreds
> >
> > of hosts appear in the pre-flight check warnings and it now looks
> > incredibly untidy to see all this.
> >
> > How can I get rid of these warnings ?
> >
> > I'm probably missing something here so any help would be appreciated.
> > I'm wondering for example, if I should force the initial state to be
> > UP for hosts rather than
> >
> > moving the ssh service check to a host check. But then what would
> > happen if the service check found a node was down - would it reflect
> > that in the "host problems" ?
> >
> > regards,
> > deborah
> >
> > **********************************************************************
> > ***** This email and any files transmitted with it are confidential
> > and intended solely for the use of the individual or entity to whom
> > they are addressed.
> >
> > Any unauthorised distribution or copying is strictly prohibited.
> > Whilst Kognitio Limited takes steps to prevent the transmission of
> > viruses via e-mail, we can not guarantee that any email or attachment
> > is free from computer viruses and you are strongly advised to
> > undertake your own anti-virus precautions.
> >
> > Kognitio grants no warranties regarding performance, use or quality of
> > any e-mail or attachment and undertakes no liability for loss or
> > damage, howsoever caused.
> >
> > Kognitio Limited, a company registered in England and Wales.
> > Registered number 0212 7833. Registered Office: 3a Waterside Park,
> > Cookham Road, Bracknell, Berks, RG12 1RB. VAT number 864 4378 92.
> >
> > Kognitio Inc, a company incorporated in Delaware, principal office 180
> > North Stetson, Suite 3500, Chicago, IL 60601, USA
> > **********************************************************************
> > *****
> >
> > ----------------------------------------------------------------------
> > -------- Apps built with the Adobe(R) Flex(R) framework and Flex
> > Builder(TM) are powering Web 2.0 with engaging, cross-platform
> > capabilities. Quickly and easily build your RIAs with Flex Builder,
> > the Eclipse(TM)based development software that enables intelligent
> > coding and step-through debugging.
> > Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
> <http://p.sf.net/sfu/www-adobe-com>
> > _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> <https://lists.sourceforge.net/lists/listinfo/nagios-users>
> > ::: Please include Nagios version, plugin version (-v) and OS when
> > reporting any issue.
> > ::: Messages without supporting info will risk being sent to /dev/null
> >
>
>
>
> --
> Richard Quintin, DBA
> Database & Application Administration
> Virginia Tech
>
>
> ***************************************************************************
> This email and any files transmitted with it are confidential and intended
> solely for the use of the individual or entity to whom they are addressed.
>
> Any unauthorised distribution or copying is strictly prohibited.
> Whilst Kognitio Limited takes steps to prevent the transmission of viruses
> via e-mail, we can not guarantee that any email or attachment is free from
> computer viruses and you are strongly advised to undertake your own
> anti-virus precautions.
>
> Kognitio grants no warranties regarding performance, use or quality of any
> e-mail or attachment and undertakes no liability for loss or damage,
> howsoever caused.
>
> Kognitio Limited, a company registered in England and Wales. Registered
> number 0212 7833. Registered Office: 3a Waterside Park, Cookham Road,
> Bracknell, Berks, RG12 1RB. VAT number 864 4378 92.
>
> Kognitio Inc, a company incorporated in Delaware, principal office 180
> North
> Stetson, Suite 3500, Chicago, IL 60601, USA
> ***************************************************************************
>
>
>
> ***************************************************************************
> This email and any files transmitted with it are confidential and intended
> solely for the use of the individual or entity to whom they are addressed.
>
> Any unauthorised distribution or copying is strictly prohibited.
> Whilst Kognitio Limited takes steps to prevent the transmission of viruses
> via e-mail, we can not guarantee that any email or attachment is free from
> computer viruses and you are strongly advised to undertake your own
> anti-virus precautions.
>
> Kognitio grants no warranties regarding performance, use or quality of any
> e-mail or attachment and undertakes no liability for loss or damage,
> howsoever caused.
>
> Kognitio Limited, a company registered in England and Wales. Registered
> number 0212 7833. Registered Office:  3a Waterside Park, Cookham Road,
> Bracknell, Berks, RG12 1RB. VAT number 864 4378 92.
>
> Kognitio Inc, a company incorporated in Delaware, principal office 180
> North Stetson, Suite 3500, Chicago, IL 60601, USA
> ***************************************************************************
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 9
> Date: Mon, 16 Mar 2009 14:24:47 -0400
> From: Anirudh Srinivasan <srianirudh at gmail.com>
> Subject: [Nagios-users] problem starting nagios
> To: nagios-users at lists.sourceforge.net
> Message-ID:
>        <699b436a0903161124ydd362eds4be8a3fa931cfded at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Folks,
>
> I did a fresh nagios 3.0.6 installation on a linux server.
>
> Things look okay - No serious problems were detected during the pre-flight
> check
> [root at DELL8B81Y11 sbin]# service nagios restart
> Running configuration check...done.
> Stopping nagios: /etc/init.d/nagios: line 67: kill: (24483) - No such
> process
> done.
> Starting nagios: done.
> [root at DELL8B81Y11 sbin]# service nagios status
> nagios is not running
> [root at DELL8B81Y11 sbin]#
>
> Why am i getting like this. I have a nagios.conf file in
> /etc/httpd/conf/nagios.conf and the content is
>
> ScriptAlias /nagios/cgi-bin/ /usr/local/nagios/sbin/
> <Directory ?/usr/local/nagios/sbin/?>
> AllowOverride AuthConfig
> Options ExecCGI
> Allow from all
> Order allow,deny
> </Directory>
>
> Alias /nagios/ /usr/local/nagios/share/
> <Directory ?/usr/local/nagios/share?>
> Options None
> AllowOverride AuthConfig
> Order allow,deny
> Allow from all
> </Directory>
>
> At the webinterface when i type http://localhost/nagios/ i get to see the
> webinterface but when i click host detail i see this :
>
> <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
> <html><head>
> <title>500 Internal Server Error</title>
> </head><body>
> <h1>Internal Server Error</h1>
> <p>The server encountered an internal error or
> misconfiguration and was unable to complete
> your request.</p>
> <p>Please contact the server administrator,
>  root at localhost and inform them of the time the error occurred,
> and anything you might have done that may have
> caused the error.</p>
> <p>More information about this error may be available
> in the server error log.</p>
> <hr>
> <address>Apache/2.2.3 (Red Hat) Server at 10.21.14.212 Port 80</address>
> </body></html>
>
>
> Any help would be appreciated.
>
> Thanks--
>
> Anirudh Srinivasan
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 10
> Date: Mon, 16 Mar 2009 14:05:46 -0500
> From: Marc Powell <marc at ena.com>
> Subject: Re: [Nagios-users] problem starting nagios
> To: nagios-users ML <nagios-users at lists.sourceforge.net>
> Message-ID: <40705790-FC5D-4CEF-B03D-1244B8985E92 at ena.com>
> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes
>
>
> On Mar 16, 2009, at 1:24 PM, Anirudh Srinivasan wrote:
>
> > Folks,
> >
> > I did a fresh nagios 3.0.6 installation on a linux server.
> >
> > Things look okay - No serious problems were detected during the pre-
> > flight check
> > [root at DELL8B81Y11 sbin]# service nagios restart
> > Running configuration check...done.
> > Stopping nagios: /etc/init.d/nagios: line 67: kill: (24483) - No
> > such process
> > done.
> > Starting nagios: done.
> > [root at DELL8B81Y11 sbin]# service nagios status
> > nagios is not running
> > [root at DELL8B81Y11 sbin]#
> >
> > Why am i getting like this.
>
> What does nagios.log have to say?
> Do you see the process running in the ps list?
> Can you run nagios in the foreground as root (/usr/local/nagios/bin/
> nagios /usr/local/nagios/etc/nagios.cfg)?
> As the nagios user?
> In daemon mode as the nagios user (/usr/local/nagios/bin/nagios -d /
> usr/local/nagios/etc/nagios.cfg)?
> Is there anything special about the mount point that nagios lives under?
>
> > At the webinterface when i type http://localhost/nagios/ i get to
> > see the webinterface but when i click host detail i see this :
> > <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML 2.0//EN">
> >
> > <html><head>
> > <title>500 Internal Server Error</title>
> What do your web server error logs show?
>
> --
> Marc
>
>
>
>
> ------------------------------
>
>
> ------------------------------------------------------------------------------
> Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
> powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
> easily build your RIAs with Flex Builder, the Eclipse(TM)based development
> software that enables intelligent coding and step-through debugging.
> Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
>
> ------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
>
>
> End of Nagios-users Digest, Vol 34, Issue 26
> ********************************************
>



-- 
Anirudh Srinivasan
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20090318/3e13b724/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
Apps built with the Adobe(R) Flex(R) framework and Flex Builder(TM) are
powering Web 2.0 with engaging, cross-platform capabilities. Quickly and
easily build your RIAs with Flex Builder, the Eclipse(TM)based development
software that enables intelligent coding and step-through debugging.
Download the free 60 day trial. http://p.sf.net/sfu/www-adobe-com
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list