From alexd at cal.net Sat May 1 00:06:28 2010 From: alexd at cal.net (Alex) Date: Fri, 30 Apr 2010 15:06:28 -0700 Subject: check_radius issue after 3.2.1 update References: <010401cae889$d86260d0$89272270$@net> <4BDB457D.9010402@flatto.net> Message-ID: <013001cae8b1$62422d40$26c687c0$@net> I was able to fix it by changing ownership of the files in /etc/radiusclient/ from 644 to 666 [alexd at owl radiusclient]$ sudo chmod 666 * [alexd at owl radiusclient]$ ll -rw-rw-rw- 1 root root 6385 Mar 5 2007 dictionary -rw-rw-rw- 1 root root 12388 Mar 5 2007 dictionary.ascend -rw-rw-rw- 1 root root 1517 Mar 5 2007 dictionary.compat -rw-rw-rw- 1 root root 599 Mar 5 2007 dictionary.merit -rw-rw-rw- 1 root root 135 Mar 5 2007 issue -rw-rw-rw- 1 root root 410 Mar 5 2007 port-id-map -rw-rw-rw- 1 root root 2522 Oct 23 2009 radiusclient.conf -rw-rw-rw- 1 root root 440 Apr 22 15:46 servers [alexd at owl radiusclient]$ /usr/lib/nagios/plugins/check_radius -H zeus2 -F /etc/radiusclient/radiusclient.conf -u nagios -p nagioscheck Auth OK Thank you for pointing me in the right direction. Alex -----Original Message----- From: Alex [mailto:alexd at cal.net] Sent: Friday, April 30, 2010 2:57 PM To: 'Nagios Users List' Subject: RE: [Nagios-users] check_radius issue after 3.2.1 update >> Try executing the script as the nagios use not as root and see what the output is . >> root and nagios have differing env setting in the shell , and there >> might be the problem in your response. >> Assaf [root at owl plugins]# su - nagios -sh-3.2$ /usr/lib/nagios/plugins/check_radius -H zeus2 -F /etc/radiusclient/radiusclient.conf -u nagios -p nagioscheck Auth Error You're right, that seems to be the issue. Could you by chance recommend what I need to do in order to get this fixed? Thank you, Alex ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at smetj.net Sun May 2 15:19:35 2010 From: nagios at smetj.net (Jelle Smet) Date: Sun, 2 May 2010 13:19:35 +0000 (UTC) Subject: RSyslog monitoring? References: <724C3B2F1C5EB44D9108E471EC5E9933B267CDF094@akechi-denki.ad.nctechcenter.com> Message-ID: Hi Kevin, > I would like to find a way to monitor syslog entries using > rsyslog 4.4 rules before they are ever written to a log file. The log parsing plugins > I found on MonitoringExchange aren?t useful for my purpose. Have a look at http://www.smetj.net/wiki/Molog it might be what you're looking for. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From service at shdawson.com Sun May 2 20:54:05 2010 From: service at shdawson.com (Stephen H. Dawson) Date: Sun, 2 May 2010 14:54:05 -0400 Subject: Monitoring SMTP Authentication Message-ID: <3D37FBE56B25415E866AAF5BD7E679D2@Because> Hi, We have a Nagios server setup to send mail through our ISP's email server. The ISP uses SMTP authentication. The ISP has been pretty flakey lately in keeping their email server up and running, to send emails for their customers, in this geographic region. We would like to setup a Nagios check to watch the SMTP authentication. Sometimes they change their ports...among other problems. If we can get a check running, we can see the alert in the Nagios GUI that there is a problem, though we (of course) would not receive the email that there is a problem. Not sure where to start with this check. Looking for best practices and, hopefully, a code example. Thank You, SHD -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Sun May 2 21:04:19 2010 From: nagios at flatto.net (Assaf Flatto) Date: Sun, 02 May 2010 20:04:19 +0100 Subject: Monitoring SMTP Authentication In-Reply-To: <3D37FBE56B25415E866AAF5BD7E679D2@Because> References: <3D37FBE56B25415E866AAF5BD7E679D2@Because> Message-ID: <4BDDCCB3.3030004@flatto.net> Stephen H. Dawson wrote: > Hi, > > > We have a Nagios server setup to send mail through our ISP's email > server. The ISP uses SMTP authentication. The ISP has been pretty > flakey lately in keeping their email server up and running, to send > emails for their customers, in this geographic region. > > We would like to setup a Nagios check to watch the > SMTP authentication. Sometimes they change their ports...among other > problems. If we can get a check running, we can see the alert in the > Nagios GUI that there is a problem, though we (of course) would not > receive the email that there is a problem. > > Not sure where to start with this check. Looking for best practices > and, hopefully, a code example. > > > Thank You, > SHD Try starting here: http://nagiosplugins.org/man/check_smtp This my give you all the details and ways to do what you need -- Never , Ever Cut a Deal With a Dragon ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From service at shdawson.com Sun May 2 21:09:38 2010 From: service at shdawson.com (Stephen H. Dawson) Date: Sun, 2 May 2010 15:09:38 -0400 Subject: Monitoring SMTP Authentication In-Reply-To: <4BDDCCB3.3030004@flatto.net> References: <3D37FBE56B25415E866AAF5BD7E679D2@Because> <4BDDCCB3.3030004@flatto.net> Message-ID: <59B71B3B5A77465387B77090F11B190F@Because> Thank you very much. Kindly, SHD _____ From: Assaf Flatto [mailto:nagios at flatto.net] Sent: Sunday, May 02, 2010 15:04 E/T To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring SMTP Authentication Stephen H. Dawson wrote: > Hi, > > > We have a Nagios server setup to send mail through our ISP's email > server. The ISP uses SMTP authentication. The ISP has been pretty > flakey lately in keeping their email server up and running, to send > emails for their customers, in this geographic region. > > We would like to setup a Nagios check to watch the > SMTP authentication. Sometimes they change their ports...among other > problems. If we can get a check running, we can see the alert in the > Nagios GUI that there is a problem, though we (of course) would not > receive the email that there is a problem. > > Not sure where to start with this check. Looking for best practices > and, hopefully, a code example. > > > Thank You, > SHD Try starting here: http://nagiosplugins.org/man/check_smtp This my give you all the details and ways to do what you need -- Never , Ever Cut a Deal With a Dragon ---------------------------------------------------------------------------- -- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null _____ avast! Antivirus: Inbound message clean. Virus Database (VPS): 100502-0, 05/02/2010 Tested on: 05/2/2010 15:08 E/T avast! - copyright (c) 1988-2010 ALWIL Software. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at smetj.net Mon May 3 11:28:24 2010 From: nagios at smetj.net (Jelle Smet) Date: Mon, 03 May 2010 11:28:24 +0200 Subject: Rsyslog monitoring Message-ID: <7a520a4a6d2938f92e65e1e2686a2b8a@mail.smetj.net> Hi list, I just released MoLog which enables you to monitor a rsyslog database for syslog entries which should end up as warnings or criticals in a Nagios/Core based setup. Have a look at http://www.smetj.net/wiki/Molog It has following features: * A stand alone daemon with builtin web server. * Works together with a centralized rsyslog/mysql setup using the Monitorware db schema. * Submits results directly to the command pipe or through NSCAweb * Only submits results passively when matches occur. * Assign a syslog priority to the monitoring warning or critical alert category. * Exclude syslog entries from the monitoring warning or critical status using regular expressions. * Apply regular expressions on a global or per host basis. * Remove open alerts from the web interface and update monitoring automatically through passive check results. * No DB records are altered when removing alerts. * No additional database required for Molog itself. * Builtin tweakable queue which buffers passive check results when they can't be delivered. * MoLog is written in Python and has its own built in web server (Cherrypy) and queue manager. It might be helpful to one of you. * my apologies if this is a double post but the previous message didn't came through* Jelle Smet http://www.smetj.net ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjscampos at gmail.com Mon May 3 18:26:42 2010 From: jjscampos at gmail.com (=?iso-8859-1?Q?Jos=E9_Campos?=) Date: Mon, 3 May 2010 17:26:42 +0100 Subject: how to restart httpd service on remote server Message-ID: <014501caeadd$6cc9d8c0$465d8a40$@com> Hello everyone. I have a Web server witch is been monitorized with nagios and nrpe, but I don?t know how recover httpd service or others if they faild. I only found some how to recover on some machine. Atentamente, Jos? Campos -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rwolfe at i-evolve.com Mon May 3 18:38:55 2010 From: rwolfe at i-evolve.com (Robert Wolfe) Date: Mon, 3 May 2010 12:38:55 -0400 Subject: how to restart httpd service on remote server In-Reply-To: <014501caeadd$6cc9d8c0$465d8a40$@com> References: <014501caeadd$6cc9d8c0$465d8a40$@com> Message-ID: <33C7BE4CE80D504E8907FDC31F8A08304E7CB0C0FB@iexchange.I-EVOLVE.com> If the HTTPD service stops running completely, you need to log into the server that runs your httpd server and restart the httpd process yourself. The same thing goes for any other service on remote machines. From: Jos? Campos [mailto:jjscampos at gmail.com] Sent: Monday, May 03, 2010 12:27 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] how to restart httpd service on remote server Hello everyone. I have a Web server witch is been monitorized with nagios and nrpe, but I don't know how recover httpd service or others if they faild. I only found some how to recover on some machine. Atentamente, Jos? Campos -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Mon May 3 18:53:36 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Mon, 3 May 2010 18:53:36 +0200 Subject: how to restart httpd service on remote server In-Reply-To: <33C7BE4CE80D504E8907FDC31F8A08304E7CB0C0FB@iexchange.I-EVOLVE.com> References: <014501caeadd$6cc9d8c0$465d8a40$@com> <33C7BE4CE80D504E8907FDC31F8A08304E7CB0C0FB@iexchange.I-EVOLVE.com> Message-ID: Non completely correctl. Event handlers can do the job for you. Ciao, Giorgio Il giorno 03/mag/2010, alle ore 18.38, Robert Wolfe ha scritto: > If the HTTPD service stops running completely, you need to log into > the server that runs your httpd server and restart the httpd process > yourself. The same thing goes for any other service on remote > machines. > > > > From: Jos? Campos [mailto:jjscampos at gmail.com] > Sent: Monday, May 03, 2010 12:27 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] how to restart httpd service on remote server > > > > > > Hello everyone. > > > > I have a Web server witch is been monitorized with > nagios and nrpe, but I don?t know how recover httpd service or other > s if they faild. > > I only found some how to recover on some machine. > > > > Atentamente, > > > > Jos? Campos > > > > --- > --- > --- > --------------------------------------------------------------------- > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From schneemann at b1-systems.de Mon May 3 19:07:15 2010 From: schneemann at b1-systems.de (Christian Schneemann) Date: Mon, 3 May 2010 19:07:15 +0200 Subject: how to restart httpd service on remote server In-Reply-To: <014501caeadd$6cc9d8c0$465d8a40$@com> References: <014501caeadd$6cc9d8c0$465d8a40$@com> Message-ID: <201005031907.15593.schneemann@b1-systems.de> Hi, On Monday 03 May 2010 06:26:42 pm Jos? Campos wrote: > Hello everyone. > > > > I have a Web server witch is been monitorized with nagios > and nrpe, but I don?t know how recover httpd service or others if they > faild. > > I only found some how to recover on some machine. you can use ssh to execute the service restart or even nrpe if you configure a command. But have a look that the user thats running nrpe is allowed to do the restart.. Regards, Christian > > > > Atentamente, > > > > Jos? Campos > -- Christian Schneemann Geschaeftsfuehrer: Ralph Dehner IT Consultant & Trainer Unternehmenssitz: Vohburg B1 Systems GmbH Amtsgericht: Ingolstadt Mobil: +49-(0)-1757250665 Handelsregister: HRB 3537 EMail: schneemann at b1-systems.de http://www.b1-systems.de Adresse: B1 Systems GmbH, Osterfeldstra?e 7, 85088 Vohburg GPG: http://pgpkeys.pca.dfn.de/pks/lookup?op=get&search=0x2FA8643A41BDAB81 ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Mon May 3 19:06:30 2010 From: perldork at webwizarddesign.com (Max) Date: Mon, 3 May 2010 13:06:30 -0400 Subject: how to restart httpd service on remote server In-Reply-To: References: <014501caeadd$6cc9d8c0$465d8a40$@com> <33C7BE4CE80D504E8907FDC31F8A08304E7CB0C0FB@iexchange.I-EVOLVE.com> Message-ID: As can puppet, Chef, Monit, Net-SNMP, and many others :) .. always a balance with deciding where process restarts should happen - the larger the network, the more benefit there is to having autonomous agents handle restarts and have Nagios focus purely on polling / asynchronous event handling. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Mon May 3 20:36:31 2010 From: mitsuto at gmail.com (Marcel) Date: Mon, 3 May 2010 15:36:31 -0300 Subject: how to restart httpd service on remote server In-Reply-To: References: <014501caeadd$6cc9d8c0$465d8a40$@com> <33C7BE4CE80D504E8907FDC31F8A08304E7CB0C0FB@iexchange.I-EVOLVE.com> Message-ID: cfengine, mon, rmon, cron, watchdogs in general, daemontools, and many many many many others... On Mon, May 3, 2010 at 2:06 PM, Max wrote: > As can puppet, Chef, Monit, Net-SNMP, and many others :) .. always a > balance with deciding where process restarts should happen - the > larger the network, the more benefit there is to having autonomous > agents handle restarts and have Nagios focus purely on polling / > asynchronous event handling. > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From service at shdawson.com Tue May 4 11:57:37 2010 From: service at shdawson.com (Stephen H. Dawson) Date: Tue, 4 May 2010 05:57:37 -0400 Subject: Parsing Logs Message-ID: <3E7D287A4787439EB82B6097BC206CC9@Because> Hi, We have looked at the check_smtp option to meet our need, but do not believe this option will work. Looking for perhaps another option. Failure of an SMTP connection to our ISP's mail server with Nagios will only tell us that the Nagios attempt to connect has failed, at that particular point in time. We are running our Nagios box on CentOS. The CentOS attempt itself could well connect and send a message a few seconds/minutes before or after that. We also run the risk of being accused of 'excessive' number of emails with our ISP's email server. So the only reliable way of checking if ISP SMTP relay connection works is to parse the CentOS logs and look for failures to authenticate/send. Has anyone parsed a log for review with Nagios? If so, please advise. Thanks, SHD -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lscarneiro at veltrac.com.br Tue May 4 14:57:30 2010 From: lscarneiro at veltrac.com.br (Leonardo Carneiro - Veltrac) Date: Tue, 04 May 2010 09:57:30 -0300 Subject: managing a large number of server Message-ID: <4BE019BA.3070809@veltrac.com.br> Hi list, I have the following situation: I'm monitoring a bunch of services on a bunch of servers (ok, nothing new so far). All machines run the same services, but not all services is being monitored by now (since they are custom services, i'm still writing plugins for all of them). New machines are being added every day, and also, i'm writing new plugins to check the services. When a new machine is added, is relatively easy: i copy the config file from another machine and change the name of the host and other unique attributes, but when a new service is added (a.k.a. i have finish another plugin), i need to edit all the dozens of config files for each server to add this new service. There is a smarter way to do this? Sorry for my poor english. Tks in advance. -- ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue May 4 15:14:28 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 04 May 2010 14:14:28 +0100 Subject: managing a large number of server In-Reply-To: <4BE019BA.3070809@veltrac.com.br> References: <4BE019BA.3070809@veltrac.com.br> Message-ID: <4BE01DB4.8070309@flatto.net> Leonardo Carneiro - Veltrac wrote: > Hi list, > > I have the following situation: I'm monitoring a bunch of services on a > bunch of servers (ok, nothing new so far). > All machines run the same services, but not all services is being > monitored by now (since they are custom services, i'm still writing > plugins for all of them). New machines are being added every day, and > also, i'm writing new plugins to check the services. > > When a new machine is added, is relatively easy: i copy the config file > from another machine and change the name of the host and other unique > attributes, but when a new service is added (a.k.a. i have finish > another plugin), i need to edit all the dozens of config files for each > server to add this new service. > > There is a smarter way to do this? > Sorry for my poor english. Tks in advance. > Try using templates and hostgroups Templates will allow you to use a central base definition to the service or host . http://nagios.sourceforge.net/docs/3_0/objectinheritance.html you can assign a service to a hostgroup and thus all host in the group will have the service checked on them. http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Tue May 4 15:27:39 2010 From: pangrazi at gmail.com (Greg Pangrazio) Date: Tue, 4 May 2010 08:27:39 -0500 Subject: managing a large number of server In-Reply-To: <4BE01DB4.8070309@flatto.net> References: <4BE019BA.3070809@veltrac.com.br> <4BE01DB4.8070309@flatto.net> Message-ID: If you are looking for an "old school" method of doing this, check out sed. it is a very powerful tool to edit multiple files at once. Greg Pangrazio pangrazi at gmail.com On Tue, May 4, 2010 at 8:14 AM, Assaf Flatto wrote: > Leonardo Carneiro - Veltrac wrote: >> Hi list, >> >> I have the following situation: I'm monitoring a bunch of services on a >> bunch of servers (ok, nothing new so far). >> All machines run the same services, but not all services is being >> monitored by now (since they are custom services, i'm still writing >> plugins for all of them). New machines are being added every day, and >> also, i'm writing new plugins to check the services. >> >> When a new machine is added, is relatively easy: i copy the config file >> from another machine and change the name of the host and other unique >> attributes, but when a new service is added (a.k.a. i have finish >> another plugin), i need to edit all the dozens of config files for each >> server to add this new service. >> >> There is a smarter way to do this? >> Sorry for my poor english. Tks in advance. >> > Try using templates and hostgroups > Templates will allow you to use a central base definition to the service > or host . > http://nagios.sourceforge.net/docs/3_0/objectinheritance.html > > you can assign a service to a hostgroup and thus all host in the group > will have the service checked on them. > > http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service > > -- > Never,Ever Cut A Deal With a Dragon > > > I am doing a Charity Bike ride On the 27 of June for the > Capital to Coast Charity. Please help by Donating > http://www.justgiving.com/Lovefilm-capital-to-coast > > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lscarneiro at veltrac.com.br Tue May 4 15:29:41 2010 From: lscarneiro at veltrac.com.br (Leonardo Carneiro - Veltrac) Date: Tue, 04 May 2010 10:29:41 -0300 Subject: managing a large number of server In-Reply-To: <4BE01DB4.8070309@flatto.net> References: <4BE019BA.3070809@veltrac.com.br> <4BE01DB4.8070309@flatto.net> Message-ID: <4BE02145.4060302@veltrac.com.br> Assaf Flatto wrote: > Leonardo Carneiro - Veltrac wrote: > >> Hi list, >> >> I have the following situation: I'm monitoring a bunch of services on a >> bunch of servers (ok, nothing new so far). >> All machines run the same services, but not all services is being >> monitored by now (since they are custom services, i'm still writing >> plugins for all of them). New machines are being added every day, and >> also, i'm writing new plugins to check the services. >> >> When a new machine is added, is relatively easy: i copy the config file >> from another machine and change the name of the host and other unique >> attributes, but when a new service is added (a.k.a. i have finish >> another plugin), i need to edit all the dozens of config files for each >> server to add this new service. >> >> There is a smarter way to do this? >> Sorry for my poor english. Tks in advance. >> >> > Try using templates and hostgroups > Templates will allow you to use a central base definition to the service > or host . > http://nagios.sourceforge.net/docs/3_0/objectinheritance.html > > you can assign a service to a hostgroup and thus all host in the group > will have the service checked on them. > > http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service Tks for showing the way. I'll check it out. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From guidosh at gmail.com Tue May 4 15:31:38 2010 From: guidosh at gmail.com (Guy Waugh) Date: Tue, 4 May 2010 14:31:38 +0100 Subject: managing a large number of server In-Reply-To: <4BE01DB4.8070309@flatto.net> References: <4BE019BA.3070809@veltrac.com.br> <4BE01DB4.8070309@flatto.net> Message-ID: On Tue, May 4, 2010 at 2:14 PM, Assaf Flatto wrote: > Leonardo Carneiro - Veltrac wrote: > > Hi list, > > > > I have the following situation: I'm monitoring a bunch of services on a > > bunch of servers (ok, nothing new so far). > > All machines run the same services, but not all services is being > > monitored by now (since they are custom services, i'm still writing > > plugins for all of them). New machines are being added every day, and > > also, i'm writing new plugins to check the services. > > > > When a new machine is added, is relatively easy: i copy the config file > > from another machine and change the name of the host and other unique > > attributes, but when a new service is added (a.k.a. i have finish > > another plugin), i need to edit all the dozens of config files for each > > server to add this new service. > > > > There is a smarter way to do this? > > Sorry for my poor english. Tks in advance. > > > Try using templates and hostgroups > Templates will allow you to use a central base definition to the service > or host . > http://nagios.sourceforge.net/docs/3_0/objectinheritance.html > > you can assign a service to a hostgroup and thus all host in the group > will have the service checked on them. > > http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service > Also, using a web-based configuration tool such as NagiosQL ( http://www.nagiosql.org), in conjunction with things like templates, hostgroups, servicegroups etc. can make life a whole lot easier. Doubtless there are other web-based configuration front-ends for Nagios around as well... > > -- > Never,Ever Cut A Deal With a Dragon > > > I am doing a Charity Bike ride On the 27 of June for the > Capital to Coast Charity. Please help by Donating > http://www.justgiving.com/Lovefilm-capital-to-coast > > > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From spglegle at yahoo.fr Tue May 4 15:49:52 2010 From: spglegle at yahoo.fr (SP GLE) Date: Tue, 4 May 2010 13:49:52 +0000 (GMT) Subject: Handling alarms from unknown/undefined services/categories Message-ID: <586549.18565.qm@web27905.mail.ukl.yahoo.com> Hi, I would like to use nagios to be notified of alarms using passive checks, something like snmp traps but not equal to. Here is the context. The alarms are received by an SMTP server, a perl script is used to parse the received SMTP message and send information using nsca. The SMTP message contains these information : - Alarm level : SUCCESS|WARNING|CRITICAL - Source Ip Addr : the host that sent the alarm - Date/Time : the datetime of the alarm - a Message, the message is composed of two parts ":
" for ex. "CPU Temperature: 25" "Ip Link 12: link down" I would like to forward these alarms to nagios using nsca, but I've got the following major problems : - I don't have the full list of possible categories/alarms....I just have the previously described format ":
" - I know some sets of categories are quite large, for ex. the "Ip Link \d+" category can be related to over 250 ip links. - Is it possible to handle such alarms with nagios ? If so, how can I configure Nagios to monitor such 'unknown' services ? (Is there a specific plugin ?) Thank you. Regards ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a.huth at tmr.net Tue May 4 15:05:44 2010 From: a.huth at tmr.net (Alex Huth) Date: Tue, 04 May 2010 15:05:44 +0200 Subject: MySQL 5 Replikation Message-ID: <1272978344.1483.14.camel@borusse.ewmr.base> Hello! I have to set up a Master/Slave Replication in mySQL 5. Before i do this i want to have tools watching this replication. Unfortunately i haven?t found a Nagios plugin, which is checking more than something like the time the slave is behind. Is there something to check all data between the Nodes or do you trust the replication without checking it? Greetings Alex Huth ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Tue May 4 16:22:52 2010 From: benny at bennyvision.com (C. Bensend) Date: Tue, 4 May 2010 09:22:52 -0500 Subject: MySQL 5 Replikation In-Reply-To: <1272978344.1483.14.camel@borusse.ewmr.base> References: <1272978344.1483.14.camel@borusse.ewmr.base> Message-ID: <19eacf729feb9c83e222d8719d3f7256.squirrel@webmail.stinkweasel.net> > I have to set up a Master/Slave Replication in mySQL 5. Before i do this > i want to have tools watching this replication. Unfortunately i haven?t > found a Nagios plugin, which is checking more than something like the > time the slave is behind. > Is there something to check all data between the Nodes or do you trust > the replication without checking it? In a previous life, I monitored some 60 or so remote sites, each one with a replicating master-slave pair of MySQL 4.x servers. The base check_mysql (with the "-S" switch) monitored the replication just fine for us, including the amount of time a slave was behind, if the slave thread was running properly, and a number of other details. It never missed any replication failures, and trust me, MySQL replication failed all the freakin' time. Benny -- Me: 'How big a monster can you take out with one of those? Would you win a fight with Godzilla?' Jim: 'You could disassemble Godzilla at a range of seven miles.' -- Blog entry about the 76mm Melera, a gun on a US Navy Perry-class frigate that Somali pirates tried to seize ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew.ford at wachovia.com Tue May 4 20:53:24 2010 From: andrew.ford at wachovia.com (Ford, Andy) Date: Tue, 4 May 2010 14:53:24 -0400 Subject: nsca daemon performance Message-ID: I'm having a problem with my nsca daemon (v. 2.7.2) in a distributed monitoring setup (nagios 3.2.1 on SLES 10.2) and I was wondering if anyone else has seen something like this. nsca seems to be dropping incoming passives in an inconsistent way, sometimes keeping up, other times seeming to skip 1,2, or even sometimes 3 incoming passive checks. I've checked the collector, and it seems to be sending the nsca packets very solidly on the 5 minute normal_check_interval. These are the times that nagios logged a "PASSIVE SERVICE CHECK" for a typical service on a single host since last midnight: 00:03:55 00:08:56 00:13:50 00:18:51 00:28:53 00:38:55 00:43:56 00:48:50 00:53:51 00:58:52 01:08:54 01:13:55 01:18:49 01:23:50 01:38:53 01:43:54 01:48:55 01:53:49 01:58:50 02:18:54 02:23:55 02:28:49 02:33:50 02:53:47 02:58:55 03:03:56 03:08:50 03:23:46 03:28:54 03:33:55 03:38:56 03:43:50 03:48:51 03:58:46 04:03:54 04:08:55 04:13:56 04:18:50 04:38:54 04:43:55 04:48:56 04:53:50 04:58:51 05:03:52 05:13:54 05:18:55 05:23:56 05:28:50 05:48:54 05:53:55 05:58:49 06:03:50 06:23:54 06:28:48 06:33:49 06:38:50 07:03:55 07:08:49 07:13:50 07:33:54 07:38:55 07:43:56 07:48:50 08:03:46 08:08:54 08:13:55 08:18:49 08:23:50 08:33:52 08:43:54 08:48:55 08:53:56 08:58:50 09:08:45 09:18:54 09:23:55 09:28:49 09:33:50 09:38:44 09:53:54 09:58:55 10:03:49 10:08:50 10:28:54 10:33:55 10:38:49 10:43:50 11:08:55 11:13:56 11:18:50 11:33:46 11:38:54 11:43:55 11:48:56 11:53:50 12:13:54 12:18:55 Andy Ford Network Security Compliance & Automation, Wells Fargo & Co. 314-600-7025 andrew.ford at wachovia.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From neufpas at gmail.com Tue May 4 21:50:41 2010 From: neufpas at gmail.com (Noel Platzke) Date: Tue, 4 May 2010 15:50:41 -0400 Subject: nsca daemon performance In-Reply-To: References: Message-ID: If you're running the nsca daemon in xinetd you can up the number of threads and connections. I had a similar issue as I started adding services and eventually fixed it by adding these lines to /etc/xinetd.d/nsca instances = UNLIMITED per_source = UNLIMITED cps = 5000 0 On Tue, May 4, 2010 at 2:53 PM, Ford, Andy wrote: > I?m having a problem with my nsca daemon (v. 2.7.2) in a distributed > monitoring setup (nagios 3.2.1 on SLES 10.2) and I was wondering if anyone > else has seen something like this. > > nsca seems to be dropping incoming passives in an inconsistent way, > sometimes keeping up, other times seeming to skip 1,2, or even sometimes 3 > incoming passive checks. > > I?ve checked the collector, and it seems to be sending the nsca packets > very solidly on the 5 minute normal_check_interval. > > > > These are the times that nagios logged a ?PASSIVE SERVICE CHECK? for a > typical service on a single host since last midnight: > > > > 00:03:55 > > 00:08:56 > > 00:13:50 > > 00:18:51 > > 00:28:53 > > 00:38:55 > > 00:43:56 > > 00:48:50 > > 00:53:51 > > 00:58:52 > > 01:08:54 > > 01:13:55 > > 01:18:49 > > 01:23:50 > > 01:38:53 > > 01:43:54 > > 01:48:55 > > 01:53:49 > > 01:58:50 > > 02:18:54 > > 02:23:55 > > 02:28:49 > > 02:33:50 > > 02:53:47 > > 02:58:55 > > 03:03:56 > > 03:08:50 > > 03:23:46 > > 03:28:54 > > 03:33:55 > > 03:38:56 > > 03:43:50 > > 03:48:51 > > 03:58:46 > > 04:03:54 > > 04:08:55 > > 04:13:56 > > 04:18:50 > > 04:38:54 > > 04:43:55 > > 04:48:56 > > 04:53:50 > > 04:58:51 > > 05:03:52 > > 05:13:54 > > 05:18:55 > > 05:23:56 > > 05:28:50 > > 05:48:54 > > 05:53:55 > > 05:58:49 > > 06:03:50 > > 06:23:54 > > 06:28:48 > > 06:33:49 > > 06:38:50 > > 07:03:55 > > 07:08:49 > > 07:13:50 > > 07:33:54 > > 07:38:55 > > 07:43:56 > > 07:48:50 > > 08:03:46 > > 08:08:54 > > 08:13:55 > > 08:18:49 > > 08:23:50 > > 08:33:52 > > 08:43:54 > > 08:48:55 > > 08:53:56 > > 08:58:50 > > 09:08:45 > > 09:18:54 > > 09:23:55 > > 09:28:49 > > 09:33:50 > > 09:38:44 > > 09:53:54 > > 09:58:55 > > 10:03:49 > > 10:08:50 > > 10:28:54 > > 10:33:55 > > 10:38:49 > > 10:43:50 > > 11:08:55 > > 11:13:56 > > 11:18:50 > > 11:33:46 > > 11:38:54 > > 11:43:55 > > 11:48:56 > > 11:53:50 > > 12:13:54 > > 12:18:55 > > > > Andy Ford > > Network Security Compliance & Automation, Wells Fargo & Co. > > 314-600-7025 > > andrew.ford at wachovia.com > > > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew.ford at wachovia.com Tue May 4 21:59:29 2010 From: andrew.ford at wachovia.com (Ford, Andy) Date: Tue, 4 May 2010 15:59:29 -0400 Subject: nsca daemon performance In-Reply-To: References: Message-ID: Thanks, Noel, that's very good advice for an xinetd-propelled nsca daemon, but this setup is using a standalone nsca. I used the xinetd approach for many years, maybe I should switch back... From: Noel Platzke [mailto:neufpas at gmail.com] Sent: Tuesday, May 04, 2010 2:51 PM To: Nagios Users List Subject: Re: [Nagios-users] nsca daemon performance If you're running the nsca daemon in xinetd you can up the number of threads and connections. I had a similar issue as I started adding services and eventually fixed it by adding these lines to /etc/xinetd.d/nsca instances = UNLIMITED per_source = UNLIMITED cps = 5000 0 On Tue, May 4, 2010 at 2:53 PM, Ford, Andy > wrote: I'm having a problem with my nsca daemon (v. 2.7.2) in a distributed monitoring setup (nagios 3.2.1 on SLES 10.2) and I was wondering if anyone else has seen something like this. nsca seems to be dropping incoming passives in an inconsistent way, sometimes keeping up, other times seeming to skip 1,2, or even sometimes 3 incoming passive checks. I've checked the collector, and it seems to be sending the nsca packets very solidly on the 5 minute normal_check_interval. These are the times that nagios logged a "PASSIVE SERVICE CHECK" for a typical service on a single host since last midnight: 00:03:55 00:08:56 00:13:50 00:18:51 00:28:53 00:38:55 00:43:56 00:48:50 00:53:51 00:58:52 01:08:54 01:13:55 01:18:49 01:23:50 01:38:53 01:43:54 01:48:55 01:53:49 01:58:50 02:18:54 02:23:55 02:28:49 02:33:50 02:53:47 02:58:55 03:03:56 03:08:50 03:23:46 03:28:54 03:33:55 03:38:56 03:43:50 03:48:51 03:58:46 04:03:54 04:08:55 04:13:56 04:18:50 04:38:54 04:43:55 04:48:56 04:53:50 04:58:51 05:03:52 05:13:54 05:18:55 05:23:56 05:28:50 05:48:54 05:53:55 05:58:49 06:03:50 06:23:54 06:28:48 06:33:49 06:38:50 07:03:55 07:08:49 07:13:50 07:33:54 07:38:55 07:43:56 07:48:50 08:03:46 08:08:54 08:13:55 08:18:49 08:23:50 08:33:52 08:43:54 08:48:55 08:53:56 08:58:50 09:08:45 09:18:54 09:23:55 09:28:49 09:33:50 09:38:44 09:53:54 09:58:55 10:03:49 10:08:50 10:28:54 10:33:55 10:38:49 10:43:50 11:08:55 11:13:56 11:18:50 11:33:46 11:38:54 11:43:55 11:48:56 11:53:50 12:13:54 12:18:55 Andy Ford Network Security Compliance & Automation, Wells Fargo & Co. 314-600-7025 andrew.ford at wachovia.com ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lscarneiro at veltrac.com.br Tue May 4 22:06:32 2010 From: lscarneiro at veltrac.com.br (Leonardo Carneiro - Veltrac) Date: Tue, 04 May 2010 17:06:32 -0300 Subject: managing a large number of server In-Reply-To: <4BE02145.4060302@veltrac.com.br> References: <4BE019BA.3070809@veltrac.com.br> <4BE01DB4.8070309@flatto.net> <4BE02145.4060302@veltrac.com.br> Message-ID: <4BE07E48.1080208@veltrac.com.br> Leonardo Carneiro - Veltrac wrote: > Assaf Flatto wrote: > >> Leonardo Carneiro - Veltrac wrote: >> >> >>> Hi list, >>> >>> I have the following situation: I'm monitoring a bunch of services on a >>> bunch of servers (ok, nothing new so far). >>> All machines run the same services, but not all services is being >>> monitored by now (since they are custom services, i'm still writing >>> plugins for all of them). New machines are being added every day, and >>> also, i'm writing new plugins to check the services. >>> >>> When a new machine is added, is relatively easy: i copy the config file >>> from another machine and change the name of the host and other unique >>> attributes, but when a new service is added (a.k.a. i have finish >>> another plugin), i need to edit all the dozens of config files for each >>> server to add this new service. >>> >>> There is a smarter way to do this? >>> Sorry for my poor english. Tks in advance. >>> >>> >>> >> Try using templates and hostgroups >> Templates will allow you to use a central base definition to the service >> or host . >> http://nagios.sourceforge.net/docs/3_0/objectinheritance.html >> >> you can assign a service to a hostgroup and thus all host in the group >> will have the service checked on them. >> >> http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service >> > Tks for showing the way. I'll check it out. Worked like a charm. It's MUCH EASIER now to manage my hosts. Though i'm not a programmer, i do understand the OOP, and this is very much the same =) Tks again guys. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trisha at rockyou.com Wed May 5 02:13:45 2010 From: trisha at rockyou.com (Trisha Hoang) Date: Tue, 4 May 2010 17:13:45 -0700 Subject: high host latency on nagios master Message-ID: Hi, The nagios *master *got really high host latency and I'm not sure how to tweak it. I ran the check_ping plugin on a handful of hosts and the rta averaged at 0.2 second so it's not the network. *Environment:* - 565 hosts - 6790 passive checks from the slaves - not using event broker - master server *actively* executes the hosts checks every 5 minutes and *passively *processes checks every 1 minute - not doing performance data *Nagiostats* Nagios Stats 3.2.1 Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org) Last Modified: 03-09-2010 License: GPL CURRENT STATUS DATA ------------------------------------------------------ Status File: /var/log/nagios/status.dat Status File Age: 0d 0h 0m 23s Status File Version: 3.2.1 Program Running Time: 0d 1h 32m 19s Nagios PID: 28282 Used/High/Total Command Buffers: 1316 / 3066 / 4096 Total Services: 7745 Services Checked: 7745 Services Scheduled: 1381 Services Actively Checked: 955 Services Passively Checked: 6790 Total Service State Change: 0.000 / 9.740 / 0.007 % Active Service Latency: 18.948 / 205.144 / 165.751 sec Active Service Execution Time: 0.007 / 9.051 / 0.055 sec Active Service State Change: 0.000 / 5.460 / 0.006 % Active Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 Passive Service Latency: 34.359 / 190.247 / 76.739 sec Passive Service State Change: 0.000 / 9.740 / 0.008 % Passive Services Last 1/5/15/60 min: 0 / 3054 / 6774 / 6784 Services Ok/Warn/Unk/Crit: 7720 / 1 / 0 / 24 Services Flapping: 27 Services In Downtime: 0 Total Hosts: 566 Hosts Checked: 566 Hosts Scheduled: 566 Hosts Actively Checked: 566 Host Passively Checked: 0 Total Host State Change: 0.000 / 0.000 / 0.000 % Active Host Latency: 0.000 / 3410.087 / 2413.051 sec Active Host Execution Time: 0.007 / 10.010 / 0.063 sec Active Host State Change: 0.000 / 0.000 / 0.000 % Active Hosts Last 1/5/15/60 min: 0 / 8 / 10 / 565 Passive Host Latency: 0.000 / 0.000 / 0.000 sec Passive Host State Change: 0.000 / 0.000 / 0.000 % Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 Hosts Up/Down/Unreach: 563 / 3 / 0 Hosts Flapping: 1 Hosts In Downtime: 0 Active Host Checks Last 1/5/15 min: 5 / 32 / 75 Scheduled: 0 / 0 / 0 On-demand: 5 / 32 / 75 Parallel: 1 / 11 / 23 Serial: 0 / 0 / 0 Cached: 4 / 21 / 52 Passive Host Checks Last 1/5/15 min: 0 / 0 / 0 Active Service Checks Last 1/5/15 min: 0 / 0 / 0 Scheduled: 0 / 0 / 0 On-demand: 0 / 0 / 0 Cached: 0 / 0 / 0 Passive Service Checks Last 1/5/15 min: 2 / 1455 / 1455 External Commands Last 1/5/15 min: 1302 / 6063 / 20253 *Nagios.cfg* # EXTERNAL COMMAND CHECK INTERVAL # This is the interval at which Nagios should check for external commands. # This value works of the interval_length you specify later. If you leave # that at its default value of 60 (seconds), a value of 1 here will cause # Nagios to check for external commands every minute. If you specify a # number followed by an "s" (i.e. 15s), this will be interpreted to mean # actual seconds rather than a multiple of the interval_length variable. # Note: In addition to reading the external command file at regularly # scheduled intervals, Nagios will also check for external commands after # event handlers are executed. # NOTE: Setting this value to -1 causes Nagios to check the external # command file as often as possible. #command_check_interval=15s command_check_interval=-1 # SERVICE INTER-CHECK DELAY METHOD # This is the method that Nagios should use when initially # "spreading out" service checks when it starts monitoring. The # default is to use smart delay calculation, which will try to # space all service checks out evenly to minimize CPU load. # Using the dumb setting will cause all checks to be scheduled # at the same time (with no delay between them)! This is not a # good thing for production, but is useful when testing the # parallelization functionality. # n = None - don't use any delay between checks # d = Use a "dumb" delay of 1 second between checks # s = Use "smart" inter-check delay calculation # x.xx = Use an inter-check delay of x.xx seconds service_inter_check_delay_method=s # MAXIMUM SERVICE CHECK SPREAD # This variable determines the timeframe (in minutes) from the # program start time that an initial check of all services should # be completed. Default is 30 minutes. max_service_check_spread=30 # SERVICE CHECK INTERLEAVE FACTOR # This variable determines how service checks are interleaved. # Interleaving the service checks allows for a more even # distribution of service checks and reduced load on remote # hosts. Setting this value to 1 is equivalent to how versions # of Nagios previous to 0.0.5 did service checks. Set this # value to s (smart) for automatic calculation of the interleave # factor unless you have a specific reason to change it. # s = Use "smart" interleave factor calculation # x = Use an interleave factor of x, where x is a # number greater than or equal to 1. service_interleave_factor=s # HOST INTER-CHECK DELAY METHOD # This is the method that Nagios should use when initially # "spreading out" host checks when it starts monitoring. The # default is to use smart delay calculation, which will try to # space all host checks out evenly to minimize CPU load. # Using the dumb setting will cause all checks to be scheduled # at the same time (with no delay between them)! # n = None - don't use any delay between checks # d = Use a "dumb" delay of 1 second between checks # s = Use "smart" inter-check delay calculation # x.xx = Use an inter-check delay of x.xx seconds host_inter_check_delay_method=s # MAXIMUM HOST CHECK SPREAD # This variable determines the timeframe (in minutes) from the # program start time that an initial check of all hosts should # be completed. Default is 30 minutes. max_host_check_spread=30 # MAXIMUM CONCURRENT SERVICE CHECKS # This option allows you to specify the maximum number of # service checks that can be run in parallel at any given time. # Specifying a value of 1 for this variable essentially prevents # any service checks from being parallelized. A value of 0 # will not restrict the number of concurrent checks that are # being executed. max_concurrent_checks=0 # HOST AND SERVICE CHECK REAPER FREQUENCY # This is the frequency (in seconds!) that Nagios will process # the results of host and service checks. check_result_reaper_frequency=10 # MAX CHECK RESULT REAPER TIME # This is the max amount of time (in seconds) that a single # check result reaper event will be allowed to run before # returning control back to Nagios so it can perform other # duties. max_check_result_reaper_time=30 # CHECK RESULT PATH # This is directory where Nagios stores the results of host and # service checks that have not yet been processed. # # Note: Make sure that only one instance of Nagios has access # to this directory! check_result_path=/var/log/nagios/spool/checkresults # MAX CHECK RESULT FILE AGE # This option determines the maximum age (in seconds) which check # result files are considered to be valid. Files older than this # threshold will be mercilessly deleted without further processing. max_check_result_file_age=3600 # CACHED HOST CHECK HORIZON # This option determines the maximum amount of time (in seconds) # that the state of a previous host check is considered current. # Cached host states (from host checks that were performed more # recently that the timeframe specified by this value) can immensely # improve performance in regards to the host check logic. # Too high of a value for this option may result in inaccurate host # states being used by Nagios, while a lower value may result in a # performance hit for host checks. Use a value of 0 to disable host # check caching. #cached_host_check_horizon=15 cached_host_check_horizon=60 # CACHED SERVICE CHECK HORIZON # This option determines the maximum amount of time (in seconds) # that the state of a previous service check is considered current. # Cached service states (from service checks that were performed more # recently that the timeframe specified by this value) can immensely # improve performance in regards to predictive dependency checks. # Use a value of 0 to disable service check caching. cached_service_check_horizon=15 # ENABLE PREDICTIVE HOST DEPENDENCY CHECKS # This option determines whether or not Nagios will attempt to execute # checks of hosts when it predicts that future dependency logic test # may be needed. These predictive checks can help ensure that your # host dependency logic works well. # Values: # 0 = Disable predictive checks # 1 = Enable predictive checks (default) enable_predictive_host_dependency_checks=1 # ENABLE PREDICTIVE SERVICE DEPENDENCY CHECKS # This option determines whether or not Nagios will attempt to execute # checks of service when it predicts that future dependency logic test # may be needed. These predictive checks can help ensure that your # service dependency logic works well. # Values: # 0 = Disable predictive checks # 1 = Enable predictive checks (default) enable_predictive_service_dependency_checks=1 # AUTO-RESCHEDULING OPTION # This option determines whether or not Nagios will attempt to # automatically reschedule active host and service checks to # "smooth" them out over time. This can help balance the load on # the monitoring server. # WARNING: THIS IS AN EXPERIMENTAL FEATURE - IT CAN DEGRADE # PERFORMANCE, RATHER THAN INCREASE IT, IF USED IMPROPERLY auto_reschedule_checks=0 # AUTO-RESCHEDULING INTERVAL # This option determines how often (in seconds) Nagios will # attempt to automatically reschedule checks. This option only # has an effect if the auto_reschedule_checks option is enabled. # Default is 30 seconds. # WARNING: THIS IS AN EXPERIMENTAL FEATURE - IT CAN DEGRADE # PERFORMANCE, RATHER THAN INCREASE IT, IF USED IMPROPERLY auto_rescheduling_interval=30 # AUTO-RESCHEDULING WINDOW # This option determines the "window" of time (in seconds) that # Nagios will look at when automatically rescheduling checks. # Only host and service checks that occur in the next X seconds # (determined by this variable) will be rescheduled. This option # only has an effect if the auto_reschedule_checks option is # enabled. Default is 180 seconds (3 minutes). # WARNING: THIS IS AN EXPERIMENTAL FEATURE - IT CAN DEGRADE # PERFORMANCE, RATHER THAN INCREASE IT, IF USED IMPROPERLY auto_rescheduling_window=180 # SLEEP TIME # This is the number of seconds to sleep between checking for system # events and service checks that need to be run. sleep_time=0.25 # TIMEOUT VALUES # These options control how much time Nagios will allow various # types of commands to execute before killing them off. Options # are available for controlling maximum time allotted for # service checks, host checks, event handlers, notifications, the # ocsp command, and performance data commands. All values are in # seconds. service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 # AGGRESSIVE HOST CHECKING OPTION # If you don't want to turn on aggressive host checking features, set # this value to 0 (the default). Otherwise set this value to 1 to # enable the aggressive check option. Read the docs for more info # on what aggressive host check is or check out the source code in # base/checks.c use_aggressive_host_checking=0 # SERVICE CHECK EXECUTION OPTION # This determines whether or not Nagios will actively execute # service checks when it initially starts. If this option is # disabled, checks are not actively made, but Nagios can still # receive and process passive check results that come in. Unless # you're implementing redundant hosts or have a special need for # disabling the execution of service checks, leave this enabled! # Values: 1 = enable checks, 0 = disable checks execute_service_checks=0 # PASSIVE SERVICE CHECK ACCEPTANCE OPTION # This determines whether or not Nagios will accept passive # service checks results when it initially (re)starts. # Values: 1 = accept passive checks, 0 = reject passive checks accept_passive_service_checks=1 # HOST CHECK EXECUTION OPTION # This determines whether or not Nagios will actively execute # host checks when it initially starts. If this option is # disabled, checks are not actively made, but Nagios can still # receive and process passive check results that come in. Unless # you're implementing redundant hosts or have a special need for # disabling the execution of host checks, leave this enabled! # Values: 1 = enable checks, 0 = disable checks execute_host_checks=1 # PASSIVE HOST CHECK ACCEPTANCE OPTION # This determines whether or not Nagios will accept passive # host checks results when it initially (re)starts. # Values: 1 = accept passive checks, 0 = reject passive checks accept_passive_host_checks=0 # OBSESS OVER SERVICE CHECKS OPTION # This determines whether or not Nagios will obsess over service # checks and run the ocsp_command defined below. Unless you're # planning on implementing distributed monitoring, do not enable # this option. Read the HTML docs for more information on # implementing distributed monitoring. # Values: 1 = obsess over services, 0 = do not obsess (default) obsess_over_services=0 # OBSESSIVE COMPULSIVE SERVICE PROCESSOR COMMAND # This is the command that is run for every service check that is # processed by Nagios. This command is executed only if the # obsess_over_services option (above) is set to 1. The command # argument is the short name of a command definition that you # define in your host configuration file. Read the HTML docs for # more information on implementing distributed monitoring. #ocsp_command=somecommand # OBSESS OVER HOST CHECKS OPTION # This determines whether or not Nagios will obsess over host # checks and run the ochp_command defined below. Unless you're # planning on implementing distributed monitoring, do not enable # this option. Read the HTML docs for more information on # implementing distributed monitoring. # Values: 1 = obsess over hosts, 0 = do not obsess (default) obsess_over_hosts=0 # OBSESSIVE COMPULSIVE HOST PROCESSOR COMMAND # This is the command that is run for every host check that is # processed by Nagios. This command is executed only if the # obsess_over_hosts option (above) is set to 1. The command # argument is the short name of a command definition that you # define in your host configuration file. Read the HTML docs for # more information on implementing distributed monitoring. #ochp_command=somecommand # SERVICE FRESHNESS CHECK OPTION # This option determines whether or not Nagios will periodically # check the "freshness" of service results. Enabling this option # is useful for ensuring passive checks are received in a timely # manner. # Values: 1 = enabled freshness checking, 0 = disable freshness checking check_service_freshness=1 # SERVICE FRESHNESS CHECK INTERVAL # This setting determines how often (in seconds) Nagios will # check the "freshness" of service check results. If you have # disabled service freshness checking, this option has no effect. #service_freshness_check_interval=60 service_freshness_check_interval=420 # HOST FRESHNESS CHECK OPTION # This option determines whether or not Nagios will periodically # check the "freshness" of host results. Enabling this option # is useful for ensuring passive checks are received in a timely # manner. # Values: 1 = enabled freshness checking, 0 = disable freshness checking check_host_freshness=0 #check_host_freshness=1 # HOST FRESHNESS CHECK INTERVAL # This setting determines how often (in seconds) Nagios will # check the "freshness" of host check results. If you have # disabled host freshness checking, this option has no effect. #host_freshness_check_interval=60 host_freshness_check_interval=420 # ADDITIONAL FRESHNESS THRESHOLD LATENCY # This setting determines the number of seconds that Nagios # will add to any host and service freshness thresholds that # it calculates (those not explicitly specified by the user). #additional_freshness_latency=15 additional_freshness_latency=180 # LARGE INSTALLATION TWEAKS OPTION # This option determines whether or not Nagios will take some shortcuts # which can save on memory and CPU usage in large Nagios installations. # Read the documentation for more information on the benefits/tradeoffs # of enabling this option. # Values: 1 - Enabled tweaks # 0 - Disable tweaks (default) use_large_installation_tweaks=1 # CHILD PROCESS MEMORY OPTION # This option determines whether or not Nagios will free memory in # child processes (processed used to execute system commands and host/ # service checks). If you specify a value here, it will override # program defaults. # Value: 1 - Free memory in child processes # 0 - Do not free memory in child processes #free_child_process_memory=1 # CHILD PROCESS FORKING BEHAVIOR # This option determines how Nagios will fork child processes # (used to execute system commands and host/service checks). Normally # child processes are fork()ed twice, which provides a very high level # of isolation from problems. Fork()ing once is probably enough and will # save a great deal on CPU usage (in large installs), so you might # want to consider using this. If you specify a value here, it will # program defaults. # Value: 1 - Child processes fork() twice # 0 - Child processes fork() just once #child_processes_fork_twice=1 child_processes_fork_twice=0 # DEBUG LEVEL # This option determines how much (if any) debugging information will # be written to the debug file. OR values together to log multiple # types of information. # Values: # -1 = Everything # 0 = Nothing # 1 = Functions # 2 = Configuration # 4 = Process information # 8 = Scheduled events # 16 = Host/service checks # 32 = Notifications # 64 = Event broker # 128 = External commands # 256 = Commands # 512 = Scheduled downtime # 1024 = Comments # 2048 = Macros debug_level=16 # DEBUG VERBOSITY # This option determines how verbose the debug log out will be. # Values: 0 = Brief output # 1 = More detailed # 2 = Very detailed debug_verbosity=1 Thanks in advance for your help. Trisha -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From paul.willis at kent.pnn.police.uk Wed May 5 10:44:01 2010 From: paul.willis at kent.pnn.police.uk (Paul WILLIS PSE 55499) Date: Wed, 05 May 2010 09:44:01 +0100 Subject: Parsing Logs In-Reply-To: <3E7D287A4787439EB82B6097BC206CC9@Because> References: <3E7D287A4787439EB82B6097BC206CC9@Because> Message-ID: <4BE13DE1020000C400017B4A@kent.pnn.police.uk> Stephen Most email systems move messages to a retry queue if the first attempt at sending fails. If your email does something like that you can just write a simple script to check whether it is empty or not and return a status to Nagios. There might be a genuine reason why the first send fails, but at least you get alerted. You could even get the script to send a test email at this point and check whether the queue grows by one. Regards Paul Willis >>> "Stephen H. Dawson" 04/05/10 10:57 >>> Hi, We have looked at the check_smtp option to meet our need, but do not believe this option will work. Looking for perhaps another option. Failure of an SMTP connection to our ISP's mail server with Nagios will only tell us that the Nagios attempt to connect has failed, at that particular point in time. We are running our Nagios box on CentOS. The CentOS attempt itself could well connect and send a message a few seconds/minutes before or after that. We also run the risk of being accused of 'excessive' number of emails with our ISP's email server. So the only reliable way of checking if ISP SMTP relay connection works is to parse the CentOS logs and look for failures to authenticate/send. Has anyone parsed a log for review with Nagios? If so, please advise. Thanks, SHD -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.gosenheimer at proio.com Wed May 5 10:56:38 2010 From: s.gosenheimer at proio.com (Sebastian Gosenheimer) Date: Wed, 5 May 2010 10:56:38 +0200 Subject: Different contacts for services on same host Message-ID: <82610571-34BB-4C93-9880-4DA072AD71B6@proio.com> Hi everybody, i'm just having some problems with my contacts and contactgroups. Let's say, i have a contactgroup xy and a contact ab. I set up a host with the contactgroup xy and some services on this host with the contactgroup xy. But now i have a "special" service on this host, where i only want notifications send to the contact ab. Nagios is also sending the notification e-mails correctly to the contact ab, but it's also sending the notification e-mails to the contactgroup xy. Can someone tell what i'm missing? Thank you for your help! Kind regards, --sg Diese E-Mail enth?lt vertrauliche und/oder rechtlich gesch?tzte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrt?mlich erhalten haben, informieren Sie bitte sofort den Absender und vernichten Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail sind nicht gestattet. This e-mail may contain confidential and/or privileged information. If you are not the intended recipient (or have received this e-mail in error) please notify the sender immediately and destroy this e-mail. Any unauthorized copying, disclosure or distribution of the material in this e-mail is strictly forbidden. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.gosenheimer at proio.com Wed May 5 10:43:08 2010 From: s.gosenheimer at proio.com (Sebastian Gosenheimer) Date: Wed, 5 May 2010 10:43:08 +0200 Subject: Different contacts for services on same host Message-ID: Hi everybody, i'm just having some problems with my contacts and contactgroups. Let's say, i have a contactgroup xy and a contact ab. I set up a host with the contactgroup xy and some services on this host with the contactgroup xy. But now i have a "special" service on this host, where i only want notifications send to the contact ab. Nagios is also sending the notification e-mails correctly to the contact ab, but it's also sending the notification e-mails to the contactgroup xy. Can someone tell what i'm missing? Thank you for your help! Kind regards, --sg Diese E-Mail enth?lt vertrauliche und/oder rechtlich gesch?tzte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrt?mlich erhalten haben, informieren Sie bitte sofort den Absender und vernichten Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail sind nicht gestattet. This e-mail may contain confidential and/or privileged information. If you are not the intended recipient (or have received this e-mail in error) please notify the sender immediately and destroy this e-mail. Any unauthorized copying, disclosure or distribution of the material in this e-mail is strictly forbidden. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at smetj.net Wed May 5 11:30:41 2010 From: nagios at smetj.net (Jelle Smet) Date: Wed, 5 May 2010 09:30:41 +0000 (UTC) Subject: Parsing Logs References: <3E7D287A4787439EB82B6097BC206CC9@Because> Message-ID: Stephen H. Dawson shdawson.com> writes: Hi Stephen, > Has anyone parsed a log for > review with Nagios?? If so, please advise. This is a shameless self promotion but http://www.smetj.net/wiki/Molog might be helpfull for you. Jelle Smet. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From service at shdawson.com Wed May 5 11:58:06 2010 From: service at shdawson.com (Stephen H. Dawson) Date: Wed, 5 May 2010 05:58:06 -0400 Subject: Parsing Logs In-Reply-To: References: <3E7D287A4787439EB82B6097BC206CC9@Because> Message-ID: <9BE1F486E3E64A149C580EC6A555113C@Because> Thank You, SHD _____ From: Jelle Smet [mailto:nagios at smetj.net] Sent: Wednesday, May 05, 2010 05:31 E/T To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Parsing Logs Stephen H. Dawson shdawson.com> writes: Hi Stephen, > Has anyone parsed a log for > review with Nagios? If so, please advise. This is a shameless self promotion but http://www.smetj.net/wiki/Molog might be helpfull for you. Jelle Smet. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null _____ avast! Antivirus: Inbound message clean. Virus Database (VPS): 100504-2, 05/04/2010 Tested on: 05/5/2010 05:48 E/T avast! - copyright (c) 1988-2010 ALWIL Software. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From davide.blasi at infracom.it Wed May 5 12:49:43 2010 From: davide.blasi at infracom.it (Davide Blasi) Date: Wed, 5 May 2010 12:49:43 +0200 Subject: check_disk plugin Message-ID: <20100505124943.131b4624@infracom.it> Hi list, I have a question about check_disk plugin. Running check_disk -h I read : [...] -I, --ignore-eregi-path=PATH, --ignore-eregi-partition=PARTITION Regular expression to ignore selected path/partition (case insensitive) (may be repeated) [...] Good it working fine :) But now I have to add another path to ignore. This help say "may be repeated" but if I try to use my check like this " -I /my/fist/.* -I /second/.* " only first occurrence works. I tried with one -I using coma, colon, semicolon or space to separate paths but nothing works :( How can I concatenate more than one path to ignore ? Thank you in advance and sorry for my bad English ;) Dave ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dit.dash at gmail.com Wed May 5 14:33:59 2010 From: dit.dash at gmail.com (dave stern - e-mail.pluribus.unum) Date: Wed, 5 May 2010 08:33:59 -0400 Subject: monitoring radius on windows Message-ID: We're interested in monitoring radius with nagios 3.2.1. We have IAS installed on two windows boxes providing separate radius aaa authentication. I tried the check_radius.pl plugin as well as another compilable one but they don't appear to work despite providing them with user, password and secret. I suspect the way windows handles radius thru IAS may be different. In the case of one of the radius boxes, I just installed NSClient++ and monitored for the service. But the otehr box is a windows 2008 server and doesn't use service for IAS but roles. Anyone have any idea how to monitor? TIA ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.lynch at rasmussen.edu Wed May 5 16:04:01 2010 From: richard.lynch at rasmussen.edu (Richard Lynch) Date: Wed, 05 May 2010 09:04:01 -0500 Subject: Parsing Logs In-Reply-To: <3E7D287A4787439EB82B6097BC206CC9@Because> References: <3E7D287A4787439EB82B6097BC206CC9@Because> Message-ID: I?m just a Nagios-Newbie, but it seems like with all the tools in Nagios; escalation, flapping, etc you ought to be able to use check_smtp to provide a pretty good idea of whether the SMTP connection is really up, or really down, over a period of time, without resorting to grepping log files... On 5/4/10 4:57 AM, "Stephen H. Dawson" wrote: > Hi, > > > We have looked at the check_smtp option to meet our need, but do not believe > this option will work. Looking for perhaps another option. > > Failure of an SMTP connection to our ISP's mail server with Nagios will only > tell us that the Nagios attempt to connect has failed, at that particular > point in time. We are running our Nagios box on CentOS. The CentOS attempt > itself could well connect and send a message a few seconds/minutes before or > after that. We also run the risk of being accused of 'excessive' number of > emails with our ISP's email server. > So the only reliable way of checking if ISP SMTP relay connection works is to > parse the CentOS logs and look for failures to authenticate/send. > > Has anyone parsed a log for review with Nagios? If so, please advise. > > > > Thanks, > SHD > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.lynch at rasmussen.edu Wed May 5 16:05:13 2010 From: richard.lynch at rasmussen.edu (Richard Lynch) Date: Wed, 05 May 2010 09:05:13 -0500 Subject: managing a large number of server In-Reply-To: <4BE019BA.3070809@veltrac.com.br> References: <4BE019BA.3070809@veltrac.com.br> Message-ID: Perhaps put your config files into version control (svn, git, even cvs) and do an "svn -r STABLE up" on all the boxes, perhaps even automating that. On 5/4/10 7:57 AM, "Leonardo Carneiro - Veltrac" wrote: > Hi list, > > I have the following situation: I'm monitoring a bunch of services on a > bunch of servers (ok, nothing new so far). > All machines run the same services, but not all services is being > monitored by now (since they are custom services, i'm still writing > plugins for all of them). New machines are being added every day, and > also, i'm writing new plugins to check the services. > > When a new machine is added, is relatively easy: i copy the config file > from another machine and change the name of the host and other unique > attributes, but when a new service is added (a.k.a. i have finish > another plugin), i need to edit all the dozens of config files for each > server to add this new service. > > There is a smarter way to do this? > Sorry for my poor english. Tks in advance. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.lynch at rasmussen.edu Wed May 5 16:12:49 2010 From: richard.lynch at rasmussen.edu (Richard Lynch) Date: Wed, 05 May 2010 09:12:49 -0500 Subject: check_disk plugin In-Reply-To: <20100505124943.131b4624@infracom.it> References: <20100505124943.131b4624@infracom.it> Message-ID: I would have wrapped he wlidcard paths in quotes, but I have no idea if that's right or not... -I "/my/fist/.*" -I "/second/.*" On 5/5/10 5:49 AM, "Davide Blasi" wrote: > > Hi list, > > I have a question about check_disk plugin. > Running check_disk -h I read : > > [...] > -I, --ignore-eregi-path=PATH, --ignore-eregi-partition=PARTITION > Regular expression to ignore selected path/partition (case insensitive) > (may be repeated) > [...] > > Good it working fine :) > > But now I have to add another path to ignore. > This help say "may be repeated" but if I try to use my check like this " -I > /my/fist/.* -I /second/.* " only first occurrence works. > I tried with one -I using coma, colon, semicolon or space to separate paths > but nothing works :( > > How can I concatenate more than one path to ignore ? > > Thank you in advance and sorry for my bad English ;) > > Dave > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu May 6 03:24:28 2010 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Wed, 5 May 2010 18:24:28 -0700 Subject: Parsing Logs In-Reply-To: References: <3E7D287A4787439EB82B6097BC206CC9@Because> Message-ID: <20100506012428.GG17129@bakgwai.americas.hpqcorp.net> On Wed, 05 May 2010, Richard Lynch wrote: > I?m just a Nagios-Newbie, but it seems like with all the tools in Nagios; escalation, flapping, etc you ought to be able to use check_smtp to provide a pretty good idea of whether the SMTP connection is really up, or really down, over a period of time, without resorting to grepping log files... check_smtp does what it's designed to do really well: It will tell you whether an SMTP listener is up and responding to connections. In this case, I think the OP was trying to check SMTP authentication. That's a little trickier, since an some cases the SMTP server may be up and running just fine, but the auth piece is *never* required to make a connection to an SMTP server; it's only used when determining whether the person who is already connected is allowed to relay through the server, and comes after all the things check_smtp is designed to look at. That said, it's not a hard thing to check. A simple expect script could do it, and you might even be able to pull it off with check_tcp. However, it what you really want to know is if the entire mail flow is working, I'm a fan of having procmail look for trigger messages and send a passive check result to Nagios when it sees one. A cron job can be set up to send mail on a regular basis, and the check can then be configured to alert if a passive check hasn't been triggered within a certain amount of time. Sure, it's several pieces to set up, but it works. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Thu May 6 03:36:12 2010 From: benny at bennyvision.com (C. Bensend) Date: Wed, 5 May 2010 20:36:12 -0500 Subject: Parsing Logs In-Reply-To: <20100506012428.GG17129@bakgwai.americas.hpqcorp.net> References: <3E7D287A4787439EB82B6097BC206CC9@Because> <20100506012428.GG17129@bakgwai.americas.hpqcorp.net> Message-ID: <6dfff4591cc5be0f41c69e3e49cf3aed.squirrel@webmail.stinkweasel.net> > However, it what you really want to know is if the entire mail flow is > working, I'm a fan of having procmail look for trigger messages and send > a passive check result to Nagios when it sees one. A cron job can be > set up to send mail on a regular basis, and the check can then > be configured to alert if a passive check hasn't been triggered within > a certain amount of time. Sure, it's several pieces to set up, but it > works. Possibly OT, but if this *is* the intent of the OP, this particular package works very well for me in our Exchange environment: http://exchange.nagios.org/directory/Plugins/Email-and-Groupware/check_email_delivery/details It will alert on failure of any part of the email path (submission, routing, delivery), as well as the latency from submission to delivery. HTH, Benny -- Me: 'How big a monster can you take out with one of those? Would you win a fight with Godzilla?' Jim: 'You could disassemble Godzilla at a range of seven miles.' -- Blog entry about the 76mm Melera, a gun on a US Navy Perry-class frigate that Somali pirates tried to seize ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Thu May 6 10:01:27 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Thu, 6 May 2010 10:01:27 +0200 (CEST) Subject: Check for the IBM ds4000 series totalstorage cabinett - FIX Message-ID: <715a47138d33b4c92813c74bd3744414.squirrel@picard.linux.it> Hi, after the latest firmware upgrade to a IBM DS 4700, the "Check for the IBM ds4000 series totalstorage cabinet" stopped working. I made a small fix to the perl plugin, just 2 lines. To have the plugin working with the latest cli, make this substitutions: Line 164 Change if($line=~/Array status:\s*([^\s]*)/i) { with if(($line=~/Array status:\s*([^\s]*)/i) || ($line=~/Status:\s*([^\s]*)/i)) { Line 167 Change if(!defined($array_status) || !($array_status=~/online/i)) { with if(!defined($array_status) || !(($array_status=~/online/i) || ($array_status=~/optimal/i))) { Done, now it should work. The latest upgrade changes the output of cli messages, that's why the script doesn't work anymore. The fix traps the old and new status messages. Hope it helps Cheers, Giorgio Zarrelli ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From davide.blasi at infracom.it Thu May 6 10:12:33 2010 From: davide.blasi at infracom.it (Davide Blasi) Date: Thu, 6 May 2010 10:12:33 +0200 Subject: check_disk plugin In-Reply-To: References: <20100505124943.131b4624@infracom.it> Message-ID: <20100506101233.1e2cf0db@infracom.it> with or without quotes give me the same result :( Dave Date: Wed, 5 May 2010 16:12:49 +0200 From: Richard Lynch To: Nagios Users List Subject: Re: [Nagios-users] check_disk plugin I would have wrapped he wlidcard paths in quotes, but I have no idea if that's right or not... -I "/my/fist/.*" -I "/second/.*" On 5/5/10 5:49 AM, "Davide Blasi" wrote: > > Hi list, > > I have a question about check_disk plugin. > Running check_disk -h I read : > > [...] > -I, --ignore-eregi-path=PATH, --ignore-eregi-partition=PARTITION > Regular expression to ignore selected path/partition (case insensitive) > (may be repeated) > [...] > > Good it working fine :) > > But now I have to add another path to ignore. > This help say "may be repeated" but if I try to use my check like this " -I > /my/fist/.* -I /second/.* " only first occurrence works. > I tried with one -I using coma, colon, semicolon or space to separate paths > but nothing works :( > > How can I concatenate more than one path to ignore ? > > Thank you in advance and sorry for my bad English ;) > > Dave > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lomiz.mail at gmail.com Thu May 6 11:55:13 2010 From: lomiz.mail at gmail.com (Enrico Zimol) Date: Thu, 6 May 2010 11:55:13 +0200 Subject: Check for the IBM ds4000 series totalstorage cabinett - FIX In-Reply-To: <715a47138d33b4c92813c74bd3744414.squirrel@picard.linux.it> References: <715a47138d33b4c92813c74bd3744414.squirrel@picard.linux.it> Message-ID: Hi at all, I'm newbie on nagios and I'm writing here to ask you for suggestions abut how to structure my monitoring situation. I've to monitor linux servers for about 15/20 customers, from 1 to 5 server for each customer. We aren't on vpn with customers, so this servers are all behind NAT. That isn't a problem because we are the administrator of the firewall (other linux server) so we can manage any kind of DNAT and filter rule. I read on official documentation that suggest to use NCSA addon for distributed monitoring, but we choose to use NRPE addon for different motivations like: -customer force us to do that -the number of monitored servers for each customer will never grow up -the services to monitor for each server are the same (raid hw/sw, disk usage etc) -we need a completly centralized monitoring structure For last sentence I thought to use the arguments option on NRPE (yes, I read the SECURITY document). Besides, to solve the problem of NAT with NRPE I'll do DNAT on firewall and the port parameter on check_nrpe plugin (is there problems to do that? I did little tests but I prefear a confirm) To manage this structure I need to organized a well-formed config file structure on nagios server. I thinked to structure it like this obj--| |-->templatelinuxserversgeneral.cfg | |-->customer_1_directory|->templateserver.cfg | |->server1.cfg | |->server2.cfg | |->servern.cfg | |-->customer_2_directory|->templateserver.cfg |->server1.cfg |->servern.cfg Where: -templatelinuxserversgeneral.cfg is a very basic template for server -customer_1_directory in wich there is 1 file for each customer's server -templateserver.cfg will use templatelinuxserversgeneral and will add more specific common variabiles for that customer's server like the public IPAddress that will be the same for each customer's server. -servern.cfg in wich there will be some very specific server variables like nrpe port (read up). What do you think? How can I organize that service-server combination? Thank's so much P.S. sorry for my bad english -- Enrico Zimol ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lomiz.mail at gmail.com Thu May 6 11:57:23 2010 From: lomiz.mail at gmail.com (Enrico Zimol) Date: Thu, 6 May 2010 11:57:23 +0200 Subject: Distributed monitoring Message-ID: wrong 3d before, sorry On 6 May 2010 11:55, Enrico Zimol wrote: > Hi at all, > I'm newbie on nagios and I'm writing here to ask you for suggestions > abut how to structure my monitoring situation. > I've to monitor linux servers for about 15/20 customers, from 1 to 5 > server for each customer. > We aren't on vpn with customers, so this servers are all behind NAT. > That isn't a problem because we are the administrator of the firewall > (other linux server) so we can manage any kind of DNAT and filter > rule. > > I read on official documentation that suggest to use NCSA addon for > distributed monitoring, but we choose to use NRPE addon for different > motivations like: > -customer force us to do that > -the number of monitored servers for each customer will never grow up > -the services to monitor for each server are the same (raid hw/sw, > disk usage etc) > -we need a completly centralized monitoring structure > > For last sentence I thought to use the arguments option on NRPE (yes, > I read the SECURITY document). > Besides, to solve the problem of NAT with NRPE I'll do DNAT on > firewall and the port parameter on check_nrpe plugin (is there > problems to do that? I did little tests but I prefear a confirm) > > > To manage this structure I need to organized a well-formed config file > structure on nagios server. > > I thinked to structure it like this > > obj--| > ? ? ? ?|-->templatelinuxserversgeneral.cfg > ? ? ? ?| > ? ? ? ?|-->customer_1_directory|->templateserver.cfg > ? ? ? ?| ? ? ? ? ? ? ? ? ? ? ? |->server1.cfg > ? ? ? ?| ? ? ? ? ? ? ? ? ? ? ? |->server2.cfg > ? ? ? ?| ? ? ? ? ? ? ? ? ? ? ? |->servern.cfg > ? ? ? ?| > ? ? ? ?|-->customer_2_directory|->templateserver.cfg > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?|->server1.cfg > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?|->servern.cfg > > > Where: > -templatelinuxserversgeneral.cfg is a very basic template for server > -customer_1_directory in wich there is 1 file for each customer's server > -templateserver.cfg will use templatelinuxserversgeneral and will add > more specific common variabiles for that customer's server like the > public IPAddress that will be the same for each customer's server. > -servern.cfg in wich there will be some very specific server variables > like nrpe port (read up). > > What do you think? > How can I organize that service-server combination? > > > Thank's so much > > P.S. sorry for my bad english > > -- > Enrico Zimol > ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at vitro.co.uk Thu May 6 12:10:36 2010 From: nagios at vitro.co.uk (kristian) Date: Thu, 6 May 2010 11:10:36 +0100 Subject: High latency when 15% hosts offline Message-ID: <005701caed04$61270640$237512c0$@co.uk> Hi I'm running Nagios Core 3.2.1 Currently we have a network switch down, meaning all hosts beneath that switch are unreachable, 42 in number (from a total of 336) . In Nagios I have the switch set up as the parent. The switch I have set to be in scheduled downtime until we get a replacement, to prevent notifications being sent out. I am finding that the service check latency is enormous and the scheduling queue is slipping behind in time. For example, it is now 11:04am and the next check at the top of the scheduling queue should have run at 9:52am. Here are the service metrics from the Perf. Info page; Check Execution Time: 0.00 sec 30.19 sec 2.170 sec Check Latency: 0.00 sec 13612.54 sec 7025.395 sec Percent State Change: 0.00% 17.37% 0.50% Are there any ways I can reduce this latency, other than disabling active checks on all the unreachable hosts? Or any 'parallel' check tweaks I may have mis-configured? I'm happy to provide any other info Thanks for any help Kristian -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Thu May 6 12:35:52 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Thu, 6 May 2010 12:35:52 +0200 (CEST) Subject: Check for the IBM ds4000 series totalstorage cabinett - FIX In-Reply-To: References: <715a47138d33b4c92813c74bd3744414.squirrel@picard.linux.it> Message-ID: <28fff805bf30bd12ba54197be07d2c7f.squirrel@picard.linux.it> Hi, yes you can. I splitted all the major files in smaller files and nested directories, so it's easier for me to manage all the services and hosts. And I assure you, I have an heavy "split" configuration. > Hi at all, > I'm newbie on nagios and I'm writing here to ask you for suggestions > abut how to structure my monitoring situation. > I've to monitor linux servers for about 15/20 customers, from 1 to 5 > server for each customer. > We aren't on vpn with customers, so this servers are all behind NAT. > That isn't a problem because we are the administrator of the firewall > (other linux server) so we can manage any kind of DNAT and filter > rule. > > I read on official documentation that suggest to use NCSA addon for > distributed monitoring, but we choose to use NRPE addon for different > motivations like: > -customer force us to do that > -the number of monitored servers for each customer will never grow up > -the services to monitor for each server are the same (raid hw/sw, > disk usage etc) > -we need a completly centralized monitoring structure > > For last sentence I thought to use the arguments option on NRPE (yes, > I read the SECURITY document). > Besides, to solve the problem of NAT with NRPE I'll do DNAT on > firewall and the port parameter on check_nrpe plugin (is there > problems to do that? I did little tests but I prefear a confirm) > > > To manage this structure I need to organized a well-formed config file > structure on nagios server. > > I thinked to structure it like this > > obj--| > |-->templatelinuxserversgeneral.cfg > | > |-->customer_1_directory|->templateserver.cfg > | |->server1.cfg > | |->server2.cfg > | |->servern.cfg > | > |-->customer_2_directory|->templateserver.cfg > |->server1.cfg > |->servern.cfg > > > Where: > -templatelinuxserversgeneral.cfg is a very basic template for server > -customer_1_directory in wich there is 1 file for each customer's server > -templateserver.cfg will use templatelinuxserversgeneral and will add > more specific common variabiles for that customer's server like the > public IPAddress that will be the same for each customer's server. > -servern.cfg in wich there will be some very specific server variables > like nrpe port (read up). > > What do you think? > How can I organize that service-server combination? > > > Thank's so much > > P.S. sorry for my bad english > > -- > Enrico Zimol > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu May 6 13:56:12 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 06 May 2010 12:56:12 +0100 Subject: Different contacts for services on same host In-Reply-To: References: Message-ID: <4BE2AE5C.6020107@flatto.net> Sebastian Gosenheimer wrote: > Hi everybody, > > i'm just having some problems with my contacts and contactgroups. > > Let's say, i have a contactgroup xy and a contact ab. I set up a host with the contactgroup xy and some services on this host with the contactgroup xy. But now i have a "special" service on this host, where i only want notifications send to the contact ab. Nagios is also sending the notification e-mails correctly to the contact ab, but it's also sending the notification e-mails to the contactgroup xy. > > Can someone tell what i'm missing? Thank you for your help! > > Kind regards, > --sg > > Is the service using any templates ? Does the template have a contactgroup directive in it ? you need to override any template or wide definitions that cover the service and put in the service specific definitions the directive : "contacts ab" this will tell the service to just send to the specific contact . Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Thu May 6 16:38:45 2010 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Thu, 6 May 2010 16:38:45 +0200 Subject: nagios.cmd does not seem to work properly Message-ID: <20100506163845.2d89891a@saya.wrk.lsn.camptocamp.com> Hello, We'd like to use nagios.cmd pipe to send some signals. According to the examples in documentation, we can do a simple thing like that : /bin/printf "[%lu] ACKNOWLEDGE_HOST_PROBLEM;host1;1;1;1;Some One;Some Acknowledgement Comment\n" $now > $commandfile The problem is, it seems it's not taken in account - no log in nagios.log, no message is printed in shell, and more over nagios doesn't seem to do anything. I tested with an error inside the command, and it seems to be parsed: [1273156151] Warning: Unrecognized external command -> SCHEDULE_HOST_SVC_CHECKs;fqdn;1273156151 but nothing else happens. Is there a way to be sure that nagios do what we ask? The main thing is: - we're using NSCA plugin so that we have about 80 servers with their own nagios, sending results and status to a single host - we tried to run this command only on remote hosts - maybe we have to use it on the nsca server ? Any help is welcome. Thank you in advance ! Best regards, C. -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu May 6 17:08:06 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 06 May 2010 16:08:06 +0100 Subject: nagios.cmd does not seem to work properly In-Reply-To: <20100506163845.2d89891a@saya.wrk.lsn.camptocamp.com> References: <20100506163845.2d89891a@saya.wrk.lsn.camptocamp.com> Message-ID: <4BE2DB56.2040906@flatto.net> Cedric Jeanneret wrote: > Hello, > > We'd like to use nagios.cmd pipe to send some signals. According to the examples in documentation, we can do a simple thing like that : > > /bin/printf "[%lu] ACKNOWLEDGE_HOST_PROBLEM;host1;1;1;1;Some One;Some Acknowledgement Comment\n" $now > $commandfile > > The problem is, it seems it's not taken in account - no log in nagios.log, no message is printed in shell, and more over nagios doesn't seem to do anything. > > I tested with an error inside the command, and it seems to be parsed: > [1273156151] Warning: Unrecognized external command -> SCHEDULE_HOST_SVC_CHECKs;fqdn;1273156151 > > but nothing else happens. > > Is there a way to be sure that nagios do what we ask? > > The main thing is: > - we're using NSCA plugin so that we have about 80 servers with their own nagios, sending results and status to a single host > - we tried to run this command only on remote hosts - maybe we have to use it on the nsca server ? > > Any help is welcome. > > Thank you in advance ! > > Best regards, > > C. > > Enable the nagios debug ( see bottom of nagios.cfg file ) and test again what you insert . Although the nagios debug file fills up and grows quite fast so you need to make sure to have sufficient space to have the input you want . from what I know - external commands are logged in the nagios log - again check the option in the nagios.cfg Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Thu May 6 17:22:40 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Thu, 6 May 2010 11:22:40 -0400 Subject: high host latency on nagios master In-Reply-To: References: Message-ID: try lowering max_check_result_reaper value.... I had good luck playing with that value. Thanks On Tue, May 4, 2010 at 8:13 PM, Trisha Hoang wrote: > Hi, > The nagios *master *got really high host latency and I'm not sure how to > tweak it. I ran the check_ping plugin on a handful of hosts and the rta > averaged at 0.2 second so it's not the network. > > *Environment:* > - 565 hosts > - 6790 passive checks from the slaves > - not using event broker > - master server *actively* executes the hosts checks every 5 minutes and *passively > *processes checks every 1 minute > - not doing performance data > > *Nagiostats* > > Nagios Stats 3.2.1 > Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org) > Last Modified: 03-09-2010 > License: GPL > > CURRENT STATUS DATA > ------------------------------------------------------ > Status File: /var/log/nagios/status.dat > Status File Age: 0d 0h 0m 23s > Status File Version: 3.2.1 > > Program Running Time: 0d 1h 32m 19s > Nagios PID: 28282 > Used/High/Total Command Buffers: 1316 / 3066 / 4096 > > Total Services: 7745 > Services Checked: 7745 > Services Scheduled: 1381 > Services Actively Checked: 955 > Services Passively Checked: 6790 > Total Service State Change: 0.000 / 9.740 / 0.007 % > Active Service Latency: 18.948 / 205.144 / 165.751 sec > Active Service Execution Time: 0.007 / 9.051 / 0.055 sec > Active Service State Change: 0.000 / 5.460 / 0.006 % > Active Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 > Passive Service Latency: 34.359 / 190.247 / 76.739 sec > Passive Service State Change: 0.000 / 9.740 / 0.008 % > Passive Services Last 1/5/15/60 min: 0 / 3054 / 6774 / 6784 > Services Ok/Warn/Unk/Crit: 7720 / 1 / 0 / 24 > Services Flapping: 27 > Services In Downtime: 0 > > Total Hosts: 566 > Hosts Checked: 566 > Hosts Scheduled: 566 > Hosts Actively Checked: 566 > Host Passively Checked: 0 > Total Host State Change: 0.000 / 0.000 / 0.000 % > Active Host Latency: 0.000 / 3410.087 / 2413.051 sec > Active Host Execution Time: 0.007 / 10.010 / 0.063 sec > Active Host State Change: 0.000 / 0.000 / 0.000 % > Active Hosts Last 1/5/15/60 min: 0 / 8 / 10 / 565 > Passive Host Latency: 0.000 / 0.000 / 0.000 sec > Passive Host State Change: 0.000 / 0.000 / 0.000 % > Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 > Hosts Up/Down/Unreach: 563 / 3 / 0 > Hosts Flapping: 1 > Hosts In Downtime: 0 > > Active Host Checks Last 1/5/15 min: 5 / 32 / 75 > Scheduled: 0 / 0 / 0 > On-demand: 5 / 32 / 75 > Parallel: 1 / 11 / 23 > Serial: 0 / 0 / 0 > Cached: 4 / 21 / 52 > Passive Host Checks Last 1/5/15 min: 0 / 0 / 0 > Active Service Checks Last 1/5/15 min: 0 / 0 / 0 > Scheduled: 0 / 0 / 0 > On-demand: 0 / 0 / 0 > Cached: 0 / 0 / 0 > Passive Service Checks Last 1/5/15 min: 2 / 1455 / 1455 > > External Commands Last 1/5/15 min: 1302 / 6063 / 20253 > > > *Nagios.cfg* > > # EXTERNAL COMMAND CHECK INTERVAL > # This is the interval at which Nagios should check for external commands. > # This value works of the interval_length you specify later. If you leave > # that at its default value of 60 (seconds), a value of 1 here will cause > # Nagios to check for external commands every minute. If you specify a > # number followed by an "s" (i.e. 15s), this will be interpreted to mean > # actual seconds rather than a multiple of the interval_length variable. > # Note: In addition to reading the external command file at regularly > # scheduled intervals, Nagios will also check for external commands after > # event handlers are executed. > # NOTE: Setting this value to -1 causes Nagios to check the external > # command file as often as possible. > > #command_check_interval=15s > command_check_interval=-1 > > # SERVICE INTER-CHECK DELAY METHOD > # This is the method that Nagios should use when initially > # "spreading out" service checks when it starts monitoring. The > # default is to use smart delay calculation, which will try to > # space all service checks out evenly to minimize CPU load. > # Using the dumb setting will cause all checks to be scheduled > # at the same time (with no delay between them)! This is not a > # good thing for production, but is useful when testing the > # parallelization functionality. > # n = None - don't use any delay between checks > # d = Use a "dumb" delay of 1 second between checks > # s = Use "smart" inter-check delay calculation > # x.xx = Use an inter-check delay of x.xx seconds > > service_inter_check_delay_method=s > > # MAXIMUM SERVICE CHECK SPREAD > # This variable determines the timeframe (in minutes) from the > # program start time that an initial check of all services should > # be completed. Default is 30 minutes. > > max_service_check_spread=30 > > # SERVICE CHECK INTERLEAVE FACTOR > # This variable determines how service checks are interleaved. > # Interleaving the service checks allows for a more even > # distribution of service checks and reduced load on remote > # hosts. Setting this value to 1 is equivalent to how versions > # of Nagios previous to 0.0.5 did service checks. Set this > # value to s (smart) for automatic calculation of the interleave > # factor unless you have a specific reason to change it. > # s = Use "smart" interleave factor calculation > # x = Use an interleave factor of x, where x is a > # number greater than or equal to 1. > > service_interleave_factor=s > > # HOST INTER-CHECK DELAY METHOD > # This is the method that Nagios should use when initially > # "spreading out" host checks when it starts monitoring. The > # default is to use smart delay calculation, which will try to > # space all host checks out evenly to minimize CPU load. > # Using the dumb setting will cause all checks to be scheduled > # at the same time (with no delay between them)! > # n = None - don't use any delay between checks > # d = Use a "dumb" delay of 1 second between checks > # s = Use "smart" inter-check delay calculation > # x.xx = Use an inter-check delay of x.xx seconds > > host_inter_check_delay_method=s > > > # MAXIMUM HOST CHECK SPREAD > # This variable determines the timeframe (in minutes) from the > # program start time that an initial check of all hosts should > # be completed. Default is 30 minutes. > > max_host_check_spread=30 > > > # MAXIMUM CONCURRENT SERVICE CHECKS > # This option allows you to specify the maximum number of > # service checks that can be run in parallel at any given time. > # Specifying a value of 1 for this variable essentially prevents > # any service checks from being parallelized. A value of 0 > # will not restrict the number of concurrent checks that are > # being executed. > > max_concurrent_checks=0 > > > # HOST AND SERVICE CHECK REAPER FREQUENCY > # This is the frequency (in seconds!) that Nagios will process > # the results of host and service checks. > > check_result_reaper_frequency=10 > > # MAX CHECK RESULT REAPER TIME > # This is the max amount of time (in seconds) that a single > # check result reaper event will be allowed to run before > # returning control back to Nagios so it can perform other > # duties. > > max_check_result_reaper_time=30 > > > # CHECK RESULT PATH > # This is directory where Nagios stores the results of host and > # service checks that have not yet been processed. > # > # Note: Make sure that only one instance of Nagios has access > # to this directory! > > check_result_path=/var/log/nagios/spool/checkresults > > > # MAX CHECK RESULT FILE AGE > # This option determines the maximum age (in seconds) which check > # result files are considered to be valid. Files older than this > # threshold will be mercilessly deleted without further processing. > > max_check_result_file_age=3600 > > > # CACHED HOST CHECK HORIZON > # This option determines the maximum amount of time (in seconds) > # that the state of a previous host check is considered current. > # Cached host states (from host checks that were performed more > # recently that the timeframe specified by this value) can immensely > # improve performance in regards to the host check logic. > # Too high of a value for this option may result in inaccurate host > # states being used by Nagios, while a lower value may result in a > # performance hit for host checks. Use a value of 0 to disable host > # check caching. > > #cached_host_check_horizon=15 > cached_host_check_horizon=60 > > # CACHED SERVICE CHECK HORIZON > # This option determines the maximum amount of time (in seconds) > # that the state of a previous service check is considered current. > # Cached service states (from service checks that were performed more > # recently that the timeframe specified by this value) can immensely > # improve performance in regards to predictive dependency checks. > # Use a value of 0 to disable service check caching. > > cached_service_check_horizon=15 > > > > # ENABLE PREDICTIVE HOST DEPENDENCY CHECKS > # This option determines whether or not Nagios will attempt to execute > # checks of hosts when it predicts that future dependency logic test > # may be needed. These predictive checks can help ensure that your > # host dependency logic works well. > # Values: > # 0 = Disable predictive checks > # 1 = Enable predictive checks (default) > > enable_predictive_host_dependency_checks=1 > > > > # ENABLE PREDICTIVE SERVICE DEPENDENCY CHECKS > # This option determines whether or not Nagios will attempt to execute > # checks of service when it predicts that future dependency logic test > # may be needed. These predictive checks can help ensure that your > # service dependency logic works well. > # Values: > # 0 = Disable predictive checks > # 1 = Enable predictive checks (default) > > enable_predictive_service_dependency_checks=1 > > # AUTO-RESCHEDULING OPTION > # This option determines whether or not Nagios will attempt to > # automatically reschedule active host and service checks to > # "smooth" them out over time. This can help balance the load on > # the monitoring server. > # WARNING: THIS IS AN EXPERIMENTAL FEATURE - IT CAN DEGRADE > # PERFORMANCE, RATHER THAN INCREASE IT, IF USED IMPROPERLY > > auto_reschedule_checks=0 > > > > # AUTO-RESCHEDULING INTERVAL > # This option determines how often (in seconds) Nagios will > # attempt to automatically reschedule checks. This option only > # has an effect if the auto_reschedule_checks option is enabled. > # Default is 30 seconds. > # WARNING: THIS IS AN EXPERIMENTAL FEATURE - IT CAN DEGRADE > # PERFORMANCE, RATHER THAN INCREASE IT, IF USED IMPROPERLY > > auto_rescheduling_interval=30 > > > > # AUTO-RESCHEDULING WINDOW > # This option determines the "window" of time (in seconds) that > # Nagios will look at when automatically rescheduling checks. > # Only host and service checks that occur in the next X seconds > # (determined by this variable) will be rescheduled. This option > # only has an effect if the auto_reschedule_checks option is > # enabled. Default is 180 seconds (3 minutes). > # WARNING: THIS IS AN EXPERIMENTAL FEATURE - IT CAN DEGRADE > # PERFORMANCE, RATHER THAN INCREASE IT, IF USED IMPROPERLY > > auto_rescheduling_window=180 > > > > # SLEEP TIME > # This is the number of seconds to sleep between checking for system > # events and service checks that need to be run. > > sleep_time=0.25 > > # TIMEOUT VALUES > # These options control how much time Nagios will allow various > # types of commands to execute before killing them off. Options > # are available for controlling maximum time allotted for > # service checks, host checks, event handlers, notifications, the > # ocsp command, and performance data commands. All values are in > # seconds. > > service_check_timeout=60 > host_check_timeout=30 > event_handler_timeout=30 > notification_timeout=30 > ocsp_timeout=5 > perfdata_timeout=5 > > # AGGRESSIVE HOST CHECKING OPTION > # If you don't want to turn on aggressive host checking features, set > # this value to 0 (the default). Otherwise set this value to 1 to > # enable the aggressive check option. Read the docs for more info > # on what aggressive host check is or check out the source code in > # base/checks.c > > use_aggressive_host_checking=0 > > > > # SERVICE CHECK EXECUTION OPTION > # This determines whether or not Nagios will actively execute > # service checks when it initially starts. If this option is > # disabled, checks are not actively made, but Nagios can still > # receive and process passive check results that come in. Unless > # you're implementing redundant hosts or have a special need for > # disabling the execution of service checks, leave this enabled! > # Values: 1 = enable checks, 0 = disable checks > > execute_service_checks=0 > > > > # PASSIVE SERVICE CHECK ACCEPTANCE OPTION > # This determines whether or not Nagios will accept passive > # service checks results when it initially (re)starts. > # Values: 1 = accept passive checks, 0 = reject passive checks > > accept_passive_service_checks=1 > > > > # HOST CHECK EXECUTION OPTION > # This determines whether or not Nagios will actively execute > # host checks when it initially starts. If this option is > # disabled, checks are not actively made, but Nagios can still > # receive and process passive check results that come in. Unless > # you're implementing redundant hosts or have a special need for > # disabling the execution of host checks, leave this enabled! > # Values: 1 = enable checks, 0 = disable checks > > execute_host_checks=1 > > # PASSIVE HOST CHECK ACCEPTANCE OPTION > # This determines whether or not Nagios will accept passive > # host checks results when it initially (re)starts. > # Values: 1 = accept passive checks, 0 = reject passive checks > > accept_passive_host_checks=0 > > # OBSESS OVER SERVICE CHECKS OPTION > # This determines whether or not Nagios will obsess over service > # checks and run the ocsp_command defined below. Unless you're > # planning on implementing distributed monitoring, do not enable > # this option. Read the HTML docs for more information on > # implementing distributed monitoring. > # Values: 1 = obsess over services, 0 = do not obsess (default) > > obsess_over_services=0 > > > > # OBSESSIVE COMPULSIVE SERVICE PROCESSOR COMMAND > # This is the command that is run for every service check that is > # processed by Nagios. This command is executed only if the > # obsess_over_services option (above) is set to 1. The command > # argument is the short name of a command definition that you > # define in your host configuration file. Read the HTML docs for > # more information on implementing distributed monitoring. > > #ocsp_command=somecommand > > > > # OBSESS OVER HOST CHECKS OPTION > # This determines whether or not Nagios will obsess over host > # checks and run the ochp_command defined below. Unless you're > # planning on implementing distributed monitoring, do not enable > # this option. Read the HTML docs for more information on > # implementing distributed monitoring. > # Values: 1 = obsess over hosts, 0 = do not obsess (default) > > obsess_over_hosts=0 > > > > # OBSESSIVE COMPULSIVE HOST PROCESSOR COMMAND > # This is the command that is run for every host check that is > # processed by Nagios. This command is executed only if the > # obsess_over_hosts option (above) is set to 1. The command > # argument is the short name of a command definition that you > # define in your host configuration file. Read the HTML docs for > # more information on implementing distributed monitoring. > > #ochp_command=somecommand > > # SERVICE FRESHNESS CHECK OPTION > # This option determines whether or not Nagios will periodically > # check the "freshness" of service results. Enabling this option > # is useful for ensuring passive checks are received in a timely > # manner. > # Values: 1 = enabled freshness checking, 0 = disable freshness checking > > check_service_freshness=1 > > > > # SERVICE FRESHNESS CHECK INTERVAL > # This setting determines how often (in seconds) Nagios will > # check the "freshness" of service check results. If you have > # disabled service freshness checking, this option has no effect. > > #service_freshness_check_interval=60 > service_freshness_check_interval=420 > > > > # HOST FRESHNESS CHECK OPTION > # This option determines whether or not Nagios will periodically > # check the "freshness" of host results. Enabling this option > # is useful for ensuring passive checks are received in a timely > # manner. > # Values: 1 = enabled freshness checking, 0 = disable freshness checking > > check_host_freshness=0 > #check_host_freshness=1 > > > > # HOST FRESHNESS CHECK INTERVAL > # This setting determines how often (in seconds) Nagios will > # check the "freshness" of host check results. If you have > # disabled host freshness checking, this option has no effect. > > #host_freshness_check_interval=60 > host_freshness_check_interval=420 > > # ADDITIONAL FRESHNESS THRESHOLD LATENCY > # This setting determines the number of seconds that Nagios > # will add to any host and service freshness thresholds that > # it calculates (those not explicitly specified by the user). > > #additional_freshness_latency=15 > additional_freshness_latency=180 > > > # LARGE INSTALLATION TWEAKS OPTION > # This option determines whether or not Nagios will take some shortcuts > # which can save on memory and CPU usage in large Nagios installations. > # Read the documentation for more information on the benefits/tradeoffs > # of enabling this option. > # Values: 1 - Enabled tweaks > # 0 - Disable tweaks (default) > > use_large_installation_tweaks=1 > > > # CHILD PROCESS MEMORY OPTION > # This option determines whether or not Nagios will free memory in > # child processes (processed used to execute system commands and host/ > # service checks). If you specify a value here, it will override > # program defaults. > # Value: 1 - Free memory in child processes > # 0 - Do not free memory in child processes > > #free_child_process_memory=1 > > # CHILD PROCESS FORKING BEHAVIOR > # This option determines how Nagios will fork child processes > # (used to execute system commands and host/service checks). Normally > # child processes are fork()ed twice, which provides a very high level > # of isolation from problems. Fork()ing once is probably enough and will > # save a great deal on CPU usage (in large installs), so you might > # want to consider using this. If you specify a value here, it will > # program defaults. > # Value: 1 - Child processes fork() twice > # 0 - Child processes fork() just once > > #child_processes_fork_twice=1 > child_processes_fork_twice=0 > > > # DEBUG LEVEL > # This option determines how much (if any) debugging information will > # be written to the debug file. OR values together to log multiple > # types of information. > # Values: > # -1 = Everything > # 0 = Nothing > # 1 = Functions > # 2 = Configuration > # 4 = Process information > # 8 = Scheduled events > # 16 = Host/service checks > # 32 = Notifications > # 64 = Event broker > # 128 = External commands > # 256 = Commands > # 512 = Scheduled downtime > # 1024 = Comments > # 2048 = Macros > > debug_level=16 > > > # DEBUG VERBOSITY > # This option determines how verbose the debug log out will be. > # Values: 0 = Brief output > # 1 = More detailed > # 2 = Very detailed > > debug_verbosity=1 > > Thanks in advance for your help. > Trisha > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu May 6 17:45:39 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Thu, 06 May 2010 08:45:39 -0700 Subject: nagios.cmd does not seem to work properly In-Reply-To: <20100506163845.2d89891a@saya.wrk.lsn.camptocamp.com> References: <20100506163845.2d89891a@saya.wrk.lsn.camptocamp.com> Message-ID: <4BE2E423.7020601@hp.com> Cedric Jeanneret wrote: > Hello, > > We'd like to use nagios.cmd pipe to send some signals. According to the examples in documentation, we can do a simple thing like that : > > /bin/printf "[%lu] ACKNOWLEDGE_HOST_PROBLEM;host1;1;1;1;Some One;Some Acknowledgement Comment\n" $now > $commandfile > > The problem is, it seems it's not taken in account - no log in nagios.log, no message is printed in shell, and more over nagios doesn't seem to do anything. > > I tested with an error inside the command, and it seems to be parsed: > [1273156151] Warning: Unrecognized external command -> SCHEDULE_HOST_SVC_CHECKs;fqdn;1273156151 > > but nothing else happens. > > Is there a way to be sure that nagios do what we ask? > > The main thing is: > - we're using NSCA plugin so that we have about 80 servers with their own nagios, sending results and status to a single host > - we tried to run this command only on remote hosts - maybe we have to use it on the nsca server ? You do, of course, have to run this on the Nagios server, since that's where the pipe is. The message in your log seems to indicate you typo'd the SCHEDULE_HOST_SVC_CHECK command. If you're using NSCA, your pipe works: it submits commands to Nagios the same way. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mail at aidananderson.co.uk Thu May 6 18:30:57 2010 From: mail at aidananderson.co.uk (Aidan Anderson) Date: Thu, 06 May 2010 17:30:57 +0100 Subject: check_disk plugin In-Reply-To: <20100506101233.1e2cf0db@infracom.it> References: <20100505124943.131b4624@infracom.it> <20100506101233.1e2cf0db@infracom.it> Message-ID: <4BE2EEC1.6020507@aidananderson.co.uk> Davide Blasi wrote: > with or without quotes give me the same result :( > > Try using single quotes, e.g. -I '/my/fist/.*' -I '/second/.*' ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From davide.blasi at infracom.it Thu May 6 19:01:24 2010 From: davide.blasi at infracom.it (Davide Blasi) Date: Thu, 6 May 2010 19:01:24 +0200 Subject: check_disk plugin In-Reply-To: <4BE2EEC1.6020507@aidananderson.co.uk> References: <20100505124943.131b4624@infracom.it> <20100506101233.1e2cf0db@infracom.it> <4BE2EEC1.6020507@aidananderson.co.uk> Message-ID: <20100506190124.7e165af6@infracom.it> Aidan Anderson wrote : >Davide Blasi wrote: >> with or without quotes give me the same result :( >> >> > > Try using single quotes, e.g. > > -I '/my/fist/.*' -I '/second/.*' No, It doesn't works :( But I don't think that is a quote problem. If I invert path order, the check correctly accepts first argument but ignore the others. Dave ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xml.devel at gmail.com Fri May 7 07:40:40 2010 From: xml.devel at gmail.com (Kumar, Ashish) Date: Fri, 7 May 2010 11:10:40 +0530 Subject: Meaningful subject lines in e-mail alert Message-ID: Greetings, We are sending e-mail alerts on host/service state change. I was just wondering what do you guys use as subject lines; just looking around for ideas. Thank you in advance. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xml.devel at gmail.com Fri May 7 08:08:06 2010 From: xml.devel at gmail.com (Kumar, Ashish) Date: Fri, 7 May 2010 11:38:06 +0530 Subject: UTF-8 characters in Nagios UI Message-ID: Hello, We are currently using Nagios 3.0.6 on RHEL 5. I have checked Nagios archives already but there is something I found strange, so I am posting it here. I was testing UTF-8 support in Nagios with following are the results: 1. Created 3 filesystems /?????, /????, /??? (thank you Google translate) Results: All of them appear as it is in "Service section" in Nagios UI 2. Defined a NSCA service and sent messages in different languages with different severity level Result: E-mails are sent with UTF-8 characters perfectly fine The message appears with junk characters in "Status Information" section of Nagios UI It's pretty interesting that UTF-8 characters appear perfectly fine in "Service" section but not in "Status Information" that too in same UI. Is there a way to fix this? Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From paul.haldane at newcastle.ac.uk Fri May 7 10:31:39 2010 From: paul.haldane at newcastle.ac.uk (Paul Haldane) Date: Fri, 7 May 2010 09:31:39 +0100 Subject: Monitoring Condor servers with Nagios Message-ID: I've asked this question on the condor-users list and haven't had any response as yet so trying here. We're in early stages of updating and moving our Condor system (and putting it on a full production footing). As part of this we need to add monitoring of the system to our existing central monitoring system (using Nagios). I was surprised that I didn't find any plugins/recipes for this in either the Condor or Nagios communities (but I may not have found the right place to look yet). I've spotted some passing references to monitoring central nodes using Nagios but nothing substantial. Obviously we would knock something together using the standard Nagios plugins to check for processes etc but I can't help thinking that several people must have been through this already. Can anyone point me at references/useful sites? To be clear, at the moment I'm interesting in monitoring the system for availability and alerting system staff about problems. Recording usage is a separate project (and I've seen more leads for that sort of stuff). Thanks Paul ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From brian.omahony at curamsoftware.com Fri May 7 11:58:45 2010 From: brian.omahony at curamsoftware.com (Brian O'Mahony) Date: Fri, 7 May 2010 10:58:45 +0100 Subject: Monitor network traffic Message-ID: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> I want to monitor the NIC usage on a number of servers and graph them. I currently have nagiosgraph installed and running, but I am unsure of what plugin to use to perform the checks. I want to monitor the bandwidth usage of the NICs. Regards B The information in this email is confidential and may be legally privileged. It is intended solely for the addressee. Access to this email by anyone else is unauthorized. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted to be taken in reliance on it, is prohibited and may be unlawful. If you are not the intended addressee please contact the sender and dispose of this e-mail. Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bparish at cognex.com Fri May 7 15:41:01 2010 From: bparish at cognex.com (Parish, Brent) Date: Fri, 7 May 2010 09:41:01 -0400 Subject: Meaningful subject lines in e-mail alert In-Reply-To: References: Message-ID: Hi. I personally like using the hostname first, then the service. Something like this for example: adams : RAM is in a CRITICAL state! It makes it easier for me to sort alerts in Outlook that way (I have a rule sending all alerts to a separate folder), so I can see everything going on with a particular server, for example. Or to view how long a certain alert has been going on. I also changed the default output to HTML for easier reading (because I send alerts both to Instant Messenger and also email). Not sure if this will come out nicely (still in HTML format) through the mailing list, but here is a sample: Alert Type PROBLEM Hostname adams (Database Server) Address 912.267.822.2 Service RAM State CRITICAL Time Thu May 6 11:17:27 EDT 2010 Info RAM CRITICAL - 3% (7 of 196 MB) free Inventory View Host RT Tickets It is hyperlinked to Nagios (host view, then service view), as well as to this host in the inventory system and to the ticketing system (see all touble tickets related to this host). That's just what works for us. - Brent From: Kumar, Ashish [mailto:xml.devel at gmail.com] Sent: Friday, May 07, 2010 1:41 AM To: nagios-users ML Subject: [Nagios-users] Meaningful subject lines in e-mail alert Greetings, We are sending e-mail alerts on host/service state change. I was just wondering what do you guys use as subject lines; just looking around for ideas. Thank you in advance. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri May 7 17:22:12 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Fri, 07 May 2010 08:22:12 -0700 Subject: Monitor network traffic In-Reply-To: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> References: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> Message-ID: <4BE43024.4070203@hp.com> Brian O'Mahony wrote: > > I want to monitor the NIC usage on a number of servers and graph > them. I currently have nagiosgraph installed and running, but I am > unsure of what plugin to use to perform the checks. > > > > I want to monitor the bandwidth usage of the NICs. > MRTG does a pretty good job of graphing network bandwidth usage. You can then use one of the check_mrtg plugins if you want to alert through Nagios based on thresholds. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.lynch at rasmussen.edu Fri May 7 18:49:21 2010 From: richard.lynch at rasmussen.edu (Richard Lynch) Date: Fri, 07 May 2010 11:49:21 -0500 Subject: UTF-8 characters in Nagios UI In-Reply-To: References: Message-ID: I think the status info is Ajax-ed in... If so, it?s a separate HTTP request and process to display it, so being ?different? should come as no surprise. So it may not be output by the server as UTF-8, or the Ajax routines to parse/display the output may be maniuplating it as if it?s ASCII... You could probably do some digging in LiveHeaders in FireFox to narrow down the issue a bit... You may even be able to solve it :-) On 5/7/10 1:08 AM, "Kumar, Ashish" wrote: > Hello, > > We are currently using Nagios 3.0.6 on RHEL 5. > > I have checked Nagios archives already but there is something I found strange, > so I am posting it here. > > I was testing UTF-8 support in Nagios with following are the results: > > 1.? Created 3 filesystems /?????, /????, /??? (thank you Google translate) > Results:? > All of them appear as it is in "Service section" in Nagios UI > > 2.? Defined a NSCA service and sent messages in different languages with > different severity level > Result:? > E-mails are sent with UTF-8 characters perfectly fine > The message appears with junk characters in "Status Information" section of > Nagios UI > > It's pretty interesting that UTF-8 characters appear perfectly fine in > "Service" section but not in "Status Information" that too in same UI.? Is > there a way to fix this? > > Thank you. > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xml.devel at gmail.com Mon May 10 06:47:38 2010 From: xml.devel at gmail.com (Kumar, Ashish) Date: Mon, 10 May 2010 10:17:38 +0530 Subject: Meaningful subject lines in e-mail alert In-Reply-To: References: Message-ID: > I personally like using the hostname first, then the service. Something > like this for example: > > > > adams : RAM is in a CRITICAL state! > > > > It makes it easier for me to sort alerts in Outlook that way (I have a rule > sending all alerts to a separate folder), so I can see everything going on > with a particular server, for example. Or to view how long a certain alert > has been going on. > > > > I also changed the default output to HTML for easier reading (because I > send alerts both to Instant Messenger and also email). > > Not sure if this will come out nicely (still in HTML format) through the > mailing list, but here is a sample: > > > > *Alert Type * > > PROBLEM > > *Hostname * > > adams (Database Server) > > *Address * > > 912.267.822.2 > > *Service * > > RAM > > *State * > > CRITICAL > > *Time * > > Thu May 6 11:17:27 EDT 2010 > > *Info * > > RAM CRITICAL - 3% (7 of 196 MB) free > > *Inventory * > > View Host > > *RT * > > Tickets > > > > It is hyperlinked to Nagios (host view, then service view), as well as to > this host in the inventory system and to the ticketing system (see all > touble tickets related to this host). > > > > That?s just what works for us. > > > > - Brent > > Thanks for the response Brent. The subject line we use suck a bit, it's like Nagios Alert: Host = example | System = Linux | Service Status = OK | Notification Type = RECOVERY Though we are sending alerts at nagios at example.com so it's easy to create a rule. Any more ideas? Thanks in advance. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ian at acces.co.jp Mon May 10 09:41:41 2010 From: ian at acces.co.jp (Ian Masters) Date: Mon, 10 May 2010 16:41:41 +0900 Subject: UTF-8 characters in Nagios UI In-Reply-To: References: Message-ID: <4BE7B8B5.2030006@acces.co.jp> Is this relevant? http://www.mail-archive.com/pld-cvs-commit at lists.pld-linux.org/msg195151.html Looks like it might be but I don't really understand it. I'm using Nagios Version 3.0.6 on CentOS. I wonder if this is fixed in a later version. Ian > I think the status info is Ajax-ed in... If so, it?s a separate HTTP > request and process to display it, so being ?different? should come as > no surprise. > > So it may not be output by the server as UTF-8, or the Ajax routines to > parse/display the output may be maniuplating it as if it?s ASCII... > > You could probably do some digging in LiveHeaders in FireFox to narrow > down the issue a bit... > > You may even be able to solve it :-) > > On 5/7/10 1:08 AM, "Kumar, Ashish" wrote: > > Hello, > > We are currently using Nagios 3.0.6 on RHEL 5. > > I have checked Nagios archives already but there is something I > found strange, so I am posting it here. > > I was testing UTF-8 support in Nagios with following are the results: > > 1.?? Created 3 filesystems /??????????, /????????????, /????????? > (thank you Google translate) > Results:?? > All of them appear as it is in "Service section" in Nagios UI > > 2.?? Defined a NSCA service and sent messages in different languages > with different severity level > Result:?? > E-mails are sent with UTF-8 characters perfectly fine > The message appears with junk characters in "Status Information" > section of Nagios UI > > It's pretty interesting that UTF-8 characters appear perfectly fine > in "Service" section but not in "Status Information" that too in > same UI.?? Is there a way to fix this? > > Thank you. > > ------------------------------------------------------------------------ > ------------------------------------------------------------------------------ > > ------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------------------------------ > > > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From brian.omahony at curamsoftware.com Mon May 10 11:44:02 2010 From: brian.omahony at curamsoftware.com (Brian O'Mahony) Date: Mon, 10 May 2010 10:44:02 +0100 Subject: Monitor network traffic In-Reply-To: <4BE43024.4070203@hp.com> References: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> <4BE43024.4070203@hp.com> Message-ID: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB7A4@MAIL06.curamsoftware.com> Thanks Patrick What I am looking for is something like this, but I would prefer something more basic - I have only a handful of servers I want to monitor the network bandwith on, and to be honest MRTG looks like it has way more functionality than I need. I really just want something basic. Regards B -----Original Message----- From: Morris, Patrick [mailto:patrick.morris at hp.com] Sent: Friday, May 07, 2010 4:22 PM To: Nagios Users List Subject: Re: [Nagios-users] Monitor network traffic Brian O'Mahony wrote: > > I want to monitor the NIC usage on a number of servers and graph > them. I currently have nagiosgraph installed and running, but I am > unsure of what plugin to use to perform the checks. > > > > I want to monitor the bandwidth usage of the NICs. > MRTG does a pretty good job of graphing network bandwidth usage. You can then use one of the check_mrtg plugins if you want to alert through Nagios based on thresholds. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this email is confidential and may be legally privileged. It is intended solely for the addressee. Access to this email by anyone else is unauthorized. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted to be taken in reliance on it, is prohibited and may be unlawful. If you are not the intended addressee please contact the sender and dispose of this e-mail. Thank you. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Mon May 10 13:22:16 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Mon, 10 May 2010 13:22:16 +0200 Subject: Monitor network traffic In-Reply-To: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB7A4@MAIL06.curamsoftware.com> References: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> <4BE43024.4070203@hp.com> <86E8DA9E18BC2344BD0218BF23C88DF30143016EB7A4@MAIL06.curamsoftware.com> Message-ID: <878ED3FC-2966-415C-806C-9B82A32BAB80@linux.it> I do not see the problem. Enable snmp and use a plugin like http://www.monitoringexchange.org/inventory/Check-Plugins/Software/SNMP/check_bandwidth3 Ciao, Giorgio Il giorno 10/mag/2010, alle ore 11.44, Brian O'Mahony ha scritto: > Thanks Patrick > > What I am looking for is something like this, but I would prefer > something more basic - I have only a handful of servers I want to > monitor the network bandwith on, and to be honest MRTG looks like it > has way more functionality than I need. I really just want something > basic. > > Regards > > B > > -----Original Message----- > From: Morris, Patrick [mailto:patrick.morris at hp.com] > Sent: Friday, May 07, 2010 4:22 PM > To: Nagios Users List > Subject: Re: [Nagios-users] Monitor network traffic > > Brian O'Mahony wrote: >> >> I want to monitor the NIC usage on a number of servers and graph >> them. I currently have nagiosgraph installed and running, but I am >> unsure of what plugin to use to perform the checks. >> >> >> >> I want to monitor the bandwidth usage of the NICs. >> > > MRTG does a pretty good job of graphing network bandwidth usage. You > can then use one of the check_mrtg plugins if you want to alert > through > Nagios based on thresholds. > > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > The information in this email is confidential and may be legally > privileged. > It is intended solely for the addressee. Access to this email by > anyone else > is unauthorized. If you are not the intended recipient, any > disclosure, > copying, distribution or any action taken or omitted to be taken in > reliance > on it, is prohibited and may be unlawful. If you are not the intended > addressee please contact the sender and dispose of this e-mail. > Thank you. > > > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rmp.dmd1229 at gmail.com Mon May 10 22:52:52 2010 From: rmp.dmd1229 at gmail.com (rmp dmd) Date: Mon, 10 May 2010 16:52:52 -0400 Subject: Nagios Monitor for MSA500 Message-ID: Hi, We have HP ProLiant DL380 G4 Packaged Cluster with MSA500 G2 and I need to monitor the MSA500 for not only HDD failure but also fan or PSU. This ( http://forums11.itrc.hp.com/service/forums/questionanswer.do?admit=109447626+1273520258039+28353475&threadId=1262924) indicates check_hpasm plugin will help. But I found out that it only monitor the server but not the SAN. I checked with HP support, hpaducli will generate HW report for the MSA500. So two questions: - any nagios plug-in which will check MSA500 hardware STATUS - how can I incorporate hpaducli with nrpe? Thanks! -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ari at masterwriter.com Tue May 11 00:29:29 2010 From: ari at masterwriter.com (Ari Echt) Date: Mon, 10 May 2010 15:29:29 -0700 Subject: nagios newbie looking for guidance Message-ID: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> Hello, Nagios newbie here looking to setup monitoring for 4 servers. Any helpful guides out there that anyone recommends would be greatly appreciated. Thanks! ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pradhanparas at gmail.com Tue May 11 00:40:56 2010 From: pradhanparas at gmail.com (Paras pradhan) Date: Mon, 10 May 2010 17:40:56 -0500 Subject: nagios newbie looking for guidance In-Reply-To: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> References: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> Message-ID: To get started easily: http://www.thegeekstuff.com/2008/05/nagios-30-jumpstart-guide-for-red-hat-overview-installation-and-configuration/ http://www.thegeekstuff.com/2008/06/how-to-monitor-remote-linux-host-using-nagios-30/ Paras. On Mon, May 10, 2010 at 5:29 PM, Ari Echt wrote: > Hello, > > Nagios newbie here looking to setup monitoring for 4 servers. Any > helpful guides out there that anyone recommends would be greatly > appreciated. > > Thanks! > > > > > > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue May 11 00:43:17 2010 From: mitsuto at gmail.com (Marcel) Date: Mon, 10 May 2010 19:43:17 -0300 Subject: nagios newbie looking for guidance In-Reply-To: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> References: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> Message-ID: On Mon, May 10, 2010 at 7:29 PM, Ari Echt wrote: > Nagios newbie here looking to setup monitoring for 4 servers. Any > helpful guides out there that anyone recommends would be greatly > appreciated. > http://nagios.sourceforge.net/docs/3_0/ -- Assuming you're using the 3.X branch. have fun! -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Tue May 11 00:48:06 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Tue, 11 May 2010 00:48:06 +0200 Subject: nagios newbie looking for guidance In-Reply-To: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> References: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> Message-ID: <0F7C0043-61FA-4241-A52F-E9BC0454FA1E@linux.it> Start with the online docs, they are really helpful Ciao, Giorgio Il giorno 11/mag/2010, alle ore 00.29, Ari Echt ha scritto: > Hello, > > Nagios newbie here looking to setup monitoring for 4 servers. Any > helpful guides out there that anyone recommends would be greatly > appreciated. > > Thanks! > > > > > > > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Tue May 11 00:47:22 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Tue, 11 May 2010 00:47:22 +0200 Subject: Nagios Monitor for MSA500 In-Reply-To: References: Message-ID: <937A0516-0A03-41CD-936D-DCD540D141E0@linux.it> Use snmp Ciao, Giorgio Il giorno 10/mag/2010, alle ore 22.52, rmp dmd ha scritto: > Hi, > > We have HP ProLiant DL380 G4 Packaged Cluster with MSA500 G2 and I > need to monitor the MSA500 for not only HDD failure but also fan or > PSU. > > This (http://forums11.itrc.hp.com/service/forums/questionanswer.do?admit=109447626+1273520258039+28353475&threadId=1262924 > ) indicates check_hpasm plugin will help. But I found out that it > only monitor the server but not the SAN. > > I checked with HP support, hpaducli will generate HW report for the > MSA500. > > So two questions: > - any nagios plug-in which will check MSA500 hardware STATUS > - how can I incorporate hpaducli with nrpe? > > Thanks! > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ward.P.Fontenot at wellsfargo.com Tue May 11 01:45:13 2010 From: Ward.P.Fontenot at wellsfargo.com (Ward.P.Fontenot at wellsfargo.com) Date: Mon, 10 May 2010 18:45:13 -0500 Subject: Odd long URL Message-ID: Hello, After scratching my head until my head fell I have come to ask this question "How can I monitor the following URL with Nagios?" http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=test.crl Paul Fontenot Enterprise Key Management & Public Key Infrastructure Team Cryptographic Services|IST|TGS|TOG|Wells Fargo Voice 480.437.7795 This message may contain confidential and/or privileged information. If you are not the addressee or authorized to receive this for the addressee, you must not use, copy, disclose, or take any action based on this message or any information herein. If you have received this message in error, please advise the sender immediately by reply e-mail and delete this message. Thank you for your cooperation. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.browne at ausregistry.com.au Tue May 11 02:13:11 2010 From: stuart.browne at ausregistry.com.au (Stuart Browne) Date: Tue, 11 May 2010 10:13:11 +1000 Subject: Odd long URL In-Reply-To: References: Message-ID: <8CEF048B9EC83748B1517DC64EA130FB3E3DA065DB@off-win2003-01.ausregistrygroup.local> Does 'check_http' not work for you? From: Ward.P.Fontenot at wellsfargo.com [mailto:Ward.P.Fontenot at wellsfargo.com] Sent: Tuesday, 11 May 2010 09:45 To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Odd long URL Hello, After scratching my head until my head fell I have come to ask this question "How can I monitor the following URL with Nagios?" http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=test.crl Paul Fontenot Enterprise Key Management & Public Key Infrastructure Team Cryptographic Services|IST|TGS|TOG|Wells Fargo Voice 480.437.7795 This message may contain confidential and/or privileged information. If you are not the addressee or authorized to receive this for the addressee, you must not use, copy, disclose, or take any action based on this message or any information herein. If you have received this message in error, please advise the sender immediately by reply e-mail and delete this message. Thank you for your cooperation. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lomiz.mail at gmail.com Tue May 11 09:28:05 2010 From: lomiz.mail at gmail.com (Enrico Zimol) Date: Tue, 11 May 2010 09:28:05 +0200 Subject: nagios newbie looking for guidance In-Reply-To: <0F7C0043-61FA-4241-A52F-E9BC0454FA1E@linux.it> References: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> <0F7C0043-61FA-4241-A52F-E9BC0454FA1E@linux.it> Message-ID: On 11 May 2010 00:48, Giorgio Zarrelli wrote: > Start with the online docs, they are really helpful > I agree. I'm newbie too and I'm finding too helpful -- Enrico Zimol ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lomiz.mail at gmail.com Tue May 11 09:34:04 2010 From: lomiz.mail at gmail.com (Enrico Zimol) Date: Tue, 11 May 2010 09:34:04 +0200 Subject: Odd long URL In-Reply-To: References: Message-ID: On 11 May 2010 01:45, wrote: > Hello, > ?How can I monitor the following URL with Nagios?? > http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=test.crl > What do you mean with "check"? Check the result of your http request (202, 403 etc) or check the page to find an expression? -- Enrico Zimol ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xml.devel at gmail.com Tue May 11 10:02:04 2010 From: xml.devel at gmail.com (Kumar, Ashish) Date: Tue, 11 May 2010 13:32:04 +0530 Subject: Monitor network traffic In-Reply-To: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB7A4@MAIL06.curamsoftware.com> References: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> <4BE43024.4070203@hp.com> <86E8DA9E18BC2344BD0218BF23C88DF30143016EB7A4@MAIL06.curamsoftware.com> Message-ID: > What I am looking for is something like this, but I would prefer something > more basic - I have only a handful of servers I want to monitor the network > bandwith on, and to be honest MRTG looks like it has way more functionality > than I need. I really just want something basic. > What platforms exactly? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xml.devel at gmail.com Tue May 11 10:07:22 2010 From: xml.devel at gmail.com (Kumar, Ashish) Date: Tue, 11 May 2010 13:37:22 +0530 Subject: UTF-8 characters in Nagios UI In-Reply-To: <4BE7B8B5.2030006@acces.co.jp> References: <4BE7B8B5.2030006@acces.co.jp> Message-ID: > Is this relevant? > > > http://www.mail-archive.com/pld-cvs-commit at lists.pld-linux.org/msg195151.html > > Looks like it might be but I don't really understand it. > I'm using Nagios Version 3.0.6 on CentOS. I wonder if this is fixed in a > later > version. > > Ian > > > I think the status info is Ajax-ed in... If so, it?s a separate HTTP > > request and process to display it, so being ?different? should come as > > no surprise. > > > Thanks guys for head ups. Richard: There is no Ajax in whole CGI UI. I tried modifying few things in status.c but things didn't seem to work out. I will try again sometime :) Warm regards. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From math43 at gmail.com Tue May 11 14:15:57 2010 From: math43 at gmail.com (math@43) Date: Tue, 11 May 2010 14:15:57 +0200 Subject: check_snmp_netint, stats pb. Message-ID: Hello, I use to check the speed and the stae of my interfaces the plugin check_snmp_netint.pl ( http://exchange.nagios.org/directory/Plugins/Uncategorized/Software/SNMP/Advanced-Network-Interface-Check/details) with this command: /check_snmp_netint.pl -H $HOSTADDRESS$ -C $USER2$ --2 -n $ARG1$ -y -k -B -M -w$ARG2$ -c$ARG3$ So on my interface where my nagios server is directly connected (FA0/0 BI1800), the stats are show and calculated, but on the others interfaces i've this message: "no usable data - 0 rows", I don't no where the problem could come. [URL= http://photomaniak.com/upload/out.php/i964250_Debits.PNG][IMG]http://photomaniak.com/upload/out.php/t964250_Debits.PNG[/IMG][/URL ] Thanks per advance. (Srry for my bad english i'm a french user.) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bparish at cognex.com Tue May 11 15:43:00 2010 From: bparish at cognex.com (Parish, Brent) Date: Tue, 11 May 2010 09:43:00 -0400 Subject: check_disk plugin In-Reply-To: <20100506190124.7e165af6@infracom.it> References: <20100505124943.131b4624@infracom.it><20100506101233.1e2cf0db@infracom.it><4BE2EEC1.6020507@aidananderson.co.uk> <20100506190124.7e165af6@infracom.it> Message-ID: Hi Davide. Sorry it took so long for me to respond, I had to wait for my plugin to be approved before it would show up on Nagios Exchange. I don't know if you have already solved you disk regex problem or not, but I uploaded the one that I wrote and use if you want to give it a whirl. http://exchange.nagios.org/directory/Plugins/System-Metrics/File-System/ check_disk_snmp/details If you decide to try it out, please do read the help! (-h and for more details: -H) I have been able to use it to do something like you mentioned, but mine is not multiple "-i" arguments. Instead, it is a single "-i" followed by multiple expressions, each comma separated. e.g. your example would look like this with my script: -i '/my/fist/,/second/' HTH, Brent -----Original Message----- From: Davide Blasi [mailto:davide.blasi at infracom.it] Sent: Thursday, May 06, 2010 1:01 PM To: Nagios Users List Subject: Re: [Nagios-users] check_disk plugin Aidan Anderson wrote : >Davide Blasi wrote: >> with or without quotes give me the same result :( >> >> > > Try using single quotes, e.g. > > -I '/my/fist/.*' -I '/second/.*' No, It doesn't works :( But I don't think that is a quote problem. If I invert path order, the check correctly accepts first argument but ignore the others. Dave ------------------------------------------------------------------------ ------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lflacayo at cps.k12.il.us Tue May 11 15:49:37 2010 From: lflacayo at cps.k12.il.us (Lacayo, Luis F) Date: Tue, 11 May 2010 08:49:37 -0500 Subject: (no subject) In-Reply-To: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> References: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> Message-ID: Good Day every one, I am looking for a way to present the "Tactical Monitoring Overview" page as part of a dashboard (without the left hand column) can any one point me in the direction to achieve this. Thanks, Luis ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue May 11 16:00:20 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 11 May 2010 15:00:20 +0100 Subject: (no subject) In-Reply-To: References: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com> Message-ID: <4BE962F4.10603@flatto.net> Lacayo, Luis F wrote: > Good Day every one, > > I am looking for a way to present the "Tactical Monitoring Overview" > page as part of a dashboard (without the left hand column) can any one > point me in the direction to achieve this. > > Thanks, > > Luis > > use the direct url to call the tac.cgi i.e : http://nagios.server/nagios/tac.cgi -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Tue May 11 15:59:08 2010 From: a31modela at hotmail.com (steve f) Date: Tue, 11 May 2010 09:59:08 -0400 Subject: (no subject) In-Reply-To: References: <38AB0014-44A8-40DA-A949-0D7A036A9F9D@masterwriter.com>, Message-ID: If you open it as a new page, it will only show the tac screen. you can use the url as your path to the screen. Steve > Date: Tue, 11 May 2010 08:49:37 -0500 > From: lflacayo at cps.k12.il.us > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] (no subject) > > Good Day every one, > > I am looking for a way to present the "Tactical Monitoring Overview" > page as part of a dashboard (without the left hand column) can any one > point me in the direction to achieve this. > > Thanks, > > Luis > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null _________________________________________________________________ The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with Hotmail. http://www.windowslive.com/campaign/thenewbusy?tile=multicalendar&ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_5 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raymond.setchfield at seewhy.com Tue May 11 16:26:06 2010 From: raymond.setchfield at seewhy.com (Raymond Setchfield) Date: Tue, 11 May 2010 15:26:06 +0100 Subject: Nagios Appender - NagiosIncludeExcludeFilters.properties Message-ID: <17600.9525465362$1273588912@news.gmane.org> Hi Folks I am having some difficulty with the Filters.properties file which is used with the nagios appender for the log4j. Now the appender is working fine which is great as it speaking to nagios etc. However I am looking to use the filter file which is causing me a little bit of a grief. Now when turning this on and applying some filters to appears to be blocking everything. All the documentation which I can find the matter is pretty hazy. I am looking for someone who has used this filter and how to apply it to JBoss and block all those annoying errors. If anybody could shed some light on the matter that would be fantastic! Kindest Regards Raymond -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Tue May 11 17:21:05 2010 From: a31modela at hotmail.com (steve f) Date: Tue, 11 May 2010 11:21:05 -0400 Subject: A question on check_http to over 100 sites Message-ID: I have a need to verify that vendor urls for my company do not change. Since we only allow our locations to see approved urls, this is important. I was hoping there was some way I could use a macro to do the check_http and have all of the urls for the sites listed as the macro contents . Something like this - define url{ url_name www.test.com,www.test2.com,www.test3.com..... check_command check_http ... } define command{ command_name check_http command_line /usr/local/nagios/plugins/check_http -L $URL$ -c $ARG1$ -w $ARG2$ } Can I create a $URL$ macro for use here? Would doing something like this work? Is there an easier way for me to have a single host ( localhost) to verify the list of 100 + urls ? The plan is to do it 1 time a day & send a notification of it fails to get there. They can then update the approved list of the new url. Is it not " advisable" to use check_http in this manner? I am using it to watch a corporate now & it has been reliable. Thanks, Steve _________________________________________________________________ The New Busy is not the too busy. Combine all your e-mail accounts with Hotmail. http://www.windowslive.com/campaign/thenewbusy?tile=multiaccount&ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_4 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue May 11 17:37:53 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 11 May 2010 16:37:53 +0100 Subject: A question on check_http to over 100 sites In-Reply-To: References: Message-ID: <4BE979D1.6060800@flatto.net> steve f wrote: > I have a need to verify that vendor urls for my company do not change. > Since we only allow our locations to see approved urls, this is important. > > I was hoping there was some way I could use a macro to do the > check_http and have all of the urls for the sites listed as the macro > contents . Something like this - > > define url{ > url_name www.test.com,www.test2.com,www.test3.com..... > check_command check_http > ... > } > > > > define command{ > command_name check_http > command_line /usr/local/nagios/plugins/check_http -L $URL$ -c > $ARG1$ -w $ARG2$ > } > > Can I create a $URL$ macro for use here? Would doing something like this work? > > Is there an easier way for me to have a single host ( localhost) to verify the list of 100 + urls ? > The plan is to do it 1 time a day & send a notification of it fails to get there. > They can then update the approved list of the new url. > > Is it not " advisable" to use check_http in this manner? I am using it to watch a corporate now > & it has been reliable. > > Thanks, > Steve > you can make it with a hostgroup and -I in the check_http request . that way you do not need to define a macro , but use existing configuration and capabilities with in nagios . use a template for all the base definitions and just add the "url" in the host name example: define host{ name url-host hostgroups url-hosts check_period 24x7 check_interval 5 retry_interval 1 max_check_attempts 3 passive_checks_enabled 1 active_checks_enabled 0 notification_period 24x7 notification_interval 60 notification_options d,r contact_groups admins check_command check-url register 0 } define command{ command_name check-url command_line $USER1$/check_http -I $HOSTADDRESS$ $ARG1$ $ARG2$ $ARG3$ } define host{ use url-host host_name foo.bar alias just some site address www.foo.bar } -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Tue May 11 18:02:49 2010 From: flyinvap at orange.fr (Flyinvap) Date: Tue, 11 May 2010 18:02:49 +0200 Subject: A question on check_http to over 100 sites In-Reply-To: References: Message-ID: <20100511180249.786805df@orange.fr> Le Tue, 11 May 2010 11:21:05 -0400, steve f a ?crit : > Can I create a $URL$ macro for use here? Would doing something like > this work? You can define custom object variables [1] on host and/or service. http://nagios.sourceforge.net/docs/3_0/customobjectvars.html -- Flyinvap ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ward.P.Fontenot at wellsfargo.com Tue May 11 18:13:44 2010 From: Ward.P.Fontenot at wellsfargo.com (Ward.P.Fontenot at wellsfargo.com) Date: Tue, 11 May 2010 11:13:44 -0500 Subject: Odd long URL In-Reply-To: References: Message-ID: Check_http returns an incorrect response code, check_http returns the following [root at pkinagios plugins]# HTTP WARNING: HTTP/1.1 404 Not Found - 1099 bytes in 0.177 second response time |time=0.176506s;;;0.000000 size=1099B;;;0 While our BB server returns a 200 - which is correct. Of course I could very well be using check_http incorrectly, I have tried both of the following and neither return the correct value ./check_http http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=ent.crl ./check_http -H server.domain.com -u /crltest/http?server=server-m3.domain.com&path=/CRL/&crl=ent.crl -----Original Message----- From: Enrico Zimol [mailto:lomiz.mail at gmail.com] Sent: Tuesday, May 11, 2010 12:34 AM To: Nagios Users List Subject: Re: [Nagios-users] Odd long URL On 11 May 2010 01:45, wrote: > Hello, > "How can I monitor the following URL with Nagios?" > http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=test.crl > What do you mean with "check"? Check the result of your http request (202, 403 etc) or check the page to find an expression? -- Enrico Zimol ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From allanc at chickenandporn.com Tue May 11 18:18:39 2010 From: allanc at chickenandporn.com (Allan Clark) Date: Tue, 11 May 2010 12:18:39 -0400 Subject: Odd long URL In-Reply-To: References: Message-ID: On Tue, May 11, 2010 at 12:13, wrote: > Check_http returns an incorrect response code, check_http returns the > following > > [root at pkinagios plugins]# HTTP WARNING: HTTP/1.1 404 Not Found - 1099 > bytes in 0.177 second response time |time=0.176506s;;;0.000000 > size=1099B;;;0 > > While our BB server returns a 200 - which is correct. > > Of course I could very well be using check_http incorrectly, I have tried > both of the following and neither return the correct value > > ./check_http > http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=ent.crl > ./check_http -H server.domain.com -u /crltest/http?server= > server-m3.domain.com&path=/CRL/&crl=ent.crl > > Can you confirm you used quotes or shell-escapes to get around the bare "&" ? -- allanc at chickenandporn.com "??" http://linkedin.com/in/goldfish please, no proprietary attachments (http://tinyurl.com/cbgq) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bparish at cognex.com Tue May 11 18:24:27 2010 From: bparish at cognex.com (Parish, Brent) Date: Tue, 11 May 2010 12:24:27 -0400 Subject: Odd long URL In-Reply-To: References: Message-ID: Could it be from the metacharacters on the command line? What do you get if you enclose the entire URL in single quotes? -----Original Message----- From: Ward.P.Fontenot at wellsfargo.com [mailto:Ward.P.Fontenot at wellsfargo.com] Sent: Tuesday, May 11, 2010 12:14 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Odd long URL Check_http returns an incorrect response code, check_http returns the following [root at pkinagios plugins]# HTTP WARNING: HTTP/1.1 404 Not Found - 1099 bytes in 0.177 second response time |time=0.176506s;;;0.000000 size=1099B;;;0 While our BB server returns a 200 - which is correct. Of course I could very well be using check_http incorrectly, I have tried both of the following and neither return the correct value ./check_http http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/ CRL/&crl=ent.crl ./check_http -H server.domain.com -u /crltest/http?server=server-m3.domain.com&path=/CRL/&crl=ent.crl -----Original Message----- From: Enrico Zimol [mailto:lomiz.mail at gmail.com] Sent: Tuesday, May 11, 2010 12:34 AM To: Nagios Users List Subject: Re: [Nagios-users] Odd long URL On 11 May 2010 01:45, wrote: > Hello, > "How can I monitor the following URL with Nagios?" > http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/ CRL/&crl=test.crl > What do you mean with "check"? Check the result of your http request (202, 403 etc) or check the page to find an expression? -- Enrico Zimol ------------------------------------------------------------------------ ------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Tue May 11 18:31:33 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Tue, 11 May 2010 18:31:33 +0200 Subject: Odd long URL In-Reply-To: References: Message-ID: Try double quoting the URL. Ciao, Giorgio Il giorno 11/mag/2010, alle ore 18.13, ha scritto: > Check_http returns an incorrect response code, check_http returns > the following > > [root at pkinagios plugins]# HTTP WARNING: HTTP/1.1 404 Not Found - > 1099 bytes in 0.177 second response time |time=0.176506s;;;0.000000 > size=1099B;;;0 > > While our BB server returns a 200 - which is correct. > > Of course I could very well be using check_http incorrectly, I have > tried both of the following and neither return the correct value > > ./check_http http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=ent.crl > ./check_http -H server.domain.com -u /crltest/http?server=server- > m3.domain.com&path=/CRL/&crl=ent.crl > > -----Original Message----- > From: Enrico Zimol [mailto:lomiz.mail at gmail.com] > Sent: Tuesday, May 11, 2010 12:34 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Odd long URL > > On 11 May 2010 01:45, wrote: >> Hello, >> "How can I monitor the following URL with Nagios?" >> http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=test.crl >> > What do you mean with "check"? > Check the result of your http request (202, 403 etc) or check the page > to find an expression? > > -- > Enrico Zimol > > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From antonio.evangelista at dasa.com.br Tue May 11 18:16:14 2010 From: antonio.evangelista at dasa.com.br (Antonio Fernando Evangelista) Date: Tue, 11 May 2010 13:16:14 -0300 Subject: Nagios plugin problem using check_nrpe Message-ID: <1A9DE6C4B5CE36478FB34187441360FA043333EF8F@SVAPHEXC01> Hi. I'm writing a new Nagios plugin that will be responsible to monitor the server internal components temperature. I have a Proliant DL380 G5 with the HP Management CLI for Linux installed on it, and I wrote a perl program to check the Temperature. When I execute the perl script directly from the command prompt, I receive a message indicating the behavior of the Temperature. But, when I call this script perl using check_nrpe command, I received the same message, but the information of the temperature stays in blank. Following are my nrpe.cfg configuration file and the perl script. Does anybody know how can I fix this? NRPE.CFG pid_file=/var/run/nrpe.pid server_port=5666 nrpe_user=nagios nrpe_group=nagios allowed_hosts=172.16.4.56,localhost dont_blame_nrpe=0 debug=1 command_timeout=60 connection_timeout=300 command[check_load]=/usr/local/nagios/libexec/check_load -w 3,3,3 -c 4,4,4 command[check_sda1]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/mapper/vg01-lvol1 command[check_sda2]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/cciss/c0d0p1 command[check_sda3]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/mapper/vg01-lvol6 command[check_sda4]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/mapper/vg01-lvol4 command[check_sda5]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/mapper/vg01-lvol3 command[check_sda6]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/mapper/vg01-lvol2 command[check_sda7]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/sda7 command[check_sda8]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/sda8 command[check_sda9]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/sda9 command[check_sda10]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/sda10 command[check_drbd0]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/drbd0 command[check_drbd1]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% -p /dev/drbd1 command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 2 -c 4 -s Z command[check_informix]=/usr/local/nagios/libexec/check_procs -c 1:50 -C oninit command[check_crond]=/usr/local/nagios/libexec/check_procs -c 1:50 -C crond command[check_cupsd]=/usr/local/nagios/libexec/check_procs -c 1:50 -C cupsd command[check_freemem]=/usr/local/nagios/libexec/check_mem.pl -w 10 -c 5 -f command[check_swap]=/usr/local/nagios/libexec/check_swap -w 90% -c 80% command[check_bonding]=/usr/local/nagios/libexec/check_linux_bonding command[check_iozone]=/usr/local/nagios/libexec/check_temperature_hp.pl -s1 -w80 -c85 command[check_ambient]=/usr/local/nagios/libexec/check_temperature_hp.pl -s2 -w80 -c85 command[check_sensor1_cpu1]=/usr/local/nagios/libexec/check_temperature_hp.pl -s3 -w80 -c85 command[check_sensor2_cpu1]=/usr/local/nagios/libexec/check_temperature_hp.pl -s4 -w80 -c85 command[check_teste]=/usr/local/nagios/libexec/check_teste.pl -s 4 -w 80 -c 85 The Perl program: #! /usr/bin/perl -w # $Id: check_teste.pl use lib "/usr/local/nagios/libexec" ; # Tell Perl what we need to use use strict; use Getopt::Std; # Predefined exit codes for Nagios use vars qw($opt_s $opt_c $opt_f $opt_u $opt_w $opt_C $opt_v %exit_codes); #use vars qw($numero $descricao $tempatual $templimite $sensor); %exit_codes = ('UNKNOWN' ,-1, 'OK' , 0, 'WARNING' , 1, 'CRITICAL', 2, ); # Get our variables, do our checking: init(); get_temperature(); # Show usage sub usage() { print "\ncheck_mem.pl v1.0 - Nagios Plugin\n\n"; print "usage:\n"; print " check_mem.pl - -w -c \n\n"; print "options:\n"; print " -f Check FREE memory\n"; print " -u Check USED memory\n"; print " -C Count OS caches as FREE memory\n"; print " -w PERCENT Percent free/used when to warn\n"; print " -c PERCENT Percent free/used when critical\n"; print "\nCopyright (C) 2000 Dan Larsson \n"; print "check_mem.pl comes with absolutely NO WARRANTY either implied or explicit\n"; print "This program is licensed under the terms of the\n"; print "GNU General Public License (check source code for details)\n"; exit $exit_codes{'UNKNOWN'}; } #----------------------------------------------------------------------------- # Verifica parametros passados #----------------------------------------------------------------------------- sub init { # Get the options if ($#ARGV le 0) { &usage; } else { getopts('c:s:w:vh'); } # Shortcircuit the switches if (!$opt_w or $opt_w == 0 or !$opt_c or $opt_c == 0) { print "*** You must define WARN and CRITICAL levels!\n"; &usage; } elsif (!$opt_s) { print "*** You must select the SENSOR number!\n"; &usage; } # Check if levels are sane if ($opt_w >= $opt_c) { print "*** WARN level must not be greather than CRITICAL!\n"; &usage; } } #----------------------------------------------------------------------------- # Termina o programa enviando mensagem e postando RC #----------------------------------------------------------------------------- sub finish { my ($msg,$state) = @_; print "$msg\n"; exit $state; } #----------------------------------------------------------------------------- # Checa a temperatura do sensor solicitado #----------------------------------------------------------------------------- sub get_temperature { my $numero=0; my $descricao=0; my $tempatual=0; my $templimite=0; my $sensor=0; my $command=0; $sensor = $opt_s; $command=sprintf("/usr/local/nagios/libexec/check_temp_atual.sh %s",$sensor); $tempatual = `$command`; chomp($tempatual); $command=sprintf("/usr/local/nagios/libexec/check_temp_thre.sh %s",$sensor); $templimite = `$command`; chomp($templimite); $numero = $opt_s; my $twarning = $templimite - ($templimite-($templimite * ($opt_w / 100))); my $tcritical =$templimite - ($templimite-($templimite * ($opt_c / 100))); if($tempatual < $twarning) { print STDOUT "OK: TEMPERATURA ATUAL: $tempatual C - THRESHOLD: $templimite C\n"; exit(0); } elsif($tempatual < $templimite) { print STDOUT "WARNING: TEMPERATURA ATUAL: $tempatual C - THRESHOLD: $templimite C\n"; exit(1); } else { print STDOUT "CRITICAL: TEMPERATURA ATUAL: $tempatual C - THRESHOLD: $templimite C\n"; exit(2); } } If I execute the script directly from the command prompt: [root at lnxaphmoa04 libexec]# /usr/local/nagios/libexec/check_teste.pl -s4 -w80 -c85 OK: TEMPERATURA ATUAL: 30 C - THRESHOLD: 127 C If I execute the script calling through the check_nrpe command: [root at lnxaphmoa04 libexec]# /usr/local/nagios/libexec/check_nrpe -H localhost -c check_teste CRITICAL: TEMPERATURA ATUAL: C - THRESHOLD: C Thanks in advance. Regards, ________________________________ Antonio Evangelista Especialista de Sistemas - Infraestrutura - TI DASA - www.dasa.com.br e-mail: antonio.evangelista at dasa.com.br Tel.: (11) 4197-5450 Nextel: (11) 7719-4643 - 55*6013*626 Esta mensagem, incluindo seus anexos, pode conter informa??es privilegiadas e/ou de car?ter confidencial, n?o podendo ser retransmitida sem autoriza??o do remetente. Se voc? n?o ? o destinat?rio ou pessoa autorizada a receb?-la, informamos que o seu uso, divulga??o, c?pia ou arquivamento s?o proibidos. Portanto, se voc? recebeu esta mensagem por engano, por favor, nos informe respondendo imediatamente a este e-mail e em seguida apague-a. This message may contain confidential and/or privileged information. If you are not the addressee or authorized to receive this for the addressee, you must not use, copy, disclose or take any action based on this message or any information herein. If you have received this message in error, please advise the sender immediately by reply e-mail and delete this message. Thank you for your cooperation. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ward.P.Fontenot at wellsfargo.com Tue May 11 18:37:26 2010 From: Ward.P.Fontenot at wellsfargo.com (Ward.P.Fontenot at wellsfargo.com) Date: Tue, 11 May 2010 11:37:26 -0500 Subject: Odd long URL In-Reply-To: References: Message-ID: Thanks everyone, for future reference it worked like this for me ./check_http -H server.domain.com -u "/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=ent.crl" -----Original Message----- From: Giorgio Zarrelli [mailto:zarrelli at linux.it] Sent: Tuesday, May 11, 2010 9:32 AM To: Nagios Users List Cc: Subject: Re: [Nagios-users] Odd long URL Try double quoting the URL. Ciao, Giorgio Il giorno 11/mag/2010, alle ore 18.13, ha scritto: > Check_http returns an incorrect response code, check_http returns > the following > > [root at pkinagios plugins]# HTTP WARNING: HTTP/1.1 404 Not Found - > 1099 bytes in 0.177 second response time |time=0.176506s;;;0.000000 > size=1099B;;;0 > > While our BB server returns a 200 - which is correct. > > Of course I could very well be using check_http incorrectly, I have > tried both of the following and neither return the correct value > > ./check_http http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=ent.crl > ./check_http -H server.domain.com -u /crltest/http?server=server- > m3.domain.com&path=/CRL/&crl=ent.crl > > -----Original Message----- > From: Enrico Zimol [mailto:lomiz.mail at gmail.com] > Sent: Tuesday, May 11, 2010 12:34 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Odd long URL > > On 11 May 2010 01:45, wrote: >> Hello, >> "How can I monitor the following URL with Nagios?" >> http://server.domain.com/crltest/http?server=server-m3.domain.com&path=/CRL/&crl=test.crl >> > What do you mean with "check"? > Check the result of your http request (202, 403 etc) or check the page > to find an expression? > > -- > Enrico Zimol > > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue May 11 18:46:37 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 11 May 2010 17:46:37 +0100 Subject: Nagios plugin problem using check_nrpe In-Reply-To: <1A9DE6C4B5CE36478FB34187441360FA043333EF8F@SVAPHEXC01> References: <1A9DE6C4B5CE36478FB34187441360FA043333EF8F@SVAPHEXC01> Message-ID: <4BE989ED.5040109@flatto.net> Antonio Fernando Evangelista wrote: > > Hi. > > I?m writing a new Nagios plugin that will be responsible to monitor > the server internal components temperature. > > I have a Proliant DL380 G5 with the HP Management CLI for Linux > installed on it, and I wrote a perl program to check the > > Temperature. > > When I execute the perl script directly from the command prompt, I > receive a message indicating the behavior of the > > Temperature. But, when I call this script perl using check_nrpe > command, I received the same message, but the information > > of the temperature stays in blank. > > Following are my nrpe.cfg configuration file and the perl script. > > Does anybody know how can I fix this? > > *_NRPE.CFG_* > > pid_file=/var/run/nrpe.pid > > server_port=5666 > > nrpe_user=nagios > > nrpe_group=nagios > > allowed_hosts=172.16.4.56,localhost > > dont_blame_nrpe=0 > > debug=1 > > command_timeout=60 > > connection_timeout=300 > > command[check_load]=/usr/local/nagios/libexec/check_load -w 3,3,3 -c 4,4,4 > > command[check_sda1]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/mapper/vg01-lvol1 > > command[check_sda2]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/cciss/c0d0p1 > > command[check_sda3]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/mapper/vg01-lvol6 > > command[check_sda4]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/mapper/vg01-lvol4 > > command[check_sda5]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/mapper/vg01-lvol3 > > command[check_sda6]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/mapper/vg01-lvol2 > > command[check_sda7]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/sda7 > > command[check_sda8]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/sda8 > > command[check_sda9]=/usr/local/nagios/libexec/check_disk -w 15% -c 10% > -p /dev/sda9 > > command[check_sda10]=/usr/local/nagios/libexec/check_disk -w 15% -c > 10% -p /dev/sda10 > > command[check_drbd0]=/usr/local/nagios/libexec/check_disk -w 15% -c > 10% -p /dev/drbd0 > > command[check_drbd1]=/usr/local/nagios/libexec/check_disk -w 15% -c > 10% -p /dev/drbd1 > > command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 2 > -c 4 -s Z > > command[check_informix]=/usr/local/nagios/libexec/check_procs -c 1:50 > -C oninit > > command[check_crond]=/usr/local/nagios/libexec/check_procs -c 1:50 -C > crond > > command[check_cupsd]=/usr/local/nagios/libexec/check_procs -c 1:50 -C > cupsd > > command[check_freemem]=/usr/local/nagios/libexec/check_mem.pl -w 10 -c > 5 -f > > command[check_swap]=/usr/local/nagios/libexec/check_swap -w 90% -c 80% > > command[check_bonding]=/usr/local/nagios/libexec/check_linux_bonding > > command[check_iozone]=/usr/local/nagios/libexec/check_temperature_hp.pl > -s1 -w80 -c85 > > command[check_ambient]=/usr/local/nagios/libexec/check_temperature_hp.pl > -s2 -w80 -c85 > > command[check_sensor1_cpu1]=/usr/local/nagios/libexec/check_temperature_hp.pl > -s3 -w80 -c85 > > command[check_sensor2_cpu1]=/usr/local/nagios/libexec/check_temperature_hp.pl > -s4 -w80 -c85 > > command[check_teste]=/usr/local/nagios/libexec/check_teste.pl -s 4 -w > 80 -c 85 > > *_The Perl program:_* > > #! /usr/bin/perl -w > > # $Id: check_teste.pl > > use lib "/usr/local/nagios/libexec" ; > > # Tell Perl what we need to use > > use strict; > > use Getopt::Std; > > # Predefined exit codes for Nagios > > use vars qw($opt_s $opt_c $opt_f $opt_u $opt_w $opt_C $opt_v %exit_codes); > > #use vars qw($numero $descricao $tempatual $templimite $sensor); > > %exit_codes = ('UNKNOWN' ,-1, > > 'OK' , 0, > > 'WARNING' , 1, > > 'CRITICAL', 2, > > ); > > # Get our variables, do our checking: > > init(); > > get_temperature(); > > # Show usage > > sub usage() { > > print "\ncheck_mem.pl v1.0 - Nagios Plugin\n\n"; > > print "usage:\n"; > > print " check_mem.pl - -w -c \n\n"; > > print "options:\n"; > > print " -f Check FREE memory\n"; > > print " -u Check USED memory\n"; > > print " -C Count OS caches as FREE memory\n"; > > print " -w PERCENT Percent free/used when to warn\n"; > > print " -c PERCENT Percent free/used when critical\n"; > > print "\nCopyright (C) 2000 Dan Larsson \n"; > > print "check_mem.pl comes with absolutely NO WARRANTY either implied > or explicit\n"; > > print "This program is licensed under the terms of the\n"; > > print "GNU General Public License (check source code for details)\n"; > > exit $exit_codes{'UNKNOWN'}; > > } > > #----------------------------------------------------------------------------- > > # Verifica parametros passados > > #----------------------------------------------------------------------------- > > sub init { > > # Get the options > > if ($#ARGV le 0) { > > &usage; > > } > > else { > > getopts('c:s:w:vh'); > > } > > # Shortcircuit the switches > > if (!$opt_w or $opt_w == 0 or !$opt_c or $opt_c == 0) { > > print "*** You must define WARN and CRITICAL levels!\n"; > > &usage; > > } > > elsif (!$opt_s) { > > print "*** You must select the SENSOR number!\n"; > > &usage; > > } > > # Check if levels are sane > > if ($opt_w >= $opt_c) { > > print "*** WARN level must not be greather than CRITICAL!\n"; > > &usage; > > } > > } > > #----------------------------------------------------------------------------- > > # Termina o programa enviando mensagem e postando RC > > #----------------------------------------------------------------------------- > > sub finish { > > my ($msg,$state) = @_; > > print "$msg\n"; > > exit $state; > > } > > #----------------------------------------------------------------------------- > > # Checa a temperatura do sensor solicitado > > #----------------------------------------------------------------------------- > > sub get_temperature { > > my $numero=0; > > my $descricao=0; > > my $tempatual=0; > > my $templimite=0; > > my $sensor=0; > > my $command=0; > > $sensor = $opt_s; > > $command=sprintf("/usr/local/nagios/libexec/check_temp_atual.sh > %s",$sensor); > > $tempatual = `$command`; > > chomp($tempatual); > > $command=sprintf("/usr/local/nagios/libexec/check_temp_thre.sh > %s",$sensor); > > $templimite = `$command`; > > chomp($templimite); > > $numero = $opt_s; > > my $twarning = $templimite - ($templimite-($templimite * ($opt_w / 100))); > > my $tcritical =$templimite - ($templimite-($templimite * ($opt_c / 100))); > > if($tempatual < $twarning) > > { > > print STDOUT "OK: TEMPERATURA ATUAL: $tempatual C - THRESHOLD: > $templimite C\n"; > > exit(0); > > } > > elsif($tempatual < $templimite) > > { > > print STDOUT "WARNING: TEMPERATURA ATUAL: $tempatual C - THRESHOLD: > $templimite C\n"; > > exit(1); > > } > > else > > { > > print STDOUT "CRITICAL: TEMPERATURA ATUAL: $tempatual C - THRESHOLD: > $templimite C\n"; > > exit(2); > > } > > } > > *If I execute the script directly from the command prompt:* > > [root at lnxaphmoa04 libexec]# /usr/local/nagios/libexec/check_teste.pl > -s4 -w80 -c85 > > OK: TEMPERATURA ATUAL: 30 C - THRESHOLD: 127 C > > *If I execute the script calling through the check_nrpe command:* > > [root at lnxaphmoa04 libexec]# /usr/local/nagios/libexec/check_nrpe -H > localhost -c check_teste > > CRITICAL: TEMPERATURA ATUAL: C - THRESHOLD: C > > Thanks in advance. > > Regards, > Make sure your script have sudo permissions . most system calls to hardware require root level access , so when the nrpe is running the script as nagios user , it is not executed properly and hence the output is not generated and rendered. Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From smcafee at collaborativefusion.com Tue May 11 18:51:51 2010 From: smcafee at collaborativefusion.com (Sean McAfee) Date: Tue, 11 May 2010 12:51:51 -0400 Subject: Nagios plugin problem using check_nrpe In-Reply-To: <1A9DE6C4B5CE36478FB34187441360FA043333EF8F@SVAPHEXC01> References: <1A9DE6C4B5CE36478FB34187441360FA043333EF8F@SVAPHEXC01> Message-ID: <4BE98B27.8020700@collaborativefusion.com> Antonio Fernando Evangelista wrote: > [root at lnxaphmoa04 libexec]# /usr/local/nagios/libexec/check_teste.pl -s4 > -w80 -c85 > > OK: TEMPERATURA ATUAL: 30 C - THRESHOLD: 127 C > > *If I execute the script calling through the check_nrpe command:* > > [root at lnxaphmoa04 libexec]# /usr/local/nagios/libexec/check_nrpe -H > localhost -c check_teste > > CRITICAL: TEMPERATURA ATUAL: C - THRESHOLD: C You're running the check as root, but NRPE executes checks as the nagios user. You'll probably be able to replicate the empty variables f you run: [root at lnxaphmoa04 libexec]# sudo -H -u nagios /usr/local/nagios/libexec/check_teste.pl -s4 -w80 -c85 The solution is: 1. Add permissions in sudoers for the nagios user to exec that script as root without a password. The following line would allow the nagios user to do passwordless sudo on any plugins: nagios ALL=(root) NOPASSWD:/usr/lcoal/nagios/libexec/ 2. Change your nrpe.cfg to be: command[check_teste]=/path/to/sudobinary /usr/local/nagios/libexec/check_teste.pl -s 4 -w 80 -c 85 3. Restart npre -- Sean McAfee Senior Systems Engineer ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.lynch at rasmussen.edu Tue May 11 19:27:49 2010 From: richard.lynch at rasmussen.edu (Richard Lynch) Date: Tue, 11 May 2010 12:27:49 -0500 Subject: UTF-8 characters in Nagios UI In-Reply-To: References: Message-ID: As I look at the ?Technical Monitoring Overview? frame on my laptop proof of concept install, there is no DOCTYPE, and the encoding is ISO-8859-1... #fail On 5/11/10 3:07 AM, "Kumar, Ashish" wrote: > >> Is this relevant? >> >> http://www.mail-archive.com/pld-cvs-commit at lists.pld-linux.org/msg195151.html >> >> Looks like it might be but I don't really understand it. >> I'm using Nagios Version 3.0.6 on CentOS. I wonder if this is fixed in a >> later >> version. >> >> Ian >> >>> > I think the status info is Ajax-ed in... If so, it?s a separate HTTP >>> > request and process to display it, so being ?different? should come as >>> > no surprise. >>> > > > > Thanks guys for head ups.? > > Richard: There is no Ajax in whole CGI UI.? I tried modifying few things in > status.c but things didn't seem to work out.? I will try again sometime :) > > Warm regards. > > > > > > > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From antonio.evangelista at dasa.com.br Tue May 11 19:52:41 2010 From: antonio.evangelista at dasa.com.br (Antonio Fernando Evangelista) Date: Tue, 11 May 2010 14:52:41 -0300 Subject: RES: Nagios plugin problem using check_nrpe In-Reply-To: <4BE98B27.8020700@collaborativefusion.com> References: <1A9DE6C4B5CE36478FB34187441360FA043333EF8F@SVAPHEXC01> <4BE98B27.8020700@collaborativefusion.com> Message-ID: <1A9DE6C4B5CE36478FB34187441360FA043333EFC8@SVAPHEXC01> Hi All, I solved my problem including enabling the user nagios to execute the command /sbin/hpasmcli as root user. Thank all of you. Regards, ________________________________ Antonio Evangelista Especialista de Sistemas - Infraestrutura - TI DASA - www.dasa.com.br e-mail: antonio.evangelista at dasa.com.br Tel.: (11) 4197-5450 Nextel: (11) 7719-4643 - 55*6013*626 -----Mensagem original----- De: Sean McAfee [mailto:smcafee at collaborativefusion.com] Enviada em: ter?a-feira, 11 de maio de 2010 13:52 Para: Nagios Users List Assunto: Re: [Nagios-users] Nagios plugin problem using check_nrpe Antonio Fernando Evangelista wrote: > [root at lnxaphmoa04 libexec]# /usr/local/nagios/libexec/check_teste.pl -s4 > -w80 -c85 > > OK: TEMPERATURA ATUAL: 30 C - THRESHOLD: 127 C > > *If I execute the script calling through the check_nrpe command:* > > [root at lnxaphmoa04 libexec]# /usr/local/nagios/libexec/check_nrpe -H > localhost -c check_teste > > CRITICAL: TEMPERATURA ATUAL: C - THRESHOLD: C You're running the check as root, but NRPE executes checks as the nagios user. You'll probably be able to replicate the empty variables f you run: [root at lnxaphmoa04 libexec]# sudo -H -u nagios /usr/local/nagios/libexec/check_teste.pl -s4 -w80 -c85 The solution is: 1. Add permissions in sudoers for the nagios user to exec that script as root without a password. The following line would allow the nagios user to do passwordless sudo on any plugins: nagios ALL=(root) NOPASSWD:/usr/lcoal/nagios/libexec/ 2. Change your nrpe.cfg to be: command[check_teste]=/path/to/sudobinary /usr/local/nagios/libexec/check_teste.pl -s 4 -w 80 -c 85 3. Restart npre -- Sean McAfee Senior Systems Engineer ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Esta mensagem, incluindo seus anexos, pode conter informa??es privilegiadas e/ou de car?ter confidencial, n?o podendo ser retransmitida sem autoriza??o do remetente. Se voc? n?o ? o destinat?rio ou pessoa autorizada a receb?-la, informamos que o seu uso, divulga??o, c?pia ou arquivamento s?o proibidos. Portanto, se voc? recebeu esta mensagem por engano, por favor, nos informe respondendo imediatamente a este e-mail e em seguida apague-a. This message may contain confidential and/or privileged information. If you are not the addressee or authorized to receive this for the addressee, you must not use, copy, disclose or take any action based on this message or any information herein. If you have received this message in error, please advise the sender immediately by reply e-mail and delete this message. Thank you for your cooperation. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lscarneiro at veltrac.com.br Tue May 11 20:01:35 2010 From: lscarneiro at veltrac.com.br (Leonardo Carneiro - Veltrac) Date: Tue, 11 May 2010 15:01:35 -0300 Subject: 2 nagios Message-ID: <4BE99B7F.2050003@veltrac.com.br> Hi everyone, I have two datacenters. One of then is already being monitored by nagios, and i'll start the install in the second one this week. Is it possible to get then both talking to each other so i can access all data through a single web interface? Tks in advance. -- Leonardo Carneiro ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Tue May 11 20:25:45 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Tue, 11 May 2010 14:25:45 -0400 Subject: turning off service inheritance of host settings? Message-ID: I don't suppose there's any way (short of changing the source and recompiling) to turn off the "feature" of inheriting host settings to services? This is one thing I've found *really* annoying about 3.2.0 and would like to have a way to turn it off. I didn't see anything in the docs or in the nagios.cfg file that let me turn this behavior on or off or something I could put in a host or service setting that would let me disable it. Thanks Mark ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue May 11 21:51:43 2010 From: mitsuto at gmail.com (Marcel) Date: Tue, 11 May 2010 16:51:43 -0300 Subject: 2 nagios In-Reply-To: <4BE99B7F.2050003@veltrac.com.br> References: <4BE99B7F.2050003@veltrac.com.br> Message-ID: Yeah, read about Distributed/Redundant setup and the OCSP/OCHP command definitions. There you'll find your answer! Happy monitoring! On Tue, May 11, 2010 at 3:01 PM, Leonardo Carneiro - Veltrac < lscarneiro at veltrac.com.br> wrote: > Hi everyone, > > I have two datacenters. One of then is already being monitored by > nagios, and i'll start the install in the second one this week. Is it > possible to get then both talking to each other so i can access all data > through a single web interface? Tks in advance. > -- > Leonardo Carneiro > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cyruspy at gmail.com Tue May 11 23:03:07 2010 From: cyruspy at gmail.com (Ciro Iriarte) Date: Tue, 11 May 2010 17:03:07 -0400 Subject: NRPE compilation on HPUX In-Reply-To: References: Message-ID: 2010/2/11 Frost, Mark {PBG} : > > > It's kind of strange, but I know I had the problem you're having at one > point, but I no longer seem to have it on HPUX 11.31 (IA64).? I wonder if > there was some OS patch that was applied that took care of that.? The OS > level on our servers is shown as "HP-UX B.11.31 U ia64 1200835863" if that > helps.? swlist does show that we have an installed tcp wrapper package, > however: > > > > ? HPUX-TCPWRAP????????????????????????? B.11.31??????? HPUX TCPWrapper > daemon > > > > 'nm' shows that there is a fromhost() entity in /usr/lib/libwrap.sl > (libwrap.1) > > > > Unless I'm mistaken, your issue is with the libwrap library which I think > exists on HPUX, but isn't complete as far as NRPE is concerned.? I never > spent a lot of time trying to find a generalized solution to this, but > instead kind of hacked something into nrpe.c where I think I just commented > out some of the #ifdef HAVE_LIBWRAP code segments.? Note that the fromhost() > call you're missing is in one of the HAVE_LIBRWRAP sections.? I believe that > NRPE expects that .? We don?t use TCP wrappers anyway, so it's not a loss. > > > > I first encountered this issue with PA-RISC HPUX and I believe the way I got > around was to download the latest tcp wrapper code from the internet, then > pulled out the function that I was missing (I don't recall if it was from > host > > > > My current issue with NRPE on HPUX is twofold -- the lack of some syslog > capabilities, and that the configure script doesn't correctly find our SSL > libraries.? I have made the following 2 hacky changes to make ours work: > > > > > > *** configure.old?????? Mon Mar 31 13:49:11 2008 > > --- configure?? Thu Feb 11 11:07:10 2010 > > *************** > > *** 6676,6686 **** > > ??????????????????????? fi > > ??????????????? done > > > > !?????????????? if test x_$found_ssl != x_yes; then > > !?????????????????????? { { echo "$as_me:$LINENO: error: Cannot find ssl > libraries" >&5 > > ! echo "$as_me: error: Cannot find ssl libraries" >&2;} > > !??? { (exit 1); exit 1; }; } > > !?????????????? else > > ??????????????????????? printf "SSL libraries found in $ssllibdir\n"; > > > > ??????????????????????? LDFLAGS="$LDFLAGS -L$ssllibdir"; > > --- 6676,6686 ---- > > ??????????????????????? fi > > ??????????????? done > > > > ! #???????????? if test x_$found_ssl != x_yes; then > > ! #???????????????????? { { echo "$as_me:$LINENO: error: Cannot find ssl > libraries" >&5 > > ! #echo "$as_me: error: Cannot find ssl libraries" >&2;} > > ! #?? { (exit 1); exit 1; }; } > > ! #?? ??????????else > > ??????????????????????? printf "SSL libraries found in $ssllibdir\n"; > > > > ??????????????????????? LDFLAGS="$LDFLAGS -L$ssllibdir"; > > *************** > > *** 6701,6707 **** > > ??????????????????????? fi > > ??????????????????????? # awk to strip off meta data at bottom of dhparam > output > > ??????????????????????? $sslbin dhparam -C 512 | awk '/^-----/ {exit} > {print}' > include/dh.h > > !?????????????? fi > > ??????? fi > > > > ??????????????? echo "$as_me:$LINENO: checking for Kerberos include files" >>&5 > > --- 6701,6707 ---- > > ?? ?????????????????????fi > > ??????????????????????? # awk to strip off meta data at bottom of dhparam > output > > ??????????????????????? $sslbin dhparam -C 512 | awk '/^-----/ {exit} > {print}' > include/dh.h > > ! #???????????? fi > > ??????? fi > > > > ??????????????? echo "$as_me:$LINENO: checking for Kerberos include files" >>&5 > > > > > > > > > > > > *** src/nrpe.c.old????? Mon Mar 31 14:02:59 2008 > > --- src/nrpe.c? Thu Feb 11 11:18:34 2010 > > *************** > > *** 613,622 **** > > ??????????????? log_facility=LOG_UUCP; > > ??????? else if(!strcmp(varvalue,"cron")) > > ??????????????? log_facility=LOG_CRON; > > !?????? else if(!strcmp(varvalue,"authpriv")) > > ??????????????? log_facility=LOG_AUTHPRIV; > > ??????? else if(!strcmp(varvalue,"ftp")) > > !?????????????? log_facility=LOG_FTP; > > ??????? else if(!strcmp(varvalue,"local0")) > > ???? ???????????log_facility=LOG_LOCAL0; > > ??????? else if(!strcmp(varvalue,"local1")) > > --- 613,622 ---- > > ??????????????? log_facility=LOG_UUCP; > > ??????? else if(!strcmp(varvalue,"cron")) > > ??????????????? log_facility=LOG_CRON; > > ! /*??? else if(!strcmp(varvalue,"authpriv")) > > ??????????????? log_facility=LOG_AUTHPRIV; > > ??????? else if(!strcmp(varvalue,"ftp")) > > !?????????????? log_facility=LOG_FTP; */ > > ??????? else if(!strcmp(varvalue,"local0")) > > ??????????????? log_facility=LOG_LOCAL0; > > ??????? else if(!strcmp(varvalue,"local1")) > > > Hope that's helpful. > > > > Mark Thanks a lot for your feedback, I finally had some time to finish this pending task and took some notes about the procedure. Ref: http://cyruspy.wordpress.com/2010/05/11/compiling-nrpe-in-hpux-11-31-2/ Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed May 12 12:29:05 2010 From: ae at op5.se (Andreas Ericsson) Date: Wed, 12 May 2010 12:29:05 +0200 Subject: turning off service inheritance of host settings? In-Reply-To: References: Message-ID: <4BEA82F1.1050702@op5.se> On 05/11/2010 08:25 PM, Frost, Mark {PBC} wrote: > > I don't suppose there's any way (short of changing the source and > recompiling) to turn off the "feature" of inheriting host settings to > services? Nope. But a patch to make it optional with a setting in nagios.cfg, preferrably overridable on a per-service basis, would be accepted if it was well-written. > This is one thing I've found *really* annoying about 3.2.0 > and would like to have a way to turn it off. I didn't see anything > in the docs or in the nagios.cfg file that let me turn this behavior > on or off or something I could put in a host or service setting that > would let me disable it. > A lot of other people find it really convenient, which is why the change was introduced in the first place. I guess you can't please everyone. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lscarneiro at veltrac.com.br Wed May 12 13:14:13 2010 From: lscarneiro at veltrac.com.br (Leonardo Carneiro - Veltrac) Date: Wed, 12 May 2010 08:14:13 -0300 Subject: 2 nagios In-Reply-To: References: <4BE99B7F.2050003@veltrac.com.br> Message-ID: <4BEA8D85.3040101@veltrac.com.br> Marcel wrote: > Yeah, read about Distributed/Redundant setup and the OCSP/OCHP command > definitions. There you'll find your answer! > > Happy monitoring! > > On Tue, May 11, 2010 at 3:01 PM, Leonardo Carneiro - Veltrac > > wrote: > > Hi everyone, > > I have two datacenters. One of then is already being monitored by > nagios, and i'll start the install in the second one this week. Is it > possible to get then both talking to each other so i can access > all data > through a single web interface? Tks in advance. > -- > Leonardo Carneiro > Hi Marcel and others in the list, tks for pointing me in the right direction. I'll try that. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From brian.omahony at curamsoftware.com Thu May 13 15:28:44 2010 From: brian.omahony at curamsoftware.com (Brian O'Mahony) Date: Thu, 13 May 2010 14:28:44 +0100 Subject: Monitor network traffic In-Reply-To: References: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> <4BE43024.4070203@hp.com> <86E8DA9E18BC2344BD0218BF23C88DF30143016EB7A4@MAIL06.curamsoftware.com> Message-ID: <86E8DA9E18BC2344BD0218BF23C88DF3014301B14171@MAIL06.curamsoftware.com> Apologies was out sick the last few days. Currently RHEL5.4, but it would be good to have the option for Debian (3/4), AIX, HPUX (RISC & Itanium, both 11.23/11.31), Solaris, and zLinux (Suse11). However none of these are a requirement - more of nice option.... B From: Kumar, Ashish [mailto:xml.devel at gmail.com] Sent: Tuesday, May 11, 2010 9:02 AM To: Nagios Users List; Brian O'Mahony Subject: Re: [Nagios-users] Monitor network traffic What I am looking for is something like this, but I would prefer something more basic - I have only a handful of servers I want to monitor the network bandwith on, and to be honest MRTG looks like it has way more functionality than I need. I really just want something basic. What platforms exactly? The information in this email is confidential and may be legally privileged. It is intended solely for the addressee. Access to this email by anyone else is unauthorized. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted to be taken in reliance on it, is prohibited and may be unlawful. If you are not the intended addressee please contact the sender and dispose of this e-mail. Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From brian.omahony at curamsoftware.com Thu May 13 17:41:41 2010 From: brian.omahony at curamsoftware.com (Brian O'Mahony) Date: Thu, 13 May 2010 16:41:41 +0100 Subject: Monitor network traffic In-Reply-To: References: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> <4BE43024.4070203@hp.com> <86E8DA9E18BC2344BD0218BF23C88DF30143016EB7A4@MAIL06.curamsoftware.com> <86E8DA9E18BC2344BD0218BF23C88DF3014301B14171@MAIL06.curamsoftware.com> Message-ID: <86E8DA9E18BC2344BD0218BF23C88DF3014301B14272@MAIL06.curamsoftware.com> Wow That would be great, thanks. B From: Kumar, Ashish [mailto:xml.devel at gmail.com] Sent: Thursday, May 13, 2010 4:39 PM To: Brian O'Mahony Cc: Nagios Users List Subject: Re: [Nagios-users] Monitor network traffic Currently RHEL5.4, but it would be good to have the option for Debian (3/4), AIX, HPUX (RISC & Itanium, both 11.23/11.31), Solaris, and zLinux (Suse11). However none of these are a requirement - more of nice option.... B I have written plugins for AIX and Sun Solaris. I will try tomorrow morning for HP-UX. For Red Hat I am facing a little problem. Apparently there is no way to figure out the NIC's speed (e.g. 100Mbps, 1Gbps) as a non-privileged user (mii-tool, ethtool). I will share the plugins once I finish with it. The information in this email is confidential and may be legally privileged. It is intended solely for the addressee. Access to this email by anyone else is unauthorized. If you are not the intended recipient, any disclosure, copying, distribution or any action taken or omitted to be taken in reliance on it, is prohibited and may be unlawful. If you are not the intended addressee please contact the sender and dispose of this e-mail. Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xml.devel at gmail.com Thu May 13 17:39:07 2010 From: xml.devel at gmail.com (Kumar, Ashish) Date: Thu, 13 May 2010 21:09:07 +0530 Subject: Monitor network traffic In-Reply-To: <86E8DA9E18BC2344BD0218BF23C88DF3014301B14171@MAIL06.curamsoftware.com> References: <86E8DA9E18BC2344BD0218BF23C88DF30143016EB1D9@MAIL06.curamsoftware.com> <4BE43024.4070203@hp.com> <86E8DA9E18BC2344BD0218BF23C88DF30143016EB7A4@MAIL06.curamsoftware.com> <86E8DA9E18BC2344BD0218BF23C88DF3014301B14171@MAIL06.curamsoftware.com> Message-ID: Currently RHEL5.4, but it would be good to have the option for Debian (3/4), > AIX, HPUX (RISC & Itanium, both 11.23/11.31), Solaris, and zLinux (Suse11). > > > > However none of these are a requirement ? more of nice option?. > > > > B > I have written plugins for AIX and Sun Solaris. I will try tomorrow morning for HP-UX. For Red Hat I am facing a little problem. Apparently there is no way to figure out the NIC's speed (e.g. 100Mbps, 1Gbps) as a non-privileged user (mii-tool, ethtool). I will share the plugins once I finish with it. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From akpgeek at gmail.com Thu May 13 18:36:31 2010 From: akpgeek at gmail.com (akp geek) Date: Thu, 13 May 2010 12:36:31 -0400 Subject: notification when a file is found Message-ID: Dear all - We are using nagios. I would like to know if there is a way notification can be sent if there is a new file created in a directory? Can you please help? Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From vcolombo at gmail.com Thu May 13 18:46:34 2010 From: vcolombo at gmail.com (Vincent Colombo) Date: Thu, 13 May 2010 11:46:34 -0500 Subject: notification when a file is found In-Reply-To: References: Message-ID: http://exchange.nagios.org/directory/Plugins/System-Metrics/File-System/check_newfiles-2Esh/details On Thu, May 13, 2010 at 11:36 AM, akp geek wrote: > Dear all - > > We are using nagios. I would like to know if there is a way > notification can be sent if there is a new file created in a directory? Can > you please help? > > Regards > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu May 13 18:57:54 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Thu, 13 May 2010 09:57:54 -0700 Subject: notification when a file is found In-Reply-To: References: Message-ID: <4BEC2F92.3010601@hp.com> akp geek wrote: > Dear all - > > We are using nagios. I would like to know if there is a > way notification can be sent if there is a new file created in a > directory? Can you please help? It's incredibly easy to write a Nagios plugin. The only major requirement is that it return a valid error code so that Nagios can interpret the results. Something like this would work: -- start check_file.sh -- #!/bin/sh # $1 = Full path of filename to watch for $ERRORCODE=0 if [ -e $1 ]; then $ERRORCODE=1 fi exit $ERRORCODE -- end check_file.sh -- ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From akpgeek at gmail.com Thu May 13 19:58:11 2010 From: akpgeek at gmail.com (akp geek) Date: Thu, 13 May 2010 13:58:11 -0400 Subject: notification when a file is found In-Reply-To: References: Message-ID: Thanks a lot. I am working on it Regards On Thu, May 13, 2010 at 12:46 PM, Vincent Colombo wrote: > > http://exchange.nagios.org/directory/Plugins/System-Metrics/File-System/check_newfiles-2Esh/details > > On Thu, May 13, 2010 at 11:36 AM, akp geek wrote: > >> Dear all - >> >> We are using nagios. I would like to know if there is a way >> notification can be sent if there is a new file created in a directory? Can >> you please help? >> >> Regards >> >> >> ------------------------------------------------------------------------------ >> >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Thu May 13 19:51:07 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Thu, 13 May 2010 13:51:07 -0400 Subject: Host down due to plugins timing out Message-ID: Hi there -- I recently set up a new client to be monitored by our Nagios 3.1.2 server. The client is running the CentOS 5.4 operating system, and it has version 2.6 of NRPE as well as version 2.12 of the plugins installed on it. I made sure to deactivate the firewall on the system, disable the SELinux option, and also modified the /etc/services file to include port 5666 with udp and tcp. The problem that I am having is that even though everything appears to be running properly, the requests that are sent from the Nagios server to the client are timing out, with the result being the client appearing as down. The client has the nrpe program running with the following configuration: /usr/local/nrpe/bin/nrpe -n -c /usr/local/nrpe/etc/nrpe.cfg -d I verified the plugins are working on the client, and also confirmed there are no other clients being reported as down. The configuration that I described is not unique to this client. This same setup is working for other clients that are monitored by the server. Does anyone have any idea as to why this is occurring, and what I can do to correct it? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Matthew.Stier at us.fujitsu.com Thu May 13 21:26:50 2010 From: Matthew.Stier at us.fujitsu.com (Stier, Matthew) Date: Thu, 13 May 2010 14:26:50 -0500 Subject: Host down due to plugins timing out In-Reply-To: References: Message-ID: <63F73C973E3E4547979026ECC295EF5C02678E05@rchemxp01.fnc.net.local> The default timeout for nrpe is 10 seconds. How long does your plugin take to run? From: Kaplan, Andrew H. [mailto:AHKAPLAN at PARTNERS.ORG] Sent: Thursday, May 13, 2010 1:51 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Host down due to plugins timing out Hi there -- I recently set up a new client to be monitored by our Nagios 3.1.2 server. The client is running the CentOS 5.4 operating system, and it has version 2.6 of NRPE as well as version 2.12 of the plugins installed on it. I made sure to deactivate the firewall on the system, disable the SELinux option, and also modified the /etc/services file to include port 5666 with udp and tcp. The problem that I am having is that even though everything appears to be running properly, the requests that are sent from the Nagios server to the client are timing out, with the result being the client appearing as down. The client has the nrpe program running with the following configuration: /usr/local/nrpe/bin/nrpe -n -c /usr/local/nrpe/etc/nrpe.cfg -d I verified the plugins are working on the client, and also confirmed there are no other clients being reported as down. The configuration that I described is not unique to this client. This same setup is working for other clients that are monitored by the server. Does anyone have any idea as to why this is occurring, and what I can do to correct it? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Thu May 13 21:44:00 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Thu, 13 May 2010 15:44:00 -0400 Subject: Host down due to plugins timing out In-Reply-To: <63F73C973E3E4547979026ECC295EF5C02678E05@rchemxp01.fnc.net.local> References: <63F73C973E3E4547979026ECC295EF5C02678E05@rchemxp01.fnc.net.local> Message-ID: Hi there -- The command_timeout option in the client's nrpe.cfg file is set to 60. The syntax of the line is shown below: command_timeout=60 ________________________________ From: Stier, Matthew [mailto:Matthew.Stier at us.fujitsu.com] Sent: Thursday, May 13, 2010 3:27 PM To: Nagios Users List Subject: Re: [Nagios-users] Host down due to plugins timing out The default timeout for nrpe is 10 seconds. How long does your plugin take to run? From: Kaplan, Andrew H. [mailto:AHKAPLAN at PARTNERS.ORG] Sent: Thursday, May 13, 2010 1:51 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Host down due to plugins timing out Hi there -- I recently set up a new client to be monitored by our Nagios 3.1.2 server. The client is running the CentOS 5.4 operating system, and it has version 2.6 of NRPE as well as version 2.12 of the plugins installed on it. I made sure to deactivate the firewall on the system, disable the SELinux option, and also modified the /etc/services file to include port 5666 with udp and tcp. The problem that I am having is that even though everything appears to be running properly, the requests that are sent from the Nagios server to the client are timing out, with the result being the client appearing as down. The client has the nrpe program running with the following configuration: /usr/local/nrpe/bin/nrpe -n -c /usr/local/nrpe/etc/nrpe.cfg -d I verified the plugins are working on the client, and also confirmed there are no other clients being reported as down. The configuration that I described is not unique to this client. This same setup is working for other clients that are monitored by the server. Does anyone have any idea as to why this is occurring, and what I can do to correct it? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Thu May 13 21:53:47 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Thu, 13 May 2010 15:53:47 -0400 Subject: Host down due to plugins timing out In-Reply-To: <63F73C973E3E4547979026ECC295EF5C02678E05@rchemxp01.fnc.net.local> References: <63F73C973E3E4547979026ECC295EF5C02678E05@rchemxp01.fnc.net.local> Message-ID: I was able to solve the problem. I checked the ip address of the client system in the hosts.cfg file on the Nagios server, and one of the digits in the address was incorrect. Once I corrected the mistake, everything worked fine. Thanks for the help. ________________________________ From: Stier, Matthew [mailto:Matthew.Stier at us.fujitsu.com] Sent: Thursday, May 13, 2010 3:27 PM To: Nagios Users List Subject: Re: [Nagios-users] Host down due to plugins timing out The default timeout for nrpe is 10 seconds. How long does your plugin take to run? From: Kaplan, Andrew H. [mailto:AHKAPLAN at PARTNERS.ORG] Sent: Thursday, May 13, 2010 1:51 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Host down due to plugins timing out Hi there -- I recently set up a new client to be monitored by our Nagios 3.1.2 server. The client is running the CentOS 5.4 operating system, and it has version 2.6 of NRPE as well as version 2.12 of the plugins installed on it. I made sure to deactivate the firewall on the system, disable the SELinux option, and also modified the /etc/services file to include port 5666 with udp and tcp. The problem that I am having is that even though everything appears to be running properly, the requests that are sent from the Nagios server to the client are timing out, with the result being the client appearing as down. The client has the nrpe program running with the following configuration: /usr/local/nrpe/bin/nrpe -n -c /usr/local/nrpe/etc/nrpe.cfg -d I verified the plugins are working on the client, and also confirmed there are no other clients being reported as down. The configuration that I described is not unique to this client. This same setup is working for other clients that are monitored by the server. Does anyone have any idea as to why this is occurring, and what I can do to correct it? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From scot at wilcoxon.org Thu May 13 21:50:02 2010 From: scot at wilcoxon.org (Scot Wilcoxon) Date: Thu, 13 May 2010 14:50:02 -0500 Subject: How to override a command definition? Message-ID: <1273780202.3087.10.camel@sewilco-laptop> I want to make all check_http commands use a different timeout. How can I replace the check_http command without altering the original definition file? I tried redefining the command in the directory where I have my local configuration changes, but I get the following error: Warning: Duplicate definition found for command 'check_http' (config file '/etc/nagios3/conf.d/commands_local.cfg', starting on line 13) I'm trying to override the existing definition, because I'd rather not alter the configuration which is part of the installation package. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.browne at ausregistry.com.au Fri May 14 01:11:32 2010 From: stuart.browne at ausregistry.com.au (Stuart Browne) Date: Fri, 14 May 2010 09:11:32 +1000 Subject: How to override a command definition? In-Reply-To: <1273780202.3087.10.camel@sewilco-laptop> References: <1273780202.3087.10.camel@sewilco-laptop> Message-ID: <8CEF048B9EC83748B1517DC64EA130FB3E3DA065E8@off-win2003-01.ausregistrygroup.local> > -----Original Message----- > From: Scot Wilcoxon [mailto:scot at wilcoxon.org] > > I want to make all check_http commands use a different timeout. How > can > I replace the check_http command without altering the original > definition file? > > I tried redefining the command in the directory where I have my local > configuration changes, but I get the following error: > Warning: Duplicate definition found for command 'check_http' > (config > file '/etc/nagios3/conf.d/commands_local.cfg', starting on line 13) > > I'm trying to override the existing definition, because I'd rather not > alter the configuration which is part of the installation package. You can't. You should just call it something new. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Fri May 14 04:25:20 2010 From: lists at xodus.org (Marc Powell) Date: Thu, 13 May 2010 21:25:20 -0500 Subject: How to override a command definition? In-Reply-To: <1273780202.3087.10.camel@sewilco-laptop> References: <1273780202.3087.10.camel@sewilco-laptop> Message-ID: <91B7E3EE-BE74-4DE0-9B7B-DC3CE779F126@xodus.org> On May 13, 2010, at 2:50 PM, Scot Wilcoxon wrote: > I'm trying to override the existing definition, because I'd rather not > alter the configuration which is part of the installation package. Just comment out the original if you want to keep it there as a reference. -- Marc ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From srdan.dukic at gmail.com Fri May 14 04:34:28 2010 From: srdan.dukic at gmail.com (Srdan Dukic) Date: Fri, 14 May 2010 14:34:28 +1200 Subject: Using NSClient++ NRPE to check a local URI Message-ID: Hi, I'm trying to get Nagios to check that a website/URI returns a document when it is requested from a client machine. It should do this using NRPE, and I should mention that the client machine is a Windows box running NSClient++ e.g. Check that getting http://local02/Service.svc?wsdl returns a XML document instead of a 404 Error Now, I know that I could implement this using an 'External Script' but I was wondering if there was already a plugin in any of the existing modules which would allow this kind of setup. I've read the documentation and can't find anything, but wanted to confirm. Thank you -- Sr?an ?uki? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From scot at wilcoxon.org Fri May 14 05:06:18 2010 From: scot at wilcoxon.org (Scot Wilcoxon) Date: Thu, 13 May 2010 22:06:18 -0500 Subject: How to override a command definition? In-Reply-To: <91B7E3EE-BE74-4DE0-9B7B-DC3CE779F126@xodus.org> References: <1273780202.3087.10.camel@sewilco-laptop> <91B7E3EE-BE74-4DE0-9B7B-DC3CE779F126@xodus.org> Message-ID: <1273806378.3087.55.camel@sewilco-laptop> On Thu, 2010-05-13 at 21:25 -0500, Marc Powell wrote: > On May 13, 2010, at 2:50 PM, Scot Wilcoxon wrote: > > > I'm trying to override the existing definition, because I'd rather not > > alter the configuration which is part of the installation package. > > Just comment out the original if you want to keep it there as a reference. I know how to alter the files, but I was hoping to leave the original config file alone so the system updates could replace it if necessary. Apparently Nagios is missing the ability to replace previous definitions (also demonstrated in the problem of "undefined" causing specific behavior but not being able to undefine, such as for a no-ping host). I also haven't noticed a defined priority for configuration info, something which would be needed for an override. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ward.P.Fontenot at wellsfargo.com Fri May 14 08:13:39 2010 From: Ward.P.Fontenot at wellsfargo.com (Ward.P.Fontenot at wellsfargo.com) Date: Fri, 14 May 2010 01:13:39 -0500 Subject: Using NSClient++ NRPE to check a local URI In-Reply-To: References: Message-ID: NRPE for that? Is there a requirement that prevents you from using check_http from the Nagios server? From: Srdan Dukic [mailto:srdan.dukic at gmail.com] Sent: Thursday, May 13, 2010 7:34 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Using NSClient++ NRPE to check a local URI Hi, I'm trying to get Nagios to check that a website/URI returns a document when it is requested from a client machine. It should do this using NRPE, and I should mention that the client machine is a Windows box running NSClient++ e.g. Check that getting http://local02/Service.svc?wsdl returns a XML document instead of a 404 Error Now, I know that I could implement this using an 'External Script' but I was wondering if there was already a plugin in any of the existing modules which would allow this kind of setup. I've read the documentation and can't find anything, but wanted to confirm. Thank you -- Sr?an ?uki? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri May 14 09:13:43 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Fri, 14 May 2010 00:13:43 -0700 Subject: How to override a command definition? In-Reply-To: <1273806378.3087.55.camel@sewilco-laptop> References: <1273780202.3087.10.camel@sewilco-laptop> <91B7E3EE-BE74-4DE0-9B7B-DC3CE779F126@xodus.org> <1273806378.3087.55.camel@sewilco-laptop> Message-ID: <4BECF827.2000908@hp.com> Scot Wilcoxon wrote: > On Thu, 2010-05-13 at 21:25 -0500, Marc Powell wrote: > >> On May 13, 2010, at 2:50 PM, Scot Wilcoxon wrote: >> >> >>> I'm trying to override the existing definition, because I'd rather not >>> alter the configuration which is part of the installation package. >>> >> Just comment out the original if you want to keep it there as a reference. >> > > I know how to alter the files, but I was hoping to leave the original > config file alone so the system updates could replace it if necessary. > > Apparently Nagios is missing the ability to replace previous definitions > (also demonstrated in the problem of "undefined" causing specific > behavior but not being able to undefine, such as for a no-ping host). I > also haven't noticed a defined priority for configuration info, > something which would be needed for an override. Read the section of the docs on templates, which *do* allow overrides. Your problem seems to be that you want to override a (sample?) configuration partially supplied by someone else (your OS distribution, I'd guess) that wasn't designed to be overriden. That's not a shortcoming of Nagios, though; it's a result of how the configs you're using were defined, or the way you're interpreting them. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From prasanth.katakam at gmail.com Fri May 14 11:52:12 2010 From: prasanth.katakam at gmail.com (prasanth katakam) Date: Fri, 14 May 2010 15:22:12 +0530 Subject: 403 forbidden error Message-ID: Dear team, i was new to nagios, after installing when i try to open through browser i got an error like 403 forbidden you dont have permission to access /nagios/ on this server httpd error log -------------------- [root at localhost conf]# cat /var/log/httpd/error_log [Fri May 14 14:15:12 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /var/www/html/ [Fri May 14 14:18:20 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 14:18:41 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ httpd.conf ---------------- [root at localhost conf]# vi /etc/httpd/conf/httpd.conf Options ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user Alias /nagios /usr/local/nagios/share Options None AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user kindly do the needfull Prasanth.Katakam 9966655246 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Kristan.Webb at bbc.co.uk Fri May 14 12:38:05 2010 From: Kristan.Webb at bbc.co.uk (Kristan Webb) Date: Fri, 14 May 2010 11:38:05 +0100 Subject: 403 forbidden error In-Reply-To: References: Message-ID: <469133DFA60D7C4DB131A09BCB270A5602B2109F@bbcxue501.national.core.bbc.co.uk> Try Options +ExecCGI ie: Options +ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user Alias /nagios /usr/local/nagios/share Options +ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user Kris ________________________________ From: prasanth katakam [mailto:prasanth.katakam at gmail.com] Sent: 14 May 2010 10:52 To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] 403 forbidden error Dear team, i was new to nagios, after installing when i try to open through browser i got an error like 403 forbidden you dont have permission to access /nagios/ on this server httpd error log -------------------- [root at localhost conf]# cat /var/log/httpd/error_log [Fri May 14 14:15:12 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /var/www/html/ [Fri May 14 14:18:20 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 14:18:41 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ httpd.conf ---------------- [root at localhost conf]# vi /etc/httpd/conf/httpd.conf Options ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user Alias /nagios /usr/local/nagios/share Options None AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user kindly do the needfull Prasanth.Katakam 9966655246 http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From prasanth.katakam at gmail.com Fri May 14 13:35:03 2010 From: prasanth.katakam at gmail.com (prasanth katakam) Date: Fri, 14 May 2010 17:05:03 +0530 Subject: 403 forbidden error In-Reply-To: <469133DFA60D7C4DB131A09BCB270A5602B2109F@bbcxue501.national.core.bbc.co.uk> References: <469133DFA60D7C4DB131A09BCB270A5602B2109F@bbcxue501.national.core.bbc.co.uk> Message-ID: Dear kris, i tried using --Options +ExecCGI-- but same error repeated 403 forbidden you dont have permission to access /nagios/ on this server httpd error log -------------------- [root at localhost conf]# cat /var/log/httpd/error_log [Fri May 14 16:08:17 2010] [notice] caught SIGTERM, shutting down [Fri May 14 16:08:17 2010] [notice] suEXEC mechanism enabled (wrapper: /usr/sbin/suexec) [Fri May 14 16:08:17 2010] [notice] Digest: generating secret for digest authentication ... [Fri May 14 16:08:17 2010] [notice] Digest: done [Fri May 14 16:08:17 2010] [notice] Apache/2.2.3 (Red Hat) configured -- resuming normal operations [Fri May 14 16:08:40 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 16:09:00 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 16:09:46 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 16:09:46 2010] [error] [client 127.0.0.1] File does not exist: /var/www/html/favicon.ico [Fri May 14 16:10:12 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /var/www/html/ kindly do the needfull Thanks and regards Prasanth Katakam hyderabad 9966655246 -------------------------------------------old message------------------------------------- On Fri, May 14, 2010 at 4:08 PM, Kristan Webb wrote: > Try > > Options +ExecCGI > > ie: > > Options +ExecCGI > AllowOverride None > Order allow,deny > Allow from all > AuthName "Nagios Access" > AuthType Basic > AuthUserFile /usr/local/nagios/etc/htpasswd.users > Require valid-user > > > Alias /nagios /usr/local/nagios/share > > > Options +ExecCGI > AllowOverride None > Order allow,deny > Allow from all > AuthName "Nagios Access" > AuthType Basic > AuthUserFile /usr/local/nagios/etc/htpasswd.users > Require valid-user > > > Kris > > > ------------------------------ > *From:* prasanth katakam [mailto:prasanth.katakam at gmail.com] > *Sent:* 14 May 2010 10:52 > *To:* nagios-users at lists.sourceforge.net > *Subject:* [Nagios-users] 403 forbidden error > > Dear team, > > i was new to nagios, after installing when i try to open through browser > > i got an error like 403 forbidden > you dont have permission to access /nagios/ on this server > > httpd error log > -------------------- > > [root at localhost conf]# cat /var/log/httpd/error_log > > [Fri May 14 14:15:12 2010] [error] [client 127.0.0.1] Directory index > forbidden by Options directive: /var/www/html/ > [Fri May 14 14:18:20 2010] [error] [client 127.0.0.1] Directory index > forbidden by Options directive: /usr/local/nagios/share/ > [Fri May 14 14:18:41 2010] [error] [client 127.0.0.1] Directory index > forbidden by Options directive: /usr/local/nagios/share/ > > > httpd.conf > ---------------- > > [root at localhost conf]# vi /etc/httpd/conf/httpd.conf > > > Options ExecCGI > AllowOverride None > Order allow,deny > Allow from all > AuthName "Nagios Access" > AuthType Basic > AuthUserFile /usr/local/nagios/etc/htpasswd.users > Require valid-user > > > Alias /nagios /usr/local/nagios/share > > > Options None > AllowOverride None > Order allow,deny > Allow from all > AuthName "Nagios Access" > AuthType Basic > AuthUserFile /usr/local/nagios/etc/htpasswd.users > Require valid-user > > > kindly do the needfull > > Prasanth.Katakam > 9966655246 > > > http://www.bbc.co.uk > This e-mail (and any attachments) is confidential and may contain personal > views which are not the views of the BBC unless specifically stated. > If you have received it in error, please delete it from your system. > Do not use, copy or disclose the information in any way nor act in reliance > on it and notify the sender immediately. > Please note that the BBC monitors e-mails sent or received. > Further communication will signify your consent to this. > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Prasanth.Katakam 9966655246 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Fri May 14 13:39:25 2010 From: lists at xodus.org (Marc Powell) Date: Fri, 14 May 2010 06:39:25 -0500 Subject: 403 forbidden error In-Reply-To: References: Message-ID: <0BBB9D59-4688-423B-B5FD-149CE6D23246@xodus.org> On May 14, 2010, at 4:52 AM, prasanth katakam wrote: > Dear team, > > i was new to nagios, after installing when i try to open through browser > > i got an error like 403 forbidden > you dont have permission to access /nagios/ on this server > > httpd error log > -------------------- > > [root at localhost conf]# cat /var/log/httpd/error_log > > [Fri May 14 14:15:12 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /var/www/html/ > [Fri May 14 14:18:20 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ Do you have PHP installed? If you needed to install it, did you restart Apache? -- Marc ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From davide.blasi at infracom.it Fri May 14 14:36:48 2010 From: davide.blasi at infracom.it (Davide Blasi) Date: Fri, 14 May 2010 14:36:48 +0200 Subject: 403 forbidden error In-Reply-To: References: <469133DFA60D7C4DB131A09BCB270A5602B2109F@bbcxue501.national.core.bbc.co.uk> Message-ID: <20100514143648.4eac6e31@infracom.it> Hi Prasanth, Probably apache don't have permission on cgi-dir can you paste here output of : `ls -l /usr/local/nagios/sbin /usr/local/nagios/share' ? Apache needs read and execution permission on directories, maybe it can be resolved by `chmod -R o+rx /usr/local/nagios/sbin /usr/local/nagios/share' or give ownership to apache group ( in some system www-data ) to them. Dave Date: Fri, 14 May 2010 13:35:03 +0200 From: prasanth katakam To: Nagios Users List Subject: Re: [Nagios-users] 403 forbidden error Dear kris, i tried using --Options +ExecCGI-- but same error repeated 403 forbidden you dont have permission to access /nagios/ on this server httpd error log -------------------- [root at localhost conf]# cat /var/log/httpd/error_log [Fri May 14 16:08:17 2010] [notice] caught SIGTERM, shutting down [Fri May 14 16:08:17 2010] [notice] suEXEC mechanism enabled (wrapper: /usr/sbin/suexec) [Fri May 14 16:08:17 2010] [notice] Digest: generating secret for digest authentication ... [Fri May 14 16:08:17 2010] [notice] Digest: done [Fri May 14 16:08:17 2010] [notice] Apache/2.2.3 (Red Hat) configured -- resuming normal operations [Fri May 14 16:08:40 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 16:09:00 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 16:09:46 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 16:09:46 2010] [error] [client 127.0.0.1] File does not exist: /var/www/html/favicon.ico [Fri May 14 16:10:12 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /var/www/html/ kindly do the needfull Thanks and regards Prasanth Katakam hyderabad 9966655246 -------------------------------------------old message------------------------------------- On Fri, May 14, 2010 at 4:08 PM, Kristan Webb > wrote: Try Options +ExecCGI ie: Options +ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user Alias /nagios /usr/local/nagios/share Options +ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user Kris ________________________________ From: prasanth katakam [mailto:prasanth.katakam at gmail.com] Sent: 14 May 2010 10:52 To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] 403 forbidden error Dear team, i was new to nagios, after installing when i try to open through browser i got an error like 403 forbidden you dont have permission to access /nagios/ on this server httpd error log -------------------- [root at localhost conf]# cat /var/log/httpd/error_log [Fri May 14 14:15:12 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /var/www/html/ [Fri May 14 14:18:20 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ [Fri May 14 14:18:41 2010] [error] [client 127.0.0.1] Directory index forbidden by Options directive: /usr/local/nagios/share/ httpd.conf ---------------- [root at localhost conf]# vi /etc/httpd/conf/httpd.conf Options ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user Alias /nagios /usr/local/nagios/share Options None AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user kindly do the needfull Prasanth.Katakam 9966655246 http://www.bbc.co.uk This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Prasanth.Katakam 9966655246 ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sergioafonsojr at gmail.com Fri May 14 17:49:46 2010 From: sergioafonsojr at gmail.com (=?ISO-8859-1?Q?S=E9rgio_Afonso?=) Date: Fri, 14 May 2010 11:49:46 -0400 Subject: Problems with distributed monitoring Message-ID: Hello, I'm having same problems with my distributed monitoring instalation. I just 27 hosts in my master nagios. With around just 150 services. All these service checks are being performed as passive one. But I'm having a growing delay between master and slaves nagios. When I start it, I don't have this delay, but wth in the course of time it increases some much until one point that nagios master stops. Could yoiu help me? If is there some additional information necessary to inform, I can pass it. Rgs, S?rgio Afonso ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trisha at rockyou.com Fri May 14 19:28:32 2010 From: trisha at rockyou.com (Trisha Hoang) Date: Fri, 14 May 2010 10:28:32 -0700 Subject: Problems with distributed monitoring In-Reply-To: References: Message-ID: Hi Sergio, Some of the directives I found helpful for our MASTER server are listed below. Since status.dat and nagios.cmd are disk bound, put them on ramdisk will be faster. status_file=/mnt/ramdisk/status.dat command_file=/mnt/ramdisk/nagios.cmd I don't think aggressive_host_checking is needed as nagios checks for host when a service is in error anyway. use_aggressive_host_checking=0 check_host_freshness=0 Service freshness is important as the MASTER tends to process passive checks much slower so the services may go stale. However, since our checks are 5 min interval, having the MASTER wait for the next round of check is fine. check_service_freshness=1 service_freshness_check_interval=420 We use nagios-3.2.1 and I think these directives are still experimental but they seem to help. You will see defunct nagios processes that come and go. I think it's caused by child forked once instead of twice so one gets killed (my theory), but again, it seems to be running ok. use_large_installation_tweaks=0 child_processes_fork_twice=0 Our MASTER receives ~7000 passive checks from the SLAVE but it could only process max ~5000 passive checks per 5 min. The latency is about <10 secs. For the rest, the MASTER actively checks them. If you or someone knows a way to improve passive check processing, that will be great. Also, in our setup, we don't use NSCA. The slaves have ocsp_command=send_service_check where this command inserts the checks into a file that gets sent every 5 sec to the master. On the master, there's a script that opens this file and inserts the lines directly into the nagios.cmd pipe every 5 sec. Trisha -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Fri May 14 19:50:00 2010 From: mitsuto at gmail.com (Marcel) Date: Fri, 14 May 2010 14:50:00 -0300 Subject: Problems with distributed monitoring In-Reply-To: References: Message-ID: With only 150 services, it should not delay that much nor stops execution of the main process. Please check you main nagios.cfg file and look for *command_check_interval*, if the value attributed to that variable isn't *"-1"* then there is your problem. Also, which nagios version are you running? On Fri, May 14, 2010 at 2:28 PM, Trisha Hoang wrote: > Hi Sergio, > Some of the directives I found helpful for our MASTER server are listed > below. > > Since status.dat and nagios.cmd are disk bound, put them on ramdisk will be > faster. > status_file=/mnt/ramdisk/status.dat > command_file=/mnt/ramdisk/nagios.cmd > > I don't think aggressive_host_checking is needed as nagios checks for host > when a service is in error anyway. > use_aggressive_host_checking=0 > check_host_freshness=0 > > Service freshness is important as the MASTER tends to process passive > checks much slower so the services may go stale. However, since our checks > are 5 min interval, having the MASTER wait for the next round of check is > fine. > check_service_freshness=1 > service_freshness_check_interval=420 > > We use nagios-3.2.1 and I think these directives are still experimental but > they seem to help. You will see defunct nagios processes that come and go. I > think it's caused by child forked once instead of twice so one gets killed > (my theory), but again, it seems to be running ok. > use_large_installation_tweaks=0 > child_processes_fork_twice=0 > > Our MASTER receives ~7000 passive checks from the SLAVE but it could only > process max ~5000 passive checks per 5 min. The latency is about <10 secs. > For the rest, the MASTER actively checks them. If you or someone knows a way > to improve passive check processing, that will be great. > > Also, in our setup, we don't use NSCA. The slaves have > ocsp_command=send_service_check where this command inserts the checks into a > file that gets sent every 5 sec to the master. On the master, there's a > script that opens this file and inserts the lines directly into the > nagios.cmd pipe every 5 sec. > > Trisha > > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sergioafonsojr at gmail.com Fri May 14 20:20:29 2010 From: sergioafonsojr at gmail.com (=?ISO-8859-1?Q?S=E9rgio_Afonso?=) Date: Fri, 14 May 2010 14:20:29 -0400 Subject: Problems with distributed monitoring In-Reply-To: References: Message-ID: Hello Marcel, My nagios version is 3.2.0. About my command_check_interval I couldn't understand very well what you mean. My command_check_interval is set to -1 Rgs, S?rgio. On Fri, May 14, 2010 at 1:50 PM, Marcel wrote: > With only 150 services, it should not delay that much nor stops execution of > the main process. > Please check you main nagios.cfg file and look for command_check_interval, > if the value attributed to that variable isn't "-1" then there is your > problem. > > Also, which nagios version are you running? > > > On Fri, May 14, 2010 at 2:28 PM, Trisha Hoang wrote: >> >> Hi Sergio, >> Some of the directives I found helpful for our MASTER server are listed >> below. >> >> Since status.dat and nagios.cmd are disk bound, put them on ramdisk will >> be faster. >> status_file=/mnt/ramdisk/status.dat >> command_file=/mnt/ramdisk/nagios.cmd >> >> I don't think aggressive_host_checking is needed as nagios checks for host >> when a service is in error anyway. >> use_aggressive_host_checking=0 >> check_host_freshness=0 >> >> Service freshness is important as the MASTER tends to process passive >> checks much slower so the services may go stale. However, since our checks >> are 5 min interval, having the MASTER wait for the next round of check is >> fine. >> check_service_freshness=1 >> service_freshness_check_interval=420 >> >> We use nagios-3.2.1 and I think these directives are still experimental >> but they seem to help. You will see defunct nagios processes that come and >> go. I think it's caused by child forked once instead of twice so one gets >> killed (my theory), but again, it seems to be running ok. >> use_large_installation_tweaks=0 >> child_processes_fork_twice=0 >> >> Our MASTER receives ~7000 passive checks from the SLAVE but it could only >> process max ~5000 passive checks per 5 min. The latency is about <10 secs. >> For the rest, the MASTER actively checks them. If you or someone knows a way >> to improve passive check processing, that will be great. >> >> Also, in our setup, we don't use NSCA. The slaves have >> ocsp_command=send_service_check where this command inserts the checks into a >> file that gets sent every 5 sec to the master. On the master, there's a >> script that opens this file and inserts the lines directly into the >> nagios.cmd pipe every 5 sec. >> >> Trisha >> >> >> >> ------------------------------------------------------------------------------ >> >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Fri May 14 20:39:06 2010 From: mitsuto at gmail.com (Marcel) Date: Fri, 14 May 2010 15:39:06 -0300 Subject: Problems with distributed monitoring In-Reply-To: References: Message-ID: Make sure you understand the underlying design of distributed nagios setup. Obssessive-Compulsive (ocsp) command need to be correctly configurated on nagios slave. 2010/5/14 S?rgio Afonso > Hello Marcel, > > My nagios version is 3.2.0. About my command_check_interval I > couldn't understand very well what you mean. My command_check_interval > is set to -1 > > Rgs, > > S?rgio. > > On Fri, May 14, 2010 at 1:50 PM, Marcel wrote: > > With only 150 services, it should not delay that much nor stops execution > of > > the main process. > > Please check you main nagios.cfg file and look for > command_check_interval, > > if the value attributed to that variable isn't "-1" then there is your > > problem. > > > > Also, which nagios version are you running? > > > > > > On Fri, May 14, 2010 at 2:28 PM, Trisha Hoang > wrote: > >> > >> Hi Sergio, > >> Some of the directives I found helpful for our MASTER server are listed > >> below. > >> > >> Since status.dat and nagios.cmd are disk bound, put them on ramdisk will > >> be faster. > >> status_file=/mnt/ramdisk/status.dat > >> command_file=/mnt/ramdisk/nagios.cmd > >> > >> I don't think aggressive_host_checking is needed as nagios checks for > host > >> when a service is in error anyway. > >> use_aggressive_host_checking=0 > >> check_host_freshness=0 > >> > >> Service freshness is important as the MASTER tends to process passive > >> checks much slower so the services may go stale. However, since our > checks > >> are 5 min interval, having the MASTER wait for the next round of check > is > >> fine. > >> check_service_freshness=1 > >> service_freshness_check_interval=420 > >> > >> We use nagios-3.2.1 and I think these directives are still experimental > >> but they seem to help. You will see defunct nagios processes that come > and > >> go. I think it's caused by child forked once instead of twice so one > gets > >> killed (my theory), but again, it seems to be running ok. > >> use_large_installation_tweaks=0 > >> child_processes_fork_twice=0 > >> > >> Our MASTER receives ~7000 passive checks from the SLAVE but it could > only > >> process max ~5000 passive checks per 5 min. The latency is about <10 > secs. > >> For the rest, the MASTER actively checks them. If you or someone knows a > way > >> to improve passive check processing, that will be great. > >> > >> Also, in our setup, we don't use NSCA. The slaves have > >> ocsp_command=send_service_check where this command inserts the checks > into a > >> file that gets sent every 5 sec to the master. On the master, there's a > >> script that opens this file and inserts the lines directly into the > >> nagios.cmd pipe every 5 sec. > >> > >> Trisha > >> > >> > >> > >> > ------------------------------------------------------------------------------ > >> > >> > >> _______________________________________________ > >> Nagios-users mailing list > >> Nagios-users at lists.sourceforge.net > >> https://lists.sourceforge.net/lists/listinfo/nagios-users > >> ::: Please include Nagios version, plugin version (-v) and OS when > >> reporting any issue. > >> ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > ------------------------------------------------------------------------------ > > > > > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From srdan.dukic at gmail.com Sat May 15 08:13:58 2010 From: srdan.dukic at gmail.com (Srdan Dukic) Date: Sat, 15 May 2010 06:13:58 +0000 Subject: Using NSClient++ NRPE to check a local URI In-Reply-To: References: Message-ID: On 14 May 2010 06:13, wrote: > NRPE for that? Is there a requirement that prevents you from using > check_http from the Nagios server? > Yes. The requirement is that the URL being checked is on a local windows network. i.e. http://somecomputer.localdomain/, which the Nagios server can't access. Cheers -- Sr?an ?uki? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmavrudiev at gmail.com Sat May 15 15:29:09 2010 From: mmavrudiev at gmail.com (Maxim Mavrudiev) Date: Sat, 15 May 2010 16:29:09 +0300 Subject: host/service schedule checks problem Message-ID: <4BEEA1A5.5000607@gmail.com> Hi all, I upgraded to the last stable versions of nagios-core, nagios-plugins, rrdtool, nagios grapher, and the things go bad... Now the versions are: - Nagios v3.2.1 Stable - Nagios Grapher 1.7.1 - rrdtool 1.4.3 - NagiosQL 3.0.3 - MySQL 5.0.37 The problem: When I stop and start the nagios by the rc script, all 813 hosts are checked by the initial check, only the hosts, not the services... After that the schedule checks neither for the hosts, neither for the services won't work...the check_interval for example for the service, which is checking hosts with ping is 4 minutes...nothing happen after 4 or more minutes...only the initial check works... For now I restarted nagios in crontab every 6 minutes to work something...but I need to figure it out what happens... nagiostat: Nagios Stats 3.2.1 Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org) Last Modified: 03-09-2010 License: GPL CURRENT STATUS DATA ------------------------------------------------------ Status File: /var/nagios/status.dat Status File Age: 0d 0h 0m 7s Status File Version: 3.2.1 Program Running Time: 0d 0h 0m 58s Nagios PID: 21290 Used/High/Total Command Buffers: 0 / 0 / 4096 Total Services: 1848 Services Checked: 1846 Services Scheduled: 1843 Services Actively Checked: 1847 Services Passively Checked: 1 Total Service State Change: 0.000 / 37.890 / 0.338 % Active Service Latency: 0.000 / 295.571 / 38.203 sec Active Service Execution Time: 0.000 / 100.005 / 11.427 sec Active Service State Change: 0.000 / 37.890 / 0.338 % Active Services Last 1/5/15/60 min: 0 / 34 / 34 / 168 Passive Service Latency: 0.256 / 0.256 / 0.256 sec Passive Service State Change: 0.000 / 0.000 / 0.000 % Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 Services Ok/Warn/Unk/Crit: 1662 / 32 / 17 / 137 Services Flapping: 2 Services In Downtime: 0 Total Hosts: 813 Hosts Checked: 813 Hosts Scheduled: 808 Hosts Actively Checked: 812 Host Passively Checked: 1 Total Host State Change: 0.000 / 38.550 / 0.244 % Active Host Latency: 0.000 / 56.533 / 8.307 sec Active Host Execution Time: 0.000 / 20.005 / 3.699 sec Active Host State Change: 0.000 / 38.550 / 0.245 % Active Hosts Last 1/5/15/60 min: 581 / 581 / 592 / 810 Passive Host Latency: 0.256 / 0.256 / 0.256 sec Passive Host State Change: 0.000 / 0.000 / 0.000 % Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 Hosts Up/Down/Unreach: 743 / 67 / 3 Hosts Flapping: 1 Hosts In Downtime: 0 Active Host Checks Last 1/5/15 min: 807 / 807 / 807 Scheduled: 807 / 807 / 807 On-demand: 0 / 0 / 0 Parallel: 807 / 807 / 807 Serial: 0 / 0 / 0 Cached: 0 / 0 / 0 Passive Host Checks Last 1/5/15 min: 0 / 0 / 0 Active Service Checks Last 1/5/15 min: 0 / 0 / 0 Scheduled: 0 / 0 / 0 On-demand: 0 / 0 / 0 Cached: 0 / 0 / 0 Passive Service Checks Last 1/5/15 min: 0 / 0 / 0 External Commands Last 1/5/15 min: 0 / 0 / 0 /usr/local/nagios/bin/nagios -s /etc/nagios Any ideas? -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: nagios.cfg URL: -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: clients.cfg URL: -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: nagios -s nagios.cfg URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mikue4 at gmail.com Sat May 15 15:38:29 2010 From: mikue4 at gmail.com (Miku, E. Cornelius) Date: Sat, 15 May 2010 16:38:29 +0300 Subject: having perl problem after upgrading Message-ID: I upgraded from Ubuntu 8.04 to Ubuntu 9.10, now when i run the "-v" command to verify my configuration i get the below error... /usr/sbin/nagios3: error while loading shared libraries: libperl.so.5.8: cannot open shared object file: No such file or directory Please assist. i'm running Nagios 3.2 If i try to locate libperl.so i get the below results; /usr/lib/libperl.so /usr/lib/libperl.so.5.10 /usr/lib/libperl.so.5.10.0 Thanks in advance, -- Miku -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Sat May 15 16:00:51 2010 From: lists at xodus.org (Marc Powell) Date: Sat, 15 May 2010 09:00:51 -0500 Subject: having perl problem after upgrading In-Reply-To: References: Message-ID: <28551EC4-107F-4462-9534-F7552065AE43@xodus.org> On May 15, 2010, at 8:38 AM, Miku, E. Cornelius wrote: > I upgraded from Ubuntu 8.04 to Ubuntu 9.10, now when i run the "-v" command to verify my configuration i get the below error... > > /usr/sbin/nagios3: error while loading shared libraries: libperl.so.5.8: cannot open shared object file: No such file or directory > > Please assist. It looks like you installed nagios from source. You need to recompile nagios with the new library and reinstall the binary. ./configure --whatever your previous options were make all /etc/init.d/nagios stop make install-base /etc/init.d/nagios start -- Marc ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Sun May 16 09:18:04 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Sun, 16 May 2010 09:18:04 +0200 Subject: having perl problem after upgrading In-Reply-To: References: Message-ID: <8BD1E6C9-D329-4F15-BCCE-7B2EE8A4823C@linux.it> Try linking: ln - s /usr/lib/libperl.so.5.8 /usr/lib/libperl.so.5.10 Ciao, Giorgio Il giorno 15/mag/2010, alle ore 15.38, "Miku, E. Cornelius" ha scritto: > I upgraded from Ubuntu 8.04 to Ubuntu 9.10, now when i run the "-v" > command to verify my configuration i get the below error... > > /usr/sbin/nagios3: error while loading shared libraries: libperl.so. > 5.8: cannot open shared object file: No such file or directory > > Please assist. > > i'm running Nagios 3.2 > > If i try to locate libperl.so i get the below results; > > /usr/lib/libperl.so > /usr/lib/libperl.so.5.10 > /usr/lib/libperl.so.5.10.0 > > Thanks in advance, > > -- > Miku > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmavrudiev at gmail.com Sun May 16 10:54:09 2010 From: mmavrudiev at gmail.com (Maxim Mavrudiev) Date: Sun, 16 May 2010 11:54:09 +0300 Subject: host/service schedule checks problem In-Reply-To: <4BEEA1A5.5000607@gmail.com> References: <4BEEA1A5.5000607@gmail.com> Message-ID: <4BEFB2B1.5050101@gmail.com> Any ideas? Everything seems to be corrected, but it doesn't work... Maxim Mavrudiev wrote: > Hi all, > > I upgraded to the last stable versions of nagios-core, nagios-plugins, > rrdtool, nagios grapher, and the things go bad... > Now the versions are: > - Nagios v3.2.1 Stable > - Nagios Grapher 1.7.1 > - rrdtool 1.4.3 > - NagiosQL 3.0.3 > - MySQL 5.0.37 > > The problem: > When I stop and start the nagios by the rc script, all 813 hosts are > checked by the initial check, only the hosts, not the services... > After that the schedule checks neither for the hosts, neither for the > services won't work...the check_interval for example for the service, > which is checking hosts with ping is 4 minutes...nothing happen after 4 > or more minutes...only the initial check works... > For now I restarted nagios in crontab every 6 minutes to work > something...but I need to figure it out what happens... > > nagiostat: > > Nagios Stats 3.2.1 > Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org) > Last Modified: 03-09-2010 > License: GPL > > CURRENT STATUS DATA > ------------------------------------------------------ > Status File: /var/nagios/status.dat > Status File Age: 0d 0h 0m 7s > Status File Version: 3.2.1 > > Program Running Time: 0d 0h 0m 58s > Nagios PID: 21290 > Used/High/Total Command Buffers: 0 / 0 / 4096 > > Total Services: 1848 > Services Checked: 1846 > Services Scheduled: 1843 > Services Actively Checked: 1847 > Services Passively Checked: 1 > Total Service State Change: 0.000 / 37.890 / 0.338 % > Active Service Latency: 0.000 / 295.571 / 38.203 sec > Active Service Execution Time: 0.000 / 100.005 / 11.427 sec > Active Service State Change: 0.000 / 37.890 / 0.338 % > Active Services Last 1/5/15/60 min: 0 / 34 / 34 / 168 > Passive Service Latency: 0.256 / 0.256 / 0.256 sec > Passive Service State Change: 0.000 / 0.000 / 0.000 % > Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 > Services Ok/Warn/Unk/Crit: 1662 / 32 / 17 / 137 > Services Flapping: 2 > Services In Downtime: 0 > > Total Hosts: 813 > Hosts Checked: 813 > Hosts Scheduled: 808 > Hosts Actively Checked: 812 > Host Passively Checked: 1 > Total Host State Change: 0.000 / 38.550 / 0.244 % > Active Host Latency: 0.000 / 56.533 / 8.307 sec > Active Host Execution Time: 0.000 / 20.005 / 3.699 sec > Active Host State Change: 0.000 / 38.550 / 0.245 % > Active Hosts Last 1/5/15/60 min: 581 / 581 / 592 / 810 > Passive Host Latency: 0.256 / 0.256 / 0.256 sec > Passive Host State Change: 0.000 / 0.000 / 0.000 % > Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 > Hosts Up/Down/Unreach: 743 / 67 / 3 > Hosts Flapping: 1 > Hosts In Downtime: 0 > > Active Host Checks Last 1/5/15 min: 807 / 807 / 807 > Scheduled: 807 / 807 / 807 > On-demand: 0 / 0 / 0 > Parallel: 807 / 807 / 807 > Serial: 0 / 0 / 0 > Cached: 0 / 0 / 0 > Passive Host Checks Last 1/5/15 min: 0 / 0 / 0 > Active Service Checks Last 1/5/15 min: 0 / 0 / 0 > Scheduled: 0 / 0 / 0 > On-demand: 0 / 0 / 0 > Cached: 0 / 0 / 0 > Passive Service Checks Last 1/5/15 min: 0 / 0 / 0 > > External Commands Last 1/5/15 min: 0 / 0 / 0 > > /usr/local/nagios/bin/nagios -s /etc/nagios > > > Any ideas? > -- ????????, ?????? ????????? ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Mon May 17 08:51:28 2010 From: jashokda at cisco.com (Jatin Davey) Date: Mon, 17 May 2010 12:21:28 +0530 Subject: Warning while starting Nagios (Changed for Sync to Bulk(NPCD) mode of PNP operation) Message-ID: <4BF0E770.8000403@cisco.com> Hi All I am using Nagios with PNP4nagios as a graphing addon. I was using the synchronous mode of PNP operation where in for every data collected the process_perfdata is run and the data gets stored in the database. My polling was such that for few services like CPU and bandwidth usage the frequency of polling was done every 20 secs. This added a lot of load on my nagios server box. Its CPU was constantly working at 70-75% and hence i decided to switch over to Bulk with NPCD mode and followed all the instructions given in the PNP4 site. Everything went fine and nagios also verified the configurations correctly. But while starting the nagios service i got the following error message as shown below. Can someone tell me what was the reason for it and is there anything that i should be concerned. Please let me know. Note: After starting the nagios server again i am seeing the data being collected and shown in the graphs. No issues in this regard. Now the load on my nagios server is significantly reduced. I could see that the CPU usage has come down from 70-75% to 40%. Running pre-flight check on configuration data... Checking services... Checked 199 services. Checking hosts... Checked 24 hosts. Checking host groups... Checked 2 host groups. Checking service groups... Checked 0 service groups. Checking contacts... Checked 16 contacts. Checking contact groups... Checked 1 contact groups. Checking service escalations... Checked 0 service escalations. Checking service dependencies... Checked 0 service dependencies. Checking host escalations... Checked 0 host escalations. Checking host dependencies... Checked 0 host dependencies. Checking commands... Checked 41 commands. Checking time periods... Checked 5 time periods. Checking for circular paths between hosts... Checking for circular host and service dependencies... Checking global event handlers... Checking obsessive compulsive processor commands... Checking misc settings... Total Warnings: 0 Total Errors: 0 Things look okay - No serious problems were detected during the pre-flight check [root at pc-fedora-1 objects]# /etc/rc.d/init.d/nagios start Starting nagios:su: warning: cannot change directory to /var/spool/nagios: No such file or directory done. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From liulantao at gmail.com Mon May 17 10:18:20 2010 From: liulantao at gmail.com (Liu Lantao) Date: Mon, 17 May 2010 16:18:20 +0800 Subject: How to override a command definition? In-Reply-To: <1273806378.3087.55.camel@sewilco-laptop> References: <1273780202.3087.10.camel@sewilco-laptop> <91B7E3EE-BE74-4DE0-9B7B-DC3CE779F126@xodus.org> <1273806378.3087.55.camel@sewilco-laptop> Message-ID: <4BF0FBCC.1050506@gmail.com> On 05/14/2010 11:06 AM, Scot Wilcoxon wrote: > On Thu, 2010-05-13 at 21:25 -0500, Marc Powell wrote: > >> On May 13, 2010, at 2:50 PM, Scot Wilcoxon wrote: >> >> >>> I'm trying to override the existing definition, because I'd rather not >>> alter the configuration which is part of the installation package. >>> >> Just comment out the original if you want to keep it there as a reference. >> > I know how to alter the files, but I was hoping to leave the original > config file alone so the system updates could replace it if necessary. > > Apparently Nagios is missing the ability to replace previous definitions > (also demonstrated in the problem of "undefined" causing specific > behavior but not being able to undefine, such as for a no-ping host). I > also haven't noticed a defined priority for configuration info, > something which would be needed for an override. > it's not a good idea to overide the 'previous' definition, as a result of the config style as 'cfg_dir', nagios cannot decide which is the 'previous' config file. cfg_dir=/usr/local/nagios/etc/hosts ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From miku at habari.co.tz Mon May 17 12:26:19 2010 From: miku at habari.co.tz (Miku, E. Cornelius) Date: Mon, 17 May 2010 13:26:19 +0300 Subject: having perl problem after upgrading In-Reply-To: <28551EC4-107F-4462-9534-F7552065AE43@xodus.org> References: <28551EC4-107F-4462-9534-F7552065AE43@xodus.org> Message-ID: Thanks Marc, You were correct, i had installed from source, and doing what you suggested fixed the issue. Again, thanks for your assistance. Regards, -- Miku On Sat, May 15, 2010 at 5:00 PM, Marc Powell wrote: > > On May 15, 2010, at 8:38 AM, Miku, E. Cornelius wrote: > > > I upgraded from Ubuntu 8.04 to Ubuntu 9.10, now when i run the "-v" > command to verify my configuration i get the below error... > > > > /usr/sbin/nagios3: error while loading shared libraries: libperl.so.5.8: > cannot open shared object file: No such file or directory > > > > Please assist. > > It looks like you installed nagios from source. You need to recompile > nagios with the new library and reinstall the binary. > > ./configure --whatever your previous options were > make all > /etc/init.d/nagios stop > make install-base > /etc/init.d/nagios start > > -- > Marc > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Mon May 17 13:33:40 2010 From: nagios at flatto.net (Assaf Flatto) Date: Mon, 17 May 2010 12:33:40 +0100 Subject: Warning while starting Nagios (Changed for Sync to Bulk(NPCD) mode of PNP operation) In-Reply-To: <4BF0E770.8000403@cisco.com> References: <4BF0E770.8000403@cisco.com> Message-ID: <4BF12994.9070509@flatto.net> Jatin Davey wrote: > Hi All > > I am using Nagios with PNP4nagios as a graphing addon. I was using the > synchronous mode of PNP operation where in for every data collected the > process_perfdata is run and the data gets stored in the database. My > polling was such that for few services like CPU and bandwidth usage the > frequency of polling was done every 20 secs. This added a lot of load on > my nagios server box. Its CPU was constantly working at 70-75% and hence > i decided to switch over to Bulk with NPCD mode and followed all the > instructions given in the PNP4 site. Everything went fine and nagios > also verified the configurations correctly. But while starting the > nagios service i got the following error message as shown below. > > Can someone tell me what was the reason for it and is there anything > that i should be concerned. Please let me know. > > Note: After starting the nagios server again i am seeing the data being > collected and shown in the graphs. No issues in this regard. Now the > load on my nagios server is significantly reduced. I could see that the > CPU usage has come down from 70-75% to 40%. > > Running pre-flight check on configuration data... > > Checking services... > Checked 199 services. > Checking hosts... > Checked 24 hosts. > Checking host groups... > Checked 2 host groups. > Checking service groups... > Checked 0 service groups. > Checking contacts... > Checked 16 contacts. > Checking contact groups... > Checked 1 contact groups. > Checking service escalations... > Checked 0 service escalations. > Checking service dependencies... > Checked 0 service dependencies. > Checking host escalations... > Checked 0 host escalations. > Checking host dependencies... > Checked 0 host dependencies. > Checking commands... > Checked 41 commands. > Checking time periods... > Checked 5 time periods. > Checking for circular paths between hosts... > Checking for circular host and service dependencies... > Checking global event handlers... > Checking obsessive compulsive processor commands... > Checking misc settings... > > Total Warnings: 0 > Total Errors: 0 > > Things look okay - No serious problems were detected during the > pre-flight check > [root at pc-fedora-1 objects]# /etc/rc.d/init.d/nagios start > Starting nagios:su: warning: cannot change directory to > /var/spool/nagios: No such file or directory > done. > > > As it says in the bottom of the error list - you are missing a directory for nagios to work in . check that the directory exists and that it has the permissions for the nagios user to read/write to it. This might be needed as the cache location (spool area) and may cause an increase in use of CPU and RAM resources , which might also impact the performance of nagios and the internal availability and trending nagios keeps. Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lomiz.mail at gmail.com Mon May 17 17:17:47 2010 From: lomiz.mail at gmail.com (Enrico Zimol) Date: Mon, 17 May 2010 17:17:47 +0200 Subject: NRPE with command arguments Message-ID: Hello at all, I'm looking to pass argument to nrpe, so I read that I've to set dont_blame_nrpe to 1 and recompile with --enable-command-args. My question is about if this recompiling will do effect on NRPE or on check_nrpe plugin. I think first one, but I want confirm on that. Is there some specifics documentation about it? because on officials there aren't informations. (for example I don't know what "make install-*" I've to do after recompiling) Thank's so much sorry for my bad english ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Mon May 17 17:28:29 2010 From: nagios at flatto.net (Assaf Flatto) Date: Mon, 17 May 2010 16:28:29 +0100 Subject: NRPE with command arguments In-Reply-To: References: Message-ID: <4BF1609D.7010908@flatto.net> Enrico Zimol wrote: > Hello at all, > I'm looking to pass argument to nrpe, so I read that I've to set > dont_blame_nrpe to 1 and recompile with --enable-command-args. > My question is about if this recompiling will do effect on NRPE or on > check_nrpe plugin. I think first one, but I want confirm on that. > Is there some specifics documentation about it? because on officials > there aren't informations. > (for example I don't know what "make install-*" I've to do after recompiling) > > > Thank's so much > > sorry for my bad english > > The recompile will not have any effect on the check_nrpe plugin , but will change the way the nrpe daemon works and listens to the commands sent from the nagios server . about the "make" ,all you need to do is as follows : ./configure --enable-command-args make make install then check the configuration , verify the init script and start nrpe . after that - change the testing to work the way you want it . ( always good to have the debug of nrpe enabled while testing ). Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.lynch at rasmussen.edu Mon May 17 18:17:50 2010 From: richard.lynch at rasmussen.edu (Richard Lynch) Date: Mon, 17 May 2010 11:17:50 -0500 Subject: 403 forbidden error In-Reply-To: References: Message-ID: Did you put a username/password in /usr/local/nagios/etc/htpasswd.users that you know is correct? You?re supposed to get a 403 until you supply the username and unhashed-password from that file. On 5/14/10 4:52 AM, "prasanth katakam" wrote: > Dear team, > > i was new to nagios, after installing when i try to open through browser > > i got an error like 403 forbidden > you dont have permission to access /nagios/ on this server > > httpd error log > -------------------- > > [root at localhost conf]# ?cat /var/log/httpd/error_log > > [Fri May 14 14:15:12 2010] [error] [client 127.0.0.1] Directory index > forbidden by Options directive: /var/www/html/ > [Fri May 14 14:18:20 2010] [error] [client 127.0.0.1] Directory index > forbidden by Options directive: /usr/local/nagios/share/ > [Fri May 14 14:18:41 2010] [error] [client 127.0.0.1] Directory index > forbidden by Options directive: /usr/local/nagios/share/ > > > httpd.conf > ---------------- > > [root at localhost conf]# vi /etc/httpd/conf/httpd.conf? > > > ?? ?Options ExecCGI > ?? ?AllowOverride None > ?? ?Order allow,deny > ?? ?Allow from all > ?? ?AuthName "Nagios Access" > ?? ?AuthType Basic > ?? ?AuthUserFile /usr/local/nagios/etc/htpasswd.users > ?? ?Require valid-user > > > Alias /nagios /usr/local/nagios/share > > > ?? ?Options None > ?? ?AllowOverride None > ?? ?Order allow,deny > ?? ?Allow from all > ?? ?AuthName "Nagios Access" > ?? ?AuthType Basic > ?? ?AuthUserFile /usr/local/nagios/etc/htpasswd.users > ?? ?Require valid-user > > > kindly do the needfull > > Prasanth.Katakam > 9966655246 > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From john at stilen.com Mon May 17 19:57:33 2010 From: john at stilen.com (John Stile) Date: Mon, 17 May 2010 10:57:33 -0700 Subject: set warning and cricitcal for ups capacity Message-ID: <1274119053.22852.27.camel@genx.localdomain> I would like to add a warning & critical value to the following check_snmp line in /etc/nagios/nrpe.cfg, which monitors my ups. command[snmp_ups_capa]=/usr/lib/nagios/plugins/check_snmp -H 10.0.10.2 -C public -o .1.3.6.1.4.1.318.1.1.1.2.2.1.0 -l '% Capacity' -u '%' For a warning at 80% capacity and critical at 10%, it doesn't work to simply append: -w 80 -c 10 ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rkferreira at gmail.com Mon May 17 20:48:38 2010 From: rkferreira at gmail.com (Rodrigo K. Ferreira) Date: Mon, 17 May 2010 15:48:38 -0300 Subject: Nagios-2.12 Message-ID: Hi, I'm using nagios-2.12 for monitoring my systems and I need to disable "sticky acknoledge". I have read about that at Nagios docs/manuals and at google, I looked the source code too, and the only way is patching the code ? My hope was Nagios 3 came with that option at config files, but the docs I read dont mentions that. Thanks a lot. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kyle.bader at gmail.com Mon May 17 22:32:24 2010 From: kyle.bader at gmail.com (Kyle Bader) Date: Mon, 17 May 2010 13:32:24 -0700 Subject: set warning and cricitcal for ups capacity In-Reply-To: <1274119053.22852.27.camel@genx.localdomain> References: <1274119053.22852.27.camel@genx.localdomain> Message-ID: Write a wrapper to capture output and return error codes/perfdata based on thresholds passed as args? On 5/17/10, John Stile wrote: > I would like to add a warning & critical value to the following > check_snmp line in /etc/nagios/nrpe.cfg, which monitors my ups. > > command[snmp_ups_capa]=/usr/lib/nagios/plugins/check_snmp -H 10.0.10.2 > -C public -o .1.3.6.1.4.1.318.1.1.1.2.2.1.0 -l '% Capacity' -u '%' > > For a warning at 80% capacity and critical at 10%, it doesn't work to > simply append: -w 80 -c 10 > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Sent from my mobile device Kyle ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kyle.bader at gmail.com Tue May 18 01:09:40 2010 From: kyle.bader at gmail.com (Kyle Bader) Date: Mon, 17 May 2010 16:09:40 -0700 Subject: set warning and cricitcal for ups capacity In-Reply-To: References: <1274119053.22852.27.camel@genx.localdomain> Message-ID: If you can provide sample output ill perlify it :) On 5/17/10, Kyle Bader wrote: > Write a wrapper to capture output and return error codes/perfdata > based on thresholds passed as args? > > On 5/17/10, John Stile wrote: >> I would like to add a warning & critical value to the following >> check_snmp line in /etc/nagios/nrpe.cfg, which monitors my ups. >> >> command[snmp_ups_capa]=/usr/lib/nagios/plugins/check_snmp -H 10.0.10.2 >> -C public -o .1.3.6.1.4.1.318.1.1.1.2.2.1.0 -l '% Capacity' -u '%' >> >> For a warning at 80% capacity and critical at 10%, it doesn't work to >> simply append: -w 80 -c 10 >> >> >> ------------------------------------------------------------------------------ >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting >> any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > -- > Sent from my mobile device > > > Kyle > -- Sent from my mobile device Kyle ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue May 18 08:17:55 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Mon, 17 May 2010 23:17:55 -0700 Subject: set warning and cricitcal for ups capacity In-Reply-To: <1274119053.22852.27.camel@genx.localdomain> References: <1274119053.22852.27.camel@genx.localdomain> Message-ID: <4BF23113.5020700@hp.com> Use "check_snmp -h" to get a description of valid parameters. The warning and critical parameters are ranges, not single integers. The notes describe the syntax and options. John Stile wrote: > I would like to add a warning & critical value to the following > check_snmp line in /etc/nagios/nrpe.cfg, which monitors my ups. > > command[snmp_ups_capa]=/usr/lib/nagios/plugins/check_snmp -H 10.0.10.2 > -C public -o .1.3.6.1.4.1.318.1.1.1.2.2.1.0 -l '% Capacity' -u '%' > > For a warning at 80% capacity and critical at 10%, it doesn't work to > simply append: -w 80 -c 10 > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue May 18 10:49:36 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 18 May 2010 09:49:36 +0100 Subject: Nagios-2.12 In-Reply-To: References: Message-ID: <4BF254A0.6010103@flatto.net> Rodrigo K. Ferreira wrote: > Hi, > > I'm using nagios-2.12 for monitoring my systems and I need to disable > "sticky acknoledge". > I have read about that at Nagios docs/manuals and at google, I looked > the source code too, and the only way is patching the code ? > My hope was Nagios 3 came with that option at config files, but the > docs I read dont mentions that. > > Thanks a lot. The 'Sticky Acknowledgement' has the same behaviour in nagios 3 as in nagios 2 . If you want that disabled - you will need to modify the cmd.cgi source and recompile it . Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From prasanth.katakam at gmail.com Tue May 18 12:37:36 2010 From: prasanth.katakam at gmail.com (prasanth katakam) Date: Tue, 18 May 2010 16:07:36 +0530 Subject: how 2 add hosts to nagios Message-ID: Dear team, I was new to nagios and i have completed the installation of nagios successfully and please let me know how to add host machines(both linux and windows) in nagios server i tried to add windows machine to nagios with installing NSclient++ in windows machine but it is not getting added in nagios. kindly let me know the host adding process for both windows and linux Thanks and regards Prasanth.Katakam 9966655246 hyderabad -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue May 18 12:44:59 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 18 May 2010 11:44:59 +0100 Subject: how 2 add hosts to nagios In-Reply-To: References: Message-ID: <4BF26FAB.1090602@flatto.net> prasanth katakam wrote: > Dear team, > > I was new to nagios and i have completed the installation of nagios > successfully and please let me know how to add host machines(both > linux and windows) in nagios server > > i tried to add windows machine to nagios with installing NSclient++ > in windows machine but it is not getting added in nagios. > > kindly let me know the host adding process for both windows and linux > > Thanks and regards > Prasanth.Katakam > 9966655246 > hyderabad you may want to start here : http://nagios.sourceforge.net/docs/3_0/beginners.html After you read the documentation if you still can't figure it out , post again. -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rkferreira at gmail.com Tue May 18 13:02:38 2010 From: rkferreira at gmail.com (Rodrigo K. Ferreira) Date: Tue, 18 May 2010 08:02:38 -0300 Subject: Nagios-2.12 In-Reply-To: <4BF254A0.6010103@flatto.net> References: <4BF254A0.6010103@flatto.net> Message-ID: Thanks a lot. On Tue, May 18, 2010 at 5:49 AM, Assaf Flatto wrote: > Rodrigo K. Ferreira wrote: > > Hi, > > > > I'm using nagios-2.12 for monitoring my systems and I need to disable > > "sticky acknoledge". > > I have read about that at Nagios docs/manuals and at google, I looked > > the source code too, and the only way is patching the code ? > > My hope was Nagios 3 came with that option at config files, but the > > docs I read dont mentions that. > > > > Thanks a lot. > The 'Sticky Acknowledgement' has the same behaviour in nagios 3 as in > nagios 2 . > If you want that disabled - you will need to modify the cmd.cgi source > and recompile it . > > Assaf > > -- > Never,Ever Cut A Deal With a Dragon > > > I am doing a Charity Bike ride On the 27 of June for the > Capital to Coast Charity. Please help by Donating > http://www.justgiving.com/Lovefilm-capital-to-coast > > > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From prasanth.katakam at gmail.com Tue May 18 14:51:18 2010 From: prasanth.katakam at gmail.com (prasanth katakam) Date: Tue, 18 May 2010 18:21:18 +0530 Subject: how 2 add hosts to nagios In-Reply-To: <4BF26FAB.1090602@flatto.net> References: <4BF26FAB.1090602@flatto.net> Message-ID: dear team, when i start nagios it is giving an error like [root at localhost ~]# service nagios restart Running configuration check... CONFIG ERROR! Restart aborted. Check your Nagios configuration. [root at localhost ~]# /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg Nagios Core 3.2.1 Copyright (c) 2009-2010 Nagios Core Development Team and Community Contributors Copyright (c) 1999-2009 Ethan Galstad Last Modified: 03-09-2010 License: GPL Website: http://www.nagios.org Reading configuration data... Read main config file okay... Processing object config file '/usr/local/nagios/etc/objects/commands.cfg'... Processing object config file '/usr/local/nagios/etc/objects/contacts.cfg'... Processing object config file '/usr/local/nagios/etc/objects/timeperiods.cfg'... Processing object config file '/usr/local/nagios/etc/objects/templates.cfg'... Processing object config file '/usr/local/nagios/etc/objects/localhost.cfg'... Processing object config file '/usr/local/nagios/etc/objects/windows.cfg'... Error: Could not find any host matching 'winserver' (config file '/usr/local/nagios/etc/objects/windows.cfg', starting on line 139) Error: Could not expand hostgroups and/or hosts specified in service (config file '/usr/local/nagios/etc/objects/windows.cfg', starting on line 139) Error processing object config files! ***> One or more problems was encountered while processing the config files... Check your configuration file(s) to ensure that they contain valid directives and data defintions. If you are upgrading from a previous version of Nagios, you should be aware that some variables/definitions may have been removed or modified in this version. Make sure to read the HTML documentation regarding the config files, as well as the 'Whats New' section to find out what has changed. [root at localhost ~]# kindly provide me the solution thanks and regards Prasanth Katakam hyderabad 9966655246 ------------------------------------old message--------------------------------------- On Tue, May 18, 2010 at 4:14 PM, Assaf Flatto wrote: > prasanth katakam wrote: > > Dear team, > > > > I was new to nagios and i have completed the installation of nagios > > successfully and please let me know how to add host machines(both > > linux and windows) in nagios server > > > > i tried to add windows machine to nagios with installing NSclient++ > > in windows machine but it is not getting added in nagios. > > > > kindly let me know the host adding process for both windows and linux > > > > Thanks and regards > > Prasanth.Katakam > > 9966655246 > > hyderabad > you may want to start here : > > http://nagios.sourceforge.net/docs/3_0/beginners.html > > After you read the documentation if you still can't figure it out , > post again. > > > > -- > Never,Ever Cut A Deal With a Dragon > > > I am doing a Charity Bike ride On the 27 of June for the > Capital to Coast Charity. Please help by Donating > http://www.justgiving.com/Lovefilm-capital-to-coast > > > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Prasanth.Katakam 9966655246 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shankumajumdar at gmail.com Tue May 18 15:19:04 2010 From: shankumajumdar at gmail.com (shanku majumdar) Date: Tue, 18 May 2010 18:49:04 +0530 Subject: how 2 add hosts to nagios In-Reply-To: References: <4BF26FAB.1090602@flatto.net> Message-ID: I would suggest you to go to the following file and comment out the lines which define the services of *winserver* this is a default server which is added as an example or provide a valid hostaddress for it. /usr/local/nagios/etc/objects/windows.cfg On Tue, May 18, 2010 at 6:21 PM, prasanth katakam < prasanth.katakam at gmail.com> wrote: > dear team, > > when i start nagios it is giving an error like > > [root at localhost ~]# service nagios restart > Running configuration check... CONFIG ERROR! Restart aborted. Check your > Nagios configuration. > > [root at localhost ~]# /usr/local/nagios/bin/nagios -v > /usr/local/nagios/etc/nagios.cfg > > Nagios Core 3.2.1 > Copyright (c) 2009-2010 Nagios Core Development Team and Community > Contributors > Copyright (c) 1999-2009 Ethan Galstad > Last Modified: 03-09-2010 > License: GPL > > Website: http://www.nagios.org > Reading configuration data... > Read main config file okay... > Processing object config file > '/usr/local/nagios/etc/objects/commands.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/contacts.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/timeperiods.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/templates.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/localhost.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/windows.cfg'... > Error: Could not find any host matching 'winserver' (config file > '/usr/local/nagios/etc/objects/windows.cfg', starting on line 139) > Error: Could not expand hostgroups and/or hosts specified in service > (config file '/usr/local/nagios/etc/objects/windows.cfg', starting on line > 139) > Error processing object config files! > > > ***> One or more problems was encountered while processing the config > files... > > Check your configuration file(s) to ensure that they contain valid > directives and data defintions. If you are upgrading from a previous > version of Nagios, you should be aware that some variables/definitions > may have been removed or modified in this version. Make sure to read > the HTML documentation regarding the config files, as well as the > 'Whats New' section to find out what has changed. > > [root at localhost ~]# > > > kindly provide me the solution > > thanks and regards > Prasanth Katakam > hyderabad > 9966655246 > > ------------------------------------old > message--------------------------------------- > > On Tue, May 18, 2010 at 4:14 PM, Assaf Flatto wrote: > >> prasanth katakam wrote: >> > Dear team, >> > >> > I was new to nagios and i have completed the installation of nagios >> > successfully and please let me know how to add host machines(both >> > linux and windows) in nagios server >> > >> > i tried to add windows machine to nagios with installing NSclient++ >> > in windows machine but it is not getting added in nagios. >> > >> > kindly let me know the host adding process for both windows and linux >> > >> > Thanks and regards >> > Prasanth.Katakam >> > 9966655246 >> > hyderabad >> you may want to start here : >> >> http://nagios.sourceforge.net/docs/3_0/beginners.html >> >> After you read the documentation if you still can't figure it out , >> post again. >> >> >> >> -- >> Never,Ever Cut A Deal With a Dragon >> >> >> I am doing a Charity Bike ride On the 27 of June for the >> Capital to Coast Charity. Please help by Donating >> http://www.justgiving.com/Lovefilm-capital-to-coast >> >> >> >> >> ------------------------------------------------------------------------------ >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- > Prasanth.Katakam > 9966655246 > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- ShankuMajumdar -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue May 18 15:23:53 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 18 May 2010 15:23:53 +0200 Subject: how 2 add hosts to nagios In-Reply-To: References: <4BF26FAB.1090602@flatto.net> Message-ID: <4BF294E9.2040909@op5.se> On 05/18/2010 02:51 PM, prasanth katakam wrote: > dear team, > > when i start nagios it is giving an error like > > [root at localhost ~]# service nagios restart > Running configuration check... CONFIG ERROR! Restart aborted. Check your > Nagios configuration. > > [root at localhost ~]# /usr/local/nagios/bin/nagios -v > /usr/local/nagios/etc/nagios.cfg > > Nagios Core 3.2.1 > Copyright (c) 2009-2010 Nagios Core Development Team and Community > Contributors > Copyright (c) 1999-2009 Ethan Galstad > Last Modified: 03-09-2010 > License: GPL > > Website: http://www.nagios.org > Reading configuration data... > Read main config file okay... > Processing object config file > '/usr/local/nagios/etc/objects/commands.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/contacts.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/timeperiods.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/templates.cfg'... > Processing object config file > '/usr/local/nagios/etc/objects/localhost.cfg'... > Processing object config file '/usr/local/nagios/etc/objects/windows.cfg'... > Error: Could not find any host matching 'winserver' (config file > '/usr/local/nagios/etc/objects/windows.cfg', starting on line 139) > Error: Could not expand hostgroups and/or hosts specified in service (config > file '/usr/local/nagios/etc/objects/windows.cfg', starting on line 139) > Error processing object config files! > > > ***> One or more problems was encountered while processing the config > files... > > Check your configuration file(s) to ensure that they contain valid > directives and data defintions. If you are upgrading from a previous > version of Nagios, you should be aware that some variables/definitions > may have been removed or modified in this version. Make sure to read > the HTML documentation regarding the config files, as well as the > 'Whats New' section to find out what has changed. > > [root at localhost ~]# > > > kindly provide me the solution > I think you've misunderstood how opensource assistance works. We do this in our spare time. We do not do it as a full-time job, babysitting people who won't read the documentation or are unable to use google to find a suitable answer before asking us. If you ask a proper question about a specific problem and show us that you've spent a bit of effort trying to understand and solve the problem yourself, we'd be more than willing to help. As it is now, the answer wouldn't mean anything to you, and you'd need assistance again in a very short while when, for example, you're having trouble getting notifications to where they should go. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Tue May 18 15:26:55 2010 From: lists at xodus.org (Marc Powell) Date: Tue, 18 May 2010 08:26:55 -0500 Subject: how 2 add hosts to nagios In-Reply-To: References: <4BF26FAB.1090602@flatto.net> Message-ID: On May 18, 2010, at 7:51 AM, prasanth katakam wrote: > Error: Could not find any host matching 'winserver' (config file '/usr/local/nagios/etc/objects/windows.cfg', starting on line 139) > Error: Could not expand hostgroups and/or hosts specified in service (config file '/usr/local/nagios/etc/objects/windows.cfg', starting on line 139) > Error processing object config files! The service{} definition starting on line 139 of windows.cfg references a host_name or hostgroup that does not exist. Verify that you've typed them correctly. If this is for a new host or hostgroup, create a host{} or hostgroup{} definition for it. -- Marc ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Matsushita.Nobuo at tchden.org Tue May 18 15:11:10 2010 From: Matsushita.Nobuo at tchden.org (Matsushita, Nobuo) Date: Tue, 18 May 2010 07:11:10 -0600 Subject: Command Access Message-ID: HI, I am using 3.2.1 version of Nagios. Is it possible to give HOST/COMMAND access to a user who Only can access to a specific host group? Thanks in advance. Nobuo ---------------------------------------------------------------------------------------------------------------------
CONFIDENTIALITY NOTICE: This e-mail is confidential, may be legally privileged,
and for the intended recipient only. Access, disclosure, copying, forwarding and
distribution by any means is strictly prohibited. If received in error,
do not read but delete and e-mail confirmation to the sender.
==========================================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From ck at iphh.net  Tue May 18 15:21:56 2010
From: ck at iphh.net (Christoph Kluenter)
Date: Tue, 18 May 2010 15:21:56 +0200
Subject: Distributed Monitoring best practices
Message-ID: <20100518132156.GB12924@iphh.net>

Hi everybody,

we are running multiple nagios Instances which all provide 
Test results to one node which runs the cgi's.
The problem with this setup is, that re-scheduling checks 
has to be done on the checking nodes and not on the node
with the cgi's. 
Is there any good solution to this problem ?

I am thinking about testing DNX ( dnx.sf.net )
But since one can't  define which check will run on which
node, we would have to reconfigure a lot of firewalls.
Would dnx be worth this hassle ? Any experiences ?

Cheers,
	Christoph


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From lomiz.mail at gmail.com  Tue May 18 16:30:54 2010
From: lomiz.mail at gmail.com (Enrico Zimol)
Date: Tue, 18 May 2010 16:30:54 +0200
Subject: Distributed Monitoring best practices
In-Reply-To: <20100518132156.GB12924@iphh.net>
References: <20100518132156.GB12924@iphh.net>
Message-ID: 

On 18 May 2010 15:21, Christoph Kluenter  wrote:
>
> I am thinking about testing DNX ( dnx.sf.net )
> But since one can't ?define which check will run on which
> node, we would have to reconfigure a lot of firewalls.
> Would dnx be worth this hassle ? Any experiences ?

I'm interested too about it.
Any suggest to completly centrilized monitoring?

Thank's

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From patrick.morris at hp.com  Tue May 18 17:32:18 2010
From: patrick.morris at hp.com (Morris, Patrick)
Date: Tue, 18 May 2010 08:32:18 -0700
Subject: Command Access
In-Reply-To: 
References: 
Message-ID: <4BF2B302.40401@hp.com>

Matsushita, Nobuo wrote:
>
> HI,
>
>  
>
> I am using 3.2.1 version of Nagios. Is it possible to give 
> HOST/COMMAND access to a user who Only can access to a specific host 
> group?
>

Yes. If you've configured Nagios to use authentication, that's just how 
it works.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From john at stilen.com  Tue May 18 17:44:22 2010
From: john at stilen.com (John Stile)
Date: Tue, 18 May 2010 08:44:22 -0700
Subject: set warning and cricitcal for ups capacity
In-Reply-To: <4BF23113.5020700@hp.com>
References: <1274119053.22852.27.camel@genx.localdomain>
	<4BF23113.5020700@hp.com>
Message-ID: <1274197462.32084.822.camel@genx.localdomain>

I didn't even think to check for a -h.  
This did it:
check_snmp -H 10.0.10.2 -C public -o .1.3.6.1.4.1.318.1.1.1.2.2.1.0 -l '% Capacity' -u '%' -w 90:20 -c 20:0
Thank you.
On Mon, 2010-05-17 at 23:17 -0700, Morris, Patrick wrote:
> Use "check_snmp -h" to get a description of valid parameters.  The 
> warning and critical parameters are ranges, not single integers.  The 
> notes describe the syntax and options.
> 
> John Stile wrote:
> > I would like to add a warning & critical  value to the following
> > check_snmp line in  /etc/nagios/nrpe.cfg, which monitors my ups.
> >
> > command[snmp_ups_capa]=/usr/lib/nagios/plugins/check_snmp -H 10.0.10.2
> > -C public -o .1.3.6.1.4.1.318.1.1.1.2.2.1.0 -l '% Capacity' -u '%'
> >
> > For a warning at 80% capacity and critical at 10%, it doesn't work to
> > simply append:  -w 80 -c 10
> >
> >
> > ------------------------------------------------------------------------------
> >
> > _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
> > ::: Messages without supporting info will risk being sent to /dev/null
> >   
> 
> 
> ------------------------------------------------------------------------------
> 
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From nagios at chadmail.com  Tue May 18 17:42:15 2010
From: nagios at chadmail.com (nagios)
Date: Wed, 19 May 2010 01:42:15 +1000
Subject: SIGSEGV when trying to use eventhandler
Message-ID: 

Hi guys,
I am new to nagios but so far it's working well for me and is monitoring a 
number of real and virtual hosts. Nagios 3.0.6 is installed on an 
OpenSolaris 2009.06 host and monitoring routers other devices and VM's in 
VirtualBox.

My issue is when I try to add an event handler, I get a SIGSEGV and nagios 
restarts.


I have posted the details of the code I am using and the error 
here...http://pastebin.com/vBb7xTND and also below (but it reads better @ 
pastebin).

I have tried several different scripts and code combinations (even empty 
scripts and commands like ls) and all give the same error.

Can anyone help me work out why it's happening?

Thanks.

hosts.cfg

define host{
        use             windows-server  ; Inherit default values from a 
template
        host_name       Server6         ; The name we're giving to this host
        max_check_attempts              4
        event_handler   vboxmanage-restart ; Restart the vm
        alias           Server 6 - Win2008 Server       ; A longer name 
associated with the host
        address         192.168.0.6     ; IP address of the host
        }

 
commands.cfg - note I have tried various scripts here incl. ones from the 
nagios guides/books and all give the same error.

# 'vboxmanage_restart' command definition
define command{
        command_name vboxmanage-restart
#        command_line ls
        command_line sudo -u nas 
$USER1$/eventhandler/event_vboxmanage_restart -S $SERVICESTATE$ -T 
$SERVICESTATETYPE$ -A $SERVICEATTEMPT$ -H Server6
        }

 
nagios.log
[1274193005] HOST ALERT: Server6;DOWN;SOFT;1;PING CRITICAL - Packet loss = 
100%
[1274193005] Caught SIGSEGV, shutting down...
[1274193005] Nagios 3.0.6 starting... (PID=5231)
[1274193005] Local time is Wed May 19 00:30:05 EST 2010
[1274193005] LOG VERSION: 2.0
[1274193005] Finished daemonizing... (New PID=5232)
 
the scripts... (yes I know it should not be 777's but just to show it's not 
a permissions thing)
-rwxrwxrwx 1 nagios nagios 1580 2010-05-18 00:52 event_vboxmanage_restart
-rwxrwxrwx 1 nagios nagios 3815 2010-05-18 23:07 filename.out
-rwxrwxrwx 1 nagios nagios 2211 2010-05-19 00:23 restart-httpd
nas at nas:/usr/nagios/libexec/eventhandler# 
 
The script work fine from the user nagios using sudo (added nagios to 
/etc/sudoers)
nas at nas:?sr/nagios/libexec/eventhandler$ whoami                            
                                                               
nagios
nas at nas:?sr/nagios/libexec/eventhandler$ sudo -u nas 
./event_vboxmanage_restart -S CRITICAL -T HARD -A 1 -H Server6               
        
CRITICAL(C) 2005-2010 Sun Microsystems, Inc.
 
The event_vboxmanage_restart script...no that this is likely to be at fault 
(I do not think anyway as I get the error with other very simple scripts 
too).
#!/usr/bin/perl
 
use Getopt::Long;
use Net::Telnet ();
use Switch;
my ($state,$type,$attempt,$cmd,$hostname);
open(MYOUTFILE, ">>/usr/nagios/libexec/eventhandler/filename.out");
 
&processargs;
print "$state"; 
switch ($state) {
    case "OK"          { &state_OK }
    case "WARNING"     { &state_WARNING }
    case "UNKNOWN"     { &state_UNKNOWN }
    case "CRITICAL"    { &state_CRITICAL }
    else               { print "unrecognised state>$state" }
}
print MYOUTFILE">$state<";
print MYOUTFILE">$hostname<";
close(MYOUTFILE);
exit 0;
 
sub processargs {
 
GetOptions (
    "S|state=s" => \$state,
    "T|type=s" => \$type,
    "A|attempt=i" => \$attempt,
    "H|hostname=s" => \$hostname,
    "C|command=s" => \$cmd,
);
}
 
### FUNC: print $state
sub print_state {
}
### FUNC: print $state
sub state_OK {
}
### FUNC: print $state
sub state_WARNING {
}
### FUNC: print $state
sub state_UNKNOWN {
}
### FUNC: print $state
sub state_CRITICAL {
if ("$type" eq "HARD" or ("$type" eq "SOFT" and $attempt == 3)) 
{@result=`VBoxManage controlvm $hostname acpipowerbutton`; foreach (@result) 
{
  print MYOUTFILE"$_\n";
};sleep(60);@result=`VBoxManage controlvm $hostname poweroff`;foreach 
(@result) {
  print MYOUTFILE"$_\n";
}; @result=`VBoxManage startvm $hostname`; print "$result[1]";
}
    else            { }
}

 As you can see from the below, it all works fine (ie. no SIGSEGV's) if I 
comment out the eventhandler line from the hosts.cfg file.
[05-19-2010 01:33:50] SERVICE ALERT: 
Server6;Explorer;OK;HARD;1;Explorer.EXE: Running
[05-19-2010 01:32:50] SERVICE ALERT: Server6;Uptime;OK;HARD;1;System Uptime 
- 0 day(s) 0 hour(s) 9 minute(s)
[05-19-2010 01:32:40] SERVICE ALERT: Server6;C:\ Drive Space;OK;HARD;1;c:\ - 
total: 39.90 Gb - used: 9.19 Gb (23%) - free 30.71 Gb (77%)
[05-19-2010 01:32:10] SERVICE ALERT: Server6;CPU Load;OK;HARD;1;CPU Load 3% 
(5 min average)
[05-19-2010 01:25:00] HOST ALERT: Server6;UP;SOFT;4;PING OK - Packet loss = 
0%, RTA = 0.44 ms
[05-19-2010 01:23:50] SERVICE ALERT: 
Server6;Explorer;CRITICAL;HARD;1;Connection refused
[05-19-2010 01:23:50] HOST ALERT: Server6;DOWN;SOFT;3;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:23:00] SERVICE ALERT: Server6;Uptime;CRITICAL;HARD;1;CRITICAL 
- Socket timeout after 10 seconds
[05-19-2010 01:22:50] SERVICE ALERT: Server6;C:\ Drive 
Space;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:22:30] HOST ALERT: Server6;DOWN;SOFT;2;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:22:20] SERVICE ALERT: Server6;CPU 
Load;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:21:10] HOST ALERT: Server6;DOWN;SOFT;1;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:21:00] SERVICE ALERT: Server6;Uptime;CRITICAL;SOFT;1;CRITICAL 
- Socket timeout after 10 seconds
[05-19-2010 01:20:50] SERVICE ALERT: Server6;C:\ Drive 
Space;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:02:10] SERVICE ALERT: Server6;CPU Load;OK;SOFT;1;CPU Load 0% 
(5 min average)
[05-19-2010 01:00:50] SERVICE ALERT: Server6;Uptime;OK;SOFT;1;System Uptime 
- 0 day(s) 0 hour(s) 57 minute(s)
[05-19-2010 01:00:40] SERVICE ALERT: Server6;C:\ Drive Space;OK;SOFT;1;c:\ - 
total: 39.90 Gb - used: 9.19 Gb (23%) - free 30.71 Gb (77%)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From lscarneiro at veltrac.com.br  Tue May 18 20:25:44 2010
From: lscarneiro at veltrac.com.br (Leonardo Carneiro - Veltrac)
Date: Tue, 18 May 2010 15:25:44 -0300
Subject: nagios without web interface
Message-ID: <4BF2DBA8.4000502@veltrac.com.br>

Hi. I want to compile nagios without the web interface. I think that i 
should include these parameters in the configure:

--disable-statusmap
--disable-statuswrl
--without-httpd-conf

Is this right? There is anything else that i should include (or exclude)?
-- 

*Leonardo de Souza Carneiro*
*Veltrac - Tecnologia em Log?stica.*
lscarneiro at veltrac.com.br 
http://www.veltrac.com.br 
/Fone Com.: (43)2105-5011/
/R. Par? 162 - CENTRO/
/Londrina- PR/
/Cep: 86010-450/


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mitsuto at gmail.com  Tue May 18 20:31:41 2010
From: mitsuto at gmail.com (Marcel)
Date: Tue, 18 May 2010 15:31:41 -0300
Subject: Full Throttle Nagios
Message-ID: 

Hi list,

Several times I'm asked about Nagios latency when the number of services go
over a certain number.

Is there any tweak to have nagios scale checkings more than it would after
tweaking every configuration option for boosting performance?

It seems that nagios don't do the best job at taking the hardware to its
real limits.

When I have more than, say, 10k checks, I start seen check latency rises and
there just isn't anything that could be done, even distributed monitoring
have the nagios.cmd write-lock bottleneck.

I'm asking if there is by design that nagios don't thrases the server when
the number of checks reaches certain limits.

Any thoughts would be appreciated.

Thank you
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From scot at wilcoxon.org  Tue May 18 22:02:47 2010
From: scot at wilcoxon.org (Scot Wilcoxon)
Date: Tue, 18 May 2010 15:02:47 -0500
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
Message-ID: <1274212967.2375.26.camel@sewilco-laptop>

On Tue, 2010-05-18 at 15:31 -0300, Marcel wrote:
...
> It seems that nagios don't do the best job at taking the hardware to
> its real limits. 
> 
> When I have more than, say, 10k checks, I start seen check latency
> rises and there just isn't anything that could be done, even
> distributed monitoring have the nagios.cmd write-lock bottleneck. 

How busy was the CPU and disk?

> I'm asking if there is by design that nagios don't thrases the server
> when the number of checks reaches certain limits. 

I don't know enough about the design yet to know about throttling, but
the Nagios FAQ says that multiple copies of Nagios can run in one
machine if certain files are separate.  So if one copy isn't saturating
the machine, you might be able to run multiple "distributed" copies in
one machine.

Next we'll hear from someone who knows better why that is not a good
idea.


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From perldork at webwizarddesign.com  Tue May 18 22:34:23 2010
From: perldork at webwizarddesign.com (Max)
Date: Tue, 18 May 2010 16:34:23 -0400
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
Message-ID: 

On Tue, May 18, 2010 at 2:31 PM, Marcel  wrote:
> Hi list,
>
> Several times I'm asked about Nagios latency when the number of services go
> over a certain number.
>
> Is there any tweak to have nagios scale checkings more than it would after
> tweaking every configuration option for boosting performance?
>
> It seems that nagios don't do the best job at taking the hardware to its
> real limits.
>
> When I have more than, say, 10k checks, I start seen check latency rises and
> there just isn't anything that could be done, even distributed monitoring
> have the nagios.cmd write-lock bottleneck.

Can you post your nagios.cfg with comments and blank lines stripped
out?  please do strip out comments and blank lines so that it is not a
huge post :).

We did a small patch to remove a sleep time that nagios does between
non-runnable events, i did post it to the nagios developer's list, it
boosted our performance at higher polling density noticably.  Right
now we get about 2500 active host checks / 12500 active service checks
per nagios node with all checks completed every 5 minutes for services
and host checks running every 10 minutes.


- Max

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mike-nagios at 5dninja.net  Tue May 18 22:18:15 2010
From: mike-nagios at 5dninja.net (Mike Lindsey)
Date: Tue, 18 May 2010 13:18:15 -0700
Subject: nagios without web interface
In-Reply-To: <4BF2DBA8.4000502@veltrac.com.br>
References: <4BF2DBA8.4000502@veltrac.com.br>
Message-ID: <4BF2F607.9090005@5dninja.net>

Leonardo Carneiro - Veltrac wrote:
> Hi. I want to compile nagios without the web interface. I think that i 
> should include these parameters in the configure:
> 
> --disable-statusmap
> --disable-statuswrl
> --without-httpd-conf
> 
> Is this right? There is anything else that i should include (or exclude)?

When you run make, just do:

make nagios
make install-base

You could also build everything, and just skip the cgi install, and that 
would probably take less time than getting your answer took.

-- 
Mike Lindsey

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mike-nagios at 5dninja.net  Tue May 18 22:49:25 2010
From: mike-nagios at 5dninja.net (Mike Lindsey)
Date: Tue, 18 May 2010 13:49:25 -0700
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
Message-ID: <4BF2FD55.8030904@5dninja.net>

Marcel wrote:
> When I have more than, say, 10k checks, I start seen check latency rises 
> and there just isn't anything that could be done, even distributed 
> monitoring have the nagios.cmd write-lock bottleneck.

So, I've just gone through this, and the single greatest bottleneck I 
had to deal with is notifications.  But, I have a lot of people in the 
notification tree, and pull in a lot of meta-data to make ticket 
tracking and issue resolution easier and faster.  Since Nagios needs to 
know the exit status of notification commands, it doesn't fork before 
notifications.. it just plods along waiting for the notification command 
to exit.

I switched all our non-pager notification commands to drop a spool file 
in a directory, letting another process read the spool files, generate 
email contents, query ticket databases, pull in documentation or 
extended testing information (full mysql processlist output, for dbas.. 
etc) and caching it for subsequent notifications for that event.

That showed a HUGE improvement to my master server's performance.

If notifications aren't your bottleneck, you can move all your temporary 
files to ramdisk.

You can also increase your FIFO pipe size, but that only delays the 
issue and doesn't really solve the problem if you're always running hot. 
  It also probably involves recompiling your kernel.

If you're using nsca, you can cache your updates for a second or two, so 
that multiple updates happen in the same socket connection.

Alternately (or additionally) you can have nsca update the checkresults 
directory, directly, skipping the steps where nagios reads the command 
pipe, and then just writes it back out to the checkresults directory.

I can package up a patch (against 2.7.2) of those last couple changes (I 
need to submit them, anyway).  If you're manlier than I might be, you 
could also consider modifying the core nagios to allow submissions from 
distributed nagios servers, directly to a socket, but doing that right 
might require serious threaded c foo, and depending on your OS and 
threading library, you might be locked to a single core.

So, you have options.  They're not all equal, and aren't all easy.  But 
you wouldn't be working with monitoring if you didn't like challenges...  :)

-- 
Mike Lindsey

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From trisha at rockyou.com  Tue May 18 23:37:55 2010
From: trisha at rockyou.com (Trisha Hoang)
Date: Tue, 18 May 2010 14:37:55 -0700
Subject: Full Throttle Nagios
In-Reply-To: <4BF2FD55.8030904@5dninja.net>
References: 
	<4BF2FD55.8030904@5dninja.net>
Message-ID: 

Marcel, are you using nagios2.x or 3.x? I'm testing out 3.2.0 and find that
nagios3.x performs pretty well, being multi-threaded and all.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From mitsuto at gmail.com  Tue May 18 23:41:21 2010
From: mitsuto at gmail.com (Marcel)
Date: Tue, 18 May 2010 18:41:21 -0300
Subject: Full Throttle Nagios
In-Reply-To: <4BF2FD55.8030904@5dninja.net>
References: 
	<4BF2FD55.8030904@5dninja.net>
Message-ID: 

Again, thank you for all the quick answers. This list/community is
awesome!!!

I'm already using tmpfs, increased named pipe buffer size, did everything
that one is supposed to do in order to increase performance.

I think I'd go with removing sleep calls in the code. I'm at version 3.2.1
and would love to have a look at Max's patch!

Notification is not my bottleneck, and this is not for my own nagios
install, it's for someone else, so I cannot post nagios.cfg here. Sorry.

But again, thanks for all the answers!!!

On Tue, May 18, 2010 at 5:49 PM, Mike Lindsey wrote:

> Marcel wrote:
> > When I have more than, say, 10k checks, I start seen check latency rises
> > and there just isn't anything that could be done, even distributed
> > monitoring have the nagios.cmd write-lock bottleneck.
>
> So, I've just gone through this, and the single greatest bottleneck I
> had to deal with is notifications.  But, I have a lot of people in the
> notification tree, and pull in a lot of meta-data to make ticket
> tracking and issue resolution easier and faster.  Since Nagios needs to
> know the exit status of notification commands, it doesn't fork before
> notifications.. it just plods along waiting for the notification command
> to exit.
>
> I switched all our non-pager notification commands to drop a spool file
> in a directory, letting another process read the spool files, generate
> email contents, query ticket databases, pull in documentation or
> extended testing information (full mysql processlist output, for dbas..
> etc) and caching it for subsequent notifications for that event.
>
> That showed a HUGE improvement to my master server's performance.
>
> If notifications aren't your bottleneck, you can move all your temporary
> files to ramdisk.
>
> You can also increase your FIFO pipe size, but that only delays the
> issue and doesn't really solve the problem if you're always running hot.
>  It also probably involves recompiling your kernel.
>
> If you're using nsca, you can cache your updates for a second or two, so
> that multiple updates happen in the same socket connection.
>
> Alternately (or additionally) you can have nsca update the checkresults
> directory, directly, skipping the steps where nagios reads the command
> pipe, and then just writes it back out to the checkresults directory.
>
> I can package up a patch (against 2.7.2) of those last couple changes (I
> need to submit them, anyway).  If you're manlier than I might be, you
> could also consider modifying the core nagios to allow submissions from
> distributed nagios servers, directly to a socket, but doing that right
> might require serious threaded c foo, and depending on your OS and
> threading library, you might be locked to a single core.
>
> So, you have options.  They're not all equal, and aren't all easy.  But
> you wouldn't be working with monitoring if you didn't like challenges...
>  :)
>
> --
> Mike Lindsey
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From perldork at webwizarddesign.com  Wed May 19 00:02:36 2010
From: perldork at webwizarddesign.com (Max)
Date: Tue, 18 May 2010 18:02:36 -0400
Subject: Full Throttle Nagios
In-Reply-To: 
References:  
	<4BF2FD55.8030904@5dninja.net>
	
Message-ID: 

Patch can be found here:

http://github.com/netoisstools/Nagios-3-Patches/blob/master/events.c.patch

Download it to your nagios source directory then run

patch -p0 < events.c.patch

if you use the patch, make sure to set use_large_installation_tweaks
to 1 as it will only skip the sleep time if large installation tweaks
is enabled.

Regards,
Max

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From trisha at rockyou.com  Wed May 19 00:06:43 2010
From: trisha at rockyou.com (Trisha Hoang)
Date: Tue, 18 May 2010 15:06:43 -0700
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
	<4BF2FD55.8030904@5dninja.net>
	
Message-ID: 

I spent couple weeks playing with 3.2.1 and found that it performs very well
with active checks (6500+ in 5 min at 1-2 sec latency max)  but could not
pass 5000 passive checks on the master server.

When switched to 3.2.0, it processes 7300-7500 passive checks out of 8055 at
0.2 sec latency  using directives use_large_installation_tweaks=1 and
child_processes_fork_twice=0.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From mmaxwell at blackarrow.tv  Wed May 19 00:39:30 2010
From: mmaxwell at blackarrow.tv (Michael Maxwell)
Date: Tue, 18 May 2010 15:39:30 -0700
Subject: localhost DOWN messages,
	return code 127 is out of bounds
Message-ID: 

Hello,

Just installed nagios 3.2.1 following the docs at
http://nagios.sourceforge.net/docs/3_0/quickstart.html

Everything starts up normally, web interface is fine, can login,
enable/disable checks, etc...  However, every 2 hours exactly,
I'm getting the following alert in email:

From: nagios
Subject: ** PROBLEM Host Alert: localhost is DOWN **

***** Nagios *****

Notification Type: PROBLEM
Host: localhost
State: DOWN
Address: 127.0.0.1
Info: (Return code of 127 is out of bounds - plugin may be missing)

Date/Time: Tue May 18 15:00:15 PDT 2010

If I go to check the services, everything is green.

This is coming from a fresh install with nothing special configured.

I've googled until my eyes glazed over and no one seems to report
this happening with "localhost".  I've checked for the plugins,
checked permissions, etc... The best I can tell, it's all as it
should be.  I?m sure I?ve missed some simple detail somewhere, but
it?s eluding me at the moment.

Any help on solving this problem would be greatly appreciated.

Below, I've included logfile contents as well as every detail
I could think of on how the system is setup.

Thanks

[1274215437] Nagios 3.2.1 starting... (PID=5185)[1274215437] Local time is Tue May 18 13:43:57 PDT 2010
[1274215437] LOG VERSION: 2.0
[1274215437] Finished daemonizing... (New PID=5186)[1274215447] Warning: Return code of 127 for check of host 'localhost' was out of bounds. Make sure the plugin you're trying to run actually e
xists.
[1274215447] HOST ALERT: localhost;DOWN;SOFT;1;(Return code of 127 is out of bounds - plugin may be missing)[1274215477] Warning: Return code of 127 for check of service 'Current Load' on host 'localhost' was out of bounds. Make sure the plugin you'r
e trying to run actually exists.
[1274215477] SERVICE ALERT: localhost;Current Load;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin may be missing)[1274215517] Warning: Return code of 127 for check of host 'localhost' was out of bounds. Make sure the plugin you're trying to run actually e
xists.
[1274215517] HOST ALERT: localhost;DOWN;SOFT;2;(Return code of 127 is out of bounds - plugin may be missing)[1274215517] Warning: Return code of 127 for check of service 'Current Users' on host 'localhost' was out of bounds. Make sure the plugin you'
re trying to run actually exists.
[1274215517] SERVICE ALERT: localhost;Current Users;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin may be missing)[1274215557] Warning: Return code of 127 for check of service 'HTTP' on host 'localhost' was out of bounds. Make sure the plugin you're trying
 to run actually exists.
[1274215557] SERVICE ALERT: localhost;HTTP;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin may be missing)[1274215587] Warning: Return code of 127 for check of host 'localhost' was out of bounds. Make sure the plugin you're trying to run actually e
xists.
[1274215557] SERVICE ALERT: localhost;HTTP;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin may be missing)[1274215587] Warning: Return code of 127 for check of host 'localhost' was out of bounds. Make sure the plugin you're trying to run actually e
xists.
[1274215587] HOST ALERT: localhost;DOWN;SOFT;3;(Return code of 127 is out of bounds - plugin may be missing)
[1274215597] Warning: Return code of 127 for check of service 'PING' on host 'localhost' was out of bounds. Make sure the plugin you're trying to run actually exists.
[1274215597] SERVICE ALERT: localhost;PING;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin may be missing)
[1274215627] Warning: Return code of 127 for check of service 'Root Partition' on host 'localhost' was out of bounds. Make sure the plugin you're trying to run actually exists.
[1274215627] SERVICE ALERT: localhost;Root Partition;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin may be missing)


Detailed information on the system, what's installed, etc...:

CentOS 5.4
Linux [deleted] 2.6.18-194.el5 #1 SMP Fri Apr 2 14:58:35 EDT 2010 i686 i686 i386 GNU/Linux

Nagios 3.2.1
Nagios Plugins 1.4.14
Apache httpd 2.2.15
PHP 5.3.2

$ grep nagios /etc/passwd
nagios:x:9000:9001::/home/nagios:/bin/bash

$ grep nag /etc/group
nagcmd:x:9000:apache
nagios:x:9001:nagios

Options used to build nagios:
./configure     --with-command-group=nagcmd \
                --prefix=/opt/nagios-3.2.1 \
                --with-nagios-user=nagios \
                --with-httpd-conf=/opt/httpd/conf

Options used to build plugins:
./configure     --with-nagios-user=nagios \
                --with-nagios-group=nagios \
                --prefix=/opt/nagios-3.2.1

Permissions of files in /opt/nagios-3.2.1/libexec:
$ ls -l /opt/nagios-3.2.1/libexec/
total 1360K
-rwxr-xr-x 1 nagios nagios 31052 May 18 13:48 check_apt*
-rwxr-xr-x 1 nagios nagios  2240 May 18 13:48 check_breeze*
-rwxr-xr-x 1 nagios nagios 38078 May 18 13:48 check_by_ssh*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_clamd -> check_tcp*
-rwxr-xr-x 1 nagios nagios 21516 May 18 13:48 check_cluster*
-r-sr-xr-x 1 root   nagios 34952 May 18 13:48 check_dhcp*
-rwxr-xr-x 1 nagios nagios 32640 May 18 13:48 check_dig*
-rwxr-xr-x 1 nagios nagios 48280 May 18 13:48 check_disk*
-rwxr-xr-x 1 nagios nagios  8075 May 18 13:48 check_disk_smb*
-rwxr-xr-x 1 nagios nagios 36777 May 18 13:48 check_dns*
-rwxr-xr-x 1 nagios nagios 18705 May 18 13:48 check_dummy*
-rwxr-xr-x 1 nagios nagios  3051 May 18 13:48 check_file_age*
-rwxr-xr-x 1 nagios nagios  6313 May 18 13:48 check_flexlm*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_ftp -> check_tcp*
-rwxr-xr-x 1 nagios nagios 85634 May 18 13:48 check_http*
-r-sr-xr-x 1 root   nagios 39620 May 18 13:48 check_icmp*
-rwxr-xr-x 1 nagios nagios 23743 May 18 13:48 check_ide_smart*
-rwxr-xr-x 1 nagios nagios 15132 May 18 13:48 check_ifoperstatus*
-rwxr-xr-x 1 nagios nagios 12518 May 18 13:48 check_ifstatus*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_imap -> check_tcp*
-rwxr-xr-x 1 nagios nagios  7350 May 18 13:48 check_ircd*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_jabber -> check_tcp*
-rwxr-xr-x 1 nagios nagios 30014 May 18 13:48 check_ldap*
lrwxrwxrwx 1 root   root      10 May 18 13:48 check_ldaps -> check_ldap*
-rwxr-xr-x 1 nagios nagios 28714 May 18 13:48 check_load*
-rwxr-xr-x 1 nagios nagios  6020 May 18 13:48 check_log*
-rwxr-xr-x 1 nagios nagios 20282 May 18 13:48 check_mailq*
-rwxr-xr-x 1 nagios nagios 25812 May 18 13:48 check_mrtg*
-rwxr-xr-x 1 nagios nagios 24295 May 18 13:48 check_mrtgtraf*
-rwxr-xr-x 1 nagios nagios 28548 May 18 13:48 check_nagios*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_nntp -> check_tcp*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_nntps -> check_tcp*
-rwxr-xr-x 1 nagios nagios 38266 May 18 13:48 check_nt*
-rwxr-xr-x 1 nagios nagios 35917 May 18 13:48 check_ntp*
-rwxr-xr-x 1 nagios nagios 33889 May 18 13:48 check_ntp_peer*
-rwxr-xr-x 1 nagios nagios 31725 May 18 13:48 check_ntp_time*
-rwxr-xr-x 1 nagios nagios 54192 May 18 13:48 check_nwstat*
-rwxr-xr-x 1 nagios nagios  8324 May 18 13:48 check_oracle*
-rwxr-xr-x 1 nagios nagios 30356 May 18 13:48 check_overcr*
-rwxr-xr-x 1 nagios nagios 41387 May 18 13:48 check_ping*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_pop -> check_tcp*
-rwxr-xr-x 1 nagios nagios 39647 May 18 13:48 check_procs*
-rwxr-xr-x 1 nagios nagios 29078 May 18 13:48 check_real*
-rwxr-xr-x 1 nagios nagios  9579 May 18 13:48 check_rpc*
-rwxr-xr-x 1 nagios nagios  1135 May 18 13:48 check_sensors*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_simap -> check_tcp*
-rwxr-xr-x 1 nagios nagios 51895 May 18 13:48 check_smtp*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_spop -> check_tcp*
-rwxr-xr-x 1 nagios nagios 26231 May 18 13:48 check_ssh*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_ssmtp -> check_tcp*
-rwxr-xr-x 1 nagios nagios 30682 May 18 13:48 check_swap*
-rwxr-xr-x 1 nagios nagios 38940 May 18 13:48 check_tcp*
-rwxr-xr-x 1 nagios nagios 28909 May 18 13:48 check_time*
lrwxrwxrwx 1 root   root       9 May 18 13:48 check_udp -> check_tcp*
-rwxr-xr-x 1 nagios nagios 34400 May 18 13:48 check_ups*
-rwxr-xr-x 1 nagios nagios 27485 May 18 13:48 check_users*
-rwxr-xr-x 1 nagios nagios  2934 May 18 13:48 check_wave*
-rwxr-xr-x 1 nagios nagios 28787 May 18 13:48 negate*
-rwxr-xr-x 1 nagios nagios 27484 May 18 13:48 urlize*
-rwxr-xr-x 1 nagios nagios  1939 May 18 13:48 utils.pm*
-rwxr-xr-x 1 nagios nagios   862 May 18 13:48 utils.sh*

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From scot at wilcoxon.org  Wed May 19 00:40:22 2010
From: scot at wilcoxon.org (Scot Wilcoxon)
Date: Tue, 18 May 2010 17:40:22 -0500
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
	<4BF2FD55.8030904@5dninja.net>
	
Message-ID: <1274222422.2375.38.camel@sewilco-laptop>

On Tue, 2010-05-18 at 14:37 -0700, Trisha Hoang wrote:
> Marcel, are you using nagios2.x or 3.x? I'm testing out 3.2.0 and find
> that nagios3.x performs pretty well, being multi-threaded and all.

>From other comments, there are presently delays inherent in various
tasks.  So does the 3.2.0 multitasking work around all the bottlenecks
or is there still a known bottleneck for which a hierarchy of
distributed Nagios can be used to work around the problem?


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From kyle.bader at gmail.com  Wed May 19 00:46:42 2010
From: kyle.bader at gmail.com (Kyle Bader)
Date: Tue, 18 May 2010 15:46:42 -0700
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
	
Message-ID: 

I'm trying to get 80k service checks w/ about 1500 hosts and would be
very interested in testing your patch, what was the name of the post
you made so I can give it a whirl?

On 5/18/10, Max  wrote:
> On Tue, May 18, 2010 at 2:31 PM, Marcel  wrote:
>> Hi list,
>>
>> Several times I'm asked about Nagios latency when the number of services
>> go
>> over a certain number.
>>
>> Is there any tweak to have nagios scale checkings more than it would after
>> tweaking every configuration option for boosting performance?
>>
>> It seems that nagios don't do the best job at taking the hardware to its
>> real limits.
>>
>> When I have more than, say, 10k checks, I start seen check latency rises
>> and
>> there just isn't anything that could be done, even distributed monitoring
>> have the nagios.cmd write-lock bottleneck.
>
> Can you post your nagios.cfg with comments and blank lines stripped
> out?  please do strip out comments and blank lines so that it is not a
> huge post :).
>
> We did a small patch to remove a sleep time that nagios does between
> non-runnable events, i did post it to the nagios developer's list, it
> boosted our performance at higher polling density noticably.  Right
> now we get about 2500 active host checks / 12500 active service checks
> per nagios node with all checks completed every 5 minutes for services
> and host checks running every 10 minutes.
>
>
> - Max
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting
> any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>

-- 
Sent from my mobile device


Kyle

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From kyle.bader at gmail.com  Wed May 19 00:58:44 2010
From: kyle.bader at gmail.com (Kyle Bader)
Date: Tue, 18 May 2010 15:58:44 -0700
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
	
	
Message-ID: 

Thanks max, ill try that!

On 5/18/10, Kyle Bader  wrote:
> I'm trying to get 80k service checks w/ about 1500 hosts and would be
> very interested in testing your patch, what was the name of the post
> you made so I can give it a whirl?
>
> On 5/18/10, Max  wrote:
>> On Tue, May 18, 2010 at 2:31 PM, Marcel  wrote:
>>> Hi list,
>>>
>>> Several times I'm asked about Nagios latency when the number of services
>>> go
>>> over a certain number.
>>>
>>> Is there any tweak to have nagios scale checkings more than it would
>>> after
>>> tweaking every configuration option for boosting performance?
>>>
>>> It seems that nagios don't do the best job at taking the hardware to its
>>> real limits.
>>>
>>> When I have more than, say, 10k checks, I start seen check latency rises
>>> and
>>> there just isn't anything that could be done, even distributed
>>> monitoring
>>> have the nagios.cmd write-lock bottleneck.
>>
>> Can you post your nagios.cfg with comments and blank lines stripped
>> out?  please do strip out comments and blank lines so that it is not a
>> huge post :).
>>
>> We did a small patch to remove a sleep time that nagios does between
>> non-runnable events, i did post it to the nagios developer's list, it
>> boosted our performance at higher polling density noticably.  Right
>> now we get about 2500 active host checks / 12500 active service checks
>> per nagios node with all checks completed every 5 minutes for services
>> and host checks running every 10 minutes.
>>
>>
>> - Max
>>
>> ------------------------------------------------------------------------------
>>
>> _______________________________________________
>> Nagios-users mailing list
>> Nagios-users at lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when
>> reporting
>> any issue.
>> ::: Messages without supporting info will risk being sent to /dev/null
>>
>
> --
> Sent from my mobile device
>
>
> Kyle
>

-- 
Sent from my mobile device


Kyle

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From bugfood-ml at fatooh.org  Wed May 19 03:29:56 2010
From: bugfood-ml at fatooh.org (Corey Hickey)
Date: Tue, 18 May 2010 18:29:56 -0700
Subject: trying to fix problem with excessive latency
Message-ID: <4BF33F14.9070605@fatooh.org>

Hello,

I have inherited maintenance of a medium-sized Nagios installation. We 
currently have 649 hosts and 5415 services. Our setup works nicely, with 
one exception: Nagios falls behind on host/service checks. Our usual 
latency once Nagios has been running for a while is about 190-200 
seconds. Our Nagios host is reasonably powerful and isn't struggling; it 
seems that Nagios itself is limited somehow.

I've searched google and read every relevant document I could find, 
including the tuning page:

http://nagios.sourceforge.net/docs/3_0/tuning.html

So far I haven't been able to find anything wrong with our 
configuration, and my experimental tuning hasn't resulted in any 
improvement. As far as I can tell, Nagios is scheduling the host/service 
checks properly, but not processing the queue aggressively enough.

Some notes:

1. The Nagios host has 8 2GHz cores and is usually 75-85% idle. Out of 4 
GB of memory, 1.2 GB is free, with no swap usage. We don't seem to be 
running into any physical limitations.

2. Raising max_concurrent_checks doesn't help; 'nagios -s' recommends a 
value of at least 599, so we're using 1200. I've tried absurdly high 
values like 6000, with no improvement.

3. Lowering service_reaper_frequency to 2 doesn't seem to help; in any 
case, our latency of 190 is way higher than the service_reaper_frequency.

4. I tried setting max_check_result_reaper_time to 30; no change. I 
don't know what I should set this to.

5. I tried disabling all host check scheduling (setting check_interval 
to 0 in our host template); that may have helped (I'm seeing 173 second 
latency instead of 190) but didn't really solve the problem.

I'm attaching our main nagios.cfg file and including the output of 
nagiostats below.

The host is running 64-bit CentOS 5.4 with a 2.6.18 kernel.

-----------------------------------------------------------------------
Nagios Stats 3.2.1
Copyright (c) 2003-2008 Ethan Galstad (www.nagios.org)
Last Modified: 03-09-2010
License: GPL

CURRENT STATUS DATA
------------------------------------------------------
Status File:                            /var/log/nagios/status.log
Status File Age:                        0d 0h 0m 6s
Status File Version:                    3.2.1

Program Running Time:                   0d 0h 18m 22s
Nagios PID:                             1556
Used/High/Total Command Buffers:        0 / 0 / 4096

Total Services:                         5415
Services Checked:                       5415
Services Scheduled:                     5415
Services Actively Checked:              5415
Services Passively Checked:             0
Total Service State Change:             0.000 / 30.390 / 0.024 %
Active Service Latency:                 5.878 / 197.462 / 194.633 sec
Active Service Execution Time:          0.020 / 120.007 / 0.847 sec
Active Service State Change:            0.000 / 30.390 / 0.024 %
Active Services Last 1/5/15/60 min:     767 / 4236 / 5412 / 5415
Passive Service Latency:                0.000 / 0.000 / 0.000 sec
Passive Service State Change:           0.000 / 0.000 / 0.000 %
Passive Services Last 1/5/15/60 min:    0 / 0 / 0 / 0
Services Ok/Warn/Unk/Crit:              5358 / 6 / 0 / 51
Services Flapping:                      1
Services In Downtime:                   22

Total Hosts:                            649
Hosts Checked:                          649
Hosts Scheduled:                        649
Hosts Actively Checked:                 649
Host Passively Checked:                 0
Total Host State Change:                0.000 / 0.000 / 0.000 %
Active Host Latency:                    0.000 / 196.614 / 194.274 sec
Active Host Execution Time:             0.020 / 11.019 / 0.069 sec
Active Host State Change:               0.000 / 0.000 / 0.000 %
Active Hosts Last 1/5/15/60 min:        91 / 506 / 649 / 649
Passive Host Latency:                   0.000 / 0.000 / 0.000 sec
Passive Host State Change:              0.000 / 0.000 / 0.000 %
Passive Hosts Last 1/5/15/60 min:       0 / 0 / 0 / 0
Hosts Up/Down/Unreach:                  646 / 3 / 0
Hosts Flapping:                         0
Hosts In Downtime:                      0

Active Host Checks Last 1/5/15 min:     101 / 536 / 1609
    Scheduled:                           98 / 520 / 1562
    On-demand:                           3 / 16 / 47
    Parallel:                            99 / 522 / 1566
    Serial:                              0 / 0 / 0
    Cached:                              3 / 15 / 44
Passive Host Checks Last 1/5/15 min:    0 / 0 / 0
Active Service Checks Last 1/5/15 min:  872 / 4360 / 13101
    Scheduled:                           872 / 4360 / 13101
    On-demand:                           0 / 0 / 0
    Cached:                              0 / 0 / 0
Passive Service Checks Last 1/5/15 min: 0 / 0 / 0

External Commands Last 1/5/15 min:      0 / 0 / 0
-----------------------------------------------------------------------

I have a feeling I'm missing something.... I would appreciate any 
suggestions.

Thanks,
Corey
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: nagios.cfg
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From patrick.morris at hp.com  Wed May 19 04:34:30 2010
From: patrick.morris at hp.com (patrick.morris at hp.com)
Date: Tue, 18 May 2010 19:34:30 -0700
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
Message-ID: <20100519023430.GH17129@bakgwai.americas.hpqcorp.net>

Hi Marcel!

On Tue, 18 May 2010, Marcel wrote:

> Hi list,
> 
> Several times I'm asked about Nagios latency when the number of services go over a certain number.
> 
> Is there any tweak to have nagios scale checkings more than it would after tweaking every configuration option for boosting performance?
> 
> It seems that nagios don't do the best job at taking the hardware to its real limits.
> 
> When I have more than, say, 10k checks, I start seen check latency rises and there just isn't anything that could be done, even distributed monitoring have the nagios.cmd write-lock bottleneck.
> 
> I'm asking if there is by design that nagios don't thrases the server when the number of checks reaches certain limits.

In my experience, the single biggest bottleneck in the number of checks
Nagios can perform without going belly-up is the OS pipe size.

In particular, this has been an issue for me on Linux, where it's not
hard to have check results overflow a 4K pipe (the Linux kernel default)
faster than Nagios can be configured to empty it out.

The good news for me has been that it's relatively easy to make that
bottleneck go away by recompiling a kernel that supports larger pipes.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mgagne at iweb.com  Wed May 19 03:39:21 2010
From: mgagne at iweb.com (=?ISO-8859-1?Q?Mathieu_Gagn=E9?=)
Date: Tue, 18 May 2010 21:39:21 -0400
Subject: trying to fix problem with excessive latency
In-Reply-To: <4BF33F14.9070605@fatooh.org>
References: <4BF33F14.9070605@fatooh.org>
Message-ID: <4BF34149.3060006@iweb.com>

On 2010-05-18 21:29, Corey Hickey wrote:
> Hello,
>
> I have inherited maintenance of a medium-sized Nagios installation. We
> currently have 649 hosts and 5415 services. Our setup works nicely, with
> one exception: Nagios falls behind on host/service checks. Our usual
> latency once Nagios has been running for a while is about 190-200
> seconds. Our Nagios host is reasonably powerful and isn't struggling; it
> seems that Nagios itself is limited somehow.

Try disabling "enable_environment_macros". It helped use greatly when we 
disabled this option. Nagios was spending most of its time allocating 
memory for environment variables it never used. (in our case)

Host/Service checks latency dropped from ~20 minutes to 10 seconds I 
would say.

-- 
Mathieu

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From lomiz.mail at gmail.com  Wed May 19 10:42:12 2010
From: lomiz.mail at gmail.com (Enrico Zimol)
Date: Wed, 19 May 2010 10:42:12 +0200
Subject: NRPE with command arguments
In-Reply-To: <4BF1609D.7010908@flatto.net>
References: 
	<4BF1609D.7010908@flatto.net>
Message-ID: 

>after that - change the testing to work the way you want it

Hello, i'm testing that but I can't understand what is wrong on my
configurations.
(blame_nrpe e ricompiling it's ok)
On nrpe.cfg I defined this command:
command[check_disk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c $ARG2$


And on nagios server I've this definitions:
http://pastebin.com/7uZRcMk9

Also enabling debug on nrpe deamon I see nothing during the command on messages:
May 19 08:21:54 blabla xinetd[24320]: START: nrpe pid=24336 from=192.168.234.234
May 19 08:21:54 blabla nrpe[24336]: INFO: SSL/TLS initialized. All
network traffic will be encrypted.
May 19 08:21:54 blabla xinetd[24320]: EXIT: nrpe status=0 pid=24336
duration=0(sec)


Any suggest or documentation?

Thank's!

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From ludse.verhoeven at vopak.com  Wed May 19 11:13:37 2010
From: ludse.verhoeven at vopak.com (Ludse Verhoeven)
Date: Wed, 19 May 2010 11:13:37 +0200
Subject: NRPE with command arguments
In-Reply-To: 
References: 
Message-ID: 

Hi there,

Your final command to invoke check_disk is 
"check_nrpe!check_disk!$_HOSTDISKWARNING$!$_HOSTDISKCRITICAL$". Passing 
this to check_nrpe will only get check_disk as an argument passed through 
to nrpe (since your check_nrpe-command is "$USER1$/check_nrpe -H 
$HOSTADDRESS$ -p $_HOSTNRPE_PORT$ -c $ARG1$"). This in turn starts the 
check_disk-command via NRPE without any arguments (because 
'_HOSTDISKWARNING' and '_HOSTDISKCRITICAL' will get passed as ARG2 and 
ARG3 to check_nrpe) and in the end will cause check_disk only see a -w and 
-c option without thresholds.

I think what needs to be done is pass the whole command with escaped 
bang-characters to check_nrpe like this:
check_nrpe!check_disk\!$_HOSTDISKWARNING$\!$_HOSTDISKCRITICAL$
or maybe even enclose it in single or double quotes so it gets passed 
through as 1 argument to check_nrpe.....

Not sure if this works like it is supposed to (haven't tested it myself), 
but qua syntax this seems more correct.

HTH

Met vriendelijke groet/with kind regards,
          Ludse Verhoeven
 
AIX System Engineer
T: +31 10 400 2440
F: +31 10 400 2891
E: ludse.verhoeven at vopak.com


Visit / Postal Address: 

http://www.vopak.com






Enrico Zimol  
19/05/2010 10:43
Please respond to
Nagios Users List 


To
Nagios Users List 
cc

Subject
Re: [Nagios-users] NRPE with command arguments






>after that - change the testing to work the way you want it

Hello, i'm testing that but I can't understand what is wrong on my
configurations.
(blame_nrpe e ricompiling it's ok)
On nrpe.cfg I defined this command:
command[check_disk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c 
$ARG2$


And on nagios server I've this definitions:
http://pastebin.com/7uZRcMk9

Also enabling debug on nrpe deamon I see nothing during the command on 
messages:
May 19 08:21:54 blabla xinetd[24320]: START: nrpe pid=24336 
from=192.168.234.234
May 19 08:21:54 blabla nrpe[24336]: INFO: SSL/TLS initialized. All
network traffic will be encrypted.
May 19 08:21:54 blabla xinetd[24320]: EXIT: nrpe status=0 pid=24336
duration=0(sec)


Any suggest or documentation?

Thank's!

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when 
reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From nagios at chadmail.com  Wed May 19 11:49:52 2010
From: nagios at chadmail.com (nagios)
Date: Wed, 19 May 2010 19:49:52 +1000
Subject: Anyone? :  SIGSEGV when trying to use eventhandler
In-Reply-To: 
References: 
Message-ID: 

Anybody?

If you need extra information, just let me know what you need to see and 
I'll upload it.

Thanks.

-----Original Message-----
From: "nagios" 
To: nagios-users at lists.sourceforge.net
Date: Wed, 19 May 2010 01:42:15 +1000
Subject: [Nagios-users] SIGSEGV when trying to use eventhandler

Hi guys,
    I am new to nagios but so far it's working well for me and is monitoring 
a number of real and virtual hosts. Nagios 3.0.6 is installed on an 
OpenSolaris 2009.06 host and monitoring routers other devices and VM's in 
VirtualBox.
 
My issue is when I try to add an event handler, I get a SIGSEGV and nagios 
restarts.
 
 
I have posted the details of the code I am using and the error 
here...http://pastebin.com/vBb7xTND and also below (but it reads better @ 
pastebin).
 
I have tried several different scripts and code combinations (even empty 
scripts and commands like ls) and all give the same error.
 
Can anyone help me work out why it's happening?
 
Thanks.
 
hosts.cfg

define host{
use windows-server ; Inherit default values from a template
host_name Server6 ; The name we're giving to this host
max_check_attempts 4
event_handler vboxmanage-restart ; Restart the vm
alias Server 6 - Win2008 Server ; A longer name associated with the host
address 192.168.0.6 ; IP address of the host
}


commands.cfg - note I have tried various scripts here incl. ones from the 
nagios guides/books and all give the same error.

# 'vboxmanage_restart' command definition
define command{
command_name vboxmanage-restart
# command_line ls
command_line sudo -u nas $USER1$/eventhandler/event_vboxmanage_restart -S 
$SERVICESTATE$ -T $SERVICESTATETYPE$ -A $SERVICEATTEMPT$ -H Server6
}


nagios.log
[1274193005] HOST ALERT: Server6;DOWN;SOFT;1;PING CRITICAL - Packet loss = 
100%
[1274193005] Caught SIGSEGV, shutting down...
[1274193005] Nagios 3.0.6 starting... (PID=5231)
[1274193005] Local time is Wed May 19 00:30:05 EST 2010
[1274193005] LOG VERSION: 2.0
[1274193005] Finished daemonizing... (New PID=5232)

the scripts... (yes I know it should not be 777's but just to show it's not 
a permissions thing)
-rwxrwxrwx 1 nagios nagios 1580 2010-05-18 00:52 event_vboxmanage_restart
-rwxrwxrwx 1 nagios nagios 3815 2010-05-18 23:07 filename.out
-rwxrwxrwx 1 nagios nagios 2211 2010-05-19 00:23 restart-httpd
nas at nas:/usr/nagios/libexec/eventhandler# 

The script work fine from the user nagios using sudo (added nagios to 
/etc/sudoers)
nas at nas:?sr/nagios/libexec/eventhandler$ whoami 
nagios
nas at nas:?sr/nagios/libexec/eventhandler$ sudo -u nas 
./event_vboxmanage_restart -S CRITICAL -T HARD -A 1 -H Server6 
CRITICAL(C) 2005-2010 Sun Microsystems, Inc.

The event_vboxmanage_restart script...no that this is likely to be at fault 
(I do not think anyway as I get the error with other very simple scripts 
too).
#!/usr/bin/perl

use Getopt::Long;
use Net::Telnet ();
use Switch;
my ($state,$type,$attempt,$cmd,$hostname);
open(MYOUTFILE, ">>/usr/nagios/libexec/eventhandler/filename.out");

&processargs;
print "$state"; 
switch ($state) {
case "OK" { &state_OK }
case "WARNING" { &state_WARNING }
case "UNKNOWN" { &state_UNKNOWN }
case "CRITICAL" { &state_CRITICAL }
else { print "unrecognised state>$state" }
}
print MYOUTFILE">$state<";
print MYOUTFILE">$hostname<";
close(MYOUTFILE);
exit 0;

sub processargs {

GetOptions (
"S|state=s" => \$state,
"T|type=s" => \$type,
"A|attempt=i" => \$attempt,
"H|hostname=s" => \$hostname,
"C|command=s" => \$cmd,
);
}

### FUNC: print $state
sub print_state {
}
### FUNC: print $state
sub state_OK {
}
### FUNC: print $state
sub state_WARNING {
}
### FUNC: print $state
sub state_UNKNOWN {
}
### FUNC: print $state
sub state_CRITICAL {
if ("$type" eq "HARD" or ("$type" eq "SOFT" and $attempt == 3)) 
{@result=`VBoxManage controlvm $hostname acpipowerbutton`; foreach (@result) 
{
print MYOUTFILE"$_\n";
};sleep(60);@result=`VBoxManage controlvm $hostname poweroff`;foreach 
(@result) {
print MYOUTFILE"$_\n";
}; @result=`VBoxManage startvm $hostname`; print "$result[1]";
}
else { }
}

 As you can see from the below, it all works fine (ie. no SIGSEGV's) if I 
comment out the eventhandler line from the hosts.cfg file.
[05-19-2010 01:33:50] SERVICE ALERT: 
Server6;Explorer;OK;HARD;1;Explorer.EXE: Running
[05-19-2010 01:32:50] SERVICE ALERT: Server6;Uptime;OK;HARD;1;System Uptime 
- 0 day(s) 0 hour(s) 9 minute(s)
[05-19-2010 01:32:40] SERVICE ALERT: Server6;C:\ Drive Space;OK;HARD;1;c:\ - 
total: 39.90 Gb - used: 9.19 Gb (23%) - free 30.71 Gb (77%)
[05-19-2010 01:32:10] SERVICE ALERT: Server6;CPU Load;OK;HARD;1;CPU Load 3% 
(5 min average)
[05-19-2010 01:25:00] HOST ALERT: Server6;UP;SOFT;4;PING OK - Packet loss = 
0%, RTA = 0.44 ms
[05-19-2010 01:23:50] SERVICE ALERT: 
Server6;Explorer;CRITICAL;HARD;1;Connection refused
[05-19-2010 01:23:50] HOST ALERT: Server6;DOWN;SOFT;3;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:23:00] SERVICE ALERT: Server6;Uptime;CRITICAL;HARD;1;CRITICAL 
- Socket timeout after 10 seconds
[05-19-2010 01:22:50] SERVICE ALERT: Server6;C:\ Drive 
Space;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:22:30] HOST ALERT: Server6;DOWN;SOFT;2;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:22:20] SERVICE ALERT: Server6;CPU 
Load;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:21:10] HOST ALERT: Server6;DOWN;SOFT;1;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:21:00] SERVICE ALERT: Server6;Uptime;CRITICAL;SOFT;1;CRITICAL 
- Socket timeout after 10 seconds
[05-19-2010 01:20:50] SERVICE ALERT: Server6;C:\ Drive 
Space;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:02:10] SERVICE ALERT: Server6;CPU Load;OK;SOFT;1;CPU Load 0% 
(5 min average)
[05-19-2010 01:00:50] SERVICE ALERT: Server6;Uptime;OK;SOFT;1;System Uptime 
- 0 day(s) 0 hour(s) 57 minute(s)
[05-19-2010 01:00:40] SERVICE ALERT: Server6;C:\ Drive Space;OK;SOFT;1;c:\ - 
total: 39.90 Gb - used: 9.19 Gb (23%) - free 30.71 Gb (77%)
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From guidosh at gmail.com  Wed May 19 12:07:52 2010
From: guidosh at gmail.com (Guy Waugh)
Date: Wed, 19 May 2010 11:07:52 +0100
Subject: Anyone? : SIGSEGV when trying to use eventhandler
In-Reply-To: 
References: 
	
Message-ID: 

I'm definitely no expert but...
* What does it say when you 'ldd' the nagios binary? Are all the libraries
the binary is linked against able to be found? Are those libraries
up-to-date?
* Where did you get nagios from? Did you compile it or is it pre-built? If
pre-built, are there any updates?
* I don't know Solaris well enough to know how to trace your running nagios
with a very simple configuration, but that might be the next step. strace?

On 19 May 2010 10:49, nagios  wrote:

>  Anybody?
>
> If you need extra information, just let me know what you need to see and
> I'll upload it.
>
> Thanks.
>
> -----Original Message-----
> From: "nagios" 
> To: nagios-users at lists.sourceforge.net
> Date: Wed, 19 May 2010 01:42:15 +1000
> Subject: [Nagios-users] SIGSEGV when trying to use eventhandler
>
>  Hi guys,
>     I am new to nagios but so far it's working well for me and is
> monitoring a number of real and virtual hosts. Nagios 3.0.6 is installed on
> an OpenSolaris 2009.06 host and monitoring routers other devices and VM's in
> VirtualBox.
>
> My issue is when I try to add an event handler, I get a SIGSEGV and nagios
> restarts.
>
>
> I have posted the details of the code I am using and the error here...
> http://pastebin.com/vBb7xTND and also below (but it reads better @
> pastebin).
>
> I have tried several different scripts and code combinations (even empty
> scripts and commands like ls) and all give the same error.
>
> Can anyone help me work out why it's happening?
>
> Thanks.
>
> hosts.cfg
> 
> define host{
> use windows-server ; Inherit default values from a template
> host_name Server6 ; The name we're giving to this host
> max_check_attempts 4
> event_handler vboxmanage-restart ; Restart the vm
> alias Server 6 - Win2008 Server ; A longer name associated with the host
> address 192.168.0.6 ; IP address of the host
> }
> 
>
> commands.cfg - note I have tried various scripts here incl. ones from the
> nagios guides/books and all give the same error.
> 
> # 'vboxmanage_restart' command definition
> define command{
> command_name vboxmanage-restart
> # command_line ls
> command_line sudo -u nas $USER1$/eventhandler/event_vboxmanage_restart -S
> $SERVICESTATE$ -T $SERVICESTATETYPE$ -A $SERVICEATTEMPT$ -H Server6
> }
> 
>
> nagios.log
> [1274193005] HOST ALERT: Server6;DOWN;SOFT;1;PING CRITICAL - Packet loss =
> 100%
> [1274193005] Caught SIGSEGV, shutting down...
> [1274193005] Nagios 3.0.6 starting... (PID=5231)
> [1274193005] Local time is Wed May 19 00:30:05 EST 2010
> [1274193005] LOG VERSION: 2.0
> [1274193005] Finished daemonizing... (New PID=5232)
>
> the scripts... (yes I know it should not be 777's but just to show it's not
> a permissions thing)
> -rwxrwxrwx 1 nagios nagios 1580 2010-05-18 00:52 event_vboxmanage_restart
> -rwxrwxrwx 1 nagios nagios 3815 2010-05-18 23:07 filename.out
> -rwxrwxrwx 1 nagios nagios 2211 2010-05-19 00:23 restart-httpd
> nas at nas:/usr/nagios/libexec/eventhandler#
>
> The script work fine from the user nagios using sudo (added nagios to
> /etc/sudoers)
> nas at nas:?sr/nagios/libexec/eventhandler$ whoami
> nagios
> nas at nas:?sr/nagios/libexec/eventhandler$ sudo -u nas
> ./event_vboxmanage_restart -S CRITICAL -T HARD -A 1 -H Server6
> CRITICAL(C) 2005-2010 Sun Microsystems, Inc.
>
> The event_vboxmanage_restart script...no that this is likely to be at fault
> (I do not think anyway as I get the error with other very simple scripts
> too).
> #!/usr/bin/perl
>
> use Getopt::Long;
> use Net::Telnet ();
> use Switch;
> my ($state,$type,$attempt,$cmd,$hostname);
> open(MYOUTFILE, ">>/usr/nagios/libexec/eventhandler/filename.out");
>
> &processargs;
> print "$state";
> switch ($state) {
> case "OK" { &state_OK }
> case "WARNING" { &state_WARNING }
> case "UNKNOWN" { &state_UNKNOWN }
> case "CRITICAL" { &state_CRITICAL }
> else { print "unrecognised state>$state" }
> }
> print MYOUTFILE">$state<";
> print MYOUTFILE">$hostname<";
> close(MYOUTFILE);
> exit 0;
>
> sub processargs {
>
> GetOptions (
> "S|state=s" => \$state,
> "T|type=s" => \$type,
> "A|attempt=i" => \$attempt,
> "H|hostname=s" => \$hostname,
> "C|command=s" => \$cmd,
> );
> }
>
> ### FUNC: print $state
> sub print_state {
> }
> ### FUNC: print $state
> sub state_OK {
> }
> ### FUNC: print $state
> sub state_WARNING {
> }
> ### FUNC: print $state
> sub state_UNKNOWN {
> }
> ### FUNC: print $state
> sub state_CRITICAL {
> if ("$type" eq "HARD" or ("$type" eq "SOFT" and $attempt == 3))
> {@result=`VBoxManage controlvm $hostname acpipowerbutton`; foreach (@result)
> {
> print MYOUTFILE"$_\n";
> };sleep(60);@result=`VBoxManage controlvm $hostname poweroff`;foreach
> (@result) {
> print MYOUTFILE"$_\n";
> }; @result=`VBoxManage startvm $hostname`; print "$result[1]";
> }
> else { }
> }
>
>  As you can see from the below, it all works fine (ie. no SIGSEGV's) if I
> comment out the eventhandler line from the hosts.cfg file.
> [05-19-2010 01:33:50] SERVICE ALERT:
> Server6;Explorer;OK;HARD;1;Explorer.EXE: Running
> [05-19-2010 01:32:50] SERVICE ALERT: Server6;Uptime;OK;HARD;1;System Uptime
> - 0 day(s) 0 hour(s) 9 minute(s)
> [05-19-2010 01:32:40] SERVICE ALERT: Server6;C:\ Drive Space;OK;HARD;1;c:\
> - total: 39.90 Gb - used: 9.19 Gb (23%) - free 30.71 Gb (77%)
> [05-19-2010 01:32:10] SERVICE ALERT: Server6;CPU Load;OK;HARD;1;CPU Load 3%
> (5 min average)
> [05-19-2010 01:25:00] HOST ALERT: Server6;UP;SOFT;4;PING OK - Packet loss =
> 0%, RTA = 0.44 ms
> [05-19-2010 01:23:50] SERVICE ALERT:
> Server6;Explorer;CRITICAL;HARD;1;Connection refused
> [05-19-2010 01:23:50] HOST ALERT: Server6;DOWN;SOFT;3;PING CRITICAL -
> Packet loss = 100%
> [05-19-2010 01:23:00] SERVICE ALERT:
> Server6;Uptime;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
> [05-19-2010 01:22:50] SERVICE ALERT: Server6;C:\ Drive
> Space;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
> [05-19-2010 01:22:30] HOST ALERT: Server6;DOWN;SOFT;2;PING CRITICAL -
> Packet loss = 100%
> [05-19-2010 01:22:20] SERVICE ALERT: Server6;CPU
> Load;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
> [05-19-2010 01:21:10] HOST ALERT: Server6;DOWN;SOFT;1;PING CRITICAL -
> Packet loss = 100%
> [05-19-2010 01:21:00] SERVICE ALERT:
> Server6;Uptime;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 seconds
> [05-19-2010 01:20:50] SERVICE ALERT: Server6;C:\ Drive
> Space;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 seconds
> [05-19-2010 01:02:10] SERVICE ALERT: Server6;CPU Load;OK;SOFT;1;CPU Load 0%
> (5 min average)
> [05-19-2010 01:00:50] SERVICE ALERT: Server6;Uptime;OK;SOFT;1;System Uptime
> - 0 day(s) 0 hour(s) 57 minute(s)
> [05-19-2010 01:00:40] SERVICE ALERT: Server6;C:\ Drive Space;OK;SOFT;1;c:\
> - total: 39.90 Gb - used: 9.19 Gb (23%) - free 30.71 Gb (77%)
>
>
>
>
> ------------------------------------------------------------------------------
>
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From ae at op5.se  Wed May 19 12:58:34 2010
From: ae at op5.se (Andreas Ericsson)
Date: Wed, 19 May 2010 12:58:34 +0200
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
Message-ID: <4BF3C45A.7070001@op5.se>

There are several tweaks one can do to make Nagios more or less saturate the
CPU, but none of them are enough if the number of checks rise above a certain
level. We've noticed that we can run about 60K service-checks so long as we
don't have many state-changes in the network. That's not nearly good enough,
so it will be improved.

Work is under way to revamp the Nagios check handling though. It will get
better, but it's a series of large patches that need to be thoroughly tested
before they get released.

I'll holler when I have them in a testable state, since I guess a lot of
people would be quite interested in trying out the new checking engine I'm
working on. If it works as good as it seems to, it should alleviate a lot
of performance issues.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From perldork at webwizarddesign.com  Wed May 19 13:29:15 2010
From: perldork at webwizarddesign.com (Max)
Date: Wed, 19 May 2010 07:29:15 -0400
Subject: Full Throttle Nagios
In-Reply-To: <4BF3C45A.7070001@op5.se>
References: 
	<4BF3C45A.7070001@op5.se>
Message-ID: 

Really look forward to seeing this code - this is the process pool code, yes?

On 5/19/10, Andreas Ericsson  wrote:
> There are several tweaks one can do to make Nagios more or less saturate the
> CPU, but none of them are enough if the number of checks rise above a
> certain
> level. We've noticed that we can run about 60K service-checks so long as we
> don't have many state-changes in the network. That's not nearly good enough,
> so it will be improved.
>
> Work is under way to revamp the Nagios check handling though. It will get
> better, but it's a series of large patches that need to be thoroughly tested
> before they get released.
>
> I'll holler when I have them in a testable state, since I guess a lot of
> people would be quite interested in trying out the new checking engine I'm
> working on. If it works as good as it seems to, it should alleviate a lot
> of performance issues.
>
> --
> Andreas Ericsson                   andreas.ericsson at op5.se
> OP5 AB                             www.op5.se
> Tel: +46 8-230225                  Fax: +46 8-230231
>
> Considering the successes of the wars on alcohol, poverty, drugs and
> terror, I think we should give some serious thought to declaring war
> on peace.
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting
> any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From benny at bennyvision.com  Wed May 19 14:00:27 2010
From: benny at bennyvision.com (C. Bensend)
Date: Wed, 19 May 2010 07:00:27 -0500
Subject: trying to fix problem with excessive latency
In-Reply-To: <4BF34149.3060006@iweb.com>
References: <4BF33F14.9070605@fatooh.org> <4BF34149.3060006@iweb.com>
Message-ID: <83baa4da84b7d52e29deb88aaff6e983.squirrel@webmail.stinkweasel.net>


> Try disabling "enable_environment_macros". It helped use greatly when we
> disabled this option. Nagios was spending most of its time allocating
> memory for environment variables it never used. (in our case)
>
> Host/Service checks latency dropped from ~20 minutes to 10 seconds I
> would say.

Doesn't this kill the use of said macros in notifications, etc?  I've
not played with this setting before, so I have nothing to go on, but
I use the hell out of macros in notifications (how could you do
notifications *without* them?)...

I can't really see a way to use notifications or event handlers
effectively without the macros...  Honest question.  :)

Benny


-- 
"I can do for you is - what can not no girl!"
            -- Spam email subject, 2010-01-15



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From ae at op5.se  Wed May 19 14:31:20 2010
From: ae at op5.se (Andreas Ericsson)
Date: Wed, 19 May 2010 14:31:20 +0200
Subject: trying to fix problem with excessive latency
In-Reply-To: <83baa4da84b7d52e29deb88aaff6e983.squirrel@webmail.stinkweasel.net>
References: <4BF33F14.9070605@fatooh.org> <4BF34149.3060006@iweb.com>
	<83baa4da84b7d52e29deb88aaff6e983.squirrel@webmail.stinkweasel.net>
Message-ID: <4BF3DA18.3010305@op5.se>

On 05/19/2010 02:00 PM, C. Bensend wrote:
> 
>> Try disabling "enable_environment_macros". It helped use greatly when we
>> disabled this option. Nagios was spending most of its time allocating
>> memory for environment variables it never used. (in our case)
>>
>> Host/Service checks latency dropped from ~20 minutes to 10 seconds I
>> would say.
> 
> Doesn't this kill the use of said macros in notifications, etc?

No. It just means you can't access the various macros as environment
variables, like so:

$NAGIOS_(macroname)

They still get swapped in at the command line, so they're still
totally useful for that particular purpose.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From lists at xodus.org  Wed May 19 14:37:15 2010
From: lists at xodus.org (Marc Powell)
Date: Wed, 19 May 2010 07:37:15 -0500
Subject: trying to fix problem with excessive latency
In-Reply-To: <83baa4da84b7d52e29deb88aaff6e983.squirrel@webmail.stinkweasel.net>
References: <4BF33F14.9070605@fatooh.org> <4BF34149.3060006@iweb.com>
	<83baa4da84b7d52e29deb88aaff6e983.squirrel@webmail.stinkweasel.net>
Message-ID: 


On May 19, 2010, at 7:00 AM, C. Bensend wrote:

> Doesn't this kill the use of said macros in notifications, etc?  

Not if you're using them in the 'standard' way by passing them as $MACRONAME$ in the command_line of the notification command. This is how 99% of installations use them I would expect.

If you've got custom commands that expect them to be available as $NAGIOS_MACRONAME, then yes, those would break.

--
Marc


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From benny at bennyvision.com  Wed May 19 14:40:33 2010
From: benny at bennyvision.com (C. Bensend)
Date: Wed, 19 May 2010 07:40:33 -0500
Subject: trying to fix problem with excessive latency
In-Reply-To: <4BF3DA18.3010305@op5.se>
References: <4BF33F14.9070605@fatooh.org> <4BF34149.3060006@iweb.com>
	<83baa4da84b7d52e29deb88aaff6e983.squirrel@webmail.stinkweasel.net>
	<4BF3DA18.3010305@op5.se>
Message-ID: <5c31d94cdbbc5cee3b0ea164ee9b10e1.squirrel@webmail.stinkweasel.net>


>> Doesn't this kill the use of said macros in notifications, etc?
>
> No. It just means you can't access the various macros as environment
> variables, like so:
>
> $NAGIOS_(macroname)
>
> They still get swapped in at the command line, so they're still
> totally useful for that particular purpose.

Ah!  Thank you for the clarification, Andreas.  As I mentioned, I
have no experience with this setting and I'm fighting some
performance demons of my own.  This may very well help me as well.

It might be useful to be a little more verbose about this setting
in the default nagios.cfg - it makes no reference to NAGIOS_ and
doesn't make the distinction.  Maybe something like:


# ENABLE ENVIRONMENT MACROS
# This option determines whether or not Nagios will make all standard
# macros available as NAGIOS_ environment variables when host/service
# checks and system commands (event handlers, notifications, etc.) are
# executed.  Enabling this option can cause performance issues in


Thanks again!

Benny


-- 
"I can do for you is - what can not no girl!"
            -- Spam email subject, 2010-01-15



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From vladimir.indik at gmail.com  Wed May 19 15:20:01 2010
From: vladimir.indik at gmail.com (Vladimir N. Indik)
Date: Wed, 19 May 2010 17:20:01 +0400
Subject: Escalation question
Message-ID: <201005191720.01805.vladimir.indik@gmail.com>

I want to send notification to three contacts in order.
one at example.com
wait 10 minutes
two at example.com
wait 10 minutes
three at example.com

My config:
define service {
        host_name                       hostname
        service_description             servicename
        notification_interval           600              #!
        max_check_attempts              2               #!
        flap_detection_enabled          0
        contacts                        one at example.com
}

define serviceescalation{
        host_name                       hostname
        service_description             servicename
	first_notification              2
        last_notification               2
        notification_interval           600
        contacts                        two at example.com
        }

define serviceescalation{
        host_name                       facts
        service_description             Facts test
        first_notification              3
        last_notification               3
	notification_interval           0
	contacts                        three at example.com
        }

Is it all rihtg?

I have a problem with this configuration.
When the status changes to OK, the message about it only receives the last 
contact (e.g. three at example.com). This is a bug?

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From Deborah.Martin at Kognitio.com  Wed May 19 15:01:49 2010
From: Deborah.Martin at Kognitio.com (Deborah Martin)
Date: Wed, 19 May 2010 14:01:49 +0100
Subject: Best use of check_ntp_time plugin
Message-ID: <84836290D5AD43418C40DCF0C4A54ED3015DBEF7@kogex02.pmpl.co.uk>

Folks, 

We've had a problem recently where the timestamp has been drifting hours
ahead on the VM box running nagios. (v3.2.0 with SLES 10 SP2) 

As a consequence I'm setting up the plugin check_ntp_time to ensure that the
clock on the nagios monitoring box doesn't drift too far ahead (or behind.)
without 
telling us so that we can do something about it. 

For the threshold for warning and critical, I was thinking of -w 30 and -c
60 - would this seem sensible ? 

And also, if the alert did become critical because the drift was greater
than 60 seconds what would be a sensible way of fixing this without human
intervention. 
I thought that an event handler might be the way to go here that would
restart the ntpd daemon. 

I'd be interested to know what your thoughts are / whether there is a better
way ? 

thanks,
deborah


***************************************************************************
This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. 

Any unauthorised distribution or copying is strictly prohibited. 
Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. 

Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. 

Kognitio Limited, a company registered in England and Wales. Registered number 0212 7833. Registered Office:  3a Waterside Park, Cookham Road, Bracknell, Berks, RG12 1RB. VAT number 864 4378 92.

Kognitio Inc, a company incorporated in Delaware, principal office 180 North Stetson, Suite 3500, Chicago, IL 60601, USA
***************************************************************************
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From ae at op5.se  Wed May 19 15:43:37 2010
From: ae at op5.se (Andreas Ericsson)
Date: Wed, 19 May 2010 15:43:37 +0200
Subject: Best use of check_ntp_time plugin
In-Reply-To: <84836290D5AD43418C40DCF0C4A54ED3015DBEF7@kogex02.pmpl.co.uk>
References: <84836290D5AD43418C40DCF0C4A54ED3015DBEF7@kogex02.pmpl.co.uk>
Message-ID: <4BF3EB09.90000@op5.se>

On 05/19/2010 03:01 PM, Deborah Martin wrote:
> Folks,
> 
> We've had a problem recently where the timestamp has been drifting
> hours ahead on the VM box running nagios. (v3.2.0 with SLES 10 SP2)
> 

Don't run any kind of scheduling engine inside a VM. It's like
begging for Mr Murphy to come kick you in the nuts.

> As a consequence I'm setting up the plugin check_ntp_time to ensure
> that the clock on the nagios monitoring box doesn't drift too far
> ahead (or behind.) without telling us so that we can do something
> about it.
> 
> For the threshold for warning and critical, I was thinking of -w 30
> and -c 60 - would this seem sensible ?
> 

5 and 10 seem far more sensible to me.

> And also, if the alert did become critical because the drift was
> greater than 60 seconds what would be a sensible way of fixing this
> without human intervention. I thought that an event handler might be
> the way to go here that would restart the ntpd daemon.
> 

You'd be better off with a cron'ed script setting the clock with
ntpdate every 5 minutes or so. That's usually not long enough time
for there to be a major clockskew. The problem with ntpd on vmware
is that the host system largely ignores the clock-tick speedup and
slowdowns that ntpd issues, so ntpd has no effect what so ever.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From lomiz.mail at gmail.com  Wed May 19 16:44:08 2010
From: lomiz.mail at gmail.com (Enrico Zimol)
Date: Wed, 19 May 2010 16:44:08 +0200
Subject: NRPE with command arguments
In-Reply-To: 
References: 
	
Message-ID: 

On 19 May 2010 11:13, Ludse Verhoeven  wrote:

> Not sure if this works like it is supposed to (haven't tested it myself),
> but qua syntax this seems more correct.
>

That worked fine :)
Other question, where exactly I've to insert quotes on nrpe.cfg commands to
prevent code injection?
I think like this:
[nrpe.cfg]
[...]
command[check_disk]=/usr/local/nagios/libexec/check_disk -w *"*$ARG1$*"* -c
*"*$ARG2$*"*
[...]

Is it right?

Thank's!

-- 
Enrico "lomiz" Zimol
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From ludse.verhoeven at vopak.com  Wed May 19 17:06:57 2010
From: ludse.verhoeven at vopak.com (Ludse Verhoeven)
Date: Wed, 19 May 2010 17:06:57 +0200
Subject: NRPE with command arguments
In-Reply-To: 
References: 
Message-ID: 

I think that's fine. It all depends on what you want to pass to the 
command itself (in this case the 
/usr/locall.nagios/libexec/check_disk-plugin). If f.i. -w "$ARG1$" can use 
space-seperated values (which it can't, but just for arguments sake), then 
you have to specify (in nrpe.cfg) the (single- or) double-quotes.

Personally I tend not to use quoting in nrpe.cfg since in most cases it is 
not necessary, but there might be reasons to explicitly enclose such 
arguments in quotes.....

Met vriendelijke groet/with kind regards,
          Ludse Verhoeven
 
AIX System Engineer
T: +31 10 400 2440
F: +31 10 400 2891
E: ludse.verhoeven at vopak.com


Visit / Postal Address: 

http://www.vopak.com






Enrico Zimol  
19/05/2010 16:46
Please respond to
Nagios Users List 


To
Nagios Users List 
cc

Subject
Re: [Nagios-users] NRPE with command arguments






On 19 May 2010 11:13, Ludse Verhoeven  wrote:
Not sure if this works like it is supposed to (haven't tested it myself), 
but qua syntax this seems more correct. 
 
That worked fine :)
Other question, where exactly I've to insert quotes on nrpe.cfg commands 
to prevent code injection?
I think like this:
[nrpe.cfg]
[...]
command[check_disk]=/usr/local/nagios/libexec/check_disk -w "$ARG1$" -c "
$ARG2$"
[...]

Is it right?

Thank's!

-- 
Enrico "lomiz" Zimol
------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when 
reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From mark.frost1 at pepsico.com  Wed May 19 17:43:47 2010
From: mark.frost1 at pepsico.com (Frost, Mark {PBC})
Date: Wed, 19 May 2010 11:43:47 -0400
Subject: trying to fix problem with excessive latency
In-Reply-To: <4BF33F14.9070605@fatooh.org>
References: <4BF33F14.9070605@fatooh.org>
Message-ID: 


> -----Original Message-----
> From: Corey Hickey [mailto:bugfood-ml at fatooh.org] 
> Sent: Tuesday, May 18, 2010 9:30 PM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] trying to fix problem with excessive latency
> 
> Hello,
> 
> I have inherited maintenance of a medium-sized Nagios installation. We 
> currently have 649 hosts and 5415 services. Our setup works nicely, with 
> one exception: Nagios falls behind on host/service checks. Our usual 
> latency once Nagios has been running for a while is about 190-200 
> seconds. Our Nagios host is reasonably powerful and isn't struggling; it 
> seems that Nagios itself is limited somehow.
> 



> Active Service Execution Time:          0.020 / 120.007 / 0.847 sec
> Active Host Execution Time:             0.020 / 11.019 / 0.069 sec
> 



> I have a feeling I'm missing something.... I would appreciate any 
> suggestions.
> 
> Thanks,
> Corey

Corey,

I'm not an expert, but I'll relay some of my own experiences here.  I did
find that switching on large_installation_tweaks did indeed make a big difference
with our latencies.

We also were doing the pre-Nagios 3.2 practice of not doing active host checks.  As
the tuning guide recommends, it's actually more efficient to do active checks and then
enable the cached check results.   When we did that, we found that the host that we
were seeing latency issues on leveled out on latencies.  (It's good to graph those values,
by the way).  They were still high-ish, but the active host checks caused them
to stop increasing over time.

But additionally, we found that long running checks were also messing up latencies.
As I understand it, if Nagios schedules a check and then it takes a lot longer than Nagios
expects it to to return, that can mess up scheduling the other checks.  I see you've got
some check(s) that ran at a max of 120 seconds.  When I started seeing some latency
problems I also saw that I had a service check or two that was running for several minutes.
I tracked that down and changed the check so that it completed (or timed out, really)
more quickly returning status back to Nagios in a matter of seconds rather than minutes.
The latency plummeted after that.  In general, our policy is that most checks should
complete in under 30 seconds, preferably under 10.

In the same vein, I'm not quite sure how you could have any host checks that would take
11 seconds to execute.  Are you doing multiple pings/fpings to check that a host is up?  Typically you can get away with just a single fping rather than a series of 10 to tell
you that a host is not reachable.

Hope that helps.

Mark

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From nagios at flatto.net  Wed May 19 17:58:57 2010
From: nagios at flatto.net (Assaf Flatto)
Date: Wed, 19 May 2010 16:58:57 +0100
Subject: localhost DOWN messages,
 return code 127 is out of bounds
In-Reply-To: 
References: 
Message-ID: <4BF40AC1.7030006@flatto.net>

Michael Maxwell wrote:
> Hello,
>
> Just installed nagios 3.2.1 following the docs at
> http://nagios.sourceforge.net/docs/3_0/quickstart.html
>
> Everything starts up normally, web interface is fine, can login,
> enable/disable checks, etc...  However, every 2 hours exactly,
> I'm getting the following alert in email:
>
> From: nagios
> Subject: ** PROBLEM Host Alert: localhost is DOWN **
>
> ***** Nagios *****
>
> Notification Type: PROBLEM
> Host: localhost
> State: DOWN
> Address: 127.0.0.1
> Info: (Return code of 127 is out of bounds - plugin may be missing)
>
> Date/Time: Tue May 18 15:00:15 PDT 2010
>
> If I go to check the services, everything is green.
>
> This is coming from a fresh install with nothing special configured.
>
> I've googled until my eyes glazed over and no one seems to report
> this happening with "localhost".  I've checked for the plugins,
> checked permissions, etc... The best I can tell, it's all as it
> should be.  I?m sure I?ve missed some simple detail somewhere, but
> it?s eluding me at the moment.
>
> Any help on solving this problem would be greatly appreciated.
>
> Below, I've included logfile contents as well as every detail
> I could think of on how the system is setup.
>
> Thanks
>
> [1274215437] Nagios 3.2.1 starting... (PID=5185)[1274215437] Local 
> time is Tue May 18 13:43:57 PDT 2010
> [1274215437] LOG VERSION: 2.0
> [1274215437] Finished daemonizing... (New PID=5186)[1274215447] 
> Warning: Return code of 127 for check of host 'localhost' was out of 
> bounds. Make sure the plugin you're trying to run actually e
> xists.
> [1274215447] HOST ALERT: localhost;DOWN;SOFT;1;(Return code of 127 is 
> out of bounds - plugin may be missing)[1274215477] Warning: Return 
> code of 127 for check of service 'Current Load' on host 'localhost' 
> was out of bounds. Make sure the plugin you'r
> e trying to run actually exists.
> [1274215477] SERVICE ALERT: localhost;Current 
> Load;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin may 
> be missing)[1274215517] Warning: Return code of 127 for check of host 
> 'localhost' was out of bounds. Make sure the plugin you're trying to 
> run actually e
> xists.
> [1274215517] HOST ALERT: localhost;DOWN;SOFT;2;(Return code of 127 is 
> out of bounds - plugin may be missing)[1274215517] Warning: Return 
> code of 127 for check of service 'Current Users' on host 'localhost' 
> was out of bounds. Make sure the plugin you'
> re trying to run actually exists.
> [1274215517] SERVICE ALERT: localhost;Current 
> Users;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin 
> may be missing)[1274215557] Warning: Return code of 127 for check of 
> service 'HTTP' on host 'localhost' was out of bounds. Make sure the 
> plugin you're trying
>  to run actually exists.
> [1274215557] SERVICE ALERT: localhost;HTTP;CRITICAL;HARD;1;(Return 
> code of 127 is out of bounds - plugin may be missing)[1274215587] 
> Warning: Return code of 127 for check of host 'localhost' was out of 
> bounds. Make sure the plugin you're trying to run actually e
> xists.
> [1274215557] SERVICE ALERT: localhost;HTTP;CRITICAL;HARD;1;(Return 
> code of 127 is out of bounds - plugin may be missing)[1274215587] 
> Warning: Return code of 127 for check of host 'localhost' was out of 
> bounds. Make sure the plugin you're trying to run actually e
> xists.
> [1274215587] HOST ALERT: localhost;DOWN;SOFT;3;(Return code of 127 is 
> out of bounds - plugin may be missing)
> [1274215597] Warning: Return code of 127 for check of service 'PING' 
> on host 'localhost' was out of bounds. Make sure the plugin you're 
> trying to run actually exists.
> [1274215597] SERVICE ALERT: localhost;PING;CRITICAL;HARD;1;(Return 
> code of 127 is out of bounds - plugin may be missing)
> [1274215627] Warning: Return code of 127 for check of service 'Root 
> Partition' on host 'localhost' was out of bounds. Make sure the plugin 
> you're trying to run actually exists.
> [1274215627] SERVICE ALERT: localhost;Root 
> Partition;CRITICAL;HARD;1;(Return code of 127 is out of bounds - 
> plugin may be missing)
>
>
> Detailed information on the system, what's installed, etc...:
>
> CentOS 5.4
> Linux [deleted] 2.6.18-194.el5 #1 SMP Fri Apr 2 14:58:35 EDT 2010 i686 
> i686 i386 GNU/Linux
>
> Nagios 3.2.1
> Nagios Plugins 1.4.14
> Apache httpd 2.2.15
> PHP 5.3.2
>
> $ grep nagios /etc/passwd
> nagios:x:9000:9001::/home/nagios:/bin/bash
>
> $ grep nag /etc/group
> nagcmd:x:9000:apache
> nagios:x:9001:nagios
>
> Options used to build nagios:
> ./configure     --with-command-group=nagcmd \
>                 --prefix=/opt/nagios-3.2.1 \
>                 --with-nagios-user=nagios \
>                 --with-httpd-conf=/opt/httpd/conf
>
> Options used to build plugins:
> ./configure     --with-nagios-user=nagios \
>                 --with-nagios-group=nagios \
>                 --prefix=/opt/nagios-3.2.1
>
> Permissions of files in /opt/nagios-3.2.1/libexec:
> $ ls -l /opt/nagios-3.2.1/libexec/
> total 1360K
> -rwxr-xr-x 1 nagios nagios 31052 May 18 13:48 check_apt*
> -rwxr-xr-x 1 nagios nagios  2240 May 18 13:48 check_breeze*
> -rwxr-xr-x 1 nagios nagios 38078 May 18 13:48 check_by_ssh*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_clamd -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 21516 May 18 13:48 check_cluster*
> -r-sr-xr-x 1 root   nagios 34952 May 18 13:48 check_dhcp*
> -rwxr-xr-x 1 nagios nagios 32640 May 18 13:48 check_dig*
> -rwxr-xr-x 1 nagios nagios 48280 May 18 13:48 check_disk*
> -rwxr-xr-x 1 nagios nagios  8075 May 18 13:48 check_disk_smb*
> -rwxr-xr-x 1 nagios nagios 36777 May 18 13:48 check_dns*
> -rwxr-xr-x 1 nagios nagios 18705 May 18 13:48 check_dummy*
> -rwxr-xr-x 1 nagios nagios  3051 May 18 13:48 check_file_age*
> -rwxr-xr-x 1 nagios nagios  6313 May 18 13:48 check_flexlm*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_ftp -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 85634 May 18 13:48 check_http*
> -r-sr-xr-x 1 root   nagios 39620 May 18 13:48 check_icmp*
> -rwxr-xr-x 1 nagios nagios 23743 May 18 13:48 check_ide_smart*
> -rwxr-xr-x 1 nagios nagios 15132 May 18 13:48 check_ifoperstatus*
> -rwxr-xr-x 1 nagios nagios 12518 May 18 13:48 check_ifstatus*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_imap -> check_tcp*
> -rwxr-xr-x 1 nagios nagios  7350 May 18 13:48 check_ircd*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_jabber -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 30014 May 18 13:48 check_ldap*
> lrwxrwxrwx 1 root   root      10 May 18 13:48 check_ldaps -> check_ldap*
> -rwxr-xr-x 1 nagios nagios 28714 May 18 13:48 check_load*
> -rwxr-xr-x 1 nagios nagios  6020 May 18 13:48 check_log*
> -rwxr-xr-x 1 nagios nagios 20282 May 18 13:48 check_mailq*
> -rwxr-xr-x 1 nagios nagios 25812 May 18 13:48 check_mrtg*
> -rwxr-xr-x 1 nagios nagios 24295 May 18 13:48 check_mrtgtraf*
> -rwxr-xr-x 1 nagios nagios 28548 May 18 13:48 check_nagios*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_nntp -> check_tcp*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_nntps -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 38266 May 18 13:48 check_nt*
> -rwxr-xr-x 1 nagios nagios 35917 May 18 13:48 check_ntp*
> -rwxr-xr-x 1 nagios nagios 33889 May 18 13:48 check_ntp_peer*
> -rwxr-xr-x 1 nagios nagios 31725 May 18 13:48 check_ntp_time*
> -rwxr-xr-x 1 nagios nagios 54192 May 18 13:48 check_nwstat*
> -rwxr-xr-x 1 nagios nagios  8324 May 18 13:48 check_oracle*
> -rwxr-xr-x 1 nagios nagios 30356 May 18 13:48 check_overcr*
> -rwxr-xr-x 1 nagios nagios 41387 May 18 13:48 check_ping*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_pop -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 39647 May 18 13:48 check_procs*
> -rwxr-xr-x 1 nagios nagios 29078 May 18 13:48 check_real*
> -rwxr-xr-x 1 nagios nagios  9579 May 18 13:48 check_rpc*
> -rwxr-xr-x 1 nagios nagios  1135 May 18 13:48 check_sensors*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_simap -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 51895 May 18 13:48 check_smtp*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_spop -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 26231 May 18 13:48 check_ssh*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_ssmtp -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 30682 May 18 13:48 check_swap*
> -rwxr-xr-x 1 nagios nagios 38940 May 18 13:48 check_tcp*
> -rwxr-xr-x 1 nagios nagios 28909 May 18 13:48 check_time*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_udp -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 34400 May 18 13:48 check_ups*
> -rwxr-xr-x 1 nagios nagios 27485 May 18 13:48 check_users*
> -rwxr-xr-x 1 nagios nagios  2934 May 18 13:48 check_wave*
> -rwxr-xr-x 1 nagios nagios 28787 May 18 13:48 negate*
> -rwxr-xr-x 1 nagios nagios 27484 May 18 13:48 urlize*
> -rwxr-xr-x 1 nagios nagios  1939 May 18 13:48 utils.pm*
> -rwxr-xr-x 1 nagios nagios   862 May 18 13:48 utils.sh*
>
AFAIK centos has a security mechanism called msec that can some time 
change the permissions of files and cause this issue .

check if you have the msec is working ( usually in the cron.d or 
cron.hourly or cron.daily directory ) and if so - you may want to set 
itup so it will not modify any of the nagios plugins or directories 
permissions.

Assaf


-- 
Never,Ever Cut A Deal With a Dragon 


I am doing a Charity Bike ride On the 27 of June for the
Capital to Coast Charity. Please help by Donating
http://www.justgiving.com/Lovefilm-capital-to-coast



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From yungwei at resolvity.com  Wed May 19 17:48:23 2010
From: yungwei at resolvity.com (Yungwei Chen)
Date: Wed, 19 May 2010 11:48:23 -0400
Subject: NRPE: Unable to read output (after upgrading sudo to
 1.7.2p1-6.el5_5)
Message-ID: <33095823FD21DF429B481B5163264B794299072AFA@VMBX102.ihostexchange.net>

Hi,

	I have been using a check_ldap_fd command on a nagios client without any problem. That command requires superuser permission.
	After upgrading sudo to 1.7.2p1-6.el5_5, the command stops working.
	
	Running the following command on my nagios server always shows the following:
	[TEST:root at serv1 ~]# /usr/lib/nagios/plugins/check_nrpe -H 192.168.12.2 -c check_ldap_fd
	NRPE: Unable to read output

	This is the command defined in /etc/nagios/nrpe.cfg on the nagios client:
	command[check_ldap_fd]=/usr/bin/sudo /usr/lib/nagios/plugins/check_ldap_fd -w 250 -c 500

	I also made the following changes to /etc/sudoers on the nagios client in order to make the command work:
	#Defaults    requiretty
	nagios ds2 = NOPASSWD:/usr/lib/nagios/plugins/check_ldap_fd

	Any ideas? Thanks.



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From lflacayo at cps.k12.il.us  Wed May 19 18:33:44 2010
From: lflacayo at cps.k12.il.us (Lacayo, Luis F)
Date: Wed, 19 May 2010 11:33:44 -0500
Subject: localhost DOWN messages,
	return code 127 is out of bounds
In-Reply-To: <4BF40AC1.7030006@flatto.net>
References: 
	<4BF40AC1.7030006@flatto.net>
Message-ID: 

What happens when you run it from the command line as the Nagios user?
I had a similar issue that required the check to run as root. 

Luis

-----Original Message-----
From: Assaf Flatto [mailto:nagios at flatto.net] 
Sent: Wednesday, May 19, 2010 10:59 AM
To: nagios-users at lists.sourceforge.net
Subject: Re: [Nagios-users] localhost DOWN messages, return code 127 is
out of bounds

Michael Maxwell wrote:
> Hello,
>
> Just installed nagios 3.2.1 following the docs at
> http://nagios.sourceforge.net/docs/3_0/quickstart.html
>
> Everything starts up normally, web interface is fine, can login,
> enable/disable checks, etc...  However, every 2 hours exactly,
> I'm getting the following alert in email:
>
> From: nagios
> Subject: ** PROBLEM Host Alert: localhost is DOWN **
>
> ***** Nagios *****
>
> Notification Type: PROBLEM
> Host: localhost
> State: DOWN
> Address: 127.0.0.1
> Info: (Return code of 127 is out of bounds - plugin may be missing)
>
> Date/Time: Tue May 18 15:00:15 PDT 2010
>
> If I go to check the services, everything is green.
>
> This is coming from a fresh install with nothing special configured.
>
> I've googled until my eyes glazed over and no one seems to report
> this happening with "localhost".  I've checked for the plugins,
> checked permissions, etc... The best I can tell, it's all as it
> should be.  I'm sure I've missed some simple detail somewhere, but
> it's eluding me at the moment.
>
> Any help on solving this problem would be greatly appreciated.
>
> Below, I've included logfile contents as well as every detail
> I could think of on how the system is setup.
>
> Thanks
>
> [1274215437] Nagios 3.2.1 starting... (PID=5185)[1274215437] Local 
> time is Tue May 18 13:43:57 PDT 2010
> [1274215437] LOG VERSION: 2.0
> [1274215437] Finished daemonizing... (New PID=5186)[1274215447] 
> Warning: Return code of 127 for check of host 'localhost' was out of 
> bounds. Make sure the plugin you're trying to run actually e
> xists.
> [1274215447] HOST ALERT: localhost;DOWN;SOFT;1;(Return code of 127 is 
> out of bounds - plugin may be missing)[1274215477] Warning: Return 
> code of 127 for check of service 'Current Load' on host 'localhost' 
> was out of bounds. Make sure the plugin you'r
> e trying to run actually exists.
> [1274215477] SERVICE ALERT: localhost;Current 
> Load;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin may

> be missing)[1274215517] Warning: Return code of 127 for check of host 
> 'localhost' was out of bounds. Make sure the plugin you're trying to 
> run actually e
> xists.
> [1274215517] HOST ALERT: localhost;DOWN;SOFT;2;(Return code of 127 is 
> out of bounds - plugin may be missing)[1274215517] Warning: Return 
> code of 127 for check of service 'Current Users' on host 'localhost' 
> was out of bounds. Make sure the plugin you'
> re trying to run actually exists.
> [1274215517] SERVICE ALERT: localhost;Current 
> Users;CRITICAL;HARD;1;(Return code of 127 is out of bounds - plugin 
> may be missing)[1274215557] Warning: Return code of 127 for check of 
> service 'HTTP' on host 'localhost' was out of bounds. Make sure the 
> plugin you're trying
>  to run actually exists.
> [1274215557] SERVICE ALERT: localhost;HTTP;CRITICAL;HARD;1;(Return 
> code of 127 is out of bounds - plugin may be missing)[1274215587] 
> Warning: Return code of 127 for check of host 'localhost' was out of 
> bounds. Make sure the plugin you're trying to run actually e
> xists.
> [1274215557] SERVICE ALERT: localhost;HTTP;CRITICAL;HARD;1;(Return 
> code of 127 is out of bounds - plugin may be missing)[1274215587] 
> Warning: Return code of 127 for check of host 'localhost' was out of 
> bounds. Make sure the plugin you're trying to run actually e
> xists.
> [1274215587] HOST ALERT: localhost;DOWN;SOFT;3;(Return code of 127 is 
> out of bounds - plugin may be missing)
> [1274215597] Warning: Return code of 127 for check of service 'PING' 
> on host 'localhost' was out of bounds. Make sure the plugin you're 
> trying to run actually exists.
> [1274215597] SERVICE ALERT: localhost;PING;CRITICAL;HARD;1;(Return 
> code of 127 is out of bounds - plugin may be missing)
> [1274215627] Warning: Return code of 127 for check of service 'Root 
> Partition' on host 'localhost' was out of bounds. Make sure the plugin

> you're trying to run actually exists.
> [1274215627] SERVICE ALERT: localhost;Root 
> Partition;CRITICAL;HARD;1;(Return code of 127 is out of bounds - 
> plugin may be missing)
>
>
> Detailed information on the system, what's installed, etc...:
>
> CentOS 5.4
> Linux [deleted] 2.6.18-194.el5 #1 SMP Fri Apr 2 14:58:35 EDT 2010 i686

> i686 i386 GNU/Linux
>
> Nagios 3.2.1
> Nagios Plugins 1.4.14
> Apache httpd 2.2.15
> PHP 5.3.2
>
> $ grep nagios /etc/passwd
> nagios:x:9000:9001::/home/nagios:/bin/bash
>
> $ grep nag /etc/group
> nagcmd:x:9000:apache
> nagios:x:9001:nagios
>
> Options used to build nagios:
> ./configure     --with-command-group=nagcmd \
>                 --prefix=/opt/nagios-3.2.1 \
>                 --with-nagios-user=nagios \
>                 --with-httpd-conf=/opt/httpd/conf
>
> Options used to build plugins:
> ./configure     --with-nagios-user=nagios \
>                 --with-nagios-group=nagios \
>                 --prefix=/opt/nagios-3.2.1
>
> Permissions of files in /opt/nagios-3.2.1/libexec:
> $ ls -l /opt/nagios-3.2.1/libexec/
> total 1360K
> -rwxr-xr-x 1 nagios nagios 31052 May 18 13:48 check_apt*
> -rwxr-xr-x 1 nagios nagios  2240 May 18 13:48 check_breeze*
> -rwxr-xr-x 1 nagios nagios 38078 May 18 13:48 check_by_ssh*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_clamd ->
check_tcp*
> -rwxr-xr-x 1 nagios nagios 21516 May 18 13:48 check_cluster*
> -r-sr-xr-x 1 root   nagios 34952 May 18 13:48 check_dhcp*
> -rwxr-xr-x 1 nagios nagios 32640 May 18 13:48 check_dig*
> -rwxr-xr-x 1 nagios nagios 48280 May 18 13:48 check_disk*
> -rwxr-xr-x 1 nagios nagios  8075 May 18 13:48 check_disk_smb*
> -rwxr-xr-x 1 nagios nagios 36777 May 18 13:48 check_dns*
> -rwxr-xr-x 1 nagios nagios 18705 May 18 13:48 check_dummy*
> -rwxr-xr-x 1 nagios nagios  3051 May 18 13:48 check_file_age*
> -rwxr-xr-x 1 nagios nagios  6313 May 18 13:48 check_flexlm*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_ftp -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 85634 May 18 13:48 check_http*
> -r-sr-xr-x 1 root   nagios 39620 May 18 13:48 check_icmp*
> -rwxr-xr-x 1 nagios nagios 23743 May 18 13:48 check_ide_smart*
> -rwxr-xr-x 1 nagios nagios 15132 May 18 13:48 check_ifoperstatus*
> -rwxr-xr-x 1 nagios nagios 12518 May 18 13:48 check_ifstatus*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_imap -> check_tcp*
> -rwxr-xr-x 1 nagios nagios  7350 May 18 13:48 check_ircd*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_jabber ->
check_tcp*
> -rwxr-xr-x 1 nagios nagios 30014 May 18 13:48 check_ldap*
> lrwxrwxrwx 1 root   root      10 May 18 13:48 check_ldaps ->
check_ldap*
> -rwxr-xr-x 1 nagios nagios 28714 May 18 13:48 check_load*
> -rwxr-xr-x 1 nagios nagios  6020 May 18 13:48 check_log*
> -rwxr-xr-x 1 nagios nagios 20282 May 18 13:48 check_mailq*
> -rwxr-xr-x 1 nagios nagios 25812 May 18 13:48 check_mrtg*
> -rwxr-xr-x 1 nagios nagios 24295 May 18 13:48 check_mrtgtraf*
> -rwxr-xr-x 1 nagios nagios 28548 May 18 13:48 check_nagios*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_nntp -> check_tcp*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_nntps ->
check_tcp*
> -rwxr-xr-x 1 nagios nagios 38266 May 18 13:48 check_nt*
> -rwxr-xr-x 1 nagios nagios 35917 May 18 13:48 check_ntp*
> -rwxr-xr-x 1 nagios nagios 33889 May 18 13:48 check_ntp_peer*
> -rwxr-xr-x 1 nagios nagios 31725 May 18 13:48 check_ntp_time*
> -rwxr-xr-x 1 nagios nagios 54192 May 18 13:48 check_nwstat*
> -rwxr-xr-x 1 nagios nagios  8324 May 18 13:48 check_oracle*
> -rwxr-xr-x 1 nagios nagios 30356 May 18 13:48 check_overcr*
> -rwxr-xr-x 1 nagios nagios 41387 May 18 13:48 check_ping*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_pop -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 39647 May 18 13:48 check_procs*
> -rwxr-xr-x 1 nagios nagios 29078 May 18 13:48 check_real*
> -rwxr-xr-x 1 nagios nagios  9579 May 18 13:48 check_rpc*
> -rwxr-xr-x 1 nagios nagios  1135 May 18 13:48 check_sensors*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_simap ->
check_tcp*
> -rwxr-xr-x 1 nagios nagios 51895 May 18 13:48 check_smtp*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_spop -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 26231 May 18 13:48 check_ssh*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_ssmtp ->
check_tcp*
> -rwxr-xr-x 1 nagios nagios 30682 May 18 13:48 check_swap*
> -rwxr-xr-x 1 nagios nagios 38940 May 18 13:48 check_tcp*
> -rwxr-xr-x 1 nagios nagios 28909 May 18 13:48 check_time*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_udp -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 34400 May 18 13:48 check_ups*
> -rwxr-xr-x 1 nagios nagios 27485 May 18 13:48 check_users*
> -rwxr-xr-x 1 nagios nagios  2934 May 18 13:48 check_wave*
> -rwxr-xr-x 1 nagios nagios 28787 May 18 13:48 negate*
> -rwxr-xr-x 1 nagios nagios 27484 May 18 13:48 urlize*
> -rwxr-xr-x 1 nagios nagios  1939 May 18 13:48 utils.pm*
> -rwxr-xr-x 1 nagios nagios   862 May 18 13:48 utils.sh*
>
AFAIK centos has a security mechanism called msec that can some time 
change the permissions of files and cause this issue .

check if you have the msec is working ( usually in the cron.d or 
cron.hourly or cron.daily directory ) and if so - you may want to set 
itup so it will not modify any of the nagios plugins or directories 
permissions.

Assaf


-- 
Never,Ever Cut A Deal With a Dragon 


I am doing a Charity Bike ride On the 27 of June for the
Capital to Coast Charity. Please help by Donating
http://www.justgiving.com/Lovefilm-capital-to-coast



------------------------------------------------------------------------
------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From patrick.morris at hp.com  Wed May 19 18:42:38 2010
From: patrick.morris at hp.com (Morris, Patrick)
Date: Wed, 19 May 2010 09:42:38 -0700
Subject: localhost DOWN messages,
 return code 127 is out of bounds
In-Reply-To: 
References: 
Message-ID: <4BF414FE.50807@hp.com>

Michael Maxwell wrote:
> Hello,
>
> Just installed nagios 3.2.1 following the docs at
> http://nagios.sourceforge.net/docs/3_0/quickstart.html
>
> Everything starts up normally, web interface is fine, can login,
> enable/disable checks, etc...  However, every 2 hours exactly,
> I'm getting the following alert in email:
>
> From: nagios
> Subject: ** PROBLEM Host Alert: localhost is DOWN **
>
> ***** Nagios *****
>
> Notification Type: PROBLEM
> Host: localhost
> State: DOWN
> Address: 127.0.0.1
> Info: (Return code of 127 is out of bounds - plugin may be missing)

You've left out what is probably the most important information: how the 
check that is failing is defined.

Take a look at your host check definition. Chances are it's either got a 
typo in the configuration, is pointing at a plugin that's not installed, 
or is running into issues with something like SELinux that is denying 
access to the plugin.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mmaxwell at blackarrow.tv  Wed May 19 19:01:56 2010
From: mmaxwell at blackarrow.tv (Michael Maxwell)
Date: Wed, 19 May 2010 10:01:56 -0700
Subject: localhost DOWN messages,
 return code 127 is out of bounds
In-Reply-To: <4BF40AC1.7030006@flatto.net>
References: <4BF40AC1.7030006@flatto.net>
Message-ID: 

On 5/19/10 8:58 AM, "Assaf Flatto"  wrote:

> AFAIK centos has a security mechanism called msec that can some time
> change the permissions of files and cause this issue .
> 
> check if you have the msec is working ( usually in the cron.d or
> cron.hourly or cron.daily directory ) and if so - you may want to set
> itup so it will not modify any of the nagios plugins or directories
> permissions.
> 
> Assaf

You might be thinking of SELinux - this is disabled.  I've looked through
the crons and see nothing in there that would be modifying permissions,
especially not on anything in /opt.



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From nagios at chadmail.com  Wed May 19 14:15:26 2010
From: nagios at chadmail.com (nagios)
Date: Wed, 19 May 2010 22:15:26 +1000
Subject: Anyone? : SIGSEGV when trying to use	eventhandler
In-Reply-To: 
References: 
	
	
Message-ID: 

Thanks for the reply Guy.
 
I installed nagios from the "contrib" repo. as per the blog here...and all 
looks like it's working fine. This is the most recent version in the repo's. 
I guess I could compile it from source, but would like to be sure that's the 
issue before I go down this path.
http://blogs.sun.com/baiken/entry/nagios_installation_guide_for_opensolaris

And this additional information....
nas at nas:/usr/nagios/libexec/eventhandler# ldd /usr/nagios/bin/nagios
        libm.so.2 =>     /lib/libm.so.2
        libpthread.so.1 =>       /lib/libpthread.so.1
        libdl.so.1 =>    /lib/libdl.so.1
        libc.so.1 =>     /lib/libc.so.1
 
Installed Nagios from contrib. repo.
Summary:                 Host/service/network monitoring program
  Size:                  12.19 MB
  Category:              None
  Installed Version:     3.0.6,5.11-0.111
  Latest Version:        3.0.6,5.11-0.111
  Packaging Date:        Tue Oct 27 16:14:19 2009
  FMRI:                  pkg:/nagios at 3.0.6,5.11-0.111:20091027T161419Z
  Repository:            contrib
re: trace...Solaris does have dtrace...and it's supposed to be pretty good, 
but I'd need to read up a heap to understand how to use it.

Any more ideas folks?
-----Original Message-----
From: Guy Waugh 
To: Nagios Users List 
Date: Wed, 19 May 2010 11:07:52 +0100
Subject: Re: [Nagios-users] Anyone? : SIGSEGV when trying to use 
eventhandler

I'm definitely no expert but...
* What does it say when you 'ldd' the nagios binary? Are all the libraries 
the binary is linked against able to be found? Are those libraries 
up-to-date?
* Where did you get nagios from? Did you compile it or is it pre-built? If 
pre-built, are there any updates?
* I don't know Solaris well enough to know how to trace your running nagios 
with a very simple configuration, but that might be the next step. strace?

On 19 May 2010 10:49, nagios  wrote:
Anybody?
 
If you need extra information, just let me know what you need to see and 
I'll upload it.
 
Thanks.
-----Original Message-----
From: "nagios" 
To: nagios-users at lists.sourceforge.net
Date: Wed, 19 May 2010 01:42:15 +1000
Subject: [Nagios-users] SIGSEGV when trying to use eventhandler

Hi guys,
    I am new to nagios but so far it's working well for me and is monitoring 
a number of real and virtual hosts. Nagios 3.0.6 is installed on an 
OpenSolaris 2009.06 host and monitoring routers other devices and VM's in 
VirtualBox.
 
My issue is when I try to add an event handler, I get a SIGSEGV and nagios 
restarts.
 
 
I have posted the details of the code I am using and the error 
here...http://pastebin.com/vBb7xTND and also below (but it reads better @ 
pastebin).
 
I have tried several different scripts and code combinations (even empty 
scripts and commands like ls) and all give the same error.
 
Can anyone help me work out why it's happening?
 
Thanks.
 
hosts.cfg

define host{
use windows-server ; Inherit default values from a template
host_name Server6 ; The name we're giving to this host
max_check_attempts 4
event_handler vboxmanage-restart ; Restart the vm
alias Server 6 - Win2008 Server ; A longer name associated with the host
address 192.168.0.6 ; IP address of the host
}


commands.cfg - note I have tried various scripts here incl. ones from the 
nagios guides/books and all give the same error.

# 'vboxmanage_restart' command definition
define command{
command_name vboxmanage-restart
# command_line ls
command_line sudo -u nas $USER1$/eventhandler/event_vboxmanage_restart -S 
$SERVICESTATE$ -T $SERVICESTATETYPE$ -A $SERVICEATTEMPT$ -H Server6
}


nagios.log
[1274193005] HOST ALERT: Server6;DOWN;SOFT;1;PING CRITICAL - Packet loss = 
100%
[1274193005] Caught SIGSEGV, shutting down...
[1274193005] Nagios 3.0.6 starting... (PID=5231)
[1274193005] Local time is Wed May 19 00:30:05 EST 2010
[1274193005] LOG VERSION: 2.0
[1274193005] Finished daemonizing... (New PID=5232)

the scripts... (yes I know it should not be 777's but just to show it's not 
a permissions thing)
-rwxrwxrwx 1 nagios nagios 1580 2010-05-18 00:52 event_vboxmanage_restart
-rwxrwxrwx 1 nagios nagios 3815 2010-05-18 23:07 filename.out
-rwxrwxrwx 1 nagios nagios 2211 2010-05-19 00:23 restart-httpd
nas at nas:/usr/nagios/libexec/eventhandler# 

The script work fine from the user nagios using sudo (added nagios to 
/etc/sudoers)
nas at nas:?sr/nagios/libexec/eventhandler$ whoami 
nagios
nas at nas:?sr/nagios/libexec/eventhandler$ sudo -u nas 
./event_vboxmanage_restart -S CRITICAL -T HARD -A 1 -H Server6 
CRITICAL(C) 2005-2010 Sun Microsystems, Inc.

The event_vboxmanage_restart script...no that this is likely to be at fault 
(I do not think anyway as I get the error with other very simple scripts 
too).
#!/usr/bin/perl

use Getopt::Long;
use Net::Telnet ();
use Switch;
my ($state,$type,$attempt,$cmd,$hostname);
open(MYOUTFILE, ">>/usr/nagios/libexec/eventhandler/filename.out");

&processargs;
print "$state"; 
switch ($state) {
case "OK" { &state_OK }
case "WARNING" { &state_WARNING }
case "UNKNOWN" { &state_UNKNOWN }
case "CRITICAL" { &state_CRITICAL }
else { print "unrecognised state>$state" }
}
print MYOUTFILE">$state<";
print MYOUTFILE">$hostname<";
close(MYOUTFILE);
exit 0;

sub processargs {

GetOptions (
"S|state=s" => \$state,
"T|type=s" => \$type,
"A|attempt=i" => \$attempt,
"H|hostname=s" => \$hostname,
"C|command=s" => \$cmd,
);
}

### FUNC: print $state
sub print_state {
}
### FUNC: print $state
sub state_OK {
}
### FUNC: print $state
sub state_WARNING {
}
### FUNC: print $state
sub state_UNKNOWN {
}
### FUNC: print $state
sub state_CRITICAL {
if ("$type" eq "HARD" or ("$type" eq "SOFT" and $attempt == 3)) 
{@result=`VBoxManage controlvm $hostname acpipowerbutton`; foreach (@result) 
{
print MYOUTFILE"$_\n";
};sleep(60);@result=`VBoxManage controlvm $hostname poweroff`;foreach 
(@result) {
print MYOUTFILE"$_\n";
}; @result=`VBoxManage startvm $hostname`; print "$result[1]";
}
else { }
}

 As you can see from the below, it all works fine (ie. no SIGSEGV's) if I 
comment out the eventhandler line from the hosts.cfg file.
[05-19-2010 01:33:50] SERVICE ALERT: 
Server6;Explorer;OK;HARD;1;Explorer.EXE: Running
[05-19-2010 01:32:50] SERVICE ALERT: Server6;Uptime;OK;HARD;1;System Uptime 
- 0 day(s) 0 hour(s) 9 minute(s)
[05-19-2010 01:32:40] SERVICE ALERT: Server6;C:\ Drive Space;OK;HARD;1;c:\ - 
total: 39.90 Gb - used: 9.19 Gb (23%) - free 30.71 Gb (77%)
[05-19-2010 01:32:10] SERVICE ALERT: Server6;CPU Load;OK;HARD;1;CPU Load 3% 
(5 min average)
[05-19-2010 01:25:00] HOST ALERT: Server6;UP;SOFT;4;PING OK - Packet loss = 
0%, RTA = 0.44 ms
[05-19-2010 01:23:50] SERVICE ALERT: 
Server6;Explorer;CRITICAL;HARD;1;Connection refused
[05-19-2010 01:23:50] HOST ALERT: Server6;DOWN;SOFT;3;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:23:00] SERVICE ALERT: Server6;Uptime;CRITICAL;HARD;1;CRITICAL 
- Socket timeout after 10 seconds
[05-19-2010 01:22:50] SERVICE ALERT: Server6;C:\ Drive 
Space;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:22:30] HOST ALERT: Server6;DOWN;SOFT;2;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:22:20] SERVICE ALERT: Server6;CPU 
Load;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:21:10] HOST ALERT: Server6;DOWN;SOFT;1;PING CRITICAL - Packet 
loss = 100%
[05-19-2010 01:21:00] SERVICE ALERT: Server6;Uptime;CRITICAL;SOFT;1;CRITICAL 
- Socket timeout after 10 seconds
[05-19-2010 01:20:50] SERVICE ALERT: Server6;C:\ Drive 
Space;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 seconds
[05-19-2010 01:02:10] SERVICE ALERT: Server6;CPU Load;OK;SOFT;1;CPU Load 0% 
(5 min average)
[05-19-2010 01:00:50] SERVICE ALERT: Server6;Uptime;OK;SOFT;1;System Uptime 
- 0 day(s) 0 hour(s) 57 minute(s)
[05-19-2010 01:00:40] SERVICE ALERT: Server6;C:\ Drive Space;OK;SOFT;1;c:\ - 
total: 39.90 Gb - used: 9.19 Gb (23%) - free 30.71 Gb (77%)
 

------------------------------------------------------------------------------


_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting 
any issue.
::: Messages without supporting info will risk being sent to /dev/null

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From mmaxwell at blackarrow.tv  Wed May 19 19:08:21 2010
From: mmaxwell at blackarrow.tv (Michael Maxwell)
Date: Wed, 19 May 2010 10:08:21 -0700
Subject: localhost DOWN messages,
 return code 127 is out of bounds
In-Reply-To: <4BF414FE.50807@hp.com>
References: <4BF414FE.50807@hp.com>
Message-ID: 


On 5/19/10 9:42 AM, "Morris, Patrick"  wrote:

> You've left out what is probably the most important information: how the
> check that is failing is defined.
> 
> Take a look at your host check definition. Chances are it's either got a
> typo in the configuration, is pointing at a plugin that's not installed,
> or is running into issues with something like SELinux that is denying
> access to the plugin.

Well, I didn't exactly leave this out.  Remember I mentioned that it's a
clean install from the Quickstart document.  I've changed nothing in it
other than the admin contact in contacts.cfg.  So whatever's there is there
in the default localhost checks out of the box.

This is actually the 4th clean install of Nagios on this system.  This time,
I made it a point to change nothing anywhere other than the email address in
contacts.cfg.  The first 2, I'd already started configuring hosts for
monitoring and each time, I ran into the same problem with the "localhost
DOWN" messages each time.  I've never had this problem with earlier versions
of Nagios.

Also, SELinux is disabled, as is the local firewall.

# getenforce
Disabled
# service iptables status
Firewall is stopped.
# service ip6tables status
Firewall is stopped.

Thanks



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mmelin at gmail.com  Wed May 19 19:15:00 2010
From: mmelin at gmail.com (Martin Melin)
Date: Wed, 19 May 2010 19:15:00 +0200
Subject: Escalation question
In-Reply-To: <201005191720.01805.vladimir.indik@gmail.com>
References: <201005191720.01805.vladimir.indik@gmail.com>
Message-ID: 

On Wed, May 19, 2010 at 3:20 PM, Vladimir N. Indik
 wrote:
> I want to send notification to three contacts in order.
> one at example.com
> wait 10 minutes
> two at example.com
> wait 10 minutes
> three at example.com
>
> My config:
> define service {
> ? ? ? ?host_name ? ? ? ? ? ? ? ? ? ? ? hostname
> ? ? ? ?service_description ? ? ? ? ? ? servicename
> ? ? ? ?notification_interval ? ? ? ? ? 600 ? ? ? ? ? ? ?#!
> ? ? ? ?max_check_attempts ? ? ? ? ? ? ?2 ? ? ? ? ? ? ? #!
> ? ? ? ?flap_detection_enabled ? ? ? ? ?0
> ? ? ? ?contacts ? ? ? ? ? ? ? ? ? ? ? ?one at example.com
> }
>
> define serviceescalation{
> ? ? ? ?host_name ? ? ? ? ? ? ? ? ? ? ? hostname
> ? ? ? ?service_description ? ? ? ? ? ? servicename
> ? ? ? ?first_notification ? ? ? ? ? ? ?2
> ? ? ? ?last_notification ? ? ? ? ? ? ? 2
> ? ? ? ?notification_interval ? ? ? ? ? 600
> ? ? ? ?contacts ? ? ? ? ? ? ? ? ? ? ? ?two at example.com
> ? ? ? ?}
>
> define serviceescalation{
> ? ? ? ?host_name ? ? ? ? ? ? ? ? ? ? ? facts
> ? ? ? ?service_description ? ? ? ? ? ? Facts test
> ? ? ? ?first_notification ? ? ? ? ? ? ?3
> ? ? ? ?last_notification ? ? ? ? ? ? ? 3
> ? ? ? ?notification_interval ? ? ? ? ? 0
> ? ? ? ?contacts ? ? ? ? ? ? ? ? ? ? ? ?three at example.com
> ? ? ? ?}
>
> Is it all rihtg?
>
> I have a problem with this configuration.
> When the status changes to OK, the message about it only receives the last
> contact (e.g. three at example.com). This is a bug?
>

No, it is working as designed. Read up on how escalations work here:
http://nagios.sourceforge.net/docs/3_0/escalations.html

It is considered best practice to include contacts from lower
escalation ranges in higher escalations, because of this exact issue.

Quote the above doc:

"When defining notification escalations, it is important to keep in
mind that any contact groups that were members of "lower" escalations
(i.e. those with lower notification number ranges) should also be
included in "higher" escalation definitions. This should be done to
ensure that anyone who gets notified of a problem continues to get
notified as the problem is escalated. "

Best regards,
Martin Melin

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mmaxwell at blackarrow.tv  Wed May 19 19:17:52 2010
From: mmaxwell at blackarrow.tv (Michael Maxwell)
Date: Wed, 19 May 2010 10:17:52 -0700
Subject: localhost DOWN messages,
 return code 127 is out of bounds
In-Reply-To: 
References: 
Message-ID: 

On 5/19/10 9:33 AM, "Lacayo, Luis F"  wrote:

> What happens when you run it from the command line as the Nagios user?
> I had a similar issue that required the check to run as root.

Everything works fine when running from commandline as nagios user or as
root.  It even works fine within Nagios *except* something happens every 2
hours (on odd-numbered hours at that) that causes these messages.  Any other
time, everything is fine.

Here's the Service Status details for localhost (edited slightly to fit in
here somewhat neatly).  This is from the web interface:

Current Load OK 1/4 OK - load average: 0.00, 0.00, 0.00
Current Users OK 1/4 USERS OK - 1 users currently logged in
HTTP OK 1/4 HTTP OK: HTTP/1.1 200 OK - 311 bytes in 0.001 second response
time
PING OK 1/4 PING OK - Packet loss = 0%, RTA = 0.05 ms
Root Partition OK 1/4 DISK OK - free space: / 4025 MB (62% inode=94%):
SSH OK 1/4 SSH OK - OpenSSH_4.3 (protocol 2.0)
Swap Usage OK 1/4 SWAP OK - 100% free (1023 MB out of 1023 MB)
Total Processes OK 1/4 PROCS OK: 21 processes with STATE = RSZDT



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From kyle.bader at gmail.com  Wed May 19 19:33:17 2010
From: kyle.bader at gmail.com (Kyle Bader)
Date: Wed, 19 May 2010 10:33:17 -0700
Subject: Full Throttle Nagios
In-Reply-To: 
References: 
	<4BF3C45A.7070001@op5.se>
	
Message-ID: 

>> There are several tweaks one can do to make Nagios more or less saturate the
>> CPU, but none of them are enough if the number of checks rise above a
>> certain
>> level. We've noticed that we can run about 60K service-checks so long as we
>> don't have many state-changes in the network. That's not nearly good enough,
>> so it will be improved.
>>
>> Work is under way to revamp the Nagios check handling though. It will get
>> better, but it's a series of large patches that need to be thoroughly tested
>> before they get released.
>>
>> I'll holler when I have them in a testable state, since I guess a lot of
>> people would be quite interested in trying out the new checking engine I'm
>> working on. If it works as good as it seems to, it should alleviate a lot
>> of performance issues.

Sounds very promising, can't wait to see your work :D

-- 

Kyle

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From rick.mangus+nagios at gmail.com  Wed May 19 19:51:28 2010
From: rick.mangus+nagios at gmail.com (Rick Mangus)
Date: Wed, 19 May 2010 12:51:28 -0500
Subject: localhost DOWN messages,
	return code 127 is out of 	bounds
In-Reply-To: 
References: 
	
Message-ID: 

Michael:

What check are you using for the host check?

The line we're looking for is under your host definition and looks like so:

check_command              

Then, copy/paste the command definition for that command.

Patrick asked the vital question, and you blew him off.  The
Quickstart page doesn't say what is in the default config, and most of
us aren't going to go dig up the sample config from a source tarball
to answer your question.

Either provide the requested information, or don't expect much help.

On Wed, May 19, 2010 at 12:17 PM, Michael Maxwell
 wrote:
> On 5/19/10 9:33 AM, "Lacayo, Luis F"  wrote:
>
>> What happens when you run it from the command line as the Nagios user?
>> I had a similar issue that required the check to run as root.
>
> Everything works fine when running from commandline as nagios user or as
> root. ?It even works fine within Nagios *except* something happens every 2
> hours (on odd-numbered hours at that) that causes these messages. ?Any other
> time, everything is fine.

Yeah, let me guess.... is your check_interval 120?  At that point, it
does a host check... what're you running from the commandline?  Is it
the same as the host check command?

>
> Here's the Service Status details for localhost (edited slightly to fit in
> here somewhat neatly). ?This is from the web interface:


These services are inconsequential.  The host is down, not its services.

--Rick

P.S.  My apologies if I seem snippy.  It's a bad day.  If you provide
the info, I promise a speedy, polite response.  :)

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From trisha at rockyou.com  Wed May 19 19:37:09 2010
From: trisha at rockyou.com (Trisha Hoang)
Date: Wed, 19 May 2010 10:37:09 -0700
Subject: localhost DOWN messages,
	return code 127 is out of 	bounds
In-Reply-To: 
References: 
	
Message-ID: 

We used nagios-3.2.1 and Centos5.2 and experienced somewhat the same problem
as Michael described, but I don't see the same problem after moving to
nagios-3.2.0.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From mmaxwell at blackarrow.tv  Wed May 19 20:09:24 2010
From: mmaxwell at blackarrow.tv (Michael Maxwell)
Date: Wed, 19 May 2010 11:09:24 -0700
Subject: localhost DOWN messages,
 return code 127 is out of bounds
In-Reply-To: 
References: 
Message-ID: 


> P.S.  My apologies if I seem snippy.  It's a bad day.  If you provide
> the info, I promise a speedy, polite response.  :)

No thanks.  I'll use an earlier version.  I'm not exactly having a great day
either.  A little civility would get a better response from me as I've been
nothing but civil and have gone out of my way to provide copious amounts of
information.  And no, I did NOT "blow anyone off" - at least until now.
Sorry if it sounded that way.





------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From rick.mangus+nagios at gmail.com  Wed May 19 20:27:42 2010
From: rick.mangus+nagios at gmail.com (Rick Mangus)
Date: Wed, 19 May 2010 13:27:42 -0500
Subject: localhost DOWN messages,
	return code 127 is out of 	bounds
In-Reply-To: 
References:  
	
Message-ID: 

On Wed, May 19, 2010 at 1:09 PM, Michael Maxwell  wrote:
>
>> P.S. ?My apologies if I seem snippy. ?It's a bad day. ?If you provide
>> the info, I promise a speedy, polite response. ?:)
>
> No thanks. ?I'll use an earlier version. ?I'm not exactly having a great day
> either. ?A little civility would get a better response from me as I've been
> nothing but civil and have gone out of my way to provide copious amounts of
> information. ?And no, I did NOT "blow anyone off" - at least until now.
> Sorry if it sounded that way.
>

Eh.  The version shouldn't matter, though an older version might come
with a different config.  As far as it goes, Patrick did ask for
information and got this in response:

>Well, I didn't exactly leave this out.  Remember I mentioned that it's a
>clean install from the Quickstart document.  I've changed nothing in it
>other than the admin contact in contacts.cfg.  So whatever's there is there
>in the default localhost checks out of the box.

I will apologize for this line:

> Either provide the requested information, or don't expect much help.

It's tone was unnecessary, but the content was not.  Your problem
could be identified much more simply if you would provide the
information requested.  I didn't intend to offend; such is the risk
with written anonymous communication.

Have a nice day!

--Rick

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mike-nagios at 5dninja.net  Wed May 19 20:29:09 2010
From: mike-nagios at 5dninja.net (Mike Lindsey)
Date: Wed, 19 May 2010 11:29:09 -0700
Subject: Escalations - Warning to Critical, without skipping?
In-Reply-To: 
References: 	
	
Message-ID: <4BF42DF5.2030804@5dninja.net>

So, here's my situation.  I've got around 10k checks, Warnings do not 
notify, because we have historically had issues with Warning 
notifications (from the contact group setting) going out, then a service 
turning critical and the pager escalations (which only include critical) 
skipping directly to "Page everyone, and a couple managers" because we'd 
already had 3 warning-level notifications.

So, now all contacts have warning notifications disabled.  Which leads 
to missed events.

Is there any way to notify on warnings, without incrementing the 
notification count, and affecting escalations?

What I want is:  Warnings notify, and when a service turns Critical, it 
always starts at step 1 of the escalations.
That way, ops and dev can get notifications about service issues, before 
we get to the point where we need to page about it.  And when it does 
get to be paging time, nagios isn't waking up management at 4am.

I'd love to avoid having duplicate service checks, with a "warning" 
check that has warning notifications enabled, and a "critical" check 
with warning notifications disabled.

Ideal would be some manner of having split escalations, where it tracks 
the number of notifications of a specific state, and escalates based on 
that, but it looks like that requires some serious refactoring of the code.

(Running 3.0.6)
-- 
Mike Lindsey

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mitsuto at gmail.com  Wed May 19 20:37:29 2010
From: mitsuto at gmail.com (Marcel)
Date: Wed, 19 May 2010 15:37:29 -0300
Subject: localhost DOWN messages,
	return code 127 is out of 	bounds
In-Reply-To: 
References: 
	
Message-ID: 

Perhaps if you're upgrading and haven't touched the resources.cfg, then you
could be having issues with $USER1$ defined for your previous installation.

On Wed, May 19, 2010 at 3:09 PM, Michael Maxwell wrote:

>
> > P.S.  My apologies if I seem snippy.  It's a bad day.  If you provide
> > the info, I promise a speedy, polite response.  :)
>
> No thanks.  I'll use an earlier version.  I'm not exactly having a great
> day
> either.  A little civility would get a better response from me as I've been
> nothing but civil and have gone out of my way to provide copious amounts of
> information.  And no, I did NOT "blow anyone off" - at least until now.
> Sorry if it sounded that way.
>
> 
>
>
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From bugfood-ml at fatooh.org  Wed May 19 20:47:57 2010
From: bugfood-ml at fatooh.org (Corey Hickey)
Date: Wed, 19 May 2010 11:47:57 -0700
Subject: trying to fix problem with excessive latency
In-Reply-To: <4BF34149.3060006@iweb.com>
References: <4BF33F14.9070605@fatooh.org> <4BF34149.3060006@iweb.com>
Message-ID: <4BF4325D.108@fatooh.org>

On 2010-05-18 18:39, Mathieu Gagn? wrote:
> On 2010-05-18 21:29, Corey Hickey wrote:
>> Hello,
>>
>> I have inherited maintenance of a medium-sized Nagios installation. We
>> currently have 649 hosts and 5415 services. Our setup works nicely, with
>> one exception: Nagios falls behind on host/service checks. Our usual
>> latency once Nagios has been running for a while is about 190-200
>> seconds. Our Nagios host is reasonably powerful and isn't struggling; it
>> seems that Nagios itself is limited somehow.
> 
> Try disabling "enable_environment_macros". It helped use greatly when we
> disabled this option. Nagios was spending most of its time allocating
> memory for environment variables it never used. (in our case)
> 
> Host/Service checks latency dropped from ~20 minutes to 10 seconds I
> would say.

I just tried this and latency dropped from 171 seconds to 149 seconds.
Thanks, that definitely helped. I'm actually surprised that it did,
since our nagios process doesn't seem to be CPU bound (75-85% idle,
rarely with any individual core below 40% idle). It might be that I'm
drawing the wrong conclusion from top, though, since the nagios process
moves around from core to core.

Anyway, thanks again, and thanks to everyone for the advice on- and
off-list. I'm going to try more of your suggestions, and I'll let you
know how it goes.

-Corey

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From shadhin71 at gmail.com  Wed May 19 21:12:27 2010
From: shadhin71 at gmail.com (shadih rahman)
Date: Wed, 19 May 2010 15:12:27 -0400
Subject: NRPE with command arguments
In-Reply-To: 
References: 
	
Message-ID: 

try nrpe call "-n" option

On Wed, May 19, 2010 at 11:06 AM, Ludse Verhoeven  wrote:

>
> I think that's fine. It all depends on what you want to pass to the command
> itself (in this case the /usr/locall.nagios/libexec/check_disk-plugin). If
> f.i. -w "$ARG1$" can use space-seperated values (which it can't, but just
> for arguments sake), then you have to specify (in nrpe.cfg) the (single- or)
> double-quotes.
>
> Personally I tend not to use quoting in nrpe.cfg since in most cases it is
> not necessary, but there might be reasons to explicitly enclose such
> arguments in quotes.....
>
> Met vriendelijke groet/with kind regards,
>          Ludse Verhoeven
>
> AIX System Engineer
> T: +31 10 400 2440
> F: +31 10 400 2891
> E: ludse.verhoeven at vopak.com
>
>
> Visit / Postal Address:
>
> http://www.vopak.com
>
>
>
>
>
>  *Enrico Zimol *
>
> 19/05/2010 16:46
>  Please respond to
> Nagios Users List 
>
>   To
> Nagios Users List 
> cc
>   Subject
> Re: [Nagios-users] NRPE with command arguments
>
>
>
>
> On 19 May 2010 11:13, Ludse Verhoeven <*ludse.verhoeven at vopak.com*>
> wrote:
> Not sure if this works like it is supposed to (haven't tested it myself),
> but qua syntax this seems more correct.
>
> That worked fine :)
> Other question, where exactly I've to insert quotes on nrpe.cfg commands to
> prevent code injection?
> I think like this:
> [nrpe.cfg]
> [...]
> command[check_disk]=/usr/local/nagios/libexec/check_disk -w *"*$ARG1$*"*-c
> *"*$ARG2$*"*
> [...]
>
> Is it right?
>
> Thank's!
>
> --
> Enrico "lomiz" Zimol
> ------------------------------------------------------------------------------
>
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
> ------------------------------------------------------------------------------
>
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>



-- 
Cordially,
Shadhin Rahman
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From bohara at gmail.com  Wed May 19 22:35:56 2010
From: bohara at gmail.com (Ben O'Hara)
Date: Wed, 19 May 2010 22:35:56 +0200
Subject: NRPE: Unable to read output (after upgrading
	sudo to 1.7.2p1-6.el5_5)
In-Reply-To: <33095823FD21DF429B481B5163264B794299072AFA@VMBX102.ihostexchange.net>
References: <33095823FD21DF429B481B5163264B794299072AFA@VMBX102.ihostexchange.net>
Message-ID: 

I found the same problem with sudo after an update.

Seems the comment line in the sudoers file causes a synax with
#comment, you need a space, so # comment works ok.

Ben

On Wed, May 19, 2010 at 5:48 PM, Yungwei Chen  wrote:
> Hi,
>
> ? ? ? ?I have been using a check_ldap_fd command on a nagios client without any problem. That command requires superuser permission.
> ? ? ? ?After upgrading sudo to 1.7.2p1-6.el5_5, the command stops working.
>
> ? ? ? ?Running the following command on my nagios server always shows the following:
> ? ? ? ?[TEST:root at serv1 ~]# /usr/lib/nagios/plugins/check_nrpe -H 192.168.12.2 -c check_ldap_fd
> ? ? ? ?NRPE: Unable to read output
>
> ? ? ? ?This is the command defined in /etc/nagios/nrpe.cfg on the nagios client:
> ? ? ? ?command[check_ldap_fd]=/usr/bin/sudo /usr/lib/nagios/plugins/check_ldap_fd -w 250 -c 500
>
> ? ? ? ?I also made the following changes to /etc/sudoers on the nagios client in order to make the command work:
> ? ? ? ?#Defaults ? ?requiretty
> ? ? ? ?nagios ds2 = NOPASSWD:/usr/lib/nagios/plugins/check_ldap_fd
>
> ? ? ? ?Any ideas? Thanks.
>
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>



-- 
"Scientology ... is not a religion."

- L. Ron Hubbard (Creator of Scientology)

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From bugfood-ml at fatooh.org  Wed May 19 22:43:47 2010
From: bugfood-ml at fatooh.org (Corey Hickey)
Date: Wed, 19 May 2010 13:43:47 -0700
Subject: trying to fix problem with excessive latency
In-Reply-To: <4BF4325D.108@fatooh.org>
References: <4BF33F14.9070605@fatooh.org> <4BF34149.3060006@iweb.com>
	<4BF4325D.108@fatooh.org>
Message-ID: <4BF44D83.6050704@fatooh.org>

On 2010-05-19 11:47, Corey Hickey wrote:
> On 2010-05-18 18:39, Mathieu Gagn? wrote:
>> On 2010-05-18 21:29, Corey Hickey wrote:
>>> Hello,
>>>
>>> I have inherited maintenance of a medium-sized Nagios installation. We
>>> currently have 649 hosts and 5415 services. Our setup works nicely, with
>>> one exception: Nagios falls behind on host/service checks. Our usual
>>> latency once Nagios has been running for a while is about 190-200
>>> seconds. Our Nagios host is reasonably powerful and isn't struggling; it
>>> seems that Nagios itself is limited somehow.
>>
>> Try disabling "enable_environment_macros". It helped use greatly when we
>> disabled this option. Nagios was spending most of its time allocating
>> memory for environment variables it never used. (in our case)
>>
>> Host/Service checks latency dropped from ~20 minutes to 10 seconds I
>> would say.
> 
> I just tried this and latency dropped from 171 seconds to 149 seconds.
> Thanks, that definitely helped. I'm actually surprised that it did,
> since our nagios process doesn't seem to be CPU bound (75-85% idle,
> rarely with any individual core below 40% idle). It might be that I'm
> drawing the wrong conclusion from top, though, since the nagios process
> moves around from core to core.
> 
> Anyway, thanks again, and thanks to everyone for the advice on- and
> off-list. I'm going to try more of your suggestions, and I'll let you
> know how it goes.

It looks like use_large_installation_tweaks did the trick. I could have
sworn I tried that before; likely I set it to 0 instead of 1 or
something foolish like that.

Anyway, I wanted to test the individual components of
large_installation_tweaks to see what helped.
* child_processes_fork_twice=0: latency drops from 149 to 18
* free_child_process_memory=0:  latency drops from 149 to 0.6
* disabling summary macros in environment variables doesn't seem to be
  individually configurable, but I expect that's disabled already by
  enable_environment_macros=0.

So, thanks again to everybody who helped. Latency is now 0.35% of what
it used to be.

-Corey

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From getling at stern.nyu.edu  Wed May 19 21:50:17 2010
From: getling at stern.nyu.edu (Greg Etling)
Date: Wed, 19 May 2010 15:50:17 -0400
Subject: check_openmanage weirdness
Message-ID: <4BF440F9.2040205@stern.nyu.edu>

I have just started implementing some check_openmanage checks on my 
servers, and have run into some odd behavior with the combination of 
Windows 2003, OM 6.2 and the SNMP check. It appears that this 
combination is having issues with the drive/controller reporting. 
Initially things worked fine under OM 5.4, until the SNMP service would 
die (other than that, Mrs. Lincoln...) - so i upgraded to OM 6.2, when I 
observed the following behaviour.

When the check is run without any blacklisting, the plugin reports that 
there is a global status WARNING, but all components are OK - the 
WARNING is coming from out of date Firmware/Driver versions as listed below:

------
Firmware/Driver Information for Controller PERC 6/i Integrated
Firmware Version    6.0.3-0002
Minimum Required Firmware Version    6.2.0-0012
Driver Version    2.14.00.32
Minimum Required Driver Version    2.23.00.32
Storport Driver Version    5.2.3790.3959
Minimum Required Storport Driver Version    5.2.3790.4173
------

Now when run in debug mode, I noticed that it had no information about 
the drives at all (note the beta version - same output as plugin v3.5.7):
------
[root at sys-mgt-1 stern]# ./check_openmanage -H testserver -C *****
    System:      PowerEdge 2950
    ServiceTag:  XXXXXXX                  OMSA version:    6.2.0
    BIOS/date:   2.3.1 04/29/2008         Plugin version:  3.5.8-beta7
-----------------------------------------------------------------------------
    Chassis Components
=============================================================================
   STATE  |  ID  |  MESSAGE TEXT
---------+------+------------------------------------------------------------
       OK |    1 | Memory module 1 [DIMM1, 2048 MB] is Ok
       OK |    2 | Memory module 2 [DIMM2, 2048 MB] is Ok
       OK |    3 | Memory module 3 [DIMM3, 2048 MB] is Ok
       OK |    4 | Memory module 4 [DIMM4, 2048 MB] is Ok
       OK |    1 | Chassis fan 1 [System Board FAN 1 RPM]: 7050
       OK |    2 | Chassis fan 2 [System Board FAN 2 RPM]: 7125
       OK |    3 | Chassis fan 3 [System Board FAN 3 RPM]: 7125
       OK |    4 | Chassis fan 4 [System Board FAN 4 RPM]: 7050
       OK |    0 | Power Supply 0 [AC]: Presence detected
       OK |    1 | Power Supply 1 [AC]: Presence detected
       OK |    0 | Temperature Probe 0 [System Board Ambient Temp] reads 
22 C (min=8/3, max=42/47)
       OK |    0 | Processor 0 [Intel Xeon E5440 2.83GHz] is Present
       OK |    1 | Processor 1 [Intel Xeon E5440 2.83GHz] is Present
       OK |    0 | Voltage sensor 0 [CPU1 VCORE] is Good
       OK |    1 | Voltage sensor 1 [CPU2 VCORE] is Good
       OK |    2 | Voltage sensor 2 [System Board CPU VTT] is Good
       OK |    3 | Voltage sensor 3 [System Board 1.5V PG] is Good
       OK |    4 | Voltage sensor 4 [System Board 1.8V PG] is Good
       OK |    5 | Voltage sensor 5 [System Board 3.3V PG] is Good
       OK |    6 | Voltage sensor 6 [System Board 5V PG] is Good
       OK |    7 | Voltage sensor 7 [Riser 1.5V PXH PG] is Good
       OK |    8 | Voltage sensor 8 [Riser 5V Riser PG] is Good
       OK |    9 | Voltage sensor 9 [System Board Backplane PG] is Good
       OK |   10 | Voltage sensor 10 [System Board Linear PG] is Good
       OK |   11 | Voltage sensor 11 [System Board 0.9V PG] is Good
       OK |   12 | Voltage sensor 12 [System Board 0.9V Over Volt] is Good
       OK |   13 | Voltage sensor 13 [System Board CPU Power Fault] is Good
       OK |    0 | Battery probe 0 [System Board CMOS Battery] is 
Presence Detected
       OK |    0 | Chassis intrusion 0 detection: Ok (Not Breached)
-----------------------------------------------------------------------------
    Other messages
=============================================================================
   STATE  |  MESSAGE TEXT
---------+-------------------------------------------------------------------
       OK | ESM log health is Ok (less than 80% full)
OOPS! Something is wrong with this server, but I don't know what. The 
global system health status is WARNING, but every component check is OK. 
This may be a bug in the Nagios plugin, please file a bug report.
------

And the Status as reported to Nagios believes that there are no disks 
whatsoever on the server:
------
OK - System: 'PowerEdge 2950', SN: 'XXXXXXX', hardware working fine, 0 
logical drives, 0 physical drives
------

This has been replicated on several identical systems.

I'm a bit stumped as to where the problem lies. Please let me know if 
you need further information from me.

Thanks in advance,
Greg
---
Greg Etling
getling at stern.nyu.edu
Systems Administrator
Stern IT Enterprise Operations
NYU Stern School of Business


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From luc at suryo.com  Wed May 19 22:52:00 2010
From: luc at suryo.com (Luc I. Suryo)
Date: Wed, 19 May 2010 13:52:00 -0700
Subject: NRPE: Unable to read output (after upgrading
 sudo to 1.7.2p1-6.el5_5)
In-Reply-To: 
References: <33095823FD21DF429B481B5163264B794299072AFA@VMBX102.ihostexchange.net>
	
Message-ID: <20100519205200.GA5326@carrera.local>


1) do : uname -n
the output must match the host defined, in your case ds2
JUST in case put a space between NOPASSWD: and /usr/....

2) make sure nagios has a correct/auth shell (/etc/shells)

3) add the following lines in the sudo file
Cmnd_Alias      LIST            =       /usr/bin/sudo -l
ALL             ALL     =       NOPASSWD: LIST
then as root : su - nagios 
once you are nagios do a : sudo -l 
and maybe a : sudo /usr/lib/nagios/plugins/check_ldap_fd -w 250 -c 500


not much... 

-ls

Ben O'Hara 
   wrote at Wed, May 19, 2010 at 10:35:56PM +0200:

> I found the same problem with sudo after an update.
> 
> Seems the comment line in the sudoers file causes a synax with
> #comment, you need a space, so # comment works ok.
> 
> Ben
> 
> On Wed, May 19, 2010 at 5:48 PM, Yungwei Chen  wrote:
> > Hi,
> >
> > ? ? ? ?I have been using a check_ldap_fd command on a nagios client without any problem. That command requires superuser permission.
> > ? ? ? ?After upgrading sudo to 1.7.2p1-6.el5_5, the command stops working.
> >
> > ? ? ? ?Running the following command on my nagios server always shows the following:
> > ? ? ? ?[TEST:root at serv1 ~]# /usr/lib/nagios/plugins/check_nrpe -H 192.168.12.2 -c check_ldap_fd
> > ? ? ? ?NRPE: Unable to read output
> >
> > ? ? ? ?This is the command defined in /etc/nagios/nrpe.cfg on the nagios client:
> > ? ? ? ?command[check_ldap_fd]=/usr/bin/sudo /usr/lib/nagios/plugins/check_ldap_fd -w 250 -c 500
> >
> > ? ? ? ?I also made the following changes to /etc/sudoers on the nagios client in order to make the command work:
> > ? ? ? ?#Defaults ? ?requiretty
> > ? ? ? ?nagios ds2 = NOPASSWD:/usr/lib/nagios/plugins/check_ldap_fd
> >
> > ? ? ? ?Any ideas? Thanks.
> >

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From t.h.amundsen at usit.uio.no  Wed May 19 23:40:39 2010
From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen)
Date: Wed, 19 May 2010 23:40:39 +0200
Subject: check_openmanage weirdness
In-Reply-To: <4BF440F9.2040205@stern.nyu.edu> (Greg Etling's message of "Wed, 
	19 May 2010 15:50:17 -0400")
References: <4BF440F9.2040205@stern.nyu.edu>
Message-ID: <15teih77ep4.fsf@tux.uio.no>

Greg Etling  writes:

> I have just started implementing some check_openmanage checks on my 
> servers, and have run into some odd behavior with the combination of 
> Windows 2003, OM 6.2 and the SNMP check. It appears that this 
> combination is having issues with the drive/controller reporting. 
> Initially things worked fine under OM 5.4, until the SNMP service would 
> die (other than that, Mrs. Lincoln...) - so i upgraded to OM 6.2, when I 
> observed the following behaviour.
>
> When the check is run without any blacklisting, the plugin reports that 
> there is a global status WARNING, but all components are OK - the 
> WARNING is coming from out of date Firmware/Driver versions as listed below:
>
> ------
> Firmware/Driver Information for Controller PERC 6/i Integrated
> Firmware Version    6.0.3-0002
> Minimum Required Firmware Version    6.2.0-0012
> Driver Version    2.14.00.32
> Minimum Required Driver Version    2.23.00.32
> Storport Driver Version    5.2.3790.3959
> Minimum Required Storport Driver Version    5.2.3790.4173
> ------
>
> Now when run in debug mode, I noticed that it had no information about 
> the drives at all (note the beta version - same output as plugin v3.5.7):

[snip]

This is the key to this problem. There are warnings associated with the
storage subsystem, but that information is not available via SNMP for
some reason. The global status of the server inherits these warnings,
however, so the plugin reports this as some unknown error.

Does omreport report anything on storage? Try:

  omreport storage controller

If that works, try getting the same information via SNMP:

  snmpwalk -v2c -c   1.3.6.1.4.1.674.10893.1.20.130.1

Usually the problem is that the storage components of OMSA is not
installed, in which case neither command will work.

> And the Status as reported to Nagios believes that there are no disks 
> whatsoever on the server:
> ------
> OK - System: 'PowerEdge 2950', SN: 'XXXXXXX', hardware working fine, 0 
> logical drives, 0 physical drives
> ------

Yes, that is the normal behaviour when the plugin doesn't find any
storage components. The plugin can't report this as a problem, since
it's OK for a server not to have storage reported by OMSA (which only
reports on supported storage), or any storage at all for that matter
(diskless servers).

> This has been replicated on several identical systems.
>
> I'm a bit stumped as to where the problem lies. Please let me know if 
> you need further information from me.

You should check your OMSA install. The storage parts of it was probably
not installed. It may also be that there is something wrong with the
OMSA+SNMP integration, which prevents storage information from being
presented. That would be trickier to debug.

Cheers,
-- 
Trond H. Amundsen 
Center for Information Technology Services, University of Oslo

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From lists at xodus.org  Thu May 20 01:00:26 2010
From: lists at xodus.org (Marc Powell)
Date: Wed, 19 May 2010 18:00:26 -0500
Subject: localhost DOWN messages,
	return code 127 is out of bounds
In-Reply-To: 
References: 
Message-ID: <5DB41FF5-8D3B-4000-89BB-C1456DD1DAE1@xodus.org>


On May 19, 2010, at 12:08 PM, Michael Maxwell wrote:

> Well, I didn't exactly leave this out.  Remember I mentioned that it's a
> clean install from the Quickstart document.  I've changed nothing in it
> other than the admin contact in contacts.cfg.  So whatever's there is there
> in the default localhost checks out of the box.

It's not *exactly* standard because it's not installed in /usr/local/nagios. ;)  Does $USER1$ in resource.cfg point to '/opt/nagios-3.2.1/libexec'?

--
Marc


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From ae at op5.se  Thu May 20 10:52:05 2010
From: ae at op5.se (Andreas Ericsson)
Date: Thu, 20 May 2010 10:52:05 +0200
Subject: Full Throttle Nagios
In-Reply-To: 
References: 	<4BF3C45A.7070001@op5.se>
	
Message-ID: <4BF4F835.2040606@op5.se>

On 05/19/2010 01:29 PM, Max wrote:
> Really look forward to seeing this code - this is the process pool code, yes?
> 

Yes, this is the process pool code, although additional changes has to be
made in order to solve the bottleneck problem with the named pipe for
submitting commands and passive checkresults.

Should be glorious once it's ready.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From s.tesch at science-computing.de  Thu May 20 10:33:33 2010
From: s.tesch at science-computing.de (Stephan Tesch)
Date: Thu, 20 May 2010 10:33:33 +0200
Subject: Reporting for redundant hosts
Message-ID: <4BF4F3DD.8020803@science-computing.de>

Hello everyone,

I'm facing a problem for a long time now, having done some research on
this, but never been able to find a solution:

We're monitoring a lot of redundant network hosts (routers, firewalls,
etc.) where we have to report the uptime of the whole setup, not only
the uptime of a single component.

An example:

We have routerA and routerB, that form a HA setup. If one router is
taken down, for example to update its firmware, or simply crashes, the
whole setup still should have a reported uptime of 100%.

What I've already tried is to configure a cluster service. The problem
with this approach is, that I can't distinguish in the reporting whether
the host was down or unreachable, which is another crucial number to
report, also the downtimes don't get reflected very well.

I'm running Nagios 3.2.1 with NDOutils if that matters for the question.
Is there any plugin or addon that provides this functionality?

Many thanks,
Stephan

-- 
Stephan Tesch                Phone + 49 (0)7071-9457-608
science + computing ag       FAX   + 49 (0)7071-9457-511
Hagellocher Weg 73
D-72070 Tuebingen            Email: s.tesch at science-computing.de

-- 
Vorstand/Board of Management:
Dr. Bernd Finkbeiner, Dr. Roland Niemeier, 
Dr. Arno Steitz, Dr. Ingrid Zech
Vorsitzender des Aufsichtsrats/
Chairman of the Supervisory Board:
Michel Lepert
Sitz/Registered Office: Tuebingen
Registergericht/Registration Court: Stuttgart
Registernummer/Commercial Register No.: HRB 382196 



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From paul.willis at kent.pnn.police.uk  Thu May 20 11:13:34 2010
From: paul.willis at kent.pnn.police.uk (Paul WILLIS PSE 55499)
Date: Thu, 20 May 2010 10:13:34 +0100
Subject: check_openmanage weirdness
In-Reply-To: <4BF440F9.2040205@stern.nyu.edu>
References: <4BF440F9.2040205@stern.nyu.edu>
Message-ID: <4BF50B4E020000C4000183C4@kent.pnn.police.uk>

I use check_snmp to get certain traps from Openmanage but I
suspect check_openmanage works the same way. There is a 
separate mib (dcstorag(10893).mib) for the disk storage side as
opposed to the rest of the chassis (10892.mib) so my guess
would be the problem lies there.
What does the following give you?
define command{
  command_name    dell_returnglobaldiskstatus
  command_line    $USER1$/check_snmp -H $HOSTADDRESS$ -C $ARG1$  -w 0:3 -c 7:5  -o .1.3.6.1.4.1.674.10893.1.20.110.13.0 -l 'Disk Status' -P 2c
  }
where ARG1 is your community string
 
Regards
Paul

>>> Greg Etling  19/05/10 20:50 >>>
I have just started implementing some check_openmanage checks on my 
servers, and have run into some odd behavior with the combination of 
Windows 2003, OM 6.2 and the SNMP check. It appears that this 
combination is having issues with the drive/controller reporting. 
Initially things worked fine under OM 5.4, until the SNMP service would 
die (other than that, Mrs. Lincoln...) - so i upgraded to OM 6.2, when I 
observed the following behaviour.

When the check is run without any blacklisting, the plugin reports that 
there is a global status WARNING, but all components are OK - the 
WARNING is coming from out of date Firmware/Driver versions as listed below:

------
Firmware/Driver Information for Controller PERC 6/i Integrated
Firmware Version    6.0.3-0002
Minimum Required Firmware Version    6.2.0-0012
Driver Version    2.14.00.32
Minimum Required Driver Version    2.23.00.32
Storport Driver Version    5.2.3790.3959
Minimum Required Storport Driver Version    5.2.3790.4173
------

Now when run in debug mode, I noticed that it had no information about 
the drives at all (note the beta version - same output as plugin v3.5.7):
------
[root at sys-mgt-1 stern]# ./check_openmanage -H testserver -C *****
    System:      PowerEdge 2950
    ServiceTag:  XXXXXXX                  OMSA version:    6.2.0
    BIOS/date:   2.3.1 04/29/2008         Plugin version:  3.5.8-beta7
-----------------------------------------------------------------------------
    Chassis Components
=============================================================================
   STATE  |  ID  |  MESSAGE TEXT
---------+------+------------------------------------------------------------
       OK |    1 | Memory module 1 [DIMM1, 2048 MB] is Ok
       OK |    2 | Memory module 2 [DIMM2, 2048 MB] is Ok
       OK |    3 | Memory module 3 [DIMM3, 2048 MB] is Ok
       OK |    4 | Memory module 4 [DIMM4, 2048 MB] is Ok
       OK |    1 | Chassis fan 1 [System Board FAN 1 RPM]: 7050
       OK |    2 | Chassis fan 2 [System Board FAN 2 RPM]: 7125
       OK |    3 | Chassis fan 3 [System Board FAN 3 RPM]: 7125
       OK |    4 | Chassis fan 4 [System Board FAN 4 RPM]: 7050
       OK |    0 | Power Supply 0 [AC]: Presence detected
       OK |    1 | Power Supply 1 [AC]: Presence detected
       OK |    0 | Temperature Probe 0 [System Board Ambient Temp] reads 
22 C (min=8/3, max=42/47)
       OK |    0 | Processor 0 [Intel Xeon E5440 2.83GHz] is Present
       OK |    1 | Processor 1 [Intel Xeon E5440 2.83GHz] is Present
       OK |    0 | Voltage sensor 0 [CPU1 VCORE] is Good
       OK |    1 | Voltage sensor 1 [CPU2 VCORE] is Good
       OK |    2 | Voltage sensor 2 [System Board CPU VTT] is Good
       OK |    3 | Voltage sensor 3 [System Board 1.5V PG] is Good
       OK |    4 | Voltage sensor 4 [System Board 1.8V PG] is Good
       OK |    5 | Voltage sensor 5 [System Board 3.3V PG] is Good
       OK |    6 | Voltage sensor 6 [System Board 5V PG] is Good
       OK |    7 | Voltage sensor 7 [Riser 1.5V PXH PG] is Good
       OK |    8 | Voltage sensor 8 [Riser 5V Riser PG] is Good
       OK |    9 | Voltage sensor 9 [System Board Backplane PG] is Good
       OK |   10 | Voltage sensor 10 [System Board Linear PG] is Good
       OK |   11 | Voltage sensor 11 [System Board 0.9V PG] is Good
       OK |   12 | Voltage sensor 12 [System Board 0.9V Over Volt] is Good
       OK |   13 | Voltage sensor 13 [System Board CPU Power Fault] is Good
       OK |    0 | Battery probe 0 [System Board CMOS Battery] is 
Presence Detected
       OK |    0 | Chassis intrusion 0 detection: Ok (Not Breached)
-----------------------------------------------------------------------------
    Other messages
=============================================================================
   STATE  |  MESSAGE TEXT
---------+-------------------------------------------------------------------
       OK | ESM log health is Ok (less than 80% full)
OOPS! Something is wrong with this server, but I don't know what. The 
global system health status is WARNING, but every component check is OK. 
This may be a bug in the Nagios plugin, please file a bug report.
------

And the Status as reported to Nagios believes that there are no disks 
whatsoever on the server:
------
OK - System: 'PowerEdge 2950', SN: 'XXXXXXX', hardware working fine, 0 
logical drives, 0 physical drives
------

This has been replicated on several identical systems.

I'm a bit stumped as to where the problem lies. Please let me know if 
you need further information from me.

Thanks in advance,
Greg
---
Greg Etling
getling at stern.nyu.edu
Systems Administrator
Stern IT Enterprise Operations
NYU Stern School of Business


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From nagios at flatto.net  Thu May 20 11:32:21 2010
From: nagios at flatto.net (Assaf Flatto)
Date: Thu, 20 May 2010 10:32:21 +0100
Subject: Reporting for redundant hosts
In-Reply-To: <4BF4F3DD.8020803@science-computing.de>
References: <4BF4F3DD.8020803@science-computing.de>
Message-ID: <4BF501A5.1070107@flatto.net>

Stephan Tesch wrote:
> Hello everyone,
>
> I'm facing a problem for a long time now, having done some research on
> this, but never been able to find a solution:
>
> We're monitoring a lot of redundant network hosts (routers, firewalls,
> etc.) where we have to report the uptime of the whole setup, not only
> the uptime of a single component.
>
> An example:
>
> We have routerA and routerB, that form a HA setup. If one router is
> taken down, for example to update its firmware, or simply crashes, the
> whole setup still should have a reported uptime of 100%.
>
> What I've already tried is to configure a cluster service. The problem
> with this approach is, that I can't distinguish in the reporting whether
> the host was down or unreachable, which is another crucial number to
> report, also the downtimes don't get reflected very well.
>
> I'm running Nagios 3.2.1 with NDOutils if that matters for the question.
> Is there any plugin or addon that provides this functionality?
>
> Many thanks,
> Stephan
>
>   
One approach is to monitor a VIP for the service (i.e. the "clustered" 
ip of the external facing IP address of the routers) for  finding if the 
service is up , and the individual devices by their direct IP  for each 
device  activity .

This will give you both the business view ( if the VIP is down then the 
business is impacted) , and the operational view ( one device impacted ).


-- 
Never,Ever Cut A Deal With a Dragon 


I am doing a Charity Bike ride On the 27 of June for the
Capital to Coast Charity. Please help by Donating
http://www.justgiving.com/Lovefilm-capital-to-coast



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From getling at stern.nyu.edu  Thu May 20 16:53:11 2010
From: getling at stern.nyu.edu (Greg Etling)
Date: Thu, 20 May 2010 10:53:11 -0400
Subject: check_openmanage weirdness
In-Reply-To: 
References: 
Message-ID: <4BF54CD7.8030905@stern.nyu.edu>

Trond, thanks for your quick reply. Unfortunately it does appear we have 
a disconnect between OMSA and SNMP:

 >Does omreport report anything on storage? Try:
 >
 >  omreport storage controller
 >
C:\Documents and Settings\Administrator>omreport storage controller
  Controller  PERC 6/i Integrated (Embedded)

Controllers
ID                                            : 0
Status                                        : Non-Critical
Name                                          : PERC 6/i Integrated
Slot ID                                       : Embedded
State                                         : Degraded
Firmware Version                              : 6.0.3-0002
Minimum Required Firmware Version             : 6.2.0-0012
Driver Version                                : 2.14.00.32
Minimum Required Driver Version               : 2.23.00.32
Storport Driver Version                       : 5.2.3790.3959
Minimum Required Storport Driver Version      : 5.2.3790.4173
Number of Connectors                          : 2
Rebuild Rate                                  : 30%
BGI Rate                                      : 30%
Check Consistency Rate                        : 30%
Reconstruct Rate                              : 30%
Alarm State                                   : Not Applicable
Cluster Mode                                  : Not Applicable
SCSI Initiator ID                             : Not Applicable
Cache Memory Size                             : 256 MB
Patrol Read Mode                              : Auto
Patrol Read State                             : Stopped
Patrol Read Rate                              : 30%
Patrol Read Iterations                        : 82
Abort check consistency on error              : Not Applicable
Allow Revertible Hot Spare and Replace Member : Not Applicable
Auto replace member on predictive failure     : Not Applicable
Load balance                                  : Not Applicable
Security Capable                              : Not Applicable
Security Key Present                          : Not Applicable
Redundant Path view                           : Not Applicable

 >If that works, try getting the same information via SNMP:
 >
 >  snmpwalk -v2c -c   
 >1.3.6.1.4.1.674.10893.1.20.130.1
 >

[root at nagios ~]# snmpwalk -v2c -c ***** testserver 
1.3.6.1.4.1.674.10893.1.20.130.1
SNMPv2-SMI::enterprises.674.10893.1.20.130.1 = No Such Object available 
on this agent at this OID

It appears to only have data under the 1.3.6.1.4.1.674.10892 and 
1.3.6.1.4.1.674.10899 trees. Thoughts?

Greg

Greg Etling  writes:

 > > I have just started implementing some check_openmanage checks on my
 > > servers, and have run into some odd behavior with the combination of
 > > Windows 2003, OM 6.2 and the SNMP check. It appears that this
 > > combination is having issues with the drive/controller reporting.
 > > Initially things worked fine under OM 5.4, until the SNMP service 
would
 > > die (other than that, Mrs. Lincoln...) - so i upgraded to OM 6.2, 
when I
 > > observed the following behaviour.
 > >
 > > When the check is run without any blacklisting, the plugin reports 
that
 > > there is a global status WARNING, but all components are OK - the
 > > WARNING is coming from out of date Firmware/Driver versions as 
listed below:
 > >
 > > ------
 > > Firmware/Driver Information for Controller PERC 6/i Integrated
 > > Firmware Version    6.0.3-0002
 > > Minimum Required Firmware Version    6.2.0-0012
 > > Driver Version    2.14.00.32
 > > Minimum Required Driver Version    2.23.00.32
 > > Storport Driver Version    5.2.3790.3959
 > > Minimum Required Storport Driver Version    5.2.3790.4173
 > > ------
 > >
 > > Now when run in debug mode, I noticed that it had no information about
 > > the drives at all (note the beta version - same output as plugin 
v3.5.7):

[snip]

This is the key to this problem. There are warnings associated with the
storage subsystem, but that information is not available via SNMP for
some reason. The global status of the server inherits these warnings,
however, so the plugin reports this as some unknown error.

Does omreport report anything on storage? Try:

   omreport storage controller

If that works, try getting the same information via SNMP:

   snmpwalk -v2c -c   1.3.6.1.4.1.674.10893.1.20.130.1

Usually the problem is that the storage components of OMSA is not
installed, in which case neither command will work.

 > > And the Status as reported to Nagios believes that there are no disks
 > > whatsoever on the server:
 > > ------
 > > OK - System: 'PowerEdge 2950', SN: 'XXXXXXX', hardware working fine, 0
 > > logical drives, 0 physical drives
 > > ------

Yes, that is the normal behaviour when the plugin doesn't find any
storage components. The plugin can't report this as a problem, since
it's OK for a server not to have storage reported by OMSA (which only
reports on supported storage), or any storage at all for that matter
(diskless servers).

 > > This has been replicated on several identical systems.
 > >
 > > I'm a bit stumped as to where the problem lies. Please let me know if
 > > you need further information from me.

You should check your OMSA install. The storage parts of it was probably
not installed. It may also be that there is something wrong with the
OMSA+SNMP integration, which prevents storage information from being
presented. That would be trickier to debug.

Cheers,
-- Trond H. Amundsen  Center for Information 
Technology Services, University of Oslo ------------------------------

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mirde at oppy.com  Thu May 20 17:13:23 2010
From: mirde at oppy.com (Mirza Dedic)
Date: Thu, 20 May 2010 08:13:23 -0700
Subject: aix 5.3 plugin to monitor for long running processes
	?
Message-ID: 

Is there such a plugin to monitor aix topas utility (topas -P) and report back processes that have a long running "TIME" value? Better yet, TIME combined with high CPU value?

The output of topas -P on a AIX box looks like this:

USER        PID    PPID PRI NI   RES   RES SPACE    TIME CPU%  I/O  OTH COMMAND
mirde   4079860 3244406 120 20  1668  1865  1668  206:26 25.6    0    0 _progres
dougr   4079860 3244406 120 20  1668  1865  1668    6:26 20.6    0    0 _progres
prosvra 3412254 4071460 114 20  1453  1561  2033   16:09 18.7    0    0 _proapsv
braro   1654968       1  66 20  1388  1865  1793   57:25  1.5    0    0 _progres
batch   2613370       1  69 22  1689  1865  1879    7:05  1.2    2 1220 _progres
soual   1826826  286914  61 20  1806  1865  1806    0:16  0.8    0 3376 _progres
braro   1601808       1  61 20  1957  1865  2410   82:17  0.7    0    6 _progres
allbr   1274340 3399852  61 20  2061  1865  2061    0:01  0.5    0 2312 _progres
kelca   3407948 1413528  62 20  2207  1865  2207    0:08  0.5   42  936 _progres
allbr   2543698 1274340  62 20   208    61   208    0:00  0.4    0 3320 ksh
mikko   1032604 2073082  60 20  2522  1865  2522    0:15  0.4    2 1255 _progres
larha   3809412 2060626  61 20  2159  1865  2159    0:02  0.4   17  115 _progres
kirpa   1605690 3838314  60 20  2558  1865  2558    0:32  0.4    2   13 _progres
braro   2732514       1  60 20  1632  1865  2149   13:05  0.3    0    0 _progres
braro   1909064       1  60 20  1134  1865  1760    6:20  0.3    0    0 _progres
root    2719758  745934  58 41  2431   128  4619   24:25  0.3    0    1 topas
judre    549372 1503500  61 20  1798  1865  1798    0:00  0.2    1  198 _progres
prosvr  1814718       1  82 20  6374    26 15995    7:21  0.2    1    1 java
dorod   2904242 2187728  60 20  2495  1865  2495    6:53  0.2    1  930 _progres
linso   3760208 4395326  60 20  1509  1865  1509    0:00  0.1    0    2 _progres
katpa   3494134 4280722  60 20  2307  1865  2307    0:03  0.1    0    0 _progre

What I want to capture is, something with 120+ (for TIME) and 20%+ (CPU).

Have Nagios issue an alert of such an occurance.

My Nagios box already monitors our AIX box remotely, using check_by_ssh to scan for errors in certain log files (using Consol's plugins); I would like to expand on this with the above.

Any help is appreciated.

Thanks.


The Oppenheimer Group ---- CONFIDENTIAL

This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From patrick.morris at hp.com  Thu May 20 18:15:24 2010
From: patrick.morris at hp.com (Morris, Patrick)
Date: Thu, 20 May 2010 09:15:24 -0700
Subject: aix 5.3 plugin to monitor for long running
 processes ?
In-Reply-To: 
References: 
Message-ID: <4BF5601C.8090503@hp.com>

Mirza Dedic wrote:
> Is there such a plugin to monitor aix topas utility (topas -P) and 
> report back processes that have a long running "TIME" value? Better 
> yet, TIME combined with high CPU value?
>  
> The output of topas -P on a AIX box looks like this:
>  
> _USER        PID    PPID PRI NI   RES   RES SPACE    TIME CPU%  I/O  
> OTH COMMAND_
> *mirde**   **4079860** **3244406** 120 20  1668  1865  1668  
> **20**6:26 2**5**.6**    0    0 **_progres*
> dougr   4079860 3244406 120 20  1668  1865  1668    6:26 20.6    0    
> 0 _progres
> prosvra 3412254 4071460 114 20  1453  1561  2033   16:09 18.7    0    
> 0 _proapsv
> braro   1654968       1  66 20  1388  1865  1793   57:25  1.5    0    
> 0 _progres
> batch   2613370       1  69 22  1689  1865  1879    7:05  1.2    2 
> 1220 _progres
> soual   1826826  286914  61 20  1806  1865  1806    0:16  0.8    0 
> 3376 _progres
> braro   1601808       1  61 20  1957  1865  2410   82:17  0.7    0    
> 6 _progres
> allbr   1274340 3399852  61 20  2061  1865  2061    0:01  0.5    0 
> 2312 _progres
> kelca   3407948 1413528  62 20  2207  1865  2207    0:08  0.5   42  
> 936 _progres
> allbr   2543698 1274340  62 20   208    61   208    0:00  0.4    0 
> 3320 ksh
> mikko   1032604 2073082  60 20  2522  1865  2522    0:15  0.4    2 
> 1255 _progres
> larha   3809412 2060626  61 20  2159  1865  2159    0:02  0.4   17  
> 115 _progres
> kirpa   1605690 3838314  60 20  2558  1865  2558    0:32  0.4    2   
> 13 _progres
> braro   2732514       1  60 20  1632  1865  2149   13:05  0.3    0    
> 0 _progres
> braro   1909064       1  60 20  1134  1865  1760    6:20  0.3    0    
> 0 _progres
> root    2719758  745934  58 41  2431   128  4619   24:25  0.3    0    
> 1 topas
> judre    549372 1503500  61 20  1798  1865  1798    0:00  0.2    1  
> 198 _progres
> prosvr  1814718       1  82 20  6374    26 15995    7:21  0.2    1    
> 1 java
> dorod   2904242 2187728  60 20  2495  1865  2495    6:53  0.2    1  
> 930 _progres
> linso   3760208 4395326  60 20  1509  1865  1509    0:00  0.1    0    
> 2 _progres
> katpa   3494134 4280722  60 20  2307  1865  2307    0:03  0.1    0    
> 0 _progre
>  
> What I want to capture is, something with 120+ (for TIME) and 20%+ (CPU).
>  
If you've got even basic scripting skills, Nagios plugins are incredibly 
easy to write.  Documentation is here:

http://nagiosplug.sourceforge.net/developer-guidelines.html

However, the most basic plugins only need to worry about a couple 
things: They should have one line of output, and return an exit code of 
0 for OK, 1 for warning, 2 for critical, and 3 for unknown.

Given that, all you'd need to do is write a simple script that runs that 
command and parses it looking for times over a certain value, and 
returns a 1 or a 2 if something crosses your thresholds.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From s.tesch at science-computing.de  Thu May 20 18:28:47 2010
From: s.tesch at science-computing.de (Stephan Tesch)
Date: Thu, 20 May 2010 18:28:47 +0200
Subject: Reporting for redundant hosts
In-Reply-To: <4BF501A5.1070107@flatto.net>
References: <4BF4F3DD.8020803@science-computing.de>
	<4BF501A5.1070107@flatto.net>
Message-ID: <4BF5633F.9090209@science-computing.de>

Am 20.05.2010 11:32, schrieb Assaf Flatto:

Hello Assaf,

> One approach is to monitor a VIP for the service (i.e. the "clustered"
> ip of the external facing IP address of the routers) for  finding if the 
> service is up , and the individual devices by their direct IP  for each 
> device  activity .
>
> This will give you both the business view ( if the VIP is down then the 
> business is impacted) , and the operational view ( one device impacted ).
>   
This is for sure a good idea. On the other hand this would mean that we
have to monitor another "host" and also plan for downtimes of this host,
too. An approach where I just define the redundant hosts and get the
report that I need would in my opinion be the better one. The data is
all there, it just has to be evaluated. I really can't believe, that I'm
the only one with this problem?

Best regards,
Stephan

-- 
Stephan Tesch                Phone + 49 (0)7071-9457-608
science + computing ag       FAX   + 49 (0)7071-9457-511
Hagellocher Weg 73
D-72070 Tuebingen            Email: s.tesch at science-computing.de

-- 
Vorstand/Board of Management:
Dr. Bernd Finkbeiner, Dr. Roland Niemeier, 
Dr. Arno Steitz, Dr. Ingrid Zech
Vorsitzender des Aufsichtsrats/
Chairman of the Supervisory Board:
Michel Lepert
Sitz/Registered Office: Tuebingen
Registergericht/Registration Court: Stuttgart
Registernummer/Commercial Register No.: HRB 382196 



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From patrick.morris at hp.com  Thu May 20 18:41:28 2010
From: patrick.morris at hp.com (Morris, Patrick)
Date: Thu, 20 May 2010 09:41:28 -0700
Subject: Reporting for redundant hosts
In-Reply-To: <4BF5633F.9090209@science-computing.de>
References: <4BF4F3DD.8020803@science-computing.de>	<4BF501A5.1070107@flatto.net>
	<4BF5633F.9090209@science-computing.de>
Message-ID: <4BF56638.5060207@hp.com>

Stephan Tesch wrote:
> Am 20.05.2010 11:32, schrieb Assaf Flatto:
>
> Hello Assaf,
>
>   
>> One approach is to monitor a VIP for the service (i.e. the "clustered"
>> ip of the external facing IP address of the routers) for  finding if the 
>> service is up , and the individual devices by their direct IP  for each 
>> device  activity .
>>
>> This will give you both the business view ( if the VIP is down then the 
>> business is impacted) , and the operational view ( one device impacted ).
>>   
>>     
> This is for sure a good idea. On the other hand this would mean that we
> have to monitor another "host" and also plan for downtimes of this host,
> too. An approach where I just define the redundant hosts and get the
> report that I need would in my opinion be the better one. The data is
> all there, it just has to be evaluated. I really can't believe, that I'm
> the only one with this problem?

I don't know about your case, but what I do here if there are multiple 
devices which support a single address is put all of them in Nagios as 
Assaf mentioned. I'll then set the "real" hosts as the parents of the 
virtual one, and if things are set up that way you *don't* need to worry 
about setting downtimes for the virtual address, because it will 
automatically be marked unreachable if all of the real addresses are down.

In my case, though, I primarily want to know if the VIP is working, and 
am not quite as concerned with the individual hosts serving it (though 
those are important as well, just not as important as if the service 
they provide is available). Based on your statement that you'd "have to 
monitor another host," it sounds like knowing the state of this service 
isn't really as important to you as knowing if the individual hosts that 
serve it are working.  Frankly, that seems a bit backwards to me, but I 
don't really know your environment.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mmelin at gmail.com  Thu May 20 23:25:49 2010
From: mmelin at gmail.com (Martin Melin)
Date: Thu, 20 May 2010 23:25:49 +0200
Subject: Reporting for redundant hosts
In-Reply-To: <4BF5633F.9090209@science-computing.de>
References: <4BF4F3DD.8020803@science-computing.de>
	<4BF501A5.1070107@flatto.net> <4BF5633F.9090209@science-computing.de>
Message-ID: 

On Thu, May 20, 2010 at 6:28 PM, Stephan Tesch
 wrote:
> Am 20.05.2010 11:32, schrieb Assaf Flatto:
>
> Hello Assaf,
>
>> One approach is to monitor a VIP for the service (i.e. the "clustered"
>> ip of the external facing IP address of the routers) for ?finding if the
>> service is up , and the individual devices by their direct IP ?for each
>> device ?activity .
>>
>> This will give you both the business view ( if the VIP is down then the
>> business is impacted) , and the operational view ( one device impacted ).
>>
> This is for sure a good idea. On the other hand this would mean that we
> have to monitor another "host" and also plan for downtimes of this host,
> too. An approach where I just define the redundant hosts and get the
> report that I need would in my opinion be the better one. The data is
> all there, it just has to be evaluated. I really can't believe, that I'm
> the only one with this problem?

I'm not sure if there is an easy way to do just this for vanilla
Nagios, but op5:s Ninja project includes SLA reports with this exact
functionality: http://www.op5.org/community/projects/ninja

Best regards,
Martin Melin

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From ae at op5.se  Fri May 21 10:26:55 2010
From: ae at op5.se (Andreas Ericsson)
Date: Fri, 21 May 2010 10:26:55 +0200
Subject: Reporting for redundant hosts
In-Reply-To: <4BF5633F.9090209@science-computing.de>
References: <4BF4F3DD.8020803@science-computing.de>	<4BF501A5.1070107@flatto.net>
	<4BF5633F.9090209@science-computing.de>
Message-ID: <4BF643CF.7010004@op5.se>

On 05/20/2010 06:28 PM, Stephan Tesch wrote:
> Am 20.05.2010 11:32, schrieb Assaf Flatto:
> 
> Hello Assaf,
> 
>> One approach is to monitor a VIP for the service (i.e. the "clustered"
>> ip of the external facing IP address of the routers) for  finding if the
>> service is up , and the individual devices by their direct IP  for each
>> device  activity .
>>
>> This will give you both the business view ( if the VIP is down then the
>> business is impacted) , and the operational view ( one device impacted ).
>>
> This is for sure a good idea. On the other hand this would mean that we
> have to monitor another "host" and also plan for downtimes of this host,
> too. An approach where I just define the redundant hosts and get the
> report that I need would in my opinion be the better one. The data is
> all there, it just has to be evaluated. I really can't believe, that I'm
> the only one with this problem?
> 

You're not. Ninja has cluster-mode capabilities in its reporting tool.
Check it out if you're interested. Adding such a feature to the cgi's
would be more hassle than it's worth, and I sure as hell won't get the
time for it from work since we already have it in our own reporting
solution.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From t.h.amundsen at usit.uio.no  Fri May 21 11:17:07 2010
From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen)
Date: Fri, 21 May 2010 11:17:07 +0200
Subject: check_openmanage weirdness
In-Reply-To: <4BF54CD7.8030905@stern.nyu.edu> (Greg Etling's message of "Thu, 
	20 May 2010 10:53:11 -0400")
References: 
	<4BF54CD7.8030905@stern.nyu.edu>
Message-ID: <15ty6fd4nsc.fsf@tux.uio.no>

Greg Etling  writes:

> Trond, thanks for your quick reply. Unfortunately it does appear we have 
> a disconnect between OMSA and SNMP:

[snip]

> [root at nagios ~]# snmpwalk -v2c -c ***** testserver 
> 1.3.6.1.4.1.674.10893.1.20.130.1
> SNMPv2-SMI::enterprises.674.10893.1.20.130.1 = No Such Object available 
> on this agent at this OID

Hmm.. you should see output like:

$ snmpwalk -v2c -c community hostname 1.3.6.1.4.1.674.10893.1.20.130.1
SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.1.1 = INTEGER: 1
SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.2.1 = STRING: "PERC 6/i Integrated"
SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.3.1 = STRING: "DELL"
SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.4.1 = INTEGER: 6
SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.5.1 = INTEGER: 1
SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.7.1 = INTEGER: 30
SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.8.1 = STRING: "6.2.0-0013"
[...]

> It appears to only have data under the 1.3.6.1.4.1.674.10892 and 
> 1.3.6.1.4.1.674.10899 trees. Thoughts?

Unfortunately my Windows knowledge is rather limited. I have never
installed OMSA on Windows, but I suspect that there are options to
choose from during the install. The first thing I would do is to
re-install OMSA step by step and try to figure out what I might have
missed. On Linux, the install procedure and packaging of the OMSA
components changed with version 6.2.0. This may very well be the case
with the Windows version as well.

Cheers,
-- 
Trond H. Amundsen 
Center for Information Technology Services, University of Oslo

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From Steve.Rennie at Hess.com  Fri May 21 10:58:20 2010
From: Steve.Rennie at Hess.com (Rennie, Steve)
Date: Fri, 21 May 2010 09:58:20 +0100
Subject: A nrpe (check_oracle) command returns garbage
Message-ID: <1335C03DA8D9EF4B8BA9ADCA15A47F93019673ED@ABZSSEX002.ihess.com>

Hi List

I have a problem where a check_nrpe returns garbage on nagios server. 

The command is a "check_oracle" defined in nrpe.cfg 

- command[check_oracle]=/usr/local/nagios/libexec/check_oracle --db l919

on the local Solaris host it works normally but run from nagios server
it returns garbage. Other check_nrpe commands (ie check_users etc) run
normally on server and return correct output. 

The server is running RHEL5.3 and client is Solaris 9. I am using nrpe
1.8 on client and server. Tried later vers but this seemed to
incompatible with the Solaris client. I have swapped the
libexec/check_oracle script but this looks correct and it does work when
run locally.

I have enabled debug on nrpe.cfg but nothing useful produced in log.

Any ideas much appreciated.

Thanks

Steve Rennie 
Hess Limited
The Adelphi Building
1-11 John Adam Street
London WC2N 6AG
Tel: +44 (0)20 7331 3280
Mob: +44 (0)7887 958415


This e-mail and any attachments are for the sole use of the intended recipient(s) and may contain information that is confidential.  If you are not the intended recipient(s) and have received this e-mail in error, please immediately notify the sender by return e-mail and delete this e-mail from your computer. Any distribution, disclosure or the taking of any other action by anyone other than the intended recipient(s) is strictly prohibited.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From Matsushita.Nobuo at tchden.org  Fri May 21 15:27:53 2010
From: Matsushita.Nobuo at tchden.org (Matsushita, Nobuo)
Date: Fri, 21 May 2010 07:27:53 -0600
Subject: Only 1 hostgroup access?
Message-ID: 

Hi,

Is it possible to give a user access to only one hostgroup?

Someone told me that  " if authentication is enabled is to only show hosts and services to people who are contacts on those hosts and services".     Would  somebody please  give me an example of this setting?

When I check the cgi.cfg file, I cannot figure out how this can be done.

Thanks in Advance.

Nobuo

---------------------------------------------------------------------------------------------------------------------
CONFIDENTIALITY NOTICE: This e-mail is confidential, may be legally privileged,
and for the intended recipient only. Access, disclosure, copying, forwarding and
distribution by any means is strictly prohibited. If received in error,
do not read but delete and e-mail confirmation to the sender.
==========================================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From benny at bennyvision.com  Fri May 21 15:53:20 2010
From: benny at bennyvision.com (C. Bensend)
Date: Fri, 21 May 2010 08:53:20 -0500
Subject: Only 1 hostgroup access?
In-Reply-To: 
References: 
Message-ID: <1be3b6391991018009ff92c6492a462a.squirrel@webmail.stinkweasel.net>


> Is it possible to give a user access to only one hostgroup?
>
> Someone told me that  " if authentication is enabled is to only show hosts
> and services to people who are contacts on those hosts and services".
> Would  somebody please  give me an example of this setting?
>
> When I check the cgi.cfg file, I cannot figure out how this can be done.
>
> Thanks in Advance.

Nobuo,

   If you enable authentication in cgi.cfg (use_authentication=1),
Nagios will automatically limit access to hosts and services that
the user is a contact for.

   So, first you enable authentication as above.

   Then, you either create a new contact group for this user, or
add them to an existing one.  Once they are in a contact group,
you add that contact group to the host or service (the contact_groups
parameter).

   Nagios will take care of all the rest for you.

Benny


-- 
"I can do for you is - what can not no girl!"
            -- Spam email subject, 2010-01-15



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From Matsushita.Nobuo at tchden.org  Fri May 21 19:16:24 2010
From: Matsushita.Nobuo at tchden.org (Matsushita, Nobuo)
Date: Fri, 21 May 2010 11:16:24 -0600
Subject: Only 1 hostgroup access?
Message-ID: 

Thanks Benny,

I forgot about "contact_name".
If I don't specify any user in the cgi.cfg, Nagios only allows access if the user is in "contact" group or "contacts".

Thank You !

Nobuo

>>Is it possible to give a user access to only one hostgroup?
>>
>> Someone told me that  " if authentication is enabled is to only show hosts
>> and services to people who are contacts on those hosts and services".
>> Would  somebody please  give me an example of this setting?
>>
>> When I check the cgi.cfg file, I cannot figure out how this can be done.
>>
>> Thanks in Advance.

>Nobuo,

  > If you enable authentication in cgi.cfg (use_authentication=1),
>Nagios will automatically limit access to hosts and services that
>the user is a contact for.

  > So, first you enable authentication as above.

   >Then, you either create a new contact group for this user, or
>add them to an existing one.  Once they are in a contact group,
>you add that contact group to the host or service (the contact_groups
>parameter).

  > Nagios will take care of all the rest for you.

>Benny


Nobuo Matsushita
The Children's Hospital
720-777-8000

---------------------------------------------------------------------------------------------------------------------
CONFIDENTIALITY NOTICE: This e-mail is confidential, may be legally privileged,
and for the intended recipient only. Access, disclosure, copying, forwarding and
distribution by any means is strictly prohibited. If received in error,
do not read but delete and e-mail confirmation to the sender.
==========================================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From low at modog.com  Fri May 21 22:03:45 2010
From: low at modog.com (Seth P. Low)
Date: Fri, 21 May 2010 16:03:45 -0400
Subject: Checking FTPS or SFTP with username and password
Message-ID: <15DE1A7F91855E49A18A487C43846C8B9AD9A35330@modog2.modog.com>

Hello,
        Checked nagiosexhange and some of the forums but I am unable to locate a plugin which allows me to login with a username and password to check either an SFTP for FTPS server. Has anyone found a way to do this easily?  -Seth


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From doepain at gmail.com  Fri May 21 22:41:56 2010
From: doepain at gmail.com (dOE)
Date: Fri, 21 May 2010 16:41:56 -0400
Subject: Nagios Escalations sending host "OK" alerts
Message-ID: 

Has anyone else seen an issue with creating escalations in Nagios that SOME
of the contacts assigned to the escalation will randomly reciece an
alert/notification that reads: Subject: ** PROBLEM Host Alert: SEVERNAME is
UP **?

I have not specified to notify on "OK"
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From patrick.morris at hp.com  Sat May 22 03:36:03 2010
From: patrick.morris at hp.com (Morris, Patrick)
Date: Fri, 21 May 2010 18:36:03 -0700
Subject: Nagios Escalations sending host "OK" alerts
In-Reply-To: 
References: 
Message-ID: <4BF73503.6000006@hp.com>

dOE wrote:
> Has anyone else seen an issue with creating escalations in Nagios that 
> SOME of the contacts assigned to the escalation will randomly reciece 
> an alert/notification that reads: Subject: ** PROBLEM Host Alert: 
> SEVERNAME is UP **? 
>
> I have not specified to notify on "OK" 
No, I've never seen a random notification from Nagios ever.  It's always 
notified as configured.

Chances are you've got one of a few things going on:

a) You *have* told Nagios to notify on recovery states, either in the 
host definition or in an escalation definition (which would be easy to 
verify if you showed how you've configured those), or

b) You've got a couple instances of Nagios running at the same time, and 
one of them is running with an old configuration that did specify to 
notify on recovery, or

c) You've got mail backed up on some box somewhere (which should be 
obvious from looking at timestamps and headers on the messages), or

d) Some combination of the above.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mh+nagios-users at zugschlus.de  Mon May 24 11:37:57 2010
From: mh+nagios-users at zugschlus.de (Marc Haber)
Date: Mon, 24 May 2010 11:37:57 +0200
Subject: check_multi advice to check "Internet available"
Message-ID: <20100524093757.GN23462@torres.zugschlus.de>

Hi,

I am checking a number of servers which are "on the Internet", and are
thus likely to become unreachable for my Nagios installations in case
of an outage out there on the Internet. I would like to have a "host"
which I could use as parent host for all those servers so that I don't
get buried under an avalanche of HOST DOWN alarms should my Internet
go partly or fully down.

I have investigated using check_multi do help me with that, and have
come up with a solution which can help me with this task. I would like
to solicit your comments to that solution.

I have defined a number of hosts which are checked by Nagios proper,
just as normal hosts are. Their individual notifications may be
switched off, but I haven't done so since I like to see what's going
on at least in the start phase. Their hostnames all start with
mh-internet, so I can use a single check_multi config line to pull
them all into a single check_multi check. check_multi is then used as
service and host check on a "virtual host" called internet, which goes
CRITICAL when more than one mh-internet-foo host is CRITICAL.

Here is my detailed configuration:

/etc/nagios3/conf.d/internet.cmd:
statusdat [ internet ] = /^mh-internet-.*$/:my-defaultping

state[WARNING] = COUNT(WARNING) > 1
state[CRITICAL] = COUNT(CRITICAL) > 1
state[UNKNOWN] = COUNT(UNKNOWN) > 1


Nagios configuration:
define host {
        use                     mh-host
        host_name               mh-internet
        alias                   internet
        address                 127.0.0.1
        parents                 localhost
        check_command           check_multi!-f /etc/nagios3/conf.d/internet.cmd
}

define service {
        name                            my-internet
        use                             my-service
        service_description             my-internet
        check_command                   check_multi!-f /etc/nagios3/conf.d/internet.cmd
        host_name                       mh-internet
}

define command {
        command_name                    check_multi
        command_line                    /usr/lib/nagios/plugins/check_multi $ARG1$ $ARG2$ $ARG3$ $ARG4$
}

define host {
        use                     mh-host
        host_name               mh-internet-torres
        hostgroups              mh, ping
        alias                   torres.zugschlus.de
        address                 85.214.68.41
        parents                 mh-internet
}

define host {
        use                     mh-host
        host_name               mh-internet-heise
        hostgroups              mh, ping
        alias                   www.heise.de
        address                 www.heise.de
        parents                 mh-internet
}

define host {
        use                     mh-host
        host_name               mh-internet-google-de
        hostgroups              mh, ping
        alias                   www.google.de
        address                 www.google.de
        parents                 mh-internet
}

I am wondering whether this is a valid approach, or whether there
are better solutions for this challenge. If somebody familiar with
check_multi configuration, I would like to know whether there are any
possible issues hiding in my use of check_multi. Any comments will be
appreciated.

Greetings
Marc

-- 
-----------------------------------------------------------------------------
Marc Haber         | "I don't trust Computers. They | Mailadresse im Header
Mannheim, Germany  |  lose things."    Winona Ryder | Fon: *49 621 72739834
Nordisch by Nature |  How to make an American Quilt | Fax: *49 3221 2323190

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From standalone.sysadmin at gmail.com  Mon May 24 21:37:13 2010
From: standalone.sysadmin at gmail.com (Matt Simmons)
Date: Mon, 24 May 2010 15:37:13 -0400
Subject: check_multi advice to check "Internet available"
In-Reply-To: <20100524093757.GN23462@torres.zugschlus.de>
References: <20100524093757.GN23462@torres.zugschlus.de>
Message-ID: 

Have you considered adding your upstream router as a host, and making
it the parent of your remote hosts?

Also, remember to take off the 'u' notification flag of the uplink's
children, otherwise you'll still be buried under a pile of
notifications.

--Matt

On Mon, May 24, 2010 at 5:37 AM, Marc Haber
 wrote:
> Hi,
>
> I am checking a number of servers which are "on the Internet", and are
> thus likely to become unreachable for my Nagios installations in case
> of an outage out there on the Internet. I would like to have a "host"
> which I could use as parent host for all those servers so that I don't
> get buried under an avalanche of HOST DOWN alarms should my Internet
> go partly or fully down.
>
> I have investigated using check_multi do help me with that, and have
> come up with a solution which can help me with this task. I would like
> to solicit your comments to that solution.
>
> I have defined a number of hosts which are checked by Nagios proper,
> just as normal hosts are. Their individual notifications may be
> switched off, but I haven't done so since I like to see what's going
> on at least in the start phase. Their hostnames all start with
> mh-internet, so I can use a single check_multi config line to pull
> them all into a single check_multi check. check_multi is then used as
> service and host check on a "virtual host" called internet, which goes
> CRITICAL when more than one mh-internet-foo host is CRITICAL.
>
> Here is my detailed configuration:
>
> /etc/nagios3/conf.d/internet.cmd:
> statusdat [ internet ] = /^mh-internet-.*$/:my-defaultping
>
> state[WARNING] = COUNT(WARNING) > 1
> state[CRITICAL] = COUNT(CRITICAL) > 1
> state[UNKNOWN] = COUNT(UNKNOWN) > 1
>
>
> Nagios configuration:
> define host {
> ? ? ? ?use ? ? ? ? ? ? ? ? ? ? mh-host
> ? ? ? ?host_name ? ? ? ? ? ? ? mh-internet
> ? ? ? ?alias ? ? ? ? ? ? ? ? ? internet
> ? ? ? ?address ? ? ? ? ? ? ? ? 127.0.0.1
> ? ? ? ?parents ? ? ? ? ? ? ? ? localhost
> ? ? ? ?check_command ? ? ? ? ? check_multi!-f /etc/nagios3/conf.d/internet.cmd
> }
>
> define service {
> ? ? ? ?name ? ? ? ? ? ? ? ? ? ? ? ? ? ?my-internet
> ? ? ? ?use ? ? ? ? ? ? ? ? ? ? ? ? ? ? my-service
> ? ? ? ?service_description ? ? ? ? ? ? my-internet
> ? ? ? ?check_command ? ? ? ? ? ? ? ? ? check_multi!-f /etc/nagios3/conf.d/internet.cmd
> ? ? ? ?host_name ? ? ? ? ? ? ? ? ? ? ? mh-internet
> }
>
> define command {
> ? ? ? ?command_name ? ? ? ? ? ? ? ? ? ?check_multi
> ? ? ? ?command_line ? ? ? ? ? ? ? ? ? ?/usr/lib/nagios/plugins/check_multi $ARG1$ $ARG2$ $ARG3$ $ARG4$
> }
>
> define host {
> ? ? ? ?use ? ? ? ? ? ? ? ? ? ? mh-host
> ? ? ? ?host_name ? ? ? ? ? ? ? mh-internet-torres
> ? ? ? ?hostgroups ? ? ? ? ? ? ?mh, ping
> ? ? ? ?alias ? ? ? ? ? ? ? ? ? torres.zugschlus.de
> ? ? ? ?address ? ? ? ? ? ? ? ? 85.214.68.41
> ? ? ? ?parents ? ? ? ? ? ? ? ? mh-internet
> }
>
> define host {
> ? ? ? ?use ? ? ? ? ? ? ? ? ? ? mh-host
> ? ? ? ?host_name ? ? ? ? ? ? ? mh-internet-heise
> ? ? ? ?hostgroups ? ? ? ? ? ? ?mh, ping
> ? ? ? ?alias ? ? ? ? ? ? ? ? ? www.heise.de
> ? ? ? ?address ? ? ? ? ? ? ? ? www.heise.de
> ? ? ? ?parents ? ? ? ? ? ? ? ? mh-internet
> }
>
> define host {
> ? ? ? ?use ? ? ? ? ? ? ? ? ? ? mh-host
> ? ? ? ?host_name ? ? ? ? ? ? ? mh-internet-google-de
> ? ? ? ?hostgroups ? ? ? ? ? ? ?mh, ping
> ? ? ? ?alias ? ? ? ? ? ? ? ? ? www.google.de
> ? ? ? ?address ? ? ? ? ? ? ? ? www.google.de
> ? ? ? ?parents ? ? ? ? ? ? ? ? mh-internet
> }
>
> I am wondering whether this is a valid approach, or whether there
> are better solutions for this challenge. If somebody familiar with
> check_multi configuration, I would like to know whether there are any
> possible issues hiding in my use of check_multi. Any comments will be
> appreciated.
>
> Greetings
> Marc
>
> --
> -----------------------------------------------------------------------------
> Marc Haber ? ? ? ? | "I don't trust Computers. They | Mailadresse im Header
> Mannheim, Germany ?| ?lose things." ? ?Winona Ryder | Fon: *49 621 72739834
> Nordisch by Nature | ?How to make an American Quilt | Fax: *49 3221 2323190
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>



-- 
LITTLE GIRL: But which cookie will you eat FIRST?
COOKIE MONSTER: Me think you have misconception of cookie-eating process.

COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the
NJ-PICC Sysadmin Conference! http://www.picconf.org

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From mh+nagios-users at zugschlus.de  Mon May 24 23:51:37 2010
From: mh+nagios-users at zugschlus.de (Marc Haber)
Date: Mon, 24 May 2010 23:51:37 +0200
Subject: check_multi advice to check "Internet available"
In-Reply-To: 
References: <20100524093757.GN23462@torres.zugschlus.de>
	
Message-ID: <20100524215137.GB26813@torres.zugschlus.de>

On Mon, May 24, 2010 at 03:37:13PM -0400, Matt Simmons wrote:
> Have you considered adding your upstream router as a host, and making
> it the parent of your remote hosts?

Yes, but that's a workaround which will misfire in the case where it's
the own ISP that's faulty at its uplik.

> Also, remember to take off the 'u' notification flag of the uplink's
> children, otherwise you'll still be buried under a pile of
> notifications.

Sure.

Greetings
Marc

-- 
-----------------------------------------------------------------------------
Marc Haber         | "I don't trust Computers. They | Mailadresse im Header
Mannheim, Germany  |  lose things."    Winona Ryder | Fon: *49 621 72739834
Nordisch by Nature |  How to make an American Quilt | Fax: *49 3221 2323190

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From yinyang at eburg.com  Tue May 25 01:58:21 2010
From: yinyang at eburg.com (Gordon Messmer)
Date: Mon, 24 May 2010 16:58:21 -0700
Subject: check_multi advice to check "Internet available"
In-Reply-To: <20100524215137.GB26813@torres.zugschlus.de>
References: <20100524093757.GN23462@torres.zugschlus.de>	
	<20100524215137.GB26813@torres.zugschlus.de>
Message-ID: <4BFB129D.8080505@eburg.com>

On 05/24/2010 02:51 PM, Marc Haber wrote:
> On Mon, May 24, 2010 at 03:37:13PM -0400, Matt Simmons wrote:
>> Have you considered adding your upstream router as a host, and making
>> it the parent of your remote hosts?
>
> Yes, but that's a workaround which will misfire in the case where it's
> the own ISP that's faulty at its uplik.

If that's something you're concerned about, your check_multi will also 
"misfire" when the problem is at the uplink for the machines that you're 
monitoring.

Monitoring a gateway and making it the parent is in no way a "workaround".

An ideal installation for monitoring WAN-connected hosts would monitor 
your default gateway and make that the parent of all hosts.  You'd also 
monitor the router at each remote site and make that the parent of all 
hosts at that site.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mh+nagios-users at zugschlus.de  Tue May 25 08:02:13 2010
From: mh+nagios-users at zugschlus.de (Marc Haber)
Date: Tue, 25 May 2010 08:02:13 +0200
Subject: check_multi advice to check "Internet available"
In-Reply-To: <4BFB129D.8080505@eburg.com>
References: <20100524093757.GN23462@torres.zugschlus.de>
	
	<20100524215137.GB26813@torres.zugschlus.de>
	<4BFB129D.8080505@eburg.com>
Message-ID: <20100525060213.GC26813@torres.zugschlus.de>

Hi,

On Mon, May 24, 2010 at 04:58:21PM -0700, Gordon Messmer wrote:
> On 05/24/2010 02:51 PM, Marc Haber wrote:
> > On Mon, May 24, 2010 at 03:37:13PM -0400, Matt Simmons wrote:
> >> Have you considered adding your upstream router as a host, and making
> >> it the parent of your remote hosts?
> >
> > Yes, but that's a workaround which will misfire in the case where it's
> > the own ISP that's faulty at its uplik.
> 
> If that's something you're concerned about, your check_multi will also 
> "misfire" when the problem is at the uplink for the machines that you're 
> monitoring.
> 
> Monitoring a gateway and making it the parent is in no way a "workaround".
> 
> An ideal installation for monitoring WAN-connected hosts would monitor 
> your default gateway and make that the parent of all hosts.  You'd also 
> monitor the router at each remote site and make that the parent of all 
> hosts at that site.

I know all this. My first Nagios experiences date back to when it was
still called Netsaint. Of course, the default gateway, the CPE and the
COE of the "local" ISP are monitored. But that's all that can be
comfortably monitored since the Internet uses dynamic routing and the
paths are subject to change without further notice. Furthermore, a lot
of housing providers configure their backbone and access routers to
not respond to pings, so it's a challenge to monitor these sites.

http://blog.zugschlus.de/archives/770-Nagios,-Parent-Hosts,-and-traceroute-on-the-Internet.html
has my thoughts on that matter (thankfully written in English). Be
aware that this article is more than a year old, so things have
changed since then.

Greetings
Marc

-- 
-----------------------------------------------------------------------------
Marc Haber         | "I don't trust Computers. They | Mailadresse im Header
Mannheim, Germany  |  lose things."    Winona Ryder | Fon: *49 621 72739834
Nordisch by Nature |  How to make an American Quilt | Fax: *49 3221 2323190

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From matgand at gmail.com  Tue May 25 15:08:13 2010
From: matgand at gmail.com (Mattia Gandolfi)
Date: Tue, 25 May 2010 15:08:13 +0200
Subject: LDAP authentication and CGI authorization problem
Message-ID: 

Hi all,

I'm facing problems while trying to enable LDAP authentication on a Nagios
3.2.1 install (using htpasswd.users everything works fine).
This is how I've configured Apache:


    AuthType Basic
    AuthName "Nagios - Ldap"
    AuthBasicProvider ldap
    AuthLDAPUrl
ldaps://unixautmi-ese01.sky.local:636,unixautca-ese01.sky.local:636/ou=people,dc=sky,dc=local?uid
    AuthLDAPBindDN "cn=authuser,dc=sky,dc=local"
    AuthLDAPBindPassword oaj5Phum
    Require ldap-dn uid=gandolfim,ou=people,dc=sky,dc=local
    Require ldap-user gandolfim
    AuthLDAPGroupAttributeIsDN off
    Require ldap-group cn=systemadminmi,ou=groups,dc=sky,dc=local
    Require ldap-group cn=infosec,ou=groups,dc=sky,dc=local
    AuthLDAPGroupAttribute memberUid


    AuthType Basic
    AuthName "Nagios - Ldap - CGI"
    AuthBasicProvider ldap
    AuthLDAPUrl
ldaps://unixautmi-ese01.sky.local:636,unixautca-ese01.sky.local:636/ou=people,dc=sky,dc=local?uid
    AuthLDAPBindDN "cn=authuser,dc=sky,dc=local"
    AuthLDAPBindPassword oaj5Phum
    Require ldap-dn uid=gandolfim,ou=people,dc=sky,dc=local
    Require ldap-user gandolfim
    AuthLDAPGroupAttributeIsDN off
    Require ldap-group cn=systemadminmi,ou=groups,dc=sky,dc=local
    Require ldap-group cn=infosec,ou=groups,dc=sky,dc=local
    AuthLDAPGroupAttribute memberUid


I've defined my username as a contact

define contact {
        use             email-contact
        contact_name    gandolfim
        alias           Mattia Gandolfi
        email           mattia.gandolfi at xxxxxxx.com
        pager           none
}

and I've set the following options in cgi.cfg

use_authentication=1
use_ssl_authentication=0
authorized_for_system_information=gandolfim
authorized_for_configuration_information=gandolfim
authorized_for_system_commands=gandolfim
authorized_for_all_services=gandolfim
authorized_for_all_hosts=gandolfim
authorized_for_all_service_commands=gandolfim

Authentication works fine, and I see "Logged in as *gandolfim"* on top of
the Tactical Monitoring Overview page.
However, as soon as I try to access the cgi, for example to disable
notifications for a service, I get "Sorry, but you are not authorized to
commit the specified command."

What am I missing?

Thanks

Mattia
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From getling at stern.nyu.edu  Tue May 25 15:50:48 2010
From: getling at stern.nyu.edu (Greg Etling)
Date: Tue, 25 May 2010 09:50:48 -0400
Subject: check_openmanage weirdness
In-Reply-To: <15ty6fd4nsc.fsf@tux.uio.no>
References: 	<4BF54CD7.8030905@stern.nyu.edu>
	<15ty6fd4nsc.fsf@tux.uio.no>
Message-ID: <4BFBD5B8.7020602@stern.nyu.edu>

OK, this looks like a bug in the upgrade process. I ran a repair on the 
installation of the clients, and the OID is now reporting correctly.

Thanks for your help,

Greg

On 05/21/2010 05:17 AM, Trond Hasle Amundsen wrote:
> Greg Etling  writes:
>
>    
>> Trond, thanks for your quick reply. Unfortunately it does appear we have
>> a disconnect between OMSA and SNMP:
>>      
> [snip]
>
>    
>> [root at nagios ~]# snmpwalk -v2c -c ***** testserver
>> 1.3.6.1.4.1.674.10893.1.20.130.1
>> SNMPv2-SMI::enterprises.674.10893.1.20.130.1 = No Such Object available
>> on this agent at this OID
>>      
> Hmm.. you should see output like:
>
> $ snmpwalk -v2c -c community hostname 1.3.6.1.4.1.674.10893.1.20.130.1
> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.1.1 = INTEGER: 1
> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.2.1 = STRING: "PERC 6/i Integrated"
> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.3.1 = STRING: "DELL"
> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.4.1 = INTEGER: 6
> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.5.1 = INTEGER: 1
> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.7.1 = INTEGER: 30
> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.8.1 = STRING: "6.2.0-0013"
> [...]
>
>    
>> It appears to only have data under the 1.3.6.1.4.1.674.10892 and
>> 1.3.6.1.4.1.674.10899 trees. Thoughts?
>>      
> Unfortunately my Windows knowledge is rather limited. I have never
> installed OMSA on Windows, but I suspect that there are options to
> choose from during the install. The first thing I would do is to
> re-install OMSA step by step and try to figure out what I might have
> missed. On Linux, the install procedure and packaging of the OMSA
> components changed with version 6.2.0. This may very well be the case
> with the Windows version as well.
>
> Cheers,
>    
-------------- next part --------------
A non-text attachment was scrubbed...
Name: getling.vcf
Type: text/x-vcard
Size: 215 bytes
Desc: not available
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From shadhin71 at gmail.com  Tue May 25 17:35:09 2010
From: shadhin71 at gmail.com (shadih rahman)
Date: Tue, 25 May 2010 11:35:09 -0400
Subject: nrpe and critical state
Message-ID: 

I am doing  a check which counts the number of process running on the remote
host.  Lets say for whatever reason nrpe could not establish connection
during the test.  In this situation, should it not go to UNKNOWN state.
However I see its going to critical state.  Please advise on this.  Thanks

[user]>./check_nrpe -H xxxxx -c check_process_x64 -a 0: 10: httpd
Connection refused by host
[user]>echo $?
2

-- 
Cordially,
Shadhin Rahman
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From Matthew.Stier at us.fujitsu.com  Tue May 25 17:42:00 2010
From: Matthew.Stier at us.fujitsu.com (Stier, Matthew)
Date: Tue, 25 May 2010 10:42:00 -0500
Subject: nrpe and critical state
In-Reply-To: 
References: 
Message-ID: <63F73C973E3E4547979026ECC295EF5C027F17F5@rchemxp01.fnc.net.local>

>From the check_nrpe usage message:  

 

Options:

 -n         = Do no use SSL

 -u         = Make socket timeouts return an UNKNOWN state instead of
CRITICAL

 

From: shadih rahman [mailto:shadhin71 at gmail.com] 
Sent: Tuesday, May 25, 2010 11:35 AM
To: nagios-users at lists.sourceforge.net
Subject: [Nagios-users] nrpe and critical state

 

I am doing  a check which counts the number of process running on the
remote host.  Lets say for whatever reason nrpe could not establish
connection during the test.  In this situation, should it not go to
UNKNOWN state.  However I see its going to critical state.  Please
advise on this.  Thanks

[user]>./check_nrpe -H xxxxx -c check_process_x64 -a 0: 10: httpd
Connection refused by host
[user]>echo $?
2

-- 
Cordially,
Shadhin Rahman

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From shadhin71 at gmail.com  Tue May 25 17:52:15 2010
From: shadhin71 at gmail.com (shadih rahman)
Date: Tue, 25 May 2010 11:52:15 -0400
Subject: nrpe and critical state
In-Reply-To: <63F73C973E3E4547979026ECC295EF5C027F17F5@rchemxp01.fnc.net.local>
References: 
	<63F73C973E3E4547979026ECC295EF5C027F17F5@rchemxp01.fnc.net.local>
Message-ID: 

 nrpe -u option does not work if nrpe is shutdown in the remote host.

On Tue, May 25, 2010 at 11:42 AM, Stier, Matthew <
Matthew.Stier at us.fujitsu.com> wrote:

>  From the check_nrpe usage message:
>
>
>
> Options:
>
>  -n         = Do no use SSL
>
>  -u         = Make socket timeouts return an UNKNOWN state instead of
> CRITICAL
>
>
>
> *From:* shadih rahman [mailto:shadhin71 at gmail.com]
> *Sent:* Tuesday, May 25, 2010 11:35 AM
> *To:* nagios-users at lists.sourceforge.net
> *Subject:* [Nagios-users] nrpe and critical state
>
>
>
> I am doing  a check which counts the number of process running on the
> remote host.  Lets say for whatever reason nrpe could not establish
> connection during the test.  In this situation, should it not go to UNKNOWN
> state.  However I see its going to critical state.  Please advise on this.
> Thanks
>
> [user]>./check_nrpe -H xxxxx -c check_process_x64 -a 0: 10: httpd
> Connection refused by host
> [user]>echo $?
> 2
>
> --
> Cordially,
> Shadhin Rahman
>
>
> ------------------------------------------------------------------------------
>
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>



-- 
Cordially,
Shadhin Rahman
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From shadhin71 at gmail.com  Tue May 25 18:35:43 2010
From: shadhin71 at gmail.com (shadih rahman)
Date: Tue, 25 May 2010 12:35:43 -0400
Subject: nrpe and critical state
In-Reply-To: 
References: 
	<63F73C973E3E4547979026ECC295EF5C027F17F5@rchemxp01.fnc.net.local>
	
Message-ID: 

I found this link.   I am running nagios version 3.2.0, can someone please
verify if this path has been integrated with 3.2.0 or not?  Please advise on
this.  Thanks

https://secure.opsera.com/jira/browse/OPS-313

On Tue, May 25, 2010 at 11:52 AM, shadih rahman  wrote:

>  nrpe -u option does not work if nrpe is shutdown in the remote host.
>
> On Tue, May 25, 2010 at 11:42 AM, Stier, Matthew <
> Matthew.Stier at us.fujitsu.com> wrote:
>
>>  From the check_nrpe usage message:
>>
>>
>>
>> Options:
>>
>>  -n         = Do no use SSL
>>
>>  -u         = Make socket timeouts return an UNKNOWN state instead of
>> CRITICAL
>>
>>
>>
>> *From:* shadih rahman [mailto:shadhin71 at gmail.com]
>> *Sent:* Tuesday, May 25, 2010 11:35 AM
>> *To:* nagios-users at lists.sourceforge.net
>> *Subject:* [Nagios-users] nrpe and critical state
>>
>>
>>
>> I am doing  a check which counts the number of process running on the
>> remote host.  Lets say for whatever reason nrpe could not establish
>> connection during the test.  In this situation, should it not go to UNKNOWN
>> state.  However I see its going to critical state.  Please advise on this.
>> Thanks
>>
>> [user]>./check_nrpe -H xxxxx -c check_process_x64 -a 0: 10: httpd
>> Connection refused by host
>> [user]>echo $?
>> 2
>>
>> --
>> Cordially,
>> Shadhin Rahman
>>
>>
>> ------------------------------------------------------------------------------
>>
>>
>> _______________________________________________
>> Nagios-users mailing list
>> Nagios-users at lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when
>> reporting any issue.
>> ::: Messages without supporting info will risk being sent to /dev/null
>>
>
>
>
> --
> Cordially,
> Shadhin Rahman
>



-- 
Cordially,
Shadhin Rahman
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From darren at brucetelecom.com  Tue May 25 19:16:52 2010
From: darren at brucetelecom.com (Darren Hill)
Date: Tue, 25 May 2010 13:16:52 -0400
Subject: nagios/opsview with qpage
Message-ID: <4BFC0604.8020506@brucetelecom.com>

Hi,

I've seen a few examples of qpage working with nagios (I'm using opsview 
as a frontend to nagios).
Qpage is working ok manually, but when I try and have nagios send the 
page nothing comes out.
I'm using ubuntu 8.04 LTS server running opsview 3.7

Here's my misccommands.cfg, contacts.cfg

define command{
         command_name notify-by-pagerscript
         command_line /usr/local/nagios/libexec/notifications/qpage -s 
localhost -p $PAGER$ -f $HOSTNAME$ $SERVICEDESC$ `SERVICEOUTPUT$' $HOSTNAME$
}

define contact {
         contact_name    admin/01default
         alias           Administrator
         use             global-contact
         service_notification_period     24x7
         host_notification_period        24x7
         notification_level              1
        email hidden at whatever
         pager   INETPager
         host_notification_commands      
notify-by-email,notify-by-pagerscript
         host_notification_options       u,d,r,f
         service_notification_commands   
notify-by-email,notify-by-pagerscript
         service_notification_options    w,c,r,u,f
}

Any ideas?

qpage is running with /usr/local/bin/qpage -q 5 in the background waiting.

Thanks!

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From richard.lynch at rasmussen.edu  Tue May 25 20:09:13 2010
From: richard.lynch at rasmussen.edu (Richard Lynch)
Date: Tue, 25 May 2010 13:09:13 -0500
Subject: localhost DOWN messages,
 return code 127 is out of bounds
In-Reply-To: 
References: 
Message-ID: 

I?m a newbie, but...

Does your /etc/hosts/ file have this line in it:
127.0.0.1      localhost

It should have that.

On 5/18/10 5:39 PM, "Michael Maxwell"  wrote:

> Hello,
> 
> Just installed nagios 3.2.1 following the docs at
> http://nagios.sourceforge.net/docs/3_0/quickstart.html
> 
> Everything starts up normally, web interface is fine, can login,
> enable/disable checks, etc...  However, every 2 hours exactly,
> I'm getting the following alert in email:
> 
> From: nagios
> Subject: ** PROBLEM Host Alert: localhost is DOWN **
> 
> ***** Nagios *****
> 
> Notification Type: PROBLEM
> Host: localhost
> State: DOWN
> Address: 127.0.0.1
> Info: (Return code of 127 is out of bounds - plugin may be missing)
> 
> Date/Time: Tue May 18 15:00:15 PDT 2010
> 
> If I go to check the services, everything is green.
> 
> This is coming from a fresh install with nothing special configured.
> 
> I've googled until my eyes glazed over and no one seems to report
> this happening with "localhost".  I've checked for the plugins,
> checked permissions, etc... The best I can tell, it's all as it
> should be.  I?m sure I?ve missed some simple detail somewhere, but
> it?s eluding me at the moment.
> 
> Any help on solving this problem would be greatly appreciated.
> 
> Below, I've included logfile contents as well as every detail
> I could think of on how the system is setup.
> 
> Thanks
> 
> [1274215437] Nagios 3.2.1 starting... (PID=5185)[1274215437] Local time is Tue
> May 18 13:43:57 PDT 2010
> [1274215437] LOG VERSION: 2.0
> [1274215437] Finished daemonizing... (New PID=5186)[1274215447] Warning:
> Return code of 127 for check of host 'localhost' was out of bounds. Make sure
> the plugin you're trying to run actually e
> xists.
> [1274215447] HOST ALERT: localhost;DOWN;SOFT;1;(Return code of 127 is out of
> bounds - plugin may be missing)[1274215477] Warning: Return code of 127 for
> check of service 'Current Load' on host 'localhost' was out of bounds. Make
> sure the plugin you'r
> e trying to run actually exists.
> [1274215477] SERVICE ALERT: localhost;Current Load;CRITICAL;HARD;1;(Return
> code of 127 is out of bounds - plugin may be missing)[1274215517] Warning:
> Return code of 127 for check of host 'localhost' was out of bounds. Make sure
> the plugin you're trying to run actually e
> xists.
> [1274215517] HOST ALERT: localhost;DOWN;SOFT;2;(Return code of 127 is out of
> bounds - plugin may be missing)[1274215517] Warning: Return code of 127 for
> check of service 'Current Users' on host 'localhost' was out of bounds. Make
> sure the plugin you'
> re trying to run actually exists.
> [1274215517] SERVICE ALERT: localhost;Current Users;CRITICAL;HARD;1;(Return
> code of 127 is out of bounds - plugin may be missing)[1274215557] Warning:
> Return code of 127 for check of service 'HTTP' on host 'localhost' was out of
> bounds. Make sure the plugin you're trying
>  to run actually exists.
> [1274215557] SERVICE ALERT: localhost;HTTP;CRITICAL;HARD;1;(Return code of 127
> is out of bounds - plugin may be missing)[1274215587] Warning: Return code of
> 127 for check of host 'localhost' was out of bounds. Make sure the plugin
> you're trying to run actually e
> xists.
> [1274215557] SERVICE ALERT: localhost;HTTP;CRITICAL;HARD;1;(Return code of 127
> is out of bounds - plugin may be missing)[1274215587] Warning: Return code of
> 127 for check of host 'localhost' was out of bounds. Make sure the plugin
> you're trying to run actually e
> xists.
> [1274215587] HOST ALERT: localhost;DOWN;SOFT;3;(Return code of 127 is out of
> bounds - plugin may be missing)
> [1274215597] Warning: Return code of 127 for check of service 'PING' on host
> 'localhost' was out of bounds. Make sure the plugin you're trying to run
> actually exists.
> [1274215597] SERVICE ALERT: localhost;PING;CRITICAL;HARD;1;(Return code of 127
> is out of bounds - plugin may be missing)
> [1274215627] Warning: Return code of 127 for check of service 'Root Partition'
> on host 'localhost' was out of bounds. Make sure the plugin you're trying to
> run actually exists.
> [1274215627] SERVICE ALERT: localhost;Root Partition;CRITICAL;HARD;1;(Return
> code of 127 is out of bounds - plugin may be missing)
> 
> 
> Detailed information on the system, what's installed, etc...:
> 
> CentOS 5.4
> Linux [deleted] 2.6.18-194.el5 #1 SMP Fri Apr 2 14:58:35 EDT 2010 i686 i686
> i386 GNU/Linux
> 
> Nagios 3.2.1
> Nagios Plugins 1.4.14
> Apache httpd 2.2.15
> PHP 5.3.2
> 
> $ grep nagios /etc/passwd
> nagios:x:9000:9001::/home/nagios:/bin/bash
> 
> $ grep nag /etc/group
> nagcmd:x:9000:apache
> nagios:x:9001:nagios
> 
> Options used to build nagios:
> ./configure     --with-command-group=nagcmd \
>                 --prefix=/opt/nagios-3.2.1 \
>                 --with-nagios-user=nagios \
>                 --with-httpd-conf=/opt/httpd/conf
> 
> Options used to build plugins:
> ./configure     --with-nagios-user=nagios \
>                 --with-nagios-group=nagios \
>                 --prefix=/opt/nagios-3.2.1
> 
> Permissions of files in /opt/nagios-3.2.1/libexec:
> $ ls -l /opt/nagios-3.2.1/libexec/
> total 1360K
> -rwxr-xr-x 1 nagios nagios 31052 May 18 13:48 check_apt*
> -rwxr-xr-x 1 nagios nagios  2240 May 18 13:48 check_breeze*
> -rwxr-xr-x 1 nagios nagios 38078 May 18 13:48 check_by_ssh*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_clamd -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 21516 May 18 13:48 check_cluster*
> -r-sr-xr-x 1 root   nagios 34952 May 18 13:48 check_dhcp*
> -rwxr-xr-x 1 nagios nagios 32640 May 18 13:48 check_dig*
> -rwxr-xr-x 1 nagios nagios 48280 May 18 13:48 check_disk*
> -rwxr-xr-x 1 nagios nagios  8075 May 18 13:48 check_disk_smb*
> -rwxr-xr-x 1 nagios nagios 36777 May 18 13:48 check_dns*
> -rwxr-xr-x 1 nagios nagios 18705 May 18 13:48 check_dummy*
> -rwxr-xr-x 1 nagios nagios  3051 May 18 13:48 check_file_age*
> -rwxr-xr-x 1 nagios nagios  6313 May 18 13:48 check_flexlm*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_ftp -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 85634 May 18 13:48 check_http*
> -r-sr-xr-x 1 root   nagios 39620 May 18 13:48 check_icmp*
> -rwxr-xr-x 1 nagios nagios 23743 May 18 13:48 check_ide_smart*
> -rwxr-xr-x 1 nagios nagios 15132 May 18 13:48 check_ifoperstatus*
> -rwxr-xr-x 1 nagios nagios 12518 May 18 13:48 check_ifstatus*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_imap -> check_tcp*
> -rwxr-xr-x 1 nagios nagios  7350 May 18 13:48 check_ircd*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_jabber -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 30014 May 18 13:48 check_ldap*
> lrwxrwxrwx 1 root   root      10 May 18 13:48 check_ldaps -> check_ldap*
> -rwxr-xr-x 1 nagios nagios 28714 May 18 13:48 check_load*
> -rwxr-xr-x 1 nagios nagios  6020 May 18 13:48 check_log*
> -rwxr-xr-x 1 nagios nagios 20282 May 18 13:48 check_mailq*
> -rwxr-xr-x 1 nagios nagios 25812 May 18 13:48 check_mrtg*
> -rwxr-xr-x 1 nagios nagios 24295 May 18 13:48 check_mrtgtraf*
> -rwxr-xr-x 1 nagios nagios 28548 May 18 13:48 check_nagios*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_nntp -> check_tcp*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_nntps -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 38266 May 18 13:48 check_nt*
> -rwxr-xr-x 1 nagios nagios 35917 May 18 13:48 check_ntp*
> -rwxr-xr-x 1 nagios nagios 33889 May 18 13:48 check_ntp_peer*
> -rwxr-xr-x 1 nagios nagios 31725 May 18 13:48 check_ntp_time*
> -rwxr-xr-x 1 nagios nagios 54192 May 18 13:48 check_nwstat*
> -rwxr-xr-x 1 nagios nagios  8324 May 18 13:48 check_oracle*
> -rwxr-xr-x 1 nagios nagios 30356 May 18 13:48 check_overcr*
> -rwxr-xr-x 1 nagios nagios 41387 May 18 13:48 check_ping*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_pop -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 39647 May 18 13:48 check_procs*
> -rwxr-xr-x 1 nagios nagios 29078 May 18 13:48 check_real*
> -rwxr-xr-x 1 nagios nagios  9579 May 18 13:48 check_rpc*
> -rwxr-xr-x 1 nagios nagios  1135 May 18 13:48 check_sensors*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_simap -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 51895 May 18 13:48 check_smtp*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_spop -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 26231 May 18 13:48 check_ssh*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_ssmtp -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 30682 May 18 13:48 check_swap*
> -rwxr-xr-x 1 nagios nagios 38940 May 18 13:48 check_tcp*
> -rwxr-xr-x 1 nagios nagios 28909 May 18 13:48 check_time*
> lrwxrwxrwx 1 root   root       9 May 18 13:48 check_udp -> check_tcp*
> -rwxr-xr-x 1 nagios nagios 34400 May 18 13:48 check_ups*
> -rwxr-xr-x 1 nagios nagios 27485 May 18 13:48 check_users*
> -rwxr-xr-x 1 nagios nagios  2934 May 18 13:48 check_wave*
> -rwxr-xr-x 1 nagios nagios 28787 May 18 13:48 negate*
> -rwxr-xr-x 1 nagios nagios 27484 May 18 13:48 urlize*
> -rwxr-xr-x 1 nagios nagios  1939 May 18 13:48 utils.pm*
> -rwxr-xr-x 1 nagios nagios   862 May 18 13:48 utils.sh*
> 
> 
> 
> ------------------------------------------------------------------------------
> 
> 
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting
> any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From rmp.dmd1229 at gmail.com  Tue May 25 21:27:17 2010
From: rmp.dmd1229 at gmail.com (rmp dmd)
Date: Tue, 25 May 2010 15:27:17 -0400
Subject: Reading Nagios Alerts
Message-ID: 

The nagios log files have different time stamps. How can this be converted
to the normal time stamp? Can nagios config be changed to output the normal
time stamp on the log files?

Thanks!
Roehl

[1274760000] LOG ROTATION: DAILY
[1274760000] LOG VERSION: 2.0
[1274760000] CURRENT HOST STATE: anchor;DOWN;HARD;1;CRITICAL - Host
Unreachable
[1274760000] CURRENT HOST STATE: axis;UP;HARD;1;PING OK - Packet loss = 0%,
RTA = 0.67 ms
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From rwolfe at i-evolve.com  Tue May 25 21:36:28 2010
From: rwolfe at i-evolve.com (Robert Wolfe)
Date: Tue, 25 May 2010 15:36:28 -0400
Subject: Reading Nagios Alerts
In-Reply-To: 
References: 
Message-ID: <33C7BE4CE80D504E8907FDC31F8A0830505EAA7CD3@iexchange.I-EVOLVE.com>

The following link has the answer you need:

http://support.nagios.com/knowledgebase/faqs/index.php?option=com_content&view=article&id=52&catid=35&faq_id=70&expand=false&showdesc=true

But, I'll post it here as well:

To quote:

-----8<-----
#!/usr/bin/perl
# nagios-log-print
# taken from the faqs section FAQID: F0070 on the www.nagios.org website
#

if (!@ARGV[0]) {
    print "Usage: {jumi [knowledge-based/faq/viewfaq.php]} \n";
    print "\n";
    print "prints the specified Nagios-logfile with timestamps\n";
    print "converted to human readable time\n";
    exit 1;
}

open(LOG, at ARGV[0]) || die "no Nagios log file named >>>>>>@ARGV[0]<<<<<<";
while () {
    ($part1, $part2) = split(/\s+/, $_, 2);
    $part1 =~ s/\[//;
    $part1 =~ s/\]//;
    print "[", scalar localtime($part1), "]\;$part2\n";
};
close(LOG);


Alternatively, if you want to display the log from the command line use;

perl -pe 's/(\d+)/localtime(jumi)/e' nagios.log



From: rmp dmd [mailto:rmp.dmd1229 at gmail.com]
Sent: Tuesday, May 25, 2010 3:27 PM
To: nagios-users at lists.sourceforge.net
Subject: [Nagios-users] Reading Nagios Alerts

The nagios log files have different time stamps. How can this be converted to the normal time stamp? Can nagios config be changed to output the normal time stamp on the log files?

Thanks!
Roehl

[1274760000] LOG ROTATION: DAILY
[1274760000] LOG VERSION: 2.0
[1274760000] CURRENT HOST STATE: anchor;DOWN;HARD;1;CRITICAL - Host Unreachable
[1274760000] CURRENT HOST STATE: axis;UP;HARD;1;PING OK - Packet loss = 0%, RTA = 0.67 ms
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From jbroughton at truecos.com  Tue May 25 22:00:54 2010
From: jbroughton at truecos.com (Jayson Broughton)
Date: Tue, 25 May 2010 14:00:54 -0600
Subject: nagios/opsview with qpage
In-Reply-To: <4BFC0604.8020506@brucetelecom.com>
References: <4BFC0604.8020506@brucetelecom.com>
Message-ID: <1274817654.2513.19.camel@localhost.localdomain>

Darren,
I don't know if this helps..but I use qpage for nagios and other *nix
alerts.  I call qpage directly from /usr/local/bin..here is my info

####command.cfg
# 'host-notify-by-sms' command definition
define command{
        command_name    host-notify-by-sms
        command_line    /usr/local/bin/qpage -s localhost -p
name1,name2,name3 -f $NOTIFICATIONTYPE$ - Host $HOSTALIAS$ is $HOSTSTATE
$
        }

# 'service-notify-by-sms' command definition
define command{
        command_name    service-notify-by-sms
        command_line    /usr/local/bin/qpage -s localhost -p
name1,name2,name3 -f $NOTIFICATIONTYPE$ - $HOSTALIAS$/$SERVICEDESC$ is
$SERVICESTATE$
        }

####contacts.cfg
define contact{
        contact_name                    ITpager
        alias                           Operations On-Call
        service_notification_period     24x7
        host_notification_period        24x7
        service_notification_options    w,u,c,r,f
        host_notification_options       d,u,r,f
        service_notification_commands   service-notify-by-sms
        host_notification_commands      host-notify-by-sms
        email                           administrator at whatever.huh
        }

qpage's running process:
qpage -C /etc/qpage.cf -q 15



On Tue, 2010-05-25 at 13:16 -0400, Darren Hill wrote:
> Hi,
> 
> I've seen a few examples of qpage working with nagios (I'm using opsview 
> as a frontend to nagios).
> Qpage is working ok manually, but when I try and have nagios send the 
> page nothing comes out.
> I'm using ubuntu 8.04 LTS server running opsview 3.7
> 
> Here's my misccommands.cfg, contacts.cfg
> 
> define command{
>          command_name notify-by-pagerscript
>          command_line /usr/local/nagios/libexec/notifications/qpage -s 
> localhost -p $PAGER$ -f $HOSTNAME$ $SERVICEDESC$ `SERVICEOUTPUT$' $HOSTNAME$
> }
> 
> define contact {
>          contact_name    admin/01default
>          alias           Administrator
>          use             global-contact
>          service_notification_period     24x7
>          host_notification_period        24x7
>          notification_level              1
>         email hidden at whatever
>          pager   INETPager
>          host_notification_commands      
> notify-by-email,notify-by-pagerscript
>          host_notification_options       u,d,r,f
>          service_notification_commands   
> notify-by-email,notify-by-pagerscript
>          service_notification_options    w,c,r,u,f
> }
> 
> Any ideas?
> 
> qpage is running with /usr/local/bin/qpage -q 5 in the background waiting.
> 
> Thanks!
> 
> ------------------------------------------------------------------------------
> 
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null



The information in this electronic mail message and any attached files is confidential and may be legally privileged.  If you are not the intended recipient, delete this message and contact the sender immediately.  Access to this message by anyone other than its intended recipient is unauthorized.  You must not use or disseminate this information as it is proprietary property of the True companies.  Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes.  Thank you.


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mwiese at beit.de  Tue May 25 22:02:44 2010
From: mwiese at beit.de (mwiese at beit.de)
Date: Tue, 25 May 2010 22:02:44 +0200
Subject: =?iso-8859-1?q?Marco_Wiese/BEIT_GmbH_ist_au=DFer_H?=
	=?iso-8859-1?q?aus=2E?=
Message-ID: 



Ich werde vom 25.05.2010 bis einschlie?lich  04.06.2010 nicht im B?ro sein.

Ich werde Ihre Nachricht nach meiner R?ckkehr schnellstm?glich beantworten.
------------------------------------------------------
I will be out of the office starting 05/25/2010 and will not return until
06/04/2010. I will answer your message after my return.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From michael at medin.name  Wed May 26 07:23:40 2010
From: michael at medin.name (Michael Medin)
Date: Wed, 26 May 2010 07:23:40 +0200
Subject: NSClient++ 0.3.8 released!
Message-ID: <4BFCB05C.3090608@medin.name>

Hello,

Actually this was released last week but I did not have time to get it 
up on sourceforge and what not but here we go...
New version of NSClient++ (Agent for monitoring amongst other things) 
Windows from (amongst other systems) Nagios.

You can download it from either the project page: http://nsclient.org or 
sourceforge: https://sourceforge.net/projects/nscplus/files/ I shall see 
what happen to nagiosforge seems it is not there any more.

Whats new:
A lot of new features and, I am especially, proud of the new event log 
filters.
For those who has not seen the eventlog filter I recommend reading up on 
the quick guide i wrote on the
http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog page on the wiki.
It has a fairly good getting started guide I hope. But a quick note is 
that the new default filter is:
   generated > -2d AND severity NOT IN ('success', 'informational')
Which I believe is a lot more readable the before.

Other news are many minor changes and fixes in the file7disk related 
checks as well as index in CheckCounter to allow handling multiple 
languages and locales.

The changelog grouped by module can be found here:
New commands
  - CheckSingleFile - to check spects of a single file use like so:
     CheckSingleFile file=d:\nrpe_512.pem warn=>100 check=line-count 
warn=>100 crit=>170 check=size
  - CheckSingleRegEntry
     CheckSingleRegEntry 
path=HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\services\eventlog\Application\MaxSize 
"syntax=%path%: %int%"  warn==20971520 crit==20971520 check=int ShowAll=long

Scripts
  - Cleaned up scripts folder
  - Added new "NagiosPlugin library" from op5
  - Added check_no_rdp.vbs (Checks that no RDP connection is online)
  - Added check_battery.vbs which checks batterys via WMI
  - Added check_printer.vbs to check printers via WMI

CheckExternalScript
  - Added new "script templating" thing to simplify adding scripts:
     [Wrapped Scripts]
     test=check_test.ps1 ; (will use correct template for running ps1 
commands)
  - Fixed erroneous error message "Failed to peek buffer"

NSCA
  - Added obfuscated_password to NSCA section
  - Added so "global" ([Settings] password=...) passwords are read from 
the NSCA module

CheckEventLog
  - Brand new filter syntax based on SQL where clauses
     Avalible operators are: =, !=, >, <, >=, <=, eq, ne, gt, lt, ge, 
le, OR, AND, like, IN, NOT IN

CheckCounter
  - Added new index option to CheckCounter to allow looking up index and 
thus you can use the same checks on multiple locales and also use 
characters not present in "NRPE charset)
     CheckCounter index "Counter=\1450(_Total)\1458" ShowAll MaxWarn=500 
MaxCrit=1000
     nsclient++ -noboot CheckSystem pdhlookup Utskrifter
  - Optional new "safe" PDH subsystem (slower, but possibly safer)
     pdh_subsystem=thread-safe
  - Added checks for missing counters to CheckCounter

CheckDriveSize
  - Added volume support for CheckDriveSize (CHeckAll) like so:
     CheckDriveSize MinWarn=50% MinCrit=25% CheckAll=volumes 
FilterType=FIXED FilterType=REMOTE
  - Changed "missing" disks are now a critical error and not unknown
  - Improved CheckDriveSize bad FilterType error message
  - Added option to return error messages to the client [CheckDisk] 
show_errors=1 (defauilt is off 0)

CheckFile2
  - Dates are signed (means you can use neagitve dates to check the future)
     CheckFile2 debug path=D:\tmp\dates filter+creation=<-30m MaxWarn=1 
MaxCrit=1 "syntax=%filename%: %creation%"
     CRITICAL:future.txt: C: Thursday, December 31, 2009 12:47:11, found 
files: 1 > critical|'found files'=1;1;1;
  - Added checks for missing path and missing filter on CheckFile2 thus
  - Fixed so files locked for reading can be checked (basic checks)
  - Improved speed of file chyecking (does not check file data twice)
  - Changed so missing files and such generate an error
  - Fixed major issue with date matching in CheckFile* which was not 
working at all.

- Exe file version checks:
     CheckFile2 path=D:\tmp pattern=*.exe filter+version=!=6.0.2900.5512 
"syntax=%filename%: %version%" MaxWarn=1
  - Line count check:
     CheckFile2 path=D:\tmp pattern=*.txt filter+line-count=ne:3 
"syntax=%filename%: %line-count%" MaxWarn=1
  - Added ignore-errors to "ignore" any filesystem related errors 
(NOTICE this is probably not what you want)
  - Added master-syntax to CheckFile2 to change the overall message like 
so: %list%, %files%, %matches%
     CheckFile2 MinWarn=10 MinCrit=10 path=D:\WINDOWS\system32 
filter+size=gt:0 truncate=10 ignore-errors "master-syntax=%matches%/%files%"
  - Added %user% to syntax to print user who generated message
     CheckEventLog file=application file=system filter=new filter=out 
MaxWarn=1 MaxCrit=1 filter-generated=>2w filter-severity==success 
filter-severity==informational truncate=1023 unique descriptions 
"syntax=%user% (%count%)"
     CRITICAL: (1),  (2), NT INSTANS\SYSTEM (3), NT INSTANS\SYSTEM (3), 
NT INSTANS\SYSTEM (3), missing (3), missing (5),  (4), missing (2), 
missing (2), missing (2), missing (2),  (1), eventlog: 33 > 
critical|'eventlog'=33;1;1;

Generic:
  - Added != to all string comparisons
  - Changes syntax of performance counters: Alias is ' %' and it 
also has the "full" non % data as ''
     CheckDriveSize CheckAll MaxWarnUsed=80% MaxCritUsed=90%
     CRITICAL:CRITICAL: C:\: Total: 146G - Used: 140G (95%) - Free: 
6.31G (5%) > critical, D:\: Total: 152G - Used: 148G (97%) - Free: 3.59G 
(3%) > critical|'C:\ %'=95%;80;90; 'C:\'=140.17G;117.18;131.83;0;146.48; 
'D:\ %'=97%;80;90; 'D:\'=147.93G;121.21;136.3;0;151.52;
  - Added warning message ewhen numerical filters evaluate to zero (and 
are not 0)
  -  added a new "option" in conjunction with -c you can now do -m to 
specify the module to load.
     nsclient++ -m CheckDisk.dll -c CheckDriveSize MaxWarn=100 CheckAll

// Michael Medin

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From ballarati at interfree.it  Wed May 26 10:44:02 2010
From: ballarati at interfree.it (Andrea Ballarati)
Date: Wed, 26 May 2010 10:44:02 +0200
Subject: check_openmanage plugin error
In-Reply-To: <4BFCB05C.3090608@medin.name>
References: <4BFCB05C.3090608@medin.name>
Message-ID: <4BFCDF52.6000600@interfree.it>

Nagios reports error from the plugin in subject, we have another Dell
PowerEdge 1950 for which no errors are reported.
This is the output of check_openmanage -d

   System:      PowerEdge 1800
   ServiceTag:  XXXXXXXX                  OMSA version:    4.5.0
   BIOS/date:   A05 09/21/2005           Plugin version:  3.5.7
-----------------------------------------------------------------------------
   Storage Components

=============================================================================
  STATE  |    ID    |  MESSAGE TEXT

---------+----------+--------------------------------------------------------
 WARNING |        0 | Controller 0 [CERC SATA 1.5/2s] needs attention:
Degraded
      OK |    0:0:0 | Array Disk 0:0 [1.0TB] on ctrl 0 is Online
      OK |    0:0:1 | Array Disk 0:1 [1.0TB] on ctrl 0 is Online
      OK |      0:0 | Logical drive 0 'Windows Disk 0' [RAID-1, 931.48
GB] on ctrl 0 is Ready
      OK |      0:0 | Channel 0 [] on controller 0 is Ready
-----------------------------------------------------------------------------
   Chassis Components

=============================================================================
  STATE  |  ID  |  MESSAGE TEXT

---------+------+------------------------------------------------------------
      OK |    1 | Memory module 1 [DIMM1_A, 512 MB] is Ok
      OK |    2 | Memory module 2 [DIMM1_B, 512 MB] is Ok
      OK |    1 | Chassis fan 1 [BMC Fan 1]: 1500
      OK |    2 | Chassis fan 2 [BMC Fan 2]: 1500
      OK |    0 | Power Supply 0 [VRM]: Presence detected
      OK |    1 | Power Supply 1 [VRM]: Presence detected
      OK |    0 | Temperature Probe 0 [PROC_1 Temp] reads 38 C (max=120/125)
      OK |    1 | Temperature Probe 1 [BMC Ambient Temp] reads 22 C
(min=8/3, max=40/45)
      OK |    2 | Temperature Probe 2 [BMC Planar Temp] reads 33 C
(min=8/3, max=62/67)
      OK |    3 | Temperature Probe 3 [BMC VRD 0 Temp] reads 31 C
(min=8/3, max=70/75)
      OK |    4 | Temperature Probe 4 [BMC VRD 1 Temp] reads 27 C
(min=8/3, max=70/75)
      OK |    0 | Processor 0 [Intel Xeon 3.00GHz] is Present
      OK |    0 | Voltage sensor 0 [BMC CMOS Battery] is 3.070 V
      OK |    1 | Voltage sensor 1 [PROC_1 VCORE] is Good
      OK |    2 | Voltage sensor 2 [BMC PROC VTT] is Good
      OK |    3 | Voltage sensor 3 [BMC 1.5V PG] is Good
      OK |    4 | Voltage sensor 4 [BMC 1.8V PG] is Good
      OK |    5 | Voltage sensor 5 [BMC 3.3V PG] is Good
      OK |    6 | Voltage sensor 6 [BMC 5V PG] is Good
      OK |    0 | Chassis intrusion 0 detection: Ok (Not Breached)
-----------------------------------------------------------------------------
   Other messages

=============================================================================
  STATE  |  MESSAGE TEXT

---------+-------------------------------------------------------------------
      OK | ESM log health is Ok (less than 80% full)

INTERNAL ERROR: Use of uninitialized value in numeric eq (==) at
/usr/lib/nagios/plugins/check_openmanage line 1380.
INTERNAL ERROR: Use of uninitialized value in numeric eq (==) at
/usr/lib/nagios/plugins/check_openmanage line 1380.
INTERNAL ERROR: Use of uninitialized value in sprintf at
/usr/lib/nagios/plugins

Any ideas?
Thank you.

-- 
Andrea Ballarati

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From t.h.amundsen at usit.uio.no  Wed May 26 12:00:16 2010
From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen)
Date: Wed, 26 May 2010 12:00:16 +0200
Subject: check_openmanage plugin error
In-Reply-To: <4BFCDF52.6000600@interfree.it> (Andrea Ballarati's message of
	"Wed, 26 May 2010 10:44:02 +0200")
References: <4BFCB05C.3090608@medin.name> <4BFCDF52.6000600@interfree.it>
Message-ID: <15tpr0j2dan.fsf@tux.uio.no>

Andrea Ballarati  writes:

> Nagios reports error from the plugin in subject, we have another Dell
> PowerEdge 1950 for which no errors are reported.
> This is the output of check_openmanage -d
>
>    System:      PowerEdge 1800
>    ServiceTag:  XXXXXXXX                  OMSA version:    4.5.0
>    BIOS/date:   A05 09/21/2005           Plugin version:  3.5.7
> -----------------------------------------------------------------------------
>    Storage Components
>
> =============================================================================
>   STATE  |    ID    |  MESSAGE TEXT
>
> ---------+----------+--------------------------------------------------------
>  WARNING |        0 | Controller 0 [CERC SATA 1.5/2s] needs attention:
> Degraded
>       OK |    0:0:0 | Array Disk 0:0 [1.0TB] on ctrl 0 is Online
>       OK |    0:0:1 | Array Disk 0:1 [1.0TB] on ctrl 0 is Online
>       OK |      0:0 | Logical drive 0 'Windows Disk 0' [RAID-1, 931.48
> GB] on ctrl 0 is Ready
>       OK |      0:0 | Channel 0 [] on controller 0 is Ready
> -----------------------------------------------------------------------------
>    Chassis Components
>
> =============================================================================
>   STATE  |  ID  |  MESSAGE TEXT
>
> ---------+------+------------------------------------------------------------
>       OK |    1 | Memory module 1 [DIMM1_A, 512 MB] is Ok
>       OK |    2 | Memory module 2 [DIMM1_B, 512 MB] is Ok
>       OK |    1 | Chassis fan 1 [BMC Fan 1]: 1500
>       OK |    2 | Chassis fan 2 [BMC Fan 2]: 1500
>       OK |    0 | Power Supply 0 [VRM]: Presence detected
>       OK |    1 | Power Supply 1 [VRM]: Presence detected
>       OK |    0 | Temperature Probe 0 [PROC_1 Temp] reads 38 C (max=120/125)
>       OK |    1 | Temperature Probe 1 [BMC Ambient Temp] reads 22 C
> (min=8/3, max=40/45)
>       OK |    2 | Temperature Probe 2 [BMC Planar Temp] reads 33 C
> (min=8/3, max=62/67)
>       OK |    3 | Temperature Probe 3 [BMC VRD 0 Temp] reads 31 C
> (min=8/3, max=70/75)
>       OK |    4 | Temperature Probe 4 [BMC VRD 1 Temp] reads 27 C
> (min=8/3, max=70/75)
>       OK |    0 | Processor 0 [Intel Xeon 3.00GHz] is Present
>       OK |    0 | Voltage sensor 0 [BMC CMOS Battery] is 3.070 V
>       OK |    1 | Voltage sensor 1 [PROC_1 VCORE] is Good
>       OK |    2 | Voltage sensor 2 [BMC PROC VTT] is Good
>       OK |    3 | Voltage sensor 3 [BMC 1.5V PG] is Good
>       OK |    4 | Voltage sensor 4 [BMC 1.8V PG] is Good
>       OK |    5 | Voltage sensor 5 [BMC 3.3V PG] is Good
>       OK |    6 | Voltage sensor 6 [BMC 5V PG] is Good
>       OK |    0 | Chassis intrusion 0 detection: Ok (Not Breached)
> -----------------------------------------------------------------------------
>    Other messages
>
> =============================================================================
>   STATE  |  MESSAGE TEXT
>
> ---------+-------------------------------------------------------------------
>       OK | ESM log health is Ok (less than 80% full)
>
> INTERNAL ERROR: Use of uninitialized value in numeric eq (==) at
> /usr/lib/nagios/plugins/check_openmanage line 1380.
> INTERNAL ERROR: Use of uninitialized value in numeric eq (==) at
> /usr/lib/nagios/plugins/check_openmanage line 1380.
> INTERNAL ERROR: Use of uninitialized value in sprintf at
> /usr/lib/nagios/plugins

Hi Andrea,

check_openmanage is designed to work with relatively recent OMSA
versions. You are using OMSA version 4.5.0, which is very old. The
server in question (poweredge 1800) is supported by newer OMSA, so the
solution is an OMSA upgrade to the latest version (6.2.0).

OMSA versions 5.3.0 and later is OK to use with check_openmanage, and
I've had reports that 5.1.0 and 5.2.0 works as well (but no
guarantee). Anything older will yield strange results or will simply not
work.

Cheers,
-- 
Trond H. Amundsen 
Center for Information Technology Services, University of Oslo

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From ballarati at interfree.it  Wed May 26 12:21:03 2010
From: ballarati at interfree.it (Andrea Ballarati)
Date: Wed, 26 May 2010 12:21:03 +0200
Subject: check_openmanage plugin error
In-Reply-To: <15tpr0j2dan.fsf@tux.uio.no>
References: <4BFCB05C.3090608@medin.name> <4BFCDF52.6000600@interfree.it>
	<15tpr0j2dan.fsf@tux.uio.no>
Message-ID: <4BFCF60F.50800@interfree.it>

Trond Hasle Amundsen wrote:
> check_openmanage is designed to work with relatively recent OMSA
> versions. You are using OMSA version 4.5.0, which is very old. The
> server in question (poweredge 1800) is supported by newer OMSA, so the
> solution is an OMSA upgrade to the latest version (6.2.0).
> 
> OMSA versions 5.3.0 and later is OK to use with check_openmanage, and
> I've had reports that 5.1.0 and 5.2.0 works as well (but no
> guarantee). Anything older will yield strange results or will simply not
> work.

Thank you for the answer, I'm going to upgrade both servers to OMSA ver.
6. This shoud solve the problem.

OMSA version 5.2 that is currently running on the other server
(PowerEdge 1950) seems to work well with the plugin.


-- 
Andrea Ballarati

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From matgand at gmail.com  Wed May 26 14:13:51 2010
From: matgand at gmail.com (Mattia Gandolfi)
Date: Wed, 26 May 2010 14:13:51 +0200
Subject: LDAP authentication and CGI authorization problem
In-Reply-To: 
References: 
Message-ID: 

Self-replying: I've just discovered the root cause: can_submit_commands was
set to 0 in contacts template definition.
Works as expected now

Mattia

On Tue, May 25, 2010 at 3:08 PM, Mattia Gandolfi  wrote:

> Hi all,
>
> I'm facing problems while trying to enable LDAP authentication on a Nagios
> 3.2.1 install (using htpasswd.users everything works fine).
> This is how I've configured Apache:
>
> 
>     AuthType Basic
>     AuthName "Nagios - Ldap"
>     AuthBasicProvider ldap
>     AuthLDAPUrl
> ldaps://unixautmi-ese01.sky.local:636,unixautca-ese01.sky.local:636/ou=people,dc=sky,dc=local?uid
>     AuthLDAPBindDN "cn=authuser,dc=sky,dc=local"
>     AuthLDAPBindPassword oaj5Phum
>     Require ldap-dn uid=gandolfim,ou=people,dc=sky,dc=local
>     Require ldap-user gandolfim
>     AuthLDAPGroupAttributeIsDN off
>     Require ldap-group cn=systemadminmi,ou=groups,dc=sky,dc=local
>     Require ldap-group cn=infosec,ou=groups,dc=sky,dc=local
>     AuthLDAPGroupAttribute memberUid
> 
> 
>     AuthType Basic
>     AuthName "Nagios - Ldap - CGI"
>     AuthBasicProvider ldap
>     AuthLDAPUrl
> ldaps://unixautmi-ese01.sky.local:636,unixautca-ese01.sky.local:636/ou=people,dc=sky,dc=local?uid
>     AuthLDAPBindDN "cn=authuser,dc=sky,dc=local"
>     AuthLDAPBindPassword oaj5Phum
>     Require ldap-dn uid=gandolfim,ou=people,dc=sky,dc=local
>     Require ldap-user gandolfim
>     AuthLDAPGroupAttributeIsDN off
>     Require ldap-group cn=systemadminmi,ou=groups,dc=sky,dc=local
>     Require ldap-group cn=infosec,ou=groups,dc=sky,dc=local
>     AuthLDAPGroupAttribute memberUid
> 
>
> I've defined my username as a contact
>
> define contact {
>         use             email-contact
>         contact_name    gandolfim
>         alias           Mattia Gandolfi
>         email           mattia.gandolfi at xxxxxxx.com
>         pager           none
> }
>
> and I've set the following options in cgi.cfg
>
> use_authentication=1
> use_ssl_authentication=0
> authorized_for_system_information=gandolfim
> authorized_for_configuration_information=gandolfim
> authorized_for_system_commands=gandolfim
> authorized_for_all_services=gandolfim
> authorized_for_all_hosts=gandolfim
> authorized_for_all_service_commands=gandolfim
>
> Authentication works fine, and I see "Logged in as *gandolfim"* on top of
> the Tactical Monitoring Overview page.
> However, as soon as I try to access the cgi, for example to disable
> notifications for a service, I get "Sorry, but you are not authorized to
> commit the specified command."
>
> What am I missing?
>
> Thanks
>
> Mattia
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From darren at brucetelecom.com  Wed May 26 15:43:39 2010
From: darren at brucetelecom.com (Darren Hill)
Date: Wed, 26 May 2010 09:43:39 -0400
Subject: nagios/opsview with qpage
In-Reply-To: <1274817654.2513.19.camel@localhost.localdomain>
References: <4BFC0604.8020506@brucetelecom.com>
	<1274817654.2513.19.camel@localhost.localdomain>
Message-ID: <4BFD258B.5060604@brucetelecom.com>

I have it working now.
I think I'll shorten down the message to fit onto the SMS capacity 
limit, but it does work.
I was using $PAGER$ when doing -p $PAGER$ in my command and for some 
reason it didn't like that, even if I defined it under my contacts..



On 5/25/2010 4:00 PM, Jayson Broughton wrote:
> Darren,
> I don't know if this helps..but I use qpage for nagios and other *nix
> alerts.  I call qpage directly from /usr/local/bin..here is my info
>
> ####command.cfg
> # 'host-notify-by-sms' command definition
> define command{
>          command_name    host-notify-by-sms
>          command_line    /usr/local/bin/qpage -s localhost -p
> name1,name2,name3 -f $NOTIFICATIONTYPE$ - Host $HOSTALIAS$ is $HOSTSTATE
> $
>          }
>
> # 'service-notify-by-sms' command definition
> define command{
>          command_name    service-notify-by-sms
>          command_line    /usr/local/bin/qpage -s localhost -p
> name1,name2,name3 -f $NOTIFICATIONTYPE$ - $HOSTALIAS$/$SERVICEDESC$ is
> $SERVICESTATE$
>          }
>
> ####contacts.cfg
> define contact{
>          contact_name                    ITpager
>          alias                           Operations On-Call
>          service_notification_period     24x7
>          host_notification_period        24x7
>          service_notification_options    w,u,c,r,f
>          host_notification_options       d,u,r,f
>          service_notification_commands   service-notify-by-sms
>          host_notification_commands      host-notify-by-sms
>          email                           administrator at whatever.huh
>          }
>
> qpage's running process:
> qpage -C /etc/qpage.cf -q 15
>
>
>
> On Tue, 2010-05-25 at 13:16 -0400, Darren Hill wrote:
>    
>> Hi,
>>
>> I've seen a few examples of qpage working with nagios (I'm using opsview
>> as a frontend to nagios).
>> Qpage is working ok manually, but when I try and have nagios send the
>> page nothing comes out.
>> I'm using ubuntu 8.04 LTS server running opsview 3.7
>>
>> Here's my misccommands.cfg, contacts.cfg
>>
>> define command{
>>           command_name notify-by-pagerscript
>>           command_line /usr/local/nagios/libexec/notifications/qpage -s
>> localhost -p $PAGER$ -f $HOSTNAME$ $SERVICEDESC$ `SERVICEOUTPUT$' $HOSTNAME$
>> }
>>
>> define contact {
>>           contact_name    admin/01default
>>           alias           Administrator
>>           use             global-contact
>>           service_notification_period     24x7
>>           host_notification_period        24x7
>>           notification_level              1
>>          email hidden at whatever
>>           pager   INETPager
>>           host_notification_commands
>> notify-by-email,notify-by-pagerscript
>>           host_notification_options       u,d,r,f
>>           service_notification_commands
>> notify-by-email,notify-by-pagerscript
>>           service_notification_options    w,c,r,u,f
>> }
>>
>> Any ideas?
>>
>> qpage is running with /usr/local/bin/qpage -q 5 in the background waiting.
>>
>> Thanks!
>>
>> ------------------------------------------------------------------------------
>>
>> _______________________________________________
>> Nagios-users mailing list
>> Nagios-users at lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/nagios-users
>> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
>> ::: Messages without supporting info will risk being sent to /dev/null
>>      
>
>
> The information in this electronic mail message and any attached files is confidential and may be legally privileged.  If you are not the intended recipient, delete this message and contact the sender immediately.  Access to this message by anyone other than its intended recipient is unauthorized.  You must not use or disseminate this information as it is proprietary property of the True companies.  Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes.  Thank you.
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
>    

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mlists at 1984.is  Wed May 26 20:49:34 2010
From: mlists at 1984.is (mlists at 1984.is)
Date: Wed, 26 May 2010 18:49:34 +0000
Subject: Overly persistent contact group
Message-ID: 

Dear list,

I have nagios3 on Debian Lenny.  I created a service template and host
template for a customer as well as a couple of contacts and a contact
group.  I specified the contact group in the host and service template
and created some host and service defininitions based on the
aforementioned templates.

So I hoped that notifications would be sent to these new contacts as per
the setup descibed above. This hope failed, and notifications were only
sent to a 'admins' contactgroup, which is not specified anywhere in the
setup of those hosts, services, contacts, group or template.

When I remove the 'admins' contact group from the config files and run a
test of the config, I get this:

Error: Contact group 'admins' specified in service 'SYSTEM STATUS' for
host 'host.domain.tld' is not defined anywhere!

Even though this contact group is mentioned nowhere in connection with
these hosts or services.

It seems that all contact groups except the one named 'admins' fail to
register with the Nagios system and that the 'admins' contact group is
somehow automatically associated with all host definitions, regardless of
which contact group is actually specified in configuration.

Mordur Ingolfsson
 

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mark.frost1 at pepsico.com  Wed May 26 21:00:11 2010
From: mark.frost1 at pepsico.com (Frost, Mark {PBC})
Date: Wed, 26 May 2010 15:00:11 -0400
Subject: Overly persistent contact group
In-Reply-To: 
References: 
Message-ID: 

Mordur,

Two thoughts on this.  First, I find that I've been burned many times by
contact/contactgroup inheritance.  That is, where you define a contactgroup
for a host and that gets inherited by the service (when I don't want it to).

Second, I rely a lot on looking at the "Configuration" link at the bottom of the
Nagios web interface.  That lets you look and see what's really defined for
all the objects (hosts, services, contacts, contactgroups, timeperiods, etc).
Essentially it allows me to go in and compare what I intended to say in the
configuration with what Nagios really is using.

Mark

-----Original Message-----
From: mlists at 1984.is [mailto:mlists at 1984.is] 
Sent: Wednesday, May 26, 2010 2:50 PM
To: nagios-users at lists.sourceforge.net
Subject: [Nagios-users] Overly persistent contact group

Dear list,

I have nagios3 on Debian Lenny.  I created a service template and host
template for a customer as well as a couple of contacts and a contact
group.  I specified the contact group in the host and service template
and created some host and service defininitions based on the
aforementioned templates.

So I hoped that notifications would be sent to these new contacts as per
the setup descibed above. This hope failed, and notifications were only
sent to a 'admins' contactgroup, which is not specified anywhere in the
setup of those hosts, services, contacts, group or template.

When I remove the 'admins' contact group from the config files and run a
test of the config, I get this:

Error: Contact group 'admins' specified in service 'SYSTEM STATUS' for
host 'host.domain.tld' is not defined anywhere!

Even though this contact group is mentioned nowhere in connection with
these hosts or services.

It seems that all contact groups except the one named 'admins' fail to
register with the Nagios system and that the 'admins' contact group is
somehow automatically associated with all host definitions, regardless of
which contact group is actually specified in configuration.

Mordur Ingolfsson
 

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From mirde at oppy.com  Thu May 27 01:46:03 2010
From: mirde at oppy.com (Mirza Dedic)
Date: Wed, 26 May 2010 16:46:03 -0700
Subject: check_by_ssh plugin;
	how to ignore pulgin timeout being critical?
Message-ID: 

Hi List,



I have Nagios currently using check_by_ssh (check_by_ssh v2002 (nagios-plugins 1.4.13)) to check some of my remote servers, it calls a log script on the remote server and the log script returns a warning if the log files matches the criteria.



Every now and then, the check_by_ssh script gets timed out, and returns the message below.



Is there a way to suppress this at the Nagios box, so that Plugin Timed Out messages do not get emailed as "Critical"?



Anyone know? Below is my command definition..



define command{

                command_name             van-oppy-pro_data450

                command_line $USER1$/check_by_ssh -H someserver -p 22 -i /home/nagios/.ssh/id_dsa -t 10 -C "/home/nagios/plugins/check_logfiles -t 30 -f /home/nagios/log_conf/usr1_data450.lg"

}



Nagios alert returned when the box being queried is very busy (this does not happen often, but creates a unwanted false positive warning):



CRITICAL - Plugin timed out while executing system call

The Oppenheimer Group ---- CONFIDENTIAL

This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From root at spyou.org  Thu May 27 06:24:06 2010
From: root at spyou.org (Spyou)
Date: Thu, 27 May 2010 06:24:06 +0200
Subject: check_by_ssh plugin;
 how to ignore pulgin timeout being critical?
In-Reply-To: 
References: 
Message-ID: <4BFDF3E6.1090000@spyou.org>

Le 27/05/2010 01:46, Mirza Dedic a ?crit :
> *                command_line $USER1$/check_by_ssh -H someserver -p 22
> -i /home/nagios/.ssh/id_dsa -t 10 -C
> "/home/nagios/plugins/check_logfiles -t 30 -f
> /home/nagios/log_conf/usr1_data450.lg"*


Increase both timeouts for check_ssh and check_logfiles ?

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From smainklh at free.fr  Thu May 27 07:45:36 2010
From: smainklh at free.fr (=?ISO-8859-1?Q?Sma=EFne?= Kahlouch)
Date: Thu, 27 May 2010 07:45:36 +0200
Subject: User privileges separation
Message-ID: <1274939136.10274.0.camel@mazhost>

Hi every one,

As you expect, i would like to know if it's possible to configure
different access for users.
Users are stored in a LDAP backend.

For example, users from ou=org1,dc=domain,dc=tld could only see the
hostgroup 1
users from ou=org2,dc=domain,dc=tld could see everything.

I already know how to authenticate my users by configuring apache but
don't know how to set acl's/rights ...

I googled a bit and didn't find anything about it.
Just a wishlist :(

Is there a tool or is it possible with apache ?

Thanks for your help.

Regards,
Grifith 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From james at linux-source.org  Thu May 27 08:46:42 2010
From: james at linux-source.org (James Corteciano)
Date: Thu, 27 May 2010 14:46:42 +0800
Subject: Nagios - To capture apache process list before
	restarting
Message-ID: 

Hi All,

How do you guy's setup your nagios client for remote apache server to get
its httpd process list before doing soft/hard restarting apache service if
it is detected unreachable from nagios server? The following are the summary
of my setup for nagios clients for remote apache server.

[nagios-server]
       cat /usr/lib/nagios/plugins/eventhandlers/restart-httpd-remote
                (SOFT/HARD): /usr/lib/nagios/plugins/check_nrpe -H
nagios-client -c restart_httpd

[nagios-client]
       cat /etc/nagios/nrpe.conf

  command[restart_httpd]=/usr/lib/nagios/plugins/eventhandlers/restart-httpd
       cat /usr/lib/nagios/plugins/eventhandlers/restart-httpd
               /usr/bin/sudo /sbin/service httpd restart

I just wanted to know how do you get the apache process list before
triggering to restart which will helpful during diagnostic operation. Do you
used any special plugins or personal script that could insert to
"restart-httpd" file?

Thank you.

Regards,
James
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From patrick.morris at hp.com  Thu May 27 09:00:30 2010
From: patrick.morris at hp.com (Morris, Patrick)
Date: Thu, 27 May 2010 00:00:30 -0700
Subject: User privileges separation
In-Reply-To: <1274939136.10274.0.camel@mazhost>
References: <1274939136.10274.0.camel@mazhost>
Message-ID: <4BFE188E.9090408@hp.com>

Sma?ne Kahlouch wrote:
> Hi every one,
>
> As you expect, i would like to know if it's possible to configure 
> different access for users.
> Users are stored in a LDAP backend.
>
> For example, users from ou=org1,dc=domain,dc=tld could only see the 
> hostgroup 1
> users from ou=org2,dc=domain,dc=tld could see everything.
>
> I already know how to authenticate my users by configuring apache but 
> don't know how to set acl's/rights ...
>
> I googled a bit and didn't find anything about it.
> Just a wishlist 
> :(
>
> Is there a tool or is it possible with apache ?

Nagios will, by default, only show hosts and services to those who are 
listed as contacts on those hosts and services. Who is authorized to see 
what is not handled at the Apache level.

It won't talk to LDAP directly; it will only use what is in it's own 
configuration files. You could, however, probably write a script that 
generated contact groups based on who is where in LDAP and added the 
users to those contact groups. Then you'd need to configure those 
contact groups on the appropriate host and service definitions.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From ae at op5.se  Thu May 27 10:26:32 2010
From: ae at op5.se (Andreas Ericsson)
Date: Thu, 27 May 2010 10:26:32 +0200
Subject: nagios/opsview with qpage
In-Reply-To: <4BFD258B.5060604@brucetelecom.com>
References: <4BFC0604.8020506@brucetelecom.com>	<1274817654.2513.19.camel@localhost.localdomain>
	<4BFD258B.5060604@brucetelecom.com>
Message-ID: <4BFE2CB8.7090209@op5.se>

On 05/26/2010 03:43 PM, Darren Hill wrote:
> I have it working now.
> I think I'll shorten down the message to fit onto the SMS capacity
> limit, but it does work.
> I was using $PAGER$ when doing -p $PAGER$ in my command and for some
> reason it didn't like that, even if I defined it under my contacts..
> 

Off the top of my head, I think the macro to use is $CONTACTPAGER$,
but I could be mistaken. Check the docs for the exact answer.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From smainklh at free.fr  Wed May 26 21:38:45 2010
From: smainklh at free.fr (=?ISO-8859-1?Q?Sma=EFne?= Kahlouch)
Date: Wed, 26 May 2010 21:38:45 +0200
Subject: User privileges separation
Message-ID: <1274902725.13601.6.camel@mazhost>

Hi every one,

As you expect, i would like to know if it's possible to configure
different access for users.
Users are stored in a LDAP backend.

For example, users from ou=org1,dc=domain,dc=tld could only see the
hostgroup 1
users from ou=org2,dc=domain,dc=tld could see everything.

I already know how to authenticate my users by configuring apache but
don't know how to set acl's/rights ...

I googled a bit and didn't find anything about it.
Just a wishlist :(

Is there a tool or is it possible with apache ?

Thanks for your help.

Regards,
Grifith
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From Gabriel at impactteachers.com  Thu May 27 11:58:11 2010
From: Gabriel at impactteachers.com (Gabriel - IP Guys)
Date: Thu, 27 May 2010 10:58:11 +0100
Subject: Monitoring Performance: Not sure what is good or bad
Message-ID: 

Below is a snapshot of monitoring performance as shown on the Tactical
Monitoring Overview Page

Monitoring Performance

Service Check Execution Time:		0.04 / 14.14 / 3.036 sec
Service Check Latency:			0.00 / 85.15 / 9.604 sec
Host Check Execution Time:		4.30 / 16.05 / 8.263 sec
Host Check Latency:			0.02 / 112.41 / 30.940 sec
# Active Host / Service Checks:	43 / 244
# Passive Host / Service Checks:	0  / 0

Now, I know that I'm not monitoring that many hosts, but I'm not sure
what 'good' performance is. My nagios server had an extremely high load
this morning, and the only service on that box, is nagios.

What type of performance should I reasonably expect from my monitoring
server?


The Puppet Apprentice   :- http://puppetnewbie.blogspot.com/
Follow me on twitter    :- http://twitter.com/mritguru
Puppet #tag on twitter  :- #puppet
IRC                     :- itguru ON irc.freenode.org (feel free to say
hi!)



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From felix at itim-cj.ro  Thu May 27 12:01:21 2010
From: felix at itim-cj.ro (felix farcas)
Date: Thu, 27 May 2010 13:01:21 +0300
Subject: apache22 problem installing from ports
Message-ID: <4BFE42F1.3050001@itim-cj.ro>

Hello

 have a problem installing apache 22 from ports. The installation stops 
with the following message:

You need APR random support to use mod_auth_digest.
Look at APR configure options --with-egd and --with-devrandom.
checking whether to enable mod_auth_digest... configure: error: 
mod_auth_digest failures
===>  Script "configure" failed unexpectedly.

How may I overcome this error?
Thank you
Felix



-- 
Ing. drd. Farcas Felix
National Institute of Research and Development 
of Isotopic and Molecular Technology,
IT - Department - Cluj-Napoca, Romania
yahoo id: felixfarcas
skype id: felix.farcas
mobile: +40-742-195323 


-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3092 bytes
Desc: S/MIME Cryptographic Signature
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From lists at xodus.org  Thu May 27 14:43:47 2010
From: lists at xodus.org (Marc Powell)
Date: Thu, 27 May 2010 07:43:47 -0500
Subject: Nagios - To capture apache process list before
	restarting
In-Reply-To: 
References: 
Message-ID: <8855C022-249C-476E-AF5D-043A37869933@xodus.org>


On May 27, 2010, at 1:46 AM, James Corteciano wrote:

> I just wanted to know how do you get the apache process list before triggering to restart which will helpful during diagnostic operation. Do you used any special plugins or personal script that could insert to "restart-httpd" file?


Couldn't it be as straightforward as modifying the restart script on the client to capture it to a file? --

/usr/lib/nagios/plugins/eventhandlers/restart-httpd --
		/bin/ps -efwww | grep https > /path/to/apachelist.log
               /usr/bin/sudo /sbin/service httpd restart

--
Marc
------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From lists at xodus.org  Thu May 27 14:46:14 2010
From: lists at xodus.org (Marc Powell)
Date: Thu, 27 May 2010 07:46:14 -0500
Subject: apache22 problem installing from ports
In-Reply-To: <4BFE42F1.3050001@itim-cj.ro>
References: <4BFE42F1.3050001@itim-cj.ro>
Message-ID: <0B56D564-B344-499A-8E77-45F0C5D4F9D8@xodus.org>


On May 27, 2010, at 5:01 AM, felix farcas wrote:

> Hello
> 
> have a problem installing apache 22 from ports. The installation stops with the following message:

Sent to the wrong list?

--
Marc


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From michael.friedrich at univie.ac.at  Thu May 27 15:33:44 2010
From: michael.friedrich at univie.ac.at (Michael Friedrich)
Date: Thu, 27 May 2010 15:33:44 +0200
Subject: [Nagios-devel] Nagios development moving forward
In-Reply-To: <4BD9D408.808@op5.se>
References: <4BD9D408.808@op5.se>
Message-ID: <4BFE74B8.1040200@univie.ac.at>

Hi,

just some feedback from my side on your patches.

-------- Original Message  --------
Subject: [Nagios-devel] Nagios development moving forward
From: Andreas Ericsson 
To: nagios-devel , 
Nagios-users at lists.sourceforge.net 
Date: 2010-04-29 20:46

> So for those of you who wish to try out a revamped Nagios with a
> bunch of bugfixes and code improvements, clone your repo from me
> or Ton today and build Nagios from source.

I've taken all your commits and applied them manually onto the current 
Icinga Core (we changed things, Nagios did not, and next to that, I'd 
like to understand what you and Toon hacked there ;-)

Your hackings are very very nice, and it's a real good start in fixing 
things and getting rid of old grown code and workarounds.

This commit is just WOW :-)

http://git.op5.org/git/?p=nagios.git;a=commit;h=555b4b72aae54e65ba923093ef469c3aa5a0edb9

Regarding taking over commits, kudos have been kept both within commit 
subjects and changelog for our next release including your patches next 
to ours.

 From the testing arena, everything runs fine. Although the sync 
retention feature is only one half of the good, the other would be 
opsview or some nifty hacking on distributed monitoring :) Everything 
else resolves as it should be.


By chance, get these changes into a revamped tar.gz and hand that to 
packagers, if it does not get into current CVS head. Or at least the 
patch set :)

Kind regards,
Michael

>
> To clone a repository with git, use
>    git clone
>
> This will get you all latest sources. For further info about git,
> see http://www.git-scm.org
>
> My repo:
>    clone url:
>      git://git.op5.org/nagios.git
>    gitweb url's:
>      http://git.op5.org/git/?p=nagios.git
>      http://git.op5.org/git/?p=nagios.git;a=shortlog;h=refs/heads/dev
>    branches:
>      master            Same as the still-official Nagios CVS repo
>      dev               My changes on top of CVS import
>      ae-ton            The merged state between me and Ton
>
> Ton's repo:
>    clone url:
>      git://github.com/tonvoon/opsview-nagios.git
>    gitweb url:
>      http://github.com/tonvoon/opsview-nagios.git
>    branches:
>      master            Ton's and my changes on top of CVS import
>
>
> Those of you who have reported bugs to tracker.nagios.org should
> log into the tracker again and see if your bug has been closed or
> resolved, and if so try out my dev branch, which should contain
> the proper fix.
>
> Those of you who are git-savvy since before can try
>
>    git log -p --grep="fix.*#$bug_id_without_leading_zeroes" dev
>
> in a repo cloned from mine and it'll show you the commits that
> are responsible for fixing your problem.
>


-- 
DI (FH) Michael Friedrich
michael.friedrich at univie.ac.at
Tel: +43 1 4277 14359

Vienna University Computer Center
Universitaetsstrasse 7 A-1010 Vienna, Austria

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From ae at op5.se  Thu May 27 15:51:39 2010
From: ae at op5.se (Andreas Ericsson)
Date: Thu, 27 May 2010 15:51:39 +0200
Subject: Nagios development moving forward
In-Reply-To: <4BFE74B8.1040200@univie.ac.at>
References: <4BD9D408.808@op5.se> <4BFE74B8.1040200@univie.ac.at>
Message-ID: <4BFE78EB.6050609@op5.se>

On 05/27/2010 03:33 PM, Michael Friedrich wrote:
> Hi,
> 
> just some feedback from my side on your patches.
> 
> -------- Original Message --------
> Subject: [Nagios-devel] Nagios development moving forward
> From: Andreas Ericsson 
> To: nagios-devel , 
> Nagios-users at lists.sourceforge.net 
> Date: 2010-04-29 20:46
> 
>> So for those of you who wish to try out a revamped Nagios with a
>> bunch of bugfixes and code improvements, clone your repo from me
>> or Ton today and build Nagios from source.
> 
> I've taken all your commits and applied them manually onto the current 
> Icinga Core (we changed things, Nagios did not, and next to that, I'd 
> like to understand what you and Toon hacked there ;-)
> 

Primarily code reduction and simplification. The diffstat between my
dev branch and the upstream master looks like this:
47 files changed, 1563 insertions(+), 4272 deletions(-)

Quite a few bugfixes too ofcourse.

> Your hackings are very very nice, and it's a real good start in fixing 
> things and getting rid of old grown code and workarounds.
> 

Yup. More can be done so object code only has to be compiled once for
both cgi's and core for quite a small tradeoff in memory usage. This
would mean simpler code and quite a lot simpler build-system, along
with the possibility of installing nagios-devel header files for us
eventmodule hackers.

> This commit is just WOW :-)
> 
> http://git.op5.org/git/?p=nagios.git;a=commit;h=555b4b72aae54e65ba923093ef469c3aa5a0edb9 
> 

Glad you like it. Macro voodoo is something of a fetish for me.

> 
> Regarding taking over commits, kudos have been kept both within commit 
> subjects and changelog for our next release including your patches next 
> to ours.
> 

Thanks. Always nice to be appreciated.

>  From the testing arena, everything runs fine. Although the sync 
> retention feature is only one half of the good, the other would be 
> opsview or some nifty hacking on distributed monitoring :) Everything 
> else resolves as it should be.
> 
> 
> By chance, get these changes into a revamped tar.gz and hand that to 
> packagers, if it does not get into current CVS head. Or at least the 
> patch set :)
> 

I'll poke Ethan about releasing 3.2.2 from git so we can get more
people onto the latest and greatest.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

------------------------------------------------------------------------------



From m.borsani at it.net  Thu May 27 15:50:07 2010
From: m.borsani at it.net (Marco Borsani)
Date: Thu, 27 May 2010 15:50:07 +0200
Subject: Scom collector
Message-ID: <24858.5679488771$1274969614@news.gmane.org>

Did  anybody ever use the collector between Nagios and Scom ??

 

Regards

Marco

-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From subscription at kkeane.com  Thu May 27 14:18:11 2010
From: subscription at kkeane.com (Kevin Keane)
Date: Thu, 27 May 2010 05:18:11 -0700
Subject: Monitoring Performance: Not sure what is good or
	bad
In-Reply-To: 
References: 
Message-ID: 

You have some checks that take a long time - more than 14 seconds for a service check and more than 16 seconds for a host check. You may want to identify which checks take such a long time. It may or may not be the culprit (the reason could be either a lot of processing, or waiting for some kind of timeout) but is probably a good starting point. Also run "top" to see which process is responsible for the load.

> -----Original Message-----
> From: Gabriel - IP Guys [mailto:Gabriel at impactteachers.com]
> Sent: Thursday, May 27, 2010 2:58 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] Monitoring Performance: Not sure what is good or
> bad
> 
> Below is a snapshot of monitoring performance as shown on the Tactical
> Monitoring Overview Page
> 
> Monitoring Performance
> 
> Service Check Execution Time:		0.04 / 14.14 / 3.036 sec
> Service Check Latency:			0.00 / 85.15 / 9.604 sec
> Host Check Execution Time:		4.30 / 16.05 / 8.263 sec
> Host Check Latency:			0.02 / 112.41 / 30.940 sec
> # Active Host / Service Checks:	43 / 244
> # Passive Host / Service Checks:	0  / 0
> 
> Now, I know that I'm not monitoring that many hosts, but I'm not sure what
> 'good' performance is. My nagios server had an extremely high load this
> morning, and the only service on that box, is nagios.
> 
> What type of performance should I reasonably expect from my monitoring
> server?
> 
> 
> The Puppet Apprentice   :- http://puppetnewbie.blogspot.com/
> Follow me on twitter    :- http://twitter.com/mritguru
> Puppet #tag on twitter  :- #puppet
> IRC                     :- itguru ON irc.freenode.org (feel free to say
> hi!)
> 
> 
> 
> ------------------------------------------------------------------------------
> 
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting
> any issue.
> ::: Messages without supporting info will risk being sent to /dev/null

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From Kimmo.Lahteenlahti at nokia.com  Thu May 27 22:14:34 2010
From: Kimmo.Lahteenlahti at nokia.com (Kimmo.Lahteenlahti at nokia.com)
Date: Thu, 27 May 2010 22:14:34 +0200
Subject: Nagios host flapping on centos 5.4 ?
Message-ID: <13D20E8EEC29BE4092DC1896DC45A91656AC70AE50@NOK-EUMSG-04.mgdnok.nokia.com>

Hi,

I have Nagios installed on Ubuntu 8.04 running on Vmware Vsphere and Centos 5.4 running on IBM hs20 blade.
Both are identical setup and they are checking same servers.

But for some reason in centos almost half of the checked  host are flapping at any given time (40 host to check)
Biggest difference between this environments is that Ubuntu is using Nagios 3.0.6 and centos 3.2.1.

Have any of you noted similar behavior ?

I followed this guide when installing nagios to centos:
http://docs.cslabs.clarkson.edu/wiki/Install_Nagios_on_CentOS_5

And for some reason I can't stop nagios service in the Centos:
/etc/init.d/nagios stop
nagios (pid  2647) is running...
Stopping nagios:                                           [FAILED]

Or restart:
/etc/init.d/nagios restart
Stopping nagios:                                           [FAILED]
Starting nagios:

Restart via nagios.cmd works. Any idea why this is happening ?

-       Kimmo










-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From drew.weaver at thenap.com  Thu May 27 23:09:54 2010
From: drew.weaver at thenap.com (Drew Weaver)
Date: Thu, 27 May 2010 17:09:54 -0400
Subject: Sending NAGIOS alerts to Gmail addresses can get
 your mail server listed as suspect.
Message-ID: 

Hi there,

I noticed that a lot of our mail was ending up in users' junk/spam folder at GMail and it seems that if you send Nagios warning messages to Gmail they somehow assume that your server is malicious and spamming. Is it SOP to use a different SMTP server to deliver Nagios messages?

thanks,
-Drew



-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From maxhetrick at verizon.net  Thu May 27 23:25:31 2010
From: maxhetrick at verizon.net (Max Hetrick)
Date: Thu, 27 May 2010 17:25:31 -0400
Subject: Sending NAGIOS alerts to Gmail addresses can get
 your mail server listed as suspect.
In-Reply-To: 
References: 
Message-ID: <4BFEE34B.3050204@verizon.net>

Drew Weaver wrote:

> I noticed that a lot of our mail was ending up in users' junk/spam 
> folder at GMail and it seems that if you send Nagios warning messages to 
> Gmail they somehow assume that your server is malicious and spamming. Is 
> it SOP to use a different SMTP server to deliver Nagios messages?

I'd assume you'd have to take that up with GMail. Or check the spam 
filtering settings on the user accounts. I don't use GMail, but I assume 
there are junk spam filtering rules that you can do to classify mail on 
the settings, like all other e-mail clients and services.

Regards,
Max

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From gl at arlut.utexas.edu  Thu May 27 18:00:54 2010
From: gl at arlut.utexas.edu (Jay G. Scott)
Date: Thu, 27 May 2010 11:00:54 -0500
Subject: newbie -- checking log files
Message-ID: <20100527160054.GA6792@jgs.arlut.utexas.edu>


my boss says nagios (dunno which version he used) doesn't
monitor log files.

is that no longer true, i hope?

maybe i don't know what to google--but "nagios log files" is
telling me about nagios' log files.  not what i mean.  what i
want to know is whether nagios can be taught to monitor
syslog (for example) for nasty messages.

j.

-- 
Jay Scott		512-835-3553		gl at arlut.utexas.edu
Head of Sun Support, Sr. Operating Systems Specialist
Applied Research Labs, Computer Science Div.                   S224
University of Texas at Austin

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From perldork at webwizarddesign.com  Thu May 27 23:43:26 2010
From: perldork at webwizarddesign.com (Max)
Date: Thu, 27 May 2010 17:43:26 -0400
Subject: newbie -- checking log files
In-Reply-To: <20100527160054.GA6792@jgs.arlut.utexas.edu>
References: <20100527160054.GA6792@jgs.arlut.utexas.edu>
Message-ID: 

On Thu, May 27, 2010 at 12:00 PM, Jay G. Scott  wrote:
>
> my boss says nagios (dunno which version he used) doesn't
> monitor log files.
>
> is that no longer true, i hope?

http://www.google.com/#hl=en&q=Nagios+monitor+log+files

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From georgyboy at gmail.com  Thu May 27 23:50:44 2010
From: georgyboy at gmail.com (Jorge Gil)
Date: Thu, 27 May 2010 23:50:44 +0200
Subject: Sending NAGIOS alerts to Gmail addresses can get
	your mail server listed as suspect.
In-Reply-To: <4BFEE34B.3050204@verizon.net>
References: 
	<4BFEE34B.3050204@verizon.net>
Message-ID: 

Hi!

I use gmail, and get nagios alerts in my gmail account, but I dont remember
having done anything special to prevent nagios alerts ending up in spam.

After a while I set up a filter in the account to label the nagios incoming
mail, just for my comfort.

I dont think gmail lets you customize its spam filtering rules, but it does
let you mark emails in the spam folder as "not spam" emails. i guess that if
you add the sender address to your contacts, they will have less
possibilities to end up there. maybe also if you do a rule as i did for that
incoming mails, but these are just my happy ideas.

Good luck!
Jorge

On Thu, May 27, 2010 at 11:25 PM, Max Hetrick wrote:

>  Drew Weaver wrote:
>
> > I noticed that a lot of our mail was ending up in users' junk/spam
> > folder at GMail and it seems that if you send Nagios warning messages to
> > Gmail they somehow assume that your server is malicious and spamming. Is
> > it SOP to use a different SMTP server to deliver Nagios messages?
>
> I'd assume you'd have to take that up with GMail. Or check the spam
> filtering settings on the user accounts. I don't use GMail, but I assume
> there are junk spam filtering rules that you can do to classify mail on
> the settings, like all other e-mail clients and services.
>
> Regards,
> Max
>
>
> ------------------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From wwanghongrui at cebbank.com  Fri May 28 05:43:26 2010
From: wwanghongrui at cebbank.com (wwanghongrui)
Date: Fri, 28 May 2010 11:43:26 +0800
Subject: newbie -- checking log files
References: <20100527160054.GA6792@jgs.arlut.utexas.edu>
Message-ID: <201005281143253755289@cebbank.com>

You can see this http://labs.consol.de/lang/en/nagios/check_logfiles/
 




wwanghongrui
2010-05-28



???? Jay G. Scott
????? 2010-05-28 05:36:46
???? nagios-users
??? 
??? [Nagios-users] newbie -- checking log files

my boss says nagios (dunno which version he used) doesn't
monitor log files.
is that no longer true, i hope?
maybe i don't know what to google--but "nagios log files" is
telling me about nagios' log files.  not what i mean.  what i
want to know is whether nagios can be taught to monitor
syslog (for example) for nasty messages.
j.
-- 
Jay Scott 512-835-3553 gl at arlut.utexas.edu
Head of Sun Support, Sr. Operating Systems Specialist
Applied Research Labs, Computer Science Div.                   S224
University of Texas at Austin
------------------------------------------------------------------------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From mad at b-care.net  Fri May 28 08:15:49 2010
From: mad at b-care.net (=?ISO-8859-1?Q?Marc-Andr=E9?= Doll)
Date: Fri, 28 May 2010 08:15:49 +0200
Subject: IBM plugin
Message-ID: <1275027349.1507.7.camel@MADness>

Hi list,

I am looking for a plugin to check IBM blades and chassis the way
plugins like check_openmanage do from quite some time now. I tried
googling it and browsing NagiosExchange but I didn't find what I'm
looking for. I found plugins to check specific hardware on specific IBM
architecture but nothing really generalistic.

Does anyone have heard about or developed some "check_ibm" plugin ?

Thanks,


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From ae at op5.se  Fri May 28 11:19:46 2010
From: ae at op5.se (Andreas Ericsson)
Date: Fri, 28 May 2010 11:19:46 +0200
Subject: Sending NAGIOS alerts to Gmail addresses can get
 your mail server listed as suspect.
In-Reply-To: 
References: 	<4BFEE34B.3050204@verizon.net>
	
Message-ID: <4BFF8AB2.60601@op5.se>

On 05/27/2010 11:50 PM, Jorge Gil wrote:
> Hi!
> 
> I use gmail, and get nagios alerts in my gmail account, but I dont remember
> having done anything special to prevent nagios alerts ending up in spam.
> 
> After a while I set up a filter in the account to label the nagios incoming
> mail, just for my comfort.
> 
> I dont think gmail lets you customize its spam filtering rules, but it does
> let you mark emails in the spam folder as "not spam" emails. i guess that if
> you add the sender address to your contacts, they will have less
> possibilities to end up there. maybe also if you do a rule as i did for that
> incoming mails, but these are just my happy ideas.
> 

GMail doesn't allow you to customize the spam-rules, but it does let you
bypass them. Only mail that's sent to inbox gets checked for spam, so any
filter which also adds "bypass inbox" to its rules will go straight to the
designated folder and never be caught by spamfilters.

-- 
Andreas Ericsson                   andreas.ericsson at op5.se
OP5 AB                             www.op5.se
Tel: +46 8-230225                  Fax: +46 8-230231

Considering the successes of the wars on alcohol, poverty, drugs and
terror, I think we should give some serious thought to declaring war
on peace.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From smcafee at collaborativefusion.com  Fri May 28 12:18:48 2010
From: smcafee at collaborativefusion.com (Sean McAfee)
Date: Fri, 28 May 2010 06:18:48 -0400
Subject: Sending NAGIOS alerts to Gmail addresses can get
 your mail server listed as suspect.
In-Reply-To: 
References: 	<4BFEE34B.3050204@verizon.net>
	
Message-ID: <4BFF9888.4030103@collaborativefusion.com>

Jorge Gil wrote:
> Hi!
>  
> I use gmail, and get nagios alerts in my gmail account, but I dont 
> remember having done anything special to prevent nagios alerts ending up 
> in spam.
>  
> Good luck!
> Jorge

Same here.

OP, try checking yourself out on http://www.senderbase.org/.  It's 
likely that your servers ended up on an RBL for something else.

-- 
Sean McAfee
Senior Systems Engineer

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From jhein at netways.de  Fri May 28 12:45:54 2010
From: jhein at netways.de (Julian Hein)
Date: Fri, 28 May 2010 10:45:54 +0000
Subject: IBM plugin
In-Reply-To: <1275027349.1507.7.camel@MADness>
References: <1275027349.1507.7.camel@MADness>
Message-ID: 

Hi,

Here is one: http://www.monitoringexchange.org/p/794

Bye,
Julian


Am 28.05.10 08:15 schrieb "Marc-Andr? Doll" unter :

> Hi list,
> 
> I am looking for a plugin to check IBM blades and chassis the way
> plugins like check_openmanage do from quite some time now. I tried
> googling it and browsing NagiosExchange but I didn't find what I'm
> looking for. I found plugins to check specific hardware on specific IBM
> architecture but nothing really generalistic.
> 
> Does anyone have heard about or developed some "check_ibm" plugin ?
> 
> Thanks,
> 
> 
> ------------------------------------------------------------------------------
> 
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting
> any issue. 
> ::: Messages without supporting info will risk being sent to /dev/null


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From lists at xodus.org  Fri May 28 13:56:27 2010
From: lists at xodus.org (Marc Powell)
Date: Fri, 28 May 2010 06:56:27 -0500
Subject: Sending NAGIOS alerts to Gmail addresses can get
	your mail server listed as suspect.
In-Reply-To: 
References: 
Message-ID: <23005CFD-52DD-46D4-8A37-FE22100B153D@xodus.org>


On May 27, 2010, at 4:09 PM, Drew Weaver wrote:

> Hi there,
>  
> I noticed that a lot of our mail was ending up in users' junk/spam folder at GMail and it seems that if you send Nagios warning messages to Gmail they somehow assume that your server is malicious and spamming. Is it SOP to use a different SMTP server to deliver Nagios messages

Does your nagios server send the messages to gmail directly, and not through an SMTP relay? If so, are you following all the rules and expectations for it to be a mail server because to gmail, that's exactly what it is.

	- Is the SMTP server on the machine configured to HELO as a valid fully qualified domain name (hostname.yourdomain.com)?
	- Does hostname.yourdomain.com exist in the DNS and point to the outgoing public IP that gmail sees your message originate from?
	- Does that IP address have a reverse DNS entry of hostname.yourdomain.com?

You can answer most of this by looking at the Received: line in any of the messages where your system hands the message off to Google. 

At the very least a RDNS lookup of the IP should show hostname.yourdomain.com and a lookup of hostname.yourdomain.com should result in that same IP.

--
Marc


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From drew.weaver at thenap.com  Fri May 28 14:17:54 2010
From: drew.weaver at thenap.com (Drew Weaver)
Date: Fri, 28 May 2010 08:17:54 -0400
Subject: Sending NAGIOS alerts to Gmail addresses can
	get	your mail server listed as suspect.
In-Reply-To: <23005CFD-52DD-46D4-8A37-FE22100B153D@xodus.org>
References: 
	<23005CFD-52DD-46D4-8A37-FE22100B153D@xodus.org>
Message-ID: 

No, the mail is relayed through our SMTP server.

And the RDNS and everything is fine, and it is even listed as 'Good' in senderbase.

thanks,
-Drew


-----Original Message-----
From: Marc Powell [mailto:lists at xodus.org] 
Sent: Friday, May 28, 2010 7:56 AM
To: Nagios Users List
Subject: Re: [Nagios-users] Sending NAGIOS alerts to Gmail addresses can get your mail server listed as suspect.


On May 27, 2010, at 4:09 PM, Drew Weaver wrote:

> Hi there,
>  
> I noticed that a lot of our mail was ending up in users' junk/spam folder at GMail and it seems that if you send Nagios warning messages to Gmail they somehow assume that your server is malicious and spamming. Is it SOP to use a different SMTP server to deliver Nagios messages

Does your nagios server send the messages to gmail directly, and not through an SMTP relay? If so, are you following all the rules and expectations for it to be a mail server because to gmail, that's exactly what it is.

	- Is the SMTP server on the machine configured to HELO as a valid fully qualified domain name (hostname.yourdomain.com)?
	- Does hostname.yourdomain.com exist in the DNS and point to the outgoing public IP that gmail sees your message originate from?
	- Does that IP address have a reverse DNS entry of hostname.yourdomain.com?

You can answer most of this by looking at the Received: line in any of the messages where your system hands the message off to Google. 

At the very least a RDNS lookup of the IP should show hostname.yourdomain.com and a lookup of hostname.yourdomain.com should result in that same IP.

--
Marc


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From drew.weaver at thenap.com  Fri May 28 14:16:58 2010
From: drew.weaver at thenap.com (Drew Weaver)
Date: Fri, 28 May 2010 08:16:58 -0400
Subject: Sending NAGIOS alerts to Gmail addresses can get
 your mail server listed as suspect.
In-Reply-To: <4BFF9888.4030103@collaborativefusion.com>
References: 
	<4BFEE34B.3050204@verizon.net>
	
	<4BFF9888.4030103@collaborativefusion.com>
Message-ID: 

The score for our SMTP server is listed as 'Good' in Senderbase.

thanks,
-Drew


-----Original Message-----
From: Sean McAfee [mailto:smcafee at collaborativefusion.com] 
Sent: Friday, May 28, 2010 6:19 AM
To: Nagios Users List
Subject: Re: [Nagios-users] Sending NAGIOS alerts to Gmail addresses can get your mail server listed as suspect.

Jorge Gil wrote:
> Hi!
>  
> I use gmail, and get nagios alerts in my gmail account, but I dont 
> remember having done anything special to prevent nagios alerts ending up 
> in spam.
>  
> Good luck!
> Jorge

Same here.

OP, try checking yourself out on http://www.senderbase.org/.  It's 
likely that your servers ended up on an RBL for something else.

-- 
Sean McAfee
Senior Systems Engineer

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From f.hugh at comcast.net  Fri May 28 19:04:28 2010
From: f.hugh at comcast.net (f.hugh at comcast.net)
Date: Fri, 28 May 2010 17:04:28 +0000 (UTC)
Subject: Nagios and a custom webinject.pl
In-Reply-To: <1447110886.2421781275066253372.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net>
References: <1447110886.2421781275066253372.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net>
Message-ID: <1058556432.2421891275066268474.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net>

    We just deployed a new site that requires client certificates so I found a discussion thread on how to make a few changes to the script so that it would use the client certificate that I specified.  I left the original script intact along with all the nagios configs and named this new script webinjectcert.pl.  I wanted 2 separate scripts and configs so that I did not disrupt what I already had running successfully
    The new script works perfectly as long as I run it manually as sudo.  The problem is when I configure it for nagios with the same script, config file, and test case, it returns results that claim it could not log onto the site.  I have enabled debugging in the nagios.cfg, but I just can't see what the problem is.  I looked at the permissions on the new script and certificate to make sure that wasn't it and they look fine.  I am not really sure how to run it as the nagios user since I can't remember what the password is and don't want to make a mess and change it.
    Any ideas? 
    Paul 



------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From shadhin71 at gmail.com  Fri May 28 20:22:58 2010
From: shadhin71 at gmail.com (shadih rahman)
Date: Fri, 28 May 2010 14:22:58 -0400
Subject: Can an entity have more than one parent?
Message-ID: 

All,
   I am monitoring a set of vif behind csm.  I want to control the amount
notifications.  So basic idea is if the host is down don't bother sending
notification for the vif.  So, I fiugure I would just create parent child
relationship with the host serving the page and vif name.  However, one
problem is, all vifs are being served by two hosts.  So I have two
questions.

1)  Can a host have two parents?
2)  What happen if only one of those parents go down?

Please advise on these.  Thanks

-- 
Cordially,
Shadhin Rahman
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From perldork at webwizarddesign.com  Fri May 28 20:34:13 2010
From: perldork at webwizarddesign.com (Max)
Date: Fri, 28 May 2010 14:34:13 -0400
Subject: Can an entity have more than one parent?
In-Reply-To: 
References: 
Message-ID: 

On Fri, May 28, 2010 at 2:22 PM, shadih rahman  wrote:
> 1)? Can a host have two parents?

http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#host

> 2)? What happen if only one of those parents go down?

http://nagios.sourceforge.net/docs/3_0/networkreachability.html

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From gerard.lancellotti at mpsa.com  Mon May 31 08:53:55 2010
From: gerard.lancellotti at mpsa.com (gerard.lancellotti at mpsa.com)
Date: Mon, 31 May 2010 08:53:55 +0200
Subject: Access hosts and service for that you are contact
Message-ID: 


Hi all

I have multiple hosts and services that I monitor with Nagios. Each user
must view even the host and services for which there is contact. I defined
in the configuration files hosts and services the contact_groups user. This
works well with the hosts, but if the user requests the service_groups , he
has the following message:

                                                                           
                 Service Overview For All Service Groups                   
                                                                           
                                                                           
                                                                           
                                                                           









It appears as though you do not have permission to view information for any
                       of the hosts you requested...
   If you believe this is an error, check the HTTP server authentication
                    requirements for accessing this CGI
    and check the authorization options in your CGI configuration file.

Have anyone resolved this problem ?

Thanks for help

Cordialement

G?rard Lancellotti
Psa Peugeot Citro?n

DCSI/DSIN/INSI/ETSO/IPSL
T?l  01 30 19 93 90 / 06 80 90 45 90
Fax 01 30 19 77 80
E.Mail gerard.lancellotti at mpsa.com
PYPY10/106
case 60
Ce message peut contenir des informations confidentielles. S'il ne vous est
pas destin?, merci de le d?truire et d'informer imm?diatement son ?metteur.

Pour plus d'informations relatives ? la confidentialit? et ? la s?curit?
veuillez consulter http://disclaimer.psa-peugeot-citroen.com.


------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




From xml.devel at gmail.com  Mon May 31 14:01:38 2010
From: xml.devel at gmail.com (Kumar, Ashish)
Date: Mon, 31 May 2010 17:31:38 +0530
Subject: UTF-8 characters in Nagios UI
In-Reply-To: 
References: 
	
Message-ID: 

On Tue, May 11, 2010 at 10:57 PM, Richard Lynch  wrote:

>  As I look at the ?Technical Monitoring Overview? frame on my laptop proof
> of concept install, there is no DOCTYPE, and the encoding is ISO-8859-1...
>
> #fail
>
>
Thanks for the idea but the frame status.cgi has UTF-8 encoding.  Then
again, as I mentioned in my first e-mail, Service section shows UTF-8
characters but Status Information does not and both of them reside on the
same page.

We upgraded to nagios 3.2.1, still no luck.  Perhaps nagios developers could
help with this?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From lecb79 at gmail.com  Mon May 31 18:29:45 2010
From: lecb79 at gmail.com (Eduardo Barreto)
Date: Mon, 31 May 2010 13:29:45 -0300
Subject: DHCP
Message-ID: 

Hi ALL,

Anybody have any idea, how can I monitor dhcp leases, should I have to
install a NRPE plugin into the client to get the info from it?

Thanks in advance.

Eduardo Barreto.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From matgand at gmail.com  Mon May 31 18:54:04 2010
From: matgand at gmail.com (Mattia Gandolfi)
Date: Mon, 31 May 2010 18:54:04 +0200
Subject: DHCP
In-Reply-To: 
References: 
Message-ID: 

Hi,

I personally use check_dhcpd_leases (
http://exchange.nagios.org/directory/Plugins/Network-Protocols/DHCP-and-BOOTP/Check_Dhcpd_Leases/details).
It runs locally, so you nrpe to run it remotely.

A possible alternative would be to enable omshell on the dhcp server and
write a plugin to get info from it. I'm not aware such a plugin already
exists, but I would be happu if proven to be wrong :)

Cheers

Mattia

On Mon, May 31, 2010 at 6:29 PM, Eduardo Barreto  wrote:

> Hi ALL,
>
> Anybody have any idea, how can I monitor dhcp leases, should I have to
> install a NRPE plugin into the client to get the info from it?
>
> Thanks in advance.
>
> Eduardo Barreto.
>
>
> ------------------------------------------------------------------------------
>
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From nttbroken at gmail.com  Mon May 31 14:33:43 2010
From: nttbroken at gmail.com (ntt broken)
Date: Mon, 31 May 2010 15:33:43 +0300
Subject: NagiosQL installation manual.
Message-ID: 

I'm looking for more detailed NagiosQL installation
manual/tutorial/reference/notes etc.
Right now I have only those:
http://www.nagiosql.org/faq/31-general-documentation/49-documentation-for-nagiosql-2x.html
http://www.berhorst.net/index.php/nagiosmanually (section 6)

I found the installation notes not clear or missing ? well, not for noobs.
I have working and kicking Nagios machine installed on RHE5 distro and I
want to have for the first time NagiosQL installed working with it.
I went through the Nagios installation plus without problem since its well
documented on the net.
if you have can you share some installation notes it would be great.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From zarrelli at linux.it  Mon May 31 22:38:18 2010
From: zarrelli at linux.it (Giorgio Zarrelli)
Date: Mon, 31 May 2010 22:38:18 +0200
Subject: NagiosQL installation manual.
In-Reply-To: 
References: 
Message-ID: 

I installes IT today. Didn't see any problems. Which problems did u  
meet?

Ciao,

Giorgio

Il giorno 31/mag/2010, alle ore 14.33, ntt broken  
 ha scritto:

> I'm looking for more detailed NagiosQL installation manual/tutorial/ 
> reference/notes etc.
> Right now I have only those:
> http://www.nagiosql.org/faq/31-general-documentation/49-documentation-for-nagiosql-2x.html
> http://www.berhorst.net/index.php/nagiosmanually (section 6)
>
> I found the installation notes not clear or missing ? well, not for  
> noobs.
> I have working and kicking Nagios machine installed on RHE5 distro  
> and I want to have for the first time NagiosQL installed working  
> with it.
> I went through the Nagios installation plus without problem since  
> its well documented on the net.
> if you have can you share some installation notes it would be great.
>
> --- 
> --- 
> --- 
> ---------------------------------------------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when  
> reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
-------------- next part --------------
------------------------------------------------------------------------------

-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

From patrick.morris at hp.com  Mon May 31 22:45:27 2010
From: patrick.morris at hp.com (Morris, Patrick)
Date: Mon, 31 May 2010 13:45:27 -0700
Subject: Access hosts and service for that you are contact
In-Reply-To: 
References: 
Message-ID: <4C041FE7.1090306@hp.com>

gerard.lancellotti at mpsa.com wrote:
> Hi all
>
> I have multiple hosts and services that I monitor with Nagios. Each user
> must view even the host and services for which there is contact. I defined
> in the configuration files hosts and services the contact_groups user. This
> works well with the hosts, but if the user requests the service_groups , he
> has the following message:
>
> It appears as though you do not have permission to view information for any
>                        of the hosts you requested...
>    If you believe this is an error, check the HTTP server authentication
>                     requirements for accessing this CGI
>     and check the authorization options in your CGI configuration file.
>
> Have anyone resolved this problem ?
>   

If a contact should be allowed to view all services in a servicegroup, 
that contact needs to be listed on all services in that group. 
Otherwise, the servicegroup view would be giving that person access to 
services they're not allowed to view.

In other words, it sounds like this error may be normal in your case.

------------------------------------------------------------------------------

_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null