From Asif.Surangiwala at firstdata.com Wed Dec 1 16:48:28 2010 From: Asif.Surangiwala at firstdata.com (Surangiwala, Asif ) Date: Wed, 1 Dec 2010 08:48:28 -0700 Subject: check_openmanage plugin reporting Firmware out of date Message-ID: <4A03B9D20B80A1428B53101F813BCA0704C1C975@WFDTDNPXMASRV01.1DC.COM> I have Dell Open Manage Server Administrator 6.3.0 installed on some Dell R710's with PERC H700 controller. When I run the Nagios plugin check_openmanage, it reports the following: Controller 0 [PERC H700 Integrated]: Firmware '12.10.0-0025' is out of date The H700 is running the latest firmware 12.10.0-0025, check_openmanage plugin is v3.6.2 by Trond H. Amundsen. OMSA is running fine and is not complaining about any firmware issues. The same 'Firmware out of date' warning is also given for H800 controllers on the R710's having it. Is there an issue with the plugin's interaction with OMSA? ----------------------------------------- The information in this message may be proprietary and/or confidential, and protected from disclosure. If the reader of this message is not the intended recipient, or an employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. If you have received this communication in error, please notify First Data immediately by replying to this message and deleting it from your computer. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Wed Dec 1 17:00:18 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Wed, 1 Dec 2010 09:00:18 -0700 Subject: high latency Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> I've been watching my latency graphs, and showing 2000 seconds for some service and host checks. What I don't understand is I still have idle time on the CPU, (quad processor) so I'm curious if the server isn't in trouble, why am I seeing such high latency? Or maybe I misunderstand how latency is calculated? I do have 9 service checks that are failing on about 700 hosts if that matters at all. Trying to tweak the performance to the max on this so any insight welcome. Thanks, Dan -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From MarkL at lmfj.com Wed Dec 1 17:03:33 2010 From: MarkL at lmfj.com (Mark A. Lappin) Date: Wed, 1 Dec 2010 10:03:33 -0600 Subject: check_openmanage plugin reporting Firmware out of date In-Reply-To: <4A03B9D20B80A1428B53101F813BCA0704C1C975@WFDTDNPXMASRV01.1DC.COM> References: <4A03B9D20B80A1428B53101F813BCA0704C1C975@WFDTDNPXMASRV01.1DC.COM> Message-ID: <0227B653B3DC82438B8291BC5218612F67375334D4@lmfjex07.lmfj.com> I have noticed similar on some of my servers but to be fair to check_openmanage, the firmware it is reporting is out of date but when I look at OMSA directly it does not give me errors. If I run the Dell system scanner, it does identify that I need to update some of the firmware on the controllers. Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com [http://www.lmfj.com/images/lmfjsig.gif] ________________________________ This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . From: Surangiwala, Asif [mailto:Asif.Surangiwala at firstdata.com] Sent: Wednesday, December 01, 2010 9:48 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] check_openmanage plugin reporting Firmware out of date I have Dell Open Manage Server Administrator 6.3.0 installed on some Dell R710's with PERC H700 controller. When I run the Nagios plugin check_openmanage, it reports the following: Controller 0 [PERC H700 Integrated]: Firmware '12.10.0-0025' is out of date The H700 is running the latest firmware 12.10.0-0025, check_openmanage plugin is v3.6.2 by Trond H. Amundsen. OMSA is running fine and is not complaining about any firmware issues. The same 'Firmware out of date' warning is also given for H800 controllers on the R710's having it. Is there an issue with the plugin's interaction with OMSA? ________________________________ The information in this message may be proprietary and/or confidential, and protected from disclosure. If the reader of this message is not the intended recipient, or an employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. If you have received this communication in error, please notify First Data immediately by replying to this message and deleting it from your computer. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Wed Dec 1 17:08:11 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Wed, 01 Dec 2010 17:08:11 +0100 Subject: check_openmanage plugin reporting Firmware out of date In-Reply-To: <4A03B9D20B80A1428B53101F813BCA0704C1C975@WFDTDNPXMASRV01.1DC.COM> (Asif Surangiwala's message of "Wed, 1 Dec 2010 08:48:28 -0700") References: <4A03B9D20B80A1428B53101F813BCA0704C1C975@WFDTDNPXMASRV01.1DC.COM> Message-ID: <15toc95zdas.fsf@tux.uio.no> "Surangiwala, Asif " writes: > I have Dell Open Manage Server Administrator 6.3.0 installed on some Dell > R710?s with PERC H700 controller. When I run the Nagios plugin > check_openmanage, it reports the following: > > Controller 0 [PERC H700 Integrated]: Firmware '12.10.0-0025' is out of date > > The H700 is running the latest firmware 12.10.0-0025, check_openmanage plugin > is v3.6.2 by Trond H. Amundsen. OMSA is running fine and is not complaining > about any firmware issues. > > The same ?Firmware out of date? warning is also given for H800 controllers on > the R710?s having it. > > Is there an issue with the plugin?s interaction with OMSA? Hi Asif, This is a bug in OMSA, not check_openmanage. OMSA is reporting that the firmware is too old while clearly it is not. Dell has stated that the bug will be fixed in the next version of OMSA. For more information, see the following thread on the Linux-Poweredge mailing list: http://lists.us.dell.com/pipermail/linux-poweredge/2010-December/043713.html As a workaround, I suggest using blacklisting to suppress the false warnings until OMSA 6.4.0 is released and deployed on your systems: check_openmanage -b ctrl_fw=all [..other options..] Regards, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Asif.Surangiwala at firstdata.com Wed Dec 1 17:42:21 2010 From: Asif.Surangiwala at firstdata.com (Surangiwala, Asif ) Date: Wed, 1 Dec 2010 09:42:21 -0700 Subject: check_openmanage plugin reporting Firmware out of date In-Reply-To: <15toc95zdas.fsf@tux.uio.no> References: <4A03B9D20B80A1428B53101F813BCA0704C1C975@WFDTDNPXMASRV01.1DC.COM> <15toc95zdas.fsf@tux.uio.no> Message-ID: <4A03B9D20B80A1428B53101F813BCA0704C1CA0A@WFDTDNPXMASRV01.1DC.COM> Thanks, Trond. Can we update the check_openmanage script to parse the "Minimum Required Firmware Version" and compare it with the current "Firmware Version" to overcome the OMSA bug? Regards, --Asif -----Original Message----- From: Trond Hasle Amundsen [mailto:t.h.amundsen at usit.uio.no] Sent: Wednesday, December 01, 2010 11:08 AM To: Surangiwala, Asif Cc: nagios-users at lists.sourceforge.net Subject: Re: check_openmanage plugin reporting Firmware out of date "Surangiwala, Asif " writes: > I have Dell Open Manage Server Administrator 6.3.0 installed on some Dell > R710?s with PERC H700 controller. When I run the Nagios plugin > check_openmanage, it reports the following: > > Controller 0 [PERC H700 Integrated]: Firmware '12.10.0-0025' is out of date > > The H700 is running the latest firmware 12.10.0-0025, check_openmanage plugin > is v3.6.2 by Trond H. Amundsen. OMSA is running fine and is not complaining > about any firmware issues. > > The same ?Firmware out of date? warning is also given for H800 controllers on > the R710?s having it. > > Is there an issue with the plugin?s interaction with OMSA? Hi Asif, This is a bug in OMSA, not check_openmanage. OMSA is reporting that the firmware is too old while clearly it is not. Dell has stated that the bug will be fixed in the next version of OMSA. For more information, see the following thread on the Linux-Poweredge mailing list: http://lists.us.dell.com/pipermail/linux-poweredge/2010-December/043713.html As a workaround, I suggest using blacklisting to suppress the false warnings until OMSA 6.4.0 is released and deployed on your systems: check_openmanage -b ctrl_fw=all [..other options..] Regards, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ----------------------------------------- The information in this message may be proprietary and/or confidential, and protected from disclosure. If the reader of this message is not the intended recipient, or an employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any dissemination, distribution or copying of this communication is strictly prohibited. If you have received this communication in error, please notify First Data immediately by replying to this message and deleting it from your computer. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Dec 1 18:25:36 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 01 Dec 2010 17:25:36 +0000 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4CF68510.6090400@flatto.net> dan there were a couple of discussions on the list that dealt with latency issues . Have you tried looking at the list archives about the topic ? Assaf On 01/12/10 16:00, Daniel Wittenberg wrote: > > I've been watching my latency graphs, and showing 2000 seconds for > some service and host checks. What I don't understand is I still have > idle time on the CPU, (quad processor) so I'm curious if the server > isn't in trouble, why am I seeing such high latency? Or maybe I > misunderstand how latency is calculated? I do have 9 service checks > that are failing on about 700 hosts if that matters at all. Trying to > tweak the performance to the max on this so any insight welcome. > > > Thanks, > > Dan > > -- Never,Ever Cut A Deal With a Dragon Next year I will be doing the London to Paris bike ride to raise money for the DogTrust (www.dogstrust.co.uk) . Please Sponsor me at http://www.justgiving.com/Assaf-Flatto -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Wed Dec 1 19:34:39 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Wed, 01 Dec 2010 19:34:39 +0100 Subject: check_openmanage plugin reporting Firmware out of date In-Reply-To: <4A03B9D20B80A1428B53101F813BCA0704C1CA0A@WFDTDNPXMASRV01.1DC.COM> (Asif Surangiwala's message of "Wed, 1 Dec 2010 09:42:21 -0700") References: <4A03B9D20B80A1428B53101F813BCA0704C1C975@WFDTDNPXMASRV01.1DC.COM> <15toc95zdas.fsf@tux.uio.no> <4A03B9D20B80A1428B53101F813BCA0704C1CA0A@WFDTDNPXMASRV01.1DC.COM> Message-ID: <15tfwuhz6io.fsf@tux.uio.no> "Surangiwala, Asif " writes: > Can we update the check_openmanage script to parse the "Minimum > Required Firmware Version" and compare it with the current "Firmware > Version" to overcome the OMSA bug? It is entirely possible to mitigate this bug within the plugin, but I don't think that it's a good idea to let the plugin do all version parsings and ignore OMSA on a general basis. I have created a version that works around this particular bug (version 3.6.2-p1) and made it available here: http://folk.uio.no/trondham/software/omsa-fw-bug/ It simply ignores out-of-date firmware if the firmware and minimum firmware versions match those in question. But in order for this to work, I also had to turn off checking the global health status, which inherits the non-critical status of the controller. DISCLAIMER: This version is only intended as a temporary solution for users of OMSA 6.3.0 that struggles with the recent firmware bug, and don't want to use blacklisting as a workaround. When OMSA 6.4.0 becomes available, you should upgrade OMSA and revert to a regular release of check_openmanage. Regards, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cbeattie at geninfo.com Wed Dec 1 21:30:52 2010 From: cbeattie at geninfo.com (Chris Beattie) Date: Wed, 01 Dec 2010 15:30:52 -0500 Subject: Nagios Core 3.2.3 host check retry interval In-Reply-To: <1290183605.5449.15.camel@DevNagios.geninfo.com> References: <4CE2FD13.7080504@op5.se><1290103745.29293.17.camel@DevNagios.geninfo.com> <1290183605.5449.15.camel@DevNagios.geninfo.com> Message-ID: <1291235452.23318.14.camel@DevNagios.geninfo.com> On Fri, 2010-11-19 at 11:20 -0500, Chris Beattie wrote: > > This time I'm trying a nearly-stock nagios.cfg file. The one I've > been > using predates Nagios 3.0. Though it's been updated some, it doesn't > contain all the more-recent settings. I was out of town for a bit. This is still happening, but not all the time. Most of the host checks happen 70 seconds apart, but the too-closely spaced ones are usually 20 seconds apart. I don't know how long this has been the case. It turns out it doesn't usually result in a notification, so nobody's complaining. [11-30-2010 17:13:03] SERVICE ALERT: bgcprodiceweb4d;Service: ScaleOut;CRITICAL;SOFT;1;SOSS: Not found [11-30-2010 17:14:33] SERVICE ALERT: bgcprodiceweb4d;Service: AntiVirus;WARNING;SOFT;1;No data was received from host! [11-30-2010 17:14:43] HOST ALERT: bgcprodiceweb4d;DOWN;SOFT;1;CRITICAL - 10.3.54.208: rta nan, lost 100% [11-30-2010 17:15:03] HOST ALERT: bgcprodiceweb4d;UP;SOFT;2;OK - 10.3.54.208: rta 33.504ms, lost 0% Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sebastian.Ries at dtnet.de Thu Dec 2 16:31:09 2010 From: Sebastian.Ries at dtnet.de (Sebastian Ries) Date: Thu, 02 Dec 2010 16:31:09 +0100 Subject: Check size of large directory Message-ID: <1291303869.9611.31.camel@bofh.dtnet.de> Hi I need to check the size of a directory... The problem is that this directory is very large (needs to stay under 1TB - that's why I need the check) and I cannot move it to an own partition (content is hard-linked) So I tried to write the size of this directory into a file (content does not change very often) an check the content of this file via nagios. Something like daily do # du -m -s $dir > $file Does anyone know about a plugin that can do this? Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From maxhetrick at verizon.net Thu Dec 2 16:58:26 2010 From: maxhetrick at verizon.net (Max Hetrick) Date: Thu, 02 Dec 2010 10:58:26 -0500 Subject: Check size of large directory In-Reply-To: <1291303869.9611.31.camel@bofh.dtnet.de> References: <1291303869.9611.31.camel@bofh.dtnet.de> Message-ID: <4CF7C222.9040502@verizon.net> On 12/02/2010 10:31 AM, Sebastian Ries wrote: > Does anyone know about a plugin that can do this? Check out the check_file plugins from the nagios-of-plugins. http://www.openfusion.com.au/labs/nagios/ I'm pretty sure it will work on a directory the same as a file. Should be something like: check_file -s -30MB -f /path/to/dir Regards, Max ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Thu Dec 2 17:33:13 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Thu, 2 Dec 2010 11:33:13 -0500 Subject: Check size of large directory In-Reply-To: <1291303869.9611.31.camel@bofh.dtnet.de> References: <1291303869.9611.31.camel@bofh.dtnet.de> Message-ID: Hi there -- The check_folder_size.sh script will probably suit your needs well. -----Original Message----- From: Sebastian Ries [mailto:Sebastian.Ries at dtnet.de] Sent: Thursday, December 02, 2010 10:31 AM To: nagios-users-ML Subject: [Nagios-users] Check size of large directory Hi I need to check the size of a directory... The problem is that this directory is very large (needs to stay under 1TB - that's why I need the check) and I cannot move it to an own partition (content is hard-linked) So I tried to write the size of this directory into a file (content does not change very often) an check the content of this file via nagios. Something like daily do # du -m -s $dir > $file Does anyone know about a plugin that can do this? Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Thu Dec 2 16:59:15 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Thu, 2 Dec 2010 08:59:15 -0700 Subject: high latency In-Reply-To: <4CF68510.6090400@flatto.net> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF68510.6090400@flatto.net> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> Yeah, for giggles I went back further through the archives last night and found stuff back to 2.x series, and not much has seemed to help. I killed some of my mis-behaving active checks, and that dropped to about 20 seconds, then went up to about 35-50. So while that's better, I have A LOT more hosts and service checks to add, and am afraid it'll go nuts when I dump more on. I think I've tried about all the config options I could find and some helped, some didn't seem to, but there should be plenty of horsepower on the machine to run this much faster so not sure why it's not. Dan From: Assaf Flatto [mailto:nagios at flatto.net] Sent: Wednesday, December 01, 2010 11:26 AM To: Nagios Users List Cc: Daniel Wittenberg Subject: Re: [Nagios-users] high latency dan there were a couple of discussions on the list that dealt with latency issues . Have you tried looking at the list archives about the topic ? Assaf On 01/12/10 16:00, Daniel Wittenberg wrote: I've been watching my latency graphs, and showing 2000 seconds for some service and host checks. What I don't understand is I still have idle time on the CPU, (quad processor) so I'm curious if the server isn't in trouble, why am I seeing such high latency? Or maybe I misunderstand how latency is calculated? I do have 9 service checks that are failing on about 700 hosts if that matters at all. Trying to tweak the performance to the max on this so any insight welcome. Thanks, Dan -- Never,Ever Cut A Deal With a Dragon Next year I will be doing the London to Paris bike ride to raise money for the DogTrust (www.dogstrust.co.uk) . Please Sponsor me at http://www.justgiving.com/Assaf-Flatto -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Thu Dec 2 17:46:29 2010 From: benny at bennyvision.com (C. Bensend) Date: Thu, 2 Dec 2010 10:46:29 -0600 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF68510.6090400@flatto.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> > Yeah, for giggles I went back further through the archives last night > and found stuff back to 2.x series, and not much has seemed to help. I > killed some of my mis-behaving active checks, and that dropped to about > 20 seconds, then went up to about 35-50. So while that's better, I have > A LOT more hosts and service checks to add, and am afraid it'll go nuts > when I dump more on. I think I've tried about all the config options I > could find and some helped, some didn't seem to, but there should be > plenty of horsepower on the machine to run this much faster so not sure > why it's not. Hey Dan, I too have been wrestling alligators with service and host check latencies averaging around 60s, and increasing to 100+ (sometimes to 300) after a few reloads during the day. This morning, I enabled the use_large_installation_tweaks option. As of a minute ago, my host check latency is now averaging 2.116s, and service check latency is averaging 0.748s. I didn't see if you had tried this yet, it might be something to consider. Benny -- "No matter how many shorts we have in the system, my guards will be instructed to treat every surveillance camera malfunction as a full-scale emergency." -- Peter Anspach's Evil Overlord List, #67 ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Thu Dec 2 18:05:25 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Thu, 2 Dec 2010 10:05:25 -0700 Subject: high latency In-Reply-To: <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> Yeah, been running that since day one, since when rollout is done we'll probably have about 18k servers and around 3 million service checks... I can probably post my relevant config options if someone wants to peak. Dan -----Original Message----- From: C. Bensend [mailto:benny at bennyvision.com] Sent: Thursday, December 02, 2010 10:46 AM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] high latency > Yeah, for giggles I went back further through the archives last night > and found stuff back to 2.x series, and not much has seemed to help. I > killed some of my mis-behaving active checks, and that dropped to about > 20 seconds, then went up to about 35-50. So while that's better, I have > A LOT more hosts and service checks to add, and am afraid it'll go nuts > when I dump more on. I think I've tried about all the config options I > could find and some helped, some didn't seem to, but there should be > plenty of horsepower on the machine to run this much faster so not sure > why it's not. Hey Dan, I too have been wrestling alligators with service and host check latencies averaging around 60s, and increasing to 100+ (sometimes to 300) after a few reloads during the day. This morning, I enabled the use_large_installation_tweaks option. As of a minute ago, my host check latency is now averaging 2.116s, and service check latency is averaging 0.748s. I didn't see if you had tried this yet, it might be something to consider. Benny -- "No matter how many shorts we have in the system, my guards will be instructed to treat every surveillance camera malfunction as a full-scale emergency." -- Peter Anspach's Evil Overlord List, #67 ------------------------------------------------------------------------ ------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Dec 2 18:16:35 2010 From: ae at op5.se (Andreas Ericsson) Date: Thu, 02 Dec 2010 18:16:35 +0100 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF68510.6090400@flatto.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4CF7D473.9000108@op5.se> On 12/02/2010 04:59 PM, Daniel Wittenberg wrote: > Yeah, for giggles I went back further through the archives last night > and found stuff back to 2.x series, and not much has seemed to help. I > killed some of my mis-behaving active checks, and that dropped to about > 20 seconds, then went up to about 35-50. So while that's better, I have > A LOT more hosts and service checks to add, and am afraid it'll go nuts > when I dump more on. I think I've tried about all the config options I > could find and some helped, some didn't seem to, but there should be > plenty of horsepower on the machine to run this much faster so not sure > why it's not. > > > > Dan > > > > From: Assaf Flatto [mailto:nagios at flatto.net] > Sent: Wednesday, December 01, 2010 11:26 AM > To: Nagios Users List > Cc: Daniel Wittenberg > Subject: Re: [Nagios-users] high latency > > > > dan > > there were a couple of discussions on the list that dealt with latency > issues . > > Have you tried looking at the list archives about the topic ? > > Assaf > > > On 01/12/10 16:00, Daniel Wittenberg wrote: > > I've been watching my latency graphs, and showing 2000 seconds for some > service and host checks. What I don't understand is I still have idle > time on the CPU, (quad processor) so I'm curious if the server isn't in > trouble, why am I seeing such high latency? Or maybe I misunderstand > how latency is calculated? I do have 9 service checks that are failing > on about 700 hosts if that matters at all. Trying to tweak the > performance to the max on this so any insight welcome. > Ditch your performance-data processing and see if that helps. You might also want to get rid of embedded perl. It's been known to cause really weird errors (although primarily memory leaks). You'll also want to get rid of obsessive host and service commands. How large is your installation and what hardware and system are you running it on? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Dec 2 18:18:54 2010 From: ae at op5.se (Andreas Ericsson) Date: Thu, 02 Dec 2010 18:18:54 +0100 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4CF7D4FE.7090706@op5.se> On 12/02/2010 06:05 PM, Daniel Wittenberg wrote: > Yeah, been running that since day one, since when rollout is done we'll > probably have about 18k servers and around 3 million service checks... > 170 services per host? Sounds like an awful lot of switches. I'd use some cleverness to grab snmp-info once and parse the data afterwards if I were you. For that kind of installation, you'll need to use a distributed setup of some sort. merlin, dnx and apparently mod-gearman should get you going in the right direction. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Thu Dec 2 18:42:25 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Thu, 2 Dec 2010 10:42:25 -0700 Subject: high latency In-Reply-To: <4CF7D4FE.7090706@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> Not using SNMP for any of the checks, and most are passive checks. For the few active checks we are probably going to be using dnx. Embeded perl is interesting though, I hadn't tried that, thought it was supposed to help with performance. I don't think we have any obsessive stuff running right now. Right now hardware is 4 proc vmware esx, 4GB RAM. For production there will be 12 of those boxes with the number of hosts being about 1200-1500 per nagios server. Dan -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Thursday, December 02, 2010 11:19 AM To: Nagios Users List Cc: Daniel Wittenberg Subject: Re: [Nagios-users] high latency On 12/02/2010 06:05 PM, Daniel Wittenberg wrote: > Yeah, been running that since day one, since when rollout is done we'll > probably have about 18k servers and around 3 million service checks... > 170 services per host? Sounds like an awful lot of switches. I'd use some cleverness to grab snmp-info once and parse the data afterwards if I were you. For that kind of installation, you'll need to use a distributed setup of some sort. merlin, dnx and apparently mod-gearman should get you going in the right direction. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Thu Dec 2 20:38:41 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Thu, 2 Dec 2010 12:38:41 -0700 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net><31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG><4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> Someone else noticed that nagios is generating a ton of minor page faults, and curious if that's normal and if that could be causing some of the latency in the checks? I've also got a tmpfs setup for the status.dat and the checkresults directory to ease some of the disk i/o since we're on a san-backed vm host. I turned off embedded perl this morning and our latency has been holding at < 10 seconds so far, so that seemed to help a lot. Dan -----Original Message----- From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Thursday, December 02, 2010 11:42 AM To: Andreas Ericsson; Nagios Users List Subject: Re: [Nagios-users] high latency Not using SNMP for any of the checks, and most are passive checks. For the few active checks we are probably going to be using dnx. Embeded perl is interesting though, I hadn't tried that, thought it was supposed to help with performance. I don't think we have any obsessive stuff running right now. Right now hardware is 4 proc vmware esx, 4GB RAM. For production there will be 12 of those boxes with the number of hosts being about 1200-1500 per nagios server. Dan -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Thursday, December 02, 2010 11:19 AM To: Nagios Users List Cc: Daniel Wittenberg Subject: Re: [Nagios-users] high latency On 12/02/2010 06:05 PM, Daniel Wittenberg wrote: > Yeah, been running that since day one, since when rollout is done we'll > probably have about 18k servers and around 3 million service checks... > 170 services per host? Sounds like an awful lot of switches. I'd use some cleverness to grab snmp-info once and parse the data afterwards if I were you. For that kind of installation, you'll need to use a distributed setup of some sort. merlin, dnx and apparently mod-gearman should get you going in the right direction. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------ ------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From malarie at processia.com Thu Dec 2 22:21:28 2010 From: malarie at processia.com (Maxime Alarie) Date: Thu, 2 Dec 2010 16:21:28 -0500 Subject: Newbie Question.. Monitoring Lnux machines. Message-ID: <62F2034CF68DFB45BAA4FB7466782DDC01945D5A@denali.processia2003.com> Good day, I have installed nagios, on my centOS VM. I can monitor Windows machines and servers just fine using nsclient++. How can I monitor a Linux server? There is no NSclient++ available, and the guide on monitoring linux machines is incomplete I have check my localhost.cfg file, thinking I could rename it to LinuxServer.cfg but it is quite empty: define host { host_name localhost use linux-server alias localhost address 127.0.0.1 ; register 1 } Any help is appreciated. Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Thu Dec 2 22:32:06 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Thu, 2 Dec 2010 14:32:06 -0700 Subject: Newbie Question.. Monitoring Lnux machines. In-Reply-To: <62F2034CF68DFB45BAA4FB7466782DDC01945D5A@denali.processia2003.com> References: <62F2034CF68DFB45BAA4FB7466782DDC01945D5A@denali.processia2003.com> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E4ACD@WPSCV6MM.OPR.STATEFARM.ORG> Take a look at NRPE for Linux/Unix clients. There are RPM's available from rpmforge which will get you going. Dan From: Maxime Alarie [mailto:malarie at processia.com] Sent: Thursday, December 02, 2010 3:21 PM To: Nagios Users List Subject: [Nagios-users] Newbie Question.. Monitoring Lnux machines. Good day, I have installed nagios, on my centOS VM. I can monitor Windows machines and servers just fine using nsclient++. How can I monitor a Linux server? There is no NSclient++ available, and the guide on monitoring linux machines is incomplete I have check my localhost.cfg file, thinking I could rename it to LinuxServer.cfg but it is quite empty: define host { host_name localhost use linux-server alias localhost address 127.0.0.1 ; register 1 } Any help is appreciated. Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Thu Dec 2 22:32:08 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Thu, 2 Dec 2010 16:32:08 -0500 Subject: Newbie Question.. Monitoring Lnux machines. In-Reply-To: <62F2034CF68DFB45BAA4FB7466782DDC01945D5A@denali.processia2003.com> References: <62F2034CF68DFB45BAA4FB7466782DDC01945D5A@denali.processia2003.com> Message-ID: Hi there -- The NRPE, Nagios Remote Plugin Executor, software can be installed on the Linux systems along with the plugins that are also present on the server. The combination of the two will enable you to monitor the clients in question. ________________________________ From: Maxime Alarie [mailto:malarie at processia.com] Sent: Thursday, December 02, 2010 4:21 PM To: Nagios Users List Subject: [Nagios-users] Newbie Question.. Monitoring Lnux machines. Good day, I have installed nagios, on my centOS VM. I can monitor Windows machines and servers just fine using nsclient++. How can I monitor a Linux server? There is no NSclient++ available, and the guide on monitoring linux machines is incomplete I have check my localhost.cfg file, thinking I could rename it to LinuxServer.cfg but it is quite empty: define host { host_name localhost use linux-server alias localhost address 127.0.0.1 ; register 1 } Any help is appreciated. Regards The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From randywhite30 at neb.rr.com Thu Dec 2 22:38:24 2010 From: randywhite30 at neb.rr.com (Randy White) Date: Thu, 2 Dec 2010 15:38:24 -0600 Subject: Newbie Question.. Monitoring Lnux machines. In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E4ACD@WPSCV6MM.OPR.STATEFARM.ORG> References: <62F2034CF68DFB45BAA4FB7466782DDC01945D5A@denali.processia2003.com> <31B0FE0A1A8166409E9DF35C6DEECB24052E4ACD@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <002a01cb9269$40014ec0$c003ec40$@rr.com> If the system is on the local network there is no reason to use the client. It can monitor if the system is up or down. I believe the client is just for if your not on the local lan From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Thursday, December 02, 2010 3:32 PM To: Nagios Users List Subject: Re: [Nagios-users] Newbie Question.. Monitoring Lnux machines. Take a look at NRPE for Linux/Unix clients. There are RPM's available from rpmforge which will get you going. Dan From: Maxime Alarie [mailto:malarie at processia.com] Sent: Thursday, December 02, 2010 3:21 PM To: Nagios Users List Subject: [Nagios-users] Newbie Question.. Monitoring Lnux machines. Good day, I have installed nagios, on my centOS VM. I can monitor Windows machines and servers just fine using nsclient++. How can I monitor a Linux server? There is no NSclient++ available, and the guide on monitoring linux machines is incomplete I have check my localhost.cfg file, thinking I could rename it to LinuxServer.cfg but it is quite empty: define host { host_name localhost use linux-server alias localhost address 127.0.0.1 ; register 1 } Any help is appreciated. Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Thu Dec 2 22:46:06 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Thu, 2 Dec 2010 14:46:06 -0700 Subject: Newbie Question.. Monitoring Lnux machines. In-Reply-To: <002a01cb9269$40014ec0$c003ec40$@rr.com> References: <62F2034CF68DFB45BAA4FB7466782DDC01945D5A@denali.processia2003.com><31B0FE0A1A8166409E9DF35C6DEECB24052E4ACD@WPSCV6MM.OPR.STATEFARM.ORG> <002a01cb9269$40014ec0$c003ec40$@rr.com> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E4B0E@WPSCV6MM.OPR.STATEFARM.ORG> Well, local lan really doesn't have to do with it. You CAN use SNMP for some of the checks, my in my experience you can't get everything from SNMP or as detailed info on some things, so I always use NRPE. If you are monitoring the local machine, (localhost) then you obviously don't need an agent, but if you are monitoring other servers you'll need with SNMP or NRPE. Dan From: Randy White [mailto:randywhite30 at neb.rr.com] Sent: Thursday, December 02, 2010 3:38 PM To: 'Nagios Users List' Subject: Re: [Nagios-users] Newbie Question.. Monitoring Lnux machines. If the system is on the local network there is no reason to use the client. It can monitor if the system is up or down. I believe the client is just for if your not on the local lan From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Thursday, December 02, 2010 3:32 PM To: Nagios Users List Subject: Re: [Nagios-users] Newbie Question.. Monitoring Lnux machines. Take a look at NRPE for Linux/Unix clients. There are RPM's available from rpmforge which will get you going. Dan From: Maxime Alarie [mailto:malarie at processia.com] Sent: Thursday, December 02, 2010 3:21 PM To: Nagios Users List Subject: [Nagios-users] Newbie Question.. Monitoring Lnux machines. Good day, I have installed nagios, on my centOS VM. I can monitor Windows machines and servers just fine using nsclient++. How can I monitor a Linux server? There is no NSclient++ available, and the guide on monitoring linux machines is incomplete I have check my localhost.cfg file, thinking I could rename it to LinuxServer.cfg but it is quite empty: define host { host_name localhost use linux-server alias localhost address 127.0.0.1 ; register 1 } Any help is appreciated. Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From steve at dedicatedserversaustralia.com.au Thu Dec 2 23:05:22 2010 From: steve at dedicatedserversaustralia.com.au (Steve Kemp) Date: Fri, 3 Dec 2010 09:05:22 +1100 Subject: How can i show remote nagios results in one interface? Message-ID: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> We have 4 remotely located Nagios installs each monitoring the same set of infrastructure Currently we have to naturally login to each one individually to see results etc What we are looking for is a solution that will show results from each location on the one page, and perhaps has a policy setup where if 2 out of the 4 locations report issues it sends a warning or if 3 or all four report an issue that it then sends a warning For example service xyz NewYork OK London WARN Sydney CRITICAL Los Angeles OK This its hoped will remove the need for us to receive up to 4 sms's and 4 eamils each time something happens to a server or service We have looked at passive results using NSCA but it doesn't display the results as we had hoped Anyone know of an addon or mod that will allow us to achieve this? Regards Steve Kemp Dedicated Servers Australia Description: Description: ded-aus-logo www.dedicatedserversaustralia.com.au Tel: +61 7 30187567 Fax: +61 7 38476684 Support: https://accounts.dedicatedserversaustralia.com.au Sales Email: dedaus.sales at dedicatedserversaustralia.com.au -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 7072 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tyarusso at nagios.com Fri Dec 3 02:50:06 2010 From: tyarusso at nagios.com (Tony Yarusso) Date: Thu, 02 Dec 2010 19:50:06 -0600 Subject: How can i show remote nagios results in one interface? In-Reply-To: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> References: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> Message-ID: <1291341006.14908.3.camel@sudbury> I can think of two: 1) MNTOS (free/Free) - see my video on http://library.nagios.com/library/products/nagioscore/documentation/289-using-mntos 2) Nagios Fusion (paid/proprietary) - see http://www.nagios.com/products/nagiosfusion I'm not certain about the alerting functionality, but surely that could be done; I'm just not sure of the best approach for it. -- Tony Yarusso Technical Team ___ Nagios Enterprises, LLC Email: tyarusso at nagios.com Web: www.nagios.com ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From maxs at webwizarddesign.com Fri Dec 3 03:02:36 2010 From: maxs at webwizarddesign.com (Max Schubert) Date: Thu, 2 Dec 2010 21:02:36 -0500 Subject: How can i show remote nagios results in one interface? In-Reply-To: <1291341006.14908.3.camel@sudbury> References: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> <1291341006.14908.3.camel@sudbury> Message-ID: How about Thruk? http://www.thruk.org/ - Max ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Fri Dec 3 04:25:54 2010 From: pangrazi at gmail.com (Greg Pangrazio) Date: Thu, 2 Dec 2010 21:25:54 -0600 Subject: How can i show remote nagios results in one interface? In-Reply-To: References: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> Message-ID: I do this with one service check that looks at all 4 of the statuses and there is a warning and a critical for each. I will have to look up the name of the service check but it used the status macros from the services it might be check_multi Greg Pangrazio On Thu, Dec 2, 2010 at 4:05 PM, Steve Kemp < steve at dedicatedserversaustralia.com.au> wrote: > We have 4 remotely located Nagios installs each monitoring the same set > of infrastructure > > > > Currently we have to naturally login to each one individually to see > results etc > > > > What we are looking for is a solution that will show results from each > location on the one page, and perhaps has a policy setup where if 2 out of > the 4 locations report issues it sends a warning or if 3 or all four report > an issue that it then sends a warning > > > > For example service xyz > > > > NewYork OK > > London WARN > > Sydney CRITICAL > > Los Angeles OK > > > > This its hoped will remove the need for us to receive up to 4 sms?s and 4 > eamils each time something happens to a server or service > > > > We have looked at passive results using NSCA but it doesn?t display the > results as we had hoped > > > > Anyone know of an addon or mod that will allow us to achieve this? > > > > Regards > > > > *Steve Kemp* > > *Dedicated Servers Australia* > > [image: Description: Description: ded-aus-logo] > > *www.dedicatedserversaustralia.com.au* > > *Tel:* *+61 7 30187567* *Fax:* *+61 7 38476684* > > *Support: **https://accounts.dedicatedserversaustralia.com.au* > > *Sales Email:* *dedaus.sales at dedicatedserversaustralia.com.au* > > > > > > > ------------------------------------------------------------------------------ > Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! > Tap into the largest installed PC base & get more eyes on your game by > optimizing for Intel(R) Graphics Technology. Get started today with the > Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. > http://p.sf.net/sfu/intelisp-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 7072 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 7072 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sebastian.Ries at dtnet.de Fri Dec 3 07:29:54 2010 From: Sebastian.Ries at dtnet.de (Sebastian Ries) Date: Fri, 03 Dec 2010 07:29:54 +0100 Subject: Check size of large directory In-Reply-To: References: <1291303869.9611.31.camel@bofh.dtnet.de> Message-ID: <1291357794.19682.3.camel@bofh.dtnet.de> Hi > The check_folder_size.sh script will probably suit your needs well. Yes, I found scripts for this. The Problem is that a du within this folder takes about 2 minutes. That's why I told that it is a very large directory with many files) The other side is that the content within this directory is generated once a day so I wanted to run the "du" after generating the content of the file and let nagios just check the result that was written in a file. Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trm.nagios at gmail.com Fri Dec 3 11:53:38 2010 From: trm.nagios at gmail.com (trm asn) Date: Fri, 3 Dec 2010 16:23:38 +0530 Subject: Check New Host & Service added into Monitoring Message-ID: Dear List, Is there any plug-in available for nagios to check the newly added Host/Services/Contacts and notify by email. /\ dE -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From schneemann at b1-systems.de Fri Dec 3 12:22:09 2010 From: schneemann at b1-systems.de (Christian Schneemann) Date: Fri, 3 Dec 2010 12:22:09 +0100 Subject: Check New Host & Service added into Monitoring In-Reply-To: References: Message-ID: <201012031222.09929.schneemann@b1-systems.de> Hi, On Friday, December 03, 2010 11:53:38 trm asn wrote: > Dear List, > > Is there any plug-in available for nagios to check the newly added > Host/Services/Contacts and notify by email. could you be more verbose what kind of plugin you are looking for? Regards, Christian > > > > /\ > dE -- Christian Schneemann Linux Consultant & Developer Tel.: +49-175-7250665 Mail: schneemann at b1-systems.de B1 Systems GmbH Osterfeldstra?e 7 / 85088 Vohburg / http://www.b1-systems.de GF: Ralph Dehner / Unternehmenssitz: Vohburg / AG: Ingolstadt,HRB 3537 ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Dec 3 12:22:08 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 03 Dec 2010 12:22:08 +0100 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4CF8D2E0.1060101@op5.se> On 12/02/2010 06:42 PM, Daniel Wittenberg wrote: > > Embeded perl is interesting though, I hadn't tried that, thought it was > supposed to help with performance. In theory, it does. It probably does in practice too, but the problems associated with it makes it "not worth it". > I don't think we have any obsessive > stuff running right now. > Check if you're not sure. > Right now hardware is 4 proc vmware esx, 4GB RAM. For production there > will be 12 of those boxes with the number of hosts being about 1200-1500 > per nagios server. > Virtual systems. Bleh. Anyways, if you're going to use a loadbalanced setup you should look into using Merlin. That way you get complete failover for free. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Dec 3 12:25:34 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 03 Dec 2010 12:25:34 +0100 Subject: Check size of large directory In-Reply-To: <1291357794.19682.3.camel@bofh.dtnet.de> References: <1291303869.9611.31.camel@bofh.dtnet.de> <1291357794.19682.3.camel@bofh.dtnet.de> Message-ID: <4CF8D3AE.3090801@op5.se> On 12/03/2010 07:29 AM, Sebastian Ries wrote: > Hi > >> The check_folder_size.sh script will probably suit your needs well. > > Yes, I found scripts for this. > The Problem is that a du within this folder takes about 2 minutes. > That's why I told that it is a very large directory with many files) > In that case there's not much a plugin can do about it. The process is bound by IO and no amount of hacking will make it go noticeably faster. > The other side is that the content within this directory is generated > once a day so I wanted to run the "du" after generating the content of > the file and let nagios just check the result that was written in a > file. > Then I'd suggest you write a plugin yourself that does just that. Any old language should work for such a simple task, really. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Dec 3 12:39:11 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 03 Dec 2010 12:39:11 +0100 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net><31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG><4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4CF8D6DF.5080306@op5.se> On 12/02/2010 08:38 PM, Daniel Wittenberg wrote: > Someone else noticed that nagios is generating a ton of minor page > faults, and curious if that's normal and if that could be causing some > of the latency in the checks? define "a ton" $ /usr/bin/time php -r 'echo "marsipulami\n";' marsipulami 0.01user 0.01system 0:00.09elapsed 34%CPU (0avgtext+0avgdata 29104maxresident)k 10208inputs+0outputs (70major+1962minor)pagefaults 0swaps That's with a reasonably simple program, and it generates 70 major and 1962 minor pagefaults. > I've also got a tmpfs setup for the > status.dat and the checkresults directory to ease some of the disk i/o > since we're on a san-backed vm host. > That's good, although if you're using a virtual system you'll never know for sure if you're really using a ramdisk or not, since the host system might well use swap to store the ramdisk anyway. > I turned off embedded perl this morning and our latency has been holding > at< 10 seconds so far, so that seemed to help a lot. > Neat. Did it affect your pagefaults? If so, how? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From maxs at webwizarddesign.com Fri Dec 3 12:46:41 2010 From: maxs at webwizarddesign.com (Max Schubert) Date: Fri, 3 Dec 2010 06:46:41 -0500 Subject: high latency In-Reply-To: <4CF8D6DF.5080306@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF68510.6090400@flatto.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D6DF.5080306@op5.se> Message-ID: I find it interesting that a number of users get performance improvements with embedded perl off - we lose 20-40% polling capacity perl poller with it off. - Max ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Dec 3 12:58:38 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 03 Dec 2010 12:58:38 +0100 Subject: high latency In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF68510.6090400@flatto.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D6DF.5080306@op5.se> Message-ID: <4CF8DB6E.7060406@op5.se> On 12/03/2010 12:46 PM, Max Schubert wrote: > I find it interesting that a number of users get performance > improvements with embedded perl off - we lose 20-40% polling capacity > perl poller with it off. > How do you mean that you're losing capacity? Does latency start to creep upwards or is load increasing? Out of interest; How much memory does epn leak nowadays, and which perl version is it compiled against? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From maxs at webwizarddesign.com Fri Dec 3 14:03:15 2010 From: maxs at webwizarddesign.com (Max Schubert) Date: Fri, 3 Dec 2010 08:03:15 -0500 Subject: high latency In-Reply-To: <4CF8DB6E.7060406@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF68510.6090400@flatto.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D6DF.5080306@op5.se> <4CF8DB6E.7060406@op5.se> Message-ID: Latency increases much more quickly for us without epn as execution times are noticably longer per check. We use rhel 5.x, so the perl is 5.8.8. We have semi dailoy updates to our pollers and with epn that means cold restarts - memory leaks have not been noticable given that scenrio, but on test hosts or hosts where we are doing burn ins it is negligable enough that we can go for 2-3 days with no memory issues - we always hit service latency thresholds first. 7 seconds is in general where we have to force a restart of our pollers to prevent metric collection and snmp delta calculation issues. Max On 12/3/10, Andreas Ericsson wrote: > On 12/03/2010 12:46 PM, Max Schubert wrote: >> I find it interesting that a number of users get performance >> improvements with embedded perl off - we lose 20-40% polling capacity >> perl poller with it off. >> > > How do you mean that you're losing capacity? Does latency start to creep > upwards or is load increasing? > > Out of interest; How much memory does epn leak nowadays, and which perl > version is it compiled against? > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > Considering the successes of the wars on alcohol, poverty, drugs and > terror, I think we should give some serious thought to declaring war > on peace. > ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Fri Dec 3 14:48:02 2010 From: pangrazi at gmail.com (Greg Pangrazio) Date: Fri, 3 Dec 2010 07:48:02 -0600 Subject: How can i show remote nagios results in one interface? In-Reply-To: References: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> Message-ID: I was wrong i use check_service_cluster http://nagios.sourceforge.net/docs/3_0/clusters.html Greg Pangrazio On Thu, Dec 2, 2010 at 9:25 PM, Greg Pangrazio wrote: > I do this with one service check that looks at all 4 of the statuses and > there is a warning and a critical for each. I will have to look up the name > of the service check but it used the status macros from the services it > might be check_multi > Greg Pangrazio > On Thu, Dec 2, 2010 at 4:05 PM, Steve Kemp < > steve at dedicatedserversaustralia.com.au> wrote: > >> We have 4 remotely located Nagios installs each monitoring the same set >> of infrastructure >> >> >> >> Currently we have to naturally login to each one individually to see >> results etc >> >> >> >> What we are looking for is a solution that will show results from each >> location on the one page, and perhaps has a policy setup where if 2 out of >> the 4 locations report issues it sends a warning or if 3 or all four report >> an issue that it then sends a warning >> >> >> >> For example service xyz >> >> >> >> NewYork OK >> >> London WARN >> >> Sydney CRITICAL >> >> Los Angeles OK >> >> >> >> This its hoped will remove the need for us to receive up to 4 sms?s and 4 >> eamils each time something happens to a server or service >> >> >> >> We have looked at passive results using NSCA but it doesn?t display the >> results as we had hoped >> >> >> >> Anyone know of an addon or mod that will allow us to achieve this? >> >> >> >> Regards >> >> >> >> *Steve Kemp* >> >> *Dedicated Servers Australia* >> >> [image: Description: Description: ded-aus-logo] >> >> *www.dedicatedserversaustralia.com.au* >> >> *Tel:* *+61 7 30187567* *Fax:* *+61 7 38476684* >> >> *Support: **https://accounts.dedicatedserversaustralia.com.au* >> >> *Sales Email:* *dedaus.sales at dedicatedserversaustralia.com.au* >> >> >> >> >> >> >> ------------------------------------------------------------------------------ >> Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! >> Tap into the largest installed PC base & get more eyes on your game by >> optimizing for Intel(R) Graphics Technology. Get started today with the >> Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. >> http://p.sf.net/sfu/intelisp-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 7072 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trm.nagios at gmail.com Fri Dec 3 15:28:43 2010 From: trm.nagios at gmail.com (trm asn) Date: Fri, 3 Dec 2010 19:58:43 +0530 Subject: Check New Host & Service added into Monitoring In-Reply-To: References: Message-ID: I am looking for a plugin which will check the hosts.cfg, contacts.cfg, command.cfg and if found any new entry has been added into those files , then will be notified to the specific email id . On Fri, Dec 3, 2010 at 4:23 PM, trm asn wrote: > Dear List, > > Is there any plug-in available for nagios to check the newly added > Host/Services/Contacts and notify by email. > > > > /\ > dE > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Fri Dec 3 16:23:54 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Fri, 3 Dec 2010 08:23:54 -0700 Subject: How can i show remote nagios results in oneinterface? In-Reply-To: References: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E4E73@WPSCV6MM.OPR.STATEFARM.ORG> Also take a look at multisite/livestatus too. I'm currently using it to watch over 15 servers. Dan From: Greg Pangrazio [mailto:pangrazi at gmail.com] Sent: Friday, December 03, 2010 7:48 AM To: steve at dedicatedserversaustralia.com.au; Nagios Users List Subject: Re: [Nagios-users] How can i show remote nagios results in oneinterface? I was wrong i use check_service_cluster http://nagios.sourceforge.net/docs/3_0/clusters.html Greg Pangrazio On Thu, Dec 2, 2010 at 9:25 PM, Greg Pangrazio wrote: I do this with one service check that looks at all 4 of the statuses and there is a warning and a critical for each. I will have to look up the name of the service check but it used the status macros from the services it might be check_multi Greg Pangrazio On Thu, Dec 2, 2010 at 4:05 PM, Steve Kemp < steve at dedicatedserversaustralia.com.au> wrote: We have 4 remotely located Nagios installs each monitoring the same set of infrastructure Currently we have to naturally login to each one individually to see results etc What we are looking for is a solution that will show results from each location on the one page, and perhaps has a policy setup where if 2 out of the 4 locations report issues it sends a warning or if 3 or all four report an issue that it then sends a warning For example service xyz NewYork OK London WARN Sydney CRITICAL Los Angeles OK This its hoped will remove the need for us to receive up to 4 sms's and 4 eamils each time something happens to a server or service We have looked at passive results using NSCA but it doesn't display the results as we had hoped Anyone know of an addon or mod that will allow us to achieve this? Regards Steve Kemp Dedicated Servers Australia www.dedicatedserversaustralia.com.au Tel: +61 7 30187567 Fax: +61 7 38476684 Support: https://accounts.dedicatedserversaustralia.com.au Sales Email: dedaus.sales at dedicatedserversaustralia.com.au ------------------------------------------------------------------------ ------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 7072 bytes Desc: image001.png URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Fri Dec 3 16:31:31 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Fri, 3 Dec 2010 08:31:31 -0700 Subject: high latency In-Reply-To: <4CF8D6DF.5080306@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net><31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG><4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D6DF.5080306@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E4E90@WPSCV6MM.OPR.STATEFARM.ORG> Pagefaults - 20-30k. This seems to be the source of most of the cpu system time (understandably), which sits about 40-50%. So if I could reduce the pagefaults I think we could gain quite a bit of performance back. I found one other huge issue...somehow in the generic service check, the check_inteval was set to 5 minutes...however, normal_check_interval wasn't set at all and appeared to be checking every minute. I deleted check_interval and added normal_check_interval and that helped a ton, latency went down to 0.5-1.5 seconds. That was only running 2 active checks and about a dozen passive on 700 hosts. I then added back in the other 9 active checks and latency once again shot back up to about 2000 *sigh*. I grabbed another vm and made it a dnx client and that seemed to help, but wish I could get the main server to handle more. Right now it has about 700 hosts and 12,100 service checks, of which about 7000 are active and rest are passive. Oh, and we do have obsessive turned off. I've even gone through as many configs as I could and removed the macros too until I can write a caching mech for the macro statements. Any more ideas? -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Friday, December 03, 2010 5:39 AM To: Nagios Users List Cc: Daniel Wittenberg Subject: Re: [Nagios-users] high latency On 12/02/2010 08:38 PM, Daniel Wittenberg wrote: > Someone else noticed that nagios is generating a ton of minor page > faults, and curious if that's normal and if that could be causing some > of the latency in the checks? define "a ton" $ /usr/bin/time php -r 'echo "marsipulami\n";' marsipulami 0.01user 0.01system 0:00.09elapsed 34%CPU (0avgtext+0avgdata 29104maxresident)k 10208inputs+0outputs (70major+1962minor)pagefaults 0swaps That's with a reasonably simple program, and it generates 70 major and 1962 minor pagefaults. > I've also got a tmpfs setup for the > status.dat and the checkresults directory to ease some of the disk i/o > since we're on a san-backed vm host. > That's good, although if you're using a virtual system you'll never know for sure if you're really using a ramdisk or not, since the host system might well use swap to store the ramdisk anyway. > I turned off embedded perl this morning and our latency has been holding > at< 10 seconds so far, so that seemed to help a lot. > Neat. Did it affect your pagefaults? If so, how? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AdcockJ at leoncountyfl.gov Fri Dec 3 16:30:37 2010 From: AdcockJ at leoncountyfl.gov (Jon Adcock) Date: Fri, 03 Dec 2010 10:30:37 -0500 Subject: How can i show remote nagios results in one interface? In-Reply-To: <1291341006.14908.3.camel@sudbury> References: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> <1291341006.14908.3.camel@sudbury> Message-ID: <4CF8C6CD020000750003080D@leoncountyfl.gov> Tony, I have installed MNTOS on two systems (troubleshooting) and I have the same issue with both. The nagios.xml file is being updated with data, but when I hit the webpage, there is nothing displayed (absolutely blank). I downloaded a simple php page that lists a bunch of PHP configuration information (info.php), and saved that into the same directory as the mntos index.php file. The info.php page displays just fine, but I get a completely blank page when I hit the page: http///mntos/index.php Do you have any ideas about where I should start looking for the problem? Jon Adcock Network Systems Administrator MIS / Systems Team Leon County (850) 606-5500 >>> Tony Yarusso 12/2/2010 8:50 PM >>> I can think of two: 1) MNTOS (free/Free) - see my video on http://library.nagios.com/library/products/nagioscore/documentation/289-using-mntos 2) Nagios Fusion (paid/proprietary) - see http://www.nagios.com/products/nagiosfusion I'm not certain about the alerting functionality, but surely that could be done; I'm just not sure of the best approach for it. -- Tony Yarusso Technical Team ___ Nagios Enterprises, LLC Email: tyarusso at nagios.com Web: www.nagios.com ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Fri Dec 3 16:32:58 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Fri, 3 Dec 2010 08:32:58 -0700 Subject: Check size of large directory In-Reply-To: <4CF8D3AE.3090801@op5.se> References: <1291303869.9611.31.camel@bofh.dtnet.de> <1291357794.19682.3.camel@bofh.dtnet.de> <4CF8D3AE.3090801@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E4E95@WPSCV6MM.OPR.STATEFARM.ORG> Not sure how those other scripts are doing it, if using an ls maybe? As a test, can you do: time find -type f |wc -l See how long it takes for that? Dan -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Friday, December 03, 2010 5:26 AM To: Nagios Users List Subject: Re: [Nagios-users] Check size of large directory On 12/03/2010 07:29 AM, Sebastian Ries wrote: > Hi > >> The check_folder_size.sh script will probably suit your needs well. > > Yes, I found scripts for this. > The Problem is that a du within this folder takes about 2 minutes. > That's why I told that it is a very large directory with many files) > In that case there's not much a plugin can do about it. The process is bound by IO and no amount of hacking will make it go noticeably faster. > The other side is that the content within this directory is generated > once a day so I wanted to run the "du" after generating the content of > the file and let nagios just check the result that was written in a > file. > Then I'd suggest you write a plugin yourself that does just that. Any old language should work for such a simple task, really. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------ ------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Dec 3 17:40:37 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 03 Dec 2010 17:40:37 +0100 Subject: Check New Host & Service added into Monitoring In-Reply-To: References: Message-ID: <4CF91D85.4060002@op5.se> On 12/03/2010 03:28 PM, trm asn wrote: > I am looking for a plugin which will check the hosts.cfg, contacts.cfg, > command.cfg and if found any new entry has been added into those files , > then will be notified to the specific email id . > Hope you find one. It shouldn't be hard to write yourself if you don't though. Best of luck. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Dec 3 17:52:52 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 03 Dec 2010 17:52:52 +0100 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E4E90@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net><31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG><4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D6DF.5080306@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4E90@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4CF92064.5040306@op5.se> On 12/03/2010 04:31 PM, Daniel Wittenberg wrote: > Pagefaults - 20-30k. This seems to be the source of most of the cpu > system time (understandably), which sits about 40-50%. So if I could > reduce the pagefaults I think we could gain quite a bit of performance > back. > Over what period of time? Here's from a program running a mere 1.22s, showing 13k pagefaults. The majority of that time is *not* spent trying to load the swapped out mmap regions, but in delta chain lookups inside the program logic. And so the output: $ time git repack Counting objects: 397, done. Delta compression using up to 4 threads. Compressing objects: 100% (397/397), done. Writing objects: 100% (397/397), done. Total 397 (delta 238), reused 0 (delta 0) 0.28user 0.09system 0:01.22elapsed 30%CPU (0avgtext+0avgdata 20544maxresident)k 6368inputs+464outputs (297major+12959minor)pagefaults 0swaps I really think you're misunderstanding what pagefaults are and how they work. Starting an X-server or openoffice.org is likely to generate somewhere around a million pagefaults each, simply because they use a lot of libraries, read a lot of config files, invoke a lot of helper programs and in attempt to access various devices. 20-30k pagefaults is *nothing* for a cpu capable of executing a couple of billion instructions per second. > I found one other huge issue...somehow in the generic service check, the > check_inteval was set to 5 minutes...however, normal_check_interval > wasn't set at all and appeared to be checking every minute. I deleted > check_interval and added normal_check_interval and that helped a ton, > latency went down to 0.5-1.5 seconds. That was only running 2 active > checks and about a dozen passive on 700 hosts. I then added back in the > other 9 active checks and latency once again shot back up to about 2000 > *sigh*. > You're doing something weird. I'm 100% certain that this isn't Nagios' fault. Any chance you could share your config off-list? Remove passwords and addresses first if you like. > I grabbed another vm and made it a dnx client and that seemed to help, > but wish I could get the main server to handle more. Right now it has > about 700 hosts and 12,100 service checks, of which about 7000 are > active and rest are passive. > Umm... First you said you added 9 checks and that made the entire thing just blow up, and now you're running 7000 active checks. What checks are you running? If you sort by cpu usage in top, is there anyone that's really prominent? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Fri Dec 3 17:57:54 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Fri, 3 Dec 2010 09:57:54 -0700 Subject: high latency In-Reply-To: <4CF92064.5040306@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net><31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG><4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D6DF.5080306@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4E90@WPSCV6MM.OPR.STATEFARM.ORG> <4CF92064.5040306@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E5024@WPSCV6MM.OPR.STATEFARM.ORG> Sorry for confusion on that..I added 9 checks to *each* host, and there's about 700 hosts. No, it's all the nagios daemon itself (nagios -uxd). It feels like if I add that many more checks that it has a hard time doing the checks and processing the results since if I either move the active checking to dnx or drop them completely the load and latency times drop. Dan > I grabbed another vm and made it a dnx client and that seemed to help, > but wish I could get the main server to handle more. Right now it has > about 700 hosts and 12,100 service checks, of which about 7000 are > active and rest are passive. > Umm... First you said you added 9 checks and that made the entire thing just blow up, and now you're running 7000 active checks. What checks are you running? If you sort by cpu usage in top, is there anyone that's really prominent? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gavin at openfusion.com.au Fri Dec 3 18:10:29 2010 From: gavin at openfusion.com.au (Gavin Carr) Date: Fri, 3 Dec 2010 17:10:29 +0000 Subject: Check size of large directory In-Reply-To: <1291357794.19682.3.camel@bofh.dtnet.de> References: <1291303869.9611.31.camel@bofh.dtnet.de> <1291357794.19682.3.camel@bofh.dtnet.de> Message-ID: <20101203171027.GA5598@openfusion.com.au> On Fri, Dec 03, 2010 at 07:29:54AM +0100, Sebastian Ries wrote: >> The check_folder_size.sh script will probably suit your needs well. > >Yes, I found scripts for this. >The Problem is that a du within this folder takes about 2 minutes. >That's why I told that it is a very large directory with many files) > >The other side is that the content within this directory is generated >once a day so I wanted to run the "du" after generating the content of >the file and let nagios just check the result that was written in a >file. I'd put the starts in your calculation script e.g. DIR=/path/to/BIGDIR FLAG=$DIR/EEK_TOO_BIG LIMIT=12345678 rm -f $FLAG || exit 3 DIRSIZE=$(du -s $DIR | sed 's/\t.*$//') test $DIRSIZE -ge $LIMIT && touch $FLAG or whatever i.e. have your calc script check the limits and touch a file to signal that there's a problem. Your actual nagios test is then trivial. Cheers, Gavin ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Fri Dec 3 19:59:28 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Fri, 3 Dec 2010 11:59:28 -0700 Subject: high latency In-Reply-To: <4CF8D2E0.1060101@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24052E5186@WPSCV6MM.OPR.STATEFARM.ORG> It appears that nagios spawns lots and lots of new procs for all the various tasks it does, check results and such. I was curious, wouldn't a model more like Apache work better? Something like, a queue for work, and have worker processes grab off that queue, run a bunch of different jobs, then die, rather than just performing one task? That seems like it would still maintain stability and offer higher performance gains ? Dan -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Friday, December 03, 2010 5:22 AM To: Daniel Wittenberg Cc: Nagios Users List Subject: Re: [Nagios-users] high latency On 12/02/2010 06:42 PM, Daniel Wittenberg wrote: > > Embeded perl is interesting though, I hadn't tried that, thought it was > supposed to help with performance. In theory, it does. It probably does in practice too, but the problems associated with it makes it "not worth it". > I don't think we have any obsessive > stuff running right now. > Check if you're not sure. > Right now hardware is 4 proc vmware esx, 4GB RAM. For production there > will be 12 of those boxes with the number of hosts being about 1200-1500 > per nagios server. > Virtual systems. Bleh. Anyways, if you're going to use a loadbalanced setup you should look into using Merlin. That way you get complete failover for free. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Fri Dec 3 20:14:29 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Fri, 3 Dec 2010 14:14:29 -0500 Subject: high latency In-Reply-To: <4CF8D2E0.1060101@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> Message-ID: Can the use of dependencies also be the cause of increased latencies? I too struggle with them and I'm running on lightly-loaded physical hardware. We have 2 servers doing the checks sending back to a central server. Both distributed nodes use ocsp/ochp, but they do nothing more than append results to a file (i.e. it exits quickly). Results are handled outside of Nagios. What's odd is that distserver 1 and distserver 2 are configured the same distserver1: Hosts Checked 675 Services Checked: 4179 Active Service Latency: 0.000 / 3.155 / 0.382 sec Active Service Execution Time: 0.000 / 60.038 / 0.145 sec distserver2: Hosts Checked: 261 Services Checked: 4289 Active Service Latency: 0.000 / 169.977 / 81.300 sec Active Service Execution Time: 0.000 / 15.270 / 0.211 sec yet as you can see, distserver2's latency is much higher and always has been. I tried turning off EPN yesterday on distserver2 and it had no discernable effect. We added 400 new service checks yesterday on distserver2 (just more of the same checks we already do but on 26 new hosts) and the latency went from 35 to over 80. The checks we do are very different (Windows, Linux, Unix, many are app-centric) so it's difficult to compare exactly what runs on distserver1 and distserver2, but given the jump that was taken yesterday, I'm wondering if the fact that the type of checks on these new hosts are all built on dependencies make me wonder if that doesn't have something to do with it. These hosts (Windows) have a basic check for NRPE and all other checks on the host are dependent on the NRPE check succeeding. I have to move to all new Nagios servers very soon. I'm interested in Merlin, but given its non-production nature just yet, I'm hesitant to commit and I'm not sure if it will help me here. Thanks Mark ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Rick.Carter at umich.edu Fri Dec 3 21:22:24 2010 From: Rick.Carter at umich.edu (Rick Carter) Date: Fri, 3 Dec 2010 15:22:24 -0500 Subject: Check New Host & Service added into Monitoring In-Reply-To: References: Message-ID: <1FD4BE38-9179-4E05-BFD5-FFDE7DF8DDDD@umich.edu> We keep those files under RCS revision control and make people check them out and in to add new information. If we wanted to, I'd probably write an rcsdiff to see if anything's checked out that's not been checked back in. Otherwise, two ideas come to mind: - Create a daily cron job to check those files for a date newer than (last-checked-time) and email if they exist *or* - Create a daily cron job to diff those files against a save version of the files and email any diffs it finds, and when done, copy the current files to the saved versions again. On Dec 3, 2010, at 9:28 AM, trm asn wrote: > I am looking for a plugin which will check the hosts.cfg, contacts.cfg, command.cfg and if found any new entry has been added into those files , then will be notified to the specific email id . > > > > > On Fri, Dec 3, 2010 at 4:23 PM, trm asn wrote: > Dear List, > > Is there any plug-in available for nagios to check the newly added Host/Services/Contacts and notify by email. > > > > /\ > dE > > ------------------------------------------------------------------------------ > Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! > Tap into the largest installed PC base & get more eyes on your game by > optimizing for Intel(R) Graphics Technology. Get started today with the > Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. > http://p.sf.net/sfu/intelisp-dev2dev_______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Rick Carter, Unix/Linux SysAdmin University of Michigan, ITS System Support Team "The best way to find out where you are from is find out where you are going and work backwards." - The Doctor -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Rick.Carter at umich.edu Fri Dec 3 21:24:28 2010 From: Rick.Carter at umich.edu (Rick Carter) Date: Fri, 3 Dec 2010 15:24:28 -0500 Subject: Check New Host & Service added into Monitoring In-Reply-To: <1FD4BE38-9179-4E05-BFD5-FFDE7DF8DDDD@umich.edu> References: <1FD4BE38-9179-4E05-BFD5-FFDE7DF8DDDD@umich.edu> Message-ID: <36DFC306-DBE6-4882-948D-5327A800D735@umich.edu> Actually I thought of further things I'd make such a cron job do (this could easily be a simple bash or perl script)... - if the files have changed, check their times against the time nagios last restarted and warn if they are newer (as the changes aren't being picked up by the running nagios) and - run a config check and warn if restarting would fail due to errors. On Dec 3, 2010, at 3:22 PM, Rick Carter wrote: > We keep those files under RCS revision control and make people check them out and in to add new information. If we wanted to, I'd probably write an rcsdiff to see if anything's checked out that's not been checked back in. > > Otherwise, two ideas come to mind: > - Create a daily cron job to check those files for a date newer than (last-checked-time) and email if they exist > *or* > - Create a daily cron job to diff those files against a save version of the files and email any diffs it finds, and when done, copy the current files to the saved versions again. > > On Dec 3, 2010, at 9:28 AM, trm asn wrote: > >> I am looking for a plugin which will check the hosts.cfg, contacts.cfg, command.cfg and if found any new entry has been added into those files , then will be notified to the specific email id . >> >> >> >> >> On Fri, Dec 3, 2010 at 4:23 PM, trm asn wrote: >> Dear List, >> >> Is there any plug-in available for nagios to check the newly added Host/Services/Contacts and notify by email. >> >> >> >> /\ >> dE >> >> ------------------------------------------------------------------------------ >> Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! >> Tap into the largest installed PC base & get more eyes on your game by >> optimizing for Intel(R) Graphics Technology. Get started today with the >> Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. >> http://p.sf.net/sfu/intelisp-dev2dev_______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > -- > Rick Carter, Unix/Linux SysAdmin > University of Michigan, ITS System Support Team > "The best way to find out where you are from is find out where you are going and work backwards." - The Doctor > > > > -- Rick Carter, Unix/Linux SysAdmin University of Michigan, ITS System Support Team "It's worse than you know." "It usually is." (The Operator and Mal, _Serenity_) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tyarusso at nagios.com Sat Dec 4 22:34:56 2010 From: tyarusso at nagios.com (Tony Yarusso) Date: Sat, 04 Dec 2010 15:34:56 -0600 Subject: How can i show remote nagios results in one interface? In-Reply-To: <4CF8C6CD020000750003080D@leoncountyfl.gov> References: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> <1291341006.14908.3.camel@sudbury> <4CF8C6CD020000750003080D@leoncountyfl.gov> Message-ID: <1291498496.2854.1.camel@sudbury> On Fri, 2010-12-03 at 10:30 -0500, Jon Adcock wrote: > The info.php page displays just fine, but I get a completely blank > page when I hit the page: http///mntos/index.php Do you > have any ideas about where I should start looking for the problem? Your web server's error log. For instance, for Apache on Ubuntu, it's /var/log/apache2/error.log -- Tony Yarusso Technical Team ___ Nagios Enterprises, LLC Email: tyarusso at nagios.com Web: www.nagios.com ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Sun Dec 5 03:31:49 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Sat, 4 Dec 2010 19:31:49 -0700 Subject: high latency In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG><5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net><31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG><4CF7D4FE.7090706@op5.se><31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG><31B0FE0A1A8166409E9DF35C6DEECB24052E48EC@WPSCV6MM.OPR.STATEFARM.ORG><4CF8D6DF.5080306@op5.se><4CF8DB6E.7060406@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB2405332591@WPSCV6MM.OPR.STATEFARM.ORG> I did some testing today with epn on and off and it didn't seem to make any difference in our latency times. Not overly scientific though, but looked about the same running few hours each way. Dan -----Original Message----- From: Max Schubert [mailto:maxs at webwizarddesign.com] Sent: Friday, December 03, 2010 7:03 AM To: Andreas Ericsson; Nagios Users List Subject: Re: [Nagios-users] high latency Latency increases much more quickly for us without epn as execution times are noticably longer per check. We use rhel 5.x, so the perl is 5.8.8. We have semi dailoy updates to our pollers and with epn that means cold restarts - memory leaks have not been noticable given that scenrio, but on test hosts or hosts where we are doing burn ins it is negligable enough that we can go for 2-3 days with no memory issues - we always hit service latency thresholds first. 7 seconds is in general where we have to force a restart of our pollers to prevent metric collection and snmp delta calculation issues. Max On 12/3/10, Andreas Ericsson wrote: > On 12/03/2010 12:46 PM, Max Schubert wrote: >> I find it interesting that a number of users get performance >> improvements with embedded perl off - we lose 20-40% polling capacity >> perl poller with it off. >> > > How do you mean that you're losing capacity? Does latency start to creep > upwards or is load increasing? > > Out of interest; How much memory does epn leak nowadays, and which perl > version is it compiled against? > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > Considering the successes of the wars on alcohol, poverty, drugs and > terror, I think we should give some serious thought to declaring war > on peace. > ------------------------------------------------------------------------ ------ Increase Visibility of Your 3D Game App & Earn a Chance To Win $500! Tap into the largest installed PC base & get more eyes on your game by optimizing for Intel(R) Graphics Technology. Get started today with the Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. http://p.sf.net/sfu/intelisp-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AdcockJ at leoncountyfl.gov Sun Dec 5 05:19:40 2010 From: AdcockJ at leoncountyfl.gov (Jon Adcock) Date: Sat, 04 Dec 2010 23:19:40 -0500 Subject: How can i show remote nagios results in one interface? In-Reply-To: <1291498496.2854.1.camel@sudbury> References: <036e01cb926d$042d2870$0c877950$@dedicatedserversaustralia.com.au> <1291341006.14908.3.camel@sudbury> <4CF8C6CD020000750003080D@leoncountyfl.gov> <1291498496.2854.1.camel@sudbury> Message-ID: <4CFACC8C0200007500030890@leoncountyfl.gov> Tony, That was a good recommendation. I looked in the Apache2 error.log and found a bunch of XSLProcessor errors. I figured out that those errors mean that php5-xsl was not installed. After installing that package, MNTOS displayed properly. Jon Adcock Network Systems Administrator MIS / Systems Team Leon County (850) 606-5500 >>> Tony Yarusso 12/4/2010 4:34 PM >>> On Fri, 2010-12-03 at 10:30 -0500, Jon Adcock wrote: > The info.php page displays just fine, but I get a completely blank > page when I hit the page: http///mntos/index.php Do you > have any ideas about where I should start looking for the problem? Your web server's error log. For instance, for Apache on Ubuntu, it's /var/log/apache2/error.log -- Tony Yarusso Technical Team ___ Nagios Enterprises, LLC Email: tyarusso at nagios.com Web: www.nagios.com ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From john at andrunas.net Sun Dec 5 20:58:50 2010 From: john at andrunas.net (John Andrunas) Date: Sun, 5 Dec 2010 11:58:50 -0800 Subject: monitoring interface drops Message-ID: Does anyone have any good ways of monitoring interface drops on cisco switches? I can use check_snmp to check drops but since they are counters they only increase, so I can't set a simple threshold. What I need to do is alert if the current number is greater than the last check. -- John ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Dec 6 12:06:24 2010 From: ae at op5.se (Andreas Ericsson) Date: Mon, 06 Dec 2010 12:06:24 +0100 Subject: high latency In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> Message-ID: <4CFCC3B0.6050706@op5.se> On 12/03/2010 08:14 PM, Frost, Mark {PBC} wrote: > > Can the use of dependencies also be the cause of increased latencies? > If they're very deep, it's possible. Otherwise it really shouldn't matter all that much. It will ofcourse add *some* load, but it shouldn't be enough to cause latency. > I too struggle with them and I'm running on lightly-loaded physical hardware. > We have 2 servers doing the checks sending back to a central server. Both > distributed nodes use ocsp/ochp, but they do nothing more than append results > to a file (i.e. it exits quickly). Results are handled outside of Nagios. > Try getting rid of the oc[sh]p commands and use Merlin or google for "pnsca" or "persistent nsca". There's one available from op5's repositories that may or may not work, and there's one from somewhere else that they're apparently using to great effect. Even if it exits quickly, it's still executed serially, so checking halts a small period of time for each and every check that runs. > What's odd is that distserver 1 and distserver 2 are configured the same > > distserver1: > Hosts Checked 675 > Services Checked: 4179 > Active Service Latency: 0.000 / 3.155 / 0.382 sec > Active Service Execution Time: 0.000 / 60.038 / 0.145 sec > > distserver2: > Hosts Checked: 261 > Services Checked: 4289 > Active Service Latency: 0.000 / 169.977 / 81.300 sec > Active Service Execution Time: 0.000 / 15.270 / 0.211 sec > > yet as you can see, distserver2's latency is much higher and always has been. > I tried turning off EPN yesterday on distserver2 and it had no discernable effect. > We added 400 new service checks yesterday on distserver2 (just more of the same > checks we already do but on 26 new hosts) and the latency went from 35 to over 80. > What kind of checks are you running? Some plugins draw a lot of cpu. Are any of the checks set to run in serial (grep for parallelize_check in your objects.cache file). What version of Nagios are you running? > The checks we do are very different (Windows, Linux, Unix, many are app-centric) so > it's difficult to compare exactly what runs on distserver1 and distserver2, but given > the jump that was taken yesterday, I'm wondering if the fact that the type of checks > on these new hosts are all built on dependencies make me wonder if that doesn't > have something to do with it. These hosts (Windows) have a basic check for NRPE > and all other checks on the host are dependent on the NRPE check succeeding. > > I have to move to all new Nagios servers very soon. I'm interested in Merlin, but > given its non-production nature just yet, I'm hesitant to commit and I'm not sure if > it will help me here. > It's been running at our 400+ customers with very few problems for the past month. 0.9.1, released just yesterday, solves the known issues our customers have encountered. You might want to take a look at it again. There are some issues on FreeBSD though (was that you reporting them?). I just recently got a new laptop with better support for running virtual systems, so I'm downloading a FreeBSD 8.1 install dvd as we speak. Hopefully I'll have those issues sorted out before the end of the week. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Dec 6 12:05:10 2010 From: ae at op5.se (Andreas Ericsson) Date: Mon, 06 Dec 2010 12:05:10 +0100 Subject: high latency In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24052E5186@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E5186@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4CFCC366.2070303@op5.se> On 12/03/2010 07:59 PM, Daniel Wittenberg wrote: > It appears that nagios spawns lots and lots of new procs for all the > various tasks it does, check results and such. I was curious, wouldn't > a model more like Apache work better? Something like, a queue for work, > and have worker processes grab off that queue, run a bunch of different > jobs, then die, rather than just performing one task? That seems like > it would still maintain stability and offer higher performance gains ? > It probably would, and it's on the roadmap to rewrite those parts of Nagios to something similar to what you've described. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ej_seg at hotmail.com Mon Dec 6 14:52:34 2010 From: ej_seg at hotmail.com (Rikard Dahlberg) Date: Mon, 6 Dec 2010 13:52:34 +0000 Subject: Monitoring remote hosts Message-ID: Hello I'm just starting with nagios and im trying to learn everything at once. At this moment im trying to get a remote windows 2008 server to be monitored, its on a different network so i've decided to use NSCA to monitor it via passive checks. However I get an error message at the remote server saying "Could not connect to: xxx.xxx.xxx.xxx:5667 10061: No connection could be made because the target machine actively refused it." And i've checked in the /usr/local/nagios/var/log/nagios.log and the /var/log/syslog.log files and they come up blank. Im sure ive set up same encryption on both sides and im sure that they both use the same password... Any ideas how to start troubleshooting it? And since im new to nagios, please explain thouroly :) //Rikard -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Mon Dec 6 15:19:45 2010 From: nagios at flatto.net (Assaf Flatto) Date: Mon, 06 Dec 2010 14:19:45 +0000 Subject: Monitoring remote hosts In-Reply-To: References: Message-ID: <4CFCF101.4030600@flatto.net> Rikard From first glance you may have several factors that may contribute to the issue you are having : 1) you may not have the nsca deamon running on the nagios server , to receive the sent data from the submitting server - to solve that look at the documentation of nsca and see about setting up the receiving end. 2) you may have the local linux firewall block the nsca port ( as it is a non standard port , you will need to open the access for it specifically) , remember it is using UDP and not TCP when opening the firewall. 3) do you have any extra security on the nagios server in the for of a third party security software ? Assaf On 06/12/10 13:52, Rikard Dahlberg wrote: > Hello > > I'm just starting with nagios and im trying to learn everything at once. > At this moment im trying to get a remote windows 2008 server to be > monitored, its on a different network so i've decided to use NSCA to > monitor it via passive checks. > However I get an error message at the remote server saying "Could not > connect to: xxx.xxx.xxx.xxx:5667 10061: No connection could be made > because the target machine actively refused it." And i've checked in > the /usr/local/nagios/var/log/nagios.log and the /var/log/syslog.log > files and they come up blank. > > Im sure ive set up same encryption on both sides and im sure that they > both use the same password... > Any ideas how to start troubleshooting it? And since im /*new */to > nagios, please explain thouroly :) > > //Rikard -- Never,Ever Cut A Deal With a Dragon Next year I will be doing the London to Paris bike ride to raise money for the DogTrust (www.dogstrust.co.uk) . Please Sponsor me at http://www.justgiving.com/Assaf-Flatto -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dtuecks at googlemail.com Mon Dec 6 15:20:50 2010 From: dtuecks at googlemail.com (Daniel Tuecks) Date: Mon, 6 Dec 2010 15:20:50 +0100 Subject: Monitoring remote hosts In-Reply-To: References: Message-ID: Hi Rikard, is your nsca daemon running (on your nagios host?). You can check the process list via 'ps aux | grep nsca'. Furthermore you should try to connect to port 5667 (from localhost and/or your windows server): telnet localhost 5667. If you get connected you should verify your network setup (Firewalls? Do you use (x)inetd? If so, is the option 'allow_only' configured?) ... Daniel 2010/12/6 Rikard Dahlberg : > Hello > > I'm just starting with nagios and im trying to learn everything at once. > At this moment im trying to get a remote windows 2008 server to be > monitored, its on a different network so i've decided to use NSCA to monitor > it via passive checks. > However I get an error message at the remote server saying "Could not > connect to: xxx.xxx.xxx.xxx:5667 10061: No connection could be made because > the target machine actively refused it." And i've checked in the > /usr/local/nagios/var/log/nagios.log and the /var/log/syslog.log files and > they come up blank. > > Im sure ive set up same encryption on both sides and im sure that they both > use the same password... > Any ideas how to start troubleshooting it? And since im new to nagios, > please explain thouroly :) > > //Rikard > > ------------------------------------------------------------------------------ > What happens now with your Lotus Notes apps - do you make another costly > upgrade, or settle for being marooned without product support? Time to move > off Lotus Notes and onto the cloud with Force.com, apps are easier to build, > use, and manage than apps on traditional platforms. Sign up for the Lotus > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rik.dahlberg at gmail.com Mon Dec 6 15:52:59 2010 From: rik.dahlberg at gmail.com (Rikard Dahlberg) Date: Mon, 6 Dec 2010 14:52:59 +0000 Subject: Monitoring remote hosts In-Reply-To: References: , Message-ID: Hey daniel! Wow, this really showed my error i think. I cant connect either on localhost:5667 nor from the server via telnet 5667... Could i ask you if you could walk me through how to troubleshoot this? That would really make my day :) yes, i use xinetd this is the config file : # description: NSCA (Nagios Service Check Acceptor) service nsca { flags = REUSE type = UNLISTED port = 5667 socket_type = stream wait = no user = nagiosadmin group = nagios server = /usr/sbin/nsca server_args = -c /etc/nsca.cfg .inetd log_on_failure += USERID disable = no only_from = 127.0.0.1 *.*.*.* } im a total tool coming to this, what should i change? :) /Rikard > Date: Mon, 6 Dec 2010 15:20:50 +0100 > From: dtuecks at googlemail.com > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Monitoring remote hosts > > Hi Rikard, > > is your nsca daemon running (on your nagios host?). You can check the > process list via 'ps aux | grep nsca'. Furthermore you should try to > connect to port 5667 (from localhost and/or your windows server): > telnet localhost 5667. If you get connected you should verify your > network setup (Firewalls? Do you use (x)inetd? If so, is the option > 'allow_only' configured?) ... > > Daniel > > 2010/12/6 Rikard Dahlberg : > > Hello > > > > I'm just starting with nagios and im trying to learn everything at once. > > At this moment im trying to get a remote windows 2008 server to be > > monitored, its on a different network so i've decided to use NSCA to monitor > > it via passive checks. > > However I get an error message at the remote server saying "Could not > > connect to: xxx.xxx.xxx.xxx:5667 10061: No connection could be made because > > the target machine actively refused it." And i've checked in the > > /usr/local/nagios/var/log/nagios.log and the /var/log/syslog.log files and > > they come up blank. > > > > Im sure ive set up same encryption on both sides and im sure that they both > > use the same password... > > Any ideas how to start troubleshooting it? And since im new to nagios, > > please explain thouroly :) > > > > //Rikard > > > > ------------------------------------------------------------------------------ > > What happens now with your Lotus Notes apps - do you make another costly > > upgrade, or settle for being marooned without product support? Time to move > > off Lotus Notes and onto the cloud with Force.com, apps are easier to build, > > use, and manage than apps on traditional platforms. Sign up for the Lotus > > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------------------------------ > What happens now with your Lotus Notes apps - do you make another costly > upgrade, or settle for being marooned without product support? Time to move > off Lotus Notes and onto the cloud with Force.com, apps are easier to build, > use, and manage than apps on traditional platforms. Sign up for the Lotus > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dtuecks at googlemail.com Mon Dec 6 19:44:27 2010 From: dtuecks at googlemail.com (Daniel Tuecks) Date: Mon, 6 Dec 2010 19:44:27 +0100 Subject: Monitoring remote hosts In-Reply-To: References: Message-ID: Hi Rikard, I assume you have checked the path and rights to all files. What seems to be wrong is your 'server_args' parameter. server_args = -c /etc/nsca.cfg .inetd The option 'inetd' must be added via '--' and not '.'. server_args = -c /etc/nsca.cfg --inetd I think that should do the trick. (Don't forget to restart xinetd). If it still doesn't work, try to start your nsca in daemon mode: As user 'nagiosadmin' type /usr/sbin/nsca -c /etc/nsca.cfg -d and repeat your tests. Daniel 2010/12/6 Rikard Dahlberg : > > Hey daniel! > > Wow, this really showed my error i think. I cant connect either on > localhost:5667 nor from the server via telnet 5667... > Could i ask you if you could walk me through how to troubleshoot this? That > would really make my day :) > yes, i use xinetd > this is the config file : > > # description: NSCA (Nagios Service Check Acceptor) > service nsca > { > flags?????????? = REUSE > type??????????? = UNLISTED > port??????????? = 5667 > socket_type???? = stream > wait??????????? = no > user??????????? = nagiosadmin > group?????????? = nagios > server????????? = /usr/sbin/nsca > server_args???? = -c /etc/nsca.cfg .inetd > log_on_failure? += USERID > disable???????? = no > only_from?????? = 127.0.0.1 *.*.*.* > } > > im a total tool coming to this, what should i change? :) > /Rikard > >> Date: Mon, 6 Dec 2010 15:20:50 +0100 >> From: dtuecks at googlemail.com >> To: nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] Monitoring remote hosts >> >> Hi Rikard, >> >> is your nsca daemon running (on your nagios host?). You can check the >> process list via 'ps aux | grep nsca'. Furthermore you should try to >> connect to port 5667 (from localhost and/or your windows server): >> telnet localhost 5667. If you get connected you should verify your >> network setup (Firewalls? Do you use (x)inetd? If so, is the option >> 'allow_only' configured?) ... >> >> Daniel >> >> 2010/12/6 Rikard Dahlberg : >> > Hello >> > >> > I'm just starting with nagios and im trying to learn everything at once. >> > At this moment im trying to get a remote windows 2008 server to be >> > monitored, its on a different network so i've decided to use NSCA to >> > monitor >> > it via passive checks. >> > However I get an error message at the remote server saying "Could not >> > connect to: xxx.xxx.xxx.xxx:5667 10061: No connection could be made >> > because >> > the target machine actively refused it." And i've checked in the >> > /usr/local/nagios/var/log/nagios.log and the /var/log/syslog.log files >> > and >> > they come up blank. >> > >> > Im sure ive set up same encryption on both sides and im sure that they >> > both >> > use the same password... >> > Any ideas how to start troubleshooting it? And since im new to nagios, >> > please explain thouroly :) >> > >> > //Rikard >> > >> > >> > ------------------------------------------------------------------------------ >> > What happens now with your Lotus Notes apps - do you make another costly >> > upgrade, or settle for being marooned without product support? Time to >> > move >> > off Lotus Notes and onto the cloud with Force.com, apps are easier to >> > build, >> > use, and manage than apps on traditional platforms. Sign up for the >> > Lotus >> > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d >> > _______________________________________________ >> > Nagios-users mailing list >> > Nagios-users at lists.sourceforge.net >> > https://lists.sourceforge.net/lists/listinfo/nagios-users >> > ::: Please include Nagios version, plugin version (-v) and OS when >> > reporting >> > any issue. >> > ::: Messages without supporting info will risk being sent to /dev/null >> > >> >> >> ------------------------------------------------------------------------------ >> What happens now with your Lotus Notes apps - do you make another costly >> upgrade, or settle for being marooned without product support? Time to >> move >> off Lotus Notes and onto the cloud with Force.com, apps are easier to >> build, >> use, and manage than apps on traditional platforms. Sign up for the Lotus >> Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > What happens now with your Lotus Notes apps - do you make another costly > upgrade, or settle for being marooned without product support? Time to move > off Lotus Notes and onto the cloud with Force.com, apps are easier to build, > use, and manage than apps on traditional platforms. Sign up for the Lotus > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From iehrenwald at tripadvisor.com Mon Dec 6 20:02:46 2010 From: iehrenwald at tripadvisor.com (Ian Ehrenwald) Date: Mon, 6 Dec 2010 14:02:46 -0500 Subject: Alerting based on past-to-current trends? Message-ID: Hello I was wondering if there was a straight-forward way to alert based on an average of past data plus a current perfdata entry. I understand I'm not explaining it very well that way, so here is the real-world example I am working with - I am polling a set of machines via SNMP for CPU load every 1 minute (looking at hrProcessorLoad). If the return value is at or above 95%, send out a WARNING. If the return value is 98% or above, send out a CRITICAL. The problem here is that it's OK for a process to take up 100% CPU for multiple seconds, and sometimes that high CPU usage coincides with the SNMP %CPU query, so I get a lot of false alerts. Is there a way to use past perfdata in conjunction with the current returned data to generate an average and send a WARNING or CRITICAL based on that new number? I only care to get alerted from Nagios if, for example, the %CPU has been at 100% for 5 minutes. Or am I just way over-thinking this and should be monitoring 1m, 5m, 15m UNIX load averages (which doesn't seem that accurate anyway)? What are other people doing to monitor CPU usage and alert on abnormal long periods of utilization? Thanks for your help. Ian Ehrenwald ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Mon Dec 6 20:50:51 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Mon, 6 Dec 2010 14:50:51 -0500 Subject: Determining what is causing a high load reported by check_load plugin Message-ID: Hi there -- We are running Nagios 3.1.2 server, and the client that is the subject of this e-mail is running version 2.6 of the nrpe client. The check_load plugin, version 1.4, is indicating the past three readings are the following: load average: 71.00, 71.00, 70.95 CRITICAL The critical threshold of the plugin has been set to the 30, 25, 20 settings. When I checked the client in question, the first thing I did was to run the top command. The results are shown below: CPU0 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU1 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU2 states: 1.0% user, 4.0% system, 0.0% nice, 93.0% idle Mem: 2064324K av, 2032308K used, 32016K free, 0K shrd, 509924K buff Swap: 2096472K av, 21432K used, 2075040K free 1035592K cached The one thing that I noticed was the amount of free memory was at thirty-two megabytes. I wanted to know if that was what was causing the critical status to occur, or if there is something(s) else that I should investigate. Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Mon Dec 6 18:03:58 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Mon, 6 Dec 2010 10:03:58 -0700 Subject: Monitoring remote hosts In-Reply-To: References: , Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB2405332B0B@WPSCV6MM.OPR.STATEFARM.ORG> You might want to check: netstat -nap |grep 5667 That will show you what's actually listening on that port, if anything. That's usually oen big reason xinetd couldn't start up. Like maybe you have an RC script starting up NSCA outside of xinetd. Dan From: Rikard Dahlberg [mailto:rik.dahlberg at gmail.com] Sent: Monday, December 06, 2010 8:53 AM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Monitoring remote hosts Hey daniel! Wow, this really showed my error i think. I cant connect either on localhost:5667 nor from the server via telnet 5667... Could i ask you if you could walk me through how to troubleshoot this? That would really make my day :) yes, i use xinetd this is the config file : # description: NSCA (Nagios Service Check Acceptor) service nsca { flags = REUSE type = UNLISTED port = 5667 socket_type = stream wait = no user = nagiosadmin group = nagios server = /usr/sbin/nsca server_args = -c /etc/nsca.cfg .inetd log_on_failure += USERID disable = no only_from = 127.0.0.1 *.*.*.* } im a total tool coming to this, what should i change? :) /Rikard > Date: Mon, 6 Dec 2010 15:20:50 +0100 > From: dtuecks at googlemail.com > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Monitoring remote hosts > > Hi Rikard, > > is your nsca daemon running (on your nagios host?). You can check the > process list via 'ps aux | grep nsca'. Furthermore you should try to > connect to port 5667 (from localhost and/or your windows server): > telnet localhost 5667. If you get connected you should verify your > network setup (Firewalls? Do you use (x)inetd? If so, is the option > 'allow_only' configured?) ... > > Daniel > > 2010/12/6 Rikard Dahlberg : > > Hello > > > > I'm just starting with nagios and im trying to learn everything at once. > > At this moment im trying to get a remote windows 2008 server to be > > monitored, its on a different network so i've decided to use NSCA to monitor > > it via passive checks. > > However I get an error message at the remote server saying "Could not > > connect to: xxx.xxx.xxx.xxx:5667 10061: No connection could be made because > > the target machine actively refused it." And i've checked in the > > /usr/local/nagios/var/log/nagios.log and the /var/log/syslog.log files and > > they come up blank. > > > > Im sure ive set up same encryption on both sides and im sure that they both > > use the same password... > > Any ideas how to start troubleshooting it? And since im new to nagios, > > please explain thouroly :) > > > > //Rikard > > > > ------------------------------------------------------------------------ ------ > > What happens now with your Lotus Notes apps - do you make another costly > > upgrade, or settle for being marooned without product support? Time to move > > off Lotus Notes and onto the cloud with Force.com, apps are easier to build, > > use, and manage than apps on traditional platforms. Sign up for the Lotus > > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------------------------ ------ > What happens now with your Lotus Notes apps - do you make another costly > upgrade, or settle for being marooned without product support? Time to move > off Lotus Notes and onto the cloud with Force.com, apps are easier to build, > use, and manage than apps on traditional platforms. Sign up for the Lotus > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Mon Dec 6 21:12:04 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Mon, 6 Dec 2010 15:12:04 -0500 Subject: high latency In-Reply-To: <4CFCC3B0.6050706@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> <4CFCC3B0.6050706@op5.se> Message-ID: > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Monday, December 06, 2010 6:06 AM > To: Nagios Users List > Cc: Frost, Mark {PBC} > Subject: Re: [Nagios-users] high latency > > On 12/03/2010 08:14 PM, Frost, Mark {PBC} wrote: > > > > I too struggle with them and I'm running on lightly-loaded physical hardware. > > We have 2 servers doing the checks sending back to a central server. Both > > distributed nodes use ocsp/ochp, but they do nothing more than append results > > to a file (i.e. it exits quickly). Results are handled outside of Nagios. > > > > Try getting rid of the oc[sh]p commands and use Merlin or google for "pnsca" or > "persistent nsca". There's one available from op5's repositories that may or may > not work, and there's one from somewhere else that they're apparently using to > great effect. > > Even if it exits quickly, it's still executed serially, so checking halts a > small period of time for each and every check that runs. Hmm. So then I'd be so curious why the 2 distservers which are both using oc[sh]p commands the same way have such radically different latencies. Either way, you're suggesting that having a NEB module handle the post-check work will eliminate the serialization. > > What's odd is that distserver 1 and distserver 2 are configured the same > > > > distserver1: > > Hosts Checked 675 > > Services Checked: 4179 > > Active Service Latency: 0.000 / 3.155 / 0.382 sec > > Active Service Execution Time: 0.000 / 60.038 / 0.145 sec > > > > distserver2: > > Hosts Checked: 261 > > Services Checked: 4289 > > Active Service Latency: 0.000 / 169.977 / 81.300 sec > > Active Service Execution Time: 0.000 / 15.270 / 0.211 sec > > > > yet as you can see, distserver2's latency is much higher and always has been. > > I tried turning off EPN yesterday on distserver2 and it had no discernable effect. > > We added 400 new service checks yesterday on distserver2 (just more of the same > > checks we already do but on 26 new hosts) and the latency went from 35 to over 80. > > > > What kind of checks are you running? Some plugins draw a lot of cpu. > Are any of the checks set to run in serial (grep for parallelize_check in your > objects.cache file). parallelize_check is set to 1 everywhere. Most things are NRPE checks (also NRPE to NSClient++). Some are locally running perl scripts and others are locally running things like check_http. > What version of Nagios are you running? > 3.2.1 > > The checks we do are very different (Windows, Linux, Unix, many are app-centric) so > > it's difficult to compare exactly what runs on distserver1 and distserver2, but given > > the jump that was taken yesterday, I'm wondering if the fact that the type of checks > > on these new hosts are all built on dependencies make me wonder if that doesn't > > have something to do with it. These hosts (Windows) have a basic check for NRPE > > and all other checks on the host are dependent on the NRPE check succeeding. > > > > I have to move to all new Nagios servers very soon. I'm interested in Merlin, but > > given its non-production nature just yet, I'm hesitant to commit and I'm not sure if > > it will help me here. > > > It's been running at our 400+ customers with very few problems for the past month. > 0.9.1, released just yesterday, solves the known issues our customers have > encountered. You might want to take a look at it again. There are some issues on > FreeBSD though (was that you reporting them?). I just recently got a new laptop > with better support for running virtual systems, so I'm downloading a FreeBSD 8.1 > install dvd as we speak. Hopefully I'll have those issues sorted out before the > end of the week. > > -- > Andreas Ericsson andreas.ericsson at op5.se Thanks, Andreas. I'm hoping to allocate sufficient resources on the new servers to be able to play with Merlin more there. Will I be able to have the performance data from a poller be sent up to a NOC for digestion by pnp4nagios? It may have been a long time ago, but I thought I remember seeing that performance data was not yet implemented. No we'd be using some flavor of SLES. Thanks Mark ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Mon Dec 6 22:26:23 2010 From: lists at xodus.org (Marc Powell) Date: Mon, 6 Dec 2010 15:26:23 -0600 Subject: Determining what is causing a high load reported by check_load plugin In-Reply-To: References: Message-ID: On Mon, Dec 6, 2010 at 1:50 PM, Kaplan, Andrew H. wrote: > Hi there -- > > We are running Nagios 3.1.2 server, and the client that is the subject of > this e-mail is running version 2.6 of the nrpe client. > > The check_load plugin, version 1.4, is indicating the past three readings > are the following: > > load average: 71.00, 71.00, 70.95 CRITICAL > > The critical threshold of the plugin has been set to the 30, 25, 20 > settings. > > When I checked the client in question, the first thing I did was to run the > top command. The results are shown below: > > CPU0 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle > CPU1 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle > CPU2 states: 1.0% user, 4.0% system, 0.0% nice, 93.0% idle > Mem: 2064324K av, 2032308K used, 32016K free, 0K shrd, 509924K > buff > Swap: 2096472K av, 21432K used, 2075040K free 1035592K > cached > > The one thing that I noticed was the amount of free memory was at > thirty-two megabytes. I wanted to know if that was > what was causing the critical status to occur, or if there is something(s) > else that I should investigate. > Memory is not a factor in the load calculation, only the number of processes running or waiting to run. For at least 15 minutes you had approximately 71 processes either running or ready to run and waiting on CPU resources. Running top/ps was the right thing to do but you really need to do it when the problem is occurring to see what's actually using all the CPU resources. There are far too many reasons why load could be high but it should be easy for someone familiar with your system to figure it out (at least generally) while in-the-act. -- Marc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Mon Dec 6 22:39:45 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Mon, 6 Dec 2010 14:39:45 -0700 Subject: Determining what is causing a high load reported by check_load plugin In-Reply-To: References: Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG> In top, does it show the same load values? The status of your memory shouldn't cause the nagios plugin to report high cpu. What does the uptime command say? Try running the check_load script by hand on that host and verify it returns the same results. Dan From: Marc Powell [mailto:lists at xodus.org] Sent: Monday, December 06, 2010 3:26 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reported by check_load plugin On Mon, Dec 6, 2010 at 1:50 PM, Kaplan, Andrew H. wrote: Hi there -- We are running Nagios 3.1.2 server, and the client that is the subject of this e-mail is running version 2.6 of the nrpe client. The check_load plugin, version 1.4, is indicating the past three readings are the following: load average: 71.00, 71.00, 70.95 CRITICAL The critical threshold of the plugin has been set to the 30, 25, 20 settings. When I checked the client in question, the first thing I did was to run the top command. The results are shown below: CPU0 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU1 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU2 states: 1.0% user, 4.0% system, 0.0% nice, 93.0% idle Mem: 2064324K av, 2032308K used, 32016K free, 0K shrd, 509924K buff Swap: 2096472K av, 21432K used, 2075040K free 1035592K cached The one thing that I noticed was the amount of free memory was at thirty-two megabytes. I wanted to know if that was what was causing the critical status to occur, or if there is something(s) else that I should investigate. Memory is not a factor in the load calculation, only the number of processes running or waiting to run. For at least 15 minutes you had approximately 71 processes either running or ready to run and waiting on CPU resources. Running top/ps was the right thing to do but you really need to do it when the problem is occurring to see what's actually using all the CPU resources. There are far too many reasons why load could be high but it should be easy for someone familiar with your system to figure it out (at least generally) while in-the-act. -- Marc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jwiggins at salon.com Tue Dec 7 00:10:31 2010 From: jwiggins at salon.com (Jonathan Wiggins) Date: Mon, 6 Dec 2010 15:10:31 -0800 Subject: question about additional plugins and/or changing default port on supplemented plugin Message-ID: <55B6A786-2505-426A-8E54-213992A749E4@salon.com> I want to find out if I can simply change the check_command for imap, pop, and smtp to verify 587, 995, and 993 respectively on the nagios server, in commands.cfg, I thought perhaps I could just change this: # 'check_pop' command definition define command{ command_name check_pop command_line $USER1$/check_pop -H $HOSTADDRESS$ $ARG1$ } to... # 'check_pop' command definition define command{ command_name check_pop command_line $USER1$/check_pop -p 995 -H $HOSTADDRESS$ $ARG1$ } but I got a critical alert almost immediately.. iptables isn't running, SELinux is disabled. hosts are on the same network I found a thread online that showed this: command[check_pops]=/usr/lib/nagios/plugins/check_pop -p 995 -4 -w 10 -c 20 along with ... command[check_imaps]=/usr/lib/nagios/plugins/check_imap -p 993 -4 -w 10 -c 20 does that mean, this person rolled their own? I tried looking on Nagios Exchange for the secure plugins for IMAP, POP, and SMTP, and didn't see them. Thanks in advance J ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ej_seg at hotmail.com Tue Dec 7 10:44:40 2010 From: ej_seg at hotmail.com (Rikard Dahlberg) Date: Tue, 7 Dec 2010 09:44:40 +0000 Subject: Monitor disk via NSCA Message-ID: Hey all! I want to thank you all for the lovely help i got in my previous errand. It was NSCA that was missconfigured on one line, or more imporatly, one complete line was gone :) Now the NSCA passive checks work flawlessly, almost anyway. I can monitor CPU, memory and services, the only thing im getting problems with is hard-drive monitoring. These are the commands I've chosen, but the disk command doesn't write anything out in nagios. Down below are a sample from nagios .cfg file also. >From what i've read is that nagios treats the passive checks just as a normal queue as from a active check, so i believe i need a service for every host, as I would for a active check? Any idea what ive done wrong on the harddrive config? >From NSClient: [NSCA Commands] CPU Load=alias_cpu host_check=check_ok Memory Usage=alias_mem Uptime=alias_up Drive space=alias_disk Service check=alias_service [External Alias] alias_cpu=checkCPU warn=80 crit=90 time=5m time=1m time=30s alias_cpu_ex=checkCPU warn=$ARG1$ crit=$ARG2$ time=5m time=1m time=30s alias_disk=CheckDriveSize MinWarn=10% MinCrit=5% CheckAll FilterType=FIXED alias_service=checkServiceState CheckAll alias_process=checkProcState $ARG1$=started alias_mem=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=physical alias_up=checkUpTime MinWarn=1d MinWarn=1h alias_file_age=checkFile2 filter=out "file=$ARG1$" filter-written=>1d MaxWarn=1 MaxCrit=1 "syntax=%filename% %write%" alias_file_size=checkFile2 filter=out "file=$ARG1$" filter-size=>$ARG2$ MaxWarn=1 MaxCrit=1 "syntax=%filename% %size%" alias_file_size_in_dir=checkFile2 filter=out pattern=*.txt "file=$ARG1$" filter-size=>$ARG2$ MaxWarn=1 MaxCrit=1 "syntax=%filename% %size%" alias_event_log_old=CheckEventLog file=application file=system filter=new filter=out MaxWarn=1 MaxCrit=1 filter-generated=>2d filter-severity==success filter-severity==informational truncate=800 unique descriptions "syntax=%severity%: %source%: %message% (%count%)" alias_event_log_new=CheckEventLog file=application file=system MaxWarn=1 MaxCrit=1 "filter=generated gt -2d AND severity NOT IN ('success', 'informational')" truncate=800 unique descriptions "syntax=%severity%: %source%: %message% (%count%)" alias_event_log=alias_event_log_new >From the host.cfg file from nagios define service{ use generic-service host_name ILSERVER service_description C:\ Drive Space check_command check_nt!USEDDISKSPACE!-l c -w 80 -c 90 } -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Tue Dec 7 14:49:02 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Tue, 7 Dec 2010 08:49:02 -0500 Subject: Determining what is causing a high load reportedby check_load plugin In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: Hi there -- The load values that are displayed in top match those for the check_load plugin. This is the case whether the plugin is run either automatically or interactively. The output for the uptime command is shown below: 8:48am up 153 days, 23:21, 1 user, load average: 73.36, 73.29, 73.21 ________________________________ From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Monday, December 06, 2010 4:40 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reportedby check_load plugin In top, does it show the same load values? The status of your memory shouldn't cause the nagios plugin to report high cpu. What does the uptime command say? Try running the check_load script by hand on that host and verify it returns the same results. Dan From: Marc Powell [mailto:lists at xodus.org] Sent: Monday, December 06, 2010 3:26 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reported by check_load plugin On Mon, Dec 6, 2010 at 1:50 PM, Kaplan, Andrew H. wrote: Hi there -- We are running Nagios 3.1.2 server, and the client that is the subject of this e-mail is running version 2.6 of the nrpe client. The check_load plugin, version 1.4, is indicating the past three readings are the following: load average: 71.00, 71.00, 70.95 CRITICAL The critical threshold of the plugin has been set to the 30, 25, 20 settings. When I checked the client in question, the first thing I did was to run the top command. The results are shown below: CPU0 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU1 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU2 states: 1.0% user, 4.0% system, 0.0% nice, 93.0% idle Mem: 2064324K av, 2032308K used, 32016K free, 0K shrd, 509924K buff Swap: 2096472K av, 21432K used, 2075040K free 1035592K cached The one thing that I noticed was the amount of free memory was at thirty-two megabytes. I wanted to know if that was what was causing the critical status to occur, or if there is something(s) else that I should investigate. Memory is not a factor in the load calculation, only the number of processes running or waiting to run. For at least 15 minutes you had approximately 71 processes either running or ready to run and waiting on CPU resources. Running top/ps was the right thing to do but you really need to do it when the problem is occurring to see what's actually using all the CPU resources. There are far too many reasons why load could be high but it should be easy for someone familiar with your system to figure it out (at least generally) while in-the-act. -- Marc The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 7 15:43:48 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 07 Dec 2010 15:43:48 +0100 Subject: high latency In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> <4CFCC3B0.6050706@op5.se> Message-ID: <4CFE4824.8000102@op5.se> On 12/06/2010 09:12 PM, Frost, Mark {PBC} wrote: > >> -----Original Message----- >> From: Andreas Ericsson [mailto:ae at op5.se] >> Sent: Monday, December 06, 2010 6:06 AM >> To: Nagios Users List >> Cc: Frost, Mark {PBC} >> Subject: Re: [Nagios-users] high latency >> >> On 12/03/2010 08:14 PM, Frost, Mark {PBC} wrote: >>> >>> I too struggle with them and I'm running on lightly-loaded physical hardware. >>> We have 2 servers doing the checks sending back to a central server. Both >>> distributed nodes use ocsp/ochp, but they do nothing more than append results >>> to a file (i.e. it exits quickly). Results are handled outside of Nagios. >>> >> >> Try getting rid of the oc[sh]p commands and use Merlin or google for "pnsca" or >> "persistent nsca". There's one available from op5's repositories that may or may >> not work, and there's one from somewhere else that they're apparently using to >> great effect. >> >> Even if it exits quickly, it's still executed serially, so checking halts a >> small period of time for each and every check that runs. > > Hmm. So then I'd be so curious why the 2 distservers which are both using > oc[sh]p commands the same way have such radically different latencies. > Agreed. There must be other differences too. Perhaps there's trouble resolving from one of the nodes? That usually makes checks run a helluva lot longer than they normally have to. > Either way, you're suggesting that having a NEB module handle the > post-check work will eliminate the serialization. > Yes. Sneaking a peak at what's needed in order for an event to get sent to master via an eventbroker compared to running an oc[sh]p command renders this, more or less: broker module (nagios halts while this happens): Run a chain of 3-4 functions (increasing/decreasing stack size, pushing and popping registers etc). Copy 500-1000 bytes of memory from the process to the kernel. OC?P command: fork() nagios, copying the complete stack and generating page tables for the heap (usually 1-2M). possibly fork() again, redoing the last step, unless large_install_... execve() the shell, loading a 4M binary and all its linked dependencies from disk. The kernel wipes the pages used by the fork()'ed and doubly fork()'ed Nagios and sets up new stack and heap tables for the shell. shell parses command-line (this is quite quick though) shell execve()'s the command you set as oc?p command, possibly searching through all files in all directories in your $PATH (which will be hot in the cache, but still), causing the kernel to once again destroy and set up all the memory tables. The command opens a file and puts the testresult there, issuing an fsync() and thus waiting for data to actually hit the disk before returning. The command exits, causing the kernel to destroy its allocated memory. Nagios reaps the command and moves on. In terms of effort, the difference is sort of like either hopping on one leg along the entire great wall of china or walking to the kitchen and grab a beer. >>> What's odd is that distserver 1 and distserver 2 are configured the same >>> >>> distserver1: >>> Hosts Checked 675 >>> Services Checked: 4179 >>> Active Service Latency: 0.000 / 3.155 / 0.382 sec >>> Active Service Execution Time: 0.000 / 60.038 / 0.145 sec >>> >>> distserver2: >>> Hosts Checked: 261 >>> Services Checked: 4289 >>> Active Service Latency: 0.000 / 169.977 / 81.300 sec >>> Active Service Execution Time: 0.000 / 15.270 / 0.211 sec >>> >>> yet as you can see, distserver2's latency is much higher and always has been. >>> I tried turning off EPN yesterday on distserver2 and it had no discernable effect. >>> We added 400 new service checks yesterday on distserver2 (just more of the same >>> checks we already do but on 26 new hosts) and the latency went from 35 to over 80. >>> >> >> What kind of checks are you running? Some plugins draw a lot of cpu. >> Are any of the checks set to run in serial (grep for parallelize_check in your >> objects.cache file). > > parallelize_check is set to 1 everywhere. > Does one server have a lot of random service failures? On-demand hostchecks are still run in parallel. > Most things are NRPE checks (also NRPE to NSClient++). Some are locally > running perl scripts and others are locally running things like check_http. > Shouldn't be all that much work for it though. > >> What version of Nagios are you running? >> > > 3.2.1 > I take it upgrading makes no difference? >>> The checks we do are very different (Windows, Linux, Unix, many are app-centric) so >>> it's difficult to compare exactly what runs on distserver1 and distserver2, but given >>> the jump that was taken yesterday, I'm wondering if the fact that the type of checks >>> on these new hosts are all built on dependencies make me wonder if that doesn't >>> have something to do with it. These hosts (Windows) have a basic check for NRPE >>> and all other checks on the host are dependent on the NRPE check succeeding. >>> >>> I have to move to all new Nagios servers very soon. I'm interested in Merlin, but >>> given its non-production nature just yet, I'm hesitant to commit and I'm not sure if >>> it will help me here. >>> >> It's been running at our 400+ customers with very few problems for the past month. >> 0.9.1, released just yesterday, solves the known issues our customers have >> encountered. You might want to take a look at it again. There are some issues on >> FreeBSD though (was that you reporting them?). I just recently got a new laptop >> with better support for running virtual systems, so I'm downloading a FreeBSD 8.1 >> install dvd as we speak. Hopefully I'll have those issues sorted out before the >> end of the week. >> >> -- >> Andreas Ericsson andreas.ericsson at op5.se > > Thanks, Andreas. I'm hoping to allocate sufficient resources on the new servers > to be able to play with Merlin more there. It's quite resource-friendly actually. Well, compared to what you're running now it's positively feather-light. > Will I be able to have the performance > data from a poller be sent up to a NOC for digestion by pnp4nagios? Yes, but you'll need the threadsafe version of Nagios you can obtain from either CVS or git://git.op5.org/nagios.git for performance-data to work. Actually, you need that for Merlin to work. > It may have > been a long time ago, but I thought I remember seeing that performance data was > not yet implemented. > That was then. This is now :) > No we'd be using some flavor of SLES. > Should work marvellously then. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Tue Dec 7 16:10:37 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Tue, 07 Dec 2010 17:10:37 +0200 Subject: Multiple parents in map Message-ID: <9baaff7c0a1bfc1267704c79704a1158@vps517.directvps.nl> HI, I have digging into the archives but could not find a solution. But in my view Nagios 3.2.3 is not showing nodes correctly in the map view. I have 2 fire wall nodes (FW1 and FW2) for the customer that I can check on a special TCP port. They are 2 cluster members on different physical locations. Then I have 2 SMTP servers (SMTP1 and SMTP2) behind them. They are also distributed over both locations. SMTP1 has the parents FW1 and FW2 and SMTP2 has the parents FW2 and FW1. (The listorder is important.) On the map both SMTP servers are behind FW2 and there are 2 blank spots behind FW1 in the Circular (Marked Up) map. Will this be fixed in a future release? Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Tue Dec 7 15:11:02 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Tue, 7 Dec 2010 07:11:02 -0700 Subject: Determining what is causing a high loadreportedby check_load plugin In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG> So what are the first few processes listed in top? That should be what is causing your load then. Dan From: Kaplan, Andrew H. [mailto:AHKAPLAN at PARTNERS.ORG] Sent: Tuesday, December 07, 2010 7:49 AM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high loadreportedby check_load plugin Hi there -- The load values that are displayed in top match those for the check_load plugin. This is the case whether the plugin is run either automatically or interactively. The output for the uptime command is shown below: 8:48am up 153 days, 23:21, 1 user, load average: 73.36, 73.29, 73.21 ________________________________ From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Monday, December 06, 2010 4:40 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reportedby check_load plugin In top, does it show the same load values? The status of your memory shouldn't cause the nagios plugin to report high cpu. What does the uptime command say? Try running the check_load script by hand on that host and verify it returns the same results. Dan From: Marc Powell [mailto:lists at xodus.org] Sent: Monday, December 06, 2010 3:26 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reported by check_load plugin On Mon, Dec 6, 2010 at 1:50 PM, Kaplan, Andrew H. wrote: Hi there -- We are running Nagios 3.1.2 server, and the client that is the subject of this e-mail is running version 2.6 of the nrpe client. The check_load plugin, version 1.4, is indicating the past three readings are the following: load average: 71.00, 71.00, 70.95 CRITICAL The critical threshold of the plugin has been set to the 30, 25, 20 settings. When I checked the client in question, the first thing I did was to run the top command. The results are shown below: CPU0 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU1 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU2 states: 1.0% user, 4.0% system, 0.0% nice, 93.0% idle Mem: 2064324K av, 2032308K used, 32016K free, 0K shrd, 509924K buff Swap: 2096472K av, 21432K used, 2075040K free 1035592K cached The one thing that I noticed was the amount of free memory was at thirty-two megabytes. I wanted to know if that was what was causing the critical status to occur, or if there is something(s) else that I should investigate. Memory is not a factor in the load calculation, only the number of processes running or waiting to run. For at least 15 minutes you had approximately 71 processes either running or ready to run and waiting on CPU resources. Running top/ps was the right thing to do but you really need to do it when the problem is occurring to see what's actually using all the CPU resources. There are far too many reasons why load could be high but it should be easy for someone familiar with your system to figure it out (at least generally) while in-the-act. -- Marc The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Tue Dec 7 16:25:49 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Tue, 7 Dec 2010 10:25:49 -0500 Subject: Determining what is causing a highloadreportedby check_load plugin In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: Hi there -- The output shown below shows the top processes on the server: 439 processes: 438 sleeping, 1 running, 0 zombie, 0 stopped CPU0 states: 19.0% user, 9.4% system, 0.0% nice, 71.0% idle CPU1 states: 20.1% user, 13.0% system, 0.0% nice, 66.3% idle CPU2 states: 27.1% user, 17.3% system, 0.0% nice, 55.0% idle Mem: 2064324K av, 2013820K used, 50504K free, 0K shrd, 487764K buff Swap: 2096472K av, 12436K used, 2084036K free 976244K cached PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME COMMAND 2398 root 15 0 1280 1280 824 R 1.9 0.0 0:00 top 5648 root 22 0 1196 1196 1104 S 1.3 0.0 0:00 ASMProServer 1 root 15 0 488 484 448 S 0.0 0.0 2:28 init 2 root 0K 0 0 0 0 SW 0.0 0.0 0:00 migration_CPU0 3 root 0K 0 0 0 0 SW 0.0 0.0 0:00 migration_CPU1 4 root 0K 0 0 0 0 SW 0.0 0.0 0:00 migration_CPU2 5 root 15 0 0 0 0 SW 0.0 0.0 0:03 keventd 6 root 34 19 0 0 0 SWN 0.0 0.0 17:52 ksoftirqd_CPU0 7 root 34 19 0 0 0 SWN 0.0 0.0 16:39 ksoftirqd_CPU1 8 root 34 19 0 0 0 SWN 0.0 0.0 17:33 ksoftirqd_CPU2 9 root 15 0 0 0 0 SW 0.0 0.0 28:22 kswapd 10 root 15 0 0 0 0 SW 0.0 0.0 42:39 bdflush 11 root 15 0 0 0 0 SW 0.0 0.0 3:08 kupdated 12 root 25 0 0 0 0 SW 0.0 0.0 0:00 mdrecoveryd 18 root 16 0 0 0 0 SW 0.0 0.0 0:00 scsi_eh_0 21 root 15 0 0 0 0 SW 0.0 0.0 4:38 kjournald 101 root 15 0 0 0 0 SW 0.0 0.0 0:00 khubd 265 root 15 0 0 0 0 SW 0.0 0.0 0:03 kjournald 266 root 15 0 0 0 0 SW 0.0 0.0 3:43 kjournald 267 root 15 0 0 0 0 SW 0.0 0.0 0:04 kjournald 268 root 15 0 0 0 0 SW 0.0 0.0 0:01 kjournald 269 root 15 0 0 0 0 SW 0.0 0.0 0:11 kjournald 270 root 15 0 0 0 0 SW 0.0 0.0 4:34 kjournald 271 root 15 0 0 0 0 SW 0.0 0.0 4:28 kjournald 272 root 15 0 0 0 0 SW 0.0 0.0 0:08 kjournald 273 root 15 0 0 0 0 SW 0.0 0.0 0:14 kjournald 274 root 15 0 0 0 0 SW 0.0 0.0 0:07 kjournald 275 root 15 0 0 0 0 SW 0.0 0.0 1:14 kjournald 805 root 15 0 588 576 532 S 0.0 0.0 1:39 syslogd 810 root 15 0 448 432 432 S 0.0 0.0 0:00 klogd 830 rpc 15 0 596 572 508 S 0.0 0.0 0:04 portmap 858 rpcuser 19 0 708 608 608 S 0.0 0.0 0:00 rpc.statd 970 root 15 0 0 0 0 SW 0.0 0.0 0:21 rpciod 971 root 15 0 0 0 0 SW 0.0 0.0 0:00 lockd 999 ntp 15 0 1812 1812 1732 S 0.0 0.0 5:04 ntpd 1022 root 15 0 772 720 632 S 0.0 0.0 0:00 ypbind 1024 root 15 0 772 720 632 S 0.0 0.0 1:16 ypbind What caught my eye was the number of processes along with the number of sleeping processes. I tried running the kill command on the kjournald instances, but that did not appear to stop them. Aside from rebooting the server, which can be done if necessary, what other approach can I try? ________________________________ From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Tuesday, December 07, 2010 9:11 AM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a highloadreportedby check_load plugin So what are the first few processes listed in top? That should be what is causing your load then. Dan From: Kaplan, Andrew H. [mailto:AHKAPLAN at PARTNERS.ORG] Sent: Tuesday, December 07, 2010 7:49 AM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high loadreportedby check_load plugin Hi there -- The load values that are displayed in top match those for the check_load plugin. This is the case whether the plugin is run either automatically or interactively. The output for the uptime command is shown below: 8:48am up 153 days, 23:21, 1 user, load average: 73.36, 73.29, 73.21 ________________________________ From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Monday, December 06, 2010 4:40 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reportedby check_load plugin In top, does it show the same load values? The status of your memory shouldn't cause the nagios plugin to report high cpu. What does the uptime command say? Try running the check_load script by hand on that host and verify it returns the same results. Dan From: Marc Powell [mailto:lists at xodus.org] Sent: Monday, December 06, 2010 3:26 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reported by check_load plugin On Mon, Dec 6, 2010 at 1:50 PM, Kaplan, Andrew H. wrote: Hi there -- We are running Nagios 3.1.2 server, and the client that is the subject of this e-mail is running version 2.6 of the nrpe client. The check_load plugin, version 1.4, is indicating the past three readings are the following: load average: 71.00, 71.00, 70.95 CRITICAL The critical threshold of the plugin has been set to the 30, 25, 20 settings. When I checked the client in question, the first thing I did was to run the top command. The results are shown below: CPU0 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU1 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU2 states: 1.0% user, 4.0% system, 0.0% nice, 93.0% idle Mem: 2064324K av, 2032308K used, 32016K free, 0K shrd, 509924K buff Swap: 2096472K av, 21432K used, 2075040K free 1035592K cached The one thing that I noticed was the amount of free memory was at thirty-two megabytes. I wanted to know if that was what was causing the critical status to occur, or if there is something(s) else that I should investigate. Memory is not a factor in the load calculation, only the number of processes running or waiting to run. For at least 15 minutes you had approximately 71 processes either running or ready to run and waiting on CPU resources. Running top/ps was the right thing to do but you really need to do it when the problem is occurring to see what's actually using all the CPU resources. There are far too many reasons why load could be high but it should be easy for someone familiar with your system to figure it out (at least generally) while in-the-act. -- Marc The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rick.mangus+nagios at gmail.com Tue Dec 7 16:48:49 2010 From: rick.mangus+nagios at gmail.com (Rick Mangus) Date: Tue, 7 Dec 2010 09:48:49 -0600 Subject: Determining what is causing a highloadreportedby check_load plugin In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: Kjournald is needed for journalling on ext3 filesystems. Be glad you didn't manage to kill them. To find something that is running many many instances, try this: "ps -ax -o cmd | sort | uniq -c | sort -n" The output will be like so: 3 [kjournald] 3 [sh] 5 -bash 7 crond The column on the left is the number of processes with that command line. I occasionally have 10,000 instances of nsca that simply need to be killed. Do let us know what you find! --Rick On Tue, Dec 7, 2010 at 9:25 AM, Kaplan, Andrew H. wrote: > Hi there -- > > The output shown below shows the top processes on the server: > > 439 processes: 438 sleeping, 1 running, 0 zombie, 0 stopped > CPU0 states: 19.0% user, 9.4% system, 0.0% nice, 71.0% idle > CPU1 states: 20.1% user, 13.0% system, 0.0% nice, 66.3% idle > CPU2 states: 27.1% user, 17.3% system, 0.0% nice, 55.0% idle > Mem: 2064324K av, 2013820K used, 50504K free, 0K shrd, 487764K > buff > Swap: 2096472K av, 12436K used, 2084036K free 976244K > cached > > PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME COMMAND > 2398 root 15 0 1280 1280 824 R 1.9 0.0 0:00 top > 5648 root 22 0 1196 1196 1104 S 1.3 0.0 0:00 ASMProServer > 1 root 15 0 488 484 448 S 0.0 0.0 2:28 init > 2 root 0K 0 0 0 0 SW 0.0 0.0 0:00 > migration_CPU0 > 3 root 0K 0 0 0 0 SW 0.0 0.0 0:00 > migration_CPU1 > 4 root 0K 0 0 0 0 SW 0.0 0.0 0:00 > migration_CPU2 > 5 root 15 0 0 0 0 SW 0.0 0.0 0:03 keventd > 6 root 34 19 0 0 0 SWN 0.0 0.0 17:52 > ksoftirqd_CPU0 > 7 root 34 19 0 0 0 SWN 0.0 0.0 16:39 > ksoftirqd_CPU1 > 8 root 34 19 0 0 0 SWN 0.0 0.0 17:33 > ksoftirqd_CPU2 > 9 root 15 0 0 0 0 SW 0.0 0.0 28:22 kswapd > 10 root 15 0 0 0 0 SW 0.0 0.0 42:39 bdflush > 11 root 15 0 0 0 0 SW 0.0 0.0 3:08 kupdated > 12 root 25 0 0 0 0 SW 0.0 0.0 0:00 mdrecoveryd > 18 root 16 0 0 0 0 SW 0.0 0.0 0:00 scsi_eh_0 > 21 root 15 0 0 0 0 SW 0.0 0.0 4:38 kjournald > 101 root 15 0 0 0 0 SW 0.0 0.0 0:00 khubd > 265 root 15 0 0 0 0 SW 0.0 0.0 0:03 kjournald > 266 root 15 0 0 0 0 SW 0.0 0.0 3:43 kjournald > 267 root 15 0 0 0 0 SW 0.0 0.0 0:04 kjournald > 268 root 15 0 0 0 0 SW 0.0 0.0 0:01 kjournald > 269 root 15 0 0 0 0 SW 0.0 0.0 0:11 kjournald > 270 root 15 0 0 0 0 SW 0.0 0.0 4:34 kjournald > 271 root 15 0 0 0 0 SW 0.0 0.0 4:28 kjournald > 272 root 15 0 0 0 0 SW 0.0 0.0 0:08 kjournald > 273 root 15 0 0 0 0 SW 0.0 0.0 0:14 kjournald > 274 root 15 0 0 0 0 SW 0.0 0.0 0:07 kjournald > 275 root 15 0 0 0 0 SW 0.0 0.0 1:14 kjournald > 805 root 15 0 588 576 532 S 0.0 0.0 1:39 syslogd > 810 root 15 0 448 432 432 S 0.0 0.0 0:00 klogd > 830 rpc 15 0 596 572 508 S 0.0 0.0 0:04 portmap > 858 rpcuser 19 0 708 608 608 S 0.0 0.0 0:00 rpc.statd > 970 root 15 0 0 0 0 SW 0.0 0.0 0:21 rpciod > 971 root 15 0 0 0 0 SW 0.0 0.0 0:00 lockd > 999 ntp 15 0 1812 1812 1732 S 0.0 0.0 5:04 ntpd > 1022 root 15 0 772 720 632 S 0.0 0.0 0:00 ypbind > 1024 root 15 0 772 720 632 S 0.0 0.0 1:16 ypbind > > What caught my eye was the number of processes along with the number of > sleeping processes. > I tried running the kill command on the kjournald instances, but that did > not appear to stop them. > > Aside from rebooting the server, which can be done if necessary, what other > approach can I try? > > > > > ------------------------------ > *From:* Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] > *Sent:* Tuesday, December 07, 2010 9:11 AM > > *To:* Nagios Users List > *Subject:* Re: [Nagios-users] Determining what is causing a > highloadreportedby check_load plugin > > So what are the first few processes listed in top? That should be what > is causing your load then. > > > > Dan > > > > > > > > *From:* Kaplan, Andrew H. [mailto:AHKAPLAN at PARTNERS.ORG] > *Sent:* Tuesday, December 07, 2010 7:49 AM > *To:* Nagios Users List > *Subject:* Re: [Nagios-users] Determining what is causing a high > loadreportedby check_load plugin > > > > Hi there -- > > > > The load values that are displayed in top match those for the check_load > plugin. This is the case whether the plugin > > is run either automatically or interactively. The output for the uptime > command is shown below: > > > > 8:48am up 153 days, 23:21, 1 user, load average: 73.36, 73.29, 73.21 > > > > > > > > > ------------------------------ > > *From:* Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] > *Sent:* Monday, December 06, 2010 4:40 PM > *To:* Nagios Users List > *Subject:* Re: [Nagios-users] Determining what is causing a high load > reportedby check_load plugin > > In top, does it show the same load values? The status of your memory > shouldn?t cause the nagios plugin to report high cpu. What does the uptime > command say? Try running the check_load script by hand on that host and > verify it returns the same results. > > > Dan > > > > > > *From:* Marc Powell [mailto:lists at xodus.org] > *Sent:* Monday, December 06, 2010 3:26 PM > *To:* Nagios Users List > *Subject:* Re: [Nagios-users] Determining what is causing a high load > reported by check_load plugin > > > > > > On Mon, Dec 6, 2010 at 1:50 PM, Kaplan, Andrew H. > wrote: > > Hi there -- > > We are running Nagios 3.1.2 server, and the client that is the subject of > this e-mail is running version 2.6 of the nrpe client. > > The check_load plugin, version 1.4, is indicating the past three readings > are the following: > > load average: 71.00, 71.00, 70.95 CRITICAL > > The critical threshold of the plugin has been set to the 30, 25, 20 > settings. > > When I checked the client in question, the first thing I did was to run the > top command. The results are shown below: > > CPU0 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle > CPU1 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle > CPU2 states: 1.0% user, 4.0% system, 0.0% nice, 93.0% idle > Mem: 2064324K av, 2032308K used, 32016K free, 0K shrd, 509924K > buff > Swap: 2096472K av, 21432K used, 2075040K free 1035592K > cached > > The one thing that I noticed was the amount of free memory was at > thirty-two megabytes. I wanted to know if that was > what was causing the critical status to occur, or if there is something(s) > else that I should investigate. > > > Memory is not a factor in the load calculation, only the number of > processes running or waiting to run. For at least 15 minutes you had > approximately 71 processes either running or ready to run and waiting on CPU > resources. Running top/ps was the right thing to do but you really need to > do it when the problem is occurring to see what's actually using all the CPU > resources. There are far too many reasons why load could be high but it > should be easy for someone familiar with your system to figure it out (at > least generally) while in-the-act. > > -- > Marc > > > > The information in this e-mail is intended only for the person to whom it > is > addressed. If you believe this e-mail was sent to you in error and the > e-mail > contains patient information, please contact the Partners Compliance > HelpLine at > http://www.partners.org/complianceline . If the e-mail was sent to you in > error > but does not contain patient information, please contact the sender and > properly > dispose of the e-mail. > > > ------------------------------------------------------------------------------ > What happens now with your Lotus Notes apps - do you make another costly > upgrade, or settle for being marooned without product support? Time to move > off Lotus Notes and onto the cloud with Force.com, apps are easier to > build, > use, and manage than apps on traditional platforms. Sign up for the Lotus > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ihab24 at hotmail.com Tue Dec 7 17:09:45 2010 From: ihab24 at hotmail.com (Ihab Samara) Date: Tue, 7 Dec 2010 18:09:45 +0200 Subject: NRPE In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG>, <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG>, Message-ID: Hi List I have a problem using check_nrpe: The script on the remote machine is running this command: #/bin/bash output=`sudo /usr/sbin/lsof -X |grep tomcat| wc -l` max=$1 if [ "$output" -lt "$max" ]; then echo "OK |value is "$output"" exit 0; else echo "CRITICAL value is "$output"" exit 2; fi I have this line in sudoers file: nagios ALL=(ALL) NOPASSWD: /usr/sbin/lsof I have this in the nrpe.cfg: command[check_lsof]=/usr/local/nagios/libexec/check_lsof.sh 256 When I run the command from the nagios user on the remote machine: [nagios at serv_1]$ /usr/local/nagios/libexec/check_lsof.sh 256 OK |value is 132 When I run it from the Nagios server (remotely): [root at healthy libexec]# ./check_nrpe -H 10.1.1.1 -c check_lsof OK |value is 0 Ive set the user in /etc/xinetd.d/nrpe to "nagios", and all the other checks are working fine. Any thoughts? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lomiz.mail at gmail.com Tue Dec 7 17:24:32 2010 From: lomiz.mail at gmail.com (Enrico Zimol) Date: Tue, 7 Dec 2010 17:24:32 +0100 Subject: NRPE In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: >command[check_lsof]=sudo /usr/local/nagios/libexec/check_lsof.sh 256 Try this, with sudo -- Enrico "lomiz" Zimol http://www.lomiz.it ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.elsen at gmail.com Tue Dec 7 17:30:51 2010 From: mark.elsen at gmail.com (Mark Elsen) Date: Tue, 7 Dec 2010 17:30:51 +0100 Subject: NRPE In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: > Hi List > > I have a problem using check_nrpe: > > The script on the remote machine is running this command: > > #/bin/bash > > output=`sudo /usr/sbin/lsof -X |grep tomcat| wc -l` > > max=$1 > > if [ "$output" -lt "$max" ]; then > > echo "OK |value is "$output"" > > exit 0; > > else > > echo "CRITICAL value is "$output"" > > exit 2; > > fi > > I have this line in sudoers file: > > nagios?? ALL=(ALL)?????? NOPASSWD: /usr/sbin/lsof > > I have this in the nrpe.cfg: > > command[check_lsof]=/usr/local/nagios/libexec/check_lsof.sh 256 > > > When I run the command from the nagios user on the remote machine: > > [nagios at serv_1]$ /usr/local/nagios/libexec/check_lsof.sh 256 > OK |value is 132 > > When I run it from the Nagios server (remotely): > [root at healthy libexec]# ./check_nrpe -H 10.1.1.1 -c check_lsof > OK |value is 0 > > > Ive set the user in /etc/xinetd.d/nrpe to "nagios", and all the other checks > are working fine. > > Any thoughts? > > - Search for 'sudo' in nrpe.cfg , make sure the sudo prefix is set, not commented. - Look for 'tty' in the sudoers file, make sure 'requiretty' is commented. M. ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Tue Dec 7 20:20:59 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Tue, 7 Dec 2010 14:20:59 -0500 Subject: high latency In-Reply-To: <4CFE4824.8000102@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> <4CFCC3B0.6050706@op5.se> <4CFE4824.8000102@op5.se> Message-ID: > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Tuesday, December 07, 2010 9:44 AM > > > Hmm. So then I'd be so curious why the 2 distservers which are both using > > oc[sh]p commands the same way have such radically different latencies. > > > > Agreed. There must be other differences too. Perhaps there's trouble resolving > from one of the nodes? That usually makes checks run a helluva lot longer than > they normally have to. I had another look. While I found a test host that I'd made that was deliberately unreachable, I found that when I removed it it made no difference. Execution times are significantly lower (min/max/avg) on the host with the high latencies than for the one with low latencies. I don't see any unresolvable hosts or now, any unreachable hosts. Puzzling. I've always wished there was an easy way to see which processes had high latencies from the web interface without having to view the status.dat file... > > Either way, you're suggesting that having a NEB module handle the > > post-check work will eliminate the serialization. > Yes. Sneaking a peak at what's needed in order for an event to get sent to > master via an eventbroker compared to running an oc[sh]p command renders > this, more or less: > [ good stuff snipped...] Wow. > In terms of effort, the difference is sort of like either hopping on one > leg along the entire great wall of china or walking to the kitchen and grab > a beer. > > > > parallelize_check is set to 1 everywhere. > > Does one server have a lot of random service failures? On-demand hostchecks are > still run in parallel. I don't think so. Intermittent you mean? Not as far as I know or can see. > > > What version of Nagios are you running? > > > > 3.2.1 > > I take it upgrading makes no difference? To 3.2.3? I'll probably try that on the new servers, but if things work out I may just move to Merlin + 3.2.4. I wasn't sure I saw anything in the 3.2.3 release that I found compelling for us at the time. As I say, this system now has fairly high visibility so just trying something like that would involve a rather painful internal change process. It's like piloting the QE2 -- I can't change course very quickly :-) > > Thanks, Andreas. I'm hoping to allocate sufficient resources on the new servers > > to be able to play with Merlin more there. > > It's quite resource-friendly actually. Well, compared to what you're running now > it's positively feather-light. I meant more like installing MySQL everywhere, building filesystems to hold the MySQL data, etc. Not so much like I need more memory or more CPUs. I don't remember seeing anything in the Merlin docs (maybe I missed it), but how large would the MySQL database need to be? Pretty small on each box, right? Like 500MB or less? > > Will I be able to have the performance > > data from a poller be sent up to a NOC for digestion by pnp4nagios? > > Yes, but you'll need the threadsafe version of Nagios you can obtain from either > CVS or git://git.op5.org/nagios.git for performance-data to work. Actually, you > need that for Merlin to work. That's part of the plan. Any chance that the OP5 site will eventually be configured to allow git through a proxy? It's of course less convenient to use snapshot tarballs, but still workable, of course. > > It may have > > been a long time ago, but I thought I remember seeing that performance data was > > not yet implemented. > > > > That was then. This is now :) Spifftacular! > > No we'd be using some flavor of SLES. > > > > Should work marvellously then. Thanks as always for your help, Andreas. Mark ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Tue Dec 7 20:35:02 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Tue, 7 Dec 2010 12:35:02 -0700 Subject: high latency In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG><5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net><31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG><4CF7D4FE.7090706@op5.se><31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG><4CF8D2E0.1060101@op5.se><4CFCC3B0.6050706@op5.se><4CFE4824.8000102@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24053887D8@WPSCV6MM.OPR.STATEFARM.ORG> To watch this in a 'top' like perspective, I wrote a query into livestatus, but we're running the livestatus NEB on all our servers and multisite to be able to watch all servers. Been doing some interesting performance testing with 4/6/8 processor VM's and will be doing a physical box too, seeing how many hosts/services they can handle before either CPU falls over or latency gets too bad. So far CPU gets too high (for my comfort, which is generally a load of 2x # of procs) before latency gets bad. Dan -----Original Message----- From: Frost, Mark {PBC} [mailto:mark.frost1 at pepsico.com] Sent: Tuesday, December 07, 2010 1:21 PM To: Andreas Ericsson Cc: Nagios Users List Subject: Re: [Nagios-users] high latency I've always wished there was an easy way to see which processes had high latencies from the web interface without having to view the status.dat file... ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Tue Dec 7 20:33:43 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Tue, 7 Dec 2010 14:33:43 -0500 Subject: Determining what is causing a highloadreportedby check_load plugin In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG><31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: Hi there -- I ran the command syntax you suggested, and outputted it to a file. When I checked the file, I noticed there was a large amount of updatedb and slocate instances that were running going back to August of this year. When I tried to kill those processes, I ran into the same problem that I encountered with the kjournald instances. I did some further investigating, and it turns out a high number of the updatedb and slocate processes are symptomatic of a corrupted filesystem. Accordingly, I rebooted the server and had it run fsck on all filesystems. The server is now up, and I will monitor it for the next week to see if the problem returns. ________________________________ From: Rick Mangus [mailto:rick.mangus+nagios at gmail.com] Sent: Tuesday, December 07, 2010 10:49 AM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a highloadreportedby check_load plugin Kjournald is needed for journalling on ext3 filesystems. Be glad you didn't manage to kill them. To find something that is running many many instances, try this: "ps -ax -o cmd | sort | uniq -c | sort -n" The output will be like so: 3 [kjournald] 3 [sh] 5 -bash 7 crond The column on the left is the number of processes with that command line. I occasionally have 10,000 instances of nsca that simply need to be killed. Do let us know what you find! --Rick On Tue, Dec 7, 2010 at 9:25 AM, Kaplan, Andrew H. wrote: Hi there -- The output shown below shows the top processes on the server: 439 processes: 438 sleeping, 1 running, 0 zombie, 0 stopped CPU0 states: 19.0% user, 9.4% system, 0.0% nice, 71.0% idle CPU1 states: 20.1% user, 13.0% system, 0.0% nice, 66.3% idle CPU2 states: 27.1% user, 17.3% system, 0.0% nice, 55.0% idle Mem: 2064324K av, 2013820K used, 50504K free, 0K shrd, 487764K buff Swap: 2096472K av, 12436K used, 2084036K free 976244K cached PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME COMMAND 2398 root 15 0 1280 1280 824 R 1.9 0.0 0:00 top 5648 root 22 0 1196 1196 1104 S 1.3 0.0 0:00 ASMProServer 1 root 15 0 488 484 448 S 0.0 0.0 2:28 init 2 root 0K 0 0 0 0 SW 0.0 0.0 0:00 migration_CPU0 3 root 0K 0 0 0 0 SW 0.0 0.0 0:00 migration_CPU1 4 root 0K 0 0 0 0 SW 0.0 0.0 0:00 migration_CPU2 5 root 15 0 0 0 0 SW 0.0 0.0 0:03 keventd 6 root 34 19 0 0 0 SWN 0.0 0.0 17:52 ksoftirqd_CPU0 7 root 34 19 0 0 0 SWN 0.0 0.0 16:39 ksoftirqd_CPU1 8 root 34 19 0 0 0 SWN 0.0 0.0 17:33 ksoftirqd_CPU2 9 root 15 0 0 0 0 SW 0.0 0.0 28:22 kswapd 10 root 15 0 0 0 0 SW 0.0 0.0 42:39 bdflush 11 root 15 0 0 0 0 SW 0.0 0.0 3:08 kupdated 12 root 25 0 0 0 0 SW 0.0 0.0 0:00 mdrecoveryd 18 root 16 0 0 0 0 SW 0.0 0.0 0:00 scsi_eh_0 21 root 15 0 0 0 0 SW 0.0 0.0 4:38 kjournald 101 root 15 0 0 0 0 SW 0.0 0.0 0:00 khubd 265 root 15 0 0 0 0 SW 0.0 0.0 0:03 kjournald 266 root 15 0 0 0 0 SW 0.0 0.0 3:43 kjournald 267 root 15 0 0 0 0 SW 0.0 0.0 0:04 kjournald 268 root 15 0 0 0 0 SW 0.0 0.0 0:01 kjournald 269 root 15 0 0 0 0 SW 0.0 0.0 0:11 kjournald 270 root 15 0 0 0 0 SW 0.0 0.0 4:34 kjournald 271 root 15 0 0 0 0 SW 0.0 0.0 4:28 kjournald 272 root 15 0 0 0 0 SW 0.0 0.0 0:08 kjournald 273 root 15 0 0 0 0 SW 0.0 0.0 0:14 kjournald 274 root 15 0 0 0 0 SW 0.0 0.0 0:07 kjournald 275 root 15 0 0 0 0 SW 0.0 0.0 1:14 kjournald 805 root 15 0 588 576 532 S 0.0 0.0 1:39 syslogd 810 root 15 0 448 432 432 S 0.0 0.0 0:00 klogd 830 rpc 15 0 596 572 508 S 0.0 0.0 0:04 portmap 858 rpcuser 19 0 708 608 608 S 0.0 0.0 0:00 rpc.statd 970 root 15 0 0 0 0 SW 0.0 0.0 0:21 rpciod 971 root 15 0 0 0 0 SW 0.0 0.0 0:00 lockd 999 ntp 15 0 1812 1812 1732 S 0.0 0.0 5:04 ntpd 1022 root 15 0 772 720 632 S 0.0 0.0 0:00 ypbind 1024 root 15 0 772 720 632 S 0.0 0.0 1:16 ypbind What caught my eye was the number of processes along with the number of sleeping processes. I tried running the kill command on the kjournald instances, but that did not appear to stop them. Aside from rebooting the server, which can be done if necessary, what other approach can I try? ________________________________ From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Tuesday, December 07, 2010 9:11 AM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a highloadreportedby check_load plugin So what are the first few processes listed in top? That should be what is causing your load then. Dan From: Kaplan, Andrew H. [mailto:AHKAPLAN at PARTNERS.ORG] Sent: Tuesday, December 07, 2010 7:49 AM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high loadreportedby check_load plugin Hi there -- The load values that are displayed in top match those for the check_load plugin. This is the case whether the plugin is run either automatically or interactively. The output for the uptime command is shown below: 8:48am up 153 days, 23:21, 1 user, load average: 73.36, 73.29, 73.21 ________________________________ From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Monday, December 06, 2010 4:40 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reportedby check_load plugin In top, does it show the same load values? The status of your memory shouldn't cause the nagios plugin to report high cpu. What does the uptime command say? Try running the check_load script by hand on that host and verify it returns the same results. Dan From: Marc Powell [mailto:lists at xodus.org] Sent: Monday, December 06, 2010 3:26 PM To: Nagios Users List Subject: Re: [Nagios-users] Determining what is causing a high load reported by check_load plugin On Mon, Dec 6, 2010 at 1:50 PM, Kaplan, Andrew H. wrote: Hi there -- We are running Nagios 3.1.2 server, and the client that is the subject of this e-mail is running version 2.6 of the nrpe client. The check_load plugin, version 1.4, is indicating the past three readings are the following: load average: 71.00, 71.00, 70.95 CRITICAL The critical threshold of the plugin has been set to the 30, 25, 20 settings. When I checked the client in question, the first thing I did was to run the top command. The results are shown below: CPU0 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU1 states: 0.0% user, 0.0% system, 0.0% nice, 100.0% idle CPU2 states: 1.0% user, 4.0% system, 0.0% nice, 93.0% idle Mem: 2064324K av, 2032308K used, 32016K free, 0K shrd, 509924K buff Swap: 2096472K av, 21432K used, 2075040K free 1035592K cached The one thing that I noticed was the amount of free memory was at thirty-two megabytes. I wanted to know if that was what was causing the critical status to occur, or if there is something(s) else that I should investigate. Memory is not a factor in the load calculation, only the number of processes running or waiting to run. For at least 15 minutes you had approximately 71 processes either running or ready to run and waiting on CPU resources. Running top/ps was the right thing to do but you really need to do it when the problem is occurring to see what's actually using all the CPU resources. There are far too many reasons why load could be high but it should be easy for someone familiar with your system to figure it out (at least generally) while in-the-act. -- Marc The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From forrie at gmail.com Tue Dec 7 22:41:06 2010 From: forrie at gmail.com (Forrest Aldrich) Date: Tue, 07 Dec 2010 16:41:06 -0500 Subject: Problem with hostgroup members definition Message-ID: <4CFEA9F2.9080100@gmail.com> I have a new Nagios 3 installation where I am trying to include several hosts in a hostgroup definition (to be efficient). The error I'm getting (from the pre-flight check): Error: Could not expand members specified in hostgroup ... At first, I was referring to hosts by their "alias" which is defined in the host definition, but it appears I cannot.. Next, I thought the line was too long. There are a total of 27 hosts. I want use a hostgroup definition and bind a check-host-alive to that group -- seems pretty efficient. Is there a better way to accomplish this. I had another person double check my work and we compared it to another known-working config, where I believe aliases were being used. No special configuration was done, I just used the example configs and commented out a bunch of stuff or just defined my own objects. At the moment, I have several hosts that belong to a production group: define hostgroup{ hostgroup_name prod-servers ; The name of the hostgroup alias Production Servers ; Long name of the group members host1,host2, ... (omitted here for brevity) } We have 2 or 3 groups of servers that we want to configure for simple check-host-alive work, for some development integration with Nagios - it doesn't have to be sophisticated at the moment. The "production" group is the largest. I redacted the changes and put a FQHN (1) in there and it seemed to work. Is there a better way to associate multiple hosts with a hostgroup? Any pointers would be appreciated. ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 7 23:42:28 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 07 Dec 2010 23:42:28 +0100 Subject: Problem with hostgroup members definition In-Reply-To: <4CFEA9F2.9080100@gmail.com> References: <4CFEA9F2.9080100@gmail.com> Message-ID: <4CFEB854.5020003@op5.se> On 12/07/2010 10:41 PM, Forrest Aldrich wrote: > I have a new Nagios 3 installation where I am trying to include several > hosts in a hostgroup definition (to be efficient). > > The error I'm getting (from the pre-flight check): > > Error: Could not expand members specified in hostgroup ... > > At first, I was referring to hosts by their "alias" which is defined in > the host definition, but it appears I cannot.. Next, I thought the > line was too long. There are a total of 27 hosts. > If the line is longer than 64K, then it's too long. Unless your hostnames are very very very very VERY long I doubt that's the problem. > I want use a hostgroup definition and bind a check-host-alive to that > group -- seems pretty efficient. Is there a better way to accomplish this. > But you can't. Hostgroups can't be used to assign host-checks. Use host templates for that. Your check-host-alive command should be the default that (very nearly) all templates use, and those that for some reason deviate from the standard check-alive thing should have their commands set in the host object definition. > I had another person double check my work and we compared it to another > known-working config, where I believe aliases were being used. > They weren't. aliases have never worked to assign hosts to hostgroups. > No special configuration was done, I just used the example configs and > commented out a bunch of stuff or just defined my own objects. > > At the moment, I have several hosts that belong to a production group: > > > define hostgroup{ > hostgroup_name prod-servers ; The name of the hostgroup > alias Production Servers ; Long name of the group > members host1,host2, ... (omitted here for brevity) > } > > > We have 2 or 3 groups of servers that we want to configure for simple > check-host-alive work, for some development integration with Nagios - it > doesn't have to be sophisticated at the moment. The "production" group > is the largest. > > I redacted the changes and put a FQHN (1) in there and it seemed to > work. Is there a better way to associate multiple hosts with a hostgroup? > There are two ways to assign hosts to a hostgroups. Well, three actually, but one of them happens to be an extension of the original two, so it doesn't really count. The first is to set the 'members' variable in the hostgroup itself and on that line specify all the named hosts you want to include. This has to be done using the host_name variable from the object itself. I'm not sure if the parser groks spaces surrounding the comma yet. It didn't a while back, causing confusion in the user-ranks. The second is to list the hostgroups using the hostgroups directive in the host object definitions themselves. Then you must link them to the hostgroup_name variable from the hostgroup objects. The third way, which is an extension of the second, is to set the hostgroups variable in a host template and use that template in all the hosts you want to be part of the hostgroup. That way you can get away with specifying the check-host-alive command only once, and then override it from each host object if you wish. > Any pointers would be appreciated. > www.nagios.org and then click your way to the online documentation. You should probably have started there before mailing here tbh, but I'm in a good mood today so I refrained from biting your head off. You can thank me for that later if you like. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Tue Dec 7 23:01:53 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Tue, 7 Dec 2010 15:01:53 -0700 Subject: Problem with hostgroup members definition In-Reply-To: <4CFEA9F2.9080100@gmail.com> References: <4CFEA9F2.9080100@gmail.com> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240538899F@WPSCV6MM.OPR.STATEFARM.ORG> You could do it the other way, and in the host definition assign it to a hostgroup, using the "hostgroups" directive, and give it a comma-separated list of groups? We did it that way so basically the host definition file is all self-contained. Dan -----Original Message----- From: Forrest Aldrich [mailto:forrie at gmail.com] Sent: Tuesday, December 07, 2010 3:41 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Problem with hostgroup members definition I have a new Nagios 3 installation where I am trying to include several hosts in a hostgroup definition (to be efficient). The error I'm getting (from the pre-flight check): Error: Could not expand members specified in hostgroup ... At first, I was referring to hosts by their "alias" which is defined in the host definition, but it appears I cannot.. Next, I thought the line was too long. There are a total of 27 hosts. I want use a hostgroup definition and bind a check-host-alive to that group -- seems pretty efficient. Is there a better way to accomplish this. I had another person double check my work and we compared it to another known-working config, where I believe aliases were being used. No special configuration was done, I just used the example configs and commented out a bunch of stuff or just defined my own objects. At the moment, I have several hosts that belong to a production group: define hostgroup{ hostgroup_name prod-servers ; The name of the hostgroup alias Production Servers ; Long name of the group members host1,host2, ... (omitted here for brevity) } We have 2 or 3 groups of servers that we want to configure for simple check-host-alive work, for some development integration with Nagios - it doesn't have to be sophisticated at the moment. The "production" group is the largest. I redacted the changes and put a FQHN (1) in there and it seemed to work. Is there a better way to associate multiple hosts with a hostgroup? Any pointers would be appreciated. ------------------------------------------------------------------------ ------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 7 23:56:40 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 07 Dec 2010 23:56:40 +0100 Subject: high latency In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> <4CFCC3B0.6050706@op5.se> <4CFE4824.8000102@op5.se> Message-ID: <4CFEBBA8.6070509@op5.se> On 12/07/2010 08:20 PM, Frost, Mark {PBC} wrote: > >> -----Original Message----- >> From: Andreas Ericsson [mailto:ae at op5.se] >> Sent: Tuesday, December 07, 2010 9:44 AM >> >>> Hmm. So then I'd be so curious why the 2 distservers which are both using >>> oc[sh]p commands the same way have such radically different latencies. >>> >> >> Agreed. There must be other differences too. Perhaps there's trouble resolving >> from one of the nodes? That usually makes checks run a helluva lot longer than >> they normally have to. > > I had another look. While I found a test host that I'd made that was > deliberately unreachable, I found that when I removed it it made no > difference. Execution times are significantly lower (min/max/avg) on > the host with the high latencies than for the one with low latencies. > I don't see any unresolvable hosts or now, any unreachable hosts. > Puzzling. > Not necessarily unresolvable, but if you've configured a faulty primary dns so it ticks over to the secondary one after the 10-15 (whatever) timeout that would obviously cause much higher execution times. If load is very low and latency is very high on one system but not the other it's very nearly always down to configuration differences. > I've always wished there was an easy way to see which processes had > high latencies from the web interface without having to view the status.dat > file... > You won't like it, but... install merlin and enable database writing. Then you can do 'select * from service where latency >= 10.0' and get a complete list of it, although you probably want to grab only some of the fields, such as host_name, service_description and check_command. >>> Either way, you're suggesting that having a NEB module handle the >>> post-check work will eliminate the serialization. > >> Yes. Sneaking a peak at what's needed in order for an event to get sent to >> master via an eventbroker compared to running an oc[sh]p command renders >> this, more or less: > >> [ good stuff snipped...] > > Wow. > Indeed. http://blogs.op5.org The relevant post is still the topmost one. >>> >>> parallelize_check is set to 1 everywhere. >> >> Does one server have a lot of random service failures? On-demand hostchecks are >> still run in parallel. > > I don't think so. Intermittent you mean? Not as far as I know or can see. > Check top alert producers and include soft states in the report and you should see if there are gargantuan differences. >>>> What version of Nagios are you running? >>> >>> 3.2.1 >> >> I take it upgrading makes no difference? > > To 3.2.3? I'll probably try that on the new servers, but if things work out I may > just move to Merlin + 3.2.4. I wasn't sure I saw anything in the 3.2.3 release that > I found compelling for us at the time. As I say, this system now has fairly high > visibility so just trying something like that would involve a rather painful > internal change process. It's like piloting the QE2 -- I can't change > course very quickly :-) > I quite understand. Let me know if you want me to hook you up with a sales rep. We'll do the migration in half a day, if that. >>> Thanks, Andreas. I'm hoping to allocate sufficient resources on the new servers >>> to be able to play with Merlin more there. >> >> It's quite resource-friendly actually. Well, compared to what you're running now >> it's positively feather-light. > > I meant more like installing MySQL everywhere, building filesystems to hold the > MySQL data, etc. Not so much like I need more memory or more CPUs. I don't > remember seeing anything in the Merlin docs (maybe I missed it), but how > large would the MySQL database need to be? Pretty small on each box, right? > Like 500MB or less? > You don't need to use a database at all if you don't want to. You can use merlin for loadbalancing and redundancy and still use the old cgi's or whatever for watching current status. I think reports will be a bit bugged though, but that should be easy to patch in Merlin tbh. >>> Will I be able to have the performance >>> data from a poller be sent up to a NOC for digestion by pnp4nagios? >> >> Yes, but you'll need the threadsafe version of Nagios you can obtain from either >> CVS or git://git.op5.org/nagios.git for performance-data to work. Actually, you >> need that for Merlin to work. > > That's part of the plan. Any chance that the OP5 site will eventually be > configured to allow git through a proxy? It's of course less convenient to > use snapshot tarballs, but still workable, of course. > You mean through http? Doesn't it already? I think it's supposed to. I can check up on that later. The gitweb page has links for grabbing latest master as a tarball though. That might work as an interim solution. > >>> No we'd be using some flavor of SLES. >>> >> >> Should work marvellously then. > > Thanks as always for your help, Andreas. > You're welcome. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From PWilliamson at twgi.net Tue Dec 7 23:42:57 2010 From: PWilliamson at twgi.net (Paul Williamson) Date: Tue, 7 Dec 2010 22:42:57 +0000 Subject: Indication of dial backup? Message-ID: How can I configure Nagios to recognize when a router is on dial backup (or connected via the non-primary link)? I have about 300 locations and would like to see when a system is not connected via the primary interface. I realize I'll probably need to define a template of some sort that all routers would fit into, but I'm not very familiar with how to indicate that the condition is good (on primary) or it is bad (on dial back up). I've looked at the Nagios Exchange and didn't find any plugin or template. Thanks, Paul ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Wed Dec 8 10:25:12 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Wed, 08 Dec 2010 10:25:12 +0100 Subject: =?utf-8?q?Indication_of_dial_backup=3F?= In-Reply-To: References: Message-ID: On Tue, 7 Dec 2010 22:42:57 +0000, Paul Williamson wrote: > How can I configure Nagios to recognize when a router is on dial > backup (or connected via the non-primary link)? I have about 300 > locations and would like to see when a system is not connected via > the > primary interface. I realize I'll probably need to define a template > of some sort that all routers would fit into, but I'm not very > familiar with how to indicate that the condition is good (on primary) > or it is bad (on dial back up). I've looked at the Nagios Exchange > and didn't find any plugin or template. Add an interface check for the dialup interface but negate the result. If the interface is down everything is OK. But if the interface is up then you should set the status to CRITICAL or WARNING. You will need to add some effort of your own into this but this would be roughly how I would add monitoring. Just considere the backup link as a service of each router and set it to CRITICAL if the dialup interface is up. Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kenneho.ndu at gmail.com Wed Dec 8 13:52:13 2010 From: kenneho.ndu at gmail.com (Kenneth Holter) Date: Wed, 8 Dec 2010 13:52:13 +0100 Subject: Nagios configuraion best practice In-Reply-To: References: <7f62d2420809150750g50e432d6xbcb9a1a954273a09@mail.gmail.com> <7f62d2420809160641y128fe757lc99915534d3a847d@mail.gmail.com> Message-ID: Hi all, I'm picking up this old tread, as I'm about to start restructuring parts of my Nagios configuration. In my current configuration I've created host groups to which I've linked both hosts and services. Example: I have a host group called "linux-servers", to which all linux servers are linked. A number of service checks common for all linux servers are also linked to this host group. When adding new linux boxes I simply link them to the "linux-servers" host group. In a similar fashion I have a host-group called "application-servers" to which all application servers and related service checks are linked. When adding a new application server, I make sure it links to both "linux-server" and "applications-servers". This setup works pretty well, but it really clutters up the host groups web page. Is this the way others have structured their nagios configuration? Regards, Kenneth On Wed, Sep 17, 2008 at 2:55 PM, Kenneth Holter wrote: > I should have been more precise regarding the cluttering of the display - it > was the hostgroup display I was referring to. :/ > > Would it be an idea to as much as possible use this configuration method: > hosts --- hostgroups --- services? To elaborate some: Hosts are always > connected to host groups, and the same for services. This way one will > avoid?linking services to specific hosts, making the service definitions > more clean. > > Also, I'm thinking about having?a host group called for example > "linux-servers", to which all linux servers are linked. A number of service > checks common for all linux servers are also linked to this host group. When > adding new linux boxes I simply link them to the "linux-servers" host group, > and add extra services checks where needed. In a similar fashin I would > create a host-group called for example "dell-servers" to which all Dell > hosts and related service checks are linked. New Dell servers are then > linked to this host group. > > I'm thinking this may be a?good idea, but would like to hear how others have > structured their configuration and if there may be some hidden pitfalls in > my proposed configuration method. > > > > On 9/16/08, dave stern - e-mail.pluribus.unum wrote: >> >> Not quite sure what you're trying to do. If you monitor a service, >> it's going to be on the nagios >> service details pagre regardless, That's the whole point. There are >> some tricks to slim down >> some other displays eg I define a whole bunch of services under a host >> without defining the >> hosts these services run on explicitly. I just feed the IP address as >> part of the check_command. >> This reduced the hostgroup grid page. I don't think that will work for >> you. >> >> If you're asking how to most efficiently code this in your configs, >> the timesaving tips page >> mentions ganging together services. ie you don't need a service stanza >> for each host you want >> to run this on. Rather, use a wildcard or group. >> define service{ >> ????????????use???????????? generic >> ???????????? host_name????hosta, hostb >> ???????????? hostgroup???? special-ones,!webguy >> ??????????..... >> } >> >> Run the service on hosts hosta, hostb, all hosts in the hostgroup, >> "special-ones" but not "webguy" >> >> One other thing you should consider. Where will the plugins live? The >> easy answer is to put them >> on an NFSmounted partition that all hosts can access and that you can >> easily update. This is a >> bad practise. What happens when NFS breaks? You should really copy the >> plugins to each >> host st they run locally. >> >> On Tue, Sep 16, 2008 at 8:05 AM, Kenneth Holter >> wrote: >> > >> > Thanks for the reply. I'll read the documents you listed. >> > >> > Meanwhile, I have a design question: What's the best way to set up a >> > service >> > check to be executed on a selected few (or maybe all for that matter) >> > nodes? >> > Say I need to monitor a particular process (lets call this service A) on >> > a >> > number of systems, how would I implement this? I guess one way would be >> > to >> > link service A to a hostgroup, say hostgroup A, and add the selected >> > systems >> > to that hostgroup. This, however, will somewhat clutter up the web >> > display, >> > so I'm looking for alternative approaches. >> > >> > Any suggestions? >> > >> > >> > >> > On 9/15/08, dave stern - e-mail.pluribus.unum >> > wrote: >> >> >> >> Assuming you're already familiar with nagios, I'd look at the nagios >> >> documentation >> >> in the following order: >> >> >> >> Under "Configuring nagios", look at "Object definitions" and make sure >> >>?? you understand all parameters and what would make sense in your >> >> environment >> >> Under "Advanced Topics", look at "Timesaving tips for object >> >> definitions" >> >> Under "Advanced Topics", look at "host and service dependancies" >> >> >> >> Under "Advanced Topics", look at both "Distributed Monitoring" and >> >> "Redundant and >> >>?? Failover Monitoring". >> >> Finally, under Security and Performance Tuning, look at Large >> >> Installation Tweaks >> >> >> >> On Mon, Sep 15, 2008 at 10:23 AM, Kenneth Holter >> >> >> >> wrote: >> >> > Hello all, >> >> > >> >> > >> >> > I'm new to Nagios, and are planning on using Nagios for monitoring >> >> > our >> >> > network of Linux servers. >> >> > >> >> > Are there any best practice documents on how to manage the different >> >> > definitions such as hostgroups, services, servicegroups, and so forth >> >> > in >> >> > an >> >> > enterprise environment? >> >> > >> >> > >> >> > Regards, >> >> > Kenneth Holter >> >> > >> >> > >> >> > >> >> > ------------------------------------------------------------------------- >> >> > This SF.Net email is sponsored by the Moblin Your Move Developer's >> >> > challenge >> >> > Build the coolest Linux based applications with Moblin SDK & win >> >> > great >> >> > prizes >> >> > Grand prize is a trip for two to an Open Source event anywhere in the >> >> > world >> >> > http://moblin-contest.org/redirect.php?banner_id=100&url=/ >> >> > _______________________________________________ >> >> > Nagios-users mailing list >> >> > Nagios-users at lists.sourceforge.net >> >> > https://lists.sourceforge.net/lists/listinfo/nagios-users >> >> > ::: Please include Nagios version, plugin version (-v) and OS when >> >> > reporting >> >> > any issue. >> >> > ::: Messages without supporting info will risk being sent to >> >> > /dev/null >> >> > >> > >> > >> > >> > ------------------------------------------------------------------------- >> > This SF.Net email is sponsored by the Moblin Your Move Developer's >> > challenge >> > Build the coolest Linux based applications with Moblin SDK & win great >> > prizes >> > Grand prize is a trip for two to an Open Source event anywhere in the >> > world >> > http://moblin-contest.org/redirect.php?banner_id=100&url=/ >> > _______________________________________________ >> > Nagios-users mailing list >> > Nagios-users at lists.sourceforge.net >> > https://lists.sourceforge.net/lists/listinfo/nagios-users >> > ::: Please include Nagios version, plugin version (-v) and OS when >> > reporting >> > any issue. >> > ::: Messages without supporting info will risk being sent to /dev/null >> > > > ------------------------------------------------------------------------------ What happens now with your Lotus Notes apps - do you make another costly upgrade, or settle for being marooned without product support? Time to move off Lotus Notes and onto the cloud with Force.com, apps are easier to build, use, and manage than apps on traditional platforms. Sign up for the Lotus Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From malarie at processia.com Wed Dec 8 20:25:54 2010 From: malarie at processia.com (Maxime Alarie) Date: Wed, 8 Dec 2010 14:25:54 -0500 Subject: NagiosXI licence exceeded. Message-ID: <62F2034CF68DFB45BAA4FB7466782DDC019AE4D9@denali.processia2003.com> Hi, I run nagiosxi on CentOS. I have by mistake created more than 7 entries with the FREE Licence. I have deleted everything related to my test laptop in the directories, restarted Nagios, but I cant access anything anymore. I went to /usr/local/Nagios and grep -iR LAPTOP and erased every entry I could find.. I still cant use nagiosxi.. Is there a way to fix this? Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dit.dash at gmail.com Wed Dec 8 21:12:10 2010 From: dit.dash at gmail.com (dave stern - e-mail.pluribus.unum) Date: Wed, 8 Dec 2010 15:12:10 -0500 Subject: Nagios configuraion best practice In-Reply-To: References: <7f62d2420809150750g50e432d6xbcb9a1a954273a09@mail.gmail.com> <7f62d2420809160641y128fe757lc99915534d3a847d@mail.gmail.com> Message-ID: I think this is more a personal (or site) preference. But using host groups can be very useful to eg insure that all windows machines get thus-and-such service checks all RedHat machines get thus-and-such service checks etc In other words, a wildcard or hostgroup tag for a service check can substantially reduce the size of your config, make it more readable and insure uniformity. >From a grouping perspective, ie if you tend to display by hostgroups, it's often helpful to list all hosts once and only once. However, in other environments, some find it very useful to list some (or all ) machines in multiple groups. This would allow you to easily view all RHEL9 hosts, SLES10 hosts that have not yet been updated, hosts in room 123, etc. The downside of this is when hosts go red, you'll see a lot more red... On Wed, Dec 8, 2010 at 7:52 AM, Kenneth Holter wrote: > Hi all, > > > I'm picking up this old tread, as I'm about to start restructuring > parts of my Nagios configuration. In my current configuration I've > created host groups to which I've linked both hosts and services. > Example: I have a host group called "linux-servers", to which all > linux servers are linked. A number of service checks common for all > linux servers are also linked to this host group. When adding new > linux boxes I simply link them to the "linux-servers" host group. In a > similar fashion I have a host-group called "application-servers" to > which all application servers and related service checks are linked. > When adding a new application server, I make sure it links to both > "linux-server" and "applications-servers". > > This setup works pretty well, but it really clutters up the host > groups web page. Is this the way others have structured their nagios > configuration? > > > Regards, > Kenneth > > > On Wed, Sep 17, 2008 at 2:55 PM, Kenneth Holter wrote: >> I should have been more precise regarding the cluttering of the display - it >> was the hostgroup display I was referring to. :/ >> >> Would it be an idea to as much as possible use this configuration method: >> hosts --- hostgroups --- services? To elaborate some: Hosts are always >> connected to host groups, and the same for services. This way one will >> avoid linking services to specific hosts, making the service definitions >> more clean. >> >> Also, I'm thinking about having a host group called for example >> "linux-servers", to which all linux servers are linked. A number of service >> checks common for all linux servers are also linked to this host group. When >> adding new linux boxes I simply link them to the "linux-servers" host group, >> and add extra services checks where needed. In a similar fashin I would >> create a host-group called for example "dell-servers" to which all Dell >> hosts and related service checks are linked. New Dell servers are then >> linked to this host group. >> >> I'm thinking this may be a good idea, but would like to hear how others have >> structured their configuration and if there may be some hidden pitfalls in >> my proposed configuration method. >> >> >> >> On 9/16/08, dave stern - e-mail.pluribus.unum wrote: >>> >>> Not quite sure what you're trying to do. If you monitor a service, >>> it's going to be on the nagios >>> service details pagre regardless, That's the whole point. There are >>> some tricks to slim down >>> some other displays eg I define a whole bunch of services under a host >>> without defining the >>> hosts these services run on explicitly. I just feed the IP address as >>> part of the check_command. >>> This reduced the hostgroup grid page. I don't think that will work for >>> you. >>> >>> If you're asking how to most efficiently code this in your configs, >>> the timesaving tips page >>> mentions ganging together services. ie you don't need a service stanza >>> for each host you want >>> to run this on. Rather, use a wildcard or group. >>> define service{ >>> use generic >>> host_name hosta, hostb >>> hostgroup special-ones,!webguy >>> ..... >>> } >>> >>> Run the service on hosts hosta, hostb, all hosts in the hostgroup, >>> "special-ones" but not "webguy" >>> >>> One other thing you should consider. Where will the plugins live? The >>> easy answer is to put them >>> on an NFSmounted partition that all hosts can access and that you can >>> easily update. This is a >>> bad practise. What happens when NFS breaks? You should really copy the >>> plugins to each >>> host st they run locally. >>> >>> On Tue, Sep 16, 2008 at 8:05 AM, Kenneth Holter >>> wrote: >>> > >>> > Thanks for the reply. I'll read the documents you listed. >>> > >>> > Meanwhile, I have a design question: What's the best way to set up a >>> > service >>> > check to be executed on a selected few (or maybe all for that matter) >>> > nodes? >>> > Say I need to monitor a particular process (lets call this service A) on >>> > a >>> > number of systems, how would I implement this? I guess one way would be >>> > to >>> > link service A to a hostgroup, say hostgroup A, and add the selected >>> > systems >>> > to that hostgroup. This, however, will somewhat clutter up the web >>> > display, >>> > so I'm looking for alternative approaches. >>> > >>> > Any suggestions? >>> > >>> > >>> > >>> > On 9/15/08, dave stern - e-mail.pluribus.unum >>> > wrote: >>> >> >>> >> Assuming you're already familiar with nagios, I'd look at the nagios >>> >> documentation >>> >> in the following order: >>> >> >>> >> Under "Configuring nagios", look at "Object definitions" and make sure >>> >> you understand all parameters and what would make sense in your >>> >> environment >>> >> Under "Advanced Topics", look at "Timesaving tips for object >>> >> definitions" >>> >> Under "Advanced Topics", look at "host and service dependancies" >>> >> >>> >> Under "Advanced Topics", look at both "Distributed Monitoring" and >>> >> "Redundant and >>> >> Failover Monitoring". >>> >> Finally, under Security and Performance Tuning, look at Large >>> >> Installation Tweaks >>> >> >>> >> On Mon, Sep 15, 2008 at 10:23 AM, Kenneth Holter >>> >> >>> >> wrote: >>> >> > Hello all, >>> >> > >>> >> > >>> >> > I'm new to Nagios, and are planning on using Nagios for monitoring >>> >> > our >>> >> > network of Linux servers. >>> >> > >>> >> > Are there any best practice documents on how to manage the different >>> >> > definitions such as hostgroups, services, servicegroups, and so forth >>> >> > in >>> >> > an >>> >> > enterprise environment? >>> >> > >>> >> > >>> >> > Regards, >>> >> > Kenneth Holter >>> >> > >>> >> > >>> >> > >>> >> > ------------------------------------------------------------------------- >>> >> > This SF.Net email is sponsored by the Moblin Your Move Developer's >>> >> > challenge >>> >> > Build the coolest Linux based applications with Moblin SDK & win >>> >> > great >>> >> > prizes >>> >> > Grand prize is a trip for two to an Open Source event anywhere in the >>> >> > world >>> >> > http://moblin-contest.org/redirect.php?banner_id=100&url=/ >>> >> > _______________________________________________ >>> >> > Nagios-users mailing list >>> >> > Nagios-users at lists.sourceforge.net >>> >> > https://lists.sourceforge.net/lists/listinfo/nagios-users >>> >> > ::: Please include Nagios version, plugin version (-v) and OS when >>> >> > reporting >>> >> > any issue. >>> >> > ::: Messages without supporting info will risk being sent to >>> >> > /dev/null >>> >> > >>> > >>> > >>> > >>> > ------------------------------------------------------------------------- >>> > This SF.Net email is sponsored by the Moblin Your Move Developer's >>> > challenge >>> > Build the coolest Linux based applications with Moblin SDK & win great >>> > prizes >>> > Grand prize is a trip for two to an Open Source event anywhere in the >>> > world >>> > http://moblin-contest.org/redirect.php?banner_id=100&url=/ >>> > _______________________________________________ >>> > Nagios-users mailing list >>> > Nagios-users at lists.sourceforge.net >>> > https://lists.sourceforge.net/lists/listinfo/nagios-users >>> > ::: Please include Nagios version, plugin version (-v) and OS when >>> > reporting >>> > any issue. >>> > ::: Messages without supporting info will risk being sent to /dev/null >>> > >> >> > > ------------------------------------------------------------------------------ > What happens now with your Lotus Notes apps - do you make another costly > upgrade, or settle for being marooned without product support? Time to move > off Lotus Notes and onto the cloud with Force.com, apps are easier to build, > use, and manage than apps on traditional platforms. Sign up for the Lotus > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xurizaemon at gmail.com Wed Dec 8 21:41:26 2010 From: xurizaemon at gmail.com (chris burgess) Date: Thu, 9 Dec 2010 09:41:26 +1300 Subject: Set notification periods for services from host entry? Message-ID: Using Nagios3 on Debian 5.0.7, hoping to disable notifications for a couple of hosts which experience downtime (or, poor response times from my monitoring host) during morning backups. Have created a timeperiod to black out the backup period, and set the notification_period on the relevant host entries. But still receiving service alerts on the host's services. Is there a way to override the service notification period by setting it in the hosts entries? (this is what I expected to do, but it seems the service takes precedence) Is it required to configure a separate service (eg: http, http-except-backups; smtp, smtp-except-backups) for the services? Should this service extend the existing http service with "use"? Suggestions on better ways to handle this welcome too - permitting slower response times during the backup periods would be a good solution, as we'd still get notified if the server *actually* fell over. Hopefully these excerpts give all the relevant details of what I'm trying - have left out anything which is stock (eg generic-host config which is unchanged from OS default, 24x7 definitions etc). Aim is to receive notifications for bar 24x7 and foo only during silent_backups. Thanks! # /etc/nagios3/conf.d/timeperiods_nagios2.cfg contains this define timeperiod { timeperiod_name silent_backups alias 24x7 with exclusion for daily backups sunday 00:00-05:30,07:00-24:00 monday 00:00-05:30,07:00-24:00 tuesday 00:00-05:30,07:00-24:00 wednesday 00:00-05:30,07:00-24:00 thursday 00:00-05:30,07:00-24:00 friday 00:00-05:30,07:00-24:00 saturday 00:00-05:30,07:00-24:00 } # /etc/nagios3/conf.d/hosts.cfg contains this define host { use generic-host; host_name foo; address foo.example.org; check_period silent_backups; } define host { use generic-host; host_name bar; address bar.example.org; check_period 24x7; } # /etc/nagios3/conf.d/services_nagios2.cfg define service { hostgroup_name smtp-servers service_description SMTP check_command check_smtp use generic-service notification_interval 0 ; set > 0 if you want to be renotified } define service { hostgroup_name http-servers service_description HTTP check_command check_http use generic-service notification_interval 0 ; set > 0 if you want to be renotified } # /etc/nagios3/conf.d/hostgroups_nagios2.cfg # A list of your web servers define hostgroup { hostgroup_name http-servers alias HTTP servers members foo, bar } define hostgroup { hostgroup_name smtp-servers alias SMTP servers members foo, bar } -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tyarusso at nagios.com Wed Dec 8 22:14:08 2010 From: tyarusso at nagios.com (Tony Yarusso) Date: Wed, 08 Dec 2010 15:14:08 -0600 Subject: NagiosXI licence exceeded. In-Reply-To: <62F2034CF68DFB45BAA4FB7466782DDC019AE4D9@denali.processia2003.com> References: <62F2034CF68DFB45BAA4FB7466782DDC019AE4D9@denali.processia2003.com> Message-ID: <1291842848.12189.1.camel@ubuntu-desktop.SSG5-Serial> On Wed, 2010-12-08 at 14:25 -0500, Maxime Alarie wrote: > I run nagiosxi on CentOS. This list is intended for the open source Nagios Core, not XI. Could you please post this on http://support.nagios.com/forum/viewforum.php?f=6 instead? -- Tony Yarusso Technical Team ___ Nagios Enterprises, LLC Email: tyarusso at nagios.com Web: www.nagios.com ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kaushalshriyan at gmail.com Thu Dec 9 10:04:21 2010 From: kaushalshriyan at gmail.com (Kaushal Shriyan) Date: Thu, 9 Dec 2010 14:34:21 +0530 Subject: Check RAID Script Message-ID: Hi I have the below Raid Controller Card on HP DL 360 G6 Server. Can someone please suggest/guide and let me know if there is a Raid Check script for this specific model 03:00.0 RAID bus controller: Hewlett-Packard Company Smart Array G6 controllers (rev 01) Subsystem: Hewlett-Packard Company Device 3245 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- Kernel driver in use: cciss Kernel modules: cciss Thanks Kaushal -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From duncan at dcl.co.uk Thu Dec 9 10:43:43 2010 From: duncan at dcl.co.uk (Duncan Berriman) Date: Thu, 9 Dec 2010 09:43:43 -0000 Subject: Check RAID Script In-Reply-To: References: Message-ID: <118601cb9785$91900350$b4b009f0$@dcl.co.uk> Use check_cciss (uses hpacucli which must be installed), raid checker for HP. Duncan From: Kaushal Shriyan [mailto:kaushalshriyan at gmail.com] Sent: 09 December 2010 09:04 To: nagios Mailinglist Subject: [Nagios-users] Check RAID Script Hi I have the below Raid Controller Card on HP DL 360 G6 Server. Can someone please suggest/guide and let me know if there is a Raid Check script for this specific model 03:00.0 RAID bus controller: Hewlett-Packard Company Smart Array G6 controllers (rev 01) Subsystem: Hewlett-Packard Company Device 3245 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- SERR- Kernel driver in use: cciss Kernel modules: cciss Thanks Kaushal -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sebastian.Ries at dtnet.de Thu Dec 9 11:10:40 2010 From: Sebastian.Ries at dtnet.de (Sebastian Ries) Date: Thu, 09 Dec 2010 11:10:40 +0100 Subject: Check RAID Script In-Reply-To: <118601cb9785$91900350$b4b009f0$@dcl.co.uk> References: <118601cb9785$91900350$b4b009f0$@dcl.co.uk> Message-ID: <1291889440.16961.6.camel@bofh.dtnet.de> Hi Am Donnerstag, den 09.12.2010, 09:43 +0000 schrieb Duncan Berriman: > Use check_cciss (uses hpacucli which must be installed), raid checker > for HP. Thanks, I was also looking for something like this, but: maintain:/usr/lib/nagios/plugins# ./check_cciss -v RAID OK: [Controller Status: OK Cache Status: OK Battery Status: OK Controller Status: OK Cache Status: Not Configured Battery Status: OK] maintain:/usr/lib/nagios/plugins# hpacucli controller all show MSA20 at E0ARMLJ191 (sn: PAAACADMQUUNF7, csn: E0ARMLJ191) Smart Array 6400 (sn: P57820R9SS32WO) As the MSA20 is connected to the P6400 this one does not have disks directly connected. So is it guaranteed that both controllers are checked? How can I configure which to check? Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sebastian.Ries at dtnet.de Thu Dec 9 11:37:39 2010 From: Sebastian.Ries at dtnet.de (Sebastian Ries) Date: Thu, 09 Dec 2010 11:37:39 +0100 Subject: Check RAID Script In-Reply-To: <1291889440.16961.6.camel@bofh.dtnet.de> References: <118601cb9785$91900350$b4b009f0$@dcl.co.uk> <1291889440.16961.6.camel@bofh.dtnet.de> Message-ID: <1291891059.16961.11.camel@bofh.dtnet.de> Am Donnerstag, den 09.12.2010, 11:10 +0100 schrieb Sebastian Ries: > Hi > > Am Donnerstag, den 09.12.2010, 09:43 +0000 schrieb Duncan Berriman: > > Use check_cciss (uses hpacucli which must be installed), raid checker > > for HP. > > Thanks, I was also looking for something like this, but: > > maintain:/usr/lib/nagios/plugins# ./check_cciss -v > RAID OK: [Controller Status: OK Cache Status: OK Battery Status: OK > Controller Status: OK Cache Status: Not Configured Battery Status: OK] *args* Now I realized that Both controllers are mentioned - but NO Array was found :-/ This is typical for an MSA20 (which is an external shelf with it's own controller - which must be connected to another HP Controller :-/) Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ihab24 at hotmail.com Thu Dec 9 12:08:56 2010 From: ihab24 at hotmail.com (Ihab Samara) Date: Thu, 9 Dec 2010 13:08:56 +0200 Subject: NRPE In-Reply-To: References: , , <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG>, , <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG>, , , Message-ID: It is working now. The requiretty on sudoers solved it, I used added this to sudoers, to allow the nagios user only: Defaults:nagios !requiretty Thanks Mark for the help Ihab > From: mark.elsen at gmail.com > Date: Tue, 7 Dec 2010 17:30:51 +0100 > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] NRPE > > > Hi List > > > > I have a problem using check_nrpe: > > > > The script on the remote machine is running this command: > > > > #/bin/bash > > > > output=`sudo /usr/sbin/lsof -X |grep tomcat| wc -l` > > > > max=$1 > > > > if [ "$output" -lt "$max" ]; then > > > > echo "OK |value is "$output"" > > > > exit 0; > > > > else > > > > echo "CRITICAL value is "$output"" > > > > exit 2; > > > > fi > > > > I have this line in sudoers file: > > > > nagios ALL=(ALL) NOPASSWD: /usr/sbin/lsof > > > > I have this in the nrpe.cfg: > > > > command[check_lsof]=/usr/local/nagios/libexec/check_lsof.sh 256 > > > > > > When I run the command from the nagios user on the remote machine: > > > > [nagios at serv_1]$ /usr/local/nagios/libexec/check_lsof.sh 256 > > OK |value is 132 > > > > When I run it from the Nagios server (remotely): > > [root at healthy libexec]# ./check_nrpe -H 10.1.1.1 -c check_lsof > > OK |value is 0 > > > > > > Ive set the user in /etc/xinetd.d/nrpe to "nagios", and all the other checks > > are working fine. > > > > Any thoughts? > > > > > > - Search for 'sudo' in nrpe.cfg , make sure the sudo prefix is set, > not commented. > > - Look for 'tty' in the sudoers file, make sure 'requiretty' is commented. > > M. > > ------------------------------------------------------------------------------ > What happens now with your Lotus Notes apps - do you make another costly > upgrade, or settle for being marooned without product support? Time to move > off Lotus Notes and onto the cloud with Force.com, apps are easier to build, > use, and manage than apps on traditional platforms. Sign up for the Lotus > Notes Migration Kit to learn more. http://p.sf.net/sfu/salesforce-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From duncan at dcl.co.uk Thu Dec 9 12:37:36 2010 From: duncan at dcl.co.uk (Duncan Berriman) Date: Thu, 9 Dec 2010 11:37:36 -0000 Subject: Check RAID Script In-Reply-To: <1291889440.16961.6.camel@bofh.dtnet.de> References: <118601cb9785$91900350$b4b009f0$@dcl.co.uk> <1291889440.16961.6.camel@bofh.dtnet.de> Message-ID: <11d501cb9795$7a685410$6f38fc30$@dcl.co.uk> Can't help there as never had that sort of config. I guess you can use hpacucli to query the shelf so it should just be a matter of adapting the script to suit (there may already be an option in the script). On a simple raid controller direct to disks you get something like following as a response. RAID OK: Smart Array 5i in Slot 0 array A logicaldrive 1 (33.9 GB, RAID 1+0, OK) array B logicaldrive 2 (16.9 GB, RAID 1+0, OK) array C logicaldrive 3 (67.8 GB, RAID 1+0, OK) Duncan -----Original Message----- From: Sebastian Ries [mailto:Sebastian.Ries at dtnet.de] Sent: 09 December 2010 10:11 To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Check RAID Script Hi Am Donnerstag, den 09.12.2010, 09:43 +0000 schrieb Duncan Berriman: > Use check_cciss (uses hpacucli which must be installed), raid checker > for HP. Thanks, I was also looking for something like this, but: maintain:/usr/lib/nagios/plugins# ./check_cciss -v RAID OK: [Controller Status: OK Cache Status: OK Battery Status: OK Controller Status: OK Cache Status: Not Configured Battery Status: OK] maintain:/usr/lib/nagios/plugins# hpacucli controller all show MSA20 at E0ARMLJ191 (sn: PAAACADMQUUNF7, csn: E0ARMLJ191) Smart Array 6400 (sn: P57820R9SS32WO) As the MSA20 is connected to the P6400 this one does not have disks directly connected. So is it guaranteed that both controllers are checked? How can I configure which to check? Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ---------------------------------------------------------------------------- -- This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean. ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cbeattie at geninfo.com Thu Dec 9 15:21:03 2010 From: cbeattie at geninfo.com (Chris Beattie) Date: Thu, 09 Dec 2010 09:21:03 -0500 Subject: Set notification periods for services from hostentry? In-Reply-To: References: Message-ID: <1291904463.28504.10.camel@DevNagios.geninfo.com> On Wed, 2010-12-08 at 15:41 -0500, chris burgess wrote: > Is there a way to override the service notification period by setting > it in the hosts entries? (this is what I expected to do, but it seems > the service takes precedence) You're on the right track, but facing backwards. If you set a timeperiod in your service object definition, it will override the timeperiod set for that service's host object. If you do *not* set the timeperiod on a service, it will inherit it from its host object. Check out the section on Implied Inheritance here: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------------ This SF Dev2Dev email is sponsored by: WikiLeaks The End of the Free Internet http://p.sf.net/sfu/therealnews-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pslund at gmail.com Thu Dec 9 17:55:53 2010 From: pslund at gmail.com (=?ISO-8859-1?Q?P=E4r_=C5slund?=) Date: Thu, 9 Dec 2010 17:55:53 +0100 Subject: check_memcached and hits/misses Message-ID: Hi, I'm searching for a nagios check that looks at hits and misses. Been using check_memcached for checking memcached status and right now I will continue to do that checking that memcached is answering and got free memory. However, the issue with check_memcached is that it's looking at the hits/misses ratio since start up of the memcached-process. Whereas I need to know if a sudden increase in misses instead between polls has occured. Is anyone familiar with a such check for Nagios? Best regards, -p ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From TGFurnish at herffjones.com Thu Dec 9 19:16:41 2010 From: TGFurnish at herffjones.com (Furnish, Trever G) Date: Thu, 9 Dec 2010 13:16:41 -0500 Subject: low-cost snmp-enabled temperature sensor? Message-ID: Can anyone recommend a low-cost external temperature sensor that doesn't require the buyer to break out a soldering iron? Preferably it would be SNMP-enabled so I can poll it from anything. Power-over-ethernet would be great too. The least expensive snmp-enabled sensor I've found so far is this one at 195 USD: http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm I've also noted probes like this one at 15 USD: http://www.ibuttonlink.com/t-sense.aspx ...but the systems in the site where I'd hook this up are all Windows, so I'm not sure how I'd get from there into Nagios. -- Trever Furnish, tgfurnish at herffjones.com Herff Jones, Inc. Solutions Architect Phone: 317.612.3519 Any sufficiently advanced technology is indistinguishable from Unix. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AdcockJ at leoncountyfl.gov Thu Dec 9 19:41:28 2010 From: AdcockJ at leoncountyfl.gov (Jon Adcock) Date: Thu, 09 Dec 2010 13:41:28 -0500 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: References: Message-ID: <4D00DC880200007500030C1F@leoncountyfl.gov> Trevor, We are successfully using Websensors (http://www.eesensors.com/websensor.html). These models can monitor temp, humidity and illumination, and with add-on pieces, they can monitor for the presence of water, and electrical power voltage. One of the nice things about these components is that they come with Nagios check plugins and instructions on how to install/setup Nagios monitoring. They provide the monitored details over IP (not SNMP), and you can see the details at any time by using a browser to access the webpage (http://. Jon Adcock Network Systems Administrator MIS / Systems Team Leon County (850) 606-5500 >>> "Furnish, Trever G" 12/9/2010 1:16 PM >>> Can anyone recommend a low-cost external temperature sensor that doesn't require the buyer to break out a soldering iron? Preferably it would be SNMP-enabled so I can poll it from anything. Power-over-ethernet would be great too. The least expensive snmp-enabled sensor I've found so far is this one at 195 USD: http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm I've also noted probes like this one at 15 USD: http://www.ibuttonlink.com/t-sense.aspx ...but the systems in the site where I'd hook this up are all Windows, so I'm not sure how I'd get from there into Nagios. -- Trever Furnish, tgfurnish at herffjones.com Herff Jones, Inc. Solutions Architect Phone: 317.612.3519 Any sufficiently advanced technology is indistinguishable from Unix. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From TGFurnish at herffjones.com Thu Dec 9 19:50:31 2010 From: TGFurnish at herffjones.com (Furnish, Trever G) Date: Thu, 9 Dec 2010 13:50:31 -0500 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: <4D00DC880200007500030C1F@leoncountyfl.gov> References: <4D00DC880200007500030C1F@leoncountyfl.gov> Message-ID: Thanks, Jon. It looks like those are in the $400+ range -- am I perhaps not seeing a less-expensive version that you're using? -- Trever From: Jon Adcock [mailto:AdcockJ at leoncountyfl.gov] Sent: Thursday, December 09, 2010 1:41 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] low-cost snmp-enabled temperature sensor? Trevor, We are successfully using Websensors (http://www.eesensors.com/websensor.html). These models can monitor temp, humidity and illumination, and with add-on pieces, they can monitor for the presence of water, and electrical power voltage. One of the nice things about these components is that they come with Nagios check plugins and instructions on how to install/setup Nagios monitoring. They provide the monitored details over IP (not SNMP), and you can see the details at any time by using a browser to access the webpage (http:// address>. Jon Adcock Network Systems Administrator MIS / Systems Team Leon County (850) 606-5500 >>> "Furnish, Trever G" 12/9/2010 1:16 PM >>> Can anyone recommend a low-cost external temperature sensor that doesn't require the buyer to break out a soldering iron? Preferably it would be SNMP-enabled so I can poll it from anything. Power-over-ethernet would be great too. The least expensive snmp-enabled sensor I've found so far is this one at 195 USD: http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm I've also noted probes like this one at 15 USD: http://www.ibuttonlink.com/t-sense.aspx ...but the systems in the site where I'd hook this up are all Windows, so I'm not sure how I'd get from there into Nagios. -- Trever Furnish, tgfurnish at herffjones.com Herff Jones, Inc. Solutions Architect Phone: 317.612.3519 Any sufficiently advanced technology is indistinguishable from Unix. ------------------------------------------------------------------------ ------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From maxhetrick at verizon.net Thu Dec 9 19:51:52 2010 From: maxhetrick at verizon.net (Max Hetrick) Date: Thu, 09 Dec 2010 13:51:52 -0500 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: References: Message-ID: <4D012548.8030004@verizon.net> On 12/09/2010 01:16 PM, Furnish, Trever G wrote: > Can anyone recommend a low-cost external temperature sensor that doesn't > require the buyer to break out a soldering iron? > > Preferably it would be SNMP-enabled so I can poll it from anything. > Power-over-ethernet would be great too. > > The least expensive snmp-enabled sensor I've found so far is this one at > 195 USD: > http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm > > I've also noted probes like this one at 15 USD: > http://www.ibuttonlink.com/t-sense.aspx > > ...but the systems in the site where I'd hook this up are all Windows, > so I'm not sure how I'd get from there into Nagios. I use these guys here. http://store.enviromon.net/cart.php?target=product&product_id=255&category_id=78 They are a little more expensive, but are ethernet connected, and have no issues using SNMP to incorporate into Nagios. We have two sensors hooked up to it, one is temp/humidity and the other is a water sensor. Regards, Max ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cmadams at hiwaay.net Thu Dec 9 20:04:48 2010 From: cmadams at hiwaay.net (Chris Adams) Date: Thu, 9 Dec 2010 13:04:48 -0600 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: References: Message-ID: <20101209190448.GD29478@hiwaay.net> Once upon a time, Furnish, Trever G said: > Can anyone recommend a low-cost external temperature sensor that doesn't > require the buyer to break out a soldering iron? > > Preferably it would be SNMP-enabled so I can poll it from anything. > Power-over-ethernet would be great too. Along the same lines, does anybody know of a current meter that doesn't cost a bunch? Ideally something like a clamp-on ammeter that can handle 20A. I'd like to monitor the current on a bunch of circuits without having to spend much money (right now, I just periodically use a hand-held clamp-on ammeter to see the "instant" current on each wire). -- Chris Adams Systems and Network Administrator - HiWAAY Internet Services I don't speak for anybody but myself - that's enough trouble. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris at mowisp.net Thu Dec 9 20:00:53 2010 From: chris at mowisp.net (Christopher Tyler) Date: Thu, 09 Dec 2010 13:00:53 -0600 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: <4D012548.8030004@verizon.net> References: <4D012548.8030004@verizon.net> Message-ID: <4D012765.9080004@mowisp.net> Take a look at this: http://www.packetflux.com/index.php?main_page=product_info&cPath=6&products_id=24 Temperature unit is $69.96 and it requires the base unit ($99.95) as well, even so it's under $200 for the complete setup and you get relays to turn things on/off at desired ranges. Christopher Tyler Total Wireless Communications, LLC On 12/09/2010 12:51 PM, Max Hetrick wrote: > On 12/09/2010 01:16 PM, Furnish, Trever G wrote: >> Can anyone recommend a low-cost external temperature sensor that doesn't >> require the buyer to break out a soldering iron? >> >> Preferably it would be SNMP-enabled so I can poll it from anything. >> Power-over-ethernet would be great too. >> >> The least expensive snmp-enabled sensor I've found so far is this one at >> 195 USD: >> http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm >> >> I've also noted probes like this one at 15 USD: >> http://www.ibuttonlink.com/t-sense.aspx >> >> ...but the systems in the site where I'd hook this up are all Windows, >> so I'm not sure how I'd get from there into Nagios. > > I use these guys here. > > http://store.enviromon.net/cart.php?target=product&product_id=255&category_id=78 > > They are a little more expensive, but are ethernet connected, and have > no issues using SNMP to incorporate into Nagios. > > We have two sensors hooked up to it, one is temp/humidity and the other > is a water sensor. > > Regards, > Max > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jmoseley at corp.xanadoo.com Thu Dec 9 20:26:26 2010 From: jmoseley at corp.xanadoo.com (James Moseley) Date: Thu, 9 Dec 2010 13:26:26 -0600 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: References: Message-ID: You might try the APC enviro sensor for about $225. They also make an upgrade kit if you already own an APC smart UPS with a network card. APC also offers their Netbotz line - a company they bought, but those products are a bit more expensive. -- James On Thu, Dec 9, 2010 at 12:16 PM, Furnish, Trever G wrote: > Can anyone recommend a low-cost external temperature sensor that doesn't > require the buyer to break out a soldering iron? > > Preferably it would be SNMP-enabled so I can poll it from anything. > Power-over-ethernet would be great too. > > The least expensive snmp-enabled sensor I've found so far is this one at > 195 USD: > http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm > > I've also noted probes like this one at 15 USD: > http://www.ibuttonlink.com/t-sense.aspx > > ...but the systems in the site where I'd hook this up are all Windows, > so I'm not sure how I'd get from there into Nagios. > > -- > Trever Furnish, tgfurnish at herffjones.com > Herff Jones, Inc. Solutions Architect > Phone: 317.612.3519 > Any sufficiently advanced technology is indistinguishable from Unix. > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From darren at brucetelecom.com Thu Dec 9 20:55:13 2010 From: darren at brucetelecom.com (Darren Hill) Date: Thu, 09 Dec 2010 14:55:13 -0500 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: <4D012765.9080004@mowisp.net> References: <4D012548.8030004@verizon.net> <4D012765.9080004@mowisp.net> Message-ID: <4D013421.7000809@brucetelecom.com> Try these guys. We use their sensor hub pro rack mount/wall mount devices. You can connect any external sensor, or use the smaller guys such as the SH-2. They have built in sensors but you can connect any of the external ones too. On 12/9/2010 2:00 PM, Christopher Tyler wrote: > Take a look at this: > > http://www.packetflux.com/index.php?main_page=product_info&cPath=6&products_id=24 > > Temperature unit is $69.96 and it requires the base unit ($99.95) as > well, even so it's under $200 for the complete setup and you get relays > to turn things on/off at desired ranges. > > Christopher Tyler > Total Wireless Communications, LLC > > On 12/09/2010 12:51 PM, Max Hetrick wrote: >> On 12/09/2010 01:16 PM, Furnish, Trever G wrote: >>> Can anyone recommend a low-cost external temperature sensor that doesn't >>> require the buyer to break out a soldering iron? >>> >>> Preferably it would be SNMP-enabled so I can poll it from anything. >>> Power-over-ethernet would be great too. >>> >>> The least expensive snmp-enabled sensor I've found so far is this one at >>> 195 USD: >>> http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm >>> >>> I've also noted probes like this one at 15 USD: >>> http://www.ibuttonlink.com/t-sense.aspx >>> >>> ...but the systems in the site where I'd hook this up are all Windows, >>> so I'm not sure how I'd get from there into Nagios. >> I use these guys here. >> >> http://store.enviromon.net/cart.php?target=product&product_id=255&category_id=78 >> >> They are a little more expensive, but are ethernet connected, and have >> no issues using SNMP to incorporate into Nagios. >> >> We have two sensors hooked up to it, one is temp/humidity and the other >> is a water sensor. >> >> Regards, >> Max >> >> ------------------------------------------------------------------------------ >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Darren Hill Network Administrator Bruce Telecom e: darren at bmts.com p: 519 368-1267 f: 519 368-1285 ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From darren at brucetelecom.com Thu Dec 9 21:16:26 2010 From: darren at brucetelecom.com (Darren Hill) Date: Thu, 09 Dec 2010 15:16:26 -0500 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: <4D012765.9080004@mowisp.net> References: <4D012548.8030004@verizon.net> <4D012765.9080004@mowisp.net> Message-ID: <4D01391A.4050700@brucetelecom.com> apparently the url didn't paste in www.uptimedevices.com On 12/9/2010 2:00 PM, Christopher Tyler wrote: > Take a look at this: > > http://www.packetflux.com/index.php?main_page=product_info&cPath=6&products_id=24 > > Temperature unit is $69.96 and it requires the base unit ($99.95) as > well, even so it's under $200 for the complete setup and you get relays > to turn things on/off at desired ranges. > > Christopher Tyler > Total Wireless Communications, LLC > > On 12/09/2010 12:51 PM, Max Hetrick wrote: >> On 12/09/2010 01:16 PM, Furnish, Trever G wrote: >>> Can anyone recommend a low-cost external temperature sensor that doesn't >>> require the buyer to break out a soldering iron? >>> >>> Preferably it would be SNMP-enabled so I can poll it from anything. >>> Power-over-ethernet would be great too. >>> >>> The least expensive snmp-enabled sensor I've found so far is this one at >>> 195 USD: >>> http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm >>> >>> I've also noted probes like this one at 15 USD: >>> http://www.ibuttonlink.com/t-sense.aspx >>> >>> ...but the systems in the site where I'd hook this up are all Windows, >>> so I'm not sure how I'd get from there into Nagios. >> I use these guys here. >> >> http://store.enviromon.net/cart.php?target=product&product_id=255&category_id=78 >> >> They are a little more expensive, but are ethernet connected, and have >> no issues using SNMP to incorporate into Nagios. >> >> We have two sensors hooked up to it, one is temp/humidity and the other >> is a water sensor. >> >> Regards, >> Max >> >> ------------------------------------------------------------------------------ >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Darren Hill Network Administrator Bruce Telecom e: darren at bmts.com p: 519 368-1267 f: 519 368-1285 ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at ljnet.dk Fri Dec 10 07:43:34 2010 From: nagios at ljnet.dk (Leif J.) Date: Fri, 10 Dec 2010 07:43:34 +0100 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: References: Message-ID: <4D01CC16.9070708@ljnet.dk> I have no idea what it will cost there, but I'm using: http://www.hw-group.com/products/poseidon/poseidon_xxxx_en.html Ready made Nagios Plugin /Leif Den 09-12-2010 19:16, Furnish, Trever G skrev: > Can anyone recommend a low-cost external temperature sensor that doesn't > require the buyer to break out a soldering iron? > > Preferably it would be SNMP-enabled so I can poll it from anything. > Power-over-ethernet would be great too. > > The least expensive snmp-enabled sensor I've found so far is this one at > 195 USD: > http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm > > I've also noted probes like this one at 15 USD: > http://www.ibuttonlink.com/t-sense.aspx > > ...but the systems in the site where I'd hook this up are all Windows, > so I'm not sure how I'd get from there into Nagios. > > -- > Trever Furnish, tgfurnish at herffjones.com > Herff Jones, Inc. Solutions Architect > Phone: 317.612.3519 > Any sufficiently advanced technology is indistinguishable from Unix. > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From duncan at dcl.co.uk Fri Dec 10 10:45:02 2010 From: duncan at dcl.co.uk (Duncan Berriman) Date: Fri, 10 Dec 2010 09:45:02 -0000 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: References: Message-ID: <13bd01cb984e$eac437b0$c04ca710$@dcl.co.uk> I'd recommend http://www.omega.co.uk/ppt/pptsc.asp?ref=ithx-w_ithx-m Fabulous product, plug it in and it works, used them for years. Regards Duncan -----Original Message----- From: Furnish, Trever G [mailto:TGFurnish at herffjones.com] Sent: 09 December 2010 18:17 To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] low-cost snmp-enabled temperature sensor? Can anyone recommend a low-cost external temperature sensor that doesn't require the buyer to break out a soldering iron? Preferably it would be SNMP-enabled so I can poll it from anything. Power-over-ethernet would be great too. The least expensive snmp-enabled sensor I've found so far is this one at 195 USD: http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm I've also noted probes like this one at 15 USD: http://www.ibuttonlink.com/t-sense.aspx ...but the systems in the site where I'd hook this up are all Windows, so I'm not sure how I'd get from there into Nagios. -- Trever Furnish, tgfurnish at herffjones.com Herff Jones, Inc. Solutions Architect Phone: 317.612.3519 Any sufficiently advanced technology is indistinguishable from Unix. ---------------------------------------------------------------------------- -- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean. ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rutger at blokje.net Fri Dec 10 12:30:37 2010 From: rutger at blokje.net (Rutger Blom) Date: Fri, 10 Dec 2010 12:30:37 +0100 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: <13bd01cb984e$eac437b0$c04ca710$@dcl.co.uk> References: <13bd01cb984e$eac437b0$c04ca710$@dcl.co.uk> Message-ID: You could use the temp-sensors in your server. Not as reliable as an external temp-sensor, but extremely cheap ;-) Rutger 2010/12/10 Duncan Berriman > I'd recommend > > http://www.omega.co.uk/ppt/pptsc.asp?ref=ithx-w_ithx-m > > Fabulous product, plug it in and it works, used them for years. > > Regards > Duncan > > -----Original Message----- > From: Furnish, Trever G [mailto:TGFurnish at herffjones.com] > Sent: 09 December 2010 18:17 > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] low-cost snmp-enabled temperature sensor? > > Can anyone recommend a low-cost external temperature sensor that doesn't > require the buyer to break out a soldering iron? > > Preferably it would be SNMP-enabled so I can poll it from anything. > Power-over-ethernet would be great too. > > The least expensive snmp-enabled sensor I've found so far is this one at > 195 USD: > http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm > > I've also noted probes like this one at 15 USD: > http://www.ibuttonlink.com/t-sense.aspx > > ...but the systems in the site where I'd hook this up are all Windows, so > I'm not sure how I'd get from there into Nagios. > > -- > Trever Furnish, tgfurnish at herffjones.com Herff Jones, Inc. Solutions > Architect > Phone: 317.612.3519 > Any sufficiently advanced technology is indistinguishable from Unix. > > > > ---------------------------------------------------------------------------- > -- > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -- > This message has been scanned for viruses and dangerous content by > MailScanner, and is believed to be clean. > > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Rutger Blom Luzernv?gen 14 227 38 LUND Sweden Tel. +46 763 46 99 44 www.rutgerblom.com about.me/rutgerblom -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ej_seg at hotmail.com Fri Dec 10 14:56:44 2010 From: ej_seg at hotmail.com (Rikard Dahlberg) Date: Fri, 10 Dec 2010 13:56:44 +0000 Subject: Alert host down with passive check Message-ID: Heya guys! Is there any possible way to configure nagios to report host as DOWN, if nagios havn't got any passive checkresult within like 5 minutes? Can I change that somehow? For now, when a host dies in my current configuration, it actually doesn't report is as DOWN, since im using passive checks. But i want to set nagios to if i havn't got a new passive.-check result in 5 minutes, i want nagios to automaticly set it as DOWN... Any idea please? :) Regards Rikard -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Fri Dec 10 15:05:45 2010 From: pangrazi at gmail.com (Greg Pangrazio) Date: Fri, 10 Dec 2010 08:05:45 -0600 Subject: Alert host down with passive check In-Reply-To: References: Message-ID: You are looking for "Freshness" check out http://nagios.sourceforge.net/docs/3_0/freshness.html I use this with all of my passive checks. Greg Pangrazio On Fri, Dec 10, 2010 at 7:56 AM, Rikard Dahlberg wrote: > Heya guys! > > Is there any possible way to configure nagios to report host as DOWN, if > nagios havn't got any passive checkresult within like 5 minutes? > Can I change that somehow? For now, when a host dies in my current > configuration, it actually doesn't report is as DOWN, since im using passive > checks. But i want to set nagios to if i havn't got a new passive.-check > result in 5 minutes, i want nagios to automaticly set it as DOWN... > > Any idea please? :) > > Regards > Rikard > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From maxs at webwizarddesign.com Fri Dec 10 15:20:50 2010 From: maxs at webwizarddesign.com (Max Schubert) Date: Fri, 10 Dec 2010 09:20:50 -0500 Subject: Alert host down with passive check In-Reply-To: References: Message-ID: Read up on freshness checking: http://nagios.sourceforge.net/docs/3_0/freshness.html max On 12/10/10, Rikard Dahlberg wrote: > > Heya guys! > > Is there any possible way to configure nagios to report host as DOWN, if > nagios havn't got any passive checkresult within like 5 minutes? > Can I change that somehow? For now, when a host dies in my current > configuration, it actually doesn't report is as DOWN, since im using passive > checks. But i want to set nagios to if i havn't got a new passive.-check > result in 5 minutes, i want nagios to automaticly set it as DOWN... > > Any idea please? :) > > Regards > Rikard > -- Sent from my mobile device ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From odenbach at uni-paderborn.de Fri Dec 10 15:56:54 2010 From: odenbach at uni-paderborn.de (Christopher Odenbach) Date: Fri, 10 Dec 2010 15:56:54 +0100 Subject: low-cost snmp-enabled temperature sensor? In-Reply-To: References: <13bd01cb984e$eac437b0$c04ca710$@dcl.co.uk> Message-ID: <4D023FB6.5090602@uni-paderborn.de> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi, we use the appliance from Netways: http://www.netways.de/de/units/monitoring_hardware/ueberwachung/messpc/ You can attach up to 4 temperature or humidity sensors and read the values with snmp, http... Nagios plugin is available from there as well. Costs about 270 EUR for 4 temperature sensors and the box. Christopher > > -----Original Message----- > From: Furnish, Trever G [mailto:TGFurnish at herffjones.com > ] > Sent: 09 December 2010 18:17 > To: nagios-users at lists.sourceforge.net > > Subject: [Nagios-users] low-cost snmp-enabled temperature sensor? > > Can anyone recommend a low-cost external temperature sensor that doesn't > require the buyer to break out a soldering iron? > > Preferably it would be SNMP-enabled so I can poll it from anything. > Power-over-ethernet would be great too. > > The least expensive snmp-enabled sensor I've found so far is this one at > 195 USD: > http://avtech.com/Products/Temperature_Monitors/TemPageR_3E.htm > > I've also noted probes like this one at 15 USD: > http://www.ibuttonlink.com/t-sense.aspx > > ...but the systems in the site where I'd hook this up are all > Windows, so > I'm not sure how I'd get from there into Nagios. > > -- > Trever Furnish, tgfurnish at herffjones.com > Herff Jones, Inc. Solutions > Architect > Phone: 317.612.3519 > Any sufficiently advanced technology is indistinguishable from Unix. > > > ---------------------------------------------------------------------------- > -- > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -- > This message has been scanned for viruses and dangerous content by > MailScanner, and is believed to be clean. > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > > -- > Rutger Blom > Luzernv?gen 14 > 227 38 LUND > Sweden > Tel. +46 763 46 99 44 > www.rutgerblom.com > about.me/rutgerblom > > > > ------------------------------------------------------------------------------ > > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null - -- ====================================================== Dipl.-Ing. Christopher Odenbach Zentrum fuer Informations- und Medientechnologien Universitaet Paderborn Raum N5.122 odenbach at uni-paderborn.de Tel.: +49 5251 60 5315 ====================================================== -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iD8DBQFNAj+2hxiCJKeLY0IRAohfAKCCKSQURzygbQWtCQlu/BjBIHygFQCcCiAp 4L6yjQHuHVl/B/h/QVVV9oI= =xCnL -----END PGP SIGNATURE----- ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Fri Dec 10 16:28:13 2010 From: pangrazi at gmail.com (Greg Pangrazio) Date: Fri, 10 Dec 2010 09:28:13 -0600 Subject: Alert host down with passive check In-Reply-To: References: Message-ID: Please keep the list on the replies so others can learn from this as well. define service{ use generic-service host_name HOST service_description Logged In users active_checks_enabled 0; passive_checks_enabled 1; check_freshness 1; freshness_threshold 600; check_command check_stale!2!'This Service is stale' } Greg Pangrazio On Fri, Dec 10, 2010 at 9:18 AM, Rikard Dahlberg wrote: > Thanks. > > I've now edited nagios.cfg to use freshness and looked up the manual.. :) > Would you mind giving me an example of the service and command? Mine still > isn't working.. > > Rikard > >> From: pangrazi at gmail.com >> Date: Fri, 10 Dec 2010 08:05:45 -0600 >> To: nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] Alert host down with passive check >> >> You are looking for "Freshness" >> >> check out http://nagios.sourceforge.net/docs/3_0/freshness.html >> >> I use this with all of my passive checks. >> >> Greg Pangrazio >> >> >> >> >> >> On Fri, Dec 10, 2010 at 7:56 AM, Rikard Dahlberg >> wrote: >> > Heya guys! >> > >> > Is there any possible way to configure nagios to report host as DOWN, if >> > nagios havn't got any passive checkresult within like 5 minutes? >> > Can I change that somehow? For now, when a host dies in my current >> > configuration, it actually doesn't report is as DOWN, since im using >> > passive >> > checks. But i want to set nagios to if i havn't got a new passive.-check >> > result in 5 minutes, i want nagios to automaticly set it as DOWN... >> > >> > Any idea please? :) >> > >> > Regards >> > Rikard >> > >> > >> > ------------------------------------------------------------------------------ >> > >> > _______________________________________________ >> > Nagios-users mailing list >> > Nagios-users at lists.sourceforge.net >> > https://lists.sourceforge.net/lists/listinfo/nagios-users >> > ::: Please include Nagios version, plugin version (-v) and OS when >> > reporting >> > any issue. >> > ::: Messages without supporting info will risk being sent to /dev/null >> > >> >> >> ------------------------------------------------------------------------------ >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Fri Dec 10 17:26:44 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Fri, 10 Dec 2010 16:26:44 +0000 Subject: Alerting based on past-to-current trends? In-Reply-To: References: Message-ID: On 6 December 2010 19:02, Ian Ehrenwald wrote: > Hello > I was wondering if there was a straight-forward way to alert based on an average of past data plus a current perfdata entry. ?I understand I'm not explaining it very well that way, so here is the real-world example I am working with - > > I am polling a set of machines via SNMP for CPU load every 1 minute (looking at hrProcessorLoad). ?If the return value is at or above 95%, send out a WARNING. ?If the return value is 98% or above, send out a CRITICAL. ?The problem here is that it's OK for a process to take up 100% CPU for multiple seconds, and sometimes that high CPU usage coincides with the SNMP %CPU query, so I get a lot of false alerts. > > Is there a way to use past perfdata in conjunction with the current returned data to generate an average and send a WARNING or CRITICAL based on that new number? ?I only care to get alerted from Nagios if, for example, the %CPU has been at 100% for 5 minutes. ?Or am I just way over-thinking this and should be monitoring 1m, 5m, 15m UNIX load averages (which doesn't seem that accurate anyway)? ?What are other people doing to monitor CPU usage and alert on abnormal long periods of utilization? Nagios will alert as soon as the plugin returns a non-OK status. You can of course configure max_check_attempts and/or first_notification_delay so that Nagios won't send a notification until after a given time, but this won't stop it from appearing on on the web page for problem services straight away. It would be great if you could get Nagios to display only hard status alerts - I don't think you can though, not with ordinary Nagios Core anyway. Some of the third-party Nagios front ends will do it, for example you can configure the icons in NagVis only to display hard alerts. Cheers, Jim ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Fri Dec 10 17:48:14 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Fri, 10 Dec 2010 16:48:14 +0000 Subject: Monitor disk via NSCA In-Reply-To: References: Message-ID: On 7 December 2010 09:44, Rikard Dahlberg wrote: > Hey all! > > I want to thank you all for the lovely help i got in my previous errand. It > was NSCA that was missconfigured on one line, or more imporatly, one > complete line was gone :) Now the NSCA passive checks work flawlessly, > almost anyway. > I can monitor CPU, memory and services, the only thing im getting problems > with is hard-drive monitoring. > > These are the commands I've? chosen, but the disk command doesn't write > anything out in nagios. Down below are a sample from nagios .cfg file also. > From what i've read is that nagios treats the passive checks just as a > normal queue as from a active check, so i believe i need a service for every > host, as I would for a active check? > > Any idea what ive done wrong on the harddrive config? > > From NSClient: > [NSCA Commands] > CPU Load=alias_cpu > host_check=check_ok > Memory Usage=alias_mem > Uptime=alias_up > Drive space=alias_disk > Service check=alias_service > > [External Alias] > alias_cpu=checkCPU warn=80 crit=90 time=5m time=1m time=30s > alias_cpu_ex=checkCPU warn=$ARG1$ crit=$ARG2$ time=5m time=1m time=30s > alias_disk=CheckDriveSize MinWarn=10% MinCrit=5% CheckAll FilterType=FIXED > alias_service=checkServiceState CheckAll > alias_process=checkProcState $ARG1$=started > alias_mem=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=physical > alias_up=checkUpTime MinWarn=1d MinWarn=1h > alias_file_age=checkFile2 filter=out "file=$ARG1$" filter-written=>1d > MaxWarn=1 MaxCrit=1 "syntax=%filename% %write%" > alias_file_size=checkFile2 filter=out "file=$ARG1$" filter-size=>$ARG2$ > MaxWarn=1 MaxCrit=1 "syntax=%filename% %size%" > alias_file_size_in_dir=checkFile2 filter=out pattern=*.txt "file=$ARG1$" > filter-size=>$ARG2$ MaxWarn=1 MaxCrit=1 "syntax=%filename% %size%" > alias_event_log_old=CheckEventLog file=application file=system filter=new > filter=out MaxWarn=1 MaxCrit=1 filter-generated=>2d filter-severity==success > filter-severity==informational truncate=800 unique descriptions > "syntax=%severity%: %source%: %message% (%count%)" > alias_event_log_new=CheckEventLog file=application file=system MaxWarn=1 > MaxCrit=1 "filter=generated gt -2d AND severity NOT IN ('success', > 'informational')" truncate=800 unique descriptions "syntax=%severity%: > %source%: %message% (%count%)" > alias_event_log=alias_event_log_new > > > From the host.cfg file from nagios > define service{ > ??????? use???????????????????? generic-service > ??????? host_name?????????????? ILSERVER > ??????? service_description???? C:\ Drive Space > ??????? check_command?????????? check_nt!USEDDISKSPACE!-l c -w 80 -c 90 > ??????? } > The service_description in the Nagios service definition should match the entry in your [nsca commands] section. I would change the 's' in 'Drive space' to upper case to match your nagios service definition and scrub the "C:\" so both now read 'Drive Space'. The alias_disk check tests all fixed disks so it's not strictly correct to have the C:\ in the description. The other problem is in your service definition you are using a different kind of disk space check for the check_command. It might work, but would give weird results if it does both active and passive checks. You should either set it to query the same kind of check as the passive one like this .. check_command check_nrpe!-c alias_disk or if the firewall prevents your Nagios server from doing active checks, you should instead configure freshness checking to alert if you haven't received a check recently. Typically, for a passive service check like this you will want something like.. define service{ use generic-service host_name ILSERVER service_description Drive Space check_freshness 1 freshness_threshold 5400 active_checks_enabled 0 max_check_attempts 1 check_command check_dummy!3 "UNKNOWN: No passive check received lately from the monitored host!" } Forgive me if I've missed anything. Although I do have a few passive checks configured here, typically I only use them for servers which are a right pain to log on to so I haven't double-checked all this is 100% correct! hth, Jim ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Fri Dec 10 17:54:00 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Fri, 10 Dec 2010 16:54:00 +0000 Subject: Multiple parents in map In-Reply-To: <9baaff7c0a1bfc1267704c79704a1158@vps517.directvps.nl> References: <9baaff7c0a1bfc1267704c79704a1158@vps517.directvps.nl> Message-ID: On 7 December 2010 15:10, Hugo van der Kooij wrote: > HI, > > I have digging into the archives but could not find a solution. But in my > view Nagios 3.2.3 is not showing nodes correctly in the map view. > > I have 2 fire wall nodes (FW1 and FW2) ?for the customer that I can check on > a special TCP port. They are 2 cluster members on different physical > locations. > > Then I have 2 SMTP servers (SMTP1 and SMTP2) behind them. They are also > distributed over both locations. > > SMTP1 has the parents FW1 and FW2 and SMTP2 has the parents FW2 and FW1. > (The listorder is important.) > > On the map both SMTP servers are behind FW2 and there are 2 blank spots > behind FW1 in the Circular (Marked Up) map. > > Will this be fixed in a future release? I can't speak for the developers, but I doubt this will be fixed. It's more likely an alternative mapping method will replace the current Circular (Marked Up) map which doesn't scale well to large numbers of hosts. There are alternative maps available now as add-ons for Nagios, for example NagVis includes an automap feature which might fit the bill. http://www.nagvis.org/ hth, Jim ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rand at meridian-enviro.com Fri Dec 10 18:11:12 2010 From: rand at meridian-enviro.com (Douglas K. Rand) Date: Fri, 10 Dec 2010 11:11:12 -0600 Subject: low-cost snmp-enabled temperature sensor? References: Message-ID: <87mxodmu33.wl%rand@meridian-enviro.com> I really like the 1-wire sensors. They are really small, really inexpensive, and really easy to connect together. What makes them useful for your question is this $100 unit that provides an Ethernet bridge. http://www.edsproducts.com/OW-SERVER--1-Wire-to-Ethernet-Server_p_152.html You get SNMP access to the devices, or if you'd rather an HTTP hosted XML document that will give you readings from all the sensors on the network. Don't be mislead by the 3 1-wire ports, each of those can support a seperate 1-wire network with many sensors. I'm getting alot of my 1-wire sensors from iButtonLink: http://www.ibuttonlink.com/ We have alot of the T-Sense sensors for $15 each. ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Fri Dec 10 20:29:24 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Fri, 10 Dec 2010 19:29:24 +0000 Subject: Alerting based on past-to-current trends? In-Reply-To: References: Message-ID: On 10 December 2010 18:43, Rick Carter wrote: > Hi Jim, > > I'm wondering if load average would get you where you want to be, as in a lot of cases, a CPU busy might not be a big deal unless the run queue is growing. > > My nagios-fu isn't good enough to tell you how to get that, but when I saw your message, I thought right away of the linux/unix: > > $ uptime > 13:41 ?up 2 days, 18:11, 2 users, load averages: 0.31 0.25 0.24 > > Where the 2nd load average is the 5-minute one. > > - Rick Good point Rick, there is a check_load plugin, and you could indeed set appropriate thresholds to make it concentrate on the 15-minute value rather than the 5-minute or 1-minute values. As to what 'load' actually means I'm not 100% sure. I've read http://www.teamquest.com/resources/gunther/display/5/index.htm a few times, and think it helps a bit! I even bought Gunther's book "Guerilla Capacity Planning" but confess I haven't read anywhere near all of it. I seem to recall reading somewhere that as a general rule of thumb if load is > 2 * the number of cpus, it's probably affecting performance. Certainly on my own Nagios server with 4 CPUs I find it's struggling whenever load is consistently > 10. Cheers, Jim ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bret.Goodfellow at questar.com Fri Dec 10 23:39:04 2010 From: Bret.Goodfellow at questar.com (Bret Goodfellow) Date: Fri, 10 Dec 2010 22:39:04 +0000 Subject: check_logs.pl doesn't return output on RHEL 6 Message-ID: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com> December 10, 2010. Just installed Red Hat EL 6 with Nagios. The plugin check_logs.pl returns no output upon execution. Under Red Hat EL 5, everything works great. Sorry about the lack of input here, but the simple answer is that no output is returned. All other plugins so far work fine. Regards, Bret Goodfellow -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rik.dahlberg at gmail.com Sat Dec 11 08:53:48 2010 From: rik.dahlberg at gmail.com (Rikard Dahlberg) Date: Sat, 11 Dec 2010 07:53:48 +0000 Subject: Alert host down with passive check In-Reply-To: References: , , , Message-ID: Thanks for the fast answer. Freshenss seems to do the tric, to a point anyway. I can see if the host services dies now, just like an active check as its down. However it doesn't report correctly on the host status. Since freshness last resport option is to ping the host IP adress, and if the server goes down and are behind a firewall, the ping will just go to that gateway. Is there any way to "fake" nagios to thinking that the host is down if some service goes down? Rikard > From: pangrazi at gmail.com > Date: Fri, 10 Dec 2010 09:28:13 -0600 > To: rik.dahlberg at gmail.com; nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Alert host down with passive check > > Please keep the list on the replies so others can learn from this as well. > > define service{ > use generic-service > host_name HOST > service_description Logged In users > active_checks_enabled 0; > passive_checks_enabled 1; > check_freshness 1; > freshness_threshold 600; > check_command check_stale!2!'This Service is stale' > } > > Greg Pangrazio > > > > > > On Fri, Dec 10, 2010 at 9:18 AM, Rikard Dahlberg wrote: > > Thanks. > > > > I've now edited nagios.cfg to use freshness and looked up the manual.. :) > > Would you mind giving me an example of the service and command? Mine still > > isn't working.. > > > > Rikard > > > >> From: pangrazi at gmail.com > >> Date: Fri, 10 Dec 2010 08:05:45 -0600 > >> To: nagios-users at lists.sourceforge.net > >> Subject: Re: [Nagios-users] Alert host down with passive check > >> > >> You are looking for "Freshness" > >> > >> check out http://nagios.sourceforge.net/docs/3_0/freshness.html > >> > >> I use this with all of my passive checks. > >> > >> Greg Pangrazio > >> > >> > >> > >> > >> > >> On Fri, Dec 10, 2010 at 7:56 AM, Rikard Dahlberg > >> wrote: > >> > Heya guys! > >> > > >> > Is there any possible way to configure nagios to report host as DOWN, if > >> > nagios havn't got any passive checkresult within like 5 minutes? > >> > Can I change that somehow? For now, when a host dies in my current > >> > configuration, it actually doesn't report is as DOWN, since im using > >> > passive > >> > checks. But i want to set nagios to if i havn't got a new passive.-check > >> > result in 5 minutes, i want nagios to automaticly set it as DOWN... > >> > > >> > Any idea please? :) > >> > > >> > Regards > >> > Rikard > >> > > >> > > >> > ------------------------------------------------------------------------------ > >> > > >> > _______________________________________________ > >> > Nagios-users mailing list > >> > Nagios-users at lists.sourceforge.net > >> > https://lists.sourceforge.net/lists/listinfo/nagios-users > >> > ::: Please include Nagios version, plugin version (-v) and OS when > >> > reporting > >> > any issue. > >> > ::: Messages without supporting info will risk being sent to /dev/null > >> > > >> > >> > >> ------------------------------------------------------------------------------ > >> _______________________________________________ > >> Nagios-users mailing list > >> Nagios-users at lists.sourceforge.net > >> https://lists.sourceforge.net/lists/listinfo/nagios-users > >> ::: Please include Nagios version, plugin version (-v) and OS when > >> reporting any issue. > >> ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Sat Dec 11 16:18:23 2010 From: pangrazi at gmail.com (Greg Pangrazio) Date: Sat, 11 Dec 2010 09:18:23 -0600 Subject: Alert host down with passive check In-Reply-To: References: Message-ID: You can use a macro for another service but why don't you use that service as the host check? On Dec 11, 2010 2:00 AM, "Rikard Dahlberg" wrote: > > Thanks for the fast answer. > Freshenss seems to do the tric, to a point anyway. > I can see if the host services dies now, just like an active check as its down. However it doesn't report correctly on the host status. > > Since freshness last resport option is to ping the host IP adress, and if the server goes down and are behind a firewall, the ping will just go to that gateway. Is there any way to "fake" nagios to thinking that the host is down if some service goes down? > Rikard > >> From: pangrazi at gmail.com >> Date: Fri, 10 Dec 2010 09:28:13 -0600 >> To: rik.dahlberg at gmail.com; nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] Alert host down with passive check >> >> Please keep the list on the replies so others can learn from this as well. >> >> define service{ >> use generic-service >> host_name HOST >> service_description Logged In users >> active_checks_enabled 0; >> passive_checks_enabled 1; >> check_freshness 1; >> freshness_threshold 600; >> check_command check_stale!2!'This Service is stale' >> } >> >> Greg Pangrazio >> >> >> >> >> >> On Fri, Dec 10, 2010 at 9:18 AM, Rikard Dahlberg wrote: >> > Thanks. >> > >> > I've now edited nagios.cfg to use freshness and looked up the manual.. :) >> > Would you mind giving me an example of the service and command? Mine still >> > isn't working.. >> > >> > Rikard >> > >> >> From: pangrazi at gmail.com >> >> Date: Fri, 10 Dec 2010 08:05:45 -0600 >> >> To: nagios-users at lists.sourceforge.net >> >> Subject: Re: [Nagios-users] Alert host down with passive check >> >> >> >> You are looking for "Freshness" >> >> >> >> check out http://nagios.sourceforge.net/docs/3_0/freshness.html >> >> >> >> I use this with all of my passive checks. >> >> >> >> Greg Pangrazio >> >> >> >> >> >> >> >> >> >> >> >> On Fri, Dec 10, 2010 at 7:56 AM, Rikard Dahlberg >> >> wrote: >> >> > Heya guys! >> >> > >> >> > Is there any possible way to configure nagios to report host as DOWN, if >> >> > nagios havn't got any passive checkresult within like 5 minutes? >> >> > Can I change that somehow? For now, when a host dies in my current >> >> > configuration, it actually doesn't report is as DOWN, since im using >> >> > passive >> >> > checks. But i want to set nagios to if i havn't got a new passive.-check >> >> > result in 5 minutes, i want nagios to automaticly set it as DOWN... >> >> > >> >> > Any idea please? :) >> >> > >> >> > Regards >> >> > Rikard >> >> > >> >> > >> >> > ------------------------------------------------------------------------------ >> >> > >> >> > _______________________________________________ >> >> > Nagios-users mailing list >> >> > Nagios-users at lists.sourceforge.net >> >> > https://lists.sourceforge.net/lists/listinfo/nagios-users >> >> > ::: Please include Nagios version, plugin version (-v) and OS when >> >> > reporting >> >> > any issue. >> >> > ::: Messages without supporting info will risk being sent to /dev/null >> >> > >> >> >> >> >> >> ------------------------------------------------------------------------------ >> >> _______________________________________________ >> >> Nagios-users mailing list >> >> Nagios-users at lists.sourceforge.net >> >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> >> ::: Please include Nagios version, plugin version (-v) and OS when >> >> reporting any issue. >> >> ::: Messages without supporting info will risk being sent to /dev/null >> > >> >> ------------------------------------------------------------------------------ >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Sat Dec 11 19:14:47 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Sat, 11 Dec 2010 13:14:47 -0500 Subject: high latency In-Reply-To: <4CFEBBA8.6070509@op5.se> References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> <4CFCC3B0.6050706@op5.se> <4CFE4824.8000102@op5.se> <4CFEBBA8.6070509@op5.se> Message-ID: > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Tuesday, December 07, 2010 5:57 PM > To: Frost, Mark {PBC} > Cc: Nagios Users List > Subject: Re: [Nagios-users] high latency > > > > > Any chance that the OP5 site will eventually be > > configured to allow git through a proxy? It's of course less convenient to > > use snapshot tarballs, but still workable, of course. > > > > You mean through http? Doesn't it already? I think it's supposed to. I can check > up on that later. The gitweb page has links for grabbing latest master as a > tarball though. That might work as an interim solution. > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 Andreas, It's just never worked for me and I thought you'd mentioned some time ago that OP5's git site just didn't support it. I've validated that my version of git (1.7.1) will grab code from a public site via our corporate proxy using other public code (the proxy is setup via the $http_proxy environment variable): $ git clone http://github.com/schacon/grack.git Initialized empty Git repository in /home/mfrost0/src/grack/.git/ remote: Counting objects: 85, done. remote: Compressing objects: 100% (45/45), done. remote: Total 85 (delta 32), reused 80 (delta 31) Unpacking objects: 100% (85/85), done. but... $ git clone http://git.op5.org/nagios/merlin.git merlin-src Initialized empty Git repository in /home/mfrost0/src/merlin-src/.git/ fatal: http://git.op5.org/nagios/merlin.git/info/refs not found: did you run git update-server-info on the server? $ git clone http://git.op5.org/nagios.git nagios-src Initialized empty Git repository in /home/mfrost0/src/nagios-src/.git/ fatal: http://git.op5.org/nagios.git/info/refs not found: did you run git update-server-info on the server? so, you know :-( Thanks Mark ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Sun Dec 12 21:09:14 2010 From: ae at op5.se (Andreas Ericsson) Date: Sun, 12 Dec 2010 21:09:14 +0100 Subject: high latency In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24052956DC@WPSCV6MM.OPR.STATEFARM.ORG><4CF68510.6090400@flatto.net><31B0FE0A1A8166409E9DF35C6DEECB24052E45E5@WPSCV6MM.OPR.STATEFARM.ORG> <5d744ca63ee1f981e90502f67339c5b6.squirrel@webmail.stinkweasel.net> <31B0FE0A1A8166409E9DF35C6DEECB24052E4705@WPSCV6MM.OPR.STATEFARM.ORG> <4CF7D4FE.7090706@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24052E4794@WPSCV6MM.OPR.STATEFARM.ORG> <4CF8D2E0.1060101@op5.se> <4CFCC3B0.6050706@op5.se> <4CFE4824.8000102@op5.se> <4CFEBBA8.6070509@op5.se> Message-ID: <4D052BEA.1030200@op5.se> On 12/11/2010 07:14 PM, Frost, Mark {PBC} wrote: >> -----Original Message----- >> From: Andreas Ericsson [mailto:ae at op5.se] >> Sent: Tuesday, December 07, 2010 5:57 PM >> To: Frost, Mark {PBC} >> Cc: Nagios Users List >> Subject: Re: [Nagios-users] high latency >> >>> >>> Any chance that the OP5 site will eventually be >>> configured to allow git through a proxy? It's of course less convenient to >>> use snapshot tarballs, but still workable, of course. >>> >> >> You mean through http? Doesn't it already? I think it's supposed to. I can check >> up on that later. The gitweb page has links for grabbing latest master as a >> tarball though. That might work as an interim solution. >> >> -- >> Andreas Ericsson andreas.ericsson at op5.se >> OP5 AB www.op5.se >> Tel: +46 8-230225 Fax: +46 8-230231 > > Andreas, > > It's just never worked for me and I thought you'd mentioned some time ago that > OP5's git site just didn't support it. > > I've validated that my version of git (1.7.1) will grab code from a public site > via our corporate proxy using other public code (the proxy is setup via the $http_proxy environment variable): > > $ git clone http://github.com/schacon/grack.git > Initialized empty Git repository in /home/mfrost0/src/grack/.git/ > remote: Counting objects: 85, done. > remote: Compressing objects: 100% (45/45), done. > remote: Total 85 (delta 32), reused 80 (delta 31) > Unpacking objects: 100% (85/85), done. > > but... > > $ git clone http://git.op5.org/nagios/merlin.git merlin-src > Initialized empty Git repository in /home/mfrost0/src/merlin-src/.git/ > fatal: http://git.op5.org/nagios/merlin.git/info/refs not found: did you run git update-server-info on the server? > $ git clone http://git.op5.org/nagios.git nagios-src > Initialized empty Git repository in /home/mfrost0/src/nagios-src/.git/ > fatal: http://git.op5.org/nagios.git/info/refs not found: did you run git update-server-info on the server? > > so, you know :-( > Aight. I'll look into it tomorrow when I get to work. It's supposed to work anyways. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Mon Dec 13 11:42:17 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Mon, 13 Dec 2010 11:42:17 +0100 Subject: check_openmanage plugin reporting Firmware out of date In-Reply-To: <15tfwuhz6io.fsf@tux.uio.no> (Trond Hasle Amundsen's message of "Wed, 01 Dec 2010 19:34:39 +0100") References: <4A03B9D20B80A1428B53101F813BCA0704C1C975@WFDTDNPXMASRV01.1DC.COM> <15toc95zdas.fsf@tux.uio.no> <4A03B9D20B80A1428B53101F813BCA0704C1CA0A@WFDTDNPXMASRV01.1DC.COM> <15tfwuhz6io.fsf@tux.uio.no> Message-ID: <15tei9mq7hy.fsf@tux.uio.no> Trond Hasle Amundsen writes: > "Surangiwala, Asif " writes: > >> Can we update the check_openmanage script to parse the "Minimum >> Required Firmware Version" and compare it with the current "Firmware >> Version" to overcome the OMSA bug? > > It is entirely possible to mitigate this bug within the plugin, but I > don't think that it's a good idea to let the plugin do all version > parsings and ignore OMSA on a general basis. I have created a version > that works around this particular bug (version 3.6.2-p1) and made it > available here: > > http://folk.uio.no/trondham/software/omsa-fw-bug/ > > It simply ignores out-of-date firmware if the firmware and minimum > firmware versions match those in question. But in order for this to > work, I also had to turn off checking the global health status, which > inherits the non-critical status of the controller. > > DISCLAIMER: This version is only intended as a temporary solution for > users of OMSA 6.3.0 that struggles with the recent firmware bug, and > don't want to use blacklisting as a workaround. When OMSA 6.4.0 becomes > available, you should upgrade OMSA and revert to a regular release of > check_openmanage. Hi Asif, Dell has released OMSA 6.4.0, which fixes the firmware version parsing issue. I have also released a new version of check_openmanage that contains a few compatibility fixes for OMSA 6.4.0: http://folk.uio.no/trondham/software/check_openmanage.html#download Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Eliot.Picken at wenaas.co.uk Mon Dec 13 12:02:45 2010 From: Eliot.Picken at wenaas.co.uk (Eliot.Picken at wenaas.co.uk) Date: Mon, 13 Dec 2010 11:02:45 +0000 Subject: AUTO: Eliot Picken is out of the office (returning 15/12/2010) Message-ID: I am out of the office until 15/12/2010. I am currently out of the office, and I will respond to your email upon my return. Your email has not been forwarded. Note: This is an automated response to your message "Re: [Nagios-users] check_openmanage plugin reporting Firmware out of date" sent on 12/13/2010 10:42:17 AM. This is the only notification you will receive while this person is away. ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Mon Dec 13 12:58:46 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Mon, 13 Dec 2010 12:58:46 +0100 Subject: =?utf-8?q?check=5Flogs=2Epl_doesn=27t_return_outpu?= =?utf-8?q?t_on_RHEL_6?= In-Reply-To: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com> Message-ID: <940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl> On Fri, 10 Dec 2010 22:39:04 +0000, Bret Goodfellow wrote: December 10, 2010. Just installed Red Hat EL 6 with Nagios. The plugin check_logs.pl returns no output upon execution. Under Red Hat EL 5, everything works great. Sorry about the lack of input here, but the simple answer is that no output is returned. All other plugins so far work fine. It might be a perl script that contains code that is not compatible with the perl version included in RHEL6. Did you contact the $AUTHOR ? What happens if you run it by hand? Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Oracle to DB2 Conversion Guide: Learn learn about native support for PL/SQL, new data types, scalar functions, improved concurrency, built-in packages, OCI, SQL*Plus, data movement tools, best practices and more. http://p.sf.net/sfu/oracle-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From malarie at processia.com Mon Dec 13 19:39:36 2010 From: malarie at processia.com (Maxime Alarie) Date: Mon, 13 Dec 2010 13:39:36 -0500 Subject: NagiosXI licence exceeded. In-Reply-To: <1291842848.12189.1.camel@ubuntu-desktop.SSG5-Serial> References: <62F2034CF68DFB45BAA4FB7466782DDC019AE4D9@denali.processia2003.com> <1291842848.12189.1.camel@ubuntu-desktop.SSG5-Serial> Message-ID: <62F2034CF68DFB45BAA4FB7466782DDC019AE7F5@denali.processia2003.com> My bad, I am sorry. Thaks for pointing that up and for the link Tony. -----Original Message----- From: Tony Yarusso [mailto:tyarusso at nagios.com] Sent: 08 December 2010 16:14 To: Nagios Users List; Maxime Alarie Subject: Re: [Nagios-users] NagiosXI licence exceeded. On Wed, 2010-12-08 at 14:25 -0500, Maxime Alarie wrote: > I run nagiosxi on CentOS. This list is intended for the open source Nagios Core, not XI. Could you please post this on http://support.nagios.com/forum/viewforum.php?f=6 instead? -- Tony Yarusso Technical Team ___ Nagios Enterprises, LLC Email: tyarusso at nagios.com Web: www.nagios.com ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Mon Dec 13 21:34:59 2010 From: stanb at panix.com (stan) Date: Mon, 13 Dec 2010 15:34:59 -0500 Subject: distributed nagios ? Message-ID: <20101213203459.GA13764@teddy.fas.com> I have a Nagios instance that curently monitors about 70 machines. Now I have a addtional network coming on line that will "hide" behind a firewall using NAT. It seems to me that the way to deal with this is to install Nagios on one of the machines behind the firewall, and have that instance report ist's results back to my master Nagios instance. I have tried a number of Google search terms, but have not come up with the one that describes this methodology. What is the Nagios terminology for this setup? Any pointers to docs on how to set it up? Thanks. -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at kabano.net Mon Dec 13 21:43:03 2010 From: flyinvap at kabano.net (Flyinvap) Date: Mon, 13 Dec 2010 21:43:03 +0100 Subject: distributed nagios ? In-Reply-To: <20101213203459.GA13764@teddy.fas.com> References: <20101213203459.GA13764@teddy.fas.com> Message-ID: <4D068557.7080601@kabano.net> Le 13/12/2010 21:34, stan a ?crit : > Any pointers to docs on how to set it up? Did you try ? nagios distributed ? in you search engine ? You could read [0] for beginning. [0] http://nagios.sourceforge.net/docs/3_0/distributed.html -- Fly ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Mon Dec 13 22:26:48 2010 From: stanb at panix.com (stan) Date: Mon, 13 Dec 2010 16:26:48 -0500 Subject: distributed nagios ? In-Reply-To: <4D068557.7080601@kabano.net> References: <20101213203459.GA13764@teddy.fas.com> <4D068557.7080601@kabano.net> Message-ID: <20101213212648.GB15123@teddy.fas.com> On Mon, Dec 13, 2010 at 09:43:03PM +0100, Flyinvap wrote: > Le 13/12/2010 21:34, stan a ?crit : > > Any pointers to docs on how to set it up? > > Did you try ? nagios distributed ? in you search engine ? You could > read [0] for beginning. > > [0] http://nagios.sourceforge.net/docs/3_0/distributed.html > Thaks, taht should help us get started. -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From James.Whittington at vc3.com Mon Dec 13 23:41:43 2010 From: James.Whittington at vc3.com (James Whittington) Date: Mon, 13 Dec 2010 17:41:43 -0500 Subject: distributed nagios ? In-Reply-To: <20101213212648.GB15123@teddy.fas.com> References: <20101213203459.GA13764@teddy.fas.com> <4D068557.7080601@kabano.net> <20101213212648.GB15123@teddy.fas.com> Message-ID: <9D512D9F7866FB458FBCA66B376FBF044EBAE668@VC3-EXCH-01.vc3.com> Stan, Some commercial packages with community open source versions certainly can ease the process of doing distributed monitoring. In our case we picked Opsview - for ease of use front end - with nagios still on the backend - and simplified distributed architecture http://www.opsview.com/downloads/download-opsview-community It was a small learning curve but with many great returns. Good luck with the distributed monitoring, James Whittington VC3, Inc. -----Original Message----- From: stan [mailto:stanb at panix.com] Sent: Monday, December 13, 2010 4:27 PM To: Flyinvap Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] distributed nagios ? On Mon, Dec 13, 2010 at 09:43:03PM +0100, Flyinvap wrote: > Le 13/12/2010 21:34, stan a ?crit : > > Any pointers to docs on how to set it up? > > Did you try ? nagios distributed ? in you search engine ? You could > read [0] for beginning. > > [0] http://nagios.sourceforge.net/docs/3_0/distributed.html > Thaks, taht should help us get started. -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From GJFRATER at bechtel.com Tue Dec 14 01:17:10 2010 From: GJFRATER at bechtel.com (Frater, Greg J) Date: Mon, 13 Dec 2010 16:17:10 -0800 Subject: qpage - OT Message-ID: <872CB0AEB377C240A112DD7C10B2592909A0CB87@wtps0171.amers.ibechtel.com> Cheers All, I know a number of people use qpage for sending alerts via modem as we do. I'm hoping someone can help me with a qpage problem we're having, there does not seem to be a qpage mailing list and as of yet the developer has not responded to my email. We are getting *random* alert pages that fail to send with the following error. I say random because I have not figured out any patterns or commonality between them (other than the obvious they are all from Nagios, etc.). Roughly once a day a page will fail to send but others before and after it will send fine, they all use the same send script. We've always had some problems but they recently got worse after we replaced a failed PCI modem with a USB modem from US Robotics. Is anyone else using USB modems, if so what brand/model? Has anyone seen problems like this? Is the a better forum or mailing list I could look to for help? Sorry for the off topic post. qpage error: <502 MESSAGE REJECTED - STX OR EOT EXPECTED> Thanks, Greg Frater System Admin -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Tue Dec 14 03:43:55 2010 From: benny at bennyvision.com (C. Bensend) Date: Mon, 13 Dec 2010 20:43:55 -0600 Subject: qpage - OT In-Reply-To: <872CB0AEB377C240A112DD7C10B2592909A0CB87@wtps0171.amers.ibechtel.com> References: <872CB0AEB377C240A112DD7C10B2592909A0CB87@wtps0171.amers.ibechtel.com> Message-ID: <6162375473eb6aa87c524ebf6b1e9120.squirrel@webmail.stinkweasel.net> > qpage error: > <502 MESSAGE REJECTED - STX OR EOT EXPECTED> It would have been nice to see your qpage.cf file... ;) Be sure you have 'parity=even' in your config. When you run a test with verbose and interactive flags set, do you fail five or six times before you get that message? Benny -- "I'm no meteorologist, but I'm pretty sure it's rainin' bitches!" -- Cleveland, "Family Guy" ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 14 10:49:05 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 14 Dec 2010 10:49:05 +0100 Subject: distributed nagios ? In-Reply-To: <20101213203459.GA13764@teddy.fas.com> References: <20101213203459.GA13764@teddy.fas.com> Message-ID: <4D073D91.5010003@op5.se> On 12/13/2010 09:34 PM, stan wrote: > I have a Nagios instance that curently monitors about 70 machines. Now I > have a addtional network coming on line that will "hide" behind a firewall > using NAT. It seems to me that the way to deal with this is to install > Nagios on one of the machines behind the firewall, and have that instance > report ist's results back to my master Nagios instance. > > I have tried a number of Google search terms, but have not come up with the > one that describes this methodology. What is the Nagios terminology for > this setup? > http://www.op5.org/community/plugin-inventory/op5-projects/merlin > Any pointers to docs on how to set it up? > http://git.op5.org/git/?p=nagios/merlin.git;a=blob;f=HOWTO;hb=master http://git.op5.org/git/?p=nagios/merlin.git;a=blob;f=README;hb=master https://wiki.op5.org/merlin:start#guides If I were you, I'd wait til tomorrow with installing it though, when 1.0.0 is released as stable. Reading up on the docs and whatnot beforehand is still a good idea though. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ihab24 at hotmail.com Tue Dec 14 11:45:54 2010 From: ihab24 at hotmail.com (Ihab Samara) Date: Tue, 14 Dec 2010 12:45:54 +0200 Subject: Notfications control and grouping In-Reply-To: References: , , , , <31B0FE0A1A8166409E9DF35C6DEECB2405332E6B@WPSCV6MM.OPR.STATEFARM.ORG>, , , , <31B0FE0A1A8166409E9DF35C6DEECB24053330F2@WPSCV6MM.OPR.STATEFARM.ORG>, , , , , , , Message-ID: Hi List We've got a relatively big environment, where Nagios is monitoring about 100 hosts, and checking about 800 services. My question is about notifications. I am looking for a solution that will do this for us: 1) After setting dependency between several services, the notification is sent for only one of them, I need the notification to mention that the other services are Critical as well. 2) For the same host group, when multiple hosts are down, at the same time, we are getting a notification for each one of t hem, we want to group this into one notification. 3)Limit the notification to a specific number within a period of time, for example (within 5 min, the max notifications sent cant be more than 10). Any thoughts will be helpful and appreciated. Thanks Ihab -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mad at b-care.net Tue Dec 14 15:11:25 2010 From: mad at b-care.net (=?ISO-8859-1?Q?Marc-Andr=E9?= Doll) Date: Tue, 14 Dec 2010 15:11:25 +0100 Subject: JVM Monitoring Message-ID: <1292335885.1679.12.camel@MADness> Hi list, I have to monitor some JVM and I don't find plugins that fit exactly with what I want/imagine. I could use the check_jmx but I don't really want to install a JRE on my Nagios server. Currently, I'm monitoring Tomcat servers with check_jmx4perl and I'm quite happy with it. Is it possible to configure/tweek the JVM or the J4P war to use it on a non-JEE server? Or am I doomed to install java on my monitoring server? Thanks for your help. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ej_seg at hotmail.com Tue Dec 14 15:42:09 2010 From: ej_seg at hotmail.com (Rikard Dahlberg) Date: Tue, 14 Dec 2010 14:42:09 +0000 Subject: Nagiosgraper Message-ID: Heya! im currently trying to get a decent graphicsystem online to somewhat replace our Cacti. I know nagiosgraph doesn't really do that but it does fit our needs :) Im trying to re-write this config file to create nice shiney graphs for my monitored services, but so far no luck. This is the template for ping, but if i would change it for maybe checking CPU Load, how would I do so? What are the variables i need to change, this is what i cant figure out. Worth knowing is that our more or less whole system is built on passive checks that now (Thanks alot to Greg on this mailing list) works flawless. I have also edited nagios.cfg to enable performance graphing, so that is OK :) #--- #NagiosGrapherTemplateforcheck_ping #Author:MariusHein #--- define ngraph{ service_name PING graph_log_regex loss = (\d+) graph_value Loss graph_units % graph_legend Loss graph_legend_eol none page 2 Loss rrd_plottype LINE2 rrd_color ff0000 } #PingDEFRTA define ngraph{ service_name PING graph_log_regex rta = (\d+[,\.]\d+) graph_value RTA graph_units ms graph_legend RTA page 1 RTA rrd_plottype AREA rrd_color 00a000 } #PingVDEF,AverageRTA define ngraph{ service_name PING type VDEF graph_value vdef_ping_average graph_units graph_legend RTA Average graph_calc RTA,AVERAGE graph_legend_eol LEFT page 1 RTA rrd_plottype LINE1 rrd_color 0000ff hide no } -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From matlnx1983 at gmail.com Tue Dec 14 16:01:06 2010 From: matlnx1983 at gmail.com (Matias Damian) Date: Tue, 14 Dec 2010 12:01:06 -0300 Subject: Cisco Monitoring Problem - Interface Down state not recognize Message-ID: Hi, i have a problem when i want to monitoring a Cisco device (Switch 2960). Qhen i configured Nagios, never show the CRITICAL state, when the port / interface is down, because always receibe a message for SNMP, that it?s Ok. Doing a research, i see this. When i use the terminal of Linux Server the command with the string that use Nagios "hardcoded" v?a snmp the devide send me this message, and nagios don?t know how understand this. root at NAGIOS:# /usr/local/nagios/libexec/check_snmp -C COMUNIDAD -o ifOperStatus.10001 -r 1 -m RFC1213-MIB -H IP_HOST If the interface it?s Ok: SNMP OK - 1 | RFC1213-MIB::ifOperStatus.10001=1 But if the interface is down: SNMP OK - 2 | RFC1213-MIB::ifOperStatus.10002=2 So, i understand the 1 is "Ok" and the 2 id "down", how evener, nagios understand "ok" in both cases. Thanks in advance for the help, and sorry for my english. Matias (matlnx) -- MatLnx at LU9CBL -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bret.Goodfellow at questar.com Tue Dec 14 16:12:24 2010 From: Bret.Goodfellow at questar.com (Bret Goodfellow) Date: Tue, 14 Dec 2010 15:12:24 +0000 Subject: check_logs.pl doesn't return output on RHEL 6 In-Reply-To: <940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com> <940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl> Message-ID: <534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> Thanks Hugo. Yes, I have sent an email to the author but haven?t heard back yet. I have also run the script by hand, the shell output is below: [root at server ~]# cd /usr/lib64/nagios/plugins/ [root at server plugins]# ./check_logs.pl -c /etc/nagios/check_logs_linux.cfg [root at server plugins]# It doesn?t fail, but it also doesn?t return anything. This script runs successfully on RHEL 4 and RHEL 5. If anyone has experienced this on RHEL 6, please let me know. From: Hugo van der Kooij [mailto:hvdkooij at vanderkooij.org] Sent: Monday, December 13, 2010 4:59 AM To: Nagios Users List Subject: Re: [Nagios-users] check_logs.pl doesn't return output on RHEL 6 On Fri, 10 Dec 2010 22:39:04 +0000, Bret Goodfellow wrote: December 10, 2010. Just installed Red Hat EL 6 with Nagios. The plugin check_logs.pl returns no output upon execution. Under Red Hat EL 5, everything works great. Sorry about the lack of input here, but the simple answer is that no output is returned. All other plugins so far work fine. It might be a perl script that contains code that is not compatible with the perl version included in RHEL6. Did you contact the $AUTHOR ? What happens if you run it by hand? Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Tue Dec 14 16:46:32 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Tue, 14 Dec 2010 16:46:32 +0100 Subject: =?utf-8?q?check=5Flogs=2Epl_doesn=27t_return_outpu?= =?utf-8?q?t_on_RHEL_6?= In-Reply-To: <534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com> <940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl> <534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> Message-ID: <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> On Tue, 14 Dec 2010 15:12:24 +0000, Bret Goodfellow wrote: [root at server ~]# cd /usr/lib64/nagios/plugins/ [root at server plugins]# ./check_logs.pl -c /etc/nagios/check_logs_linux.cfg [root at server plugins]# I strongly suspect that this is a perl issue as RHEL 6 is using a newer perl version that is not entirely identical or compatible to the one used by RHEL 4 and RHEL 5. Are you sure the other servers are x64 as well BTW? Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Tue Dec 14 17:08:35 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Tue, 14 Dec 2010 09:08:35 -0700 Subject: Return code of 127 is out of bounds - only on high cpu load though In-Reply-To: <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com><940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl><534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240542485D@WPSCV6MM.OPR.STATEFARM.ORG> I noticed something odd the other day while stressing my servers. I noticed that when I overload it with too many hosts/checks, that I start getting active check failures with the standard 127 code. But, if I slowly reduce the number of hosts/checks, I?ll get to a point where it starts working again. Anyone else seen this, where the ?plugin may be missing? really isn?t the problem and only on high load? Dan -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 14 17:12:16 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 14 Dec 2010 17:12:16 +0100 Subject: Return code of 127 is out of bounds - only on high cpu load though In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB240542485D@WPSCV6MM.OPR.STATEFARM.ORG> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com><940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl><534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> <31B0FE0A1A8166409E9DF35C6DEECB240542485D@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4D079760.4000404@op5.se> On 12/14/2010 05:08 PM, Daniel Wittenberg wrote: > I noticed something odd the other day while stressing my servers. I > noticed that when I overload it with too many hosts/checks, that I > start getting active check failures with the standard 127 code. But, > if I slowly reduce the number of hosts/checks, I?ll get to a point > where it starts working again. Anyone else seen this, where the > ?plugin may be missing? really isn?t the problem and only on high > load? > Does it happpen for all checks or just checks of a certain type? Some checks share resources in a way that cause them to bomb out when too many of them are run simultaneously. One possible error in such cases is the infamous 127 error. If it happens with compiled plugins from the standard plugins package, that's not it though as they're all fairly well behaved. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Tue Dec 14 17:14:33 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Tue, 14 Dec 2010 09:14:33 -0700 Subject: Return code of 127 is out of bounds - only on high cpu load though In-Reply-To: <4D079760.4000404@op5.se> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com><940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl><534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> <31B0FE0A1A8166409E9DF35C6DEECB240542485D@WPSCV6MM.OPR.STATEFARM.ORG> <4D079760.4000404@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB2405424884@WPSCV6MM.OPR.STATEFARM.ORG> Yeah, the only two I'm testing with are check_nrpe and check_tcp, and it's all of them on every server that start failing. Any idea what kind of shared resources it might be starving? Dan -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Tuesday, December 14, 2010 10:12 AM To: Nagios Users List Cc: Daniel Wittenberg Subject: Re: [Nagios-users] Return code of 127 is out of bounds - only on high cpu load though On 12/14/2010 05:08 PM, Daniel Wittenberg wrote: > I noticed something odd the other day while stressing my servers. I > noticed that when I overload it with too many hosts/checks, that I > start getting active check failures with the standard 127 code. But, > if I slowly reduce the number of hosts/checks, I?ll get to a point > where it starts working again. Anyone else seen this, where the > ?plugin may be missing? really isn?t the problem and only on high > load? > Does it happpen for all checks or just checks of a certain type? Some checks share resources in a way that cause them to bomb out when too many of them are run simultaneously. One possible error in such cases is the infamous 127 error. If it happens with compiled plugins from the standard plugins package, that's not it though as they're all fairly well behaved. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Dec 14 18:04:12 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 14 Dec 2010 17:04:12 +0000 Subject: JVM Monitoring In-Reply-To: <1292335885.1679.12.camel@MADness> References: <1292335885.1679.12.camel@MADness> Message-ID: On 14 December 2010 14:11, Marc-Andr? Doll wrote: > Hi list, > > I have to monitor some JVM and I don't find plugins that fit exactly > with what I want/imagine. > > I could use the check_jmx but I don't really want to install a JRE on my > Nagios server. > > Currently, I'm monitoring Tomcat servers with check_jmx4perl and I'm > quite happy with it. Is it possible to configure/tweek the JVM or the > J4P war to use it on a non-JEE server? Or am I doomed to install java on > my monitoring server? I would think you could continue to use check_jmx4perl on the Tomcat server and get it to send the results back to Nagios as a passive check using send_nsca. You can either use the send_nsca which is built in to NSClient++, or there is a standalone binary send_nsca which is quite easy to use ( http://exchange.nagios.org/directory/Addons/Passive-Checks/NSCA-Win32-Client/details ). You will of course need to configure the nsca daemon on your Nagios server if you haven't done that already. hth, Jim ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 14 18:21:51 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 14 Dec 2010 18:21:51 +0100 Subject: Return code of 127 is out of bounds - only on high cpu load though In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB2405424884@WPSCV6MM.OPR.STATEFARM.ORG> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com><940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl><534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> <31B0FE0A1A8166409E9DF35C6DEECB240542485D@WPSCV6MM.OPR.STATEFARM.ORG> <4D079760.4000404@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB2405424884@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4D07A7AF.5000002@op5.se> On 12/14/2010 05:14 PM, Daniel Wittenberg wrote: > Yeah, the only two I'm testing with are check_nrpe and check_tcp, and > it's all of them on every server that start failing. Any idea what > kind of shared resources it might be starving? > Not those two, no. They should be fairly well behaved, unless you're using check_tcp with ssl and your ssl library is acting up. I can't see how that could be happening though. What does the debug log tell you? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Tue Dec 14 18:39:11 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Tue, 14 Dec 2010 10:39:11 -0700 Subject: Return code of 127 is out of bounds - only on high cpu load though In-Reply-To: <4D07A7AF.5000002@op5.se> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com><940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl><534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> <31B0FE0A1A8166409E9DF35C6DEECB240542485D@WPSCV6MM.OPR.STATEFARM.ORG> <4D079760.4000404@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB2405424884@WPSCV6MM.OPR.STATEFARM.ORG> <4D07A7AF.5000002@op5.se> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24054249CC@WPSCV6MM.OPR.STATEFARM.ORG> I ran a full strace of nagios daemon and children and it looks like it was the enable_environment_macros that was causing: [pid 20478] execve("/bin/sh", ["sh", "-c", . . . . . ] = -1 E2BIG (Argument list too long) <0.000337> [pid 20478] exit_group(127) = ? I turned them off and that fixes things, but potentially breaks some other things in our setup, but for right now it's working. Still odd that as I change the number hosts the env changes enough to push over the edge. I ran same tests with only 300 hosts and it worked fine, just looks like at about 500 or so something changes in env. Always something fun in nagiosland :) Dan -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Tuesday, December 14, 2010 11:22 AM To: Nagios Users List Cc: Daniel Wittenberg Subject: Re: [Nagios-users] Return code of 127 is out of bounds - only on high cpu load though On 12/14/2010 05:14 PM, Daniel Wittenberg wrote: > Yeah, the only two I'm testing with are check_nrpe and check_tcp, and > it's all of them on every server that start failing. Any idea what > kind of shared resources it might be starving? > Not those two, no. They should be fairly well behaved, unless you're using check_tcp with ssl and your ssl library is acting up. I can't see how that could be happening though. What does the debug log tell you? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Tue Dec 14 20:39:03 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Tue, 14 Dec 2010 14:39:03 -0500 Subject: distributed nagios ? In-Reply-To: <4D073D91.5010003@op5.se> References: <20101213203459.GA13764@teddy.fas.com> <4D073D91.5010003@op5.se> Message-ID: > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Tuesday, December 14, 2010 4:49 AM > To: nagios List; docrtp at yahoo.com > Subject: Re: [Nagios-users] distributed nagios ? > >> Any pointers to docs on how to set it up? >> > > http://git.op5.org/git/?p=nagios/merlin.git;a=blob;f=HOWTO;hb=master > http://git.op5.org/git/?p=nagios/merlin.git;a=blob;f=README;hb=master > https://wiki.op5.org/merlin:start#guides > > If I were you, I'd wait til tomorrow with installing it though, when 1.0.0 > is released as stable. Reading up on the docs and whatnot beforehand is > still a good idea though. > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 Hooray! Actually, I wanted to point out a few things I found when building the most recent version of merlin recently. At the heart of my issues is that our team is not allowed root access on these servers (long boring corporate story...) so I'm installing everything in an alternate tree. 1) There are a couple of hard-coded paths in ipc.c and node.c for the socket and the binlogs. I'm assuming that's intentional, but it does mean one has to manually edit the source files to point to different paths rather than specifying anything like that during the build process. 2) Because we're trying to put all the files into an alternate tree, the installation of 'mon' from install-merlin.sh didn't really work right. In our case, it made a lot more sense to change cp apps/mon.py $root_path/usr/bin/mon to cp apps/mon.py $bindir/mon otherwise it would put 'mon' in a really weird spot. I'm guessing these are design decisions on your part, but in case they're not, I thought I'd point them out. Thanks Mark ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bret.Goodfellow at questar.com Tue Dec 14 22:57:03 2010 From: Bret.Goodfellow at questar.com (Bret Goodfellow) Date: Tue, 14 Dec 2010 21:57:03 +0000 Subject: check_logs.pl doesn't return output on RHEL 6 In-Reply-To: <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com> <940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl> <534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> Message-ID: <534585C15C30B048B7A3A335F9B6C95C0BAD49ED@SLCEXMB02.corp.questar.com> Yes, my servers are x64. The perl release on RHEL4 is v5.8.5, and the perl release on RHEL5 is v5.8.8, and the perl release on RHEL6 is v5.10.1. From: Hugo van der Kooij [mailto:hvdkooij at vanderkooij.org] Sent: Tuesday, December 14, 2010 8:47 AM To: Nagios Users List Subject: Re: [Nagios-users] check_logs.pl doesn't return output on RHEL 6 On Tue, 14 Dec 2010 15:12:24 +0000, Bret Goodfellow wrote: [root at server ~]# cd /usr/lib64/nagios/plugins/ [root at server plugins]# ./check_logs.pl -c /etc/nagios/check_logs_linux.cfg [root at server plugins]# I strongly suspect that this is a perl issue as RHEL 6 is using a newer perl version that is not entirely identical or compatible to the one used by RHEL 4 and RHEL 5. Are you sure the other servers are x64 as well BTW? Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Dec 15 10:46:29 2010 From: ae at op5.se (Andreas Ericsson) Date: Wed, 15 Dec 2010 10:46:29 +0100 Subject: distributed nagios ? In-Reply-To: References: <20101213203459.GA13764@teddy.fas.com> <4D073D91.5010003@op5.se> Message-ID: <4D088E75.2050308@op5.se> On 12/14/2010 08:39 PM, Frost, Mark {PBC} wrote: > >> -----Original Message----- >> From: Andreas Ericsson [mailto:ae at op5.se] >> Sent: Tuesday, December 14, 2010 4:49 AM >> To: nagios List; docrtp at yahoo.com >> Subject: Re: [Nagios-users] distributed nagios ? >> >>> Any pointers to docs on how to set it up? >>> >> >> http://git.op5.org/git/?p=nagios/merlin.git;a=blob;f=HOWTO;hb=master >> http://git.op5.org/git/?p=nagios/merlin.git;a=blob;f=README;hb=master >> https://wiki.op5.org/merlin:start#guides >> >> If I were you, I'd wait til tomorrow with installing it though, when 1.0.0 >> is released as stable. Reading up on the docs and whatnot beforehand is >> still a good idea though. >> >> -- >> Andreas Ericsson andreas.ericsson at op5.se >> OP5 AB www.op5.se >> Tel: +46 8-230225 Fax: +46 8-230231 > > Hooray! > > Actually, I wanted to point out a few things I found when building the > most recent version of merlin recently. At the heart of my issues > is that our team is not allowed root access on these servers (long boring > corporate story...) so I'm installing everything in an alternate tree. > > 1) There are a couple of hard-coded paths in ipc.c and node.c for > the socket and the binlogs. I'm assuming that's intentional, but it > does mean one has to manually edit the source files to point to different > paths rather than specifying anything like that during the build process. > The socket location can be configured. Binlogs cannot. I'll amend that in the next release though. The core functionality is there, but there's no option to set it in the config files, which is kinda stupid. > 2) Because we're trying to put all the files into an alternate tree, the > installation of 'mon' from install-merlin.sh didn't really work right. In > our case, it made a lot more sense to change > > cp apps/mon.py $root_path/usr/bin/mon > > to > > cp apps/mon.py $bindir/mon > > otherwise it would put 'mon' in a really weird spot. > > I'm guessing these are design decisions on your part, but in case they're > not, I thought I'd point them out. > Yes. The install-merlin.sh script is designed to be usable from the rpm spec file, and it's meant to aid people who want to install everything in its default location. Would $root_path/$bindir/mon work for you? Since you can set $root_path to whatever you want, I suppose it should. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ej_seg at hotmail.com Wed Dec 15 10:48:39 2010 From: ej_seg at hotmail.com (Rikard Dahlberg) Date: Wed, 15 Dec 2010 09:48:39 +0000 Subject: Nagiosgraper In-Reply-To: References: , , , Message-ID: Right, pnp4nagioS seems slicker. However, after i've installed it, i got this error , did you get that also? If so, how did you work around it? : " PNP Error Please check the Kohana documentation for information about the following error. application/models/data.php [104]: perfdata directory "/usr/local/pnp4nagios/var/perfdata/" is empty. Please check your Nagios config. Read FAQ online" /Rik From: PWilliamson at twgi.net To: ej_seg at hotmail.com Subject: RE: [Nagios-users] Nagiosgraper Date: Tue, 14 Dec 2010 14:56:05 +0000 Just the normal documentation. I just need to ?turn off? nagiosgrapher now. Setup on pnp4n is much nicer. From: Rikard Dahlberg [mailto:ej_seg at hotmail.com] Sent: Tuesday, December 14, 2010 9:51 AM To: Paul Williamson Subject: RE: [Nagios-users] Nagiosgraper Thanks paul, i might just look into that. Did you follow a specific guide or just the normal documentations? /RikFrom: PWilliamson at twgi.net To: ej_seg at hotmail.com Subject: RE: [Nagios-users] Nagiosgraper Date: Tue, 14 Dec 2010 14:52:04 +0000I just went down the same road, and I think you?d be better served to go with pnp4nagios. It?s what the ?official? Nagios commercial distribution uses, so there?s more hope for longer term support. Plus, setting up nagiosgrapher, while not bad, is way more involved than pnp4nagios. Paul From: Rikard Dahlberg [mailto:ej_seg at hotmail.com] Sent: Tuesday, December 14, 2010 9:42 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Nagiosgraper Heya! im currently trying to get a decent graphicsystem online to somewhat replace our Cacti. I know nagiosgraph doesn't really do that but it does fit our needs :) Im trying to re-write this config file to create nice shiney graphs for my monitored services, but so far no luck. This is the template for ping, but if i would change it for maybe checking CPU Load, how would I do so? What are the variables i need to change, this is what i cant figure out. Worth knowing is that our more or less whole system is built on passive checks that now (Thanks alot to Greg on this mailing list) works flawless. I have also edited nagios.cfg to enable performance graphing, so that is OK :)#--- #NagiosGrapherTemplateforcheck_ping #Author:MariusHein #--- define ngraph{ service_name PING graph_log_regex loss = (\d+) graph_value Loss graph_units % graph_legend Loss graph_legend_eol none page 2 Loss rrd_plottype LINE2 rrd_color ff0000 } #PingDEFRTA define ngraph{ service_name PING graph_log_regex rta = (\d+[,\.]\d+) graph_value RTA graph_units ms graph_legend RTA page 1 RTA rrd_plottype AREA rrd_color 00a000 } #PingVDEF,AverageRTA define ngraph{ service_name PING type VDEF graph_value vdef_ping_average graph_units graph_legend RTA Average graph_calc RTA,AVERAGE graph_legend_eol LEFT page 1 RTA rrd_plottype LINE1 rrd_color 0000ff hide no } -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Dec 15 10:51:04 2010 From: ae at op5.se (Andreas Ericsson) Date: Wed, 15 Dec 2010 10:51:04 +0100 Subject: Return code of 127 is out of bounds - only on high cpu load though In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24054249CC@WPSCV6MM.OPR.STATEFARM.ORG> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com><940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl><534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> <31B0FE0A1A8166409E9DF35C6DEECB240542485D@WPSCV6MM.OPR.STATEFARM.ORG> <4D079760.4000404@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB2405424884@WPSCV6MM.OPR.STATEFARM.ORG> <4D07A7AF.5000002@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB24054249CC@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4D088F88.8060409@op5.se> On 12/14/2010 06:39 PM, Daniel Wittenberg wrote: > I ran a full strace of nagios daemon and children and it looks like it > was the enable_environment_macros that was causing: > > [pid 20478] execve("/bin/sh", ["sh", "-c", . . . . . ] = -1 E2BIG > (Argument list too long)<0.000337> > [pid 20478] exit_group(127) = ? > > I turned them off and that fixes things, but potentially breaks some > other things in our setup, but for right now it's working. Still odd > that as I change the number hosts the env changes enough to push over > the edge. I ran same tests with only 300 hosts and it worked fine, just > looks like at about 500 or so something changes in env. > Ah. One of the environment macros contains a list of all hostgroups the host is a member of, and unless I'm mistaken, a list of all member hosts of the "first hostgroup" that the host is a member of. Those lists can be huge so they can quickly fill up the environment variables. My guess is that you pushed it over the limit when you added more services. It's the same with 'servicegroups' and 'servicegroup members' for services btw. In short; Don't enable environment variables. In fact, I think I'll add a deprecation notice for it in the code and warn people that it'll be removed in early 2012, or at least modified so that the superhuge lists are no longer created as environment variables no matter if they're enabled or not. That should alleviate problems like this, which are quite frustrating to track down. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Wed Dec 15 14:06:10 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Wed, 15 Dec 2010 13:06:10 +0000 Subject: Cisco Monitoring Problem - Interface Down state not recognize In-Reply-To: References: Message-ID: On 14 December 2010 15:01, Matias Damian wrote: > Hi, i have a problem when i want to monitoring a Cisco device (Switch 2960). > Qhen i configured Nagios, never show the CRITICAL state, when the port / > interface is down, because always receibe a message for SNMP, that it?s Ok. > > Doing a research, i see this. When i use the terminal of Linux Server the > command with the string that use Nagios "hardcoded" v?a snmp the devide send > me this message, and nagios don?t know how understand this. > > root at NAGIOS:# /usr/local/nagios/libexec/check_snmp -C COMUNIDAD -o > ifOperStatus.10001 -r 1 -m RFC1213-MIB -H IP_HOST > > If the interface it?s Ok: > SNMP OK - 1 | RFC1213-MIB::ifOperStatus.10001=1 > > But if the interface is down: > SNMP OK - 2 | RFC1213-MIB::ifOperStatus.10002=2 > > So, i understand the 1 is "Ok" and the 2 id "down", how evener, nagios > understand "ok" in both cases. > > Thanks in advance for the help, and sorry for my english. > Matias (matlnx) You need to specify the warning and critical ranges when you run the check_snmp plugin. For example, I guess the following might do what you want: /usr/local/nagios/libexec/check_snmp -C COMUNIDAD -o ifOperStatus.10001 -r 1 -m RFC1213-MIB -H IP_HOST -c 0:1 I don't have Cisco equipment here though so I can't test it for sure. See: http://nagiosplug.sourceforge.net/developer-guidelines.html#THRESHOLDFORMAT for information on how to specify the warning and critical threshold ranges. The syntax is a bit weird, but quite powerful, allowing you to be very specific about what ranges of values you will alert on. For monitoring a specific interface, you might find the check_snmp_int plugin more useful. You will find it at http://nagios.manubulon.com/ Whenever you run a plugin from the command line, be careful to run it as the same user which runs the Nagios daemon (usually the user 'nagios'). If you forget this then you will sometimes have problems with permissions when Nagios runs the plugin, and sometimes you will find the plugin creates a temporary file which then can't be written to (or overwritten) when you come to run the plugin under Nagios. I don't think this is too much of a problem with check_snmp, but it can be a real problem with check_snmp_int and various other plugins. hth, Jim ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Wed Dec 15 15:25:15 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Wed, 15 Dec 2010 09:25:15 -0500 Subject: distributed nagios ? In-Reply-To: <4D088E75.2050308@op5.se> References: <20101213203459.GA13764@teddy.fas.com> <4D073D91.5010003@op5.se> <4D088E75.2050308@op5.se> Message-ID: > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Wednesday, December 15, 2010 4:46 AM > > On 12/14/2010 08:39 PM, Frost, Mark {PBC} wrote: >> >> Hooray! >> >> Actually, I wanted to point out a few things I found when building the >> most recent version of merlin recently. At the heart of my issues >> is that our team is not allowed root access on these servers (long boring >> corporate story...) so I'm installing everything in an alternate tree. >> >> 1) There are a couple of hard-coded paths in ipc.c and node.c for >> the socket and the binlogs. I'm assuming that's intentional, but it >> does mean one has to manually edit the source files to point to different >> paths rather than specifying anything like that during the build process. >> > > The socket location can be configured. Binlogs cannot. I'll amend that in > the next release though. The core functionality is there, but there's no > option to set it in the config files, which is kinda stupid. "Binlogs cannot" meaning it can't be moved without modifying the code directly, right? Because that's what I did :-). >> 2) Because we're trying to put all the files into an alternate tree, the >> installation of 'mon' from install-merlin.sh didn't really work right. > Yes. The install-merlin.sh script is designed to be usable from the > rpm spec file, and it's meant to aid people who want to install > everything in its default location. Would $root_path/$bindir/mon > work for you? Since you can set $root_path to whatever you want, > I suppose it should. Yes, I believe that would work for me. I'm not setting $root_path at all. Thanks, Andreas. Mark ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Dec 15 15:32:34 2010 From: ae at op5.se (Andreas Ericsson) Date: Wed, 15 Dec 2010 15:32:34 +0100 Subject: distributed nagios ? In-Reply-To: References: <20101213203459.GA13764@teddy.fas.com> <4D073D91.5010003@op5.se> <4D088E75.2050308@op5.se> Message-ID: <4D08D182.4010600@op5.se> On 12/15/2010 03:25 PM, Frost, Mark {PBC} wrote: >> -----Original Message----- >> From: Andreas Ericsson [mailto:ae at op5.se] >> Sent: Wednesday, December 15, 2010 4:46 AM >> >> On 12/14/2010 08:39 PM, Frost, Mark {PBC} wrote: >>> >>> Hooray! >>> >>> Actually, I wanted to point out a few things I found when building the >>> most recent version of merlin recently. At the heart of my issues >>> is that our team is not allowed root access on these servers (long boring >>> corporate story...) so I'm installing everything in an alternate tree. >>> >>> 1) There are a couple of hard-coded paths in ipc.c and node.c for >>> the socket and the binlogs. I'm assuming that's intentional, but it >>> does mean one has to manually edit the source files to point to different >>> paths rather than specifying anything like that during the build process. >>> >> >> The socket location can be configured. Binlogs cannot. I'll amend that in >> the next release though. The core functionality is there, but there's no >> option to set it in the config files, which is kinda stupid. > > "Binlogs cannot" meaning it can't be moved without modifying the code > directly, right? Because that's what I did :-). > That's correct. >>> 2) Because we're trying to put all the files into an alternate tree, the >>> installation of 'mon' from install-merlin.sh didn't really work right. > >> Yes. The install-merlin.sh script is designed to be usable from the >> rpm spec file, and it's meant to aid people who want to install >> everything in its default location. Would $root_path/$bindir/mon >> work for you? Since you can set $root_path to whatever you want, >> I suppose it should. > > Yes, I believe that would work for me. I'm not setting $root_path at all. > Sweet. I'll add something to that effect then. For us, it won't make a difference, but if it makes life easier for you, then that's just all the better. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From allanc at chickenandporn.com Wed Dec 15 15:32:58 2010 From: allanc at chickenandporn.com (Allan Clark) Date: Wed, 15 Dec 2010 08:32:58 -0600 Subject: distributed nagios ? In-Reply-To: <4D088E75.2050308@op5.se> References: <20101213203459.GA13764@teddy.fas.com> <4D073D91.5010003@op5.se> <4D088E75.2050308@op5.se> Message-ID: On Wed, Dec 15, 2010 at 03:46, Andreas Ericsson wrote: > On 12/14/2010 08:39 PM, Frost, Mark {PBC} wrote: >> 2) Because we're trying to put all the files into an alternate tree, the >> installation of 'mon' from install-merlin.sh didn't really work right. ?In >> our case, it made a lot more sense to change >> >> ? ? ? cp apps/mon.py $root_path/usr/bin/mon >> >> to >> >> ? ? ? cp apps/mon.py $bindir/mon >> >> otherwise it would put 'mon' in a really weird spot. >> >> I'm guessing these are design decisions on your part, but in case they're >> not, I thought I'd point them out. >> > > Yes. The install-merlin.sh script is designed to be usable from the > rpm spec file, and it's meant to aid people who want to install > everything in its default location. Would $root_path/$bindir/mon > work for you? Since you can set $root_path to whatever you want, > I suppose it should. I thought the convention was $DESTDIR , which is also used by autotools, which is a fairly common tool in that domain. I'd strongly recommend marching to the same drummer when possible. Allan -- allanc at chickenandporn.com? "??" http://linkedin.com/in/goldfish ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Wed Dec 15 15:36:22 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Wed, 15 Dec 2010 09:36:22 -0500 Subject: converting distributed Nagios setup to Nagios+Merlin Message-ID: Our site currently uses a somewhat traditional distributed Nagios setup. I'm setting up merlin on some new Nagios servers and am looking at what configurations I'm going to want to change. As part of that, I realize that there are some Nagios config directives that I wanted some clarification on before I started changing things. I haven't seen these documented elsewhere (at least not that I could find). I was looking for clarification on the following: 1) Obsessive (ocsp/ochp) configuration directives get turned off. Merlin does all that. Plus ocsp/ochp is deemed detrimental to performance making that another reason to turn it off. 2) Freshness checking. Nagios would probably still try to do this if I left it in, but there's no point since Merlin will also do this. 3) Passive/Active checks. If I understand things correctly under Merlin everything is an active check. Or rather, anything that Nagios is supposed to run on some host or another is an active check. Things that are truly sent via NSCA from some monitored host out there would still be passive, but otherwise everything's configured to run actively Merlin takes care of where it runs. 4) In a load balanced/redundant configuration (such as 'yoda' and 'obi' in the HOWTO doc), which of 'yoda' or 'obi1' sends out notifications? Or do they both send them out but Merlin somehow only has one of them send it? I'm guessing that this is handled in the more traditional way where notifications are enabled on say, 'yoda' but disabled on 'obi1'. If 'yoda' crashes, you manually enable the alerts via the command file on 'obi1'? It would of course be super-cool if Merlin handled all that :-). 5) Other parameters such as process_perf_data - still probably only on the master(s), but that's really up to how crazy we'd want be. event handler settings - unchanged by this configuration retain status information - unchanged by this configuration Thanks Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Dec 15 15:44:54 2010 From: ae at op5.se (Andreas Ericsson) Date: Wed, 15 Dec 2010 15:44:54 +0100 Subject: distributed nagios ? In-Reply-To: References: <20101213203459.GA13764@teddy.fas.com> <4D073D91.5010003@op5.se> <4D088E75.2050308@op5.se> Message-ID: <4D08D466.9010903@op5.se> On 12/15/2010 03:32 PM, Allan Clark wrote: > On Wed, Dec 15, 2010 at 03:46, Andreas Ericsson wrote: >> On 12/14/2010 08:39 PM, Frost, Mark {PBC} wrote: >>> 2) Because we're trying to put all the files into an alternate tree, the >>> installation of 'mon' from install-merlin.sh didn't really work right. In >>> our case, it made a lot more sense to change >>> >>> cp apps/mon.py $root_path/usr/bin/mon >>> >>> to >>> >>> cp apps/mon.py $bindir/mon >>> >>> otherwise it would put 'mon' in a really weird spot. >>> >>> I'm guessing these are design decisions on your part, but in case they're >>> not, I thought I'd point them out. >>> >> >> Yes. The install-merlin.sh script is designed to be usable from the >> rpm spec file, and it's meant to aid people who want to install >> everything in its default location. Would $root_path/$bindir/mon >> work for you? Since you can set $root_path to whatever you want, >> I suppose it should. > > I thought the convention was $DESTDIR , which is also used by > autotools, which is a fairly common tool in that domain. I'd strongly > recommend marching to the same drummer when possible. > It is when you're using make, but internally the install-merlin script uses $root_path. So "make install DESTDIR=/some/random/path" will install everything to the root path /some/random/path. The autotools default path is called --dest-dir, which is what install-merlin.sh uses, even though I detest the autotools with an unsurpassed passion. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bret.Goodfellow at questar.com Wed Dec 15 18:28:41 2010 From: Bret.Goodfellow at questar.com (Bret Goodfellow) Date: Wed, 15 Dec 2010 17:28:41 +0000 Subject: check_logs.pl doesn't return output on RHEL 6 - RESOLVED In-Reply-To: <534585C15C30B048B7A3A335F9B6C95C0BAD49ED@SLCEXMB02.corp.questar.com> References: <534585C15C30B048B7A3A335F9B6C95C0BAD1EE1@SLCEXMB02.corp.questar.com> <940b39dba85aeeb5ba6195b43cce98a2@vps517.directvps.nl> <534585C15C30B048B7A3A335F9B6C95C0BAD268C@SLCEXMB02.corp.questar.com> <9f31bb826ec096d02969a9abc69ae229@vps517.directvps.nl> <534585C15C30B048B7A3A335F9B6C95C0BAD49ED@SLCEXMB02.corp.questar.com> Message-ID: <534585C15C30B048B7A3A335F9B6C95C0BAD4F57@SLCEXMB02.corp.questar.com> Temporary SOLUTION! FYI - when I installed Nagios on RHEL6 I brought down nagios-3.2.3-3.el6.rf.x86_64.rpm. This package included the check_logs.pl (1-2-2007). I replaced check_logs.pl (1-2-2007) with a backlevel version check_logs.pl (6-7-2005). This version works on RHEL6! I am getting output now [messages ==> OK]. At this point in time I'm just going to use the older version of check_logs.pl (6-7-2005). Thanks to all that responded. Bret Goodfellow Questar Gas S.L.C., UT From: Bret Goodfellow [mailto:Bret.Goodfellow at questar.com] Sent: Tuesday, December 14, 2010 2:57 PM To: Nagios Users List Subject: Re: [Nagios-users] check_logs.pl doesn't return output on RHEL 6 Yes, my servers are x64. The perl release on RHEL4 is v5.8.5, and the perl release on RHEL5 is v5.8.8, and the perl release on RHEL6 is v5.10.1. From: Hugo van der Kooij [mailto:hvdkooij at vanderkooij.org] Sent: Tuesday, December 14, 2010 8:47 AM To: Nagios Users List Subject: Re: [Nagios-users] check_logs.pl doesn't return output on RHEL 6 On Tue, 14 Dec 2010 15:12:24 +0000, Bret Goodfellow wrote: [root at server ~]# cd /usr/lib64/nagios/plugins/ [root at server plugins]# ./check_logs.pl -c /etc/nagios/check_logs_linux.cfg [root at server plugins]# I strongly suspect that this is a perl issue as RHEL 6 is using a newer perl version that is not entirely identical or compatible to the one used by RHEL 4 and RHEL 5. Are you sure the other servers are x64 as well BTW? Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From MarkL at lmfj.com Wed Dec 15 18:49:04 2010 From: MarkL at lmfj.com (Mark A. Lappin) Date: Wed, 15 Dec 2010 11:49:04 -0600 Subject: Make a Host Group not visible on web interface? Message-ID: <0227B653B3DC82438B8291BC5218612F6737768389@lmfjex07.lmfj.com> Good Morning All - Is there anyway to define a host group, assign hosts to that host group but not have the membership or the host group show up on under "Host Groups" on the web interface? I have several things which run like services and several that don't, for the purposes of defining service checks, I would much rather set the services up to look at host groups instead of hosts but I'm ending up with so many host groups that the host groups become almost meaningless in the web interface. Example - I'm watching about 65 printers with Nagios. Different makes, different models. Each printer has any number of physical trays but monitoring them, they are all a bit different. One of the things I need to monitor is paper tray status. Some printers count manual feed tray as tray 1, some as tray 2, some as tray 3. Some have a tray 2 and tray 3 but not tray 1. So on my service checks, I'm applying this to all my network printers and then on the different tray checks, exclude certain printers. A little difficult to keep track of. I would like to put the printers in some host groups such as Tray1, Tray2, Tray3, etc, set services to check those host groups rather than keep excluding hosts on the services. But host groups Tray1, Tray2, Tray3 are meaningless via the web interf ace so I would like them not to show. Nagios Core - 3.2.3 ML Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Wed Dec 15 21:44:43 2010 From: stanb at panix.com (stan) Date: Wed, 15 Dec 2010 15:44:43 -0500 Subject: Help with check_smb error message Message-ID: <20101215204443.GA14147@teddy.fas.com> Let me start off ny saying Windows is something I do not know much about. I am setting up a new Nagios instance to monitor a mainly Windows subnet, and I am trying to use the check_disk_smb plugin. I have it working to a Samba shared on the Linix Nagios host, and I know the machine name, and share name on the Windows but when I run the plugin by hand, I get: root at pm2v40:/etc/nagios3/conf.d# /usr/lib/nagios/plugins/check_disk_smb -H aw201b -s ia Result from smbclient not suitable Any idea, what I am doing wrong here? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From olourkin-nagios at yahoo.com Wed Dec 15 21:48:34 2010 From: olourkin-nagios at yahoo.com (olourkin-nagios at yahoo.com) Date: Wed, 15 Dec 2010 12:48:34 -0800 (PST) Subject: Make a Host Group not visible on web interface? In-Reply-To: <0227B653B3DC82438B8291BC5218612F6737768389@lmfjex07.lmfj.com> References: <0227B653B3DC82438B8291BC5218612F6737768389@lmfjex07.lmfj.com> Message-ID: <342793.23413.qm@web30801.mail.mud.yahoo.com> If you add "register 0" to your hostgroup definition it won't show up on the web interface, but you can still assign services to it as per normal. Very useful technique. Cheers, Erik Larkin ----- Original Message ---- From: Mark A. Lappin To: Nagios Users List Sent: Wed, December 15, 2010 9:49:04 AM Subject: [Nagios-users] Make a Host Group not visible on web interface? Good Morning All - Is there anyway to define a host group, assign hosts to that host group but not have the membership or the host group show up on under "Host Groups" on the web interface? I have several things which run like services and several that don't, for the purposes of defining service checks, I would much rather set the services up to look at host groups instead of hosts but I'm ending up with so many host groups that the host groups become almost meaningless in the web interface. Example - I'm watching about 65 printers with Nagios. Different makes, different models. Each printer has any number of physical trays but monitoring them, they are all a bit different. One of the things I need to monitor is paper tray status. Some printers count manual feed tray as tray 1, some as tray 2, some as tray 3. Some have a tray 2 and tray 3 but not tray 1. So on my service checks, I'm applying this to all my network printers and then on the different tray checks, exclude certain printers. A little difficult to keep track of. I would like to put the printers in some host groups such as Tray1, Tray2, Tray3, etc, set services to check those host groups rather than keep excluding hosts on the services. But host groups Tray1, Tray2, Tray3 are meaningless via the web interface so I would like them not to show. Nagios Core - 3.2.3 ML Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From MarkL at lmfj.com Wed Dec 15 21:50:56 2010 From: MarkL at lmfj.com (Mark A. Lappin) Date: Wed, 15 Dec 2010 14:50:56 -0600 Subject: Make a Host Group not visible on web interface? In-Reply-To: <342793.23413.qm@web30801.mail.mud.yahoo.com> References: <0227B653B3DC82438B8291BC5218612F6737768389@lmfjex07.lmfj.com> <342793.23413.qm@web30801.mail.mud.yahoo.com> Message-ID: <0227B653B3DC82438B8291BC5218612F67377683D4@lmfjex07.lmfj.com> >> Is there anyway to define a host group, assign hosts to that >> host group but not have the membership or the host group show >> up on under "Host Groups" on the web interface? > add "register 0" > to your hostgroup definition it won't show up on the web interface Erik - Great, thank you. I will give this a shot in a little while. Mark Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Dec 15 22:35:35 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 15 Dec 2010 21:35:35 +0000 Subject: Help with check_smb error message In-Reply-To: <20101215204443.GA14147@teddy.fas.com> References: <20101215204443.GA14147@teddy.fas.com> Message-ID: <4D0934A7.9010400@flatto.net> stan wrote: > Let me start off ny saying Windows is something I do not know much about. I > am setting up a new Nagios instance to monitor a mainly Windows subnet, and > I am trying to use the check_disk_smb plugin. I have it working to a Samba > shared on the Linix Nagios host, and I know the machine name, and share > name on the Windows but when I run the plugin by hand, I get: > > root at pm2v40:/etc/nagios3/conf.d# /usr/lib/nagios/plugins/check_disk_smb -H > aw201b -s ia > Result from smbclient not suitable > > Any idea, what I am doing wrong here? > > If you want to monitor windows boxes , you may want to use the nsclient++ addon - which is a " NRPE for windows" so to speak . http://sourceforge.net/projects/nscplus/ Once you install that - connecting and monitoring the windows box filesystems becomes much easier . Assaf ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Wed Dec 15 23:07:52 2010 From: stanb at panix.com (stan) Date: Wed, 15 Dec 2010 17:07:52 -0500 Subject: execute_service_checks and distributed monitoring Message-ID: <20101215220752.GA16319@teddy.fas.com> We wre trying to set up our first "child" Nagios instnace. I have it working for checcking some things in the child instance. We have configured the existing "parent" with some of the new checks. In reading the documention, it is stated that we need to set execute_service_checks to 0 for the services that are actuaklly monitored by the "child". My problem is that I do not understand where to put this directive to overide the default, which is located in nagios.conf, and must be1 as we are continuing to monitor all the existing services from it. I have tried several places, but they all seem to give me a syntax error when starting nagios. If it helps, the configuration for the "parent: was initaly created using nagiosql. Thanks. -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Dec 15 23:20:05 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 15 Dec 2010 22:20:05 +0000 Subject: execute_service_checks and distributed monitoring In-Reply-To: <20101215220752.GA16319@teddy.fas.com> References: <20101215220752.GA16319@teddy.fas.com> Message-ID: <4D093F15.9020601@flatto.net> stan wrote: > We wre trying to set up our first "child" Nagios instnace. I have it > working for checcking some things in the child instance. We have configured > the existing "parent" with some of the new checks. In reading the > documention, it is stated that we need to set execute_service_checks to 0 > for the services that are actuaklly monitored by the "child". My problem is > that I do not understand where to put this directive to overide the > default, which is located in nagios.conf, and must be1 as we are continuing > to monitor all the existing services from it. I have tried several places, > but they all seem to give me a syntax error when starting nagios. > > If it helps, the configuration for the "parent: was initaly created using > nagiosql. > > Thanks. > > the definition is in the specific service definition on the Parent nagios http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service That way the service is "passive" and will not be executed , but receive the status from the child nagios via the NSCA transfers. Assaf ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bruce.edge at gmail.com Thu Dec 16 02:05:49 2010 From: bruce.edge at gmail.com (Bruce Edge) Date: Wed, 15 Dec 2010 17:05:49 -0800 Subject: CPU monitor for a single Linux user space process ? Message-ID: Rookie question here. Trying to determine nagios suitability for an embedded app. Can I monitor the CPU utilization for a single user space process on a Linux box with nagios? And, can I define an action if it exceeds a threshold? Thanks -Bruce ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Thu Dec 16 02:34:43 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Wed, 15 Dec 2010 20:34:43 -0500 Subject: CPU monitor for a single Linux user space process ? In-Reply-To: References: Message-ID: > -----Original Message----- > From: Bruce Edge [mailto:bruce.edge at gmail.com] > Sent: Wednesday, December 15, 2010 8:06 PM > > Rookie question here. Trying to determine nagios suitability for an > embedded app. > > Can I monitor the CPU utilization for a single user space process on a > Linux box with nagios? > And, can I define an action if it exceeds a threshold? > > Thanks > > -Bruce Bruce, I'm not sure that there's an existing check plugin that would do this (might be). I can say that "yes" you can do this, it's just a question of what you're willing to do. If I were to do this for our environment, I'd write a perl script that used the 'ps' command to look at the process and pull the 'pcpu' field (% cpu -- see the 'ps' man page) info for that process. I'd also use the Nagios::Plugin perl module to make the Nagios side easier and probably report the actual pcpu value as performance data suitable for graphing. You could then configure the an event on that service check. That essentially another script that gets called when the state changes on the check. This means it gets called anytime the state changes, including when it goes to an "OK" state so you need to have the script detect when it's called and potentially exit if it hasn't gone into a hard critical state (depending on what you want, actually). You can read up on events on the Nagios documentation. Mark ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Thu Dec 16 03:04:56 2010 From: stanb at panix.com (stan) Date: Wed, 15 Dec 2010 21:04:56 -0500 Subject: Help with check_smb error message In-Reply-To: <4D0934A7.9010400@flatto.net> References: <20101215204443.GA14147@teddy.fas.com> <4D0934A7.9010400@flatto.net> Message-ID: <20101216020456.GA21286@teddy.fas.com> On Wed, Dec 15, 2010 at 09:35:35PM +0000, Assaf Flatto wrote: > stan wrote: > > Let me start off ny saying Windows is something I do not know much about. I > > am setting up a new Nagios instance to monitor a mainly Windows subnet, and > > I am trying to use the check_disk_smb plugin. I have it working to a Samba > > shared on the Linix Nagios host, and I know the machine name, and share > > name on the Windows but when I run the plugin by hand, I get: > > > > root at pm2v40:/etc/nagios3/conf.d# /usr/lib/nagios/plugins/check_disk_smb -H > > aw201b -s ia > > Result from smbclient not suitable > > > > Any idea, what I am doing wrong here? > > > > > > If you want to monitor windows boxes , you may want to use the > nsclient++ addon - which is a " NRPE for windows" so to speak . > > http://sourceforge.net/projects/nscplus/ > > Once you install that - connecting and monitoring the windows box > filesystems becomes much easier . > Thanks, I am using this in other locations, but politivly, I cannot install _anything_ on these boxes, so I am forced to use external tools. -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Thu Dec 16 06:42:51 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Wed, 15 Dec 2010 21:42:51 -0800 Subject: CPU monitor for a single Linux user space process ? In-Reply-To: References: Message-ID: <4D09A6DB.1080306@5dninja.net> On 12/15/10 5:05 PM, Bruce Edge wrote: > Rookie question here. Trying to determine nagios suitability for an > embedded app. > > Can I monitor the CPU utilization for a single user space process on a > Linux box with nagios? > And, can I define an action if it exceeds a threshold? Sounds like you need check_snmp_process.pl from here: http://nagios.manubulon.com/snmp_process.html I've been using it, it works quite well. It requires snmpd, but is basically the swiss army knife of user-space process monitoring. To "define an action" you need to set up an event handler. http://nagios.sourceforge.net/docs/3_0/eventhandlers.html -- Mike Lindsey ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 16 13:30:41 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 16 Dec 2010 13:30:41 +0100 Subject: =?utf-8?q?Help_with_check=5Fsmb_error_message?= In-Reply-To: <20101215204443.GA14147@teddy.fas.com> References: <20101215204443.GA14147@teddy.fas.com> Message-ID: <484f004590534b9e1220c9350f8a66cb@vps517.directvps.nl> On Wed, 15 Dec 2010 15:44:43 -0500, stan wrote: > > root at pm2v40:/etc/nagios3/conf.d# > /usr/lib/nagios/plugins/check_disk_smb -H > aw201b -s ia > Result from smbclient not suitable > > Any idea, what I am doing wrong here? What happens if you use smbclient by hand to your localhost and a windows machine with identical parameters except the address? My guess is that they use different identification schemes so you smbclient is not able to login to the windows domain. Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mariog at absi.be Thu Dec 16 14:02:29 2010 From: mariog at absi.be (Mario Garcia Ortiz) Date: Thu, 16 Dec 2010 14:02:29 +0100 Subject: Fwd: check_disk_smb invalid share error on new version of nagiosp Message-ID: <4D0A0DE5.60304@absi.be> nagios-users at lists.sourceforge.net Hello all, I have updraded nagios plugins to latest version 1.4.15 the check of a smb share doesn't work anymore, i use exactly the same commands as in check_disk_smb v1247 (nagios-plugins 1.4.12) is there a difference in the way the checks should be made, the help file is basically the same (check_disk_smb -H) on both version is identical.. the only error i have is 'invalid share.. how could I debug this? i can ofcourse copy and paste the old perl script but that's not the meaning of upgrading... here's the command used; /check_disk_smb -H "hostname" -s "Bookings" -u users-p passwd Domain=[DUMMY] OS=[Windows Server 200x 3790 Service Pack X] Server=[Windows Server 200x 5.x] Disk ok - 8.16G (54%) free on "\\\\hostame\\Bookings" in the new version i do exaclt yhe same and i get Invalid Share: Bookings kind regards Mario G. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 16 14:39:58 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 16 Dec 2010 14:39:58 +0100 Subject: =?utf-8?q?Fwd=3A_check=5Fdisk=5Fsmb_invalid_share_?= =?utf-8?q?error_on_new_version_of_nagiosp?= In-Reply-To: <4D0A0DE5.60304@absi.be> References: <4D0A0DE5.60304@absi.be> Message-ID: <07e247e5bff6971afa7a2cf864d19b9b@vps517.directvps.nl> On Thu, 16 Dec 2010 14:02:29 +0100, Mario Garcia Ortiz wrote: > I have updraded nagios plugins to latest version 1.4.15 > the check of a smb share doesn't work anymore, i use exactly the same > commands as in check_disk_smb v1247 (nagios-plugins 1.4.12) Assuming you have both versions at hand I would say that a diff output might shed some light on this. Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ej_seg at hotmail.com Thu Dec 16 14:57:38 2010 From: ej_seg at hotmail.com (Rikard Dahlberg) Date: Thu, 16 Dec 2010 13:57:38 +0000 Subject: NSCA hangs? Message-ID: Heya guys, i once more need your help :) It seems that my NSCA works absolutely perfect for 2-3 days, with 17 hosts monitored, all sending Passive checks on a 10 second intervall. (The problem may be here, I just want a second opinion) Anyway, it works really good for about 2-3 days but then it appears to shut down. This is the errormsg i get at the NSclient++: 2010-12-12 13:54:44: error:modules\NSCAAgent\NSCAThread.cpp:275: <<< Could not connect to: (keeping IP a secret) 10060: A connection attempt failed because the connected party did not properly respond after a period of time, or established connection failed because connected host has failed to respond. And from the syslogfile i can only see that packet was dropped, 32 seconds old. I cant get real print out because uhm the syslogfile are to heavy to use so it hangs my putty..:) Any ideas? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mariog at absi.be Thu Dec 16 15:21:25 2010 From: mariog at absi.be (Mario Garcia Ortiz) Date: Thu, 16 Dec 2010 15:21:25 +0100 Subject: different contact groups depending on time of day Message-ID: <4D0A2065.4030101@absi.be> Hello list, is it possible to send notification to a certain contact group depending on the time, what i mean, send notification (sms) to certain people between working hours and to other people outside working hours and weekends. thank you -- Mario GARCIA ORTIZ System Engineer Neerstalsestwg. 42 ch?e. de Neerstalle B-1190 Brussels Tel.: +32(0)2 333 40 00 mariog at absi.be http://www.absi.be The information contained in or attached to this email is confidential and may be privileged. If you have received it by mistake,please notify the sender by return e-mail and delete it from your system. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From terjet at funcom.com Thu Dec 16 15:21:39 2010 From: terjet at funcom.com (Terje Trane) Date: Thu, 16 Dec 2010 15:21:39 +0100 Subject: Check that processes run as more than one user Message-ID: <4D0A2073.70000@funcom.com> I need to check on a Windows server that a processes is running, but for two users. (So two instances should run, one as user1 and one as user2). I'm running nc_net on the server (which is running Win server 2008 R2) and can test for the process running using something like: check_command check_nt!PROCSTATE -l myprogram.exe This will say OK as long as at least one of the processes is running, but I need a warning/critical if not both of them are running. The -l parameter can take more than one process name, so I tried doubling it, (and even tripling it), but that seemed to work just as if it was only one. check_command check_nt!PROCSTATE -l myprogram.exe,myprogram.exe I cannot find anything in the docs about specifying user name. Any ideas of how I can accomplish what I need? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tktucker at gmail.com Thu Dec 16 15:47:26 2010 From: tktucker at gmail.com (Tom Tucker) Date: Thu, 16 Dec 2010 09:47:26 -0500 Subject: Setting Scheduled Downtime Externally Message-ID: Is it possible to set the "schedule downtime" value for hosts externally (curl for example)? I found some 2007 posts regarding this same topic, but the php script this individual wrote that would support such are no longer accessible. Thank you for your time and assistance -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Thu Dec 16 16:11:40 2010 From: stanb at panix.com (stan) Date: Thu, 16 Dec 2010 10:11:40 -0500 Subject: $OUTPUT$ Message-ID: <20101216151140.GA5489@teddy.fas.com> We are working on our first Nagios parent chiled system. As I understand it, on the chile I need to invoke something that looks like: /usr/share/nagios/sbin/submit_check_result $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATE$ '$OUTPUT$' That script then does something like this: /usr/bin/printf "%s\t%s\t%s\t%s\n" "$1" "$2" "$return_code" "$4" | /usr/sbin/send_nsca pnoc -c /etc/send_nsca.cfg Ny current issue is that $OUTPUT$ does ot seem to be getting expanded. Insteadit is literaly showing up on the Nagios run time web screen as $OUTPUT$. Any thoughts as to what we may be doing wrong here? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Thu Dec 16 16:57:18 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Thu, 16 Dec 2010 10:57:18 -0500 Subject: different contact groups depending on time of day In-Reply-To: <4D0A2065.4030101@absi.be> References: <4D0A2065.4030101@absi.be> Message-ID: <4D0A36DE.3070804@paul.dubuc.org> Mario Garcia Ortiz wrote: > Hello list, > is it possible to send notification to a certain contact group depending > on the time, > > what i mean, send notification (sms) to certain people between working > hours and to other people outside working hours and weekends. > > thank you > Yes. Define contact objects with different host_notification_period and service_notification_period specifications. http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#contact ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 16 17:06:47 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 16 Dec 2010 17:06:47 +0100 Subject: =?utf-8?q?NSCA_hangs=3F?= In-Reply-To: References: Message-ID: <0191e1c78c8972cbf1fee41d9d0de907@vps517.directvps.nl> On Thu, 16 Dec 2010 13:57:38 +0000, Rikard Dahlberg wrote: It seems that my NSCA works absolutely perfect for 2-3 days, with 17 hosts monitored, all sending Passive checks on a 10 second intervall. (The problem may be here, I just want a second opinion) Anyway, it works really good for about 2-3 days but then it appears to shut down. This is the errormsg i get at the NSclient++: 2010-12-12 13:54:44: error:modulesNSCAAgentNSCAThread.cpp:275: -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike at summersault.com Thu Dec 16 17:09:59 2010 From: mike at summersault.com (Mike Neimoyer) Date: Thu, 16 Dec 2010 11:09:59 -0500 Subject: different contact groups depending on time of day In-Reply-To: <4D0A36DE.3070804@paul.dubuc.org> References: <4D0A2065.4030101@absi.be> <4D0A36DE.3070804@paul.dubuc.org> Message-ID: <4D0A39D7.7020309@summersault.com> On 12/16/2010 10:57 AM, Paul M. Dubuc wrote: > > Mario Garcia Ortiz wrote: >> what i mean, send notification (sms) to certain people between working >> hours and to other people outside working hours and weekends. > > Yes. Define contact objects with different host_notification_period and > service_notification_period specifications. > > http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#contact In addendum, also be sure to define your timeperiods as well (working_hours, weekends, after_hours, etc), so that you have the proper timeperiods defined for your notification_period entries in your contact objects. http://nagios.sourceforge.net/docs/3_0/timeperiods.html ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Dec 16 17:21:43 2010 From: ae at op5.se (Andreas Ericsson) Date: Thu, 16 Dec 2010 17:21:43 +0100 Subject: $OUTPUT$ In-Reply-To: <20101216151140.GA5489@teddy.fas.com> References: <20101216151140.GA5489@teddy.fas.com> Message-ID: <4D0A3C97.9010200@op5.se> On 12/16/2010 04:11 PM, stan wrote: > We are working on our first Nagios parent chiled system. As I understand > it, on the chile I need to invoke something that looks like: > > /usr/share/nagios/sbin/submit_check_result $HOSTNAME$ '$SERVICEDESC$' > $SERVICESTATE$ '$OUTPUT$' > > That script then does something like this: > > /usr/bin/printf "%s\t%s\t%s\t%s\n" "$1" "$2" "$return_code" "$4" | > /usr/sbin/send_nsca pnoc -c /etc/send_nsca.cfg > > Ny current issue is that $OUTPUT$ does ot seem to be getting expanded. > Insteadit is literaly showing up on the Nagios run time web screen as > $OUTPUT$. > Check the docs for what macros are available for obsessive host and service check commands. Or use merlin. It was designed for things like this. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mad at b-care.net Thu Dec 16 17:43:17 2010 From: mad at b-care.net (=?ISO-8859-1?Q?Marc-Andr=E9?= Doll) Date: Thu, 16 Dec 2010 17:43:17 +0100 Subject: JVM Monitoring In-Reply-To: References: <1292335885.1679.12.camel@MADness> Message-ID: <1292517797.1715.1.camel@MADness> On Tue, 2010-12-14 at 17:04 +0000, Jim Avery wrote: > On 14 December 2010 14:11, Marc-Andr? Doll wrote: > > Hi list, > > > > I have to monitor some JVM and I don't find plugins that fit exactly > > with what I want/imagine. > > > > I could use the check_jmx but I don't really want to install a JRE on my > > Nagios server. > > > > Currently, I'm monitoring Tomcat servers with check_jmx4perl and I'm > > quite happy with it. Is it possible to configure/tweek the JVM or the > > J4P war to use it on a non-JEE server? Or am I doomed to install java on > > my monitoring server? > > I would think you could continue to use check_jmx4perl on the Tomcat > server and get it to send the results back to Nagios as a passive > check using send_nsca. > > You can either use the send_nsca which is built in to NSClient++, or > there is a standalone binary send_nsca which is quite easy to use ( > http://exchange.nagios.org/directory/Addons/Passive-Checks/NSCA-Win32-Client/details > ). > > You will of course need to configure the nsca daemon on your Nagios > server if you haven't done that already. > > hth, > > Jim > Thank you Jim for your answer. I will use passive checks with NSCA. Marc-Andr? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mad at b-care.net Thu Dec 16 17:48:28 2010 From: mad at b-care.net (=?ISO-8859-1?Q?Marc-Andr=E9?= Doll) Date: Thu, 16 Dec 2010 17:48:28 +0100 Subject: Setting Scheduled Downtime Externally In-Reply-To: References: Message-ID: <1292518108.1715.6.camel@MADness> On Thu, 2010-12-16 at 09:47 -0500, Tom Tucker wrote: > > > > Is it possible to set the "schedule downtime" value for hosts > externally (curl for example)? I found some 2007 posts regarding this > same topic, but the php script this individual wrote that would > support such are no longer accessible. Hi, Yes it is possible. I've done it to register periodic reboots. You have to execute a HTTP POST request to the right URL. See what are the HTTP data sent when you set a scheduled downtime in Nagios and mimic them with a wget/php script/curl/... Marc-Andr? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Thu Dec 16 20:04:08 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Thu, 16 Dec 2010 14:04:08 -0500 Subject: JVM Monitoring In-Reply-To: <1292335885.1679.12.camel@MADness> References: <1292335885.1679.12.camel@MADness> Message-ID: <4D0A62A8.3070504@paul.dubuc.org> Marc-Andr? Doll wrote: > Hi list, > > I have to monitor some JVM and I don't find plugins that fit exactly > with what I want/imagine. > > I could use the check_jmx but I don't really want to install a JRE on my > Nagios server. > > Currently, I'm monitoring Tomcat servers with check_jmx4perl and I'm > quite happy with it. Is it possible to configure/tweek the JVM or the > J4P war to use it on a non-JEE server? Or am I doomed to install java on > my monitoring server? > > > Thanks for your help. I was just looking at the web page for check_jmx4perl at http://exchange.nagios.org/directory/Plugins/Java-Applications-and-Servers/check_jmx4perl/details It says that it requires "No Java installation required on the Nagios host". Is this not true? Paul Dubuc ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Thu Dec 16 20:04:41 2010 From: stanb at panix.com (stan) Date: Thu, 16 Dec 2010 14:04:41 -0500 Subject: $OUTPUT$ In-Reply-To: <4D0A3C97.9010200@op5.se> References: <20101216151140.GA5489@teddy.fas.com> <4D0A3C97.9010200@op5.se> Message-ID: <20101216190441.GA11607@teddy.fas.com> On Thu, Dec 16, 2010 at 05:21:43PM +0100, Andreas Ericsson wrote: > On 12/16/2010 04:11 PM, stan wrote: > > We are working on our first Nagios parent chiled system. As I understand > > it, on the chile I need to invoke something that looks like: > > > > /usr/share/nagios/sbin/submit_check_result $HOSTNAME$ '$SERVICEDESC$' > > $SERVICESTATE$ '$OUTPUT$' > > > > That script then does something like this: > > > > /usr/bin/printf "%s\t%s\t%s\t%s\n" "$1" "$2" "$return_code" "$4" | > > /usr/sbin/send_nsca pnoc -c /etc/send_nsca.cfg > > > > Ny current issue is that $OUTPUT$ does ot seem to be getting expanded. > > Insteadit is literaly showing up on the Nagios run time web screen as > > $OUTPUT$. > > > > Check the docs for what macros are available for obsessive host and > service check commands. > > Or use merlin. It was designed for things like this. > Yes, that has been recomended (Merlin). Having spent 2 or 3 days getting nsca (close to) working, and having only one parent, one (small) child setup, I woner of it is worth the time to change at this point? How much work is involved in setting Merlin up? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From GJFRATER at bechtel.com Thu Dec 16 20:16:15 2010 From: GJFRATER at bechtel.com (Frater, Greg J) Date: Thu, 16 Dec 2010 11:16:15 -0800 Subject: qpage - OT In-Reply-To: <6162375473eb6aa87c524ebf6b1e9120.squirrel@webmail.stinkweasel.net> References: <872CB0AEB377C240A112DD7C10B2592909A0CB87@wtps0171.amers.ibechtel.com> <6162375473eb6aa87c524ebf6b1e9120.squirrel@webmail.stinkweasel.net> Message-ID: <872CB0AEB377C240A112DD7C10B2592909A0CB98@wtps0171.amers.ibechtel.com> Thanks for the response Benny, I appreciate any help I can get >> qpage error: >> <502 MESSAGE REJECTED - STX OR EOT EXPECTED> >It would have been nice to see your qpage.cf file... ;) That seems obvious, see below >Be sure you have 'parity=even' in your config. When you run a test with verbose and interactive flags set, do you fail five or six times before you get that message? I've never tried the interactive flag, I will do so. As far as the failures go when I had the retry set to 20 it would to fail 5 times in a row and then reset the modem or something, I can't fully interpret the logs, and then retry again possible 20 times? as in 20 sets of 5. The interactive (-i) option seems to require a page to be sent right now. As of yet I have been unable to get a failure when sending a page manually but I think I've really only sent a small number 10-20 pages manually. The only times it has failed so far is when it's running in daemon mode. Do you guys use USB modems with qpage? These problems got much worse after switching to a USB modem. qpage.cf: # # QuickPage configuration file # administrator=xxxxxx at xxxxxxx.xxx queuedir=/var/spool/qpage pidfile=/var/spool/qpage/qpage.pid modem=modem1 device=/dev/modem service=att device=modem1 phone=9,18009094602 baudrate=9600 parity=even allowpid=yes #maxtries=6 maxtries=3 msgprefix=false #maxmsgsize=250 maxmsgsize=500 ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Thu Dec 16 20:28:49 2010 From: benny at bennyvision.com (C. Bensend) Date: Thu, 16 Dec 2010 13:28:49 -0600 Subject: qpage - OT In-Reply-To: <872CB0AEB377C240A112DD7C10B2592909A0CB98@wtps0171.amers.ibechtel.com> References: <872CB0AEB377C240A112DD7C10B2592909A0CB87@wtps0171.amers.ibechtel.com> <6162375473eb6aa87c524ebf6b1e9120.squirrel@webmail.stinkweasel.net> <872CB0AEB377C240A112DD7C10B2592909A0CB98@wtps0171.amers.ibechtel.com> Message-ID: >>It would have been nice to see your qpage.cf file... ;) > That seems obvious, see below > >>Be sure you have 'parity=even' in your config. When you run a test > with verbose and interactive flags set, do you fail five or six times > before you get that message? > > I've never tried the interactive flag, I will do so. As far as the > failures go when I had the retry set to 20 it would to fail 5 times in a > row and then reset the modem or something, I can't fully interpret the > logs, and then retry again possible 20 times? as in 20 sets of 5. > > The interactive (-i) option seems to require a page to be sent right > now. As of yet I have been unable to get a failure when sending a page > manually but I think I've really only sent a small number 10-20 pages > manually. The only times it has failed so far is when it's running in > daemon mode. Do you guys use USB modems with qpage? These problems got > much worse after switching to a USB modem. I didn't notice anything glaringly incorrect with your config... The reason I asked about parity is because I got the exact same error with Verizon and Sprint, except that qpage would decide that the page was not sent (when it had been), so it would retry five times (thereby sending five identical pages). That issue went away when I had a palm-forehead moment and added the 'parity=even' to my config. Benny -- "I'm no meteorologist, but I'm pretty sure it's rainin' bitches!" -- Cleveland, "Family Guy" ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Thu Dec 16 21:08:36 2010 From: stanb at panix.com (stan) Date: Thu, 16 Dec 2010 15:08:36 -0500 Subject: $OUTPUT$ In-Reply-To: <4D0A3C97.9010200@op5.se> References: <20101216151140.GA5489@teddy.fas.com> <4D0A3C97.9010200@op5.se> Message-ID: <20101216200836.GA13763@teddy.fas.com> On Thu, Dec 16, 2010 at 05:21:43PM +0100, Andreas Ericsson wrote: > On 12/16/2010 04:11 PM, stan wrote: > > We are working on our first Nagios parent chiled system. As I understand > > it, on the chile I need to invoke something that looks like: > > > > /usr/share/nagios/sbin/submit_check_result $HOSTNAME$ '$SERVICEDESC$' > > $SERVICESTATE$ '$OUTPUT$' > > > > That script then does something like this: > > > > /usr/bin/printf "%s\t%s\t%s\t%s\n" "$1" "$2" "$return_code" "$4" | > > /usr/sbin/send_nsca pnoc -c /etc/send_nsca.cfg > > > > Ny current issue is that $OUTPUT$ does ot seem to be getting expanded. > > Insteadit is literaly showing up on the Nagios run time web screen as > > $OUTPUT$. > > > > Check the docs for what macros are available for obsessive host and > service check commands. > > Or use merlin. It was designed for things like this. > losure for the list archives: THis sugestion led to a Google search for "Nagios Macors" Which led to a page listing the macros and "where" they are avail;able. Turns out that the actual macro is $HOSTOUTPT". Question, why does the Nagios parser not complain about invalid macros? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Dec 16 21:55:14 2010 From: ae at op5.se (Andreas Ericsson) Date: Thu, 16 Dec 2010 21:55:14 +0100 Subject: $OUTPUT$ In-Reply-To: <20101216190441.GA11607@teddy.fas.com> References: <20101216151140.GA5489@teddy.fas.com> <4D0A3C97.9010200@op5.se> <20101216190441.GA11607@teddy.fas.com> Message-ID: <4D0A7CB2.50802@op5.se> On 12/16/2010 08:04 PM, stan wrote: > On Thu, Dec 16, 2010 at 05:21:43PM +0100, Andreas Ericsson wrote: >> On 12/16/2010 04:11 PM, stan wrote: >>> We are working on our first Nagios parent chiled system. As I understand >>> it, on the chile I need to invoke something that looks like: >>> >>> /usr/share/nagios/sbin/submit_check_result $HOSTNAME$ '$SERVICEDESC$' >>> $SERVICESTATE$ '$OUTPUT$' >>> >>> That script then does something like this: >>> >>> /usr/bin/printf "%s\t%s\t%s\t%s\n" "$1" "$2" "$return_code" "$4" | >>> /usr/sbin/send_nsca pnoc -c /etc/send_nsca.cfg >>> >>> Ny current issue is that $OUTPUT$ does ot seem to be getting expanded. >>> Insteadit is literaly showing up on the Nagios run time web screen as >>> $OUTPUT$. >>> >> >> Check the docs for what macros are available for obsessive host and >> service check commands. >> >> Or use merlin. It was designed for things like this. >> > Yes, that has been recomended (Merlin). Having spent 2 or 3 days getting > nsca (close to) working, and having only one parent, one (small) child > setup, I woner of it is worth the time to change at this point? > > How much work is involved in setting Merlin up? > Once the software is installed and you've configured one hostgroup to be handled by the poller, you type: mon node add type=poller hostgroup= on the master, and mon node add type=master and then restarting monitor and merlin. There's a more in-depth README available online at http://git.op5.org/git/?p=nagios/merlin.git;a=blob;f=HOWTO;hb=master and plenty of documentation in the wiki as well. Just google "op5 merlin" and you'll find it all. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Dec 16 21:56:01 2010 From: ae at op5.se (Andreas Ericsson) Date: Thu, 16 Dec 2010 21:56:01 +0100 Subject: $OUTPUT$ In-Reply-To: <20101216200836.GA13763@teddy.fas.com> References: <20101216151140.GA5489@teddy.fas.com> <4D0A3C97.9010200@op5.se> <20101216200836.GA13763@teddy.fas.com> Message-ID: <4D0A7CE1.8080009@op5.se> On 12/16/2010 09:08 PM, stan wrote: > On Thu, Dec 16, 2010 at 05:21:43PM +0100, Andreas Ericsson wrote: >> On 12/16/2010 04:11 PM, stan wrote: >>> We are working on our first Nagios parent chiled system. As I understand >>> it, on the chile I need to invoke something that looks like: >>> >>> /usr/share/nagios/sbin/submit_check_result $HOSTNAME$ '$SERVICEDESC$' >>> $SERVICESTATE$ '$OUTPUT$' >>> >>> That script then does something like this: >>> >>> /usr/bin/printf "%s\t%s\t%s\t%s\n" "$1" "$2" "$return_code" "$4" | >>> /usr/sbin/send_nsca pnoc -c /etc/send_nsca.cfg >>> >>> Ny current issue is that $OUTPUT$ does ot seem to be getting expanded. >>> Insteadit is literaly showing up on the Nagios run time web screen as >>> $OUTPUT$. >>> >> >> Check the docs for what macros are available for obsessive host and >> service check commands. >> >> Or use merlin. It was designed for things like this. >> > losure for the list archives: > > THis sugestion led to a Google search for "Nagios Macors" Which led to a > page listing the macros and "where" they are avail;able. Turns out that the > actual macro is $HOSTOUTPT". > > Question, why does the Nagios parser not complain about invalid macros? Because they could be environment variables. The syntax for describing macros is a bit fragile. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bruce.edge at gmail.com Thu Dec 16 23:36:00 2010 From: bruce.edge at gmail.com (Bruce Edge) Date: Thu, 16 Dec 2010 14:36:00 -0800 Subject: CPU monitor for a single Linux user space process ? In-Reply-To: <4D09A194.9060002@bettyscout.org> References: <4D09A194.9060002@bettyscout.org> Message-ID: On Wed, Dec 15, 2010 at 9:20 PM, Mike Lindsey wrote: > On 12/15/10 5:05 PM, Bruce Edge wrote: >> >> Rookie question here. Trying to determine nagios suitability for an >> embedded app. >> >> Can I monitor the CPU utilization for a single user space process on a >> Linux box with nagios? >> And, can I define an action if it exceeds a threshold? > > Sounds like you need check_snmp_process.pl from here: > http://nagios.manubulon.com/snmp_process.html > > I've been using it, it works quite well. ?It requires snmpd, but is > basically the swiss army knife of user-space process monitoring. > > To "define an action" you need to set up an event handler. > http://nagios.sourceforge.net/docs/3_0/eventhandlers.html That's exactly what I need. Would have taken me years to find that. Thanks! -Bruce > > -- > Mike Lindsey > > ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Thu Dec 16 23:41:12 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Thu, 16 Dec 2010 22:41:12 +0000 Subject: JVM Monitoring In-Reply-To: <4D0A62A8.3070504@paul.dubuc.org> References: <1292335885.1679.12.camel@MADness> <4D0A62A8.3070504@paul.dubuc.org> Message-ID: On 16 December 2010 19:04, Paul M. Dubuc wrote: > I was just looking at the web page for check_jmx4perl at > > http://exchange.nagios.org/directory/Plugins/Java-Applications-and-Servers/check_jmx4perl/details > > It says that it requires "No Java installation required on the Nagios host". > > Is this not true? Maybe it is. I can't say I've tried it! ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Thu Dec 16 23:45:14 2010 From: stanb at panix.com (stan) Date: Thu, 16 Dec 2010 17:45:14 -0500 Subject: Antoher distributed config question Message-ID: <20101216224514.GB17373@teddy.fas.com> Sorry for the torment of questions on this. We are trying to get this working with "end of year" money, so we are supposed to have it working before Jan 1 :-) We have the parent and child talking to each other using NSCA. We are getting updates on the child system on both the host detail, and service detail screens. On the parent system only the service detail screen is updating. The host detail screen continues to show "PENDING", with the red X thta indicates active checks are disabled. I am looking at the data stream coming in on the pipe, and all I see are messages like this: [1292539309] PROCESS_SERVICE_CHECK_RESULT;AW210B;RDP;0;PING OK - Packet loss = 0%, RTA = 0.37 ms Should I be seeing ones with something like HOST_SERVICE_CHECK_RESULT ? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From listen at oberhausen-it.de Thu Dec 16 23:43:55 2010 From: listen at oberhausen-it.de (Daniel) Date: Thu, 16 Dec 2010 23:43:55 +0100 Subject: Qugga BGP monitoring IPv6 Message-ID: <512491303.20101216234355@oberhausen-it.de> Hey there, can anyone tell me if it is possible to monitor the state of a IPv6 BGP-Session inside a quagga? I Found some SNMP tools for nagios but they can only handle IPv4 sessions. -- Mit freundlichen Gr??en Daniel mailto:listen at oberhausen-it.de ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Fri Dec 17 00:22:12 2010 From: stanb at panix.com (stan) Date: Thu, 16 Dec 2010 18:22:12 -0500 Subject: Antoher distributed config question In-Reply-To: <20101216224514.GB17373@teddy.fas.com> References: <20101216224514.GB17373@teddy.fas.com> Message-ID: <20101216232212.GA18455@teddy.fas.com> On Thu, Dec 16, 2010 at 05:45:14PM -0500, stan wrote: > Sorry for the torment of questions on this. We are trying to get this > working with "end of year" money, so we are supposed to have it working > before Jan 1 :-) > > We have the parent and child talking to each other using NSCA. We are > getting updates on the child system on both the host detail, and service > detail screens. On the parent system only the service detail screen is > updating. The host detail screen continues to show "PENDING", with the red > X thta indicates active checks are disabled. > > I am looking at the data stream coming in on the pipe, and all I see are > messages like this: > > [1292539309] PROCESS_SERVICE_CHECK_RESULT;AW210B;RDP;0;PING OK - Packet > loss = 0%, RTA = 0.37 ms > > Should I be seeing ones with something like HOST_SERVICE_CHECK_RESULT ? Sorry to follow up to my own email, but I meant to add this to the orignal. If it helps to understand my configuration, the child is an Ubuntu system, and I started with thier default configuration files, and used them as examples. They look a lot different from the ones on the parent machine, whcih were originally hand coded, and later imported into nagiosql. -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Fri Dec 17 00:45:13 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Thu, 16 Dec 2010 16:45:13 -0700 Subject: CPU monitor for a single Linux user spaceprocess ? In-Reply-To: References: <4D09A194.9060002@bettyscout.org> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240546F586@WPSCV6MM.OPR.STATEFARM.ORG> I also modified one of the other plugins to do a user-space, check_ps.pl I think was it, so I added a -u option that basically does a ps aux for all that user and then totals all the results. I can probably get a copy if you wanted it. Mainly because I don't use snmp for process checks is why I did it as a local check. Dan -----Original Message----- From: Bruce Edge [mailto:bruce.edge at gmail.com] Sent: Thursday, December 16, 2010 4:36 PM To: Mike Lindsey Cc: Nagios Users List Subject: Re: [Nagios-users] CPU monitor for a single Linux user spaceprocess ? On Wed, Dec 15, 2010 at 9:20 PM, Mike Lindsey wrote: > On 12/15/10 5:05 PM, Bruce Edge wrote: >> >> Rookie question here. Trying to determine nagios suitability for an >> embedded app. >> >> Can I monitor the CPU utilization for a single user space process on a >> Linux box with nagios? >> And, can I define an action if it exceeds a threshold? > > Sounds like you need check_snmp_process.pl from here: > http://nagios.manubulon.com/snmp_process.html > > I've been using it, it works quite well. ?It requires snmpd, but is > basically the swiss army knife of user-space process monitoring. > > To "define an action" you need to set up an event handler. > http://nagios.sourceforge.net/docs/3_0/eventhandlers.html That's exactly what I need. Would have taken me years to find that. Thanks! -Bruce > > -- > Mike Lindsey > > ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jorgeaaq at hotmail.com Fri Dec 17 03:16:29 2010 From: jorgeaaq at hotmail.com (Jorge Arenas) Date: Thu, 16 Dec 2010 20:16:29 -0600 Subject: check_snmp Message-ID: Hi: i just install nagios and I am following the basic guide in the site to monitor a switch i create the file and work well, but all switch ports are reported in OK status always i check the output ok the command: check_snmp -C public -H switch -r 1 .... etc and even when the port reports status of 2 ( down) the "-r 1" parameter is not working and the report send "SNMP OK" for down ports I read the instruccions but i can not find any information regarding the -r (regex) parameter I Found a workaround changing the "-r 1" for the "-c 1:1" but I do not know if the documentation in the site is out-dated or i am making something wrong any ideas thanks in advance Jorge Arenas CSA Mexico -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Fri Dec 17 03:52:09 2010 From: jpratt at norwich.edu (James Pratt) Date: Thu, 16 Dec 2010 21:52:09 -0500 Subject: check_snmp In-Reply-To: References: Message-ID: <369C2BA4DE2C8F4A88BC422AD06C96BD325B@nuexchange.norwich.edu> you could try these instead of check_snmp , i always found check_snmp to be too limited... :\ http://nagios.manubulon.com From: Jorge Arenas [mailto:jorgeaaq at hotmail.com] Sent: Thursday, December 16, 2010 9:16 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] check_snmp Hi: i just install nagios and I am following the basic guide in the site to monitor a switch i create the file and work well, but all switch ports are reported in OK status always i check the output ok the command: check_snmp -C public -H switch -r 1 .... etc and even when the port reports status of 2 ( down) the "-r 1" parameter is not working and the report send "SNMP OK" for down ports I read the instruccions but i can not find any information regarding the -r (regex) parameter I Found a workaround changing the "-r 1" for the "-c 1:1" but I do not know if the documentation in the site is out-dated or i am making something wrong any ideas thanks in advance Jorge Arenas CSA Mexico -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mad at b-care.net Fri Dec 17 08:19:52 2010 From: mad at b-care.net (=?ISO-8859-1?Q?Marc-Andr=E9?= Doll) Date: Fri, 17 Dec 2010 08:19:52 +0100 Subject: JVM Monitoring In-Reply-To: References: <1292335885.1679.12.camel@MADness> <4D0A62A8.3070504@paul.dubuc.org> Message-ID: <1292570392.1737.3.camel@MADness> On Thu, 2010-12-16 at 22:41 +0000, Jim Avery wrote: > On 16 December 2010 19:04, Paul M. Dubuc wrote: > > I was just looking at the web page for check_jmx4perl at > > > > http://exchange.nagios.org/directory/Plugins/Java-Applications-and-Servers/check_jmx4perl/details > > > > It says that it requires "No Java installation required on the Nagios host". > > > > Is this not true? Hi, Yes it is true. You don't need to install Java on your Nagios server to use check_jmx4perl. You just have to deploy a .war on your TomCat (and certainly GlassFish, I didn't try it) servers and then you just have to send some HTTP request to your server to obtain readings about your JVM. That's why I choose it. But for now, it seems it is only capable to check Java on JEE servers and not Java on its own as I need right now. Marc-Andr? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pitchfork at ederdrom.de Fri Dec 17 11:19:32 2010 From: pitchfork at ederdrom.de (Joerg Linge) Date: Fri, 17 Dec 2010 11:19:32 +0100 Subject: JVM Monitoring In-Reply-To: <1292570392.1737.3.camel@MADness> References: <1292335885.1679.12.camel@MADness> <4D0A62A8.3070504@paul.dubuc.org> <1292570392.1737.3.camel@MADness> Message-ID: <4D0B3934.1000602@ederdrom.de> Am 17.12.10 08:19, schrieb Marc-Andr? Doll: > On Thu, 2010-12-16 at 22:41 +0000, Jim Avery wrote: >> On 16 December 2010 19:04, Paul M. Dubuc wrote: >>> I was just looking at the web page for check_jmx4perl at >>> >>> http://exchange.nagios.org/directory/Plugins/Java-Applications-and-Servers/check_jmx4perl/details >>> >>> It says that it requires "No Java installation required on the Nagios host". >>> >>> Is this not true? > > Hi, > > Yes it is true. You don't need to install Java on your Nagios server to > use check_jmx4perl. You just have to deploy a .war on your TomCat (and > certainly GlassFish, I didn't try it) servers and then you just have to > send some HTTP request to your server to obtain readings about your JVM. > > That's why I choose it. But for now, it seems it is only capable to > check Java on JEE servers and not Java on its own as I need right now. Hi Marc, the jmx4perl agent is now called jolokia http://labs.consol.de/lang/de/jolokia/ http://www.jolokia.org/ There is also an JVM Agent which can be used with Sun Java 6+ http://www.jolokia.org/agent/jvm.html Joerg ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Fri Dec 17 11:54:13 2010 From: stanb at panix.com (stan) Date: Fri, 17 Dec 2010 05:54:13 -0500 Subject: Clarification on messages that NSCA shouls send Message-ID: <20101217105413.GA31745@teddy.fas.com> WE are trying to get our first parent child relationship Nagios setup going. I have a question about the trafic that I should see NSCA sending. I am seeing serivce status updates, but no host status updates. I should see 2 diferent types of messagess here, right? One for services, and one for hosts, is this correct? A 2nd question, is there a way to see the fully expnaded (including all the nested USE statements of a given object, somehoow? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.friedrich at univie.ac.at Fri Dec 17 18:33:33 2010 From: michael.friedrich at univie.ac.at (Michael Friedrich) Date: Fri, 17 Dec 2010 18:33:33 +0100 Subject: hostgroup with no members - Enhancement In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB2404EEE87E@WPSCV6MM.OPR.STATEFARM.ORG> <4CCFBEE1.3040007@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB2404EEEC37@WPSCV6MM.OPR.STATEFARM.ORG> <4CD01BB5.103@op5.se> <4CD14296.1030400@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB240516D1F2@WPSCV6MM.OPR.STATEFARM.ORG> <4CE640A9.7080507@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB240516D6E2@WPSCV6MM.OPR.STATEFARM.ORG> <4CF36248.1090401@op5.se> <4CF3A14B.7080802@op5.se> Message-ID: <4D0B9EED.60604@univie.ac.at> -------- Original Message -------- Subject: Re: [Nagios-users] hostgroup with no members - Enhancement From: Max Schubert To: Andreas Ericsson , Nagios Users List Date: 2010-11-29 15:07 > Thank you, Andreas - I actually really enjoy writing tests and > understand their value - will see if I can put a few in place using > tap over the next week or so and update the patch with them. considering the tests in t-tap - adding such a new config option breaks them. at least in test_timeperiods.c and test_nagios_config.c the variable needs to be added. int allow_empty_hostgroup_assignment; (sorry, I'm too lazy for git now) kind regards, Michael > On 11/29/10, Andreas Ericsson wrote: >> On 11/29/2010 09:20 AM, Andreas Ericsson wrote: >>> This looks nice. It's in my "compile and test" queue right now, so >>> assuming it works out ok it'll be committed before the week is out. >>> >> And now it's out there. More testing would be much appreciated though, >> but considering the patch is fairly well written I have few qualms >> about it. >> >> Cheers, and thanks again. >> >> -- >> Andreas Ericsson andreas.ericsson at op5.se >> OP5 AB www.op5.se >> Tel: +46 8-230225 Fax: +46 8-230231 >> >> Considering the successes of the wars on alcohol, poverty, drugs and >> terror, I think we should give some serious thought to declaring war >> on peace. >> > ------------------------------------------------------------------------------ > Increase Visibility of Your 3D Game App& Earn a Chance To Win $500! > Tap into the largest installed PC base& get more eyes on your game by > optimizing for Intel(R) Graphics Technology. Get started today with the > Intel(R) Software Partner Program. Five $500 cash prizes are up for grabs. > http://p.sf.net/sfu/intelisp-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- DI (FH) Michael Friedrich Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria email: michael.friedrich at univie.ac.at phone: +43 1 4277 14359 fax: +43 1 4277 14338 web: http://www.univie.ac.at/zid http://www.aco.net Icinga Core& IDOUtils Developer http://www.icinga.org ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Eliot.Picken at wenaas.co.uk Fri Dec 17 19:05:44 2010 From: Eliot.Picken at wenaas.co.uk (Eliot.Picken at wenaas.co.uk) Date: Fri, 17 Dec 2010 18:05:44 +0000 Subject: AUTO: Eliot Picken is out of the office (returning 10/01/2011) Message-ID: I am out of the office until 10/01/2011. I am currently out of the office, and your email has not been forwarded. I will be reading email periodically my time out of the office. Note: This is an automated response to your message "Re: [Nagios-users] hostgroup with no members - Enhancement" sent on 12/17/2010 5:33:33 PM. This is the only notification you will receive while this person is away. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tonvoon at gmail.com Fri Dec 17 22:02:11 2010 From: tonvoon at gmail.com (Ton Voon) Date: Fri, 17 Dec 2010 21:02:11 +0000 Subject: hostgroup with no members - Enhancement In-Reply-To: <4D0B9EED.60604@univie.ac.at> References: <31B0FE0A1A8166409E9DF35C6DEECB2404EEE87E@WPSCV6MM.OPR.STATEFARM.ORG> <4CCFBEE1.3040007@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB2404EEEC37@WPSCV6MM.OPR.STATEFARM.ORG> <4CD01BB5.103@op5.se> <4CD14296.1030400@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB240516D1F2@WPSCV6MM.OPR.STATEFARM.ORG> <4CE640A9.7080507@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB240516D6E2@WPSCV6MM.OPR.STATEFARM.ORG> <4CF36248.1090401@op5.se> <4CF3A14B.7080802@op5.se> <4D0B9EED.60604@univie.ac.at> Message-ID: On 17 Dec 2010, at 17:33, Michael Friedrich wrote: > -------- Original Message -------- > Subject: Re: [Nagios-users] hostgroup with no members - Enhancement > From: Max Schubert > To: Andreas Ericsson , Nagios Users List > > Date: 2010-11-29 15:07 >> Thank you, Andreas - I actually really enjoy writing tests and >> understand their value - will see if I can put a few in place using >> tap over the next week or so and update the patch with them. > > considering the tests in t-tap - adding such a new config option > breaks > them. > > at least in test_timeperiods.c and test_nagios_config.c the variable > needs to be added. > > int allow_empty_hostgroup_assignment; I didn't really follow this thread earlier, so bear with me with the questions: Does this have to be implemented as another variable? Why can't you always allow an empty hostgroup? (In the interests of keeping it simple) Ton ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mgagne at iweb.com Fri Dec 17 22:16:15 2010 From: mgagne at iweb.com (=?ISO-8859-1?Q?Mathieu_Gagn=E9?=) Date: Fri, 17 Dec 2010 16:16:15 -0500 Subject: hostgroup with no members - Enhancement In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB2404EEE87E@WPSCV6MM.OPR.STATEFARM.ORG> <4CCFBEE1.3040007@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB2404EEEC37@WPSCV6MM.OPR.STATEFARM.ORG> <4CD01BB5.103@op5.se> <4CD14296.1030400@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB240516D1F2@WPSCV6MM.OPR.STATEFARM.ORG> <4CE640A9.7080507@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB240516D6E2@WPSCV6MM.OPR.STATEFARM.ORG> <4CF36248.1090401@op5.se> <4CF3A14B.7080802@ op5.se> <4D0B9EED.60604@univie.ac.at> Message-ID: <4D0BD31F.6080002@iweb.com> On 12/17/10 4:02 PM, Ton Voon wrote: > > I didn't really follow this thread earlier, so bear with me with the > questions: > > Does this have to be implemented as another variable? Why can't you > always allow an empty hostgroup? > > (In the interests of keeping it simple) I agree with Ton. Why not adding servicegroups and contactgroups while we are there? -- Mathieu ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Rick.Carter at umich.edu Fri Dec 17 22:23:15 2010 From: Rick.Carter at umich.edu (Rick Carter) Date: Fri, 17 Dec 2010 16:23:15 -0500 Subject: hostgroup with no members - Enhancement In-Reply-To: <4D0BD31F.6080002@iweb.com> References: <31B0FE0A1A8166409E9DF35C6DEECB2404EEE87E@WPSCV6MM.OPR.STATEFARM.ORG> <4CCFBEE1.3040007@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB2404EEEC37@WPSCV6MM.OPR.STATEFARM.ORG> <4CD01BB5.103@op5.se> <4CD14296.1030400@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB240516D1F2@WPSCV6MM.OPR.STATEFARM.ORG> <4CE640A9.7080507@op5.se> <31B0FE0A1A8166409E9DF35C6DEECB240516D6E2@WPSCV6MM.OPR.STATEFARM.ORG> <4CF36248.1090401@op5.se> <4CF3A14B.7080802@ op5.se> <4D0B9EED.60604@univie.ac.at> <4D0BD31F.6080002@iweb.com> Message-ID: <00CC5F69-891E-4F7A-BD92-B0E7490CE4C5@umich.edu> I can see issuing warnings (because sometimes in a case like this, someone's just made a typo), or in future, having settings in nagios.cfg on each type of group whether empty=fatal, warning, or ignore. On Dec 17, 2010, at 4:16 PM, Mathieu Gagn? wrote: > On 12/17/10 4:02 PM, Ton Voon wrote: >> >> I didn't really follow this thread earlier, so bear with me with the >> questions: >> >> Does this have to be implemented as another variable? Why can't you >> always allow an empty hostgroup? >> >> (In the interests of keeping it simple) > > I agree with Ton. > > Why not adding servicegroups and contactgroups while we are there? > > -- > Mathieu -- Rick Carter, Unix/Linux SysAdmin University of Michigan, ITS System Support Team "Bugs Bunny is who we want to be; Daffy Duck is probably who we are." - Chuck Jones ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Sat Dec 18 05:50:47 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Fri, 17 Dec 2010 20:50:47 -0800 Subject: Clarification on messages that NSCA shouls send In-Reply-To: <20101217105413.GA31745@teddy.fas.com> References: <20101217105413.GA31745@teddy.fas.com> Message-ID: <4D0C3DA7.8020008@hp.com> On 12/17/2010 2:54 AM, stan wrote: > WE are trying to get our first parent child relationship Nagios setup > going. I have a question about the trafic that I should see NSCA sending. I > am seeing serivce status updates, but no host status updates. > > I should see 2 diferent types of messagess here, right? One for services, > and one for hosts, is this correct? It is correct that the messages are different. However, nsca will only send the messages your configuration tells it to send. If you haven't configured Nagios to use nsca to send host status messages, it won't. > A 2nd question, is there a way to see the fully expnaded (including all the > nested USE statements of a given object, somehoow? Have you looked at the configuration section of the Nagios interface? That may show you what you're looking for. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Sat Dec 18 16:25:20 2010 From: ae at op5.se (Andreas Ericsson) Date: Sat, 18 Dec 2010 16:25:20 +0100 Subject: Antoher distributed config question In-Reply-To: <20101216224514.GB17373@teddy.fas.com> References: <20101216224514.GB17373@teddy.fas.com> Message-ID: <4D0CD260.4030705@op5.se> On 12/16/2010 11:45 PM, stan wrote: > Sorry for the torment of questions on this. We are trying to get this > working with "end of year" money, so we are supposed to have it working > before Jan 1 :-) > > We have the parent and child talking to each other using NSCA. We are > getting updates on the child system on both the host detail, and service > detail screens. On the parent system only the service detail screen is > updating. The host detail screen continues to show "PENDING", with the red > X thta indicates active checks are disabled. > > I am looking at the data stream coming in on the pipe, and all I see are > messages like this: > > [1292539309] PROCESS_SERVICE_CHECK_RESULT;AW210B;RDP;0;PING OK - Packet > loss = 0%, RTA = 0.37 ms > > Should I be seeing ones with something like HOST_SERVICE_CHECK_RESULT ? > > If you've set up the obsessive host check command too and not only the obsessive service check command, then yes. Otherwise, no. But you'll only see it when host checks are run, so if you've turned off scheduled host checks you won't see it very frequently. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dylanklc at gmail.com Sun Dec 19 03:44:51 2010 From: dylanklc at gmail.com (k Dylan') Date: Sun, 19 Dec 2010 10:44:51 +0800 Subject: Nagios-users Digest, Vol 55, Issue 14 In-Reply-To: References: Message-ID: Hello,about nagios notification. I use nagios with merlin,I write a notification's plugin exit 2 for test.my plugin return code is 2, Nagios whit this config: notification_options is w,u,c,r,f,s. Notification_period is 24*7, nagios should notification forever,but after a notification timeout nagios will nerver notify to me . At the same time view nagios core CGI Services' status is CRITICAL. I find nagios.log , there is nothing without start and shutdown log. Does anybody get same problem? How can you solve this problem? best regards dylanklc -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Mon Dec 20 02:31:43 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Sun, 19 Dec 2010 20:31:43 -0500 Subject: exchange.nagios.org down? Message-ID: I'm having issues getting to the exchange, and according to the all-knowing, all-seeing powers that be (http://www.downforeveryoneorjustme.com/exchange.nagios.org), I'm not alone. Any knowledge of the issue or ETR? --Matt -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Mon Dec 20 14:22:42 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Mon, 20 Dec 2010 08:22:42 -0500 Subject: exchange.nagios.org down? In-Reply-To: References: Message-ID: Must have been my bad luck. It seems to be up now. Disregard complaining. Acquire plugins. --Matt On Sun, Dec 19, 2010 at 8:31 PM, Matt Simmons wrote: > I'm having issues getting to the exchange, and according to the > all-knowing, all-seeing powers that be > (http://www.downforeveryoneorjustme.com/exchange.nagios.org), I'm not > alone. > > Any knowledge of the issue or ETR? > > --Matt > > > -- > LITTLE GIRL: But which cookie will you eat FIRST? > COOKIE MONSTER: Me think you have misconception of cookie-eating process. > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pascal.miquet at wanadoo.fr Mon Dec 20 15:07:35 2010 From: pascal.miquet at wanadoo.fr (Pascal Miquet) Date: Mon, 20 Dec 2010 15:07:35 +0100 Subject: Email Notifications Message-ID: Hi, I've installed Nagios 3 on a Debian box, and it seem that for some notification, I've got the user name rather than the Email address of the user attached to the notification. Did you've got some informations ? Thanks for your Help Pascal -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Mon Dec 20 15:45:28 2010 From: stanb at panix.com (stan) Date: Mon, 20 Dec 2010 09:45:28 -0500 Subject: NSCA and host checks Message-ID: <20101220144528.GA24567@teddy.fas.com> I am working on geting a small distributed system up. I have the serviec checks going back to the master, but have not managed to get host checks passed back to the master yet. With help from the list I found: obsess_over_hosts and set it to 1. It appears that I also need an ochp command. I tried using the script I had set p for service checks, but it appears that the arguments passed to this are different. Here are the arguments that are being passed the the service check comand: Arguments: # $1 = host_name (Short name of host that the service is # associated with) # $2 = svc_description (Description of the service) # $3 = state_string (A string representing the status of # the given service - "OK", "WARNING", "CRITICAL" # or "UNKNOWN") # $4 = plugin_output (A text string that # What do the one passed to the host check look like? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Dec 20 15:54:45 2010 From: ae at op5.se (Andreas Ericsson) Date: Mon, 20 Dec 2010 15:54:45 +0100 Subject: NSCA and host checks In-Reply-To: <20101220144528.GA24567@teddy.fas.com> References: <20101220144528.GA24567@teddy.fas.com> Message-ID: <4D0F6E35.4080105@op5.se> On 12/20/2010 03:45 PM, stan wrote: > I am working on geting a small distributed system up. I have the serviec > checks going back to the master, but have not managed to get host checks > passed back to the master yet. > > With help from the list I found: > > obsess_over_hosts and set it to 1. It appears that I also need an ochp > command. I tried using the script I had set p for service checks, but it > appears that the arguments passed to this are different. Here are the > arguments that are being passed the the service check comand: > > Arguments: > # $1 = host_name (Short name of host that the service is > # associated with) > # $2 = svc_description (Description of the service) > # $3 = state_string (A string representing the status of > # the given service - "OK", "WARNING", "CRITICAL" > # or "UNKNOWN") > # $4 = plugin_output (A text string that > # > What do the one passed to the host check look like? > the same, but without the service description. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gopearls42 at gmail.com Mon Dec 20 17:13:59 2010 From: gopearls42 at gmail.com (mark bradley) Date: Mon, 20 Dec 2010 11:13:59 -0500 Subject: tracing nagios actions Message-ID: Hi, I have a small-ish number of servers and I've tried to configure Nagios to warn me about disk-space running low. The problem is that, although disk space is above both warning and critical levels I'm not getting any notifications. The nagios.log file is silent on the topic and nagios -v does not produce any errors or warnings. Is there a way to trace what actions nagios is considering (much like make -n) in order to debug this problem? Is there a debugging methodology defined somewhere? If it's in your head can you share? Thanks, Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eric.berg at barclayscapital.com Mon Dec 20 17:02:54 2010 From: eric.berg at barclayscapital.com (eric.berg at barclayscapital.com) Date: Mon, 20 Dec 2010 11:02:54 -0500 Subject: Nagios kept from restarting after reboot by lock file Message-ID: Gee, this seems like an annoying newbie problem, but if Nagios crashes or is killed (as on system reboot), it leaves a lock file around that prevents it from starting again until the lock file is manually removed. I see this on Monday mornings after weekend reboots on a Red Hat Linux box: nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' looks like its already held by another instance of Nagios (PID 0). Bailing out... Does anyone know if there's a config option or something else that obviates the need to write a wrapper scropt to check to see if Nagios is really running and remove the lock file (look slike Nagios already knows it's not running by virtue of the value of the PID inthis very message!) so that it can cleanly start up again? Thanks. Eric _______________________________________________ This e-mail may contain information that is confidential, privileged or otherwise protected from disclosure. If you are not an intended recipient of this e-mail, do not duplicate or redistribute it by any means. Please delete it and any attachments and notify the sender that you have received it in error. Unless specifically indicated, this e-mail is not an offer to buy or sell or a solicitation to buy or sell any securities, investment products or other financial product or service, an official confirmation of any transaction, or an official statement of Barclays. Any views or opinions presented are solely those of the author and do not necessarily represent those of Barclays. This e-mail is subject to terms available at the following link: www.barcap.com/emaildisclaimer. By messaging wit h Barclays you consent to the foregoing. Barclays Capital is the investment banking division of Barclays Bank PLC, a company registered in England (number 1026167) with its registered office at 1 Churchill Place, London, E14 5HP. This email may relate to or be sent from other members of the Barclays Group. _______________________________________________ ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From polifemos at conedsolutions.com Mon Dec 20 17:35:11 2010 From: polifemos at conedsolutions.com (Polifemo, Salvatore) Date: Mon, 20 Dec 2010 11:35:11 -0500 Subject: tracing nagios actions In-Reply-To: References: Message-ID: <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net> Are these Windows or *nix server? Either wau run the check command manually from a console and see what the results are. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 From: mark bradley [mailto:gopearls42 at gmail.com] Sent: Monday, December 20, 2010 11:14 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] tracing nagios actions Hi, I have a small-ish number of servers and I've tried to configure Nagios to warn me about disk-space running low. The problem is that, although disk space is above both warning and critical levels I'm not getting any notifications. The nagios.log file is silent on the topic and nagios -v does not produce any errors or warnings. Is there a way to trace what actions nagios is considering (much like make -n) in order to debug this problem? Is there a debugging methodology defined somewhere? If it's in your head can you share? Thanks, Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eric.berg at barclayscapital.com Mon Dec 20 17:16:19 2010 From: eric.berg at barclayscapital.com (eric.berg at barclayscapital.com) Date: Mon, 20 Dec 2010 11:16:19 -0500 Subject: Nagios kept from restarting after reboot by lock file In-Reply-To: References: Message-ID: Alternatively, could you recommend a good system/resource monitoring tool that would be able to let me know if nagios is down and restart it automatically? _____________________________________________ From: Berg, Eric: IT (NYK) Sent: Monday, December 20, 2010 11:03 AM To: 'nagios-users at lists.sourceforge.net' Subject: Nagios kept from restarting after reboot by lock file Gee, this seems like an annoying newbie problem, but if Nagios crashes or is killed (as on system reboot), it leaves a lock file around that prevents it from starting again until the lock file is manually removed. I see this on Monday mornings after weekend reboots on a Red Hat Linux box: nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' looks like its already held by another instance of Nagios (PID 0). Bailing out... Does anyone know if there's a config option or something else that obviates the need to write a wrapper scropt to check to see if Nagios is really running and remove the lock file (look slike Nagios already knows it's not running by virtue of the value of the PID inthis very message!) so that it can cleanly start up again? Thanks. Eric _______________________________________________ This e-mail may contain information that is confidential, privileged or otherwise protected from disclosure. If you are not an intended recipient of this e-mail, do not duplicate or redistribute it by any means. Please delete it and any attachments and notify the sender that you have received it in error. Unless specifically indicated, this e-mail is not an offer to buy or sell or a solicitation to buy or sell any securities, investment products or other financial product or service, an official confirmation of any transaction, or an official statement of Barclays. Any views or opinions presented are solely those of the author and do not necessarily represent those of Barclays. This e-mail is subject to terms available at the following link: www.barcap.com/emaildisclaimer. By messaging wit h Barclays you consent to the foregoing. Barclays Capital is the investment banking division of Barclays Bank PLC, a company registered in England (number 1026167) with its registered office at 1 Churchill Place, London, E14 5HP. This email may relate to or be sent from other members of the Barclays Group. _______________________________________________ ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gopearls42 at gmail.com Mon Dec 20 17:50:14 2010 From: gopearls42 at gmail.com (mark bradley) Date: Mon, 20 Dec 2010 11:50:14 -0500 Subject: tracing nagios actions In-Reply-To: <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net> References: <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net> Message-ID: Hi Salvatore, They're all Unix (Redhat) servers. By check command do you mean nagios -v? I've done that and I do not get an errors. Thanks, Mark On Mon, Dec 20, 2010 at 11:35 AM, Polifemo, Salvatore < polifemos at conedsolutions.com> wrote: > Are these Windows or *nix server? > > > > Either wau run the check command manually from a console and see what the > results are. > > > > > > > > *Salvatore Polifemo* > > *Sr. Systems Security Specialist* > > *ConEdison Solutions* > > *100 Summit Lake Drive* > > *Valhalla, NY 10595* > > > > *From:* mark bradley [mailto:gopearls42 at gmail.com] > *Sent:* Monday, December 20, 2010 11:14 AM > *To:* nagios-users at lists.sourceforge.net > *Subject:* [Nagios-users] tracing nagios actions > > > > Hi, > > > > I have a small-ish number of servers and I've tried to configure Nagios to > warn me about disk-space running low. The problem is that, although disk > space is above both warning and critical levels I'm not getting any > notifications. > > > > The nagios.log file is silent on the topic and nagios -v does not produce > any errors or warnings. > > > > Is there a way to trace what actions nagios is considering (much like make > -n) in order to debug this problem? Is there a debugging methodology defined > somewhere? If it's in your head can you share? > > > > Thanks, > > Mark > > > ------------------------------------------------------------------------------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Mon Dec 20 17:55:44 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Mon, 20 Dec 2010 09:55:44 -0700 Subject: Nagios kept from restarting after reboot by lockfile In-Reply-To: References: Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24054BBABE@WPSCV6MM.OPR.STATEFARM.ORG> Couple questions 1) Why do you have to reboot your monitoring server weekly? 2) How is the reboot being done? Reason I ask 2) is because the standard rc script will remove the lockfile when nagios is told to stop. So if you are having this problem is sounds like you are not doing a clean shutdown and something could be wrong. Either way, I guess worst case one way to check for this would be put something like this in your /etc/rc.d/rc.local: rm -f /var/lock/subsys/nagios Assuming that's where your lockfile is. Dan -----Original Message----- From: eric.berg at barclayscapital.com [mailto:eric.berg at barclayscapital.com] Sent: Monday, December 20, 2010 10:16 AM To: eric.berg at barclayscapital.com; nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Nagios kept from restarting after reboot by lockfile Alternatively, could you recommend a good system/resource monitoring tool that would be able to let me know if nagios is down and restart it automatically? _____________________________________________ From: Berg, Eric: IT (NYK) Sent: Monday, December 20, 2010 11:03 AM To: 'nagios-users at lists.sourceforge.net' Subject: Nagios kept from restarting after reboot by lock file Gee, this seems like an annoying newbie problem, but if Nagios crashes or is killed (as on system reboot), it leaves a lock file around that prevents it from starting again until the lock file is manually removed. I see this on Monday mornings after weekend reboots on a Red Hat Linux box: nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' looks like its already held by another instance of Nagios (PID 0). Bailing out... Does anyone know if there's a config option or something else that obviates the need to write a wrapper scropt to check to see if Nagios is really running and remove the lock file (look slike Nagios already knows it's not running by virtue of the value of the PID inthis very message!) so that it can cleanly start up again? Thanks. Eric _______________________________________________ This e-mail may contain information that is confidential, privileged or otherwise protected from disclosure. If you are not an intended recipient of this e-mail, do not duplicate or redistribute it by any means. Please delete it and any attachments and notify the sender that you have received it in error. Unless specifically indicated, this e-mail is not an offer to buy or sell or a solicitation to buy or sell any securities, investment products or other financial product or service, an official confirmation of any transaction, or an official statement of Barclays. Any views or opinions presented are solely those of the author and do not necessarily represent those of Barclays. This e-mail is subject to terms available at the following link: www.barcap.com/emaildisclaimer. By messaging with Barclays you consent to the foregoing. Barclays Capital is the investment banking division of Barclays Bank PLC, a company registered in England (number 1026167) with its registered offic e at 1 Churchill Place, London, E14 5HP. This email may relate to or be sent from other members of the Barclays Group. _______________________________________________ ------------------------------------------------------------------------ ------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Mon Dec 20 18:13:41 2010 From: a31modela at hotmail.com (steve f) Date: Mon, 20 Dec 2010 12:13:41 -0500 Subject: tracing nagios actions In-Reply-To: References: , <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net>, Message-ID: Mark, I think Salvatore means run the check manually from the command line , make sure you run it as the nagios user and try setting tha warning & critical values to something that will make it fail also: /usr/local/nagios/libexec > ./check_disk -w 50 -c 70 -p /home DISK OK - free space: /home 440 MB (95% inode=99%);| /home=20MB;436;416;0;486 The -p just checks a specific path. ( FYI ) Steve Date: Mon, 20 Dec 2010 11:50:14 -0500 From: gopearls42 at gmail.com To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] tracing nagios actions Hi Salvatore, They're all Unix (Redhat) servers. By check command do you mean nagios -v? I've done that and I do not get an errors. Thanks,Mark On Mon, Dec 20, 2010 at 11:35 AM, Polifemo, Salvatore wrote: Are these Windows or *nix server? Either wau run the check command manually from a console and see what the results are. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 From: mark bradley [mailto:gopearls42 at gmail.com] Sent: Monday, December 20, 2010 11:14 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] tracing nagios actions Hi, I have a small-ish number of servers and I've tried to configure Nagios to warn me about disk-space running low. The problem is that, although disk space is above both warning and critical levels I'm not getting any notifications. The nagios.log file is silent on the topic and nagios -v does not produce any errors or warnings. Is there a way to trace what actions nagios is considering (much like make -n) in order to debug this problem? Is there a debugging methodology defined somewhere? If it's in your head can you share? Thanks, Mark ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcasale at activenetwerx.com Mon Dec 20 18:13:24 2010 From: jcasale at activenetwerx.com (Joseph L. Casale) Date: Mon, 20 Dec 2010 17:13:24 +0000 Subject: Nagios kept from restarting after reboot by lock file In-Reply-To: References: Message-ID: >Alternatively, could you recommend a good system/resource monitoring tool that would be able to let me know if nagios is down and restart it automatically? That's kind of funny... Why are you compiling nagios on a package based distro with existing and current _properly_ built packages? Look at rpmforge... ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From polifemos at conedsolutions.com Mon Dec 20 18:36:14 2010 From: polifemos at conedsolutions.com (Polifemo, Salvatore) Date: Mon, 20 Dec 2010 12:36:14 -0500 Subject: tracing nagios actions In-Reply-To: References: , <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net>, Message-ID: <5BE7D0404F28DC44B780100927AA4CB018664C3D@whplex3.int.cecdes.net> Yes, run the actual command from the command line as Steve demonstrated. Make sure which command is being used, and if you run the command with no parameters it will display the correct usage. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 From: steve f [mailto:a31modela at hotmail.com] Sent: Monday, December 20, 2010 12:14 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] tracing nagios actions Mark, I think Salvatore means run the check manually from the command line , make sure you run it as the nagios user and try setting tha warning & critical values to something that will make it fail also: /usr/local/nagios/libexec > ./check_disk -w 50 -c 70 -p /home DISK OK - free space: /home 440 MB (95% inode=99%);| /home=20MB;436;416;0;486 The -p just checks a specific path. ( FYI ) Steve ________________________________ Date: Mon, 20 Dec 2010 11:50:14 -0500 From: gopearls42 at gmail.com To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] tracing nagios actions Hi Salvatore, They're all Unix (Redhat) servers. By check command do you mean nagios -v? I've done that and I do not get an errors. Thanks, Mark On Mon, Dec 20, 2010 at 11:35 AM, Polifemo, Salvatore wrote: Are these Windows or *nix server? Either wau run the check command manually from a console and see what the results are. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 From: mark bradley [mailto:gopearls42 at gmail.com] Sent: Monday, December 20, 2010 11:14 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] tracing nagios actions Hi, I have a small-ish number of servers and I've tried to configure Nagios to warn me about disk-space running low. The problem is that, although disk space is above both warning and critical levels I'm not getting any notifications. The nagios.log file is silent on the topic and nagios -v does not produce any errors or warnings. Is there a way to trace what actions nagios is considering (much like make -n) in order to debug this problem? Is there a debugging methodology defined somewhere? If it's in your head can you share? Thanks, Mark ------------------------------------------------------------------------ ------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Mon Dec 20 18:41:31 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Mon, 20 Dec 2010 10:41:31 -0700 Subject: tracing nagios actions In-Reply-To: <5BE7D0404F28DC44B780100927AA4CB018664C3D@whplex3.int.cecdes.net> References: , <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net>, <5BE7D0404F28DC44B780100927AA4CB018664C3D@whplex3.int.cecdes.net> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24054BBB57@WPSCV6MM.OPR.STATEFARM.ORG> Also you probably want to make sure you run the command as the nagios user, or whatever the user is that the service runs as, to make sure you are getting the right environment and permissions. Dan From: Polifemo, Salvatore [mailto:polifemos at conedsolutions.com] Sent: Monday, December 20, 2010 11:36 AM To: Nagios Users List Subject: Re: [Nagios-users] tracing nagios actions Yes, run the actual command from the command line as Steve demonstrated. Make sure which command is being used, and if you run the command with no parameters it will display the correct usage. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 From: steve f [mailto:a31modela at hotmail.com] Sent: Monday, December 20, 2010 12:14 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] tracing nagios actions Mark, I think Salvatore means run the check manually from the command line , make sure you run it as the nagios user and try setting tha warning & critical values to something that will make it fail also: /usr/local/nagios/libexec > ./check_disk -w 50 -c 70 -p /home DISK OK - free space: /home 440 MB (95% inode=99%);| /home=20MB;436;416;0;486 The -p just checks a specific path. ( FYI ) Steve ________________________________ Date: Mon, 20 Dec 2010 11:50:14 -0500 From: gopearls42 at gmail.com To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] tracing nagios actions Hi Salvatore, They're all Unix (Redhat) servers. By check command do you mean nagios -v? I've done that and I do not get an errors. Thanks, Mark On Mon, Dec 20, 2010 at 11:35 AM, Polifemo, Salvatore wrote: Are these Windows or *nix server? Either wau run the check command manually from a console and see what the results are. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 From: mark bradley [mailto:gopearls42 at gmail.com] Sent: Monday, December 20, 2010 11:14 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] tracing nagios actions Hi, I have a small-ish number of servers and I've tried to configure Nagios to warn me about disk-space running low. The problem is that, although disk space is above both warning and critical levels I'm not getting any notifications. The nagios.log file is silent on the topic and nagios -v does not produce any errors or warnings. Is there a way to trace what actions nagios is considering (much like make -n) in order to debug this problem? Is there a debugging methodology defined somewhere? If it's in your head can you share? Thanks, Mark ------------------------------------------------------------------------ ------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gopearls42 at gmail.com Mon Dec 20 19:19:34 2010 From: gopearls42 at gmail.com (mark bradley) Date: Mon, 20 Dec 2010 13:19:34 -0500 Subject: tracing nagios actions In-Reply-To: References: <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net> Message-ID: Ah, thanks for the clarification, Steve. And now I've found the problem, too. I'll document here so that others may learn from my goofiness. So, running the check by hand: /usr/lib/nagios/plugins/check_by_ssh -q -H db1.xxx.com -i /etc/nagios/nagios_private/id_rsa -l cacti -n lh -s c1 -C '/usr/lib/nagios/plugins/check_disk *-w 80 -c 95* -p /mnt/Backup' DISK OK - free space: /mnt/Backup 13470 MB (18% inode=99%);| /mnt/Backup=58284MB;75515;75500;0;75595 It seem to be telling me that the disk is OK, however df on the server shows: Filesystem Size Used Avail Use% Mounted on /dev/mapper/SysVolGroup-lvBackup 74G 57G 14G 82% /mnt/Backup In reality, then, 82% of the partition is used and the check was set to warn at 80 and critical at 95 -- why did the check not trigger? Because the specification to check_disk, if using %-ages of use, should have a % sign after the thresholds, as in: usr/lib/nagios/plugins/check_by_ssh -q -H db1.xxx.com -i /etc/nagios/nagios_private/id_rsa -l cacti -n lh -s c1 -C '/usr/lib/nagios/plugins/check_disk *-w 80% -c 95%* -p /mnt/Backup' Best, Mark Mark On Mon, Dec 20, 2010 at 12:13 PM, steve f wrote: > Mark, > > I think Salvatore means run the check manually from the command line , make > sure you run it as the nagios user and try setting tha warning & critical > values to something that will make it fail also: > > /usr/local/nagios/libexec > ./check_disk -w 50 -c 70 -p /home > DISK OK - free space: /home 440 MB (95% inode=99%);| > /home=20MB;436;416;0;486 > > The -p just checks a specific path. ( FYI ) > > Steve > > > > > ------------------------------ > Date: Mon, 20 Dec 2010 11:50:14 -0500 > From: gopearls42 at gmail.com > > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] tracing nagios actions > > > Hi Salvatore, > > They're all Unix (Redhat) servers. By check command do you mean nagios -v? > I've done that and I do not get an errors. > > Thanks, > Mark > > On Mon, Dec 20, 2010 at 11:35 AM, Polifemo, Salvatore < > polifemos at conedsolutions.com> wrote: > > Are these Windows or *nix server? > > > > Either wau run the check command manually from a console and see what the > results are. > > > > > > > > *Salvatore Polifemo* > > *Sr. Systems Security Specialist* > > *ConEdison Solutions* > > *100 Summit Lake Drive* > > *Valhalla, NY 10595* > > > > *From:* mark bradley [mailto:gopearls42 at gmail.com] > *Sent:* Monday, December 20, 2010 11:14 AM > *To:* nagios-users at lists.sourceforge.net > *Subject:* [Nagios-users] tracing nagios actions > > > > Hi, > > > > I have a small-ish number of servers and I've tried to configure Nagios to > warn me about disk-space running low. The problem is that, although disk > space is above both warning and critical levels I'm not getting any > notifications. > > > > The nagios.log file is silent on the topic and nagios -v does not produce > any errors or warnings. > > > > Is there a way to trace what actions nagios is considering (much like make > -n) in order to debug this problem? Is there a debugging methodology defined > somewhere? If it's in your head can you share? > > > > Thanks, > > Mark > > > ------------------------------------------------------------------------------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------------------------------ > Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect > the dots, take your collaborative environment to the next level, and enter > the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d > > _______________________________________________ Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please > include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------------------------------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ray at ganymede.org Mon Dec 20 19:18:26 2010 From: ray at ganymede.org (Ray Kiddy) Date: Mon, 20 Dec 2010 10:18:26 -0800 Subject: tracing nagios actions In-Reply-To: <5BE7D0404F28DC44B780100927AA4CB018664C3D@whplex3.int.cecdes.net> References: , <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net>, <5BE7D0404F28DC44B780100927AA4CB018664C3D@whplex3.int.cecdes.net> Message-ID: <5A123C03-75FD-4254-B702-0293811904C5@ganymede.org> On Dec 20, 2010, at 9:36 AM, Polifemo, Salvatore wrote: > Yes, run the actual command from the command line as Steve demonstrated. > > Make sure which command is being used, and if you run the command with no parameters it will display the correct usage. > > Salvatore Polifemo > Sr. Systems Security Specialist > ConEdison Solutions > 100 Summit Lake Drive > Valhalla, NY 10595 > Of course, one cannot tell what command is _actually_ being executed or which command _was_ actually executed. I pointed this out in a previous post (below). Apparently there are no workarounds for this. - ray Begin forwarded message: > From: Ray Kiddy > Date: November 17, 2010 9:42:59 AM PST > To: Nagios Users List > Subject: [Nagios-users] can log show actual command executed? > Reply-To: Nagios Users List > > > I am having a problem figuring out see what is actually being executed from a service. Is there a way to get the nagios log to contain the actual command being executed? > > This is what I am seeing in the Nagios.log file: > > [1290013792] SERVICE ALERT: myhost.com;Special App;CRITICAL;SOFT;1;(Service Check Timed Out) > > This is what I see in the nagios.dat file: > > check_command=check_http!/myURL!alive > > So, this shows me what the command string is in the service.cfg. I cannot see, though, what the actual command line is at this moment in time. It turns out that this check_command corresponds (I think) to: > > check_http -u /myURL -s alive > > How would I know this, though, if the command definition had been changed or if it is using, because of a mis-spelling, a command I do not think it is using? If I go into the command.cfg and switch the order of parameters, for example, I see nothing in these logs that tells me what is doing what. > > I know the simplest answer is "You should not do that." But my point is that the log file does not have enough information to tell me what happened at a past moment of time. I would need the log information _and_ the state of the command definitions at that time. If a log does not show you what happened in the past, what is its purpose? > > I am having a problem with a particular web application. For some reason I put in the check and it fails. I execute the check_http that I _think_ this service is doing, and it gives me an OK. I ended up creating a custom executable that calls curl and fetches against the same URL and this now works fine. Kind of lame, though. I use check_http in about 100 other services. So, why is this one single service not working? An obvious answer is that I am not calling the command in the way I think I am. But if I look in the log to see what the service did, I can see what I _think_ it did based on what I can see in what I _think_ is the correct command definition. But I really do not know. I do not see a line like "check_http -u /myURL -s alive" in the log, so, I cannot see if I am mis-reading things. > > Any suggestions? > > - ray > > From: steve f [mailto:a31modela at hotmail.com] > Sent: Monday, December 20, 2010 12:14 PM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] tracing nagios actions > > Mark, > > I think Salvatore means run the check manually from the command line , make sure you run it as the nagios user and try setting tha warning & critical values to something that will make it fail also: > > /usr/local/nagios/libexec > ./check_disk -w 50 -c 70 -p /home > DISK OK - free space: /home 440 MB (95% inode=99%);| /home=20MB;436;416;0;486 > > The -p just checks a specific path. ( FYI ) > > Steve > > > > > Date: Mon, 20 Dec 2010 11:50:14 -0500 > From: gopearls42 at gmail.com > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] tracing nagios actions > > Hi Salvatore, > > They're all Unix (Redhat) servers. By check command do you mean nagios -v? I've done that and I do not get an errors. > > Thanks, > Mark > > On Mon, Dec 20, 2010 at 11:35 AM, Polifemo, Salvatore wrote: > Are these Windows or *nix server? > > Either wau run the check command manually from a console and see what the results are. > > > > Salvatore Polifemo > Sr. Systems Security Specialist > ConEdison Solutions > 100 Summit Lake Drive > Valhalla, NY 10595 > > From: mark bradley [mailto:gopearls42 at gmail.com] > Sent: Monday, December 20, 2010 11:14 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] tracing nagios actions > > Hi, > > I have a small-ish number of servers and I've tried to configure Nagios to warn me about disk-space running low. The problem is that, although disk space is above both warning and critical levels I'm not getting any notifications. > > The nagios.log file is silent on the topic and nagios -v does not produce any errors or warnings. > > Is there a way to trace what actions nagios is considering (much like make -n) in order to debug this problem? Is there a debugging methodology defined somewhere? If it's in your head can you share? > > Thanks, > Mark > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mikec at aggregateknowledge.com Mon Dec 20 19:28:34 2010 From: mikec at aggregateknowledge.com (Mike Chesnut) Date: Mon, 20 Dec 2010 10:28:34 -0800 Subject: tracing nagios actions In-Reply-To: <5A123C03-75FD-4254-B702-0293811904C5@ganymede.org> References: , <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net>, <5BE7D0404F28DC44B780100927AA4CB018664C3D@whplex3.int.cecdes.net> <5A123C03-75FD-4254-B702-0293811904C5@ganymede.org> Message-ID: <4D0FA052.90502@aggregateknowledge.com> > Of course, one cannot tell what command is _actually_ being executed or > which command _was_ actually executed. I pointed this out in a previous > post (below). Apparently there are no workarounds for this. If I understand what you're asking about, I've used this to achieve it in the past: http://www.waggy.at/nagios/capture_plugin.htm ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gopearls42 at gmail.com Mon Dec 20 19:48:23 2010 From: gopearls42 at gmail.com (mark bradley) Date: Mon, 20 Dec 2010 13:48:23 -0500 Subject: tracing nagios actions In-Reply-To: <5A123C03-75FD-4254-B702-0293811904C5@ganymede.org> References: <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net> <5BE7D0404F28DC44B780100927AA4CB018664C3D@whplex3.int.cecdes.net> <5A123C03-75FD-4254-B702-0293811904C5@ganymede.org> Message-ID: There's always strace(1) if you want to dive into the details ... Mark On Mon, Dec 20, 2010 at 1:18 PM, Ray Kiddy wrote: > > On Dec 20, 2010, at 9:36 AM, Polifemo, Salvatore wrote: > > Yes, run the actual command from the command line as Steve demonstrated. > > Make sure which command is being used, and if you run the command with no > parameters it will display the correct usage. > > *Salvatore Polifemo* > *Sr. Systems Security Specialist* > *ConEdison Solutions* > *100 Summit Lake Drive* > *Valhalla, NY 10595* > > > > Of course, one cannot tell what command is _actually_ being executed or > which command _was_ actually executed. I pointed this out in a previous post > (below). Apparently there are no workarounds for this. > > - ray > > Begin forwarded message: > > *From: *Ray Kiddy > *Date: *November 17, 2010 9:42:59 AM PST > *To: *Nagios Users List > *Subject: **[Nagios-users] can log show actual command executed?* > *Reply-To: *Nagios Users List > > > I am having a problem figuring out see what is actually being executed from > a service. Is there a way to get the nagios log to contain the actual > command being executed? > > This is what I am seeing in the Nagios.log file: > > [1290013792] SERVICE ALERT: myhost.com;Special > App;CRITICAL;SOFT;1;(Service Check Timed Out) > > This is what I see in the nagios.dat file: > > check_command=check_http!/myURL!alive > > So, this shows me what the command string is in the service.cfg. I cannot > see, though, what the actual command line is at this moment in time. It > turns out that this check_command corresponds (I think) to: > > check_http -u /myURL -s alive > > How would I know this, though, if the command definition had been changed > or if it is using, because of a mis-spelling, a command I do not think it is > using? If I go into the command.cfg and switch the order of parameters, for > example, I see nothing in these logs that tells me what is doing what. > > I know the simplest answer is "You should not do that." But my point is > that the log file does not have enough information to tell me what happened > at a past moment of time. I would need the log information _and_ the state > of the command definitions at that time. If a log does not show you what > happened in the past, what is its purpose? > > I am having a problem with a particular web application. For some reason I > put in the check and it fails. I execute the check_http that I _think_ this > service is doing, and it gives me an OK. I ended up creating a custom > executable that calls curl and fetches against the same URL and this now > works fine. Kind of lame, though. I use check_http in about 100 other > services. So, why is this one single service not working? An obvious answer > is that I am not calling the command in the way I think I am. But if I look > in the log to see what the service did, I can see what I _think_ it did > based on what I can see in what I _think_ is the correct command definition. > But I really do not know. I do not see a line like "check_http -u /myURL -s > alive" in the log, so, I cannot see if I am mis-reading things. > > Any suggestions? > > - ray > > > > *From:* steve f [mailto:a31modela at hotmail.com] > *Sent:* Monday, December 20, 2010 12:14 PM > *To:* nagios-users at lists.sourceforge.net > *Subject:* Re: [Nagios-users] tracing nagios actions > > > Mark, > > I think Salvatore means run the check manually from the command line , make > sure you run it as the nagios user and try setting tha warning & critical > values to something that will make it fail also: > > /usr/local/nagios/libexec > ./check_disk -w 50 -c 70 -p /home > DISK OK - free space: /home 440 MB (95% inode=99%);| > /home=20MB;436;416;0;486 > > The -p just checks a specific path. ( FYI ) > > Steve > > > > ------------------------------ > > Date: Mon, 20 Dec 2010 11:50:14 -0500 > From: gopearls42 at gmail.com > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] tracing nagios actions > Hi Salvatore, > > They're all Unix (Redhat) servers. By check command do you mean nagios -v? > I've done that and I do not get an errors. > > Thanks, > Mark > > On Mon, Dec 20, 2010 at 11:35 AM, Polifemo, Salvatore < > polifemos at conedsolutions.com> wrote: > Are these Windows or *nix server? > > Either wau run the check command manually from a console and see what the > results are. > > > > *Salvatore Polifemo* > *Sr. Systems Security Specialist* > *ConEdison Solutions* > *100 Summit Lake Drive* > *Valhalla, NY 10595* > > *From:* mark bradley [mailto:gopearls42 at gmail.com] > *Sent:* Monday, December 20, 2010 11:14 AM > *To:* nagios-users at lists.sourceforge.net > *Subject:* [Nagios-users] tracing nagios actions > > Hi, > > I have a small-ish number of servers and I've tried to configure Nagios to > warn me about disk-space running low. The problem is that, although disk > space is above both warning and critical levels I'm not getting any > notifications. > > The nagios.log file is silent on the topic and nagios -v does not produce > any errors or warnings. > > Is there a way to trace what actions nagios is considering (much like make > -n) in order to debug this problem? Is there a debugging methodology defined > somewhere? If it's in your head can you share? > > Thanks, > Mark > > > > > ------------------------------------------------------------------------------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From BChan at Shawcor.com Mon Dec 20 19:22:31 2010 From: BChan at Shawcor.com (Brian Chan) Date: Mon, 20 Dec 2010 13:22:31 -0500 Subject: AUTO: Chan, Brian is out of the office. (returning 01/04/2011) Message-ID: I am out of the office until 01/04/2011. I will respond to your message when I return. Note: This is an automated response to your message "Nagios-users Digest, Vol 55, Issue 16" sent on 12/20/2010 13:18:34. This is the only notification you will receive while this person is away. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ray at ganymede.org Mon Dec 20 20:47:43 2010 From: ray at ganymede.org (Ray Kiddy) Date: Mon, 20 Dec 2010 11:47:43 -0800 Subject: tracing nagios actions In-Reply-To: References: <5BE7D0404F28DC44B780100927AA4CB018664C39@whplex3.int.cecdes.net> <5BE7D0404F28DC44B780100927AA4CB018664C3D@whplex3.int.cecdes.net> <5A123C03-75FD-4254-B702-0293811904C5@ganymede.org> Message-ID: <2B51BBC1-431B-46A1-945B-9E74A4BA220F@ganymede.org> On Dec 20, 2010, at 10:48 AM, mark bradley wrote: > There's always strace(1) if you want to dive into the details ... > > Mark Ah. Unfortunately, I am not Mac OS X and not Linux. On Dec 20, 2010, at 10:28 AM, Mike Chesnut wrote: >> Of course, one cannot tell what command is _actually_ being executed or >> which command _was_ actually executed. I pointed this out in a previous >> post (below). Apparently there are no workarounds for this. > > If I understand what you're asking about, I've used this to achieve it > in the past: > > http://www.waggy.at/nagios/capture_plugin.htm This is interesting. It is unfortunate that one would have to put the plugin onto every command, but it is definitely a possibility. Of course, nagios could itself have this kind of functionality. But that would be up to the implementors. Thanks for the replies, though. cheers - ray > > On Mon, Dec 20, 2010 at 1:18 PM, Ray Kiddy wrote: > > On Dec 20, 2010, at 9:36 AM, Polifemo, Salvatore wrote: > >> Yes, run the actual command from the command line as Steve demonstrated. >> >> Make sure which command is being used, and if you run the command with no parameters it will display the correct usage. >> >> Salvatore Polifemo >> Sr. Systems Security Specialist >> ConEdison Solutions >> 100 Summit Lake Drive >> Valhalla, NY 10595 >> > > Of course, one cannot tell what command is _actually_ being executed or which command _was_ actually executed. I pointed this out in a previous post (below). Apparently there are no workarounds for this. > > - ray > > Begin forwarded message: > >> From: Ray Kiddy >> Date: November 17, 2010 9:42:59 AM PST >> To: Nagios Users List >> Subject: [Nagios-users] can log show actual command executed? >> Reply-To: Nagios Users List >> >> >> I am having a problem figuring out see what is actually being executed from a service. Is there a way to get the nagios log to contain the actual command being executed? >> >> This is what I am seeing in the Nagios.log file: >> >> [1290013792] SERVICE ALERT: myhost.com;Special App;CRITICAL;SOFT;1;(Service Check Timed Out) >> >> This is what I see in the nagios.dat file: >> >> check_command=check_http!/myURL!alive >> >> So, this shows me what the command string is in the service.cfg. I cannot see, though, what the actual command line is at this moment in time. It turns out that this check_command corresponds (I think) to: >> >> check_http -u /myURL -s alive >> >> How would I know this, though, if the command definition had been changed or if it is using, because of a mis-spelling, a command I do not think it is using? If I go into the command.cfg and switch the order of parameters, for example, I see nothing in these logs that tells me what is doing what. >> >> I know the simplest answer is "You should not do that." But my point is that the log file does not have enough information to tell me what happened at a past moment of time. I would need the log information _and_ the state of the command definitions at that time. If a log does not show you what happened in the past, what is its purpose? >> >> I am having a problem with a particular web application. For some reason I put in the check and it fails. I execute the check_http that I _think_ this service is doing, and it gives me an OK. I ended up creating a custom executable that calls curl and fetches against the same URL and this now works fine. Kind of lame, though. I use check_http in about 100 other services. So, why is this one single service not working? An obvious answer is that I am not calling the command in the way I think I am. But if I look in the log to see what the service did, I can see what I _think_ it did based on what I can see in what I _think_ is the correct command definition. But I really do not know. I do not see a line like "check_http -u /myURL -s alive" in the log, so, I cannot see if I am mis-reading things. >> >> Any suggestions? >> >> - ray >> > > > >> From: steve f [mailto:a31modela at hotmail.com] >> Sent: Monday, December 20, 2010 12:14 PM >> To: nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] tracing nagios actions >> >> Mark, >> >> I think Salvatore means run the check manually from the command line , make sure you run it as the nagios user and try setting tha warning & critical values to something that will make it fail also: >> >> /usr/local/nagios/libexec > ./check_disk -w 50 -c 70 -p /home >> DISK OK - free space: /home 440 MB (95% inode=99%);| /home=20MB;436;416;0;486 >> >> The -p just checks a specific path. ( FYI ) >> >> Steve >> >> >> >> >> Date: Mon, 20 Dec 2010 11:50:14 -0500 >> From: gopearls42 at gmail.com >> To: nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] tracing nagios actions >> >> Hi Salvatore, >> >> They're all Unix (Redhat) servers. By check command do you mean nagios -v? I've done that and I do not get an errors. >> >> Thanks, >> Mark >> >> On Mon, Dec 20, 2010 at 11:35 AM, Polifemo, Salvatore wrote: >> Are these Windows or *nix server? >> >> Either wau run the check command manually from a console and see what the results are. >> >> >> >> Salvatore Polifemo >> Sr. Systems Security Specialist >> ConEdison Solutions >> 100 Summit Lake Drive >> Valhalla, NY 10595 >> >> From: mark bradley [mailto:gopearls42 at gmail.com] >> Sent: Monday, December 20, 2010 11:14 AM >> To: nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] tracing nagios actions >> >> Hi, >> >> I have a small-ish number of servers and I've tried to configure Nagios to warn me about disk-space running low. The problem is that, although disk space is above both warning and critical levels I'm not getting any notifications. >> >> The nagios.log file is silent on the topic and nagios -v does not produce any errors or warnings. >> >> Is there a way to trace what actions nagios is considering (much like make -n) in order to debug this problem? Is there a debugging methodology defined somewhere? If it's in your head can you share? >> >> Thanks, >> Mark >> > > > ------------------------------------------------------------------------------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d_______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Mon Dec 20 21:39:27 2010 From: stanb at panix.com (stan) Date: Mon, 20 Dec 2010 15:39:27 -0500 Subject: NSCA and host checks In-Reply-To: <4D0F6E35.4080105@op5.se> References: <20101220144528.GA24567@teddy.fas.com> <4D0F6E35.4080105@op5.se> Message-ID: <20101220203927.GA1392@teddy.fas.com> On Mon, Dec 20, 2010 at 03:54:45PM +0100, Andreas Ericsson wrote: > On 12/20/2010 03:45 PM, stan wrote: > > I am working on geting a small distributed system up. I have the serviec > > checks going back to the master, but have not managed to get host checks > > passed back to the master yet. > > > > With help from the list I found: > > > > obsess_over_hosts and set it to 1. It appears that I also need an ochp > > command. I tried using the script I had set p for service checks, but it > > appears that the arguments passed to this are different. Here are the > > arguments that are being passed the the service check comand: > > > > Arguments: > > # $1 = host_name (Short name of host that the service is > > # associated with) > > # $2 = svc_description (Description of the service) > > # $3 = state_string (A string representing the status of > > # the given service - "OK", "WARNING", "CRITICAL" > > # or "UNKNOWN") > > # $4 = plugin_output (A text string that > > # > > What do the one passed to the host check look like? > > > > the same, but without the service description. OK, here are my 2 command defs: command_line /usr/share/nagios/sbin/submit_service_check_result $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATEID$ '$SERVICEOUTPUT$' and /usr/share/nagios/sbin/submit_host_check_result $HOSTNAME$ $HOSTSTATEID$ '$HOSTOUTPUT$' Do these look correct? And do the *STATEID's need translating to an integer before I pass them to send_nsca? I found an example in the web, that used a case statement to do thism but I do not know if it is needed or not. Thanks for the help/ -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew1.li at citi.com Tue Dec 21 00:30:32 2010 From: andrew1.li at citi.com (Li, Andrew1 ) Date: Tue, 21 Dec 2010 10:30:32 +1100 Subject: Email Notifications In-Reply-To: References: Message-ID: <1292887832.1185.15.camel@localhost> On Tue, 2010-12-21 at 01:07, Pascal Miquet wrote: > Hi, > > I've installed Nagios 3 on a Debian box, and it seem that for some > notification, I've got the user name rather than the Email address of > the user attached to the notification. You maybe missing the linkage between the contact in your service or host in your contact definition where you define the email address. Check out the cfg files that come with the distribution tarball, it's a pretty good "quick start" example. Andrew ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eric.berg at barclayscapital.com Tue Dec 21 01:54:04 2010 From: eric.berg at barclayscapital.com (eric.berg at barclayscapital.com) Date: Mon, 20 Dec 2010 19:54:04 -0500 Subject: Nagios kept from restarting after reboot by lock file In-Reply-To: References: Message-ID: If you had any idea how difficult it is just to do the most basic system administration tasks in the environment within which we're working, you'd be shaking your head in empathetic embarassment. Filling out tickets and -- get this: our company has an official state for our tickets referred to as "Chasing", which is when, after entering your ticket into this multi-million dollar ticket tracking system, you then have to manually pick up the phone, email, or walk over to bug the guy who should now act on the ticket. It's a nightmare. I built it. That was intended to express the irony of my search for a solution to keep my monitoring system up. I'm shocked that nagios can't tell the difference between a pid file that refers to a running process and one that refers to one that's not running any more. It's the first thing about Nagios that's got my head scractching...besides the complex set of dependencies that you have to negotiate to do pretty much anything. E > -----Original Message----- > From: Joseph L. Casale [mailto:jcasale at activenetwerx.com] > Sent: Monday, December 20, 2010 12:13 PM > To: 'Nagios Users List' > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by lock file > > >Alternatively, could you recommend a good system/resource > monitoring tool that would be able to let me know if nagios > is down and restart it automatically? > > That's kind of funny... > Why are you compiling nagios on a package based distro with > existing and current _properly_ built > packages? > > Look at rpmforge... > _______________________________________________ This e-mail may contain information that is confidential, privileged or otherwise protected from disclosure. If you are not an intended recipient of this e-mail, do not duplicate or redistribute it by any means. Please delete it and any attachments and notify the sender that you have received it in error. Unless specifically indicated, this e-mail is not an offer to buy or sell or a solicitation to buy or sell any securities, investment products or other financial product or service, an official confirmation of any transaction, or an official statement of Barclays. Any views or opinions presented are solely those of the author and do not necessarily represent those of Barclays. This e-mail is subject to terms available at the following link: www.barcap.com/emaildisclaimer. By messaging wit h Barclays you consent to the foregoing. Barclays Capital is the investment banking division of Barclays Bank PLC, a company registered in England (number 1026167) with its registered office at 1 Churchill Place, London, E14 5HP. This email may relate to or be sent from other members of the Barclays Group. _______________________________________________ ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eric.berg at barclayscapital.com Tue Dec 21 01:58:47 2010 From: eric.berg at barclayscapital.com (eric.berg at barclayscapital.com) Date: Mon, 20 Dec 2010 19:58:47 -0500 Subject: Nagios kept from restarting after reboot by lockfile In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24054BBABE@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24054BBABE@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: We reboot all of our hosts on a weekly basis. I used to price myself in keeping my boxes up as long as possible, but having spent years now supporting mission-critical financial production applications, I'm on board with the weekly reboots. Lets you know early if some system or app change is problematic. Reboot is being done via a standard reboot command. I've looked around for rc scripts that might address this issue, but haven't found any. Got any pointers? Regarding the rc.local solution, a) I'd prefer to solve the problem, not just address the symptoms, and b) elsewhere in this thread I've described the roadblocks that we have to doing anything a system level. Yep, that's right, boys, we survive in the app developer layer within which we do not have root on these boxes. It's a tedious, time-consuming, frustrating, productivity-killing endeavor to do just about anything you can't do yourself. So....got any sample RC scripts, or command line params to nagios to make it smart enough to know that the PID that is in it's PID file isn't an active process? Thanks. Eric > -----Original Message----- > From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] > Sent: Monday, December 20, 2010 11:56 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by lockfile > > Couple questions > 1) Why do you have to reboot your monitoring server weekly? > 2) How is the reboot being done? > > Reason I ask 2) is because the standard rc script will remove the > lockfile when nagios is told to stop. So if you are having > this problem > is sounds like you are not doing a clean shutdown and > something could be > wrong. > > Either way, I guess worst case one way to check for this would be put > something like this in your /etc/rc.d/rc.local: > rm -f /var/lock/subsys/nagios > > Assuming that's where your lockfile is. > > Dan > > > -----Original Message----- > From: eric.berg at barclayscapital.com > [mailto:eric.berg at barclayscapital.com] > Sent: Monday, December 20, 2010 10:16 AM > To: eric.berg at barclayscapital.com; nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by > lockfile > > Alternatively, could you recommend a good system/resource monitoring > tool that would be able to let me know if nagios is down and > restart it > automatically? > > _____________________________________________ > From: Berg, Eric: IT (NYK) > Sent: Monday, December 20, 2010 11:03 AM > To: 'nagios-users at lists.sourceforge.net' > Subject: Nagios kept from restarting after reboot by lock file > > Gee, this seems like an annoying newbie problem, but if Nagios crashes > or is killed (as on system reboot), it leaves a lock file around that > prevents it from starting again until the lock file is > manually removed. > > I see this on Monday mornings after weekend reboots on a Red Hat Linux > box: > > nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' looks like its > already held by another instance of Nagios (PID 0). Bailing out... > > Does anyone know if there's a config option or something else that > obviates the need to write a wrapper scropt to check to see > if Nagios is > really running and remove the lock file (look slike Nagios > already knows > it's not running by virtue of the value of the PID inthis > very message!) > so that it can cleanly start up again? > > Thanks. > > Eric > > _______________________________________________ > > This e-mail may contain information that is confidential, > privileged or > otherwise protected from disclosure. If you are not an intended > recipient of this e-mail, do not duplicate or redistribute it by any > means. Please delete it and any attachments and notify the sender that > you have received it in error. Unless specifically indicated, this > e-mail is not an offer to buy or sell or a solicitation to buy or sell > any securities, investment products or other financial product or > service, an official confirmation of any transaction, or an official > statement of Barclays. Any views or opinions presented are > solely those > of the author and do not necessarily represent those of Barclays. This > e-mail is subject to terms available at the following link: > www.barcap.com/emaildisclaimer. By messaging with Barclays you consent > to the foregoing. Barclays Capital is the investment banking division > of Barclays Bank PLC, a company registered in England (number 1026167) > with its registered offic > e at 1 Churchill Place, London, E14 5HP. This email may relate to or > be sent from other members of the Barclays Group. > _______________________________________________ > > -------------------------------------------------------------- > ---------- > ------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -------------------------------------------------------------- > ---------------- > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Stuart.Jones at health.wa.gov.au Tue Dec 21 06:40:11 2010 From: Stuart.Jones at health.wa.gov.au (Jones, Stuart) Date: Tue, 21 Dec 2010 13:40:11 +0800 Subject: check_snmp In-Reply-To: References: Message-ID: Hello Jorge, Here is an example of my Nagios - Cisco switch port status - monitor entry, see if it helps you: define service{ use generic-service host_name Site-1 rtr Lo0 interface - r01c37 service_description Interface Fa0/1 connection to primary c6504 LAN switch sr002-c6504 - Link Status check_command check_snmp!-C Public -o ifOperStatus.5 -r 1 -m RFC1213-MIB ; 1 is up, 2 is down, 3 is testing, 4 is unknown, 5 is dormant } Rgds, Stuart ________________________________ From: Jorge Arenas [mailto:jorgeaaq at hotmail.com] Sent: Friday, 17 December 2010 10:16 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] check_snmp Hi: i just install nagios and I am following the basic guide in the site to monitor a switch i create the file and work well, but all switch ports are reported in OK status always i check the output ok the command: check_snmp -C public -H switch -r 1 .... etc and even when the port reports status of 2 ( down) the "-r 1" parameter is not working and the report send "SNMP OK" for down ports I read the instruccions but i can not find any information regarding the -r (regex) parameter I Found a workaround changing the "-r 1" for the "-c 1:1" but I do not know if the documentation in the site is out-dated or i am making something wrong any ideas thanks in advance Jorge Arenas CSA Mexico -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Tue Dec 21 07:37:48 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Mon, 20 Dec 2010 22:37:48 -0800 Subject: Nagios kept from restarting after reboot by lock file In-Reply-To: References: Message-ID: <4D104B3C.40604@5dninja.net> On 12/20/10 8:16 AM, eric.berg at barclayscapital.com wrote: > Alternatively, could you recommend a good system/resource monitoring tool that would be able to let me know if nagios is down and restart it automatically? > Add a cronjob on a five (or whatever you're comfortable with) minute interval, similar to: #!/bin/bash PATH=/bin:/usr/bin:/usr/local/bin PID=`cat /home/nagios/nagios/var/nagios.lock` PIDTEST=`kill -0 ${PID} 2>&1 >/dev/null` if [ "${PIDTEST}" -eq "1" ] then rm /home/nagios/nagios/var/nagios.lock # INSERT RESTART COMMAND HERE echo "Killed Lockfile and restarted Nagios" | mail -s "Nagios restart `hostname`" your-email at here.com fi >>> Just be aware that it'll also trigger that if block, if nagios is running under a different username. You can check for that by doing some tests in the script with ps and grep. > _____________________________________________ > From: Berg, Eric: IT (NYK) > Sent: Monday, December 20, 2010 11:03 AM > To: 'nagios-users at lists.sourceforge.net' > Subject: Nagios kept from restarting after reboot by lock file > > Gee, this seems like an annoying newbie problem, but if Nagios crashes or is killed (as on system reboot), it leaves a lock file around that prevents it from starting again until the lock file is manually removed. > > I see this on Monday mornings after weekend reboots on a Red Hat Linux box: > > nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' looks like its already held by another instance of Nagios (PID 0). Bailing out... Sounds like something in the shutdown process is throwing a 0 into the pid file, or the startup in the rc script is. Either way, you should never have a 0 in there, either the rc script is putting the wrong data in there, or it's reporting incorrectly. > Does anyone know if there's a config option or something else that obviates the need to write a wrapper scropt to check to see if Nagios is really running and remove the lock file (look slike Nagios already knows it's not running by virtue of the value of the PID inthis very message!) so that it can cleanly start up again? -- Mike Lindsey ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 21 08:34:27 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 21 Dec 2010 08:34:27 +0100 Subject: Nagios kept from restarting after reboot by lockfile In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24054BBABE@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4D105883.6030400@op5.se> On 12/21/2010 01:58 AM, eric.berg at barclayscapital.com wrote: > We reboot all of our hosts on a weekly basis. I used to price myself in keeping my boxes up as long as possible, but having spent years now supporting mission-critical financial production applications, I'm on board with the weekly reboots. Lets you know early if some system or app change is problematic. > > Reboot is being done via a standard reboot command. > > I've looked around for rc scripts that might address this issue, but haven't found any. Got any pointers? > > Regarding the rc.local solution, a) I'd prefer to solve the problem, not just address the symptoms, and b) elsewhere in this thread I've described the roadblocks that we have to doing anything a system level. Yep, that's right, boys, we survive in the app developer layer within which we do not have root on these boxes. It's a tedious, time-consuming, frustrating, productivity-killing endeavor to do just about anything you can't do yourself. > > So....got any sample RC scripts, or command line params to nagios to make it smart enough to know that the PID that is in it's PID file isn't an active process? > Depending on what system tools you've got installed, this should work decently. Set variables to proper values and add it to the top of your init script. pid=$(cat $lockfile) kill -0 $pid || rm -f $lockfile nagiospid=$(pidof nagios | sed 's/.* //') test $pid = $nagiospid || rm -f $lockfile -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 21 08:45:22 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 21 Dec 2010 08:45:22 +0100 Subject: Nagios kept from restarting after reboot by lock file In-Reply-To: References: Message-ID: <4D105B12.7070600@op5.se> On 12/21/2010 01:54 AM, eric.berg at barclayscapital.com wrote: > If you had any idea how difficult it is just to do the most basic > system administration tasks in the environment within which we're > working, you'd be shaking your head in empathetic embarassment. > Filling out tickets and -- get this: our company has an official > state for our tickets referred to as "Chasing", which is when, after > entering your ticket into this multi-million dollar ticket tracking > system, you then have to manually pick up the phone, email, or walk > over to bug the guy who should now act on the ticket. It's a > nightmare. I built it. > Sounds unpleasant, inefficient and costly. > That was intended to express the irony of my search for a solution to > keep my monitoring system up. I'm shocked that nagios can't tell the > difference between a pid file that refers to a running process and > one that refers to one that's not running any more. It can, but it's damn near impossible to do portably from a script, and hard enough to do from C. It's easy enough to check that *some* sort of process is running with the pid in the lockfile, but in order to do it correct, one should also check that it's a Nagios process, and that's the hard part. The quick workaround is to add a boot script that runs before the Nagios startup script and unconditionally removes the lockfile. That's not a universal solution though, since the lockfile will remain if Nagios crashes. > It's the first > thing about Nagios that's got my head scractching...besides the > complex set of dependencies that you have to negotiate to do pretty > much anything. > Well, there are plenty of tools to help with configuration and such. Nacoma is one of them. Nagiosql is another. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Tue Dec 21 13:34:06 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Tue, 21 Dec 2010 13:34:06 +0100 Subject: =?utf-8?q?Nagios_kept_from_restarting_after_reboot?= =?utf-8?q?_by=09lockfile?= In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24054BBABE@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <54dd583d75e0ad06f254d6cdb06bbd79@vps517.directvps.nl> On Mon, 20 Dec 2010 19:58:47 -0500, wrote: > Reboot is being done via a standard reboot command. Don't use `reboot`. Use `shutdown -r` instead. And the system (including Nagios) should close down correctly. But rebooting a monitoring server is rather silly in my view. Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chrishudson at gmail.com Tue Dec 21 14:40:23 2010 From: chrishudson at gmail.com (Chris Hudson) Date: Tue, 21 Dec 2010 07:40:23 -0600 Subject: Centreon installation question Message-ID: I don't know if anyone on the list uses Centreon web front-end, but I have a question. I have to install and set up a new Nagios server. I have already installed RHEL5, now I'm trying to get the base packages (Apache, PHP, MySQL) that are needed. I have been trying to follow the directions here: http://en.doc.centreon.com/Setup:Prerequisite/Centos/Fedora/RHEL and here: http://www.nagioswiki.com/wiki/index.php/Installing_Centreon_on_Centos_5 But they conflict and both bring me to the same point: the major packages don't seem to be there. I can set up Yum and install a simple package such as htop from their repository, but the major packages such as Apache, MySQL, and PHP don't seem to be there. I've even gone out manually to look for them, but they're not there: http://apt.sw.be/redhat/el5/en/i386/dag/RPMS/ or here: http://packages.sw.be/ Does anyone have any experience with installing Centreon? Thanks, Chris -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Tue Dec 21 14:43:13 2010 From: stanb at panix.com (stan) Date: Tue, 21 Dec 2010 08:43:13 -0500 Subject: Not a valid parent Message-ID: <20101221134313.GA22695@teddy.fas.com> What causes this error message: Error: 'pm2fwi' is not a valid parent for host 'pm2fw'! This is the insdie, and outside of a firewall. They are in 2 diferent subnets, and 2 different domains. Still, this relatiionship is corect, and I would expcect Nagios to allow me to define it, What am I doing wrong? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.elsen at gmail.com Tue Dec 21 14:53:00 2010 From: mark.elsen at gmail.com (Mark Elsen) Date: Tue, 21 Dec 2010 14:53:00 +0100 Subject: Not a valid parent In-Reply-To: <20101221134313.GA22695@teddy.fas.com> References: <20101221134313.GA22695@teddy.fas.com> Message-ID: > What causes this error message: > > Error: 'pm2fwi' is not a valid parent for host 'pm2fw'! > > This is the insdie, and outside of a firewall. They are in 2 diferent > subnets, and 2 different domains. Still, this relatiionship is corect, and > I would expcect Nagios to allow me to define it, > - Make sure the 'host_name' definition in the nagios configuration file, matches the intended name of the parent. Sometimes people use alternate names, and confuse with the fact that if host_name is used, then NAGIOS does not use DNS names. M. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Dec 21 14:58:40 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 21 Dec 2010 13:58:40 +0000 Subject: Not a valid parent In-Reply-To: <20101221134313.GA22695@teddy.fas.com> References: <20101221134313.GA22695@teddy.fas.com> Message-ID: On 21 December 2010 13:43, stan wrote: > What causes this error message: > > Error: 'pm2fwi' is not a valid parent for host 'pm2fw'! > > This is the insdie, and outside of a firewall. They are in 2 diferent > subnets, and 2 different domains. Still, this relatiionship is corect, and > I would expcect Nagios to allow me to define it, > > What am I doing wrong? I think this usually means that the host 'pm2fwi' is not defined in your Nagios configuration. I expect you might be looking for a slight typo in the hostname for example pm2fwl rather than pm2fwi or somesuch. The kind of mistake I usually make is I set up a new directory with host and service definitions in it then forget to add a cfg_dir directive in nagios.cfg. ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Tue Dec 21 14:58:35 2010 From: stanb at panix.com (stan) Date: Tue, 21 Dec 2010 08:58:35 -0500 Subject: Not a valid parent In-Reply-To: <20101221134313.GA22695@teddy.fas.com> References: <20101221134313.GA22695@teddy.fas.com> Message-ID: <20101221135835.GA23145@teddy.fas.com> On Tue, Dec 21, 2010 at 08:43:13AM -0500, stan wrote: > What causes this error message: > > Error: 'pm2fwi' is not a valid parent for host 'pm2fw'! > > This is the insdie, and outside of a firewall. They are in 2 diferent > subnets, and 2 different domains. Still, this relatiionship is corect, and > I would expcect Nagios to allow me to define it, > > What am I doing wrong? > Answer to my own question. I had named the config file p,2fwi,cfg, not pm2fwi.cfg -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Tue Dec 21 15:23:17 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Tue, 21 Dec 2010 07:23:17 -0700 Subject: Nagios kept from restarting after reboot by lockfile In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24054BBABE@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB24054BC032@WPSCV6MM.OPR.STATEFARM.ORG> So are you using the actual "reboot" command not "shutdown -r now" which is a little friendlier? The standard nagios shutdown script should take care of cleaning those up for you. Otherwise putting something like: rm -f ; service nagios start in your rc.local would take care of it. But when you mention pid file, are you saying the PID file is still there, or the lock file? Since they are different things. Again though, if nagios it shutdown properly you shouldn't be seeing that. Dan -----Original Message----- From: eric.berg at barclayscapital.com [mailto:eric.berg at barclayscapital.com] Sent: Monday, December 20, 2010 6:59 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Nagios kept from restarting after rebootby lockfile We reboot all of our hosts on a weekly basis. I used to price myself in keeping my boxes up as long as possible, but having spent years now supporting mission-critical financial production applications, I'm on board with the weekly reboots. Lets you know early if some system or app change is problematic. Reboot is being done via a standard reboot command. I've looked around for rc scripts that might address this issue, but haven't found any. Got any pointers? Regarding the rc.local solution, a) I'd prefer to solve the problem, not just address the symptoms, and b) elsewhere in this thread I've described the roadblocks that we have to doing anything a system level. Yep, that's right, boys, we survive in the app developer layer within which we do not have root on these boxes. It's a tedious, time-consuming, frustrating, productivity-killing endeavor to do just about anything you can't do yourself. So....got any sample RC scripts, or command line params to nagios to make it smart enough to know that the PID that is in it's PID file isn't an active process? Thanks. Eric > -----Original Message----- > From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] > Sent: Monday, December 20, 2010 11:56 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by lockfile > > Couple questions > 1) Why do you have to reboot your monitoring server weekly? > 2) How is the reboot being done? > > Reason I ask 2) is because the standard rc script will remove the > lockfile when nagios is told to stop. So if you are having > this problem > is sounds like you are not doing a clean shutdown and > something could be > wrong. > > Either way, I guess worst case one way to check for this would be put > something like this in your /etc/rc.d/rc.local: > rm -f /var/lock/subsys/nagios > > Assuming that's where your lockfile is. > > Dan > > > -----Original Message----- > From: eric.berg at barclayscapital.com > [mailto:eric.berg at barclayscapital.com] > Sent: Monday, December 20, 2010 10:16 AM > To: eric.berg at barclayscapital.com; nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by > lockfile > > Alternatively, could you recommend a good system/resource monitoring > tool that would be able to let me know if nagios is down and > restart it > automatically? > > _____________________________________________ > From: Berg, Eric: IT (NYK) > Sent: Monday, December 20, 2010 11:03 AM > To: 'nagios-users at lists.sourceforge.net' > Subject: Nagios kept from restarting after reboot by lock file > > Gee, this seems like an annoying newbie problem, but if Nagios crashes > or is killed (as on system reboot), it leaves a lock file around that > prevents it from starting again until the lock file is > manually removed. > > I see this on Monday mornings after weekend reboots on a Red Hat Linux > box: > > nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' looks like its > already held by another instance of Nagios (PID 0). Bailing out... > > Does anyone know if there's a config option or something else that > obviates the need to write a wrapper scropt to check to see > if Nagios is > really running and remove the lock file (look slike Nagios > already knows > it's not running by virtue of the value of the PID inthis > very message!) > so that it can cleanly start up again? > > Thanks. > > Eric > > _______________________________________________ > > This e-mail may contain information that is confidential, > privileged or > otherwise protected from disclosure. If you are not an intended > recipient of this e-mail, do not duplicate or redistribute it by any > means. Please delete it and any attachments and notify the sender that > you have received it in error. Unless specifically indicated, this > e-mail is not an offer to buy or sell or a solicitation to buy or sell > any securities, investment products or other financial product or > service, an official confirmation of any transaction, or an official > statement of Barclays. Any views or opinions presented are > solely those > of the author and do not necessarily represent those of Barclays. This > e-mail is subject to terms available at the following link: > www.barcap.com/emaildisclaimer. By messaging with Barclays you consent > to the foregoing. Barclays Capital is the investment banking division > of Barclays Bank PLC, a company registered in England (number 1026167) > with its registered offic > e at 1 Churchill Place, London, E14 5HP. This email may relate to or > be sent from other members of the Barclays Group. > _______________________________________________ > > -------------------------------------------------------------- > ---------- > ------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -------------------------------------------------------------- > ---------------- > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------ ------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eric.berg at barclayscapital.com Tue Dec 21 16:18:49 2010 From: eric.berg at barclayscapital.com (eric.berg at barclayscapital.com) Date: Tue, 21 Dec 2010 10:18:49 -0500 Subject: Nagios kept from restarting after reboot by lock file In-Reply-To: <4D105B12.7070600@op5.se> References: <4D105B12.7070600@op5.se> Message-ID: I didn't realize that it was so difficult to associate the process with the PID. I'm used to perl, and solutions like this are generally pretty portable. Both your solution and Mikes make perfect sense. I was hoping that I had missed something and wanted to get the skinny before I start actually doing work to get around this. I'll just add the process checks to the wrapper script that runs Nagios. WRT the complexity of Nagios and config editors, I've looked a several, and they do seem to provide some pretty straight-forward configuration help. I don't think our environment is quite so simple. We've got about 4 major applications, each of which has multiple services associated with it as well as different support organizations, so we've got a fairly sophisticated setup. To make matters even more interesting, for at least some of these services, the hosts on which they run -- and often the ports -- change on a regular basis, so we're looking toward writing some templated solutions to generating the configurations for Nagios...which, as you can imagine, provides some interesting challenges. Anyway, thanks, folks, for your help. Much appreciated. Eric > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Tuesday, December 21, 2010 2:45 AM > To: Nagios Users List > Cc: Berg, Eric: IT (NYK) > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by lock file > > On 12/21/2010 01:54 AM, eric.berg at barclayscapital.com wrote: > > If you had any idea how difficult it is just to do the most basic > > system administration tasks in the environment within which we're > > working, you'd be shaking your head in empathetic embarassment. > > Filling out tickets and -- get this: our company has an official > > state for our tickets referred to as "Chasing", which is when, after > > entering your ticket into this multi-million dollar ticket tracking > > system, you then have to manually pick up the phone, email, or walk > > over to bug the guy who should now act on the ticket. It's a > > nightmare. I built it. > > > > Sounds unpleasant, inefficient and costly. > > > That was intended to express the irony of my search for a > solution to > > keep my monitoring system up. I'm shocked that nagios > can't tell the > > difference between a pid file that refers to a running process and > > one that refers to one that's not running any more. > > It can, but it's damn near impossible to do portably from a > script, and > hard enough to do from C. It's easy enough to check that > *some* sort of > process is running with the pid in the lockfile, but in order to do it > correct, one should also check that it's a Nagios process, and that's > the hard part. > > The quick workaround is to add a boot script that runs before the > Nagios startup script and unconditionally removes the lockfile. That's > not a universal solution though, since the lockfile will remain if > Nagios crashes. > > > It's the first > > thing about Nagios that's got my head scractching...besides the > > complex set of dependencies that you have to negotiate to do pretty > > much anything. > > > > Well, there are plenty of tools to help with configuration and such. > Nacoma is one of them. Nagiosql is another. > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > Considering the successes of the wars on alcohol, poverty, drugs and > terror, I think we should give some serious thought to declaring war > on peace. > _______________________________________________ This e-mail may contain information that is confidential, privileged or otherwise protected from disclosure. If you are not an intended recipient of this e-mail, do not duplicate or redistribute it by any means. Please delete it and any attachments and notify the sender that you have received it in error. Unless specifically indicated, this e-mail is not an offer to buy or sell or a solicitation to buy or sell any securities, investment products or other financial product or service, an official confirmation of any transaction, or an official statement of Barclays. Any views or opinions presented are solely those of the author and do not necessarily represent those of Barclays. This e-mail is subject to terms available at the following link: www.barcap.com/emaildisclaimer. By messaging wit h Barclays you consent to the foregoing. Barclays Capital is the investment banking division of Barclays Bank PLC, a company registered in England (number 1026167) with its registered office at 1 Churchill Place, London, E14 5HP. This email may relate to or be sent from other members of the Barclays Group. _______________________________________________ ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From luc.maignan at winxpert.com Tue Dec 21 16:35:53 2010 From: luc.maignan at winxpert.com (Luc MAIGNAN) Date: Tue, 21 Dec 2010 16:35:53 +0100 Subject: Email Notifications Message-ID: <4D10C959.3010608@winxpert.com> Hi, I wonder if it is possible to have email notifications but JUST for a specified list of monitored services ? Thanks for any help ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Tue Dec 21 16:56:47 2010 From: jpratt at norwich.edu (James Pratt) Date: Tue, 21 Dec 2010 10:56:47 -0500 Subject: Email Notifications In-Reply-To: <4D10C959.3010608@winxpert.com> References: <4D10C959.3010608@winxpert.com> Message-ID: <369C2BA4DE2C8F4A88BC422AD06C96BD328B@nuexchange.norwich.edu> -----Original Message----- From: Luc MAIGNAN [mailto:luc.maignan at winxpert.com] Sent: Tuesday, December 21, 2010 10:36 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Email Notifications Hi, I wonder if it is possible to have email notifications but JUST for a specified list of monitored services ? Thanks for any help ------------------------------------------------------------------------ ------ Hi - I'm pretty sure you can use : notifications_enabled 0 in the service or host definition for the ones you don't want notifications on ... then again, there are probably other ways as well.. cheers, James ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From luc.maignan at winxpert.com Tue Dec 21 17:01:55 2010 From: luc.maignan at winxpert.com (Luc MAIGNAN) Date: Tue, 21 Dec 2010 17:01:55 +0100 Subject: Email Notifications In-Reply-To: <369C2BA4DE2C8F4A88BC422AD06C96BD328B@nuexchange.norwich.edu> References: <4D10C959.3010608@winxpert.com> <369C2BA4DE2C8F4A88BC422AD06C96BD328B@nuexchange.norwich.edu> Message-ID: <4D10CF73.6030609@winxpert.com> I wasn't enough clear, I think. Today I have all notifications to one user. I don't want to change this. But for only a list a specified services, I want to notify another user Le 21/12/10 16:56, James Pratt a ?crit : > > -----Original Message----- > From: Luc MAIGNAN [mailto:luc.maignan at winxpert.com] > Sent: Tuesday, December 21, 2010 10:36 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Email Notifications > > Hi, > > > I wonder if it is possible to have email notifications but JUST for a > specified list of monitored services ? > > Thanks for any help > > ------------------------------------------------------------------------ > ------ > > Hi - > > I'm pretty sure you can use : > > notifications_enabled 0 > > in the service or host definition for the ones you don't want > notifications on ... then again, there are probably other ways as well.. > > > cheers, > James > > ------------------------------------------------------------------------------ > Forrester recently released a report on the Return on Investment (ROI) of > Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even > within 7 months. Over 3 million businesses have gone Google with Google Apps: > an online email calendar, and document program that's accessible from your > browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Tue Dec 21 17:23:45 2010 From: jpratt at norwich.edu (James Pratt) Date: Tue, 21 Dec 2010 11:23:45 -0500 Subject: Email Notifications In-Reply-To: <4D10CF73.6030609@winxpert.com> References: <4D10C959.3010608@winxpert.com><369C2BA4DE2C8F4A88BC422AD06C96BD328B@nuexchange.norwich.edu> <4D10CF73.6030609@winxpert.com> Message-ID: <369C2BA4DE2C8F4A88BC422AD06C96BD328D@nuexchange.norwich.edu> Ok, no problem - create a new contact/contact group, then add them to whatever service or host definition(s) you want the notifications for using "contact_groups"... like this: define service { service_description MyService check_period 24x7 max_check_attempts 5 contact_groups linuxadmins,operations ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ... } cheers, James -----Original Message----- From: Luc MAIGNAN [mailto:luc.maignan at winxpert.com] Sent: Tuesday, December 21, 2010 11:02 AM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Email Notifications I wasn't enough clear, I think. Today I have all notifications to one user. I don't want to change this. But for only a list a specified services, I want to notify another user Le 21/12/10 16:56, James Pratt a ?crit : > > -----Original Message----- > From: Luc MAIGNAN [mailto:luc.maignan at winxpert.com] > Sent: Tuesday, December 21, 2010 10:36 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Email Notifications > > Hi, > > > I wonder if it is possible to have email notifications but JUST for a > specified list of monitored services ? > > Thanks for any help > > ---------------------------------------------------------------------- > -- > ------ > > Hi - > > I'm pretty sure you can use : > > notifications_enabled 0 > > in the service or host definition for the ones you don't want > notifications on ... then again, there are probably other ways as well.. > > > cheers, > James > > ---------------------------------------------------------------------- > -------- Forrester recently released a report on the Return on > Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost > savings, and break-even within 7 months. Over 3 million businesses > have gone Google with Google Apps: > an online email calendar, and document program that's accessible from > your browser. Read the Forrester report: > http://p.sf.net/sfu/googleapps-sfnew > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ccaswell at mcnc.org Tue Dec 21 17:48:14 2010 From: ccaswell at mcnc.org (Chris Caswell) Date: Tue, 21 Dec 2010 11:48:14 -0500 Subject: Nagios Looking Glass Filters Message-ID: Has anyone had success with the NLG version 1.1.0b? I've been working with it and see some odd behavior while trying to set up host filters. I've read through the FILTERS_HOW_TO.txt file and was successful in creating one additional filter to the default set, but since then have not been able to update the filters, no matter how I edit the server version of s3_filter_inc. The odd thing is that I can successfully change the name of the filters in the list, but can't seem to effect the actual content of the filters. Thoughts? Thanks. Chris Caswell MCNC (North Carolina Research and Education Network) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.Fay at ajl.co.uk Tue Dec 21 18:08:56 2010 From: Andrew.Fay at ajl.co.uk (Andrew Fay) Date: Tue, 21 Dec 2010 17:08:56 -0000 Subject: Nagios Windows Check_NT Message-ID: <73B0897B46A48E429822E662D4530F886A5477@aber-exch-003.ajl.co.uk> Hello, Hopefully someone can help me out, I am setting up Nagios on an Ubuntu box to monitor Windows servers, Check_NT is not playing ball, when it is scanning services it is coming up with : NSClient++ Version UNKNOWN 2010-12-21 17:04:06 0d 0h 0m 4s 4/4 Usage:check_nt -H host -v variable [-p port] [-w warning] [-c critical] C:\ Drive Space UNKNOWN 2010-12-21 17:05:31 0d 0h 0m 9s 4/4 check_nt: Could not parse arguments I am using the package install which is version 3.2.1, the test machine is just an XP desktop that I am trying to get Nagios to pick up. I have installed NSClient++-0.3.8-Win32 on the client machine with pretty much default settings aside from the only machine that is able to access it is the name of the ubuntu machine, taking this out doesn't make a difference. Commands I am using (which are just copied straight from templates) : define command { command_name check_nt command_line /usr/lib/nagios/plugins/check_nt -H $HOSTADDRESS$ -v $ARG1$ -v $ARG2$ } define service{ use generic-service host_name computer1 service_description NSClient++ Version check_command check_nt!CLIENTVERSION } Can anyone help? Cheers, Andy o----------------------------------------------------------------------o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o----------------------------------------------------------------------o -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.Fay at ajl.co.uk Tue Dec 21 19:31:13 2010 From: Andrew.Fay at ajl.co.uk (Andrew Fay) Date: Tue, 21 Dec 2010 18:31:13 -0000 Subject: nagios mail alerts to exchange Message-ID: <73B0897B46A48E429822E662D4530F886A547B@aber-exch-003.ajl.co.uk> Does anyone know where I can get a guide on how to use postfix to mail a local exchange server - just now I have a local installation of postfix no config, the mails just bounce off the exchange server. I did it years ago and cannot for the life of me remember how - I think I was using postfix and fetchmail, Cheers, Andy o----------------------------------------------------------------------o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o----------------------------------------------------------------------o -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From albrecht.dress at arcor.de Tue Dec 21 19:52:45 2010 From: albrecht.dress at arcor.de (Albrecht =?iso-8859-1?b?RHJl3w==?=) Date: Tue, 21 Dec 2010 19:52:45 +0100 Subject: Nagios Windows Check_NT In-Reply-To: <73B0897B46A48E429822E662D4530F886A5477@aber-exch-003.ajl.co.uk> (from Andrew.Fay@ajl.co.uk on Tue Dec 21 18:08:56 2010) References: <73B0897B46A48E429822E662D4530F886A5477@aber-exch-003.ajl.co.uk> Message-ID: <1292957565.1761.1@antares> Am 21.12.10 18:08 schrieb(en) Andrew Fay: > Check_NT is not playing ball, when it is scanning services it is coming > up with : [snipped description] The following works for me in the commands.cfg... # 'check_nt' command definition define command{ command_name check_nt command_line $USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$ } ...and the xp machine's config: define service{ use hourly-service host_name winxp-05, winxp-06 service_description C:\ Drive Space check_command check_nt!USEDDISKSPACE!-l C -w 80 -c 90 } The NSC.ini file on the xp boxes looks as follows (comments stripped): connand_time [modules] CheckSystem.dll CheckDisk.dll NSClientListener.dll NRPEListener.dll SysTray.dll CheckEventLog.dll CheckHelpers.dll CheckWMI.dll [Settings] allowed_hosts=/32 use_file=1 [log] debug=0 file=C:\Programme\NSClient++\NSC.log [NSClient] [NRPE] command_timeout=300 allow_arguments=1 allow_nasty_meta_chars=1 [Check System] [External Script] command_timeout=300 [External Scripts] [External Alias] [NSCA Agent] [NSCA Commands] [NRPE Handlers] command[check_win_updates]=c:\\windows\\system32\\cscript.exe //NoLogo //T:300 C:\\Programme\\NSClient++\\plugins\\check_windows_updates.wsf /w:0 /c:1 [NRPE Client Handlers] check_other=-H 192.168.0.1 -p 5666 -c remote_command -a arguments Hth, Albrecht. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 190 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Dec 21 19:54:34 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 21 Dec 2010 18:54:34 +0000 Subject: Nagios Windows Check_NT In-Reply-To: <73B0897B46A48E429822E662D4530F886A5477@aber-exch-003.ajl.co.uk> References: <73B0897B46A48E429822E662D4530F886A5477@aber-exch-003.ajl.co.uk> Message-ID: On 21 December 2010 17:08, Andrew Fay wrote: > ??????????????? command_line??? /usr/lib/nagios/plugins/check_nt -H > $HOSTADDRESS$ -v $ARG1$ -v $ARG2$ check_nt shouldn't need two "-v" arguments. Run: /usr/local/nagios/libexec/check_nt --help from the command line to see what arguments are expected. Then run the check_nt command at the command line and make sure it is returning the information you expect before configuring it in Nagios. You'll find some example Nagios command definitions at: http://nsclient.org/nscp/wiki/NSClientListener ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From MarkL at lmfj.com Tue Dec 21 20:38:17 2010 From: MarkL at lmfj.com (Mark A. Lappin) Date: Tue, 21 Dec 2010 13:38:17 -0600 Subject: nagios mail alerts to exchange In-Reply-To: <73B0897B46A48E429822E662D4530F886A547B@aber-exch-003.ajl.co.uk> References: <73B0897B46A48E429822E662D4530F886A547B@aber-exch-003.ajl.co.uk> Message-ID: <0227B653B3DC82438B8291BC5218612F67377688B2@lmfjex07.lmfj.com> > Does anyone know where I can get a guide on how to use postfix to > mail a local exchange server - just now I have a local installation of > postfix no config, the mails just bounce off the exchange server. > I did it years ago and cannot for the life of me remember how - I think > I was using postfix and fetchmail, I'm not using postfix but have a few boxes running exim and some running sendmail. I have exim and sendmail both configured to use my exchange server as a smarthost for all mail. Exchange is set to accept all inbound mail connections from 192.168.0.0/16 and route & relay it accordingly. I didn't do my own sendmail config, and I'm not to familiar with sendmail; but in exim I set it up for no local mail and all mail is sent through a relay or smarthost. Postfix should have similar settings. Mark From: Andrew Fay [mailto:Andrew.Fay at ajl.co.uk] Sent: Tuesday, December 21, 2010 12:31 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] nagios mail alerts to exchange Cheers, Andy o----------------------------------------------------------------------o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o----------------------------------------------------------------------o Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eric.berg at barclayscapital.com Tue Dec 21 20:57:49 2010 From: eric.berg at barclayscapital.com (eric.berg at barclayscapital.com) Date: Tue, 21 Dec 2010 14:57:49 -0500 Subject: Nagios kept from restarting after reboot by lockfile In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB24054BC032@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB24054BBABE@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24054BC032@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: Good stuff, Dan. I was not aware of the differences between how the reboot and shutdown commands handle the reboot process. Turns out that we're doing a reboot -f, which explains why I have orphaned PID files laying around. I'm going to make the call right now that to fight the fight to have 'reboot -f' changed to the plays-more-nicely-with-others "shutdown -r" is already lost and I'm going to work around that in code. Thanks for helping clarify this. It's weird....when I run nagios and kill it with -9, it leaves the pid file in tact, but when I restart it, it zero's out the pid file and starts just fine. when I just kill it with the default kill signal, it removes the pid file. In any case, I now know what the issues are and how to address this. Thanks again very much for you help, guys. You are a feature of Nagios. Eric > -----Original Message----- > From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] > Sent: Tuesday, December 21, 2010 9:23 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Nagios kept from restarting after > reboot by lockfile > > So are you using the actual "reboot" command not "shutdown -r > now" which > is a little friendlier? The standard nagios shutdown script > should take > care of cleaning those up for you. Otherwise putting something like: > rm -f ; service nagios start > in your rc.local would take care of it. But when you mention > pid file, > are you saying the PID file is still there, or the lock file? Since > they are different things. Again though, if nagios it > shutdown properly > you shouldn't be seeing that. > > Dan > > -----Original Message----- > From: eric.berg at barclayscapital.com > [mailto:eric.berg at barclayscapital.com] > Sent: Monday, December 20, 2010 6:59 PM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios kept from restarting after rebootby > lockfile > > We reboot all of our hosts on a weekly basis. I used to > price myself in > keeping my boxes up as long as possible, but having spent years now > supporting mission-critical financial production applications, I'm on > board with the weekly reboots. Lets you know early if some system or > app change is problematic. > > Reboot is being done via a standard reboot command. > > I've looked around for rc scripts that might address this issue, but > haven't found any. Got any pointers? > > Regarding the rc.local solution, a) I'd prefer to solve the > problem, not > just address the symptoms, and b) elsewhere in this thread I've > described the roadblocks that we have to doing anything a > system level. > Yep, that's right, boys, we survive in the app developer layer within > which we do not have root on these boxes. It's a tedious, > time-consuming, frustrating, productivity-killing endeavor to do just > about anything you can't do yourself. > > So....got any sample RC scripts, or command line params to nagios to > make it smart enough to know that the PID that is in it's PID > file isn't > an active process? > > Thanks. > > Eric > > > -----Original Message----- > > From: Daniel Wittenberg > [mailto:daniel.wittenberg.r0ko at statefarm.com] > > Sent: Monday, December 20, 2010 11:56 AM > > To: Nagios Users List > > Subject: Re: [Nagios-users] Nagios kept from restarting after > > reboot by lockfile > > > > Couple questions > > 1) Why do you have to reboot your monitoring server weekly? > > 2) How is the reboot being done? > > > > Reason I ask 2) is because the standard rc script will remove the > > lockfile when nagios is told to stop. So if you are having > > this problem > > is sounds like you are not doing a clean shutdown and > > something could be > > wrong. > > > > Either way, I guess worst case one way to check for this > would be put > > something like this in your /etc/rc.d/rc.local: > > rm -f /var/lock/subsys/nagios > > > > Assuming that's where your lockfile is. > > > > Dan > > > > > > -----Original Message----- > > From: eric.berg at barclayscapital.com > > [mailto:eric.berg at barclayscapital.com] > > Sent: Monday, December 20, 2010 10:16 AM > > To: eric.berg at barclayscapital.com; > nagios-users at lists.sourceforge.net > > Subject: Re: [Nagios-users] Nagios kept from restarting after > > reboot by > > lockfile > > > > Alternatively, could you recommend a good system/resource monitoring > > tool that would be able to let me know if nagios is down and > > restart it > > automatically? > > > > _____________________________________________ > > From: Berg, Eric: IT (NYK) > > Sent: Monday, December 20, 2010 11:03 AM > > To: 'nagios-users at lists.sourceforge.net' > > Subject: Nagios kept from restarting after reboot by > lock file > > > > Gee, this seems like an annoying newbie problem, but if > Nagios crashes > > or is killed (as on system reboot), it leaves a lock file > around that > > prevents it from starting again until the lock file is > > manually removed. > > > > I see this on Monday mornings after weekend reboots on a > Red Hat Linux > > box: > > > > nagios: Lockfile '/home/nagios/nagios/var/nagios.lock' > looks like its > > already held by another instance of Nagios (PID 0). Bailing out... > > > > Does anyone know if there's a config option or something else that > > obviates the need to write a wrapper scropt to check to see > > if Nagios is > > really running and remove the lock file (look slike Nagios > > already knows > > it's not running by virtue of the value of the PID inthis > > very message!) > > so that it can cleanly start up again? > > > > Thanks. > > > > Eric > > > > _______________________________________________ > > > > This e-mail may contain information that is confidential, > > privileged or > > otherwise protected from disclosure. If you are not an intended > > recipient of this e-mail, do not duplicate or redistribute it by any > > means. Please delete it and any attachments and notify the > sender that > > you have received it in error. Unless specifically indicated, this > > e-mail is not an offer to buy or sell or a solicitation to > buy or sell > > any securities, investment products or other financial product or > > service, an official confirmation of any transaction, or an official > > statement of Barclays. Any views or opinions presented are > > solely those > > of the author and do not necessarily represent those of > Barclays. This > > e-mail is subject to terms available at the following link: > > www.barcap.com/emaildisclaimer. By messaging with Barclays > you consent > > to the foregoing. Barclays Capital is the investment > banking division > > of Barclays Bank PLC, a company registered in England > (number 1026167) > > with its registered offic > > e at 1 Churchill Place, London, E14 5HP. This email may > relate to or > > be sent from other members of the Barclays Group. > > _______________________________________________ > > > > -------------------------------------------------------------- > > ---------- > > ------ > > Lotusphere 2011 > > Register now for Lotusphere 2011 and learn how > > to connect the dots, take your collaborative environment > > to the next level, and enter the era of Social Business. > > http://p.sf.net/sfu/lotusphere-d2d > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent > to /dev/null > > > > -------------------------------------------------------------- > > ---------------- > > Lotusphere 2011 > > Register now for Lotusphere 2011 and learn how > > to connect the dots, take your collaborative environment > > to the next level, and enter the era of Social Business. > > http://p.sf.net/sfu/lotusphere-d2d > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS > > when reporting any issue. > > ::: Messages without supporting info will risk being sent > to /dev/null > > > -------------------------------------------------------------- > ---------- > ------ > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -------------------------------------------------------------- > ---------------- > Lotusphere 2011 > Register now for Lotusphere 2011 and learn how > to connect the dots, take your collaborative environment > to the next level, and enter the era of Social Business. > http://p.sf.net/sfu/lotusphere-d2d > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Tue Dec 21 21:56:50 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Tue, 21 Dec 2010 15:56:50 -0500 Subject: Nagios kept from restarting after reboot by lockfile In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB24054BBABE@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB24054BC032@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <4D111492.2010909@paul.dubuc.org> eric.berg at barclayscapital.com wrote: > > It's weird....when I run nagios and kill it with -9, it leaves the pid > file intact, but when I restart it, it zero's out the pid file and starts > just fine. when I just kill it with the default kill signal, it removes the > pid file. This isn't weird. That's how it should work. kill -9 sends an uncatchable, compulsory, kill signal (SIGKILL) to the process giving it no time to clean up before exiting. The default kill signal is SIGTERM, which can be caught and handled (or ignored) by the process. Restarting Nagios from the web interface, doesn't terminate and restart the process (the PID doesn't change), only re-initializes it. ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravishankar.gundlapali at wipro.com Wed Dec 22 04:40:41 2010 From: ravishankar.gundlapali at wipro.com (ravishankar.gundlapali at wipro.com) Date: Wed, 22 Dec 2010 09:10:41 +0530 Subject: Application monitoring using Nagios!!! References: <9A51A0B8413A6645A4122F20579848987E924C7073@VANQUISH.sws.int.southernwater.co.uk> <9A51A0B8413A6645A4122F20579848987E924C7074@VANQUISH.sws.int.southernwater.co.uk> Message-ID: <618F928F9E2D824AAB26DCAA0F25221F027A9BFA@HYD-MKD-MBX02.wipro.com> Hi, I am using Nagios Version 3.0.6 on a Linux server. Please let me know whether we have an option of configuring the credentials so that Nagios will automatically login to an application and send an alert on success or failure. Thanks, Ravi G -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Dec 22 09:56:36 2010 From: ae at op5.se (Andreas Ericsson) Date: Wed, 22 Dec 2010 09:56:36 +0100 Subject: Application monitoring using Nagios!!! In-Reply-To: <618F928F9E2D824AAB26DCAA0F25221F027A9BFA@HYD-MKD-MBX02.wipro.com> References: <9A51A0B8413A6645A4122F20579848987E924C7073@VANQUISH.sws.int.southernwater.co.uk> <9A51A0B8413A6645A4122F20579848987E924C7074@VANQUISH.sws.int.southernwater.co.uk> <618F928F9E2D824AAB26DCAA0F25221F027A9BFA@HYD-MKD-MBX02.wipro.com> Message-ID: <4D11BD44.4050103@op5.se> On 12/22/2010 04:40 AM, ravishankar.gundlapali at wipro.com wrote: > Hi, > > I am using Nagios Version 3.0.6 on a Linux server. > You might want to upgrade. Nagios 3.2.3 is the latest stable. > Please let me know whether we have an option of configuring the > credentials so that Nagios will automatically login to an application > and send an alert on success or failure. > You do. RTFM for more detailed info. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From uplink.team at gmail.com Wed Dec 22 09:42:20 2010 From: uplink.team at gmail.com (uplink.team at gmail.com) Date: Wed, 22 Dec 2010 09:42:20 +0100 Subject: timeperiods / exclude daily timespan with "exception" / nagios 3.2.x ? Message-ID: Hi all, I am trying to figure out the most elegant/readable way to specify a timeperiod (for notifications), which exclude a small timespan each day. In the 3.2 docs [1] there is mention of the "exclude" directive (which does not reliably work yet, as stated in the releaselog for 3.2.0 [1]) and a - more intriguing - "[exception]" directive. If I understand the former correctly, it's supposed to work like this (for creating a "blind-spot" of 5 minutes daily): define timeperiod{ timeperiod_name exceptions_for_specific_service alias Exception for specific service day 1 - -1 06:35-06:40 # 06:35 - 06:40 daily } define timeperiod{ timeperiod_name 24x7_exceptions_for_nlc_epichannel alias 24/7, but excluding 06:35 - 06:40 daily use 24x7 exclude exceptions_for_specific_service } However, I fail at using the other directive [exception]. How is this supposed to work? There seems to be no details or examples anywhere... Can any of you please share a working example? Or point to some documentation I missed? Thanks for any help! Gustav [1] http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#timeperiod [2] http://www.nagios.org/projects/nagioscore/history/core-3x "Known issue: Service checks that are defined with timeperiods that contain "exclude" directi ves are incorrectly re-scheduled. Don't use these for now - we'll get this fixed for 3.4" -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From vasco.debian at gmail.com Wed Dec 22 13:54:37 2010 From: vasco.debian at gmail.com (Meghanand Acharekar) Date: Wed, 22 Dec 2010 18:24:37 +0530 Subject: Nagios service check warning messages (..looks like it was orphaned..) Message-ID: Hi, I can see lots of warning messages saying ....... looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... (sometimes) in my nagios log, after doing Google search I found this could be due to improper directive configuration, I'm checking many of my services via nrpe. I just want to know what could be the possible reasons behind such warnings. And is there possibility of serious misconfiguration on such warning messages. Regards, Meghanand N. Acharekar -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.Fay at ajl.co.uk Wed Dec 22 14:03:09 2010 From: Andrew.Fay at ajl.co.uk (Andrew Fay) Date: Wed, 22 Dec 2010 13:03:09 -0000 Subject: [SPAM] - Re: nagios mail alerts to exchange - Email found in subject References: <73B0897B46A48E429822E662D4530F886A547B@aber-exch-003.ajl.co.uk> <0227B653B3DC82438B8291BC5218612F67377688B2@lmfjex07.lmfj.com> Message-ID: <73B0897B46A48E429822E662D4530F886A5482@aber-exch-003.ajl.co.uk> Thanks but I'd rather use postfix as it's the only linux mailing system I have a little experience with Found this guide : http://kurinchilamp.kurinchilion.com/2009/08/ubuntu-postfix-sendmail.htm l But still not working When I follow root at greenland:/# telnet mail-server 25 Trying 172.24.8.11... Connected to mail-server.domain.com. Escape character is '^]'. 220 mailserver.domain.com Microsoft ESMTP MAIL Service, Version: 6.0.3790.3959 ready at Thu, 6 Aug 2009 12:36:38 -0400 helo gmail.com 250 mail.domain.com Hello [192.168.1.116] mail from: tester at gmail.com 250 2.1.0 tester at gmail.com....Sender OK rcpt to:receiver at gmail.com 250 2.1.5 receiver at gmail.com data 354 Start mail input; end with . testing email to google output , yes it all works fine but when I try and send an alert in Nagios I am getting : Dec 21 18:54:22 NAGIOS-SERVER postfix/local[4318]: 906432AA8E: to=, relay=local, delay=7.6, delays=7.6/0.01/0/0.03, dsn=5.1.1, status=bounced (unknown user: "nagiosreports") so it obviously isn't going out any clues? : ) I have it set to sattelitte system, Cheers, Andy -----Original Message----- From: Mark A. Lappin [mailto:MarkL at lmfj.com] Sent: 21 December 2010 19:38 To: Nagios Users List Subject: [SPAM] - Re: [Nagios-users] nagios mail alerts to exchange - Email found in subject > Does anyone know where I can get a guide on how to use postfix to > mail a local exchange server - just now I have a local installation of > postfix no config, the mails just bounce off the exchange server. > I did it years ago and cannot for the life of me remember how - I think > I was using postfix and fetchmail, I'm not using postfix but have a few boxes running exim and some running sendmail. I have exim and sendmail both configured to use my exchange server as a smarthost for all mail. Exchange is set to accept all inbound mail connections from 192.168.0.0/16 and route & relay it accordingly. I didn't do my own sendmail config, and I'm not to familiar with sendmail; but in exim I set it up for no local mail and all mail is sent through a relay or smarthost. Postfix should have similar settings. Mark From: Andrew Fay [mailto:Andrew.Fay at ajl.co.uk] Sent: Tuesday, December 21, 2010 12:31 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] nagios mail alerts to exchange Cheers, Andy o----------------------------------------------------------------------o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o----------------------------------------------------------------------o Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . ------------------------------------------------------------------------ ------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null o----------------------------------------------------------------------o This Email has been scanned for viruses by Aberdeen Journals' Inbound Email Security Systems. o----------------------------------------------------------------------o o----------------------------------------------------------------------o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o----------------------------------------------------------------------o ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Wed Dec 22 15:04:48 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Wed, 22 Dec 2010 15:04:48 +0100 Subject: [SPAM] - Re: nagios mail alerts to exchange - Email found in subject In-Reply-To: <73B0897B46A48E429822E662D4530F886A5482@aber-exch-003.ajl.co.uk> References: <73B0897B46A48E429822E662D4530F886A547B@aber-exch-003.ajl.co.uk> <0227B653B3DC82438B8291BC5218612F67377688B2@lmfjex07.lmfj.com> <73B0897B46A48E429822E662D4530F886A5482@aber-exch-003.ajl.co.uk> Message-ID: <836aeb5b58ae93a1734855389415d3b6@vps517.directvps.nl> On Wed, 22 Dec 2010 13:03:09 -0000, "Andrew Fay" wrote: > Thanks but I'd rather use postfix as it's the only linux mailing > system > I have a little experience with Too little to be of much use. So what's the point? > root at greenland:/# telnet mail-server 25 > Trying 172.24.8.11... > Connected to mail-server.domain.com. > Escape character is '^]'. > 220 mailserver.domain.com Microsoft ESMTP MAIL Service, Version: > 6.0.3790.3959 ready at Thu, 6 Aug 2009 12:36:38 -0400 > helo gmail.com > 250 mail.domain.com Hello [192.168.1.116] > mail from: tester at gmail.com > 250 2.1.0 tester at gmail.com....Sender OK > rcpt to:receiver at gmail.com > 250 2.1.5 receiver at gmail.com > data > 354 Start mail input; end with . > testing email to google output > , yes > > it all works fine but when I try and send an alert in Nagios I am > getting : > > Dec 21 18:54:22 NAGIOS-SERVER postfix/local[4318]: 906432AA8E: > to=, relay=local, delay=7.6, > delays=7.6/0.01/0/0.03, dsn=5.1.1, status=bounced (unknown user: > "nagiosreports") Obviously your have been testing eggs and are now trying to send apples. Your test should have been identical to something Nagios is trying. So talk to postfix instead of a remote SMTP server and try to mimick the same sender and recipient. > so it obviously isn't going out > > any clues? : ) I think it is safe to say that you may not be the best person to configure this. If you have a business need for this then I guess hiring someone with sufficient knowledge is the best way to getting this started. Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From MarkL at lmfj.com Wed Dec 22 15:23:37 2010 From: MarkL at lmfj.com (Mark A. Lappin) Date: Wed, 22 Dec 2010 08:23:37 -0600 Subject: [SPAM] - Re: nagios mail alerts to exchange - Email found in subject In-Reply-To: <73B0897B46A48E429822E662D4530F886A5482@aber-exch-003.ajl.co.uk> References: <73B0897B46A48E429822E662D4530F886A547B@aber-exch-003.ajl.co.uk> <0227B653B3DC82438B8291BC5218612F67377688B2@lmfjex07.lmfj.com> <73B0897B46A48E429822E662D4530F886A5482@aber-exch-003.ajl.co.uk> Message-ID: <0227B653B3DC82438B8291BC5218612F6737768938@lmfjex07.lmfj.com> > Thanks but I'd rather use postfix as it's the only linux mailing system I have a little experience with You don't have to switch, you can use anything you want/need to, just have to configure it right > When I follow > root at greenland:/# telnet mail-server 25 > Trying 172.24.8.11... > Connected to mail-server.domain.com. > Escape character is '^]'. ... > it all works fine but when I try and send an alert in Nagios I am getting : First thing, change the samples they give you to be your mail server, and tell it that the from and to addresses are the same things that Nagios is trying to use. That part is critical, otherwise you're not comparing apples to apples and you can't tell if you have an Exchange configuration or postfix configuration issue. In Exchange you should check your Hub Transport Configuration for both the organization and server for how external mail is routed, domains you will accept mail locally for, domains you will relay for, IP's you will accept relay from. In particular check your accept domains configuration and your transport configuration. >> Dec 21 18:54:22 NAGIOS-SERVER postfix/local[4318]: 906432AA8E: >> to=, relay=local, delay=7.6, delays=7.6/0.01/0/0.03, >> dsn=5.1.1, status=bounced (unknown user:"nagiosreports") > so it obviously isn't going out > any clues? : ) > I have it set to sattelitte system, I don't have my Nagios box send mail as @mydomain.com, it sends it as Nagios at nagios.mydomain.com - I think I had some issues with recipient not found when I was trying to just do @mydomain.com so I did this to make it work. Check your config files, make sure everything seems reasonable; do you have the smarthost/relay host configured right, does it resolve right, did you give it just the computer name or did you give it a FQDN, have you tried switching it to an IP instead of a name to rule out a DNS lookup issue? Check your relay domain list as well, that's gotten me before. Make sure the nagiosreports at blah.co.uk if it is a local domain to your exchange server actually exists either as a mailbox, a distribution group, etc and that it has the correct smtp aliases configured (and if it is a group, who are the allowed senders to it, you probably can't restrict it to authenticated users only). Then try your telnet test using the same parameters to talk to exchange to send mail as Nagios does. Take Nagios out of the equation right now, get where you can use some simple mail diagnostics on your Ubuntu box to send mail to Yourself. Something simple like: mail -s Test123 me at myRealE-mailAddress.com Hello World . Throw some tail -f's on your log files and see what happens. If that's not working, hit up google or even some IRC channels and get that working first before you put Nagios back in the picture. ML Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eric.berg at barclayscapital.com Wed Dec 22 15:45:19 2010 From: eric.berg at barclayscapital.com (eric.berg at barclayscapital.com) Date: Wed, 22 Dec 2010 09:45:19 -0500 Subject: Nagios service check warning messages (..looks like it was orphaned..) In-Reply-To: References: Message-ID: I just ran into this yesterday. At first I thought it was a misconfiguration issue, but it turned out to be deuling nagios processes. Kill them all and start it anew. That worked for me. Eric ________________________________ From: Meghanand Acharekar [mailto:vasco.debian at gmail.com] Sent: Wednesday, December 22, 2010 7:55 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Nagios service check warning messages (..looks like it was orphaned..) Hi, I can see lots of warning messages saying ....... looks like it was orphaned (results never came back). I'm scheduling an immediate check of the service... (sometimes) in my nagios log, after doing Google search I found this could be due to improper directive configuration, I'm checking many of my services via nrpe. I just want to know what could be the possible reasons behind such warnings. And is there possibility of serious misconfiguration on such warning messages. Regards, Meghanand N. Acharekar _______________________________________________ This e-mail may contain information that is confidential, privileged or otherwise protected from disclosure. If you are not an intended recipient of this e-mail, do not duplicate or redistribute it by any means. Please delete it and any attachments and notify the sender that you have received it in error. Unless specifically indicated, this e-mail is not an offer to buy or sell or a solicitation to buy or sell any securities, investment products or other financial product or service, an official confirmation of any transaction, or an official statement of Barclays. Any views or opinions presented are solely those of the author and do not necessarily represent those of Barclays. This e-mail is subject to terms available at the following link: www.barcap.com/emaildisclaimer. By messaging with Barclays you consent to the foregoing. Barclays Capital is the investment banking division of Barclays Bank PLC, a company registered in England (number 1026167) with its registered office at 1 Churchill Place, London, E14 5HP. This email may relate to or be sent from other members of the Barclays Group. _______________________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mh+nagios-users at zugschlus.de Wed Dec 22 15:17:05 2010 From: mh+nagios-users at zugschlus.de (Marc Haber) Date: Wed, 22 Dec 2010 15:17:05 +0100 Subject: NAGIOS_ environment variables in a notification script Message-ID: <20101222141705.GB24207@torres.zugschlus.de> Hi, I am trying to write a notification script which is called via the following command definition: define command { command_name notify-service-by-email command_line /path/bin/notify --mail="$CONTACTEMAIL$" } To save myself from handing in all macros to the script via the command line, I'd like to use the macros that are written to the environment as NAGIOS_foo. Despite having set enable_environment_macros=1 in my nagios.cfg, the notification script only sees NAGIOS_PLUGIN=/path/bin/notify. What am I doing wrong? I'm using Nagios 3.0.6 from Debian lenny. Any hints will be appreciated. Greetings Marc -- ----------------------------------------------------------------------------- Marc Haber | "I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things." Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rick at screenscape.net Wed Dec 22 16:01:07 2010 From: rick at screenscape.net (Rick Munn) Date: Wed, 22 Dec 2010 11:01:07 -0400 Subject: PNP/MYSQL/RRDTool Message-ID: Hi, Is anyone using MYSQL to collect their PNP4Nagios Stats or is the Round Robin Database (RRDTool) the only way to do it? Thank you, Rick -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.Fay at ajl.co.uk Wed Dec 22 15:54:12 2010 From: Andrew.Fay at ajl.co.uk (Andrew Fay) Date: Wed, 22 Dec 2010 14:54:12 -0000 Subject: [SPAM] - Re: [SPAM] - Re: nagios mail alerts to exchange - Email found in subject - Email found in subject References: <73B0897B46A48E429822E662D4530F886A547B@aber-exch-003.ajl.co.uk><0227B653B3DC82438B8291BC5218612F67377688B2@lmfjex07.lmfj.com><73B0897B46A48E429822E662D4530F886A5482@aber-exch-003.ajl.co.uk> <836aeb5b58ae93a1734855389415d3b6@vps517.directvps.nl> Message-ID: <73B0897B46A48E429822E662D4530F886A5485@aber-exch-003.ajl.co.uk> I have sorted this myself now.. didn't tell postfix the $relay_host. I do have little experience with Linux but I am learning. Thanks for your input anyway. -----Original Message----- From: Hugo van der Kooij [mailto:hvdkooij at vanderkooij.org] Sent: 22 December 2010 14:05 To: Nagios Users List Subject: [SPAM] - Re: [Nagios-users] [SPAM] - Re: nagios mail alerts to exchange - Email found in subject - Email found in subject On Wed, 22 Dec 2010 13:03:09 -0000, "Andrew Fay" wrote: > Thanks but I'd rather use postfix as it's the only linux mailing > system > I have a little experience with Too little to be of much use. So what's the point? > root at greenland:/# telnet mail-server 25 > Trying 172.24.8.11... > Connected to mail-server.domain.com. > Escape character is '^]'. > 220 mailserver.domain.com Microsoft ESMTP MAIL Service, Version: > 6.0.3790.3959 ready at Thu, 6 Aug 2009 12:36:38 -0400 > helo gmail.com > 250 mail.domain.com Hello [192.168.1.116] > mail from: tester at gmail.com > 250 2.1.0 tester at gmail.com....Sender OK > rcpt to:receiver at gmail.com > 250 2.1.5 receiver at gmail.com > data > 354 Start mail input; end with . > testing email to google output > , yes > > it all works fine but when I try and send an alert in Nagios I am > getting : > > Dec 21 18:54:22 NAGIOS-SERVER postfix/local[4318]: 906432AA8E: > to=, relay=local, delay=7.6, > delays=7.6/0.01/0/0.03, dsn=5.1.1, status=bounced (unknown user: > "nagiosreports") Obviously your have been testing eggs and are now trying to send apples. Your test should have been identical to something Nagios is trying. So talk to postfix instead of a remote SMTP server and try to mimick the same sender and recipient. > so it obviously isn't going out > > any clues? : ) I think it is safe to say that you may not be the best person to configure this. If you have a business need for this then I guess hiring someone with sufficient knowledge is the best way to getting this started. Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc ------------------------------------------------------------------------ ------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null o----------------------------------------------------------------------o This Email has been scanned for viruses by Aberdeen Journals' Inbound Email Security Systems. o----------------------------------------------------------------------o o----------------------------------------------------------------------o This Email has been scanned for viruses by Aberdeen Journals' Outbound Email Security Systems. o----------------------------------------------------------------------o ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pitchfork at ederdrom.de Wed Dec 22 17:34:49 2010 From: pitchfork at ederdrom.de (Joerg Linge) Date: Wed, 22 Dec 2010 17:34:49 +0100 Subject: PNP/MYSQL/RRDTool In-Reply-To: References: Message-ID: <4D1228A9.9030504@ederdrom.de> Rick Munn wrote: > Hi, > > Is anyone using MYSQL to collect their PNP4Nagios Stats or is the Round > Robin Database (RRDTool) the only way to do it? PNP4Nagios depends heavily on rrdtool because the graphs are created by rrdtool too. Is not possible to use another storage engine then rrdtool. NagiosGrapherV2 might be a solution for you. Joerg ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rick at screenscape.net Wed Dec 22 17:54:40 2010 From: rick at screenscape.net (Rick Munn) Date: Wed, 22 Dec 2010 12:54:40 -0400 Subject: PNP/MYSQL/RRDTool In-Reply-To: <4D1228A9.9030504@ederdrom.de> References: <4D1228A9.9030504@ederdrom.de> Message-ID: Thanks Joerg, I'll check it out. On Wed, Dec 22, 2010 at 12:34 PM, Joerg Linge wrote: > Rick Munn wrote: > > Hi, > > > > Is anyone using MYSQL to collect their PNP4Nagios Stats or is the Round > > Robin Database (RRDTool) the only way to do it? > > PNP4Nagios depends heavily on rrdtool because the graphs are created by > rrdtool too. > Is not possible to use another storage engine then rrdtool. > > NagiosGrapherV2 might be a solution for you. > > Joerg > > > ------------------------------------------------------------------------------ > Forrester recently released a report on the Return on Investment (ROI) of > Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even > within 7 months. Over 3 million businesses have gone Google with Google > Apps: > an online email calendar, and document program that's accessible from your > browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Forrester recently released a report on the Return on Investment (ROI) of Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even within 7 months. Over 3 million businesses have gone Google with Google Apps: an online email calendar, and document program that's accessible from your browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Wed Dec 22 19:32:21 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Wed, 22 Dec 2010 11:32:21 -0700 Subject: max_check_attempts over time Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240550B3E8@WPSCV6MM.OPR.STATEFARM.ORG> We're looking at a service that has problems from time-to-time, and it gets auto-restarted when needed (Windows). What I've been looking for is a way to say that it needs 3 restarts ir order to go critical, but only within a 1 hour time frame. Anyone come up with a way to do time frames like this? Dan -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.elsen at gmail.com Wed Dec 22 20:11:52 2010 From: mark.elsen at gmail.com (Mark Elsen) Date: Wed, 22 Dec 2010 20:11:52 +0100 Subject: max_check_attempts over time In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB240550B3E8@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB240550B3E8@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: > We?re looking at a service that has problems from time-to-time, and it gets > auto-restarted when needed (Windows).? What I?ve been looking for is a way > to say that it needs 3 restarts ir order to go critical, but only within a 1 > hour time frame.? Anyone come up with a way to do time frames like this? > > Your problem is fuzzy defined : e.g : 'needs-3-restarts to go critical' , this can be be multi-explained. What is the correct one ? M. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Wed Dec 22 20:42:25 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Wed, 22 Dec 2010 12:42:25 -0700 Subject: max_check_attempts over time In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB240550B3E8@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240550B4A8@WPSCV6MM.OPR.STATEFARM.ORG> I want to know if a service restarts 3 times in one hour. Right now I can only find a way to do 3 times, ever, using max_check_attempts. Better ? Dan -----Original Message----- From: Mark Elsen [mailto:mark.elsen at gmail.com] Sent: Wednesday, December 22, 2010 1:12 PM To: Nagios Users List Subject: Re: [Nagios-users] max_check_attempts over time > We're looking at a service that has problems from time-to-time, and it gets > auto-restarted when needed (Windows).? What I've been looking for is a way > to say that it needs 3 restarts ir order to go critical, but only within a 1 > hour time frame.? Anyone come up with a way to do time frames like this? > > Your problem is fuzzy defined : e.g : 'needs-3-restarts to go critical' , this can be be multi-explained. What is the correct one ? M. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Thu Dec 23 03:36:39 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Wed, 22 Dec 2010 18:36:39 -0800 Subject: NAGIOS_ environment variables in a notification script In-Reply-To: <20101222141705.GB24207@torres.zugschlus.de> References: <20101222141705.GB24207@torres.zugschlus.de> Message-ID: <4D12B5B7.2010506@5dninja.net> On 12/22/10 6:17 AM, Marc Haber wrote: > Despite having set enable_environment_macros=1 in my nagios.cfg, the > notification script only sees NAGIOS_PLUGIN=/path/bin/notify. > > What am I doing wrong? > > I'm using Nagios 3.0.6 from Debian lenny. Any hints will be appreciated. enable_environment_macros should override use_large_installation_tweaks, which is what can also disable environment macros. Perhaps your version is not acting as suspected? See if you have u_l_i_t enabled, and if so, try disabling it. If that isn't it, try setting debug_level=2 (and debug_file, etc). Restart and check the debug output to see if it's actually seeing the config directive. Perhaps you have a typo. Then maybe set debug_level=32 and run a few notification tests (or just set it to 34 initially so you get notification and configuration debugging)... Also, consider upgrading. Nagios 3.2+ is great. -- Mike Lindsey ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.McIntyre at uk.enersys.com Thu Dec 23 04:01:07 2010 From: Andrew.McIntyre at uk.enersys.com (Andrew McIntyre) Date: Thu, 23 Dec 2010 04:01:07 +0100 Subject: Andrew McIntyre is out of the office. Message-ID: I will be out of the office starting 12/23/2010 and will not return until 12/24/2010. I will respond to your message when I return. This email and any attachments are confidential and the property of EnerSys Ltd. If you receive this email in error it must not be disclosed, copied or distributed in any way. In the event of it being sent to you in error please notify us immediately. EnerSys Ltd Registered in England. Company No. 731261 Registered Office: 21 St Thomas Street, Bristol, BS1 6JS. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From vasco.debian at gmail.com Thu Dec 23 07:20:32 2010 From: vasco.debian at gmail.com (Meghanand Acharekar) Date: Thu, 23 Dec 2010 11:50:32 +0530 Subject: Nagios service check warning messages (..looks like it was orphaned..) In-Reply-To: References: Message-ID: Yes, I also tried this before, this could be the one of the reasons. But for me problem comes back after few hours, I'm receiving very few warning messages daily around 15-20,Checking 300 services over 25 hosts. Regards, ??????? ??. ?????? Meghanand N. Acharekar " A proud Linux User " Reg Linux User #397975 ------------------------------------------ I was born free! No Gates and Windows can restrict my Freedom !!! On Wed, Dec 22, 2010 at 8:15 PM, wrote: > I just ran into this yesterday. At first I thought it was a > misconfiguration issue, but it turned out to be deuling nagios processes. > Kill them all and start it anew. That worked for me. > > Eric > > ------------------------------ > *From:* Meghanand Acharekar [mailto:vasco.debian at gmail.com] > *Sent:* Wednesday, December 22, 2010 7:55 AM > *To:* nagios-users at lists.sourceforge.net > *Subject:* [Nagios-users] Nagios service check warning messages (..looks > like it was orphaned..) > > Hi, > > I can see lots of warning messages saying > > ....... looks like it was orphaned (results never came back). I'm > scheduling an immediate check of the service... > > (sometimes) in my nagios log, after doing Google search I found this could > be due to improper directive configuration, > I'm checking many of my services via nrpe. > > I just want to know what could be the possible reasons behind such > warnings. > And is there possibility of serious misconfiguration on such warning > messages. > > Regards, > > Meghanand N. Acharekar > > > _______________________________________________ > > > > This e-mail may contain information that is confidential, privileged or > otherwise protected from disclosure. If you are not an intended recipient of > this e-mail, do not duplicate or redistribute it by any means. Please delete > it and any attachments and notify the sender that you have received it in > error. Unless specifically indicated, this e-mail is not an offer to buy or > sell or a solicitation to buy or sell any securities, investment products or > other financial product or service, an official confirmation of any > transaction, or an official statement of Barclays. Any views or opinions > presented are solely those of the author and do not necessarily represent > those of Barclays. This e-mail is subject to terms available at the > following link: www.barcap.com/emaildisclaimer. By messaging with Barclays > you consent to the foregoing. Barclays Capital is the investment banking > division of Barclays Bank PLC, a company registered in England (number > 1026167) with its registered office at 1 Churchill Place, London, E14 5HP. > This email may relate to or be sent from other members of the Barclays > Group.** > > _______________________________________________ > > > ------------------------------------------------------------------------------ > Forrester recently released a report on the Return on Investment (ROI) of > Google Apps. They found a 300% ROI, 38%-56% cost savings, and break-even > within 7 months. Over 3 million businesses have gone Google with Google > Apps: > an online email calendar, and document program that's accessible from your > browser. Read the Forrester report: http://p.sf.net/sfu/googleapps-sfnew > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Thu Dec 23 14:08:47 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Thu, 23 Dec 2010 13:08:47 +0000 Subject: timeperiods / exclude daily timespan with "exception" / nagios 3.2.x ? In-Reply-To: References: Message-ID: On 22 December 2010 08:42, wrote: > Hi all, > I am trying to figure out the most elegant/readable way to specify a > timeperiod (for notifications), which exclude a small timespan each day. > In the 3.2 docs [1] there is mention of the "exclude" directive (which does > not reliably work yet, as stated in the releaselog for 3.2.0 [1]) and a - > more intriguing - "[exception]" directive. > If I understand the former correctly, it's supposed to work like this (for > creating a "blind-spot" of 5 minutes daily): > define timeperiod{ > ?? ? ? ?timeperiod_name exceptions_for_specific_service > ?? ? ? ?alias ? ? ? ? ? Exception for specific service > ?? ? ? ?day 1 - -1 ? ? ?06:35-06:40 ? ? # 06:35 - 06:40?daily > ?? ? ? ?} > > define timeperiod{ > ?? ? ? ?timeperiod_name 24x7_exceptions_for_nlc_epichannel > ?? ? ? ?alias ? ? ? ? ? 24/7, but excluding 06:35 - 06:40 daily > ?? ? ? ?use ? ? ? ? ? ? 24x7 > ?? ? ? ?exclude ? ? ? ? exceptions_for_specific_service > ?? ? ? ?} > > > However, I fail at using the other directive?[exception].?How is this > supposed to work? There seems to be no details or examples anywhere... > Can any of you please share a working example? Or point to some > documentation I missed? > Thanks for any help! > Gustav > > [1] http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#timeperiod > [2] http://www.nagios.org/projects/nagioscore/history/core-3x?"Known issue: > Service checks that are defined with timeperiods that contain "exclude" > directi ves are incorrectly re-scheduled. Don't use these for now - we'll > get this?fixed for 3.4" If you are just using the timeperiod for notifications (notification_period) and not for check scheduling (check_period), then I would hope the exclude directive will work ok. I confess I haven't tested it myself - I'm just going by what it says in your note [2]. The simple case I have here where I want to alert on all workdays between midnight and midnight, excluding weekends and bank holidays looks like this: # U.K. holidays (actually just England and Wales) taken from # http://www.direct.gov.uk/en/Governmentcitizensandrights/LivingintheUK/DG_073741 # Last updated 06/12/2010 by JRTA - will need updating again before Christmas 2011 define timeperiod{ name uk-holidays january 3 00:00-24:00 ; New Years Day april 22 00:00-24:00 ; Good Friday april 25 00:00-24:00 ; Easter Monday april 29 00:00-24:00 ; Royal Wedding may 2 00:00-24:00 ; Early May Bank Holiday may 30 00:00-24:00 ; Spring Bank Holiday august 29 00:00-24:00 ; Summer Bank Holiday december 27 00:00-24:00 ; Christmas december 28 00:00-24:00 ; Boxing Day register 0 ; This is a template } # That was just a template. This is the actual timeperiod which we will use in an exclude directive later. define timeperiod{ timeperiod_name ukholidays alias UK Holidays use uk-holidays } # workdays - all days which are not weekend or bank holidays # note this includes the whole day midnight-midnight not just working hours define timeperiod{ timeperiod_name workdays alias Workdays - whole days which are not bank hol or weekend monday 00:00-24:00 tuesday 00:00-24:00 wednesday 00:00-24:00 thursday 00:00-24:00 friday 00:00-24:00 exclude ukholidays } And in the service definition: notification_period workdays I hope that helps. Cheers, Jim ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Thu Dec 23 14:15:53 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Thu, 23 Dec 2010 13:15:53 +0000 Subject: max_check_attempts over time In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB240550B4A8@WPSCV6MM.OPR.STATEFARM.ORG> References: <31B0FE0A1A8166409E9DF35C6DEECB240550B3E8@WPSCV6MM.OPR.STATEFARM.ORG> <31B0FE0A1A8166409E9DF35C6DEECB240550B4A8@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: On 22 December 2010 19:42, Daniel Wittenberg wrote: > I want to know if a service restarts 3 times in one hour. > > Right now I can only find a way to do 3 times, ever, using max_check_attempts. > > Better ? > > Dan If the service writes an event to the Windows event log each time it restarts then I would think it should be easy to query NSClient++ to alert if there are more than 3 such events in the last hour. See: http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Note the new syntax for CheckEventLog is a whole lot easier now than it was in previous versions, so if you're not using the up-to-date version of NSClient++, then I recommend you upgrade. If you're not using NSClient++, please state what agent (if any) you are using. hth, Jim ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Thu Dec 23 15:38:41 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Thu, 23 Dec 2010 07:38:41 -0700 Subject: max_check_attempts over time In-Reply-To: References: <31B0FE0A1A8166409E9DF35C6DEECB240550B3E8@WPSCV6MM.OPR.STATEFARM.ORG><31B0FE0A1A8166409E9DF35C6DEECB240550B4A8@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240550B7E9@WPSCV6MM.OPR.STATEFARM.ORG> What if it doesn't write to the eventlog? We have some strange apps here... Dan -----Original Message----- From: Jim Avery [mailto:jim at jimavery.me.uk] Sent: Thursday, December 23, 2010 7:16 AM To: Nagios Users List Subject: Re: [Nagios-users] max_check_attempts over time On 22 December 2010 19:42, Daniel Wittenberg wrote: > I want to know if a service restarts 3 times in one hour. > > Right now I can only find a way to do 3 times, ever, using max_check_attempts. > > Better ? > > Dan If the service writes an event to the Windows event log each time it restarts then I would think it should be easy to query NSClient++ to alert if there are more than 3 such events in the last hour. See: http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Note the new syntax for CheckEventLog is a whole lot easier now than it was in previous versions, so if you're not using the up-to-date version of NSClient++, then I recommend you upgrade. If you're not using NSClient++, please state what agent (if any) you are using. hth, Jim ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From himanshuarora7 at gmail.com Fri Dec 24 07:06:46 2010 From: himanshuarora7 at gmail.com (Himanshu Arora) Date: Fri, 24 Dec 2010 11:36:46 +0530 Subject: CHECK_SVN Message-ID: Hi Luis, Can you share your plugin here? Date: Fri, 19 Nov 2010 16:27:13 -0200 From: Luis Gustavo de Andrade Jord?o Carneiro Subject: [Nagios-users] CHECK_SVN To: "'nagios-users at lists.sourceforge.net'" Message-ID: <6E3EBE71CB62274F8EB78338A6CDD9750207ADBF875A at ADEXC07A.intranet.local > Content-Type: text/plain; charset="iso-8859-1" Hello People!! Can you help me? I installed plugin check_svn in my Nagios 3.2.3. When I run check_svn using prompt, that's work. But not when Nagios running check_svn. nagiosql:~# su - nagios nagios at nagiosql:/usr/local/nagios/libexec$ ./check_svn -H 10.0.0.33 --https -U lcarneiro -P xxxx --dir /svn/aocsw_config SVN OK: svn repository online - directory listing successful But, when Service Nagios run, not work. See: SVN CRITICAL: Error connecting to svn server - OPTIONS of ' https://svnaero/svn/aocsw_desenv': Server certificate verification failed: issuer is not trusted (https://svnaero) Commands.cfg: $USER1$/check_svn -H $HOSTADDRESS$ --https -U $ARG1$ -P $ARG2$ --dir $ARG3$ Service: $ARG1$ lcarneiro $ARG2$ xxxxx $ARG2$/svn/aocsw_desenv check_command check_svn!lcarneiro!xxxxx!/svn/aocsw_desenv I already accept certificate permanent (p) using svn info . How can i know which USER run check_svn? I think this could be the problem, because i need to accept certificate to user that run check_svn. Luis Gustavo de A. J. Carneiro -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.McIntyre at uk.enersys.com Fri Dec 24 11:27:54 2010 From: Andrew.McIntyre at uk.enersys.com (Andrew McIntyre) Date: Fri, 24 Dec 2010 11:27:54 +0100 Subject: Andrew McIntyre is out of the office. Message-ID: I will be out of the office starting 12/24/2010 and will not return until 01/04/2011. I will respond to your message when I return. This email and any attachments are confidential and the property of EnerSys Ltd. If you receive this email in error it must not be disclosed, copied or distributed in any way. In the event of it being sent to you in error please notify us immediately. EnerSys Ltd Registered in England. Company No. 731261 Registered Office: 21 St Thomas Street, Bristol, BS1 6JS. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mh+nagios-users at zugschlus.de Sun Dec 26 17:36:03 2010 From: mh+nagios-users at zugschlus.de (Marc Haber) Date: Sun, 26 Dec 2010 17:36:03 +0100 Subject: NAGIOS_ environment variables in a notification script In-Reply-To: <4D12B5B7.2010506@5dninja.net> References: <20101222141705.GB24207@torres.zugschlus.de> <4D12B5B7.2010506@5dninja.net> Message-ID: <20101226163603.GA24565@torres.zugschlus.de> Hi Mike, thanks for replying. On Wed, Dec 22, 2010 at 06:36:39PM -0800, Mike Lindsey wrote: > On 12/22/10 6:17 AM, Marc Haber wrote: >> Despite having set enable_environment_macros=1 in my nagios.cfg, the >> notification script only sees NAGIOS_PLUGIN=/path/bin/notify. >> >> What am I doing wrong? >> >> I'm using Nagios 3.0.6 from Debian lenny. Any hints will be appreciated. > enable_environment_macros should override use_large_installation_tweaks, > which is what can also disable environment macros. Perhaps your version > is not acting as suspected? See if you have u_l_i_t enabled, and if so, > try disabling it. u_l_i_t is explicitly set to 0 in the Debian packages. > If that isn't it, try setting debug_level=2 (and debug_file, etc). > Restart and check the debug output to see if it's actually seeing the > config directive. Perhaps you have a typo. It doesn't comment about any confiruation in the debug file. > Then maybe set debug_level=32 and run a few notification tests (or just > set it to 34 initially so you get notification and configuration > debugging)... I have set: debug_level=34 debug_verbosity=2 debug_file=/var/lib/nagios3/nagios.debug max_debug_file_size=1000000 but it isnot too informative. Neither does it say how the configuration directives have been processed, nor does it comment about setting any environment variables (not even the NAGIOS_PLUGIN that _is_ passed to the environment). It does dump the raw notification command, but I know what Nagios does with that. > Also, consider upgrading. Nagios 3.2+ is great. At this time, I prefer to stick with what my distribution delivers and supports. Are there any security relevant things that could convince me to take the burden of doing my own security support, or is the environment issue fixed in Nagios 3.2? Greetings Marc -- ----------------------------------------------------------------------------- Marc Haber | "I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things." Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From talonx at gmail.com Mon Dec 27 08:04:59 2010 From: talonx at gmail.com (talonx at gmail.com) Date: Mon, 27 Dec 2010 12:34:59 +0530 Subject: check_smtp doesn't support TLS? Message-ID: Hello, I'm trying to setup the check_smtp plugin for a remote mail server. I downloaded the latest version of nagios-plugins, built and installed it. This is what I get when I invoke check_smtp from the command line (actual data removed) - > check_smtp -S -H -p -A LOGIN -U -P -v HELOCMD: EHLO 220 mi9 ESMTP SG sent AUTH LOGIN received 530 5.7.0 Must issue a STARTTLS command first sent QUIT received 221 2.0.0 Bye SMTP CRITICAL - invalid response received after AUTH LOGIN, 0.171 sec. response time, 221 2.0.0 Bye I'm including the -S flag for it to use TLS. I couldn't see any other options that might be relevant. Any pointers? Regards Hrish -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sebastian.Ries at dtnet.de Mon Dec 27 11:14:57 2010 From: Sebastian.Ries at dtnet.de (Sebastian Ries) Date: Mon, 27 Dec 2010 11:14:57 +0100 Subject: Upgrade from Nagios 1.x Message-ID: <1293444897.29774.14.camel@bofh.dtnet.de> Hi I have an old Nagios installation where they made much use of hostgroups. Hosts are arranged and contacts are configured to these groups. (which isn't allowed in Nagios2 and newer) Does anyone know if there is a nice way to convert this config to the new format? (contacts as part of the host not part of the hostgroup) Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From diego.roccia at gmail.com Mon Dec 27 11:23:21 2010 From: diego.roccia at gmail.com (Diego Roccia) Date: Mon, 27 Dec 2010 10:23:21 +0000 Subject: Upgrade from Nagios 1.x In-Reply-To: <1293444897.29774.14.camel@bofh.dtnet.de> References: <1293444897.29774.14.camel@bofh.dtnet.de> Message-ID: <1923155408-1293445401-cardhu_decombobulator_blackberry.rim.net-1912366149-@b17.c7.bise7.blackberry> I'd use host templates to replicate the configuration you need Sent from my BlackBerry? wireless device -----Original Message----- From: Sebastian Ries Date: Mon, 27 Dec 2010 11:14:57 To: nagios-users-ML Reply-To: Nagios Users List Subject: [Nagios-users] Upgrade from Nagios 1.x Hi I have an old Nagios installation where they made much use of hostgroups. Hosts are arranged and contacts are configured to these groups. (which isn't allowed in Nagios2 and newer) Does anyone know if there is a nice way to convert this config to the new format? (contacts as part of the host not part of the hostgroup) Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cbeattie at geninfo.com Mon Dec 27 15:55:13 2010 From: cbeattie at geninfo.com (Chris Beattie) Date: Mon, 27 Dec 2010 09:55:13 -0500 Subject: check_smtp doesn't support TLS? In-Reply-To: References: Message-ID: <4D18A8D1.9090808@geninfo.com> talonx at gmail.com wrote: > I'm trying to setup the check_smtp plugin for a remote mail server. I > downloaded the latest version of nagios-plugins, built and installed > it. This is what I get when I invoke check_smtp from the command line > (actual data removed) - > > received 530 5.7.0 Must issue a STARTTLS command first > > I'm including the -S flag for it to use TLS. I couldn't see any other > options that might be relevant. Any pointers? I had something similar happen to me with SSL once. I compiled and installed the plugins, but I could not get SSL to work for the check_http command. I had installed the openssl package, but I had forgotten to install the openssl-devel package before I compiled the plugins. When you ./configure the plugins, does it say "--with-gnutls: no" at the end? You may have to install the gnutls-devel package first. Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From MarkL at lmfj.com Mon Dec 27 16:42:37 2010 From: MarkL at lmfj.com (Mark A. Lappin) Date: Mon, 27 Dec 2010 09:42:37 -0600 Subject: re-enable all notifications on hosts & services Message-ID: <0227B653B3DC82438B8291BC5218612F6737768BE8@lmfjex07.lmfj.com> We have a notifications on services which were disabled via the web interface for hosts and all services on hosts. Dozens it seems. Is there an easy way to turn all notifications back on for all services and all hosts in one fell swoop? We had a lot of htem turned off while we were tweaking commands and configuration and I'm ready to have them all back on.... ML Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com [http://www.lmfj.com/images/lmfjsig.gif] ________________________________ This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cbeattie at geninfo.com Tue Dec 28 00:25:49 2010 From: cbeattie at geninfo.com (Chris Beattie) Date: Mon, 27 Dec 2010 18:25:49 -0500 Subject: re-enable all notifications on hosts & services In-Reply-To: <0227B653B3DC82438B8291BC5218612F6737768BE8@lmfjex07.lmfj.com> References: <0227B653B3DC82438B8291BC5218612F6737768BE8@lmfjex07.lmfj.com> Message-ID: <4D19207D.1030103@geninfo.com> Mark A. Lappin wrote: > interface for hosts and all services on hosts. Dozens it seems. Is > there an easy way to turn all notifications back on for all services and > all hosts in one fell swoop? We had a lot of htem turned off while we You can use Nagios' external commands to do that. http://old.nagios.org/developerinfo/externalcommands/commandlist.php If I didn't have to be too careful, I'd do something like this (apologies for the line wrapping): #!/bin/sh now=`date +%s` commandfile='/usr/local/nagios/var/rw/nagios.cmd' statusfile='/usr/local/nagios/var/status.dat' for i in `grep host_name $statusfile | sort --unique | sed "s/\thost_name=//"` do /bin/printf "[%lu] ENABLE_HOST_NOTIFICATIONS;$i\n" $now > $commandfile /bin/printf "[%lu] ENABLE_HOST_SVC_NOTIFICATIONS;$i\n" $now > $commandfile done Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From it.toonz at gmail.com Tue Dec 28 13:03:53 2010 From: it.toonz at gmail.com (Toonz IT) Date: Tue, 28 Dec 2010 17:33:53 +0530 Subject: monitoring windows event viewer. Message-ID: Is it possible to monitor specific event ids like disk error, fro windows event viewer logs?? We recently had a sever hard disk error and we detected it a bit late! :-( Please let us know. At present we just have basic monitoring like ping, disk space usage etc... we are using FAN 2.0. anth! -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.pinotti at cineca.it Tue Dec 28 12:44:31 2010 From: m.pinotti at cineca.it (Maurizio Pinotti) Date: Tue, 28 Dec 2010 12:44:31 +0100 Subject: Scheduling Queue stucked a few minutes after restart Message-ID: <4D19CD9F.8050304@cineca.it> hello, I have a really odd issue running Nagios: a few minutes after starting the scheduling queue seems to freeze and no more active checks are performed. The queue remains stucked for hours until I have to manually restart Nagios. Passive checks are processed normally. I'm running Nagios 3.0.6 (deb package) on a Debian lenny system. The harware is an 8-core Xeon CPU with 16GB RAM. Nagios is monitoring about 1K hosts and 10K services. Reverting back the configuration to "last known good configuration" did not help, neither did rebooting the server and several Nagios restarts and reloads. Already tried fixes: - disabled all active hosts checks - increased ulimit for nagios user - disabled all event handlers - disabled all "obsess" stuff Any help or hint would be appreciated. nagios.cfg follows ******************* log_file=/nagios_fe/var/log/nagios3/nagios.log cfg_file=/etc/nagios3/commands.cfg cfg_dir=/etc/nagios-plugins/config cfg_dir=/nagios_fe/etc/cmon/nagios3 cfg_dir=/nagios_fe/etc/nagiosgrapher/nagios3 object_cache_file=/nagios_fe/var/cache/nagios3/objects.cache precached_object_file=/nagios_fe/var/lib/nagios3/objects.precache resource_file=/nagios_fe/etc/cmon/nagios3/macros.res status_file=/nagios_fe/var/cache/nagios3/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/nagios_fe/var/lib/nagios3/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/nagios_fe/var/run/nagios3/nagios3.pid temp_file=/nagios_fe/var/cache/nagios3/nagios.tmp temp_path=/tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/nagios_fe/var/log/nagios3/archives use_syslog=0 log_notifications=1 log_service_retries=0 log_host_retries=0 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=0 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/nagios_fe/var/lib/nagios3/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/nagios_fe/var/lib/nagios3/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=1 enable_event_handlers=0 process_performance_data=1 service_perfdata_file=/nagios_fe/var/lib/nagiosgrapher/ngraph.pipe service_perfdata_file_template=$HOSTNAME$\t$SERVICEDESC$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\t$TIMET$\n service_perfdata_file_mode=a service_perfdata_file_processing_interval=5 service_perfdata_file_processing_command=ngraph-process-service-perfdata-pipe obsess_over_services=0 obsess_over_hosts=0 translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=euro p1_file=/usr/lib/nagios3/p1.pl enable_embedded_perl=0 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=root at localhost admin_pager=pageroot at localhost daemon_dumps_core=0 use_large_installation_tweaks=1 enable_environment_macros=0 debug_level=144 debug_verbosity=1 debug_file=/nagios_fe/var/log/nagios3/nagios.debug max_debug_file_size=2000000000 ******************* ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sebastian.Ries at dtnet.de Tue Dec 28 14:23:45 2010 From: Sebastian.Ries at dtnet.de (Sebastian Ries) Date: Tue, 28 Dec 2010 14:23:45 +0100 Subject: Plugin: nagiosVMware Message-ID: <1293542625.8847.27.camel@bofh.dtnet.de> Hi I am trying to Configure the nagiosVMware plugin I found on https://www.monitoringexchange.org/inventory/Check-Plugins/Virtualization/VMWare-%2528ESX%2529/nagiosVMware Generally it works but now I found that most of the time the CPU and MEM-checks give no result: [nagios at nagios-check-vc ~]$ ./nsca_vmware.pl 5min Processing host esx19.dtnet.de ... overcommit -1, cpuload -1 on 0 cpus, memtot -1, memfree -1 esx19 ESX-OVERCMMT 3 Memory overcommitment -1 esx19 ESX-CPU-LOAD 3 CPU load average -1 on 0 CPUs esx19 ESX-MEMORY 3 Memory use -1 total -1 free ... mem/cpu took 8 seconds ... host esx19 took 8 seconds Processing host esx20.dtnet.de but sometimes it works: [nagios at nagios-check-vc ~]$ ./nsca_vmware.pl 5min Processing host esx19.dtnet.de ... overcommit 0.00, cpuload 0.19 on 8 cpus, memtot 32766, memfree 11680 esx19 ESX-OVERCMMT 0 Memory overcommitment 0% esx19 ESX-CPU-LOAD 0 CPU load average 19% on 8 CPUs esx19 ESX-MEMORY 0 Memory use 64% (21086 of 32766 MB used) ... mem/cpu took 8 seconds ... host esx19 took 8 seconds This is the same effect with all ESX servers. All are ESX4.0 in the ....cmd file I get this error [2010-12-28 14:17:16.275 7324B90 warning 'App'] Closing Response processing in unexpected state: 3 Has anyone managed to run this without errors? s_teeter: are you on this list? (I did not find an email address) Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cbeattie at geninfo.com Tue Dec 28 15:13:14 2010 From: cbeattie at geninfo.com (Chris Beattie) Date: Tue, 28 Dec 2010 09:13:14 -0500 Subject: Scheduling Queue stucked a few minutes after restart In-Reply-To: <4D19CD9F.8050304@cineca.it> References: <4D19CD9F.8050304@cineca.it> Message-ID: <4D19F07A.1060306@geninfo.com> Maurizio Pinotti wrote: > I have a really odd issue running Nagios: a few minutes after starting the > scheduling queue seems to freeze and no more active checks are performed. The > queue remains stucked for hours until I have to manually restart Nagios. > > I'm running Nagios 3.0.6 (deb package) on a Debian lenny system. The harware is I think that is a bug in that version of Nagios. I had the same problem. It got fixed, but I still go look at my service checks every morning to make sure. Also, I see where the server guys acknowledge problems and then forget about them, heh heh. There is a much newer version of Nagios available in lenny-backports. I would give it a shot if you can. http://packages.debian.org/source/lenny-backports/backports/nagios3 -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dmitry.leonenko at gmail.com Tue Dec 28 15:19:21 2010 From: dmitry.leonenko at gmail.com (=?UTF-8?B?0JTQvNC40YLRgNC40Lkg0JvQtdC+0L3QtdC90LrQvg==?=) Date: Tue, 28 Dec 2010 16:19:21 +0200 Subject: Host/service escalation notification Message-ID: Hi. I want to create several levels of host and service escalation. Say 3 levels. In notification I want to know on which escalation level this particular notification occurred. Can't find any variable reflecting escalation level. Thanks a lot! Dmytro Leonenko -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cbeattie at geninfo.com Tue Dec 28 15:31:41 2010 From: cbeattie at geninfo.com (Chris Beattie) Date: Tue, 28 Dec 2010 09:31:41 -0500 Subject: monitoring windows event viewer. In-Reply-To: References: Message-ID: <4D19F4CD.7040408@geninfo.com> Toonz IT wrote: > Is it possible to monitor specific event ids like disk error, fro > windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From polifemos at conedsolutions.com Tue Dec 28 15:37:57 2010 From: polifemos at conedsolutions.com (Polifemo, Salvatore) Date: Tue, 28 Dec 2010 09:37:57 -0500 Subject: monitoring windows event viewer. In-Reply-To: <4D19F4CD.7040408@geninfo.com> References: <4D19F4CD.7040408@geninfo.com> Message-ID: <5BE7D0404F28DC44B780100927AA4CB018664C5C@whplex3.int.cecdes.net> If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -----Original Message----- From: Chris Beattie [mailto:cbeattie at geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: > Is it possible to monitor specific event ids like disk error, fro > windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mh+nagios-users at zugschlus.de Tue Dec 28 15:38:28 2010 From: mh+nagios-users at zugschlus.de (Marc Haber) Date: Tue, 28 Dec 2010 15:38:28 +0100 Subject: Host/service escalation notification In-Reply-To: References: Message-ID: <20101228143828.GA25548@torres.zugschlus.de> On Tue, Dec 28, 2010 at 04:19:21PM +0200, ??????? ???????? wrote: > I want to create several levels of host and service escalation. Say 3 > levels. In notification I want to know on which escalation level this > particular notification occurred. Can't find any variable reflecting > escalation level. Escalation levels are connected to the notification number, and escalations can be kind of orthogonal. Do the Notification Number ($SERVICENOTIFICATIONNUMBER$ and/or $HOSTNOTIFICATIONNUMBER$) macros the job? Greetings Marc -- ----------------------------------------------------------------------------- Marc Haber | "I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things." Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Tue Dec 28 15:46:05 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Tue, 28 Dec 2010 07:46:05 -0700 Subject: monitoring windows event viewer. In-Reply-To: <5BE7D0404F28DC44B780100927AA4CB018664C5C@whplex3.int.cecdes.net> References: <4D19F4CD.7040408@geninfo.com> <5BE7D0404F28DC44B780100927AA4CB018664C5C@whplex3.int.cecdes.net> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240555C774@WPSCV6MM.OPR.STATEFARM.ORG> Doesn't syslog-ng just consolidate the logs, it doesn't really monitor anything right? Dan -----Original Message----- From: Polifemo, Salvatore [mailto:polifemos at conedsolutions.com] Sent: Tuesday, December 28, 2010 8:38 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -----Original Message----- From: Chris Beattie [mailto:cbeattie at geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: > Is it possible to monitor specific event ids like disk error, fro > windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From polifemos at conedsolutions.com Tue Dec 28 15:56:33 2010 From: polifemos at conedsolutions.com (Polifemo, Salvatore) Date: Tue, 28 Dec 2010 09:56:33 -0500 Subject: monitoring windows event viewer. In-Reply-To: <31B0FE0A1A8166409E9DF35C6DEECB240555C774@WPSCV6MM.OPR.STATEFARM.ORG> References: <4D19F4CD.7040408@geninfo.com><5BE7D0404F28DC44B780100927AA4CB018664C5C@whplex3.int.cecdes.net> <31B0FE0A1A8166409E9DF35C6DEECB240555C774@WPSCV6MM.OPR.STATEFARM.ORG> Message-ID: <5BE7D0404F28DC44B780100927AA4CB018664C5F@whplex3.int.cecdes.net> One use of syslog to set up rules and then take an action. We look for error then send out an email. Take a look at the syslog-ng forum. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -----Original Message----- From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Tuesday, December 28, 2010 9:46 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Doesn't syslog-ng just consolidate the logs, it doesn't really monitor anything right? Dan -----Original Message----- From: Polifemo, Salvatore [mailto:polifemos at conedsolutions.com] Sent: Tuesday, December 28, 2010 8:38 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -----Original Message----- From: Chris Beattie [mailto:cbeattie at geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: > Is it possible to monitor specific event ids like disk error, fro > windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Tue Dec 28 23:02:08 2010 From: stanb at panix.com (stan) Date: Tue, 28 Dec 2010 17:02:08 -0500 Subject: Disapering popup windows Message-ID: <20101228220208.GA5294@teddy.fas.com> I have a 3.1.0 instnace which ahs been in service for a long time. Over the years, we have had some issues with the popup windows in the browser flashing up for a fraction of a second and disapering. I have always chrged this off to wierd browser behavior, however, I am now setting up a child instance, and it is at 3.2.0. Today I observed this ebhavior on the same browser, running on the smae machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in production. Does nayone have any thoughts as to what might have chnaged between these 2 versions that fixed this? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sebastian.Ries at dtnet.de Wed Dec 29 11:07:17 2010 From: Sebastian.Ries at dtnet.de (Sebastian Ries) Date: Wed, 29 Dec 2010 11:07:17 +0100 Subject: Plugin: nagiosVMware In-Reply-To: <1293542625.8847.27.camel@bofh.dtnet.de> References: <1293542625.8847.27.camel@bofh.dtnet.de> Message-ID: <1293617237.20786.11.camel@bofh.dtnet.de> Hi > I am trying to Configure the nagiosVMware plugin I found on > https://www.monitoringexchange.org/inventory/Check-Plugins/Virtualization/VMWare-%2528ESX%2529/nagiosVMware > This is the same effect with all ESX servers. All are ESX4.0 > > in the ....cmd file I get this error > [2010-12-28 14:17:16.275 7324B90 warning 'App'] Closing Response > processing in unexpected state: 3 OK I found that this error is printed to stdout from resxtop and therefore the plugin comes into an error while parsing the output. As a workaround I put: |grep -v \"Closing Response processing in unexpected state: 3\" into the command within nsca_vmware.pl so for me the Problem is solved ;-) Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Dec 29 11:19:35 2010 From: patrick.morris at hp.com (Patrick Morris) Date: Wed, 29 Dec 2010 02:19:35 -0800 Subject: Disapering popup windows In-Reply-To: <20101228220208.GA5294@teddy.fas.com> References: <20101228220208.GA5294@teddy.fas.com> Message-ID: <4D1B0B37.5030808@hp.com> On 12/28/2010 2:02 PM, stan wrote: > I have a 3.1.0 instnace which ahs been in service for a long time. Over the > years, we have had some issues with the popup windows in the browser > flashing up for a fraction of a second and disapering. I have always > chrged this off to wierd browser behavior, however, I am now setting up a > child instance, and it is at 3.2.0. > > Today I observed this ebhavior on the same browser, running on the smae > machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am > reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in > production. Does nayone have any thoughts as to what might have chnaged > between these 2 versions that fixed this? Popup windows? I've never seen Nagios use them. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From diego.roccia at gmail.com Wed Dec 29 11:31:32 2010 From: diego.roccia at gmail.com (diego.roccia at gmail.com) Date: Wed, 29 Dec 2010 11:31:32 +0100 Subject: Disapering popup windows In-Reply-To: <20101228220208.GA5294@teddy.fas.com> References: <20101228220208.GA5294@teddy.fas.com> Message-ID: Do you mean actual popup windows or overlay frames, like the ones used, for example, by pnp4nagios to embed graphs? On Tue, Dec 28, 2010 at 11:02 PM, stan wrote: > I have a 3.1.0 instnace which ahs been in service for a long time. Over the > years, we have had some issues with the popup windows in the browser > flashing up for a fraction of a second and disapering. I have always > chrged this off to wierd browser behavior, however, I am now setting up a > child instance, and it is at 3.2.0. > > Today I observed this ebhavior on the same browser, running on the smae > machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am > reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in > production. Does nayone have any thoughts as to what might have chnaged > between these 2 versions that fixed this? > > > -- > A: Because it messes up the order in which people normally read text. > Q: Why is top-posting such a bad thing? > A: Top-posting. > Q: What is the most annoying thing in e-mail? > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Diego Roccia diego.roccia (at) gmail (dot) com ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From diego.roccia at gmail.com Wed Dec 29 11:32:07 2010 From: diego.roccia at gmail.com (diego.roccia at gmail.com) Date: Wed, 29 Dec 2010 11:32:07 +0100 Subject: Disapering popup windows In-Reply-To: References: <20101228220208.GA5294@teddy.fas.com> Message-ID: On Wed, Dec 29, 2010 at 11:31 AM, diego.roccia at gmail.com wrote: > Do you mean actual popup windows or overlay frames, like the ones > used, for example, by pnp4nagios to embed graphs? > > On Tue, Dec 28, 2010 at 11:02 PM, stan wrote: >> I have a 3.1.0 instnace which ahs been in service for a long time. Over the >> years, we have had some issues with the popup windows in the browser >> flashing up for a fraction of a second and disapering. I have always >> chrged this off to wierd browser behavior, however, I am now setting up a >> child instance, and it is at 3.2.0. >> >> Today I observed this ebhavior on the same browser, running on the smae >> machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am >> reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in >> production. Does nayone have any thoughts as to what might have chnaged >> between these 2 versions that fixed this? >> >> >> -- >> A: Because it messes up the order in which people normally read text. >> Q: Why is top-posting such a bad thing? >> A: Top-posting. >> Q: What is the most annoying thing in e-mail? >> >> ------------------------------------------------------------------------------ >> Learn how Oracle Real Application Clusters (RAC) One Node allows customers >> to consolidate database storage, standardize their database environment, and, >> should the need arise, upgrade to a full multi-node Oracle RAC database >> without downtime or disruption >> http://p.sf.net/sfu/oracle-sfdevnl >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- > Diego Roccia > diego.roccia (at) gmail (dot) com > OPS, sorry for the top posting :) -- Diego Roccia diego.roccia (at) gmail (dot) com ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Wed Dec 29 12:05:34 2010 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Wed, 29 Dec 2010 12:05:34 +0100 Subject: Disapering popup windows In-Reply-To: <20101228220208.GA5294@teddy.fas.com> References: <20101228220208.GA5294@teddy.fas.com> Message-ID: <4d2a6d2c87528eb7d1934ffa337c5d78@vps517.directvps.nl> On Tue, 28 Dec 2010 17:02:08 -0500, stan wrote: > I have a 3.1.0 instnace which ahs been in service for a long time. Please elaborate and please fix your output before sending something to a mailinglist with plenty of errors that make your message hard to read. Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From it.toonz at gmail.com Wed Dec 29 12:21:17 2010 From: it.toonz at gmail.com (Toonz IT) Date: Wed, 29 Dec 2010 16:51:17 +0530 Subject: monitoring windows event viewer. In-Reply-To: <4D19F4CD.7040408@geninfo.com> References: <4D19F4CD.7040408@geninfo.com> Message-ID: Thank You, exactly what we wanted!! :-) On Tue, Dec 28, 2010 at 8:01 PM, Chris Beattie wrote: > Toonz IT wrote: > > Is it possible to monitor specific event ids like disk error, fro > > windows event viewer logs?? > > Yes, but you may have to use the NSClient++ agent on your Windows boxes > and create custom commands to do it. > > http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog > > Unfortunately, I deleted the Windows event log checks after I didn't > need them any more, so I don't have a working example configuration to > show you. > > > -- > -Chris > > ------ > > > Nothing in this message is intended to make or accept an offer or to form a > contract, except that an attachment that is an image of a contract bearing > the signature of an officer of our company may be or become a contract. This > message (including any attachments) is intended only for the use of the > individual or entity to whom it is addressed. It may contain information > that is non-public, proprietary, privileged, confidential, and exempt from > disclosure under applicable law or may constitute as attorney work product. > If you are not the intended recipient, we hereby notify you that any use, > dissemination, distribution, or copying of this message is strictly > prohibited. If you have received this message in error, please notify us > immediately by telephone and delete this message immediately. > > Thank you. > > > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, > and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Wed Dec 29 15:15:30 2010 From: stanb at panix.com (stan) Date: Wed, 29 Dec 2010 09:15:30 -0500 Subject: Disapering popup windows In-Reply-To: References: <20101228220208.GA5294@teddy.fas.com> Message-ID: <20101229141530.GB25090@teddy.fas.com> On Wed, Dec 29, 2010 at 11:31:32AM +0100, diego.roccia at gmail.com wrote: > Do you mean actual popup windows or overlay frames, like the ones > used, for example, by pnp4nagios to embed graphs? > I think I really mean pulldows, such as you get when selecting an object type in the view configuration page. -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Wed Dec 29 15:14:27 2010 From: stanb at panix.com (stan) Date: Wed, 29 Dec 2010 09:14:27 -0500 Subject: Disapering popup windows In-Reply-To: <4D1B0B37.5030808@hp.com> References: <20101228220208.GA5294@teddy.fas.com> <4D1B0B37.5030808@hp.com> Message-ID: <20101229141427.GA25090@teddy.fas.com> On Wed, Dec 29, 2010 at 02:19:35AM -0800, Patrick Morris wrote: > On 12/28/2010 2:02 PM, stan wrote: > > I have a 3.1.0 instnace which ahs been in service for a long time. Over the > > years, we have had some issues with the popup windows in the browser > > flashing up for a fraction of a second and disapering. I have always > > chrged this off to wierd browser behavior, however, I am now setting up a > > child instance, and it is at 3.2.0. > > > > Today I observed this ebhavior on the same browser, running on the smae > > machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am > > reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in > > production. Does nayone have any thoughts as to what might have chnaged > > between these 2 versions that fixed this? > > Popup windows? I've never seen Nagios use them. > OK, perhaps my termionology s imprecise. Here is an example, from the main page go to Configuration -> View Config At that point you have a pulldown for Object type. On the old system, it flashes up for less than a second and the disappers. Make more sense, now? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Wed Dec 29 16:51:37 2010 From: a31modela at hotmail.com (steve f) Date: Wed, 29 Dec 2010 10:51:37 -0500 Subject: A Question on Console Alerts for a Help Desk Message-ID: I have a Nagios 3.x configuration set up and will eventually have a very large Distributed configuration. So far in the test environment, everything is working fine. Here is a question.... If someone on our Help Desk is working on a server ( that has not alerted in any way in Nagios ) and decides to restart a service that is a nagios monitored service, the restart of that service will most likely cause an alert to show on the nagios web status screen. My concern is that someone on the other side of the Help Desk doesn't know that someone is in the box, sees the alert on the monitor screen & dials into the box to see what is going on, effectively having 2 people working on the box. I am working off of the monitoring screen only and not any notification for this scenario. I know that the initial person could go into Nagios and schedule downtime for the host or service before they do anything but in our environment, that would not always be possible. The Help Desk uses a menu to stop & restart services, etc. Is it feasible ( realistic ) for the menu command to add a snippet of code to put the nagios check for that service/host in awk mode for say 5 minutes so anyone who sees the alert on the screen would know that its being addressed? Is there an easier way to do this? FWIW, we don't do ANYTHING easy here..... :) Thanks, Steve -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Wed Dec 29 17:18:39 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Wed, 29 Dec 2010 16:18:39 +0000 Subject: A Question on Console Alerts for a Help Desk In-Reply-To: References: Message-ID: On 29 December 2010 15:51, steve f wrote: > The Help Desk uses a menu to stop & restart services, etc.? Is it feasible ( > realistic )? for the menu command to add a snippet of code to put the nagios > check for that service/host? in awk mode for say 5 minutes so anyone who > sees the alert on the screen would know that its being addressed? > > Is there an easier way to do this? There isn't an easy way to ack it for 5 minutes, as you can't ack it until it's in a hard state. I would instead get your menu script to schedule downtime in Nagios for that service for 5 minutes by submitting an external command to Nagios. I would say this is a feasible and realistic thing to do, yes. But .. I don't know how you operate your distributed Nagios setup. So long as only one of your Nagios servers is used for the web front-end this should be easy. If various people use various of the distributed Nagios servers for their web front end it could be a challenge setting up your menu system to submit the external command to the right server. See: http://nagios.sourceforge.net/docs/3_0/extcommands.html hth, Jim ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pitchfork at ederdrom.de Wed Dec 29 18:10:17 2010 From: pitchfork at ederdrom.de (Joerg Linge) Date: Wed, 29 Dec 2010 18:10:17 +0100 Subject: A Question on Console Alerts for a Help Desk In-Reply-To: References: Message-ID: <4D1B6B79.6090602@ederdrom.de> Jim Avery wrote: > On 29 December 2010 15:51, steve f wrote: > >> The Help Desk uses a menu to stop& restart services, etc. Is it feasible ( >> realistic ) for the menu command to add a snippet of code to put the nagios >> check for that service/host in awk mode for say 5 minutes so anyone who >> sees the alert on the screen would know that its being addressed? >> >> Is there an easier way to do this? > > There isn't an easy way to ack it for 5 minutes, as you can't ack it > until it's in a hard state. I would instead get your menu script to > schedule downtime in Nagios for that service for 5 minutes by > submitting an external command to Nagios. I would say this is a > feasible and realistic thing to do, yes. But .. I don't know how you > operate your distributed Nagios setup. So long as only one of your > Nagios servers is used for the web front-end this should be easy. If > various people use various of the distributed Nagios servers for their > web front end it could be a challenge setting up your menu system to > submit the external command to the right server. > > See: > > http://nagios.sourceforge.net/docs/3_0/extcommands.html Instead of scheduling downtime its also possible reschedule the next service checks to now + 15minutes for example. http://old.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=30 Joerg ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Wed Dec 29 19:36:51 2010 From: stanb at panix.com (stan) Date: Wed, 29 Dec 2010 13:36:51 -0500 Subject: A newbie configuration question Message-ID: <20101229183651.GA31320@teddy.fas.com> I am trying to rationalize a couple of Nagios configurations that have 2different histories, and they seem more different than I think they should be. So, first I want to get the big picture n my head, and decide what seems to be a sensible configuration to support going forward. Basicly I have in mind some thing like this. 1. Define hosts in a file 2. Define services in a file 3, Define commands in a file 4. Aggregate hosts in groups of similar types in a hostgroups file 5. Aggregate services in groups of similar types in a servicegroups file Now, where I start to get confused here is that the service definitions seem to have a filed for one or more hostnames. Why is this? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Wed Dec 29 19:39:16 2010 From: stanb at panix.com (stan) Date: Wed, 29 Dec 2010 13:39:16 -0500 Subject: Checking multiple TCP ports for a single status? Message-ID: <20101229183916.GB31320@teddy.fas.com> I think I need to verify that both port 135, and 445 are avaialble on some Windows amchines. As I understand it, both of these need to be up. I'd like to make this a single check. Looks like check_tcp will only accept a single -p argument. Is this correct? If so, is there a way I can AND to different check_tcp runs, and report a single status back to Nagios? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel.wittenberg.r0ko at statefarm.com Wed Dec 29 19:56:24 2010 From: daniel.wittenberg.r0ko at statefarm.com (Daniel Wittenberg) Date: Wed, 29 Dec 2010 11:56:24 -0700 Subject: Checking multiple TCP ports for a single status? In-Reply-To: <20101229183916.GB31320@teddy.fas.com> References: <20101229183916.GB31320@teddy.fas.com> Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240555D0B9@WPSCV6MM.OPR.STATEFARM.ORG> Write a simple script that does both checks and returns results? Dan -----Original Message----- From: stan [mailto:stanb at panix.com] Sent: Wednesday, December 29, 2010 12:39 PM To: nagios List Subject: [Nagios-users] Checking multiple TCP ports for a single status? I think I need to verify that both port 135, and 445 are avaialble on some Windows amchines. As I understand it, both of these need to be up. I'd like to make this a single check. Looks like check_tcp will only accept a single -p argument. Is this correct? If so, is there a way I can AND to different check_tcp runs, and report a single status back to Nagios? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Wed Dec 29 20:00:24 2010 From: pangrazi at gmail.com (Greg Pangrazio) Date: Wed, 29 Dec 2010 13:00:24 -0600 Subject: Checking multiple TCP ports for a single status? In-Reply-To: <20101229183916.GB31320@teddy.fas.com> References: <20101229183916.GB31320@teddy.fas.com> Message-ID: check out check_service_cluster command. http://nagios.sourceforge.net/docs/3_0/clusters.html I use this for what you are looking at. Greg Pangrazio On Wed, Dec 29, 2010 at 12:39 PM, stan wrote: > > I think I need to verify that both port 135, and 445 are avaialble on some > Windows amchines. As I understand it, both of these need to be up. I'd like > to make this a single check. Looks like check_tcp will only accept a single > -p argument. Is this correct? If so, is there a way I can AND to different > check_tcp runs, and report a single status back to Nagios? > > > -- > A: Because it messes up the order in which people normally read text. > Q: Why is top-posting such a bad thing? > A: Top-posting. > Q: What is the most annoying thing in e-mail? > > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Wed Dec 29 20:04:08 2010 From: a31modela at hotmail.com (steve f) Date: Wed, 29 Dec 2010 14:04:08 -0500 Subject: A Question on Console Alerts for a Help Desk In-Reply-To: <4D1B6B79.6090602@ederdrom.de> References: , , <4D1B6B79.6090602@ederdrom.de> Message-ID: Thanks very much for the replies. This is the answer I was looking for with the external commands. After reading my orig posting, I did realize I didnt want to ack it but schedule/reschedule the check. Your answers will put me where I need to be. Thanks Jim & Joerg Happy New Years All Steve > Date: Wed, 29 Dec 2010 18:10:17 +0100 > From: pitchfork at ederdrom.de > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] A Question on Console Alerts for a Help Desk > > Jim Avery wrote: > > On 29 December 2010 15:51, steve f wrote: > > > >> The Help Desk uses a menu to stop& restart services, etc. Is it feasible ( > >> realistic ) for the menu command to add a snippet of code to put the nagios > >> check for that service/host in awk mode for say 5 minutes so anyone who > >> sees the alert on the screen would know that its being addressed? > >> > >> Is there an easier way to do this? > > > > There isn't an easy way to ack it for 5 minutes, as you can't ack it > > until it's in a hard state. I would instead get your menu script to > > schedule downtime in Nagios for that service for 5 minutes by > > submitting an external command to Nagios. I would say this is a > > feasible and realistic thing to do, yes. But .. I don't know how you > > operate your distributed Nagios setup. So long as only one of your > > Nagios servers is used for the web front-end this should be easy. If > > various people use various of the distributed Nagios servers for their > > web front end it could be a challenge setting up your menu system to > > submit the external command to the right server. > > > > See: > > > > http://nagios.sourceforge.net/docs/3_0/extcommands.html > > Instead of scheduling downtime its also possible reschedule the next service checks to now + 15minutes for example. > > http://old.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=30 > > Joerg > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dit.dash at gmail.com Wed Dec 29 20:15:14 2010 From: dit.dash at gmail.com (dave stern - e-mail.pluribus.unum) Date: Wed, 29 Dec 2010 14:15:14 -0500 Subject: Checking multiple TCP ports for a single status? In-Reply-To: <20101229183916.GB31320@teddy.fas.com> References: <20101229183916.GB31320@teddy.fas.com> Message-ID: Wrapper? $NAGIOS/libexec/check_tcp -H myhost -p 135 && $NAGIOS/libexec/check_tcp -H myhost -p 445 On Wed, Dec 29, 2010 at 1:39 PM, stan wrote: > > I think I need to verify that both port 135, and 445 are avaialble on some > Windows amchines. As I understand it, both of these need to be up. I'd like > to make this a single check. Looks like check_tcp will only accept a single > -p argument. Is this correct? If so, is there a way I can AND to different > check_tcp runs, and report a single status back to Nagios? > > > -- > A: Because it messes up the order in which people normally read text. > Q: Why is top-posting such a bad thing? > A: Top-posting. > Q: What is the most annoying thing in e-mail? > > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Wed Dec 29 23:52:45 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Wed, 29 Dec 2010 22:52:45 +0000 Subject: A newbie configuration question In-Reply-To: <20101229183651.GA31320@teddy.fas.com> References: <20101229183651.GA31320@teddy.fas.com> Message-ID: On 29 December 2010 18:36, stan wrote: > I am trying to rationalize a couple of Nagios configurations that have > 2different histories, and they seem more different than I think they should > be. So, first I want to get the big picture n my head, and decide what > seems to be a sensible configuration to support going forward. > > Basicly I have in mind some thing like this. > > 1. Define hosts in a file > 2. Define services in a file > 3, Define commands in a file > 4. Aggregate hosts in groups of similar types in a hostgroups file > 5. Aggregate services in groups of similar types in a servicegroups file > > > Now, where I start to get confused here is that the service definitions > seem to have a filed for one or more hostnames. Why is this? Services are often quite agnostic as to what kind of host they relate to. Take for example FTP. Various host types will accept an FTP connection but the service definition for them will always be pretty much the same. I do pretty much what you have described there, but have a sub-directory for each host type. For example, my servers-unix directory will contain a hosts.cfg with the hosts definitions in it, but also users.cfg for checks on numbers of logged on users, disks.cfg for filesystem disk space checks, cpu.cfg for cpu% checks and so on. I have a "services" directory for general-purpose services whichare used for lots of different host types - things like FTP as I mentioned before, but also ping, telnet, http and a few others. I'm not saying this is what you should do, but it (kind of usually!) works for me. I also have a "templates" directory where I put most of my templates. To be honest mine needs a good tidy-up though, as I've been rather inconsistent in how I decide what goes in the template and what in the object definition. ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Wed Dec 29 23:55:16 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Wed, 29 Dec 2010 22:55:16 +0000 Subject: A Question on Console Alerts for a Help Desk In-Reply-To: <4D1B6B79.6090602@ederdrom.de> References: <4D1B6B79.6090602@ederdrom.de> Message-ID: On 29 December 2010 17:10, Joerg Linge wrote: > Instead of scheduling downtime its also possible reschedule the next service checks to now + 15minutes for example. > > http://old.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=30 > > Joerg That's a neat trick! :-) Thanks! ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Stuart.Jones at health.wa.gov.au Thu Dec 30 02:47:47 2010 From: Stuart.Jones at health.wa.gov.au (Jones, Stuart) Date: Thu, 30 Dec 2010 09:47:47 +0800 Subject: Qugga BGP monitoring IPv6 In-Reply-To: <512491303.20101216234355@oberhausen-it.de> References: <512491303.20101216234355@oberhausen-it.de> Message-ID: Hello Daniel, What response do you get when you undertake a snmpwalk of your quagga system? Rgds Stuart -----Original Message----- From: Daniel [mailto:listen at oberhausen-it.de] Sent: Friday, 17 December 2010 6:44 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Qugga BGP monitoring IPv6 Hey there, can anyone tell me if it is possible to monitor the state of a IPv6 BGP-Session inside a quagga? I Found some SNMP tools for nagios but they can only handle IPv4 sessions. -- Mit freundlichen Gr??en Daniel mailto:listen at oberhausen-it.de ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From talonx at gmail.com Thu Dec 30 04:26:57 2010 From: talonx at gmail.com (Hrishikesh Barua) Date: Thu, 30 Dec 2010 08:56:57 +0530 Subject: Nagios-users Digest, Vol 55, Issue 21 In-Reply-To: References: Message-ID: Hi Chris, Some tinkering led me to the same thing - and it got resolved after I 1. Installed the openssl-devel package, and 2. Ran ./configure with the --with-openssl= option and built it again. Thanks! Regards Hrish > Date: Mon, 27 Dec 2010 09:55:13 -0500 > From: Chris Beattie > Subject: Re: [Nagios-users] check_smtp doesn't support TLS? > To: Nagios Users List > Message-ID: <4D18A8D1.9090808 at geninfo.com> > Content-Type: text/plain; charset="iso-8859-1"; format="flowed" > > talonx at gmail.com wrote: > > I'm trying to setup the check_smtp plugin for a remote mail server. I > > downloaded the latest version of nagios-plugins, built and installed > > it. This is what I get when I invoke check_smtp from the command line > > (actual data removed) - > > > > received 530 5.7.0 Must issue a STARTTLS command first > > > > I'm including the -S flag for it to use TLS. I couldn't see any other > > options that might be relevant. Any pointers? > I had something similar happen to me with SSL once. I compiled and > installed the plugins, but I could not get SSL to work for the > check_http command. I had installed the openssl package, but I had > forgotten to install the openssl-devel package before I compiled the > plugins. > > When you ./configure the plugins, does it say "--with-gnutls: no" at the > end? You may have to install the gnutls-devel package first. > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mssneah at yahoo.com Thu Dec 30 08:45:34 2010 From: mssneah at yahoo.com (moses neah) Date: Wed, 29 Dec 2010 23:45:34 -0800 (PST) Subject: Nagios-users Digest, Vol 55, Issue 21 In-Reply-To: References: Message-ID: <928350.93466.qm@web45209.mail.sp1.yahoo.com> Hi All out there, Can anybody help me? I want to achieve the following: Nagios should send notification when a host/service state goes critical, down, etc Notification should be sent only once when a state changes and finally I want to know how many ping packets is ideal for nagios to send Thanking you in advance. ________________________________ From: "nagios-users-request at lists.sourceforge.net" To: nagios-users at lists.sourceforge.net Sent: Wed, December 29, 2010 2:14:35 PM Subject: Nagios-users Digest, Vol 55, Issue 21 Send Nagios-users mailing list submissions to nagios-users at lists.sourceforge.net To subscribe or unsubscribe via the World Wide Web, visit https://lists.sourceforge.net/lists/listinfo/nagios-users or, via email, send a message with subject or body 'help' to nagios-users-request at lists.sourceforge.net You can reach the person managing the list at nagios-users-owner at lists.sourceforge.net When replying, please edit your Subject line so it is more specific than "Re: Contents of Nagios-users digest..." Today's Topics: 1. Re: check_smtp doesn't support TLS? (Chris Beattie) 2. re-enable all notifications on hosts & services (Mark A. Lappin) 3. Re: re-enable all notifications on hosts & services (Chris Beattie) 4. monitoring windows event viewer. (Toonz IT) 5. Scheduling Queue stucked a few minutes after restart (Maurizio Pinotti) 6. Plugin: nagiosVMware (Sebastian Ries) 7. Re: Scheduling Queue stucked a few minutes after restart (Chris Beattie) 8. Host/service escalation notification (??????? ????????) 9. Re: monitoring windows event viewer. (Chris Beattie) 10. Re: monitoring windows event viewer. (Polifemo, Salvatore) 11. Re: Host/service escalation notification (Marc Haber) 12. Re: monitoring windows event viewer. (Daniel Wittenberg) 13. Re: monitoring windows event viewer. (Polifemo, Salvatore) 14. Disapering popup windows (stan) 15. Re: Plugin: nagiosVMware (Sebastian Ries) 16. Re: Disapering popup windows (Patrick Morris) 17. Re: Disapering popup windows (diego.roccia at gmail.com) 18. Re: Disapering popup windows (diego.roccia at gmail.com) 19. Re: Disapering popup windows (Hugo van der Kooij) 20. Re: monitoring windows event viewer. (Toonz IT) 21. Re: Disapering popup windows (stan) ---------------------------------------------------------------------- Message: 1 Date: Mon, 27 Dec 2010 09:55:13 -0500 From: Chris Beattie Subject: Re: [Nagios-users] check_smtp doesn't support TLS? To: Nagios Users List Message-ID: <4D18A8D1.9090808 at geninfo.com> Content-Type: text/plain; charset="iso-8859-1"; format="flowed" talonx at gmail.com wrote: > I'm trying to setup the check_smtp plugin for a remote mail server. I > downloaded the latest version of nagios-plugins, built and installed > it. This is what I get when I invoke check_smtp from the command line > (actual data removed) - > > received 530 5.7.0 Must issue a STARTTLS command first > > I'm including the -S flag for it to use TLS. I couldn't see any other > options that might be relevant. Any pointers? I had something similar happen to me with SSL once. I compiled and installed the plugins, but I could not get SSL to work for the check_http command. I had installed the openssl package, but I had forgotten to install the openssl-devel package before I compiled the plugins. When you ./configure the plugins, does it say "--with-gnutls: no" at the end? You may have to install the gnutls-devel package first. Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------ Message: 2 Date: Mon, 27 Dec 2010 09:42:37 -0600 From: "Mark A. Lappin" Subject: [Nagios-users] re-enable all notifications on hosts & services To: Nagios Users List Message-ID: <0227B653B3DC82438B8291BC5218612F6737768BE8 at lmfjex07.lmfj.com> Content-Type: text/plain; charset="us-ascii" We have a notifications on services which were disabled via the web interface for hosts and all services on hosts. Dozens it seems. Is there an easy way to turn all notifications back on for all services and all hosts in one fell swoop? We had a lot of htem turned off while we were tweaking commands and configuration and I'm ready to have them all back on.... ML Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com [http://www.lmfj.com/images/lmfjsig.gif] ________________________________ This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 3 Date: Mon, 27 Dec 2010 18:25:49 -0500 From: Chris Beattie Subject: Re: [Nagios-users] re-enable all notifications on hosts & services To: Nagios Users List Message-ID: <4D19207D.1030103 at geninfo.com> Content-Type: text/plain; charset="iso-8859-1"; format="flowed" Mark A. Lappin wrote: > interface for hosts and all services on hosts. Dozens it seems. Is > there an easy way to turn all notifications back on for all services and > all hosts in one fell swoop? We had a lot of htem turned off while we You can use Nagios' external commands to do that. http://old.nagios.org/developerinfo/externalcommands/commandlist.php If I didn't have to be too careful, I'd do something like this (apologies for the line wrapping): #!/bin/sh now=`date +%s` commandfile='/usr/local/nagios/var/rw/nagios.cmd' statusfile='/usr/local/nagios/var/status.dat' for i in `grep host_name $statusfile | sort --unique | sed "s/\thost_name=//"` do /bin/printf "[%lu] ENABLE_HOST_NOTIFICATIONS;$i\n" $now > $commandfile /bin/printf "[%lu] ENABLE_HOST_SVC_NOTIFICATIONS;$i\n" $now > $commandfile done Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------ Message: 4 Date: Tue, 28 Dec 2010 17:33:53 +0530 From: Toonz IT Subject: [Nagios-users] monitoring windows event viewer. To: Nagios Users List Message-ID: Content-Type: text/plain; charset="iso-8859-1" Is it possible to monitor specific event ids like disk error, fro windows event viewer logs?? We recently had a sever hard disk error and we detected it a bit late! :-( Please let us know. At present we just have basic monitoring like ping, disk space usage etc... we are using FAN 2.0. anth! -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 5 Date: Tue, 28 Dec 2010 12:44:31 +0100 From: Maurizio Pinotti Subject: [Nagios-users] Scheduling Queue stucked a few minutes after restart To: nagios-users at lists.sourceforge.net Message-ID: <4D19CD9F.8050304 at cineca.it> Content-Type: text/plain; charset=UTF-8 hello, I have a really odd issue running Nagios: a few minutes after starting the scheduling queue seems to freeze and no more active checks are performed. The queue remains stucked for hours until I have to manually restart Nagios. Passive checks are processed normally. I'm running Nagios 3.0.6 (deb package) on a Debian lenny system. The harware is an 8-core Xeon CPU with 16GB RAM. Nagios is monitoring about 1K hosts and 10K services. Reverting back the configuration to "last known good configuration" did not help, neither did rebooting the server and several Nagios restarts and reloads. Already tried fixes: - disabled all active hosts checks - increased ulimit for nagios user - disabled all event handlers - disabled all "obsess" stuff Any help or hint would be appreciated. nagios.cfg follows ******************* log_file=/nagios_fe/var/log/nagios3/nagios.log cfg_file=/etc/nagios3/commands.cfg cfg_dir=/etc/nagios-plugins/config cfg_dir=/nagios_fe/etc/cmon/nagios3 cfg_dir=/nagios_fe/etc/nagiosgrapher/nagios3 object_cache_file=/nagios_fe/var/cache/nagios3/objects.cache precached_object_file=/nagios_fe/var/lib/nagios3/objects.precache resource_file=/nagios_fe/etc/cmon/nagios3/macros.res status_file=/nagios_fe/var/cache/nagios3/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/nagios_fe/var/lib/nagios3/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/nagios_fe/var/run/nagios3/nagios3.pid temp_file=/nagios_fe/var/cache/nagios3/nagios.tmp temp_path=/tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/nagios_fe/var/log/nagios3/archives use_syslog=0 log_notifications=1 log_service_retries=0 log_host_retries=0 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=0 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/nagios_fe/var/lib/nagios3/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/nagios_fe/var/lib/nagios3/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=1 enable_event_handlers=0 process_performance_data=1 service_perfdata_file=/nagios_fe/var/lib/nagiosgrapher/ngraph.pipe service_perfdata_file_template=$HOSTNAME$\t$SERVICEDESC$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\t$TIMET$\n service_perfdata_file_mode=a service_perfdata_file_processing_interval=5 service_perfdata_file_processing_command=ngraph-process-service-perfdata-pipe obsess_over_services=0 obsess_over_hosts=0 translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=euro p1_file=/usr/lib/nagios3/p1.pl enable_embedded_perl=0 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=root at localhost admin_pager=pageroot at localhost daemon_dumps_core=0 use_large_installation_tweaks=1 enable_environment_macros=0 debug_level=144 debug_verbosity=1 debug_file=/nagios_fe/var/log/nagios3/nagios.debug max_debug_file_size=2000000000 ******************* ------------------------------ Message: 6 Date: Tue, 28 Dec 2010 14:23:45 +0100 From: Sebastian Ries Subject: [Nagios-users] Plugin: nagiosVMware To: nagios-users-ML Message-ID: <1293542625.8847.27.camel at bofh.dtnet.de> Content-Type: text/plain Hi I am trying to Configure the nagiosVMware plugin I found on https://www.monitoringexchange.org/inventory/Check-Plugins/Virtualization/VMWare-%2528ESX%2529/nagiosVMware Generally it works but now I found that most of the time the CPU and MEM-checks give no result: [nagios at nagios-check-vc ~]$ ./nsca_vmware.pl 5min Processing host esx19.dtnet.de ... overcommit -1, cpuload -1 on 0 cpus, memtot -1, memfree -1 esx19 ESX-OVERCMMT 3 Memory overcommitment -1 esx19 ESX-CPU-LOAD 3 CPU load average -1 on 0 CPUs esx19 ESX-MEMORY 3 Memory use -1 total -1 free ... mem/cpu took 8 seconds ... host esx19 took 8 seconds Processing host esx20.dtnet.de but sometimes it works: [nagios at nagios-check-vc ~]$ ./nsca_vmware.pl 5min Processing host esx19.dtnet.de ... overcommit 0.00, cpuload 0.19 on 8 cpus, memtot 32766, memfree 11680 esx19 ESX-OVERCMMT 0 Memory overcommitment 0% esx19 ESX-CPU-LOAD 0 CPU load average 19% on 8 CPUs esx19 ESX-MEMORY 0 Memory use 64% (21086 of 32766 MB used) ... mem/cpu took 8 seconds ... host esx19 took 8 seconds This is the same effect with all ESX servers. All are ESX4.0 in the ....cmd file I get this error [2010-12-28 14:17:16.275 7324B90 warning 'App'] Closing Response processing in unexpected state: 3 Has anyone managed to run this without errors? s_teeter: are you on this list? (I did not find an email address) Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------ Message: 7 Date: Tue, 28 Dec 2010 09:13:14 -0500 From: Chris Beattie Subject: Re: [Nagios-users] Scheduling Queue stucked a few minutes after restart To: Nagios Users List Message-ID: <4D19F07A.1060306 at geninfo.com> Content-Type: text/plain; charset="iso-8859-1"; format="flowed" Maurizio Pinotti wrote: > I have a really odd issue running Nagios: a few minutes after starting the > scheduling queue seems to freeze and no more active checks are performed. The > queue remains stucked for hours until I have to manually restart Nagios. > > I'm running Nagios 3.0.6 (deb package) on a Debian lenny system. The harware is I think that is a bug in that version of Nagios. I had the same problem. It got fixed, but I still go look at my service checks every morning to make sure. Also, I see where the server guys acknowledge problems and then forget about them, heh heh. There is a much newer version of Nagios available in lenny-backports. I would give it a shot if you can. http://packages.debian.org/source/lenny-backports/backports/nagios3 -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------ Message: 8 Date: Tue, 28 Dec 2010 16:19:21 +0200 From: ??????? ???????? Subject: [Nagios-users] Host/service escalation notification To: nagios-users at lists.sourceforge.net Message-ID: Content-Type: text/plain; charset="utf-8" Hi. I want to create several levels of host and service escalation. Say 3 levels. In notification I want to know on which escalation level this particular notification occurred. Can't find any variable reflecting escalation level. Thanks a lot! Dmytro Leonenko -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 9 Date: Tue, 28 Dec 2010 09:31:41 -0500 From: Chris Beattie Subject: Re: [Nagios-users] monitoring windows event viewer. To: Nagios Users List Message-ID: <4D19F4CD.7040408 at geninfo.com> Content-Type: text/plain; charset="iso-8859-1"; format="flowed" Toonz IT wrote: > Is it possible to monitor specific event ids like disk error, fro > windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------ Message: 10 Date: Tue, 28 Dec 2010 09:37:57 -0500 From: "Polifemo, Salvatore" Subject: Re: [Nagios-users] monitoring windows event viewer. To: "Nagios Users List" Message-ID: <5BE7D0404F28DC44B780100927AA4CB018664C5C at whplex3.int.cecdes.net> Content-Type: text/plain; charset="us-ascii" If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -----Original Message----- From: Chris Beattie [mailto:cbeattie at geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: > Is it possible to monitor specific event ids like disk error, fro > windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------ Message: 11 Date: Tue, 28 Dec 2010 15:38:28 +0100 From: Marc Haber Subject: Re: [Nagios-users] Host/service escalation notification To: Nagios Users List Message-ID: <20101228143828.GA25548 at torres.zugschlus.de> Content-Type: text/plain; charset=utf-8 On Tue, Dec 28, 2010 at 04:19:21PM +0200, ??????? ???????? wrote: > I want to create several levels of host and service escalation. Say 3 > levels. In notification I want to know on which escalation level this > particular notification occurred. Can't find any variable reflecting > escalation level. Escalation levels are connected to the notification number, and escalations can be kind of orthogonal. Do the Notification Number ($SERVICENOTIFICATIONNUMBER$ and/or $HOSTNOTIFICATIONNUMBER$) macros the job? Greetings Marc -- ----------------------------------------------------------------------------- Marc Haber | "I don't trust Computers. They | Mailadresse im Header Mannheim, Germany | lose things." Winona Ryder | Fon: *49 621 72739834 Nordisch by Nature | How to make an American Quilt | Fax: *49 3221 2323190 ------------------------------ Message: 12 Date: Tue, 28 Dec 2010 07:46:05 -0700 From: "Daniel Wittenberg" Subject: Re: [Nagios-users] monitoring windows event viewer. To: "Nagios Users List" Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240555C774 at WPSCV6MM.OPR.STATEFARM.ORG> Content-Type: text/plain; charset="us-ascii" Doesn't syslog-ng just consolidate the logs, it doesn't really monitor anything right? Dan -----Original Message----- From: Polifemo, Salvatore [mailto:polifemos at conedsolutions.com] Sent: Tuesday, December 28, 2010 8:38 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -----Original Message----- From: Chris Beattie [mailto:cbeattie at geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: > Is it possible to monitor specific event ids like disk error, fro > windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------ Message: 13 Date: Tue, 28 Dec 2010 09:56:33 -0500 From: "Polifemo, Salvatore" Subject: Re: [Nagios-users] monitoring windows event viewer. To: "Nagios Users List" Message-ID: <5BE7D0404F28DC44B780100927AA4CB018664C5F at whplex3.int.cecdes.net> Content-Type: text/plain; charset="us-ascii" One use of syslog to set up rules and then take an action. We look for error then send out an email. Take a look at the syslog-ng forum. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -----Original Message----- From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com] Sent: Tuesday, December 28, 2010 9:46 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Doesn't syslog-ng just consolidate the logs, it doesn't really monitor anything right? Dan -----Original Message----- From: Polifemo, Salvatore [mailto:polifemos at conedsolutions.com] Sent: Tuesday, December 28, 2010 8:38 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. If you will be monitoring event logs, you may want to look at applications made for monitoring event logs. One application that works well for us is syslog-ng. Salvatore Polifemo Sr. Systems Security Specialist ConEdison Solutions 100 Summit Lake Drive Valhalla, NY 10595 -----Original Message----- From: Chris Beattie [mailto:cbeattie at geninfo.com] Sent: Tuesday, December 28, 2010 9:32 AM To: Nagios Users List Subject: Re: [Nagios-users] monitoring windows event viewer. Toonz IT wrote: > Is it possible to monitor specific event ids like disk error, fro > windows event viewer logs?? Yes, but you may have to use the NSClient++ agent on your Windows boxes and create custom commands to do it. http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog Unfortunately, I deleted the Windows event log checks after I didn't need them any more, so I don't have a working example configuration to show you. -- -Chris ------ Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------ Message: 14 Date: Tue, 28 Dec 2010 17:02:08 -0500 From: stan Subject: [Nagios-users] Disapering popup windows To: nagios List Message-ID: <20101228220208.GA5294 at teddy.fas.com> Content-Type: text/plain; charset=us-ascii I have a 3.1.0 instnace which ahs been in service for a long time. Over the years, we have had some issues with the popup windows in the browser flashing up for a fraction of a second and disapering. I have always chrged this off to wierd browser behavior, however, I am now setting up a child instance, and it is at 3.2.0. Today I observed this ebhavior on the same browser, running on the smae machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in production. Does nayone have any thoughts as to what might have chnaged between these 2 versions that fixed this? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------ Message: 15 Date: Wed, 29 Dec 2010 11:07:17 +0100 From: Sebastian Ries Subject: Re: [Nagios-users] Plugin: nagiosVMware To: nagios-users-ML Message-ID: <1293617237.20786.11.camel at bofh.dtnet.de> Content-Type: text/plain Hi > I am trying to Configure the nagiosVMware plugin I found on >https://www.monitoringexchange.org/inventory/Check-Plugins/Virtualization/VMWare-%2528ESX%2529/nagiosVMware >e > This is the same effect with all ESX servers. All are ESX4.0 > > in the ....cmd file I get this error > [2010-12-28 14:17:16.275 7324B90 warning 'App'] Closing Response > processing in unexpected state: 3 OK I found that this error is printed to stdout from resxtop and therefore the plugin comes into an error while parsing the output. As a workaround I put: |grep -v \"Closing Response processing in unexpected state: 3\" into the command within nsca_vmware.pl so for me the Problem is solved ;-) Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------ Message: 16 Date: Wed, 29 Dec 2010 02:19:35 -0800 From: Patrick Morris Subject: Re: [Nagios-users] Disapering popup windows To: nagios-users at lists.sourceforge.net Message-ID: <4D1B0B37.5030808 at hp.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed On 12/28/2010 2:02 PM, stan wrote: > I have a 3.1.0 instnace which ahs been in service for a long time. Over the > years, we have had some issues with the popup windows in the browser > flashing up for a fraction of a second and disapering. I have always > chrged this off to wierd browser behavior, however, I am now setting up a > child instance, and it is at 3.2.0. > > Today I observed this ebhavior on the same browser, running on the smae > machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am > reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in > production. Does nayone have any thoughts as to what might have chnaged > between these 2 versions that fixed this? Popup windows? I've never seen Nagios use them. ------------------------------ Message: 17 Date: Wed, 29 Dec 2010 11:31:32 +0100 From: "diego.roccia at gmail.com" Subject: Re: [Nagios-users] Disapering popup windows To: Nagios Users List Message-ID: Content-Type: text/plain; charset=UTF-8 Do you mean actual popup windows or overlay frames, like the ones used, for example, by pnp4nagios to embed graphs? On Tue, Dec 28, 2010 at 11:02 PM, stan wrote: > I have a 3.1.0 instnace which ahs been in service for a long time. Over the > years, we have had some issues with the popup windows in the browser > flashing up for a fraction of a second and disapering. I have always > chrged this off to wierd browser behavior, however, I am now setting up a > child instance, and it is at 3.2.0. > > Today I observed this ebhavior on the same browser, running on the smae > machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am > reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in > production. Does nayone have any thoughts as to what might have chnaged > between these 2 versions that fixed this? > > > -- > A: Because it messes up the order in which people normally read text. > Q: Why is top-posting such a bad thing? > A: Top-posting. > Q: What is the most annoying thing in e-mail? > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting >any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Diego Roccia diego.roccia (at) gmail (dot) com ------------------------------ Message: 18 Date: Wed, 29 Dec 2010 11:32:07 +0100 From: "diego.roccia at gmail.com" Subject: Re: [Nagios-users] Disapering popup windows To: Nagios Users List Message-ID: Content-Type: text/plain; charset=UTF-8 On Wed, Dec 29, 2010 at 11:31 AM, diego.roccia at gmail.com wrote: > Do you mean actual popup windows or overlay frames, like the ones > used, for example, by pnp4nagios to embed graphs? > > On Tue, Dec 28, 2010 at 11:02 PM, stan wrote: >> I have a 3.1.0 instnace which ahs been in service for a long time. Over the >> years, we have had some issues with the popup windows in the browser >> flashing up for a fraction of a second and disapering. I have always >> chrged this off to wierd browser behavior, however, I am now setting up a >> child instance, and it is at 3.2.0. >> >> Today I observed this ebhavior on the same browser, running on the smae >> machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am >> reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in >> production. Does nayone have any thoughts as to what might have chnaged >> between these 2 versions that fixed this? >> >> >> -- >> A: Because it messes up the order in which people normally read text. >> Q: Why is top-posting such a bad thing? >> A: Top-posting. >> Q: What is the most annoying thing in e-mail? >> >> ------------------------------------------------------------------------------ >> Learn how Oracle Real Application Clusters (RAC) One Node allows customers >> to consolidate database storage, standardize their database environment, and, >> should the need arise, upgrade to a full multi-node Oracle RAC database >> without downtime or disruption >> http://p.sf.net/sfu/oracle-sfdevnl >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting >>any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- > Diego Roccia > diego.roccia (at) gmail (dot) com > OPS, sorry for the top posting :) -- Diego Roccia diego.roccia (at) gmail (dot) com ------------------------------ Message: 19 Date: Wed, 29 Dec 2010 12:05:34 +0100 From: Hugo van der Kooij Subject: Re: [Nagios-users] Disapering popup windows To: Nagios Users List Message-ID: <4d2a6d2c87528eb7d1934ffa337c5d78 at vps517.directvps.nl> Content-Type: text/plain; charset=UTF-8; format=flowed On Tue, 28 Dec 2010 17:02:08 -0500, stan wrote: > I have a 3.1.0 instnace which ahs been in service for a long time. Please elaborate and please fix your output before sending something to a mailinglist with plenty of errors that make your message hard to read. Hugo. -- hvdkooij at vanderkooij.org http://hugo.vanderkooij.org/ PGP/GPG? Use: http://hugo.vanderkooij.org/0x58F19981.asc ------------------------------ Message: 20 Date: Wed, 29 Dec 2010 16:51:17 +0530 From: Toonz IT Subject: Re: [Nagios-users] monitoring windows event viewer. To: Nagios Users List Message-ID: Content-Type: text/plain; charset="iso-8859-1" Thank You, exactly what we wanted!! :-) On Tue, Dec 28, 2010 at 8:01 PM, Chris Beattie wrote: > Toonz IT wrote: > > Is it possible to monitor specific event ids like disk error, fro > > windows event viewer logs?? > > Yes, but you may have to use the NSClient++ agent on your Windows boxes > and create custom commands to do it. > > http://nsclient.org/nscp/wiki/CheckEventLog/CheckEventLog > > Unfortunately, I deleted the Windows event log checks after I didn't > need them any more, so I don't have a working example configuration to > show you. > > > -- > -Chris > > ------ > > > Nothing in this message is intended to make or accept an offer or to form a > contract, except that an attachment that is an image of a contract bearing > the signature of an officer of our company may be or become a contract. This > message (including any attachments) is intended only for the use of the > individual or entity to whom it is addressed. It may contain information > that is non-public, proprietary, privileged, confidential, and exempt from > disclosure under applicable law or may constitute as attorney work product. > If you are not the intended recipient, we hereby notify you that any use, > dissemination, distribution, or copying of this message is strictly > prohibited. If you have received this message in error, please notify us > immediately by telephone and delete this message immediately. > > Thank you. > > > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, > and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 21 Date: Wed, 29 Dec 2010 09:14:27 -0500 From: stan Subject: Re: [Nagios-users] Disapering popup windows To: Patrick Morris Cc: nagios-users at lists.sourceforge.net Message-ID: <20101229141427.GA25090 at teddy.fas.com> Content-Type: text/plain; charset=us-ascii On Wed, Dec 29, 2010 at 02:19:35AM -0800, Patrick Morris wrote: > On 12/28/2010 2:02 PM, stan wrote: > > I have a 3.1.0 instnace which ahs been in service for a long time. Over the > > years, we have had some issues with the popup windows in the browser > > flashing up for a fraction of a second and disapering. I have always > > chrged this off to wierd browser behavior, however, I am now setting up a > > child instance, and it is at 3.2.0. > > > > Today I observed this ebhavior on the same browser, running on the smae > > machine in the 3.1.0 instnace, but not in the 3.2.0 instnace. I am > > reluctnat to upgrade the 3,1,0 instnace, as it is failry big, and in > > production. Does nayone have any thoughts as to what might have chnaged > > between these 2 versions that fixed this? > > Popup windows? I've never seen Nagios use them. > OK, perhaps my termionology s imprecise. Here is an example, from the main page go to Configuration -> View Config At that point you have a pulldown for Object type. On the old system, it flashes up for less than a second and the disappers. Make more sense, now? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------ ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl ------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users End of Nagios-users Digest, Vol 55, Issue 21 ******************************************** -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.pinotti at cineca.it Thu Dec 30 10:38:41 2010 From: m.pinotti at cineca.it (Maurizio Pinotti) Date: Thu, 30 Dec 2010 10:38:41 +0100 Subject: Scheduling Queue stucked a few minutes after restart In-Reply-To: <4D19CD9F.8050304@cineca.it> References: <4D19CD9F.8050304@cineca.it> Message-ID: <4D1C5321.5090204@cineca.it> hi Chris, thanks for your reply.. I just upgraded to nagios 3.2.1-2~bpo50+1, but nothing has changed :'( ------------------------------------------------------------------------------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mssneah at yahoo.com Thu Dec 30 10:58:38 2010 From: mssneah at yahoo.com (moses neah) Date: Thu, 30 Dec 2010 01:58:38 -0800 (PST) Subject: Nagios notification In-Reply-To: References: Message-ID: <548011.96735.qm@web45206.mail.sp1.yahoo.com> Hi All out there, Can anybody help me? I want to achieve the following: Nagios should send notification when a host/service state goes critical, down, etc Notification should be sent only once when a state changes and finally I want to know how many ping packets is ideal for nagios to send Thanking you in advance. ________________________________ From: "nagios-users-request at lists.sourceforge.net" To: nagios-users at lists.sourceforge.net Sent: Thu, December 30, 2010 7:45:45 AM Subject: Nagios-users Digest, Vol 55, Issue 22 Send Nagios-users mailing list submissions to nagios-users at lists.sourceforge.net To subscribe or unsubscribe via the World Wide Web, visit https://lists.sourceforge.net/lists/listinfo/nagios-users or, via email, send a message with subject or body 'help' to nagios-users-request at lists.sourceforge.net You can reach the person managing the list at nagios-users-owner at lists.sourceforge.net When replying, please edit your Subject line so it is more specific than "Re: Contents of Nagios-users digest..." Today's Topics: 1. Re: Disapering popup windows (stan) 2. A Question on Console Alerts for a Help Desk (steve f) 3. Re: A Question on Console Alerts for a Help Desk (Jim Avery) 4. Re: A Question on Console Alerts for a Help Desk (Joerg Linge) 5. A newbie configuration question (stan) 6. Checking multiple TCP ports for a single status? (stan) 7. Re: Checking multiple TCP ports for a single status? (Daniel Wittenberg) 8. Re: Checking multiple TCP ports for a single status? (Greg Pangrazio) 9. Re: A Question on Console Alerts for a Help Desk (steve f) 10. Re: Checking multiple TCP ports for a single status? (dave stern - e-mail.pluribus.unum) 11. Re: A newbie configuration question (Jim Avery) 12. Re: A Question on Console Alerts for a Help Desk (Jim Avery) 13. Re: Qugga BGP monitoring IPv6 (Jones, Stuart) 14. Re: Nagios-users Digest, Vol 55, Issue 21 (Hrishikesh Barua) 15. Re: Nagios-users Digest, Vol 55, Issue 21 (moses neah) ---------------------------------------------------------------------- Message: 1 Date: Wed, 29 Dec 2010 09:15:30 -0500 From: stan Subject: Re: [Nagios-users] Disapering popup windows To: "diego.roccia at gmail.com" Cc: Nagios Users List Message-ID: <20101229141530.GB25090 at teddy.fas.com> Content-Type: text/plain; charset=us-ascii On Wed, Dec 29, 2010 at 11:31:32AM +0100, diego.roccia at gmail.com wrote: > Do you mean actual popup windows or overlay frames, like the ones > used, for example, by pnp4nagios to embed graphs? > I think I really mean pulldows, such as you get when selecting an object type in the view configuration page. -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------ Message: 2 Date: Wed, 29 Dec 2010 10:51:37 -0500 From: steve f Subject: [Nagios-users] A Question on Console Alerts for a Help Desk To: Message-ID: Content-Type: text/plain; charset="iso-8859-1" I have a Nagios 3.x configuration set up and will eventually have a very large Distributed configuration. So far in the test environment, everything is working fine. Here is a question.... If someone on our Help Desk is working on a server ( that has not alerted in any way in Nagios ) and decides to restart a service that is a nagios monitored service, the restart of that service will most likely cause an alert to show on the nagios web status screen. My concern is that someone on the other side of the Help Desk doesn't know that someone is in the box, sees the alert on the monitor screen & dials into the box to see what is going on, effectively having 2 people working on the box. I am working off of the monitoring screen only and not any notification for this scenario. I know that the initial person could go into Nagios and schedule downtime for the host or service before they do anything but in our environment, that would not always be possible. The Help Desk uses a menu to stop & restart services, etc. Is it feasible ( realistic ) for the menu command to add a snippet of code to put the nagios check for that service/host in awk mode for say 5 minutes so anyone who sees the alert on the screen would know that its being addressed? Is there an easier way to do this? FWIW, we don't do ANYTHING easy here..... :) Thanks, Steve -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 3 Date: Wed, 29 Dec 2010 16:18:39 +0000 From: Jim Avery Subject: Re: [Nagios-users] A Question on Console Alerts for a Help Desk To: Nagios Users List Message-ID: Content-Type: text/plain; charset=ISO-8859-1 On 29 December 2010 15:51, steve f wrote: > The Help Desk uses a menu to stop & restart services, etc.? Is it feasible ( > realistic )? for the menu command to add a snippet of code to put the nagios > check for that service/host? in awk mode for say 5 minutes so anyone who > sees the alert on the screen would know that its being addressed? > > Is there an easier way to do this? There isn't an easy way to ack it for 5 minutes, as you can't ack it until it's in a hard state. I would instead get your menu script to schedule downtime in Nagios for that service for 5 minutes by submitting an external command to Nagios. I would say this is a feasible and realistic thing to do, yes. But .. I don't know how you operate your distributed Nagios setup. So long as only one of your Nagios servers is used for the web front-end this should be easy. If various people use various of the distributed Nagios servers for their web front end it could be a challenge setting up your menu system to submit the external command to the right server. See: http://nagios.sourceforge.net/docs/3_0/extcommands.html hth, Jim ------------------------------ Message: 4 Date: Wed, 29 Dec 2010 18:10:17 +0100 From: Joerg Linge Subject: Re: [Nagios-users] A Question on Console Alerts for a Help Desk To: Nagios Users List Message-ID: <4D1B6B79.6090602 at ederdrom.de> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Jim Avery wrote: > On 29 December 2010 15:51, steve f wrote: > >> The Help Desk uses a menu to stop& restart services, etc. Is it feasible ( >> realistic ) for the menu command to add a snippet of code to put the nagios >> check for that service/host in awk mode for say 5 minutes so anyone who >> sees the alert on the screen would know that its being addressed? >> >> Is there an easier way to do this? > > There isn't an easy way to ack it for 5 minutes, as you can't ack it > until it's in a hard state. I would instead get your menu script to > schedule downtime in Nagios for that service for 5 minutes by > submitting an external command to Nagios. I would say this is a > feasible and realistic thing to do, yes. But .. I don't know how you > operate your distributed Nagios setup. So long as only one of your > Nagios servers is used for the web front-end this should be easy. If > various people use various of the distributed Nagios servers for their > web front end it could be a challenge setting up your menu system to > submit the external command to the right server. > > See: > > http://nagios.sourceforge.net/docs/3_0/extcommands.html Instead of scheduling downtime its also possible reschedule the next service checks to now + 15minutes for example. http://old.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=30 Joerg ------------------------------ Message: 5 Date: Wed, 29 Dec 2010 13:36:51 -0500 From: stan Subject: [Nagios-users] A newbie configuration question To: nagios List Message-ID: <20101229183651.GA31320 at teddy.fas.com> Content-Type: text/plain; charset=us-ascii I am trying to rationalize a couple of Nagios configurations that have 2different histories, and they seem more different than I think they should be. So, first I want to get the big picture n my head, and decide what seems to be a sensible configuration to support going forward. Basicly I have in mind some thing like this. 1. Define hosts in a file 2. Define services in a file 3, Define commands in a file 4. Aggregate hosts in groups of similar types in a hostgroups file 5. Aggregate services in groups of similar types in a servicegroups file Now, where I start to get confused here is that the service definitions seem to have a filed for one or more hostnames. Why is this? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------ Message: 6 Date: Wed, 29 Dec 2010 13:39:16 -0500 From: stan Subject: [Nagios-users] Checking multiple TCP ports for a single status? To: nagios List Message-ID: <20101229183916.GB31320 at teddy.fas.com> Content-Type: text/plain; charset=us-ascii I think I need to verify that both port 135, and 445 are avaialble on some Windows amchines. As I understand it, both of these need to be up. I'd like to make this a single check. Looks like check_tcp will only accept a single -p argument. Is this correct? If so, is there a way I can AND to different check_tcp runs, and report a single status back to Nagios? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------ Message: 7 Date: Wed, 29 Dec 2010 11:56:24 -0700 From: "Daniel Wittenberg" Subject: Re: [Nagios-users] Checking multiple TCP ports for a single status? To: "Nagios Users List" Message-ID: <31B0FE0A1A8166409E9DF35C6DEECB240555D0B9 at WPSCV6MM.OPR.STATEFARM.ORG> Content-Type: text/plain; charset="us-ascii" Write a simple script that does both checks and returns results? Dan -----Original Message----- From: stan [mailto:stanb at panix.com] Sent: Wednesday, December 29, 2010 12:39 PM To: nagios List Subject: [Nagios-users] Checking multiple TCP ports for a single status? I think I need to verify that both port 135, and 445 are avaialble on some Windows amchines. As I understand it, both of these need to be up. I'd like to make this a single check. Looks like check_tcp will only accept a single -p argument. Is this correct? If so, is there a way I can AND to different check_tcp runs, and report a single status back to Nagios? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------ ------ Learn how Oracle Real Application Clusters (RAC) One Node allows customers to consolidate database storage, standardize their database environment, and, should the need arise, upgrade to a full multi-node Oracle RAC database without downtime or disruption http://p.sf.net/sfu/oracle-sfdevnl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------ Message: 8 Date: Wed, 29 Dec 2010 13:00:24 -0600 From: Greg Pangrazio Subject: Re: [Nagios-users] Checking multiple TCP ports for a single status? To: Nagios Users List Message-ID: Content-Type: text/plain; charset=ISO-8859-1 check out check_service_cluster command. http://nagios.sourceforge.net/docs/3_0/clusters.html I use this for what you are looking at. Greg Pangrazio On Wed, Dec 29, 2010 at 12:39 PM, stan wrote: > > I think I need to verify that both port 135, and 445 are avaialble on some > Windows amchines. As I understand it, both of these need to be up. I'd like > to make this a single check. Looks like check_tcp will only accept a single > -p argument. Is this correct? If so, is there a way I can AND to different > check_tcp runs, and report a single status back to Nagios? > > > -- > A: Because it messes up the order in which people normally read text. > Q: Why is top-posting such a bad thing? > A: Top-posting. > Q: What is the most annoying thing in e-mail? > > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting >any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------ Message: 9 Date: Wed, 29 Dec 2010 14:04:08 -0500 From: steve f Subject: Re: [Nagios-users] A Question on Console Alerts for a Help Desk To: Message-ID: Content-Type: text/plain; charset="iso-8859-1" Thanks very much for the replies. This is the answer I was looking for with the external commands. After reading my orig posting, I did realize I didnt want to ack it but schedule/reschedule the check. Your answers will put me where I need to be. Thanks Jim & Joerg Happy New Years All Steve > Date: Wed, 29 Dec 2010 18:10:17 +0100 > From: pitchfork at ederdrom.de > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] A Question on Console Alerts for a Help Desk > > Jim Avery wrote: > > On 29 December 2010 15:51, steve f wrote: > > > >> The Help Desk uses a menu to stop& restart services, etc. Is it feasible ( > >> realistic ) for the menu command to add a snippet of code to put the nagios > >> check for that service/host in awk mode for say 5 minutes so anyone who > >> sees the alert on the screen would know that its being addressed? > >> > >> Is there an easier way to do this? > > > > There isn't an easy way to ack it for 5 minutes, as you can't ack it > > until it's in a hard state. I would instead get your menu script to > > schedule downtime in Nagios for that service for 5 minutes by > > submitting an external command to Nagios. I would say this is a > > feasible and realistic thing to do, yes. But .. I don't know how you > > operate your distributed Nagios setup. So long as only one of your > > Nagios servers is used for the web front-end this should be easy. If > > various people use various of the distributed Nagios servers for their > > web front end it could be a challenge setting up your menu system to > > submit the external command to the right server. > > > > See: > > > > http://nagios.sourceforge.net/docs/3_0/extcommands.html > > Instead of scheduling downtime its also possible reschedule the next service >checks to now + 15minutes for example. > >http://old.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=30 >0 > > Joerg > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting >any issue. > > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 10 Date: Wed, 29 Dec 2010 14:15:14 -0500 From: "dave stern - e-mail.pluribus.unum" Subject: Re: [Nagios-users] Checking multiple TCP ports for a single status? To: Nagios Users List Message-ID: Content-Type: text/plain; charset=ISO-8859-1 Wrapper? $NAGIOS/libexec/check_tcp -H myhost -p 135 && $NAGIOS/libexec/check_tcp -H myhost -p 445 On Wed, Dec 29, 2010 at 1:39 PM, stan wrote: > > I think I need to verify that both port 135, and 445 are avaialble on some > Windows amchines. As I understand it, both of these need to be up. I'd like > to make this a single check. Looks like check_tcp will only accept a single > -p argument. Is this correct? If so, is there a way I can AND to different > check_tcp runs, and report a single status back to Nagios? > > > -- > A: Because it messes up the order in which people normally read text. > Q: Why is top-posting such a bad thing? > A: Top-posting. > Q: What is the most annoying thing in e-mail? > > > ------------------------------------------------------------------------------ > Learn how Oracle Real Application Clusters (RAC) One Node allows customers > to consolidate database storage, standardize their database environment, and, > should the need arise, upgrade to a full multi-node Oracle RAC database > without downtime or disruption > http://p.sf.net/sfu/oracle-sfdevnl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting >any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------ Message: 11 Date: Wed, 29 Dec 2010 22:52:45 +0000 From: Jim Avery Subject: Re: [Nagios-users] A newbie configuration question To: Nagios Users List Message-ID: Content-Type: text/plain; charset=ISO-8859-1 On 29 December 2010 18:36, stan wrote: > I am trying to rationalize a couple of Nagios configurations that have > 2different histories, and they seem more different than I think they should > be. So, first I want to get the big picture n my head, and decide what > seems to be a sensible configuration to support going forward. > > Basicly I have in mind some thing like this. > > 1. Define hosts in a file > 2. Define services in a file > 3, Define commands in a file > 4. Aggregate hosts in groups of similar types in a hostgroups file > 5. Aggregate services in groups of similar types in a servicegroups file > > > Now, where I start to get confused here is that the service definitions > seem to have a filed for one or more hostnames. Why is this? Services are often quite agnostic as to what kind of host they relate to. Take for example FTP. Various host types will accept an FTP connection but the service definition for them will always be pretty much the same. I do pretty much what you have described there, but have a sub-directory for each host type. For example, my servers-unix directory will contain a hosts.cfg with the hosts definitions in it, but also users.cfg for checks on numbers of logged on users, disks.cfg for filesystem disk space checks, cpu.cfg for cpu% checks and so on. I have a "services" directory for general-purpose services whichare used for lots of different host types - things like FTP as I mentioned before, but also ping, telnet, http and a few others. I'm not saying this is what you should do, but it (kind of usually!) works for me. I also have a "templates" directory where I put most of my templates. To be honest mine needs a good tidy-up though, as I've been rather inconsistent in how I decide what goes in the template and what in the object definition. ------------------------------ Message: 12 Date: Wed, 29 Dec 2010 22:55:16 +0000 From: Jim Avery Subject: Re: [Nagios-users] A Question on Console Alerts for a Help Desk To: Nagios Users List Message-ID: Content-Type: text/plain; charset=ISO-8859-1 On 29 December 2010 17:10, Joerg Linge wrote: > Instead of scheduling downtime its also possible reschedule the next service >checks to now + 15minutes for example. > >http://old.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=30 >0 > > Joerg That's a neat trick! :-) Thanks! ------------------------------ Message: 13 Date: Thu, 30 Dec 2010 09:47:47 +0800 From: "Jones, Stuart" Subject: Re: [Nagios-users] Qugga BGP monitoring IPv6 To: "Nagios Users List" Message-ID: Content-Type: text/plain; charset="iso-8859-1" Hello Daniel, What response do you get when you undertake a snmpwalk of your quagga system? Rgds Stuart -----Original Message----- From: Daniel [mailto:listen at oberhausen-it.de] Sent: Friday, 17 December 2010 6:44 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Qugga BGP monitoring IPv6 Hey there, can anyone tell me if it is possible to monitor the state of a IPv6 BGP-Session inside a quagga? I Found some SNMP tools for nagios but they can only handle IPv4 sessions. -- Mit freundlichen Gr??en Daniel mailto:listen at oberhausen-it.de ------------------------------------------------------------------------------ Lotusphere 2011 Register now for Lotusphere 2011 and learn how to connect the dots, take your collaborative environment to the next level, and enter the era of Social Business. http://p.sf.net/sfu/lotusphere-d2d _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------ Message: 14 Date: Thu, 30 Dec 2010 08:56:57 +0530 From: Hrishikesh Barua Subject: Re: [Nagios-users] Nagios-users Digest, Vol 55, Issue 21 To: nagios-users at lists.sourceforge.net Message-ID: Content-Type: text/plain; charset="iso-8859-1" Hi Chris, Some tinkering led me to the same thing - and it got resolved after I 1. Installed the openssl-devel package, and 2. Ran ./configure with the --with-openssl= option and built it again. Thanks! Regards Hrish > Date: Mon, 27 Dec 2010 09:55:13 -0500 > From: Chris Beattie > Subject: Re: [Nagios-users] check_smtp doesn't support TLS? > To: Nagios Users List > Message-ID: <4D18A8D1.9090808 at geninfo.com> > Content-Type: text/plain; charset="iso-8859-1"; format="flowed" > > talonx at gmail.com wrote: > > I'm trying to setup the check_smtp plugin for a remote mail server. I > > downloaded the latest version of nagios-plugins, built and installed > > it. This is what I get when I invoke check_smtp from the command line > > (actual data removed) - > > > > received 530 5.7.0 Must issue a STARTTLS command first > > > > I'm including the -S flag for it to use TLS. I couldn't see any other > > options that might be relevant. Any pointers? > I had something similar happen to me with SSL once. I compiled and > installed the plugins, but I could not get SSL to work for the > check_http command. I had installed the openssl package, but I had > forgotten to install the openssl-devel package before I compiled the > plugins. > > When you ./configure the plugins, does it say "--with-gnutls: no" at the > end? You may have to install the gnutls-devel package first. > > -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 15 Date: Wed, 29 Dec 2010 23:45:34 -0800 (PST) From: moses neah Subject: Re: [Nagios-users] Nagios-users Digest, Vol 55, Issue 21 To: nagios-users at lists.sourceforge.net Message-ID: <928350.93466.qm at web45209.mail.sp1.yahoo.com> Content-Type: text/plain; charset="us-ascii" Hi All out there, Can anybody help me? I want to achieve the following: Nagios should send notification when a host/service state goes critical, down, etc Notification should be sent only once when a state changes and finally I want to know how many ping packets is ideal for nagios to send Thanking you in advance. ________________________________ From: "nagios-users-request at lists.sourceforge.net" To: nagios-users at lists.sourceforge.net Sent: Wed, December 29, 2010 2:14:35 PM Subject: Nagios-users Digest, Vol 55, Issue 21 Send Nagios-users mailing list submissions to nagios-users at lists.sourceforge.net To subscribe or unsubscribe via the World Wide Web, visit https://lists.sourceforge.net/lists/listinfo/nagios-users or, via email, send a message with subject or body 'help' to nagios-users-request at lists.sourceforge.net You can reach the person managing the list at nagios-users-owner at lists.sourceforge.net When replying, please edit your Subject line so it is more specific than "Re: Contents of Nagios-users digest..." Today's Topics: 1. Re: check_smtp doesn't support TLS? (Chris Beattie) 2. re-enable all notifications on hosts & services (Mark A. Lappin) 3. Re: re-enable all notifications on hosts & services (Chris Beattie) 4. monitoring windows event viewer. (Toonz IT) 5. Scheduling Queue stucked a few minutes after restart (Maurizio Pinotti) 6. Plugin: nagiosVMware (Sebastian Ries) 7. Re: Scheduling Queue stucked a few minutes after restart (Chris Beattie) 8. Host/service escalation notification (??????? ????????) 9. Re: monitoring windows event viewer. (Chris Beattie) 10. Re: monitoring windows event viewer. (Polifemo, Salvatore) 11. Re: Host/service escalation notification (Marc Haber) 12. Re: monitoring windows event viewer. (Daniel Wittenberg) 13. Re: monitoring windows event viewer. (Polifemo, Salvatore) 14. Disapering popup windows (stan) 15. Re: Plugin: nagiosVMware (Sebastian Ries) 16. Re: Disapering popup windows (Patrick Morris) 17. Re: Disapering popup windows (diego.roccia at gmail.com) 18. Re: Disapering popup windows (diego.roccia at gmail.com) 19. Re: Disapering popup windows (Hugo van der Kooij) 20. Re: monitoring windows event viewer. (Toonz IT) 21. Re: Disapering popup windows (stan) ---------------------------------------------------------------------- Message: 1 Date: Mon, 27 Dec 2010 09:55:13 -0500 From: Chris Beattie Subject: Re: [Nagios-users] check_smtp doesn't support TLS? To: Nagios Users List Message-ID: <4D18A8D1.9090808 at geninfo.com> Content-Type: text/plain; charset="iso-8859-1"; format="flowed" talonx at gmail.com wrote: > I'm trying to setup the check_smtp plugin for a remote mail server. I > downloaded the latest version of nagios-plugins, built and installed > it. This is what I get when I invoke check_smtp from the command line > (actual data removed) - > > received 530 5.7.0 Must issue a STARTTLS command first > > I'm including the -S flag for it to use TLS. I couldn't see any other > options that might be relevant. Any pointers? I had something similar happen to me with SSL once. I compiled and installed the plugins, but I could not get SSL to work for the check_http command. I had installed the openssl package, but I had forgotten to install the openssl-devel package before I compiled the plugins. When you ./configure the plugins, does it say "--with-gnutls: no" at the end? You may have to install the gnutls-devel package first. Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------ Message: 2 Date: Mon, 27 Dec 2010 09:42:37 -0600 From: "Mark A. Lappin" Subject: [Nagios-users] re-enable all notifications on hosts & services To: Nagios Users List Message-ID: <0227B653B3DC82438B8291BC5218612F6737768BE8 at lmfjex07.lmfj.com> Content-Type: text/plain; charset="us-ascii" We have a notifications on services which were disabled via the web interface for hosts and all services on hosts. Dozens it seems. Is there an easy way to turn all notifications back on for all services and all hosts in one fell swoop? We had a lot of htem turned off while we were tweaking commands and configuration and I'm ready to have them all back on.... ML Mark A. Lappin, CCNA, MCITP: Enterprise Administrator | Lee Michaels Fine Jewelry Director of Information Technology 11314 Cloverland Ave | Baton Rouge, LA 70809 Ph: 225.291.9094 ext 245 | Fax: 225.368.3675 | Mobile: 225-362-2770 www.lmfj.com [http://www.lmfj.com/images/lmfjsig.gif] ________________________________ This communication is privileged and confidential. If you are not the intended recipient, please notify the sender by reply e-mail and destroy all copies of this communication . -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 3 Date: Mon, 27 Dec 2010 18:25:49 -0500 From: Chris Beattie Subject: Re: [Nagios-users] re-enable all notifications on hosts & services To: Nagios Users List Message-ID: <4D19207D.1030103 at geninfo.com> Content-Type: text/plain; charset="iso-8859-1"; format="flowed" Mark A. Lappin wrote: > interface for hosts and all services on hosts. Dozens it seems. Is > there an easy way to turn all notifications back on for all services and > all hosts in one fell swoop? We had a lot of htem turned off while we You can use Nagios' external commands to do that. http://old.nagios.org/developerinfo/externalcommands/commandlist.php If I didn't have to be too careful, I'd do something like this (apologies for the line wrapping): #!/bin/sh now=`date +%s` commandfile='/usr/local/nagios/var/rw/nagios.cmd' statusfile='/usr/local/nagios/var/status.dat' for i in `grep host_name $statusfile | sort --unique | sed "s/\thost_name=//"` do /bin/printf "[%lu] ENABLE_HOST_NOTIFICATIONS;$i\n" $now > $commandfile /bin/printf "[%lu] ENABLE_HOST_SVC_NOTIFICATIONS;$i\n" $now > $commandfile done Nothing in this message is intended to make or accept an offer or to form a contract, except that an attachment that is an image of a contract bearing the signature of an officer of our company may be or become a contract. This message (including any attachments) is intended only for the use of the individual or entity to whom it is addressed. It may contain information that is non-public, proprietary, privileged, confidential, and exempt from disclosure under applicable law or may constitute as attorney work product. If you are not the intended recipient, we hereby notify you that any use, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this message in error, please notify us immediately by telephone and delete this message immediately. Thank you. ------------------------------ Message: 4 Date: Tue, 28 Dec 2010 17:33:53 +0530 From: Toonz IT Subject: [Nagios-users] monitoring windows event viewer. To: Nagios Users List Message-ID: Content-Type: text/plain; charset="iso-8859-1" Is it possible to monitor specific event ids like disk error, fro windows event viewer logs?? We recently had a sever hard disk error and we detected it a bit late! :-( Please let us know. At present we just have basic monitoring like ping, disk space usage etc... we are using FAN 2.0. anth! -------------- next part -------------- An HTML attachment was scrubbed... ------------------------------ Message: 5 Date: Tue, 28 Dec 2010 12:44:31 +0100 From: Maurizio Pinotti Subject: [Nagios-users] Scheduling Queue stucked a few minutes after restart To: nagios-users at lists.sourceforge.net Message-ID: <4D19CD9F.8050304 at cineca.it> Content-Type: text/plain; charset=UTF-8 hello, I have a really odd issue running Nagios: a few minutes after starting the scheduling queue seems to freeze and no more active checks are performed. The queue remains stucked for hours until I have to manually restart Nagios. Passive checks are processed normally. I'm running Nagios 3.0.6 (deb package) on a Debian lenny system. The harware is an 8-core Xeon CPU with 16GB RAM. Nagios is monitoring about 1K hosts and 10K services. Reverting back the configuration to "last known good configuration" did not help, neither did rebooting the server and several Nagios restarts and reloads. Already tried fixes: - disabled all active hosts checks - increased ulimit for nagios user - disabled all event handlers - disabled all "obsess" stuff Any help or hint would be appreciated. nagios.cfg follows ******************* log_file=/nagios_fe/var/log/nagios3/nagios.log cfg_file=/etc/nagios3/commands.cfg cfg_dir=/etc/nagios-plugins/config cfg_dir=/nagios_fe/etc/cmon/nagios3 cfg_dir=/nagios_fe/etc/nagiosgrapher/nagios3 object_cache_file=/nagios_fe/var/cache/nagios3/objects.cache precached_object_file=/nagios_fe/var/lib/nagios3/objects.precache resource_file=/nagios_fe/etc/cmon/nagios3/macros.res status_file=/nagios_fe/var/cache/nagios3/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/nagios_fe/var/lib/nagios3/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/nagios_fe/var/run/nagios3/nagios3.pid temp_file=/nagios_fe/var/cache/nagios3/nagios.tmp temp_path=/tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/nagios_fe/var/log/nagios3/archives use_syslog=0 log_notifications=1 log_service_retries=0 log_host_retries=0 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=0 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/nagios_fe/var/lib/nagios3/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/nagios_fe/var/lib/nagios3/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=1 enable_event_handlers=0 process_performance_data=1 service_perfdata_file=/nagios_fe/var/lib/nagiosgrapher/ngraph.pipe service_perfdata_file_template=$HOSTNAME$\t$SERVICEDESC$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\t$TIMET$\n service_perfdata_file_mode=a service_perfdata_file_processing_interval=5 service_perfdata_file_processing_command=ngraph-process-service-perfdata-pipe obsess_over_services=0 obsess_over_hosts=0 translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=euro p1_file=/usr/lib/nagios3/p1.pl enable_embedded_perl=0 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=root at localhost admin_pager=pageroot at localhost daemon_dumps_core=0 use_large_installation_tweaks=1 enable_environment_macros=0 debug_level=144 debug_verbosity=1 debug_file=/nagios_fe/var/log/nagios3/nagios.debug max_debug_file_size=2000000000 ******************* ------------------------------ Message: 6 Date: Tue, 28 Dec 2010 14:23:45 +0100 From: Sebastian Ries Subject: [Nagios-users] Plugin: nagiosVMware To: nagios-users-ML Message-ID: <1293542625.8847.27.camel at bofh.dtnet.de> Content-Type: text/plain Hi I am trying to Configure the nagiosVMware plugin I found on https://www.monitoringexchange.org/inventory/Check-Plugins/Virtualization/VMWare-%2528ESX%2529/nagiosVMware Generally it works but now I found that most of the time the CPU and MEM-checks give no result: [nagios at nagios-check-vc ~]$ ./nsca_vmware.pl 5min Processing host esx19.dtnet.de ... overcommit -1, cpuload -1 on 0 cpus, memtot -1, memfree -1 esx19 ESX-OVERCMMT 3 Memory overcommitment -1 esx19 ESX-CPU-LOAD 3 CPU load average -1 on 0 CPUs esx19 ESX-MEMORY 3 Memory use -1 total -1 free ... mem/cpu took 8 seconds ... host esx19 took 8 seconds Processing host esx20.dtnet.de but sometimes it works: [nagios at nagios-check-vc ~]$ ./nsca_vmware.pl 5min Processing host esx19.dtnet.de ... overcommit 0.00, cpuload 0.19 on 8 cpus, memtot 32766, memfree 11680 esx19 ESX-OVERCMMT 0 Memory overcommitment 0% esx19 ESX-CPU-LOAD 0 CPU load average 19% on 8 CPUs esx19 ESX-MEMO