From nttbroken at gmail.com Tue Jun 1 11:07:56 2010 From: nttbroken at gmail.com (ntt broken) Date: Tue, 1 Jun 2010 12:07:56 +0300 Subject: NagiosQL installation manual. In-Reply-To: References: Message-ID: Hello there.. thanks for replying back. i'm almost in the end of the instllation. right now i have problem with HTML_Template_IT. in the beggining i succeded to do all of the "NagiosQL Installation: Checking requirements" in green except the mySQL thing. when i did reboot the mySQL changed to ok and green status but the HTML_Template_IT changed to red and not ok status and i cannot fix it. so it was working and all good from the NagiosQL side but it changeed status somehow becuase of the reboot. it tells me that HTML_Template_IT is installed (using "#pear list" command and also when i try to install it again) but i cannot change it to green again in the NagiosQL screen. uninstall and install again also don't work for me. i cannot use pear-go since its an offline linux red hat machine. but there were no problems with the pear installation. if you have an idea how can i continue it would be great. thanks again. also this reference exist (to add info to my previous message): http://www.simsonlai.org/installing-nagios-with-nagiosql-backend-administration/ On Mon, May 31, 2010 at 11:38 PM, Giorgio Zarrelli wrote: > I installes IT today. Didn't see any problems. Which problems did u meet? > > Ciao, > > Giorgio > > Il giorno 31/mag/2010, alle ore 14.33, ntt broken ha > scritto: > > I'm looking for more detailed NagiosQL installation > manual/tutorial/reference/notes etc. > Right now I have only those: > > > http://www.nagiosql.org/faq/31-general-documentation/49-documentation-for-nagiosql-2x.html > > http://www.berhorst.net/index.php/nagiosmanually (section 6) > > I found the installation notes not clear or missing ? well, not for noobs. > I have working and kicking Nagios machine installed on RHE5 distro and I > want to have for the first time NagiosQL installed working with it. > I went through the Nagios installation plus without problem since its well > documented on the net. > if you have can you share some installation notes it would be great. > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Tue Jun 1 12:58:32 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Tue, 1 Jun 2010 12:58:32 +0200 Subject: NagiosQL installation manual. In-Reply-To: References: Message-ID: Enable php error log, or create a php cli script using that pear component and check the paths IT is looking for IT Ciao, Giorgio Il giorno 01/giu/2010, alle ore 11.07, ntt broken ha scritto: > > Hello there.. > thanks for replying back. > > i'm almost in the end of the instllation. > right now i have problem with HTML_Template_IT. > in the beggining i succeded to do all of the "NagiosQL Installation: > Checking requirements" in green except the mySQL thing. > when i did reboot the mySQL changed to ok and green status but the > HTML_Template_IT changed to red and not ok status and i cannot fix it. > so it was working and all good from the NagiosQL side but it > changeed status somehow becuase of the reboot. > > it tells me that HTML_Template_IT is installed (using "#pear list" > command and also when i try to install it again) but i cannot change > it to green again in the NagiosQL screen. > uninstall and install again also don't work for me. > > i cannot use pear-go since its an offline linux red hat machine. but > there were no problems with the pear installation. > > if you have an idea how can i continue it would be great. > thanks again. > > also this reference exist (to add info to my previous message): > http://www.simsonlai.org/installing-nagios-with-nagiosql-backend-administration/ > > > > > On Mon, May 31, 2010 at 11:38 PM, Giorgio Zarrelli > wrote: > I installes IT today. Didn't see any problems. Which problems did u > meet? > > Ciao, > > Giorgio > > Il giorno 31/mag/2010, alle ore 14.33, ntt broken > ha scritto: > >> I'm looking for more detailed NagiosQL installation manual/tutorial/ >> reference/notes etc. >> Right now I have only those: >> http://www.nagiosql.org/faq/31-general-documentation/49-documentation-for-nagiosql-2x.html >> http://www.berhorst.net/index.php/nagiosmanually (section 6) >> >> I found the installation notes not clear or missing ? well, not fo >> r noobs. >> I have working and kicking Nagios machine installed on RHE5 distro >> and I want to have for the first time NagiosQL installed working >> with it. >> I went through the Nagios installation plus without problem since >> its well documented on the net. >> if you have can you share some installation notes it would be great. >> >> --- >> --- >> --- >> --------------------------------------------------------------------- >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/ >> null > > --- > --- > --- > --------------------------------------------------------------------- > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > --- > --- > --- > --------------------------------------------------------------------- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mad at b-care.net Tue Jun 1 15:38:01 2010 From: mad at b-care.net (=?ISO-8859-1?Q?Marc-Andr=E9?= Doll) Date: Tue, 01 Jun 2010 15:38:01 +0200 Subject: IBM plugin In-Reply-To: References: Message-ID: <1275399481.1438.456.camel@MADness> Hi, and thank you for your answer. I had seen this plugin but it didn't match my expectations. I'm looking for something really like the check_openmanage plugin with a global test checking several MIBs at once and returning with non-OK state if one of them is in error. As I understand the check_snmp_IBM_Bladecenter plugin, I have to inspect the MIBs one by one. Le vendredi 28 mai 2010 ? 10:45 +0000, Julian Hein a ?crit : > Hi, > > Here is one: http://www.monitoringexchange.org/p/794 > > Bye, > Julian > > > Am 28.05.10 08:15 schrieb "Marc-Andr? Doll" unter : > > > Hi list, > > > > I am looking for a plugin to check IBM blades and chassis the way > > plugins like check_openmanage do from quite some time now. I tried > > googling it and browsing NagiosExchange but I didn't find what I'm > > looking for. I found plugins to check specific hardware on specific IBM > > architecture but nothing really generalistic. > > > > Does anyone have heard about or developed some "check_ibm" plugin ? > > > > Thanks, > > > > > > ------------------------------------------------------------------------------ > > > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Chris.Holt at london2012.com Wed Jun 2 10:17:33 2010 From: Chris.Holt at london2012.com (Chris Holt) Date: Wed, 2 Jun 2010 09:17:33 +0100 Subject: Advice on suitability of Nagios ans NSCA for a blended centalised/distributed model Message-ID: Hi, I'll start by admitting n00bness but I have googled a lot about nagios and I hope that this will not duplicate a previous mail. Be kind! I am looking to create a monitoring model that does not cleanly fit into anything I have seen, and before I spend days getting it all to (not) work I wanted to validate my plans. Basically, I have a lot of remote, temporary events sites popping up on ADSL or behind multiple NATs (ie outbound access to the Internet only) and going after a few weeks. On site infrastructure will go up very quickly and come down equally quickly [1]. What I want to be able to do is have a layout so that: - I can access a view of the local site devices site from a local event server and a view of all the sites from the central server - All the polling happens from the local site servers and the central server only pings the external ADSL IPs of each site to check if they are alive - All alerts are sent out from the central server via email, sms etc - Most checks will just be pings or receipt of syslog/snmp traps from local devices I am assuming I will need to play with NSCA and have the local server doing active checks, exporting stats to a file, and the central server doing passive checks, with NCSA syncing the stats to the central server. However, ADSL polling stats for each site need to be synced back to the local servers if they are available so the on site view shows those stats too Some things I am not sure about though: a) the line above b) that this is possible with only outbound access to the Internet from the local server c) how management of configured devices is kept in sync between the local and central servers d) how the central server can have suppression/correlation of events so that a remote site being down shows just "site X down" instead of 100 alerts about each element of the site being down The key part of this is that there can be really complicated parts to the setup, but the day to day operation must be as simple as possible, ie I can't expect users to have instructions like "log into linux, use vi to change configs, kill sighup and use rsync to the manager to update it" Is this just far too complicated an idea? Thanks in advance --Chris [1] on a side note on advice on how to best to get very simple non technical users to be able write monitoring configs would be appreciated -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From abdessamad at barakat.fr Wed Jun 2 13:42:12 2010 From: abdessamad at barakat.fr (Abdessamad BARAKAT) Date: Wed, 02 Jun 2010 13:42:12 +0200 Subject: Ndoutils block nagios Message-ID: <4C064394.50009@barakat.fr> Hi, After a electrical crash, I have a problem with ndoutils, nagios start and works correctly when ndo2db isn't started . When I start ndo2db, nagios blocks. When ndo2db is started , nagios connect to ndo2db: [1275478646] ndomod: Successfully reconnected to data sink! 113098 items lost, 5000 queued items to flush. I see this activity on the process nagios with strace: [pid 18603] write(7, "\n400:\n4=1275477698.622212\n174=MAW"..., 588 [pid 18604] <... poll resumed> ) = 0 (Timeout) [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) Before the electrical crash , all is ok. I use : - nagios 3.0.6 - NDOMOD 1.4b7 (10-31-2007) Many thanks for any help / information ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bertinelli.massimo at ansa.it Wed Jun 2 14:38:35 2010 From: bertinelli.massimo at ansa.it (Bertinelli Massimo) Date: Wed, 2 Jun 2010 14:38:35 +0200 Subject: R: Ndoutils block nagios Message-ID: I risolve the same problem using this method: stop ndo and nagios deleting the file ndo.sock on nagios/var directory Start ndo and nagios Bye Max ----- Messaggio originale ----- Da: Abdessamad BARAKAT A: nagios-users at lists.sourceforge.net Inviato: Wed Jun 02 13:42:12 2010 Oggetto: [Nagios-users] Ndoutils block nagios Hi, After a electrical crash, I have a problem with ndoutils, nagios start and works correctly when ndo2db isn't started . When I start ndo2db, nagios blocks. When ndo2db is started , nagios connect to ndo2db: [1275478646] ndomod: Successfully reconnected to data sink! 113098 items lost, 5000 queued items to flush. I see this activity on the process nagios with strace: [pid 18603] write(7, "\n400:\n4=1275477698.622212\n174=MAW"..., 588 [pid 18604] <... poll resumed> ) = 0 (Timeout) [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) Before the electrical crash , all is ok. I use : - nagios 3.0.6 - NDOMOD 1.4b7 (10-31-2007) Many thanks for any help / information ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From abdessamad at barakat.fr Wed Jun 2 17:20:56 2010 From: abdessamad at barakat.fr (Abdessamad BARAKAT) Date: Wed, 02 Jun 2010 17:20:56 +0200 Subject: R: Ndoutils block nagios In-Reply-To: References: Message-ID: <4C0676D8.5080305@barakat.fr> The unix socket ndo.sock is removed automatically when I stop ndo. for information, my ndomod.cfg: instance_name=Central output_type=unixsocket output=/var/cache/nagios3/ndoutils.sock tcp_port=5669 output_buffer_items=5000 buffer_file=/var/cache/nagios3/ndoutils_mod.tmp file_rotation_interval=14400 file_rotation_timeout=60 reconnect_interval=15 reconnect_warning_interval=15 data_processing_options=-1 config_output_options=3 Thanks again for your help Bertinelli Massimo a ?crit : > I risolve the same problem using this method: > stop ndo and nagios > deleting the file ndo.sock on nagios/var directory > Start ndo and nagios > Bye > Max > > ----- Messaggio originale ----- > Da: Abdessamad BARAKAT > A: nagios-users at lists.sourceforge.net > Inviato: Wed Jun 02 13:42:12 2010 > Oggetto: [Nagios-users] Ndoutils block nagios > > Hi, > > After a electrical crash, I have a problem with ndoutils, nagios start > and works correctly when ndo2db isn't started . > > When I start ndo2db, nagios blocks. > > When ndo2db is started , nagios connect to ndo2db: > > [1275478646] ndomod: Successfully reconnected to data sink! 113098 > items lost, 5000 queued items to flush. > > I see this activity on the process nagios with strace: > > [pid 18603] write(7, "\n400:\n4=1275477698.622212\n174=MAW"..., 588 > > [pid 18604] <... poll resumed> ) = 0 (Timeout) > [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) > [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) > [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) > [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) > > Before the electrical crash , all is ok. > > I use : > > - nagios 3.0.6 > - NDOMOD 1.4b7 (10-31-2007) > > Many thanks for any help / information > > > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------------------------ > > ------------------------------------------------------------------------------ > > > > ------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew at pagerduty.com Wed Jun 2 21:51:58 2010 From: andrew at pagerduty.com (Andrew Miklas) Date: Wed, 2 Jun 2010 12:51:58 -0700 Subject: Repeating failed notifications? Message-ID: <48D6B5C5-928C-401C-9E24-BCC3F0AE9B42@pagerduty.com> Hi, Is there any way to get Nagios to repeat failed notifications? In other words, can Nagios be configured so that if a notification script exits with a non-zero return value, the script will be run again after a short interval? For a bit of background, I'm working on a way to pass events from Nagios to the PagerDuty alerting system (www.pagerduty.com). PagerDuty collects events from external monitoring tools like Nagios and sends out phone calls & SMSes based on user-provided schedules and escalation chains. Right now, people who want to use PagerDuty to deliver their Nagios alerts must redirect their Nagios email to a PagerDuty-supplied email address. While this works reasonably well, we'd like to offer a plugin so we can better integrate with Nagios. I'm planning on doing this by writing a little Perl script that invokes the PagerDuty HTTP API. The script would be run as a Nagios notification command. One obvious downside to this approach is that if there's a network problem, notifications will be lost, hence the question above. -- Andrew (co-founder @ PagerDuty) ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Steven.Morrey at OneNeck.com Wed Jun 2 23:03:59 2010 From: Steven.Morrey at OneNeck.com (Morrey, Steven) Date: Wed, 2 Jun 2010 14:03:59 -0700 Subject: Repeating failed notifications? In-Reply-To: <48D6B5C5-928C-401C-9E24-BCC3F0AE9B42@pagerduty.com> References: <48D6B5C5-928C-401C-9E24-BCC3F0AE9B42@pagerduty.com> Message-ID: <8AB820857C43014DABCEBE7D42E9F6751A88F6E9FD@ONEWS06.oneneck.corp> I'm not sure about that but we did something similar here by creating a LUA script and using custom macros. The script reads an email template file, does a search & replace for the relevant values, then sends it. If the link was down it caches the contents to /tmp then each time it's called it sends the emails contained in that queue, it'll keep trying until it sends, after which each email is removed from the queue if and only if the send was successful. You could do something very similar with post requests, just cache them all the time & only delete them when they've actually completed. I'm looking at optimizing the process by just having the cache step during the notification process and then sending all the messages out via a daemon process that checks once per minute & cleans up the queue, this would free up resources for the rest of the system. Not sure about perl but in lua it's just under 50 lines of code to do all this. -----Original Message----- From: Andrew Miklas [mailto:andrew at pagerduty.com] Sent: Wednesday, June 02, 2010 12:52 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Repeating failed notifications? Hi, Is there any way to get Nagios to repeat failed notifications? In other words, can Nagios be configured so that if a notification script exits with a non-zero return value, the script will be run again after a short interval? For a bit of background, I'm working on a way to pass events from Nagios to the PagerDuty alerting system (www.pagerduty.com). PagerDuty collects events from external monitoring tools like Nagios and sends out phone calls & SMSes based on user-provided schedules and escalation chains. Right now, people who want to use PagerDuty to deliver their Nagios alerts must redirect their Nagios email to a PagerDuty-supplied email address. While this works reasonably well, we'd like to offer a plugin so we can better integrate with Nagios. I'm planning on doing this by writing a little Perl script that invokes the PagerDuty HTTP API. The script would be run as a Nagios notification command. One obvious downside to this approach is that if there's a network problem, notifications will be lost, hence the question above. -- Andrew (co-founder @ PagerDuty) ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Privileged/Confidential Information may be contained in this message or attachments hereto. Please advise immediately if you or your employer do not consent to Internet email for messages of this kind. Opinions, conclusions and other information in this message that do not relate to the official business of this company shall be understood as neither given nor endorsed by it. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.friedrich at univie.ac.at Wed Jun 2 23:29:53 2010 From: michael.friedrich at univie.ac.at (Michael Friedrich) Date: Wed, 02 Jun 2010 23:29:53 +0200 Subject: R: Ndoutils block nagios In-Reply-To: <4C0676D8.5080305@barakat.fr> References: <4C0676D8.5080305@barakat.fr> Message-ID: <4C06CD51.8000609@univie.ac.at> Abdessamad BARAKAT wrote: > buffer_file=/var/cache/nagios3/ndoutils_mod.tmp Try to remove this from disk in order to drop the waiting 5k items being put on the socket. Having a fresh startup of nagios and ndoutils, the config waits to be written, before some more realtime/config cleanups and after that, the historical cleanups leading from ndo2db.cfg might also interfere the normal insert/update procedure. this is when the socket gets blocking and ndomod is buffed with data it can't even send to the socket. Regarding 1.4b7 - consider getting the latest cvs head and patch the unique constraint bugfix on nagios-devel yourself onto mysql.sql It's not "the latest and greatest" ndoutils, just "the latest" though. Kind regards, Michael > file_rotation_interval=14400 > file_rotation_timeout=60 > reconnect_interval=15 > reconnect_warning_interval=15 > data_processing_options=-1 > config_output_options=3 > > > Thanks again for your help > > Bertinelli Massimo a ?crit : >> I risolve the same problem using this method: >> stop ndo and nagios >> deleting the file ndo.sock on nagios/var directory >> Start ndo and nagios >> Bye >> Max >> >> ----- Messaggio originale ----- >> Da: Abdessamad BARAKAT >> A: nagios-users at lists.sourceforge.net >> Inviato: Wed Jun 02 13:42:12 2010 >> Oggetto: [Nagios-users] Ndoutils block nagios >> >> Hi, >> >> After a electrical crash, I have a problem with ndoutils, nagios start >> and works correctly when ndo2db isn't started . >> >> When I start ndo2db, nagios blocks. >> >> When ndo2db is started , nagios connect to ndo2db: >> >> [1275478646] ndomod: Successfully reconnected to data sink! 113098 >> items lost, 5000 queued items to flush. >> >> I see this activity on the process nagios with strace: >> >> [pid 18603] write(7, "\n400:\n4=1275477698.622212\n174=MAW"..., 588 >> >> [pid 18604]<... poll resumed> ) = 0 (Timeout) >> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >> >> Before the electrical crash , all is ok. >> >> I use : >> >> - nagios 3.0.6 >> - NDOMOD 1.4b7 (10-31-2007) >> >> Many thanks for any help / information >> >> >> >> >> ------------------------------------------------------------------------------ >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> >> >> ------------------------------------------------------------------------ >> >> ------------------------------------------------------------------------------ >> >> >> >> ------------------------------------------------------------------------ >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- DI (FH) Michael Friedrich michael.friedrich at univie.ac.at Tel: +43 1 4277 14359 Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From abdessamad at barakat.fr Thu Jun 3 08:38:45 2010 From: abdessamad at barakat.fr (Abdessamad BARAKAT) Date: Thu, 03 Jun 2010 08:38:45 +0200 Subject: R: Ndoutils block nagios In-Reply-To: <4C06CD51.8000609@univie.ac.at> References: <4C0676D8.5080305@barakat.fr> <4C06CD51.8000609@univie.ac.at> Message-ID: <4C074DF5.7010101@barakat.fr> Thanks for your reply. This file is also removed automatically, before I start ndo, It doesn't exists. Before this crash , all works like a charm. Do you think I really need to upgrade ? Michael Friedrich a ?crit : > Abdessamad BARAKAT wrote: >> buffer_file=/var/cache/nagios3/ndoutils_mod.tmp > > Try to remove this from disk in order to drop the waiting 5k items being > put on the socket. > > Having a fresh startup of nagios and ndoutils, the config waits to be > written, before some more realtime/config cleanups and after that, the > historical cleanups leading from ndo2db.cfg might also interfere the > normal insert/update procedure. this is when the socket gets blocking > and ndomod is buffed with data it can't even send to the socket. > > Regarding 1.4b7 - consider getting the latest cvs head and patch the > unique constraint bugfix on nagios-devel yourself onto mysql.sql > > It's not "the latest and greatest" ndoutils, just "the latest" though. > > Kind regards, > Michael > >> file_rotation_interval=14400 >> file_rotation_timeout=60 >> reconnect_interval=15 >> reconnect_warning_interval=15 >> data_processing_options=-1 >> config_output_options=3 >> >> >> Thanks again for your help >> >> Bertinelli Massimo a ?crit : >>> I risolve the same problem using this method: >>> stop ndo and nagios >>> deleting the file ndo.sock on nagios/var directory >>> Start ndo and nagios >>> Bye >>> Max >>> >>> ----- Messaggio originale ----- >>> Da: Abdessamad BARAKAT >>> A: nagios-users at lists.sourceforge.net >>> Inviato: Wed Jun 02 13:42:12 2010 >>> Oggetto: [Nagios-users] Ndoutils block nagios >>> >>> Hi, >>> >>> After a electrical crash, I have a problem with ndoutils, nagios start >>> and works correctly when ndo2db isn't started . >>> >>> When I start ndo2db, nagios blocks. >>> >>> When ndo2db is started , nagios connect to ndo2db: >>> >>> [1275478646] ndomod: Successfully reconnected to data sink! 113098 >>> items lost, 5000 queued items to flush. >>> >>> I see this activity on the process nagios with strace: >>> >>> [pid 18603] write(7, "\n400:\n4=1275477698.622212\n174=MAW"..., 588 >>> >>> [pid 18604]<... poll resumed> ) = 0 (Timeout) >>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>> >>> Before the electrical crash , all is ok. >>> >>> I use : >>> >>> - nagios 3.0.6 >>> - NDOMOD 1.4b7 (10-31-2007) >>> >>> Many thanks for any help / information >>> >>> >>> >>> >>> ------------------------------------------------------------------------------ >>> >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when >>> reporting any issue. >>> ::: Messages without supporting info will risk being sent to /dev/null >>> >>> >>> ------------------------------------------------------------------------ >>> >>> ------------------------------------------------------------------------------ >>> >>> >>> >>> ------------------------------------------------------------------------ >>> >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >>> ::: Messages without supporting info will risk being sent to /dev/null >> >> ------------------------------------------------------------------------------ >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ccoager at davisvision.com Thu Jun 3 15:52:25 2010 From: ccoager at davisvision.com (Cory Coager) Date: Thu, 03 Jun 2010 09:52:25 -0400 Subject: clustered solution, how to edit configs? Message-ID: <28154_1275573145_4C07B399_28154_1170_1_4C07B399.80804@davisvision.com> I'm planning on creating a nagios cluster using the following guidelines from http://nagios.sourceforge.net/docs/1_0/distributed.html . I was thinking of using multiple nagios agents that use a submit_check_result script to send the results to two frontends using nsca. Here is where I'm stuck though. I want to have the same configs on every server but have random checks enabled/disabled so the load is distributed. I also want to edit all the configs from one location in one directory. I don't want to search for service checks in multiple directories, e.g., where is service check23? Is it in folder2 or folder13? How can I accomplish this? Does someone have a solution already or do I need to do something creative? I was thinking I could edit all the configs from one location, submit them to a version control repository, the agents automatically download new configs from the repository via crontab and nagios is reload if and only if they pass sanity checks. Thats a nice start but I'm still stuck on, how do I distribute the load? How can I specify which checks are enabled on serverA while the rest are disabled with my requirements that I stated earlier. I don't mind scripting a solution to make this possible but I need some creative ideas. Hoping someone can help me out! ~Cory Coager ------------------------------------------------------------------------ The information contained in this communication is intended only for the use of the recipient(s) named above. It may contain information that is privileged or confidential, and may be protected by State and/or Federal Regulations. If the reader of this message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication, or any of its contents, is strictly prohibited. If you have received this communication in error, please return it to the sender immediately and delete the original message and any copy of it from your computer system. If you have any questions concerning this message, please contact the sender. ------------------------------------------------------------------------ ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Steven.Morrey at OneNeck.com Thu Jun 3 19:56:47 2010 From: Steven.Morrey at OneNeck.com (Morrey, Steven) Date: Thu, 3 Jun 2010 10:56:47 -0700 Subject: clustered solution, how to edit configs? In-Reply-To: <28154_1275573145_4C07B399_28154_1170_1_4C07B399.80804@davisvision.com> References: <28154_1275573145_4C07B399_28154_1170_1_4C07B399.80804@davisvision.com> Message-ID: <8AB820857C43014DABCEBE7D42E9F6751A88F6EC37@ONEWS06.oneneck.corp> Have you looked into DNX? -----Original Message----- From: Cory Coager [mailto:ccoager at davisvision.com] Sent: Thursday, June 03, 2010 6:52 AM To: Nagios Users List Subject: [Nagios-users] clustered solution, how to edit configs? I'm planning on creating a nagios cluster using the following guidelines from http://nagios.sourceforge.net/docs/1_0/distributed.html . I was thinking of using multiple nagios agents that use a submit_check_result script to send the results to two frontends using nsca. Here is where I'm stuck though. I want to have the same configs on every server but have random checks enabled/disabled so the load is distributed. I also want to edit all the configs from one location in one directory. I don't want to search for service checks in multiple directories, e.g., where is service check23? Is it in folder2 or folder13? How can I accomplish this? Does someone have a solution already or do I need to do something creative? I was thinking I could edit all the configs from one location, submit them to a version control repository, the agents automatically download new configs from the repository via crontab and nagios is reload if and only if they pass sanity checks. Thats a nice start but I'm still stuck on, how do I distribute the load? How can I specify which checks are enabled on serverA while the rest are disabled with my requirements that I stated earlier. I don't mind scripting a solution to make this possible but I need some creative ideas. Hoping someone can help me out! ~Cory Coager ------------------------------------------------------------------------ The information contained in this communication is intended only for the use of the recipient(s) named above. It may contain information that is privileged or confidential, and may be protected by State and/or Federal Regulations. If the reader of this message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication, or any of its contents, is strictly prohibited. If you have received this communication in error, please return it to the sender immediately and delete the original message and any copy of it from your computer system. If you have any questions concerning this message, please contact the sender. ------------------------------------------------------------------------ ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Privileged/Confidential Information may be contained in this message or attachments hereto. Please advise immediately if you or your employer do not consent to Internet email for messages of this kind. Opinions, conclusions and other information in this message that do not relate to the official business of this company shall be understood as neither given nor endorsed by it. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Steven.Morrey at OneNeck.com Thu Jun 3 19:58:27 2010 From: Steven.Morrey at OneNeck.com (Morrey, Steven) Date: Thu, 3 Jun 2010 10:58:27 -0700 Subject: clustered solution, how to edit configs? In-Reply-To: <28154_1275573145_4C07B399_28154_1170_1_4C07B399.80804@davisvision.com> References: <28154_1275573145_4C07B399_28154_1170_1_4C07B399.80804@davisvision.com> Message-ID: <8AB820857C43014DABCEBE7D42E9F6751A88F6EC3A@ONEWS06.oneneck.corp> Have you looked into DNX? Also I'm not positive but I think those instructions are for the 1.0 branch of Nagios which is relatively ancient. You should if at all possible upgrade to Nagios 3.x branch as soon as possible. -----Original Message----- From: Cory Coager [mailto:ccoager at davisvision.com] Sent: Thursday, June 03, 2010 6:52 AM To: Nagios Users List Subject: [Nagios-users] clustered solution, how to edit configs? I'm planning on creating a nagios cluster using the following guidelines from http://nagios.sourceforge.net/docs/1_0/distributed.html . I was thinking of using multiple nagios agents that use a submit_check_result script to send the results to two frontends using nsca. Here is where I'm stuck though. I want to have the same configs on every server but have random checks enabled/disabled so the load is distributed. I also want to edit all the configs from one location in one directory. I don't want to search for service checks in multiple directories, e.g., where is service check23? Is it in folder2 or folder13? How can I accomplish this? Does someone have a solution already or do I need to do something creative? I was thinking I could edit all the configs from one location, submit them to a version control repository, the agents automatically download new configs from the repository via crontab and nagios is reload if and only if they pass sanity checks. Thats a nice start but I'm still stuck on, how do I distribute the load? How can I specify which checks are enabled on serverA while the rest are disabled with my requirements that I stated earlier. I don't mind scripting a solution to make this possible but I need some creative ideas. Hoping someone can help me out! ~Cory Coager ------------------------------------------------------------------------ The information contained in this communication is intended only for the use of the recipient(s) named above. It may contain information that is privileged or confidential, and may be protected by State and/or Federal Regulations. If the reader of this message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication, or any of its contents, is strictly prohibited. If you have received this communication in error, please return it to the sender immediately and delete the original message and any copy of it from your computer system. If you have any questions concerning this message, please contact the sender. ------------------------------------------------------------------------ ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null Privileged/Confidential Information may be contained in this message or attachments hereto. Please advise immediately if you or your employer do not consent to Internet email for messages of this kind. Opinions, conclusions and other information in this message that do not relate to the official business of this company shall be understood as neither given nor endorsed by it. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Thu Jun 3 20:28:49 2010 From: td3201 at gmail.com (Terry) Date: Thu, 3 Jun 2010 13:28:49 -0500 Subject: check_yum issue Message-ID: Hello, I am trying to use check_yum: http://exchange.nagios.org/directory/Plugins/Uncategorized/Operating-Systems/Linux/Check_Yum/details It works great from the command line: [root at foo ~]# yum --security check-update Loaded plugins: dellsysid, rhnplugin, security Limiting package lists to security relevant ones Needed 4 of 11 packages, for security rhn-check.noarch 0.4.20-33.el5_5.2 rhel-x86_64-server-5 rhn-client-tools.noarch 0.4.20-33.el5_5.2 rhel-x86_64-server-5 rhn-setup.noarch 0.4.20-33.el5_5.2 rhel-x86_64-server-5 rhn-setup-gnome.noarch 0.4.20-33.el5_5.2 rhel-x86_64-server-5 [root at foo ~]# /usr/lib64/nagios/plugins/check_yum YUM CRITICAL: 4 Security Updates Available. 7 Non-Security Updates Available [root at foo ~]# echo $? 2 It returns this from nagios: [root at foo ~]# /usr/lib64/nagios/plugins/check_nrpe -H 10.0.0.2 -t 50 -c check_yum YUM OK: 0 Security Updates Available Here's my NRPE configuration: [root at bar ~]# cat /etc/nagios/nrpe.cfg | grep check_yum command[check_yum]=/usr/lib64/nagios/plugins/check_yum What am I missing here? ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Thu Jun 3 20:40:19 2010 From: td3201 at gmail.com (Terry) Date: Thu, 3 Jun 2010 13:40:19 -0500 Subject: check_yum issue In-Reply-To: References: Message-ID: On Thu, Jun 3, 2010 at 1:28 PM, Terry wrote: > Hello, > > I am trying to use check_yum: > http://exchange.nagios.org/directory/Plugins/Uncategorized/Operating-Systems/Linux/Check_Yum/details > > It works great from the command line: > [root at foo ~]# yum --security check-update > Loaded plugins: dellsysid, rhnplugin, security > Limiting package lists to security relevant ones > Needed 4 of 11 packages, for security > > rhn-check.noarch > ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 > rhn-client-tools.noarch > ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 > rhn-setup.noarch > ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 > rhn-setup-gnome.noarch > ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 > [root at foo ~]# /usr/lib64/nagios/plugins/check_yum > YUM CRITICAL: 4 Security Updates Available. 7 Non-Security Updates Available > [root at foo ~]# echo $? > 2 > > It returns this from nagios: > [root at foo ~]# /usr/lib64/nagios/plugins/check_nrpe -H 10.0.0.2 -t 50 > -c check_yum > YUM OK: 0 Security Updates Available > > Here's my NRPE configuration: > [root at bar ~]# cat /etc/nagios/nrpe.cfg | grep check_yum > ? ? ? ?command[check_yum]=/usr/lib64/nagios/plugins/check_yum > > What am I missing here? > I think I fail here. This is a permissions issue as noted in the description of the plugin. Anyone doing something similar? If so, how is your solution architected? Thanks! ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gevery at gmail.com Thu Jun 3 20:38:07 2010 From: gevery at gmail.com (Gary Every) Date: Thu, 3 Jun 2010 11:38:07 -0700 Subject: check_yum issue In-Reply-To: References: Message-ID: It appears that root has permissions but whatever user the nagios instance is running doesn't. Start there g.; On Thu, Jun 3, 2010 at 11:28 AM, Terry wrote: > Hello, > > I am trying to use check_yum: > > http://exchange.nagios.org/directory/Plugins/Uncategorized/Operating-Systems/Linux/Check_Yum/details > > It works great from the command line: > [root at foo ~]# yum --security check-update > Loaded plugins: dellsysid, rhnplugin, security > Limiting package lists to security relevant ones > Needed 4 of 11 packages, for security > > rhn-check.noarch > 0.4.20-33.el5_5.2 > rhel-x86_64-server-5 > rhn-client-tools.noarch > 0.4.20-33.el5_5.2 > rhel-x86_64-server-5 > rhn-setup.noarch > 0.4.20-33.el5_5.2 > rhel-x86_64-server-5 > rhn-setup-gnome.noarch > 0.4.20-33.el5_5.2 > rhel-x86_64-server-5 > [root at foo ~]# /usr/lib64/nagios/plugins/check_yum > YUM CRITICAL: 4 Security Updates Available. 7 Non-Security Updates > Available > [root at foo ~]# echo $? > 2 > > It returns this from nagios: > [root at foo ~]# /usr/lib64/nagios/plugins/check_nrpe -H 10.0.0.2 -t 50 > -c check_yum > YUM OK: 0 Security Updates Available > > Here's my NRPE configuration: > [root at bar ~]# cat /etc/nagios/nrpe.cfg | grep check_yum > command[check_yum]=/usr/lib64/nagios/plugins/check_yum > > What am I missing here? > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Gary Every "Pay it Forward!" -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.friedrich at univie.ac.at Thu Jun 3 20:43:38 2010 From: michael.friedrich at univie.ac.at (Michael Friedrich) Date: Thu, 03 Jun 2010 20:43:38 +0200 Subject: R: Ndoutils block nagios In-Reply-To: <4C074DF5.7010101@barakat.fr> References: <4C0676D8.5080305@barakat.fr> <4C06CD51.8000609@univie.ac.at> <4C074DF5.7010101@barakat.fr> Message-ID: <4C07F7DA.9000707@univie.ac.at> On 2010-06-03 08:38, Abdessamad BARAKAT wrote: > Before this crash , all works like a charm. Do you think I really need > to upgrade ? > I would just recommend it on performance purpose but if it worked for you, stay at 1.4b7 - 1.4b8 is shipping with several bugs which are not all resolved in 1.4b9 (from what I can tell looking at the code and comparing to Icinga IDOUtils). Regarding your problem I would run a sanity check on the database, checking if there's anything broken and ndo2db just fails on inserting data (long lasting connections to the rdbms slow down the overall sequential inserts). You might also use a little trick to remove the 5k items from buffer - you can set that in your ndomod.cfg - set it to 1, restart the core, change back and restart again. Never tried that, but at least those buffered items should be dropped out of the way. But I think the electrical incident harmed sth else, like the rdbms. Kind regards, Michael > > Michael Friedrich a ?crit : > >> Abdessamad BARAKAT wrote: >> >>> buffer_file=/var/cache/nagios3/ndoutils_mod.tmp >>> >> Try to remove this from disk in order to drop the waiting 5k items being >> put on the socket. >> >> Having a fresh startup of nagios and ndoutils, the config waits to be >> written, before some more realtime/config cleanups and after that, the >> historical cleanups leading from ndo2db.cfg might also interfere the >> normal insert/update procedure. this is when the socket gets blocking >> and ndomod is buffed with data it can't even send to the socket. >> >> Regarding 1.4b7 - consider getting the latest cvs head and patch the >> unique constraint bugfix on nagios-devel yourself onto mysql.sql >> >> It's not "the latest and greatest" ndoutils, just "the latest" though. >> >> Kind regards, >> Michael >> >> >>> file_rotation_interval=14400 >>> file_rotation_timeout=60 >>> reconnect_interval=15 >>> reconnect_warning_interval=15 >>> data_processing_options=-1 >>> config_output_options=3 >>> >>> >>> Thanks again for your help >>> >>> Bertinelli Massimo a ?crit : >>> >>>> I risolve the same problem using this method: >>>> stop ndo and nagios >>>> deleting the file ndo.sock on nagios/var directory >>>> Start ndo and nagios >>>> Bye >>>> Max >>>> >>>> ----- Messaggio originale ----- >>>> Da: Abdessamad BARAKAT >>>> A: nagios-users at lists.sourceforge.net >>>> Inviato: Wed Jun 02 13:42:12 2010 >>>> Oggetto: [Nagios-users] Ndoutils block nagios >>>> >>>> Hi, >>>> >>>> After a electrical crash, I have a problem with ndoutils, nagios start >>>> and works correctly when ndo2db isn't started . >>>> >>>> When I start ndo2db, nagios blocks. >>>> >>>> When ndo2db is started , nagios connect to ndo2db: >>>> >>>> [1275478646] ndomod: Successfully reconnected to data sink! 113098 >>>> items lost, 5000 queued items to flush. >>>> >>>> I see this activity on the process nagios with strace: >>>> >>>> [pid 18603] write(7, "\n400:\n4=1275477698.622212\n174=MAW"..., 588 >>>> >>>> [pid 18604]<... poll resumed> ) = 0 (Timeout) >>>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>>> >>>> Before the electrical crash , all is ok. >>>> >>>> I use : >>>> >>>> - nagios 3.0.6 >>>> - NDOMOD 1.4b7 (10-31-2007) >>>> >>>> Many thanks for any help / information >>>> >>>> >>>> >>>> >>>> ------------------------------------------------------------------------------ >>>> >>>> _______________________________________________ >>>> Nagios-users mailing list >>>> Nagios-users at lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>>> ::: Please include Nagios version, plugin version (-v) and OS when >>>> reporting any issue. >>>> ::: Messages without supporting info will risk being sent to /dev/null >>>> >>>> >>>> ------------------------------------------------------------------------ >>>> >>>> ------------------------------------------------------------------------------ >>>> >>>> >>>> >>>> ------------------------------------------------------------------------ >>>> >>>> _______________________________________________ >>>> Nagios-users mailing list >>>> Nagios-users at lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>>> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >>>> ::: Messages without supporting info will risk being sent to /dev/null >>>> >>> ------------------------------------------------------------------------------ >>> >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >>> ::: Messages without supporting info will risk being sent to /dev/null >>> >> >> > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Fri Jun 4 00:38:48 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Thu, 03 Jun 2010 15:38:48 -0700 Subject: upgrade from 3.0.6 to 3.2.1 using rpmforge (yum) Message-ID: <4C082EF8.5000607@cyber-office.net> Are there any gotchas I need to look out for when upgrading via yum? CentOS 5.3 (man I better get the server updated.) I see a message in January about installing on CentOS and the recommendation of using the EPEL repo. Rpmforge, EPEL? Any advantages to either? TIA, Rod -- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcasale at activenetwerx.com Fri Jun 4 02:51:21 2010 From: jcasale at activenetwerx.com (Joseph L. Casale) Date: Fri, 4 Jun 2010 00:51:21 +0000 Subject: upgrade from 3.0.6 to 3.2.1 using rpmforge (yum) In-Reply-To: <4C082EF8.5000607@cyber-office.net> References: <4C082EF8.5000607@cyber-office.net> Message-ID: >Are there any gotchas I need to look out for when upgrading via yum? Backup your configs, but the rpms from rpmforge will simply create .new files instead of clobbering your old files, then you can merge/rewrite anything needed. >I see a message in January about installing on CentOS and the >recommendation of using the EPEL repo. Rpmforge, EPEL? Any advantages >to either? Epel's nagios version is ancient, it only has useful plugins. One thing to keep in mind is the Base repo config in CentOS points to a "5" on the mirrors which is symlinked to the latest version. Continuing to use an old version like 5.3 without editing your repo cnofigs to point to 5.3 causes you to mix 5.5 rpms into your 5.3 installation which depending on what you install can be a problem... Hth, jlc ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Fri Jun 4 03:06:33 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Thu, 03 Jun 2010 18:06:33 -0700 Subject: upgrade from 3.0.6 to 3.2.1 using rpmforge (yum) In-Reply-To: References: <4C082EF8.5000607@cyber-office.net> Message-ID: <4C085199.6090709@cyber-office.net> Joseph L. Casale wrote: >> Are there any gotchas I need to look out for when upgrading via yum? > > Backup your configs, but the rpms from rpmforge will simply create .new > files instead of clobbering your old files, then you can merge/rewrite > anything needed. Thanks. I was thinking any and everything with "nagios" in the name but the config dirs/subdirs makes the most amount of sense. >> I see a message in January about installing on CentOS and the >> recommendation of using the EPEL repo. Rpmforge, EPEL? Any advantages >> to either? I put the EPEL repo on to the system then ran a "yum search nagios". Rpmforge should up but EPEL didn't I guess I should try an "yum info" for fun. > > Epel's nagios version is ancient, it only has useful plugins. I'll keep that in mind. > One thing to keep in mind is the Base repo config in CentOS points to > a "5" on the mirrors which is symlinked to the latest version. Continuing > to use an old version like 5.3 without editing your repo cnofigs to point > to 5.3 causes you to mix 5.5 rpms into your 5.3 installation which > depending on what you install can be a problem... Thanks again. \\||/ Rod -- > > Hth, > jlc > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcasale at activenetwerx.com Fri Jun 4 03:17:21 2010 From: jcasale at activenetwerx.com (Joseph L. Casale) Date: Fri, 4 Jun 2010 01:17:21 +0000 Subject: upgrade from 3.0.6 to 3.2.1 using rpmforge (yum) In-Reply-To: <4C085199.6090709@cyber-office.net> References: <4C082EF8.5000607@cyber-office.net> <4C085199.6090709@cyber-office.net> Message-ID: >I put the EPEL repo on to the system then ran a "yum search nagios". >Rpmforge should up but EPEL didn't I guess I should try an "yum info" >for fun. Keep in mind anytime you add a 3rd party repo, you should use a yum plugin like priorities to keep Base protected. http://wiki.centos.org/AdditionalResources/Repositories/RPMForge ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Jun 4 10:37:15 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 04 Jun 2010 10:37:15 +0200 Subject: clustered solution, how to edit configs? In-Reply-To: <28154_1275573145_4C07B399_28154_1170_1_4C07B399.80804@davisvision.com> References: <28154_1275573145_4C07B399_28154_1170_1_4C07B399.80804@davisvision.com> Message-ID: <4C08BB3B.80408@op5.se> On 06/03/2010 03:52 PM, Cory Coager wrote: > I'm planning on creating a nagios cluster using the following guidelines > from http://nagios.sourceforge.net/docs/1_0/distributed.html . I was > thinking of using multiple nagios agents that use a submit_check_result > script to send the results to two frontends using nsca. Here is where > I'm stuck though. > > I want to have the same configs on every server but have random checks > enabled/disabled so the load is distributed. I also want to edit all > the configs from one location in one directory. I don't want to search > for service checks in multiple directories, e.g., where is service > check23? Is it in folder2 or folder13? > > How can I accomplish this? Does someone have a solution already or do I > need to do something creative? There is DNX and there is Merlin. Both will sort of do what you want. If you use merlin, make sure you get it directly from git and use the 'next' branch. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From clee.hk at gmail.com Fri Jun 4 12:28:45 2010 From: clee.hk at gmail.com (Chris Lee) Date: Fri, 4 Jun 2010 18:28:45 +0800 Subject: Is this a nagios configuration issue? Message-ID: Dear All, I am using Nagios Core Version 3.2.1 on Fedora 12. I found the following message that in the system log file (/var/log/messages): Jun 2 12:10:14 fda12 setroubleshoot: SELinux is preventing /bin/ping access to a leaked /usr/local/nagios/var/spool/checkresults/check43t3S5 file descriptor. For complete SELinux messages. run sealert -l 5af60ca2-178e-40da-ac17-5b6fbb72db15 Is there anyone who has encountered this kind of messages before? I am wondering if this is the nagios configuration issue or it is the Fedora configuration issue. Regards, -- Chris ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravishankar.gundlapali at wipro.com Sat Jun 5 04:52:11 2010 From: ravishankar.gundlapali at wipro.com (ravishankar.gundlapali at wipro.com) Date: Sat, 5 Jun 2010 08:22:11 +0530 Subject: Nagios not sending email notifications.. In-Reply-To: <3FEFD82C77911D45A5465CEEBEAC23CDCC0B0BDB@PNE-HJN-MBX02.wipro.com> References: <3FEFD82C77911D45A5465CEEBEAC23CDCC0B0BDB@PNE-HJN-MBX02.wipro.com> Message-ID: Hi , I am using Nagios 3.0.6 on Linux with Fedora installed on it. My Nagios suddenly stopped sending email notifications .When I tried sending manually using the command 'echo "subject" | mail -s "message" contactemail at whatever ' I am not receiving any email. Please someone guide me on this... Here is my service template. # Generic service definition template - This is NOT a real service, just a template! define service{ name generic-service ; The 'name' of this service template active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (disabling this can lead to major performance problems) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled failure_prediction_enabled 1 ; Failure prediction is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts is_volatile 0 ; The service is not volatile check_period 24x7 ; The service can be checked at any time of the day max_check_attempts 2 ; Re-check the service up to 3 times in order to determine its final (hard) state normal_check_interval 5 ; Check the service every 10 minutes under normal conditions retry_check_interval 1 ; Re-check the service every two minutes until a hard state can be determined # contact_groups stw_win_server,stw_network_devices,stw_app_server ; Notifications get sent out to everyone in the 'admins' group notification_options w,u,c ; Send notifications about warning, unknown, critical, and recovery events notification_interval 30 ; Re-notify about service problems every hour notification_period 24x7 ; Notifications can be sent out at any time register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! } Thanks & Regards, Ravi G -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From asam30 at gmail.com Sat Jun 5 07:52:39 2010 From: asam30 at gmail.com (asam30 at gmail.com) Date: Sat, 5 Jun 2010 08:52:39 +0300 Subject: Nagios not sending email notifications.. In-Reply-To: References: <3FEFD82C77911D45A5465CEEBEAC23CDCC0B0BDB@PNE-HJN-MBX02.wipro.com> Message-ID: Nagios uses sendmail MTA to send emails out, please check your sendmail configuration and troubleshoot it. check to see if any emails in the queue On Sat, Jun 5, 2010 at 5:52 AM, wrote: > Hi , > > > > I am using Nagios 3.0.6 on Linux with Fedora installed on it. > > > > My Nagios suddenly stopped sending email notifications .When I tried sending manually using the command ?echo "subject" | mail -s "message" contactemail at whatever ? I am not receiving any email. > > > > Please someone guide me on this? > > > > Here is my service template. > > > > # Generic service definition template - This is NOT a real service, just a > template! > > > > define service{ > > name generic-service ; The > 'name' of this service template > > active_checks_enabled 1 ; Active > service checks are enabled > > passive_checks_enabled 1 ; Passive > service checks are enabled/accepted > > parallelize_check 1 ; Active > service checks should be parallelized (disabling this can lead to major > performance problems) > > obsess_over_service 1 ; We should > obsess over this service (if necessary) > > check_freshness 0 ; Default > is to NOT check service 'freshness' > > notifications_enabled 1 ; Service > notifications are enabled > > event_handler_enabled 1 ; Service > event handler is enabled > > flap_detection_enabled 1 ; Flap > detection is enabled > > failure_prediction_enabled 1 ; Failure > prediction is enabled > > process_perf_data 1 ; Process > performance data > > retain_status_information 1 ; Retain > status information across program restarts > > retain_nonstatus_information 1 ; Retain > non-status information across program restarts > > is_volatile 0 ; The > service is not volatile > > check_period 24x7 ; The > service can be checked at any time of the day > > max_check_attempts 2 ; Re-check > the service up to 3 times in order to determine its final (hard) state > > normal_check_interval 5 ; Check the > service every 10 minutes under normal conditions > > retry_check_interval 1 ; Re-check > the service every two minutes until a hard state can be determined > > # contact_groups > stw_win_server,stw_network_devices,stw_app_server ; Notifications get > sent out to everyone in the 'admins' group > > notification_options w,u,c ; Send > notifications about warning, unknown, critical, and recovery events > > notification_interval 30 ; Re-notify > about service problems every hour > > notification_period 24x7 ; > Notifications can be sent out at any time > > register 0 ; DONT > REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! > > } > > > > Thanks & Regards, > > Ravi G > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Shankar Asam -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravishankar.gundlapali at wipro.com Sat Jun 5 08:03:41 2010 From: ravishankar.gundlapali at wipro.com (ravishankar.gundlapali at wipro.com) Date: Sat, 5 Jun 2010 11:33:41 +0530 Subject: Nagios not sending email notifications.. In-Reply-To: References: <3FEFD82C77911D45A5465CEEBEAC23CDCC0B0BDB@PNE-HJN-MBX02.wipro.com> Message-ID: Hi, I am able to see mail sent in /var/log/maillog. Queued mail for delivery Sendmail configuration is also fine. I am not able to figure out where is the problem. Thanks & Regards, Ravi G Wipro Support - STW - Wipro Technologies - India VOIP - 8540803, Desk : +91-20-39105657 , Cell - +91 9503029658 P please consider the environment - do you really need to print this email? From: asam30 at gmail.com [mailto:asam30 at gmail.com] Sent: Saturday, June 05, 2010 11:23 AM To: Nagios Users List Subject: Re: [Nagios-users] Nagios not sending email notifications.. Nagios uses sendmail MTA to send emails out, please check your sendmail configuration and troubleshoot it. check to see if any emails in the queue On Sat, Jun 5, 2010 at 5:52 AM, wrote: Hi , I am using Nagios 3.0.6 on Linux with Fedora installed on it. My Nagios suddenly stopped sending email notifications .When I tried sending manually using the command 'echo "subject" | mail -s "message" contactemail at whatever ' I am not receiving any email. Please someone guide me on this... Here is my service template. # Generic service definition template - This is NOT a real service, just a template! define service{ name generic-service ; The 'name' of this service template active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (disabling this can lead to major performance problems) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled failure_prediction_enabled 1 ; Failure prediction is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts is_volatile 0 ; The service is not volatile check_period 24x7 ; The service can be checked at any time of the day max_check_attempts 2 ; Re-check the service up to 3 times in order to determine its final (hard) state normal_check_interval 5 ; Check the service every 10 minutes under normal conditions retry_check_interval 1 ; Re-check the service every two minutes until a hard state can be determined # contact_groups stw_win_server,stw_network_devices,stw_app_server ; Notifications get sent out to everyone in the 'admins' group notification_options w,u,c ; Send notifications about warning, unknown, critical, and recovery events notification_interval 30 ; Re-notify about service problems every hour notification_period 24x7 ; Notifications can be sent out at any time register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! } Thanks & Regards, Ravi G ------------------------------------------------------------------------ ------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Shankar Asam -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james at linux-source.org Sat Jun 5 10:57:38 2010 From: james at linux-source.org (James Corteciano) Date: Sat, 5 Jun 2010 16:57:38 +0800 Subject: Nagios - Retention period for logrotate Message-ID: Dear List, I am running Nagios version 3.2.0 and the archive logs are located in /var/log/nagios/archive which doing log rotation everyday. How can I configure nagios that the archive logs retention period is six (6) months? Thank you. Regards, James -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Sat Jun 5 11:59:16 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Sat, 5 Jun 2010 11:59:16 +0200 Subject: Nagios not sending email notifications.. In-Reply-To: References: <3FEFD82C77911D45A5465CEEBEAC23CDCC0B0BDB@PNE-HJN-MBX02.wipro.com> Message-ID: <001175DF-AF3E-452B-94EA-951129DA76D4@linux.it> Hi, It's not a Nagios problem. The problem is either at os level of your Nagios server (maybe you do not reaxh your mta or something wrong in your local to remote delivery) or it's a problem om you mta not accepting emails for delivery from Nagios server. Login to your Nagios server then su - nagios_user Then issue the following command mail and check if there are any bounced mails from your mta or wathever else. Ciao, Giorgio Il giorno 05/giu/2010, alle ore 04.52, ha scritto: > Hi , > > > > I am using Nagios 3.0.6 on Linux with Fedora installed on it. > > > > My Nagios suddenly stopped sending email notifications .When I tried > sending manually using the command ?echo "subject" | mail -s "messag > e" contactemail at whatever ? I am not receiving any email. > > > Please someone guide me on this? > > > > Here is my service template. > > > > # Generic service definition template - This is NOT a real service, > just a template! > > > > define service{ > > name generic-service ; > The 'name' of this service template > > active_checks_enabled 1 ; > Active service checks are enabled > > passive_checks_enabled 1 ; > Passive service checks are enabled/accepted > > parallelize_check 1 ; > Active service checks should be parallelized (disabling this can > lead to major performance problems) > > obsess_over_service 1 ; We > should obsess over this service (if necessary) > > check_freshness 0 ; > Default is to NOT check service 'freshness' > > notifications_enabled 1 ; > Service notifications are enabled > > event_handler_enabled 1 ; > Service event handler is enabled > > flap_detection_enabled 1 ; > Flap detection is enabled > > failure_prediction_enabled 1 ; > Failure prediction is enabled > > process_perf_data 1 ; > Process performance data > > retain_status_information 1 ; > Retain status information across program restarts > > retain_nonstatus_information 1 ; > Retain non-status information across program restarts > > is_volatile 0 ; > The service is not volatile > > check_period 24x7 ; > The service can be checked at any time of the day > > max_check_attempts 2 ; Re- > check the service up to 3 times in order to determine its final > (hard) state > > normal_check_interval 5 ; > Check the service every 10 minutes under normal conditions > > retry_check_interval 1 ; Re- > check the service every two minutes until a hard state can be > determined > > # contact_groups > stw_win_server,stw_network_devices,stw_app_server ; > Notifications get sent out to everyone in the 'admins' group > > notification_options w,u,c ; > Send notifications about warning, unknown, critical, and recovery > events > > notification_interval 30 ; Re- > notify about service problems every hour > > notification_period 24x7 ; > Notifications can be sent out at any time > > register 0 ; > DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A > TEMPLATE! > > } > > > > Thanks & Regards, > > Ravi G > > --- > --- > --- > --------------------------------------------------------------------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Sat Jun 5 14:45:22 2010 From: lists at xodus.org (Marc Powell) Date: Sat, 5 Jun 2010 07:45:22 -0500 Subject: Nagios - Retention period for logrotate In-Reply-To: References: Message-ID: On Jun 5, 2010, at 3:57 AM, James Corteciano wrote: > Dear List, > > I am running Nagios version 3.2.0 and the archive logs are located in /var/log/nagios/archive which doing log rotation everyday. How can I configure nagios that the archive logs retention period is six (6) months? There isn't a configuration option for this. Nagios will keep the log files indefinitely for reporting (you can go back 2+ years through the web interface). You can cron a simple find -exec rm to do that but be aware that you won't have any reporting capability beyond what you keep. Also, because you specifically mention it, using logrotate will break reporting in nagios. The file naming convention must be exact for nagios to find the files. -- Marc ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xml.devel at gmail.com Sat Jun 5 15:19:22 2010 From: xml.devel at gmail.com (Kumar, Ashish) Date: Sat, 5 Jun 2010 18:49:22 +0530 Subject: Is this a nagios configuration issue? In-Reply-To: References: Message-ID: > Jun 2 12:10:14 fda12 setroubleshoot: SELinux is preventing /bin/ping > access to a leaked > /usr/local/nagios/var/spool/checkresults/check43t3S5 file descriptor. > For complete SELinux messages. run sealert -l > 5af60ca2-178e-40da-ac17-5b6fbb72db15 > > This issue is related to SELinux. If you have no use for SELinux, apparently you do not, disable it in /etc/selinux/config SELINUX=disabled and reboot the system for changes to take effect. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robert.wolfe at robertwolfe.org Sat Jun 5 13:24:30 2010 From: robert.wolfe at robertwolfe.org (Robert Wolfe) Date: Sat, 05 Jun 2010 07:24:30 -0400 Subject: **SPAM** Re: upgrade from 3.0.6 to 3.2.1 using rpmforge (yum) In-Reply-To: References: <4C082EF8.5000607@cyber-office.net> <4C085199.6090709@cyber-office.net> Message-ID: <1435e554fa6aadb4c7a0137121479f69@mail.wolfe.local> On Fri, 4 Jun 2010 01:17:21 +0000, "Joseph L. Casale" wrote: >>I put the EPEL repo on to the system then ran a "yum search nagios". >>Rpmforge should up but EPEL didn't I guess I should try an "yum info" >>for fun. > > Keep in mind anytime you add a 3rd party repo, you should use a yum plugin > like priorities to keep Base protected. This is why for anything like this, I always upgrade from source. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Sat Jun 5 16:02:53 2010 From: subscription at kkeane.com (Kevin Keane) Date: Sat, 5 Jun 2010 07:02:53 -0700 Subject: check_yum issue In-Reply-To: References: Message-ID: You would probably want to use sudo. Instead of having NRPE call check_yum directly, have it call sudo check_yum, and add check_yum for the Nagios user to your sudoers (make sure to not require a password, of course!) Be sure to keep the sudoers entry as restrictive as possible, or you may open a security hole. -----Original Message----- From: Terry [mailto:td3201 at gmail.com] Sent: Thursday, June 03, 2010 11:40 AM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] check_yum issue On Thu, Jun 3, 2010 at 1:28 PM, Terry wrote: > Hello, > > I am trying to use check_yum: > http://exchange.nagios.org/directory/Plugins/Uncategorized/Operating-S > ystems/Linux/Check_Yum/details > > It works great from the command line: > [root at foo ~]# yum --security check-update Loaded plugins: dellsysid, > rhnplugin, security Limiting package lists to security relevant ones > Needed 4 of 11 packages, for security > > rhn-check.noarch > ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 > rhn-client-tools.noarch > ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 > rhn-setup.noarch > ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 > rhn-setup-gnome.noarch > ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 > ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 > [root at foo ~]# /usr/lib64/nagios/plugins/check_yum > YUM CRITICAL: 4 Security Updates Available. 7 Non-Security Updates > Available [root at foo ~]# echo $? > 2 > > It returns this from nagios: > [root at foo ~]# /usr/lib64/nagios/plugins/check_nrpe -H 10.0.0.2 -t 50 > -c check_yum YUM OK: 0 Security Updates Available > > Here's my NRPE configuration: > [root at bar ~]# cat /etc/nagios/nrpe.cfg | grep check_yum > ? ? ? ?command[check_yum]=/usr/lib64/nagios/plugins/check_yum > > What am I missing here? > I think I fail here. This is a permissions issue as noted in the description of the plugin. Anyone doing something similar? If so, how is your solution architected? Thanks! ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james at linux-source.org Sat Jun 5 17:52:12 2010 From: james at linux-source.org (James Corteciano) Date: Sat, 5 Jun 2010 23:52:12 +0800 Subject: Nagios - Retention period for logrotate In-Reply-To: References: Message-ID: Hi Marc, Thanks for your response. What can you suggest for this kind of scenario as my boss requirement is to have the nagios logs remain within 6 months only? At this moment, I just set "log_rotation_method=n" in nagios.cfg and I made simple logrotation script as stated below. /var/logrotation.d/nagios: /var/log/nagios/*log { daily rotate 180 compress dateext missingok notifempty olddir /var/log/nagios/archives sharedscripts postrotate /sbin/service nagios reload > /dev/null 2>/dev/null || true endscript } Would appreciate your help about this. Thanks. Regards, James On Sat, Jun 5, 2010 at 8:45 PM, Marc Powell wrote: > > On Jun 5, 2010, at 3:57 AM, James Corteciano wrote: > > > Dear List, > > > > I am running Nagios version 3.2.0 and the archive logs are located in > /var/log/nagios/archive which doing log rotation everyday. How can I > configure nagios that the archive logs retention period is six (6) months? > > There isn't a configuration option for this. Nagios will keep the log files > indefinitely for reporting (you can go back 2+ years through the web > interface). > > You can cron a simple find -exec rm to do that but be aware that you won't > have any reporting capability beyond what you keep. > > Also, because you specifically mention it, using logrotate will break > reporting in nagios. The file naming convention must be exact for nagios to > find the files. > > -- > Marc > > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Sat Jun 5 18:00:52 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Sat, 5 Jun 2010 18:00:52 +0200 Subject: Nagios - Retention period for logrotate In-Reply-To: References: Message-ID: Better to use log rotate and have the eldest log file mailed to an "archiving" mailbox, so you can recall them as you need. Ciao, Giorgio Il giorno 05/giu/2010, alle ore 14.45, Marc Powell ha scritto: > > On Jun 5, 2010, at 3:57 AM, James Corteciano wrote: > >> Dear List, >> >> I am running Nagios version 3.2.0 and the archive logs are located >> in /var/log/nagios/archive which doing log rotation everyday. How >> can I configure nagios that the archive logs retention period is >> six (6) months? > > There isn't a configuration option for this. Nagios will keep the > log files indefinitely for reporting (you can go back 2+ years > through the web interface). > > You can cron a simple find -exec rm to do that but be aware that you > won't have any reporting capability beyond what you keep. > > Also, because you specifically mention it, using logrotate will > break reporting in nagios. The file naming convention must be exact > for nagios to find the files. > > -- > Marc > > > --- > --- > --- > --------------------------------------------------------------------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From abdessamad at barakat.fr Sun Jun 6 02:53:20 2010 From: abdessamad at barakat.fr (Abdessamad BARAKAT) Date: Sun, 06 Jun 2010 02:53:20 +0200 Subject: [CLOSED] Re: R: Ndoutils block nagios In-Reply-To: <4C07F7DA.9000707@univie.ac.at> References: <4C0676D8.5080305@barakat.fr> <4C06CD51.8000609@univie.ac.at> <4C074DF5.7010101@barakat.fr> <4C07F7DA.9000707@univie.ac.at> Message-ID: <4C0AF180.9050507@barakat.fr> I have created a new database and now it's ok...it's seems to block on the old database(LOCK,..), mysqlcheck on the old database show "OK" Thanks a lot for your help Michael Friedrich a ?crit : > On 2010-06-03 08:38, Abdessamad BARAKAT wrote: >> Before this crash , all works like a charm. Do you think I really need >> to upgrade ? >> > > I would just recommend it on performance purpose but if it worked for > you, stay at 1.4b7 - 1.4b8 is shipping with several bugs which are not > all resolved in 1.4b9 (from what I can tell looking at the code and > comparing to Icinga IDOUtils). > > Regarding your problem I would run a sanity check on the database, > checking if there's anything broken and ndo2db just fails on inserting > data (long lasting connections to the rdbms slow down the overall > sequential inserts). > > You might also use a little trick to remove the 5k items from buffer - > you can set that in your ndomod.cfg - set it to 1, restart the core, > change back and restart again. Never tried that, but at least those > buffered items should be dropped out of the way. > > But I think the electrical incident harmed sth else, like the rdbms. > > Kind regards, > Michael > >> Michael Friedrich a ?crit : >> >>> Abdessamad BARAKAT wrote: >>> >>>> buffer_file=/var/cache/nagios3/ndoutils_mod.tmp >>>> >>> Try to remove this from disk in order to drop the waiting 5k items being >>> put on the socket. >>> >>> Having a fresh startup of nagios and ndoutils, the config waits to be >>> written, before some more realtime/config cleanups and after that, the >>> historical cleanups leading from ndo2db.cfg might also interfere the >>> normal insert/update procedure. this is when the socket gets blocking >>> and ndomod is buffed with data it can't even send to the socket. >>> >>> Regarding 1.4b7 - consider getting the latest cvs head and patch the >>> unique constraint bugfix on nagios-devel yourself onto mysql.sql >>> >>> It's not "the latest and greatest" ndoutils, just "the latest" though. >>> >>> Kind regards, >>> Michael >>> >>> >>>> file_rotation_interval=14400 >>>> file_rotation_timeout=60 >>>> reconnect_interval=15 >>>> reconnect_warning_interval=15 >>>> data_processing_options=-1 >>>> config_output_options=3 >>>> >>>> >>>> Thanks again for your help >>>> >>>> Bertinelli Massimo a ?crit : >>>> >>>>> I risolve the same problem using this method: >>>>> stop ndo and nagios >>>>> deleting the file ndo.sock on nagios/var directory >>>>> Start ndo and nagios >>>>> Bye >>>>> Max >>>>> >>>>> ----- Messaggio originale ----- >>>>> Da: Abdessamad BARAKAT >>>>> A: nagios-users at lists.sourceforge.net >>>>> Inviato: Wed Jun 02 13:42:12 2010 >>>>> Oggetto: [Nagios-users] Ndoutils block nagios >>>>> >>>>> Hi, >>>>> >>>>> After a electrical crash, I have a problem with ndoutils, nagios start >>>>> and works correctly when ndo2db isn't started . >>>>> >>>>> When I start ndo2db, nagios blocks. >>>>> >>>>> When ndo2db is started , nagios connect to ndo2db: >>>>> >>>>> [1275478646] ndomod: Successfully reconnected to data sink! 113098 >>>>> items lost, 5000 queued items to flush. >>>>> >>>>> I see this activity on the process nagios with strace: >>>>> >>>>> [pid 18603] write(7, "\n400:\n4=1275477698.622212\n174=MAW"..., 588 >>>>> >>>>> [pid 18604]<... poll resumed> ) = 0 (Timeout) >>>>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>>>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>>>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>>>> [pid 18604] poll([{fd=4, events=POLLIN}], 1, 500) = 0 (Timeout) >>>>> >>>>> Before the electrical crash , all is ok. >>>>> >>>>> I use : >>>>> >>>>> - nagios 3.0.6 >>>>> - NDOMOD 1.4b7 (10-31-2007) >>>>> >>>>> Many thanks for any help / information >>>>> >>>>> >>>>> >>>>> >>>>> ------------------------------------------------------------------------------ >>>>> >>>>> _______________________________________________ >>>>> Nagios-users mailing list >>>>> Nagios-users at lists.sourceforge.net >>>>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>>>> ::: Please include Nagios version, plugin version (-v) and OS when >>>>> reporting any issue. >>>>> ::: Messages without supporting info will risk being sent to /dev/null >>>>> >>>>> >>>>> ------------------------------------------------------------------------ >>>>> >>>>> ------------------------------------------------------------------------------ >>>>> >>>>> >>>>> >>>>> ------------------------------------------------------------------------ >>>>> >>>>> _______________________________________________ >>>>> Nagios-users mailing list >>>>> Nagios-users at lists.sourceforge.net >>>>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>>>> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >>>>> ::: Messages without supporting info will risk being sent to /dev/null >>>>> >>>> ------------------------------------------------------------------------------ >>>> >>>> _______________________________________________ >>>> Nagios-users mailing list >>>> Nagios-users at lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>>> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >>>> ::: Messages without supporting info will risk being sent to /dev/null >>>> >>> >> ------------------------------------------------------------------------------ >> ThinkGeek and WIRED's GeekDad team up for the Ultimate >> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >> lucky parental unit. See the prize list and enter to win: >> http://p.sf.net/sfu/thinkgeek-promo >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Sun Jun 6 04:26:27 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Sat, 5 Jun 2010 22:26:27 -0400 Subject: **SPAM** Re: upgrade from 3.0.6 to 3.2.1 using rpmforge (yum) In-Reply-To: <1435e554fa6aadb4c7a0137121479f69@mail.wolfe.local> References: <4C082EF8.5000607@cyber-office.net> <4C085199.6090709@cyber-office.net> <1435e554fa6aadb4c7a0137121479f69@mail.wolfe.local> Message-ID: If you're in the habit of compiling things from source on a distro that uses package management, it's not too hard to set up your own internal repository. With CentOS, it's farcically easy. The "hard part" is making an RPM, and even that just takes a little time to figure out. IBM's guide to packaging RPMs http://www.ibm.com/developerworks/library/l-rpm1/ Example spec file http://kmymoney2.sourceforge.net/phb/rpm-example.html Creating a local YUM repository http://www.g-loaded.eu/2005/12/11/local-yum-repository/ --Matt On Sat, Jun 5, 2010 at 7:24 AM, Robert Wolfe wrote: > On Fri, 4 Jun 2010 01:17:21 +0000, "Joseph L. Casale" > wrote: >>>I put the EPEL repo on to the system then ran a "yum search nagios". >>>Rpmforge should up but EPEL didn't ?I guess I should try an "yum info" >>>for fun. >> >> Keep in mind anytime you add a 3rd party repo, you should use a yum > plugin >> like priorities to keep Base protected. > > This is why for anything like this, I always upgrade from source. > > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Sun Jun 6 16:22:16 2010 From: lists at xodus.org (Marc Powell) Date: Sun, 6 Jun 2010 09:22:16 -0500 Subject: Nagios - Retention period for logrotate In-Reply-To: References: Message-ID: <4D2CAA2F-F63D-4024-942D-BD4B5E2A92C0@xodus.org> On Jun 5, 2010, at 10:52 AM, James Corteciano wrote: > Hi Marc, > > Thanks for your response. What can you suggest for this kind of scenario as my boss requirement is to have the nagios logs remain within 6 months only? The same as I suggested before; using find to remove log files older than 6 months in the archive directory. -- Marc ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f.hugh at comcast.net Sun Jun 6 17:17:06 2010 From: f.hugh at comcast.net (PRP) Date: Sun, 6 Jun 2010 10:17:06 -0500 Subject: Nagios and a custom webinject.pl In-Reply-To: <1058556432.2421891275066268474.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net> References: <1447110886.2421781275066253372.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net> <1058556432.2421891275066268474.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net> Message-ID: <001d01cb058b$54222cf0$fc6686d0$@comcast.net> Well after some work I found a few issues with the webinject.pl script that I introduced. Anyone new to this world like me if you need to run tests as the nagios user use "sudo su nagios" That will give you a shell running under the nagios user. That took me some time to figure that out. Unfortunately, that did not solve my problem. I can run my script under the nagios user without issue, but if I allow the service to run it in the same nature, it fails. I am perplexed so if anyone has any ideas I would greatly appreciate it. -----Original Message----- From: f.hugh at comcast.net [mailto:f.hugh at comcast.net] Sent: Friday, May 28, 2010 12:04 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Nagios and a custom webinject.pl We just deployed a new site that requires client certificates so I found a discussion thread on how to make a few changes to the script so that it would use the client certificate that I specified. I left the original script intact along with all the nagios configs and named this new script webinjectcert.pl. I wanted 2 separate scripts and configs so that I did not disrupt what I already had running successfully The new script works perfectly as long as I run it manually as sudo. The problem is when I configure it for nagios with the same script, config file, and test case, it returns results that claim it could not log onto the site. I have enabled debugging in the nagios.cfg, but I just can't see what the problem is. I looked at the permissions on the new script and certificate to make sure that wasn't it and they look fine. I am not really sure how to run it as the nagios user since I can't remember what the password is and don't want to make a mess and change it. Any ideas? Paul ---------------------------------------------------------------------------- -- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ian at griggle.net Mon Jun 7 12:36:11 2010 From: ian at griggle.net (Ian Orszaczki) Date: Mon, 7 Jun 2010 20:36:11 +1000 Subject: Nagios and a custom webinject.pl In-Reply-To: <001d01cb058b$54222cf0$fc6686d0$@comcast.net> References: <1447110886.2421781275066253372.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net> <1058556432.2421891275066268474.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net> <001d01cb058b$54222cf0$fc6686d0$@comcast.net> Message-ID: Hi, we use webinject.pl but have not made any major modifications to it. If you are interested in sharing your script I would be happy to test it out in our setup. Cheers, Ian O On Mon, Jun 7, 2010 at 1:17 AM, PRP wrote: > Well after some work I found a few issues with the webinject.pl script > that > I introduced. Anyone new to this world like me if you need to run tests as > the nagios user use "sudo su nagios" That will give you a shell running > under the nagios user. That took me some time to figure that out. > Unfortunately, that did not solve my problem. I can run my script under > the > nagios user without issue, but if I allow the service to run it in the same > nature, it fails. I am perplexed so if anyone has any ideas I would > greatly > appreciate it. > > -----Original Message----- > From: f.hugh at comcast.net [mailto:f.hugh at comcast.net] > Sent: Friday, May 28, 2010 12:04 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Nagios and a custom webinject.pl > > We just deployed a new site that requires client certificates so I found > a discussion thread on how to make a few changes to the script so that it > would use the client certificate that I specified. I left the original > script intact along with all the nagios configs and named this new script > webinjectcert.pl. I wanted 2 separate scripts and configs so that I did > not > disrupt what I already had running successfully > The new script works perfectly as long as I run it manually as sudo. > The problem is when I configure it for nagios with the same script, config > file, and test case, it returns results that claim it could not log onto > the > site. I have enabled debugging in the nagios.cfg, but I just can't see > what > the problem is. I looked at the permissions on the new script and > certificate to make sure that wasn't it and they look fine. I am not > really > sure how to run it as the nagios user since I can't remember what the > password is and don't want to make a mess and change it. > Any ideas? > Paul > > > > > ---------------------------------------------------------------------------- > -- > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ian at griggle.net Mon Jun 7 12:39:26 2010 From: ian at griggle.net (Ian Orszaczki) Date: Mon, 7 Jun 2010 20:39:26 +1000 Subject: Nagios host flapping on centos 5.4 ? In-Reply-To: <13D20E8EEC29BE4092DC1896DC45A91656AC70AE50@NOK-EUMSG-04.mgdnok.nokia.com> References: <13D20E8EEC29BE4092DC1896DC45A91656AC70AE50@NOK-EUMSG-04.mgdnok.nokia.com> Message-ID: Hi, this might be a similar issue I had. I had tried to stop Nagios as the wrong user and then restarted as root. That left rogue Nagios processes running which meant inconsistent behaviour as both were running checks and writing results. I suggest killing all nagios processes manually and ensuring none are running and then starting nagios. Cheers, Ian O On Fri, May 28, 2010 at 6:14 AM, wrote: > Hi, > > I have Nagios installed on Ubuntu 8.04 running on Vmware Vsphere and Centos > 5.4 running on IBM hs20 blade. > Both are identical setup and they are checking same servers. > > But for some reason in centos almost half of the checked host are flapping > at any given time (40 host to check) > Biggest difference between this environments is that Ubuntu is using Nagios > 3.0.6 and centos 3.2.1. > > Have any of you noted similar behavior ? > > I followed this guide when installing nagios to centos: > *http://docs.cslabs.clarkson.edu/wiki/Install_Nagios_on_CentOS_5* > > And for some reason I can?t stop nagios service in the Centos: > /etc/init.d/nagios stop > nagios (pid 2647) is running... > Stopping nagios: [FAILED] > > Or restart: > /etc/init.d/nagios restart > Stopping nagios: [FAILED] > Starting nagios: > > Restart via nagios.cmd works. Any idea why this is happening ? > > > - Kimmo > > > > > > > > > > > > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Jun 7 15:38:22 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 7 Jun 2010 09:38:22 -0400 Subject: nrpe configuration help Message-ID: All, I need some suggestion for nrpe configuration. I have 3 different kind of architecture in my setup. I have 32 bit linux machine (plugins installed at /usr/lib/nagios/plugins directory) , 64 bit linux machine (plugins installed /usr/lib64/nagios/plugins directory), solaris machine (plugins installed at /opt/libexec directory) In my nrpe.conf file I would three definitions like below [check_something]=/usr/lib/nagios/plugins/check_something [check_something_x64]= /usr/lib64/nagios/plugins/check_something [cehck_something_unix]=/opt/libexec/check_somthing in my service definition, I would name them differently and call the command file, for example I would have a check disk, disk_x64, disk_unix. In commands.cfg file I would call them like command_name check_remote command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ However, now new requirements came in, where disk, disk_x64, disk_unix must have same service name. I need to find a clever way define service disk and call different nrpe command based on architecture. Can someone please help me with this. Thanks -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Mon Jun 7 16:27:25 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Mon, 07 Jun 2010 10:27:25 -0400 Subject: nrpe configuration help In-Reply-To: References: Message-ID: <4C0D01CD.8090808@paul.dubuc.org> Could you define one wrapper service that executes one of the others based on an argument passed to it? shadih rahman wrote: > All, > I need some suggestion for nrpe configuration. I have 3 different > kind of architecture in my setup. I have 32 bit linux machine (plugins > installed at /usr/lib/nagios/plugins directory) , 64 bit linux machine > (plugins installed /usr/lib64/nagios/plugins directory), solaris machine > (plugins installed at /opt/libexec directory) > > In my nrpe.conf file I would three definitions like below > > [check_something]=/usr/lib/nagios/plugins/check_something > [check_something_x64]= /usr/lib64/nagios/plugins/check_something > [cehck_something_unix]=/opt/libexec/check_somthing > > > in my service definition, I would name them differently and call the > command file, for example I would have a check disk, disk_x64, > disk_unix. In commands.cfg file I would call them like > > command_name check_remote > command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ > > > > However, now new requirements came in, where disk, disk_x64, disk_unix > must have same service name. I need to find a clever way define service > disk and call different nrpe command based on architecture. Can > someone please help me with this. Thanks > > > -- > Cordially, > Shadhin Rahman > > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bparish at cognex.com Mon Jun 7 19:11:04 2010 From: bparish at cognex.com (Parish, Brent) Date: Mon, 7 Jun 2010 13:11:04 -0400 Subject: Nagios and a custom webinject.pl In-Reply-To: References: <1447110886.2421781275066253372.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net><1058556432.2421891275066268474.JavaMail.root@sz0051a.emeryville.ca.mail.comcast.net><001d01cb058b$54222cf0$fc6686d0$@comcast.net> Message-ID: Hi Paul. I really have no idea if this might be the problem, but it is an easy test to do. I would suggest trying to run the webinject from the command line like you have been but add a minus sign in the "su": e.g. sudo su - nagios The minus sign will force it to become more like a true login (the way cron or Nagios would likely run it) and not potentially carry over your current PATH and other environment variables. - Brent On Mon, Jun 7, 2010 at 1:17 AM, PRP wrote: Well after some work I found a few issues with the webinject.pl script that I introduced. Anyone new to this world like me if you need to run tests as the nagios user use "sudo su nagios" That will give you a shell running under the nagios user. That took me some time to figure that out. Unfortunately, that did not solve my problem. I can run my script under the nagios user without issue, but if I allow the service to run it in the same nature, it fails. I am perplexed so if anyone has any ideas I would greatly appreciate it. -----Original Message----- From: f.hugh at comcast.net [mailto:f.hugh at comcast.net] Sent: Friday, May 28, 2010 12:04 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Nagios and a custom webinject.pl We just deployed a new site that requires client certificates so I found a discussion thread on how to make a few changes to the script so that it would use the client certificate that I specified. I left the original script intact along with all the nagios configs and named this new script webinjectcert.pl. I wanted 2 separate scripts and configs so that I did not disrupt what I already had running successfully The new script works perfectly as long as I run it manually as sudo. The problem is when I configure it for nagios with the same script, config file, and test case, it returns results that claim it could not log onto the site. I have enabled debugging in the nagios.cfg, but I just can't see what the problem is. I looked at the permissions on the new script and certificate to make sure that wasn't it and they look fine. I am not really sure how to run it as the nagios user since I can't remember what the password is and don't want to make a mess and change it. Any ideas? Paul -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From islandjewel at gmail.com Mon Jun 7 19:39:59 2010 From: islandjewel at gmail.com (Julie) Date: Mon, 7 Jun 2010 12:39:59 -0500 Subject: a test for an expired pw on account? Message-ID: Hi, Does anyone know if this exists - a way to test if an account has an expired password? have an account that requires the PW to be reset periodically, and need to know a week before it is about to expire so it can be reset before jobs fail.. thanks ~J~ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Tue Jun 8 00:57:14 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Mon, 7 Jun 2010 16:57:14 -0600 Subject: monitoring for exactly 1 process on 3 hosts Message-ID: I need to monitor that a process is running on one of three hosts and that is not running on the other two. Is there a way to set up a service such that a check must return OK for only one host of a hostgroup and alarm if there is less or more than one instance running in that hostgroup? ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.browne at ausregistry.com.au Tue Jun 8 01:44:14 2010 From: stuart.browne at ausregistry.com.au (Stuart Browne) Date: Tue, 8 Jun 2010 09:44:14 +1000 Subject: a test for an expired pw on account? In-Reply-To: References: Message-ID: <8CEF048B9EC83748B1517DC64EA130FB3E3DA0663F@off-win2003-01.ausregistrygroup.local> As you've not said what system you're wanting to check account expirations on, it makes it a bit hard to give decent advice. On a Linux box, about the closest I can think of is to run 'passwd -S ' on every user on a box. This specific command only works for local accounts however. If in an ldap environment, then you could use ldapsearch to return each account and their expiry date (with a little creative math and knowing the policies). A similar method might be usable for an AD environment. More details, better answers. Give it a shot :) Stuart From: Julie [mailto:islandjewel at gmail.com] Sent: Tuesday, 8 June 2010 03:40 To: Nagios Users List Subject: [Nagios-users] a test for an expired pw on account? Hi, Does anyone know if this exists - a way to test if an account has an expired password? have an account that requires the PW to be reset periodically, and need to know a week before it is about to expire so it can be reset before jobs fail.. thanks ~J~ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.browne at ausregistry.com.au Tue Jun 8 01:45:07 2010 From: stuart.browne at ausregistry.com.au (Stuart Browne) Date: Tue, 8 Jun 2010 09:45:07 +1000 Subject: monitoring for exactly 1 process on 3 hosts In-Reply-To: References: Message-ID: <8CEF048B9EC83748B1517DC64EA130FB3E3DA06640@off-win2003-01.ausregistrygroup.local> > -----Original Message----- > From: Litwin, Matthew [mailto:mlitwin at stubhub.com] > Sent: Tuesday, 8 June 2010 08:57 > > I need to monitor that a process is running on one of three hosts and > that is not running on the other two. Is there a way to set up a > service such that a check must return OK for only one host of a > hostgroup and alarm if there is less or more than one instance running > in that hostgroup? > ----------------------------------------------------------------------- Look at using check_proc (without notifications) combined with check_cluster (with notifications). Stuart ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ryan at u13.net Tue Jun 8 01:39:44 2010 From: ryan at u13.net (Ryan Rawdon) Date: Mon, 07 Jun 2010 19:39:44 -0400 Subject: check_rsync: (Service check did not exit properly) Message-ID: <4C0D8340.3050300@u13.net> Hey everyone, I recently installed Nagios on a new system and migrated all of my configs and everything over. Everything is working fine, except for check_rsync, which fails on all hosts with "(Service check did not exit properly)." I have tried disabling the embedded perl interpreter (even though it worked fine in the last install), but that didn't appear to do anything except change the output to "null". Running the plugin by hand works 100% fine. What else should I check? Here's the output with embedded perl (which is one of the things that led me to believe it was an embedded perl issue) [1275952035.062527] [016.0] [pid=2637] Attempting to run scheduled check of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check options=0, latency=0.062000 [1275952035.062585] [016.0] [pid=2637] Checking service 'RSync Server' on host 'vm-mx01.puttynuts.com'... [1275952035.062635] [2320.2] [pid=2637] Raw Command Input: $USER1$/check_rsync -H $HOSTADDRESS$ [1275952035.062656] [2320.2] [pid=2637] Expanded Command Output: $USER1$/check_rsync -H $HOSTADDRESS$ [1275952035.062782] [016.1] [pid=2637] Check result output will be written to '/var/lib/nagios3/spool/checkresults/checkg48PqE' (fd=7) [1275952035.062934] [016.1] [pid=2637] ** Using Embedded Perl interpreter to run service check... [1275952035.063208] [016.0] [pid=2637] Embedded Perl failed to compile /usr/lib/nagios/plugins/check_rsync, compile error **ePN failed to compile /usr/lib/nagios/plugins/chec$ BEGIN failed--compilation aborted at (eval 1) line 39." at /usr/lib/nagios3/p1.pl line 161. - skipping plugin Here's the output with embedded perl turned up (and I've since cranked up the logging level a bit): [1275953233.045782] [016.0] [pid=25338] Attempting to run scheduled check of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check options=0, latency=0.045000 [1275953233.045923] [016.0] [pid=25338] Checking service 'RSync Server' on host 'vm-mx01.puttynuts.com'... [1275953233.046002] [2320.2] [pid=25338] Raw Command Input: $USER1$/check_rsync -H $HOSTADDRESS$ [1275953233.046013] [2320.2] [pid=25338] Expanded Command Output: $USER1$/check_rsync -H $HOSTADDRESS$ [1275953233.046183] [016.1] [pid=25338] Check result output will be written to '/var/lib/nagios3/spool/checkresults/check6XvsV0' (fd=7) [1275953233.047328] [016.2] [pid=25338] Service check is executing in child process (pid=26948) [1275953233.098575] [016.2] [pid=26949] Moving temp check result file '/var/lib/nagios3/spool/checkresults/check6XvsV0' to queue file '/var/lib/nagios3/spool/checkresults/cpt5vVV'... and another excerpt: [1275953511.122166] [016.1] [pid=25338] Handling check result for service 'RSync Server' on host 'jester.u13.net'... [1275953511.122173] [016.0] [pid=25338] ** Handling check result for service 'RSync Server' on host 'jester.u13.net'... [1275953511.122179] [016.1] [pid=25338] HOST: jester.u13.net, SERVICE: RSync Server, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes, RESCHEDULE: Yes, EXITED OK: Yes, RETURN CODE: 2, OUTPUT: (null) [1275953511.122209] [016.2] [pid=25338] Parsing check output... [1275953511.122215] [016.2] [pid=25338] Short Output: (null) [1275953511.122221] [016.2] [pid=25338] Long Output: NULL [1275953511.122227] [016.2] [pid=25338] Perf Data: NULL [1275953511.122233] [016.2] [pid=25338] ST: HARD CA: 3 MA: 3 CS: 2 LS: 2 LHS: 2 [1275953511.122240] [016.1] [pid=25338] Service is in a non-OK state! [1275953511.122246] [016.1] [pid=25338] Host is currently UP, so we'll recheck its state to make sure... [1275953511.122252] [016.1] [pid=25338] * Using last known host state: 0 [1275953511.122261] [016.1] [pid=25338] Current/Max Attempt(s): 3/3 [1275953511.122267] [016.1] [pid=25338] Service has reached max number of rechecks, so we'll handle the error... [1275953511.122274] [016.1] [pid=25338] Checking service 'RSync Server' on host 'jester.u13.net' for flapping... [1275953511.122280] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00, PSC=0.00% [1275953511.122288] [016.1] [pid=25338] Service is not flapping (0.00% state change). [1275953511.122294] [016.1] [pid=25338] Checking host 'jester.u13.net' for flapping... [1275953511.122308] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00, PSC=0.00% [1275953511.122316] [016.1] [pid=25338] Host is not flapping (0.00% state change). [1275953511.122360] [016.1] [pid=25338] Rescheduling next check of service at Mon Jun 7 23:36:45 2010 [1275953511.122380] [016.0] [pid=25338] Scheduling a non-forced, active check of service 'RSync Server' on host 'jester.u13.net' @ Mon Jun 7 23:36:45 2010 [1275953511.122389] [016.2] [pid=25338] Scheduling new service check event. [1275953511.122409] [016.1] [pid=25338] Deleted check result file '/var/lib/nagios3/spool/checkresults/cwgRtGi' [1275953511.122416] [016.2] [pid=25338] Found a check result (#4) to handle... ... and it is when it is running like this with embedded perl off which results in "(null)" (instead of "(Service check did not exit properly)")on my services summary page as well a critical state. I look forward to hearing what suggestions you might have Ryan ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ryan at u13.net Tue Jun 8 11:31:50 2010 From: ryan at u13.net (Ryan Rawdon) Date: Tue, 08 Jun 2010 05:31:50 -0400 Subject: check_rsync: (Service check did not exit properly) In-Reply-To: <4C0D8340.3050300@u13.net> References: <4C0D8340.3050300@u13.net> Message-ID: <4C0E0E06.4020509@u13.net> Some additional information - After poking around some more, I have two other plugins exhibiting the same behavior: check_irc.pl and check_mysql_replication. Again with these, running them manually from a shell as the nagios user executes and outputs just fine. Here's what is in a check result file for a check_rsync instance that I managed to grab before it was consumed: ### Active Check Result File ### file_time=1275988933 ### Nagios Service Check Result ### # Time: Tue Jun 8 09:22:13 2010 host_name=vm-mx01.puttynuts.com service_description=RSync Server check_type=0 check_options=0 scheduled_check=1 reschedule_check=1 latency=0.226000 start_time=1275988933.227030 finish_time=1275988933.280759 early_timeout=0 exited_ok=1 return_code=2 output=(null) Running with embedded perl off, I did not see that compilation error again for the rsync plugin, so I am leaving it using the external perl interpreter for now. On 06/07/2010 07:39 PM, Ryan Rawdon wrote: > Hey everyone, > > I recently installed Nagios on a new system and migrated all of my > configs and everything over. Everything is working fine, except for > check_rsync, which fails on all hosts with "(Service check did not exit > properly)." > > I have tried disabling the embedded perl interpreter (even though it > worked fine in the last install), but that didn't appear to do anything > except change the output to "null". > > Running the plugin by hand works 100% fine. > > What else should I check? Here's the output with embedded perl (which > is one of the things that led me to believe it was an embedded perl issue) > > [1275952035.062527] [016.0] [pid=2637] Attempting to run scheduled check > of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check > options=0, latency=0.062000 > [1275952035.062585] [016.0] [pid=2637] Checking service 'RSync Server' > on host 'vm-mx01.puttynuts.com'... > [1275952035.062635] [2320.2] [pid=2637] Raw Command Input: > $USER1$/check_rsync -H $HOSTADDRESS$ > [1275952035.062656] [2320.2] [pid=2637] Expanded Command Output: > $USER1$/check_rsync -H $HOSTADDRESS$ > [1275952035.062782] [016.1] [pid=2637] Check result output will be > written to '/var/lib/nagios3/spool/checkresults/checkg48PqE' (fd=7) > [1275952035.062934] [016.1] [pid=2637] ** Using Embedded Perl > interpreter to run service check... > [1275952035.063208] [016.0] [pid=2637] Embedded Perl failed to compile > /usr/lib/nagios/plugins/check_rsync, compile error **ePN failed to > compile /usr/lib/nagios/plugins/chec$ > BEGIN failed--compilation aborted at (eval 1) line 39." at > /usr/lib/nagios3/p1.pl line 161. > - skipping plugin > > > Here's the output with embedded perl turned up (and I've since cranked > up the logging level a bit): > > [1275953233.045782] [016.0] [pid=25338] Attempting to run scheduled > check of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check > options=0, latency=0.045000 > [1275953233.045923] [016.0] [pid=25338] Checking service 'RSync Server' > on host 'vm-mx01.puttynuts.com'... > [1275953233.046002] [2320.2] [pid=25338] Raw Command Input: > $USER1$/check_rsync -H $HOSTADDRESS$ > [1275953233.046013] [2320.2] [pid=25338] Expanded Command Output: > $USER1$/check_rsync -H $HOSTADDRESS$ > [1275953233.046183] [016.1] [pid=25338] Check result output will be > written to '/var/lib/nagios3/spool/checkresults/check6XvsV0' (fd=7) > [1275953233.047328] [016.2] [pid=25338] Service check is executing in > child process (pid=26948) > [1275953233.098575] [016.2] [pid=26949] Moving temp check result file > '/var/lib/nagios3/spool/checkresults/check6XvsV0' to queue file > '/var/lib/nagios3/spool/checkresults/cpt5vVV'... > > and another excerpt: > [1275953511.122166] [016.1] [pid=25338] Handling check result for > service 'RSync Server' on host 'jester.u13.net'... > [1275953511.122173] [016.0] [pid=25338] ** Handling check result for > service 'RSync Server' on host 'jester.u13.net'... > [1275953511.122179] [016.1] [pid=25338] HOST: jester.u13.net, SERVICE: > RSync Server, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes, > RESCHEDULE: Yes, EXITED OK: Yes, RETURN CODE: 2, OUTPUT: (null) > [1275953511.122209] [016.2] [pid=25338] Parsing check output... > [1275953511.122215] [016.2] [pid=25338] Short Output: (null) > [1275953511.122221] [016.2] [pid=25338] Long Output: NULL > [1275953511.122227] [016.2] [pid=25338] Perf Data: NULL > [1275953511.122233] [016.2] [pid=25338] ST: HARD CA: 3 MA: 3 CS: 2 > LS: 2 LHS: 2 > [1275953511.122240] [016.1] [pid=25338] Service is in a non-OK state! > [1275953511.122246] [016.1] [pid=25338] Host is currently UP, so we'll > recheck its state to make sure... > [1275953511.122252] [016.1] [pid=25338] * Using last known host state: 0 > [1275953511.122261] [016.1] [pid=25338] Current/Max Attempt(s): 3/3 > [1275953511.122267] [016.1] [pid=25338] Service has reached max number > of rechecks, so we'll handle the error... > [1275953511.122274] [016.1] [pid=25338] Checking service 'RSync Server' > on host 'jester.u13.net' for flapping... > [1275953511.122280] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00, > PSC=0.00% > [1275953511.122288] [016.1] [pid=25338] Service is not flapping (0.00% > state change). > [1275953511.122294] [016.1] [pid=25338] Checking host 'jester.u13.net' > for flapping... > [1275953511.122308] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00, > PSC=0.00% > [1275953511.122316] [016.1] [pid=25338] Host is not flapping (0.00% > state change). > [1275953511.122360] [016.1] [pid=25338] Rescheduling next check of > service at Mon Jun 7 23:36:45 2010 > [1275953511.122380] [016.0] [pid=25338] Scheduling a non-forced, active > check of service 'RSync Server' on host 'jester.u13.net' @ Mon Jun 7 > 23:36:45 2010 > [1275953511.122389] [016.2] [pid=25338] Scheduling new service check event. > [1275953511.122409] [016.1] [pid=25338] Deleted check result file > '/var/lib/nagios3/spool/checkresults/cwgRtGi' > [1275953511.122416] [016.2] [pid=25338] Found a check result (#4) to > handle... > > > ... and it is when it is running like this with embedded perl off which > results in "(null)" (instead of "(Service check did not exit > properly)")on my services summary page as well a critical state. > > I look forward to hearing what suggestions you might have > > Ryan > > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rms at sibs.pt Tue Jun 8 11:27:40 2010 From: rms at sibs.pt (Rui Miguel Silva Seabra) Date: Tue, 08 Jun 2010 10:27:40 +0100 Subject: monitoring for exactly 1 process on 3 hosts In-Reply-To: References: Message-ID: <1275989260.22861.0.camel@localhost> Seg, 2010-06-07 ?s 16:57 -0600, Litwin, Matthew escreveu: > I need to monitor that a process is running on one of three hosts and that is not > running on the other two. Is there a way to set up a service such that a check > must return OK for only one host of a hostgroup and alarm if there is less or > more than one instance running in that hostgroup? I think you'll need a local test combining the results of all three, or remotely checking all three (via ssh, NRPE, etc...). Rui ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From felix at itim-cj.ro Tue Jun 8 13:27:07 2010 From: felix at itim-cj.ro (felix farcas) Date: Tue, 08 Jun 2010 14:27:07 +0300 Subject: new in nagios Message-ID: <4C0E290B.7080001@itim-cj.ro> Hello I'm new in Nagios. I'm looking for a documentation for configuring and starting nagios 3.2.1 on freebsd. As I saw on the Internet there are a lot of installation and configuration methods. Would you be so kind and give me the best direction. Thank you Felix -- Ing. drd. Farcas Felix National Institute of Research and Development of Isotopic and Molecular Technology, IT - Department - Cluj-Napoca, Romania ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Jun 8 13:56:07 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 08 Jun 2010 13:56:07 +0200 Subject: new in nagios In-Reply-To: <4C0E290B.7080001@itim-cj.ro> References: <4C0E290B.7080001@itim-cj.ro> Message-ID: <4C0E2FD7.8020006@op5.se> On 06/08/2010 01:27 PM, felix farcas wrote: > Hello > > I'm new in Nagios. I'm looking for a documentation for configuring and > starting nagios 3.2.1 on freebsd. > > As I saw on the Internet there are a lot of installation and > configuration methods. > > Would you be so kind and give me the best direction. > Download. Compile. Configure. Run. Failing that, you should probably try following one of the guides you found online and see if one of them works for you. When you run into a specific problem, google for the answer to that problem. If you fail to find any answer that works for you, come back here and ask again. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lomiz.mail at gmail.com Tue Jun 8 14:34:03 2010 From: lomiz.mail at gmail.com (Enrico Zimol) Date: Tue, 8 Jun 2010 14:34:03 +0200 Subject: new in nagios In-Reply-To: <4C0E290B.7080001@itim-cj.ro> References: <4C0E290B.7080001@itim-cj.ro> Message-ID: On 8 June 2010 13:27, felix farcas wrote: > Hello > > I'm new in Nagios. I'm looking for a documentation for configuring and > starting nagios 3.2.1 on freebsd. > > As I saw on the Internet there are a lot of installation and > configuration methods. > > Would you be so kind and give me the best direction. > The best direction is to know what is your situation. What kind of machine do you have to monitor? What OS? Printer? Do you need to monitor only on your lan or do you have to monitor Distributed/Redundant? Aswering to theese questions you'll find how to do on official documentation (I'm newbie too, and I found all what I need). If you'll have some problems reading that, ask on this ML :) Bye ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.borsani at it.net Tue Jun 8 15:08:35 2010 From: m.borsani at it.net (Marco Borsani) Date: Tue, 8 Jun 2010 15:08:35 +0200 Subject: R: Scom collector In-Reply-To: <007001cafda3$87233ad0$9569b070$@borsani@it.net> References: <007001cafda3$87233ad0$9569b070$@borsani@it.net> Message-ID: <4590.35535245693$1276002718@news.gmane.org> None? Thanks Marco Da: Marco Borsani [mailto:m.borsani at it.net] Inviato: gioved? 27 maggio 2010 15.50 A: 'Nagios Users Mailinglist' Oggetto: [Nagios-users] Scom collector Did anybody ever use the collector between Nagios and Scom ?? Regards Marco -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.elsen at gmail.com Tue Jun 8 15:23:50 2010 From: mark.elsen at gmail.com (Mark Elsen) Date: Tue, 8 Jun 2010 15:23:50 +0200 Subject: R: Scom collector In-Reply-To: <6093786533486931405@unknownmsgid> References: <6093786533486931405@unknownmsgid> Message-ID: > > Did ?anybody ever use the collector between Nagios and Scom ?? > > I do , M. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jgauthier at lastar.com Tue Jun 8 15:50:08 2010 From: jgauthier at lastar.com (Jason Gauthier) Date: Tue, 8 Jun 2010 09:50:08 -0400 Subject: CGI Permissions issue Message-ID: <05512FF87225CE4B85EB365B7FE8A198013D65C2@server51.ctg.com> All, I am fighting with a permission issue I cannot seem to figure out. Here is the important pieces of my cgi configuration: use_authentication=1 default_user_name= authorized_for_system_information= authorized_for_system_commands=jack,jill,john,dave,bob,tim authorized_for_configuration_information=jack,jill,john,dave,bob,tim authorized_for_all_hosts=jack,jill,john,dave,bob,tim authorized_for_all_host_commands jack,jill,john,dave,bob,tim authorized_for_all_services=jack,jill,john,dave,bob,tim authorized_for_all_service_commands=jack,jill,john,dave,bob,tim When "dave" attempts to issue a service command (enable notifications) they receive this error: "Sorry, but you are not authorized to commit the specified command" None of the other users do. So, in an effort to troubleshoot, I removed ALL users except for dave. Dave still cannot process the command, but jack still can! When jack attempts to view the configuration, he is also denied now. This implies that the permissions are being acknowledged to some degree. I revert the changes back above. Jack can still execute commands (dave cannot), but both jack and dave can view the configuration. What is going on with dave? Can I enable some logging perhaps to help determine the root cause? Thanks! ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From danielhlockard at gmail.com Tue Jun 8 20:08:32 2010 From: danielhlockard at gmail.com (Daniel Lockard) Date: Tue, 8 Jun 2010 13:08:32 -0500 Subject: Migrating nagios Message-ID: I was wondering if it is possible to migrate uptime data from an old nagios install from a new one. I have an old nagios server, running Nagios Core 3.2.0, and have another box running the most recent version of NagiosXI, and I would like to get rid of the old Nagios Core box. However, I have two years of uptime data on the old Nagios Core box that I would like to preserve. Does anyone have suggestions for a best course of action here? Daniel H Lockard ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Tue Jun 8 21:39:03 2010 From: pangrazi at gmail.com (Greg Pangrazio) Date: Tue, 8 Jun 2010 14:39:03 -0500 Subject: check_snmp_mem.pl Numeric value for warning or critical Message-ID: I am trying to use the check_snmp_mem plugin v1.1 from nagios 3.2.0 installed from the Ubuntu repos. When i run it from the command line ie: /usr/bin/perl /usr/lib/nagios/plugins/check_snmp_mem.pl -H -C -w90,20 -c95,30 i get the following response Ram : 15%, Swap : 0% : ; OK when i try to run it as a defined service in nagios I get Current Status: UNKNOWN (for 0d 3h 13m 25s) Status Information: Numeric value for warning or critical ! Usage: /usr/lib/nagios/plugins/check_snmp_mem.pl [-v] -H -C [-2] Performance Data: (-l login -x passwd [-X pass -L ,]) [-p ] -w -c [-I|-N|-E] [-f] [-m] [-t ] [-V] #snmp Check Mem define command{ command_name check_snmp_mem command_line /usr/bin/perl /usr/lib/nagios/plugins/check_snmp_mem.pl -H $HOSTADDRESS -C $ARG1$ -w $ARG2$,$ARG3$ -c $ARG4$,$ARG5$ } define service{ host_name ; service_description Memory; use generic-service; contact_groups v3locity; check_command check_snmp_mem!lcinfra33!90!20!95!30 } anyone have any suggestions? Other snmp checks from this same family run just fine, i am using check_snmp_storage on other hosts. Greg Pangrazio ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Jun 8 22:51:27 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 8 Jun 2010 21:51:27 +0100 Subject: CGI Permissions issue In-Reply-To: <05512FF87225CE4B85EB365B7FE8A198013D65C2@server51.ctg.com> References: <05512FF87225CE4B85EB365B7FE8A198013D65C2@server51.ctg.com> Message-ID: On 8 June 2010 14:50, Jason Gauthier wrote: > All, > > > ?I am fighting with a permission issue I cannot seem to figure out. > > Here is the important pieces of my cgi configuration: > > use_authentication=1 > default_user_name= > authorized_for_system_information= > authorized_for_system_commands=jack,jill,john,dave,bob,tim > authorized_for_configuration_information=jack,jill,john,dave,bob,tim > authorized_for_all_hosts=jack,jill,john,dave,bob,tim > authorized_for_all_host_commands jack,jill,john,dave,bob,tim > authorized_for_all_services=jack,jill,john,dave,bob,tim > authorized_for_all_service_commands=jack,jill,john,dave,bob,tim > > When "dave" attempts to issue a service command (enable notifications) > they receive this error: > > "Sorry, but you are not authorized to commit the specified command" > > None of the other users do. ? ?So, in an effort to troubleshoot, I > removed ALL users except for dave. > Dave still cannot process the command, but jack still can! > > When jack attempts to view the configuration, he is also denied now. > This implies that the permissions are being acknowledged to some degree. > I revert the changes back above. ?Jack can still execute commands (dave > cannot), but both jack and dave can view the configuration. > > What is going on with dave? ?Can I enable some logging perhaps to help > determine the root cause? > > Thanks! > Bit of a long shot but I wonder if there's a non-printing character in your cgi.cfg somewhere where it shouldn't be. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Wed Jun 9 09:32:11 2010 From: mmelin at gmail.com (Martin Melin) Date: Wed, 9 Jun 2010 09:32:11 +0200 Subject: CGI Permissions issue In-Reply-To: References: <05512FF87225CE4B85EB365B7FE8A198013D65C2@server51.ctg.com> Message-ID: On Tue, Jun 8, 2010 at 10:51 PM, Jim Avery wrote: > On 8 June 2010 14:50, Jason Gauthier wrote: >> All, >> >> >> ?I am fighting with a permission issue I cannot seem to figure out. >> >> Here is the important pieces of my cgi configuration: >> >> use_authentication=1 >> default_user_name= >> authorized_for_system_information= >> authorized_for_system_commands=jack,jill,john,dave,bob,tim >> authorized_for_configuration_information=jack,jill,john,dave,bob,tim >> authorized_for_all_hosts=jack,jill,john,dave,bob,tim >> authorized_for_all_host_commands jack,jill,john,dave,bob,tim >> authorized_for_all_services=jack,jill,john,dave,bob,tim >> authorized_for_all_service_commands=jack,jill,john,dave,bob,tim >> >> When "dave" attempts to issue a service command (enable notifications) >> they receive this error: >> >> "Sorry, but you are not authorized to commit the specified command" >> >> None of the other users do. ? ?So, in an effort to troubleshoot, I >> removed ALL users except for dave. >> Dave still cannot process the command, but jack still can! >> >> When jack attempts to view the configuration, he is also denied now. >> This implies that the permissions are being acknowledged to some degree. >> I revert the changes back above. ?Jack can still execute commands (dave >> cannot), but both jack and dave can view the configuration. >> >> What is going on with dave? ?Can I enable some logging perhaps to help >> determine the root cause? >> >> Thanks! >> You have a contact with a contact_name of "dave", that has can_submit_commands set to 0. This will take precedence over CGI permissions. Best regards Martin Melin ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at martinmelin.com Wed Jun 9 09:18:27 2010 From: nagios at martinmelin.com (Martin Melin) Date: Wed, 9 Jun 2010 09:18:27 +0200 Subject: CGI Permissions issue In-Reply-To: <05512FF87225CE4B85EB365B7FE8A198013D65C2@server51.ctg.com> References: <05512FF87225CE4B85EB365B7FE8A198013D65C2@server51.ctg.com> Message-ID: On Tue, Jun 8, 2010 at 15:50, Jason Gauthier wrote: > All, > > > I am fighting with a permission issue I cannot seem to figure out. > > Here is the important pieces of my cgi configuration: > > use_authentication=1 > default_user_name= > authorized_for_system_information= > authorized_for_system_commands=jack,jill,john,dave,bob,tim > authorized_for_configuration_information=jack,jill,john,dave,bob,tim > authorized_for_all_hosts=jack,jill,john,dave,bob,tim > authorized_for_all_host_commands jack,jill,john,dave,bob,tim > authorized_for_all_services=jack,jill,john,dave,bob,tim > authorized_for_all_service_commands=jack,jill,john,dave,bob,tim > > When "dave" attempts to issue a service command (enable notifications) > they receive this error: > > "Sorry, but you are not authorized to commit the specified command" > > None of the other users do. So, in an effort to troubleshoot, I > removed ALL users except for dave. > Dave still cannot process the command, but jack still can! > > When jack attempts to view the configuration, he is also denied now. > This implies that the permissions are being acknowledged to some degree. > I revert the changes back above. Jack can still execute commands (dave > cannot), but both jack and dave can view the configuration. > > What is going on with dave? Can I enable some logging perhaps to help > determine the root cause? > > Thanks! > You probably have a contact with the name "dave" and that has can_submit_commands set to 0. That will take precedence over CGI permissions. Regards Martin Melin -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From anhquankitty at gmail.com Wed Jun 9 10:39:15 2010 From: anhquankitty at gmail.com (Tong Anh Quan) Date: Wed, 9 Jun 2010 15:39:15 +0700 Subject: check_ganglia.py: UNKNOWN: Error while getting value "Host/value not found"? Message-ID: Hi, Follow instruction on ibm website, I want to use Nagios to alert on Ganglia metrics, details below: - ganglia-services.cfg: define command { command_name check_ganglia command_line $USER1$/check_ganglia.py -h $HOSTNAME$ -m $ARG1$ -w $ARG2$ -c $ARG3$ } define service { host_name localhost service_description disk_free check_command check_ganglia!disk_free!1700!1600 max_check_attempts 5 check_interval 1 retry_interval 1 check_period 24x7 notification_interval 30 } - Reload Nagios, wait a moment and I get the following status on the web: *CHECKGANGLIA UNKNOWN: Error while getting value "Host/value not found" * - I edited ganglia_host to hostname. - Run this plugin from command line works fine: */usr/lib64/nagios/plugins/check_ganglia.py -h adtech100 -m disk_free -w 1700 -c 1600 CHECKGANGLIA OK: disk_free is 48.60 * Can anyone help me? -- --- H?nh ph?c l? m?t ly Cafe v? nh?c Tr?nh --- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at martinmelin.com Wed Jun 9 10:52:51 2010 From: nagios at martinmelin.com (Martin Melin) Date: Wed, 9 Jun 2010 10:52:51 +0200 Subject: check_ganglia.py: UNKNOWN: Error while getting value "Host/value not found"? In-Reply-To: References: Message-ID: On Wed, Jun 9, 2010 at 10:39, Tong Anh Quan wrote: > Hi, > > Follow instruction on ibm website, I want to use Nagios to alert on Ganglia > metrics, details below: > > - ganglia-services.cfg: > > define command { > command_name check_ganglia > command_line $USER1$/check_ganglia.py -h $HOSTNAME$ -m $ARG1$ -w $ARG2$ > -c $ARG3$ > } > > define service { > host_name localhost > service_description disk_free > check_command check_ganglia!disk_free!1700!1600 > max_check_attempts 5 > check_interval 1 > retry_interval 1 > check_period 24x7 > notification_interval 30 > } > > - Reload Nagios, wait a moment and I get the following status on the web: > *CHECKGANGLIA UNKNOWN: Error while getting value "Host/value not found" * > > - I edited ganglia_host to hostname. > > - Run this plugin from command line works fine: > */usr/lib64/nagios/plugins/check_ganglia.py -h adtech100 -m disk_free -w > 1700 -c 1600 > CHECKGANGLIA OK: disk_free is 48.60 > * > Can anyone help me? > -- > --- H?nh ph?c l? m?t ly Cafe v? nh?c Tr?nh --- > > > The service definition you sent will execute the plugin with -h localhost, but you show output with -h adtech100. Try running the plugin from the command line with -h localhost and you will probably get the same error. Regards Martin Melin -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Jun 9 11:41:12 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 09 Jun 2010 10:41:12 +0100 Subject: a bit [OT] PNP4nagios help Message-ID: <4C0F61B8.7060001@flatto.net> Hello All Not sure this is the right place for this - but since many of us use pnp4nagios - i thought i might be able to get some advice , I've installed pnp4nagios and it worked well for more then a month , but now it seem it no longer generating graphs for any of the existing checks. I can see in the nagios debug file that the pnp script is executed , and hence i was expecting the xml to be generated . When i execute the script manually - perl -d /usr/local/pnp4nagios/libexec/process_perfdata.pl , i get the following perl error output , which is what i think stops my graphs from being created. Anyone ever encountered this issue ? or know whom/where i should post this query at ? Thanks Assaf Use of uninitialized value in concatenation (.) or string at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098. at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098 main::handle_signal('ALRM') called at (eval 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] line 7 eval {...} called at (eval 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] line 7 Term::ReadKey::ReadKey(0, 'GLOB(0x7aea60)') called at /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2086 readline::rl_getc called at /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2073 readline::getc_with_pending() called at /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 1649 readline::readline(' DB<1> ') called at /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/Perl.pm line 11 Term::ReadLine::Perl::readline('Term::ReadLine::Perl=ARRAY(0xc242c0)', ' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl line 6371 DB::readline(' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl line 2203 DB::DB called at /usr/lib/perl5/5.8.8/perl5db.pl line 9425 DB::fake::at_exit() called at /usr/lib/perl5/5.8.8/perl5db.pl line 8997 DB::END() called at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 eval {...} called at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 Use of uninitialized value in concatenation (.) or string at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098. at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098 main::handle_signal('ALRM') called at (eval 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] line 7 eval {...} called at (eval 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] line 7 Term::ReadKey::ReadKey(0, 'GLOB(0x7aea60)') called at /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2086 readline::rl_getc called at /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2073 readline::getc_with_pending() called at /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 1649 readline::readline(' DB<1> ') called at /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/Perl.pm line 11 Term::ReadLine::Perl::readline('Term::ReadLine::Perl=ARRAY(0xc242c0)', ' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl line 6371 DB::readline(' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl line 2203 DB::DB called at /usr/lib/perl5/5.8.8/perl5db.pl line 9425 DB::fake::at_exit() called at /usr/lib/perl5/5.8.8/perl5db.pl line 8997 DB::END() called at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 eval {...} called at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jgauthier at lastar.com Wed Jun 9 13:41:52 2010 From: jgauthier at lastar.com (Jason Gauthier) Date: Wed, 9 Jun 2010 07:41:52 -0400 Subject: CGI Permissions issue In-Reply-To: References: <05512FF87225CE4B85EB365B7FE8A198013D65C2@server51.ctg.com> Message-ID: <05512FF87225CE4B85EB365B7FE8A19801423A90@server51.ctg.com> -----Original Message----- From: Martin Melin [mailto:mmelin at gmail.com] Sent: Wednesday, June 09, 2010 3:32 AM To: Nagios Users List Subject: Re: [Nagios-users] CGI Permissions issue On Tue, Jun 8, 2010 at 10:51 PM, Jim Avery wrote: > On 8 June 2010 14:50, Jason Gauthier wrote: >> All, >> >> >> ?I am fighting with a permission issue I cannot seem to figure out. >> >> Here is the important pieces of my cgi configuration: >> >> use_authentication=1 >> default_user_name= >> authorized_for_system_information= >> authorized_for_system_commands=jack,jill,john,dave,bob,tim >> authorized_for_configuration_information=jack,jill,john,dave,bob,tim >> authorized_for_all_hosts=jack,jill,john,dave,bob,tim >> authorized_for_all_host_commands jack,jill,john,dave,bob,tim >> authorized_for_all_services=jack,jill,john,dave,bob,tim >> authorized_for_all_service_commands=jack,jill,john,dave,bob,tim >> >> When "dave" attempts to issue a service command (enable >> notifications) they receive this error: >> >> "Sorry, but you are not authorized to commit the specified command" >> >> None of the other users do. ? ?So, in an effort to troubleshoot, I >> removed ALL users except for dave. >> Dave still cannot process the command, but jack still can! >> >> When jack attempts to view the configuration, he is also denied now. >> This implies that the permissions are being acknowledged to some degree. >> I revert the changes back above. ?Jack can still execute commands >> (dave cannot), but both jack and dave can view the configuration. >> >> What is going on with dave? ?Can I enable some logging perhaps to >> help determine the root cause? >> >> Thanks! >> >You have a contact with a contact_name of "dave", that has can_submit_commands set to 0. This will take precedence over CGI permissions. >Best regards >Martin Melin Nailed it! Thanks a lot! ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Jun 9 15:14:34 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 09 Jun 2010 14:14:34 +0100 Subject: a bit [OT] PNP4nagios help In-Reply-To: <4C0F61B8.7060001@flatto.net> References: <4C0F61B8.7060001@flatto.net> Message-ID: <4C0F93BA.7040506@flatto.net> Assaf Flatto wrote: > Hello All > > Not sure this is the right place for this - but since many of us use > pnp4nagios - i thought i might be able to get some advice , I've > installed pnp4nagios and it worked well for more then a month , but now > it seem it no longer generating graphs for any of the existing checks. > > I can see in the nagios debug file that the pnp script is executed , > and hence i was expecting the xml to be generated . > When i execute the script manually - perl -d > /usr/local/pnp4nagios/libexec/process_perfdata.pl , i get the following > perl error output , which is what i think stops my graphs from being > created. > > Anyone ever encountered this issue ? or know whom/where i should post > this query at ? > > Thanks > > Assaf > > > Use of uninitialized value in concatenation (.) or string at > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098. > at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098 > main::handle_signal('ALRM') called at (eval > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > line 7 > eval {...} called at (eval > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > line 7 > Term::ReadKey::ReadKey(0, 'GLOB(0x7aea60)') called at > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2086 > readline::rl_getc called at > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2073 > readline::getc_with_pending() called at > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 1649 > readline::readline(' DB<1> ') called at > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/Perl.pm line 11 > > Term::ReadLine::Perl::readline('Term::ReadLine::Perl=ARRAY(0xc242c0)', > ' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl line 6371 > DB::readline(' DB<1> ') called at > /usr/lib/perl5/5.8.8/perl5db.pl line 2203 > DB::DB called at /usr/lib/perl5/5.8.8/perl5db.pl line 9425 > DB::fake::at_exit() called at /usr/lib/perl5/5.8.8/perl5db.pl > line 8997 > DB::END() called at > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 > eval {...} called at > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 > Use of uninitialized value in concatenation (.) or string at > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098. > at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098 > main::handle_signal('ALRM') called at (eval > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > line 7 > eval {...} called at (eval > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > line 7 > Term::ReadKey::ReadKey(0, 'GLOB(0x7aea60)') called at > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2086 > readline::rl_getc called at > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2073 > readline::getc_with_pending() called at > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 1649 > readline::readline(' DB<1> ') called at > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/Perl.pm line 11 > > Term::ReadLine::Perl::readline('Term::ReadLine::Perl=ARRAY(0xc242c0)', > ' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl line 6371 > DB::readline(' DB<1> ') called at > /usr/lib/perl5/5.8.8/perl5db.pl line 2203 > DB::DB called at /usr/lib/perl5/5.8.8/perl5db.pl line 9425 > DB::fake::at_exit() called at /usr/lib/perl5/5.8.8/perl5db.pl > line 8997 > DB::END() called at > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 > eval {...} called at > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 > > Maybe more information that may help in this case - the perfdata.log has a repeated entry : 2010-06-09 10:17:25 [19819] [0] *** TIMEOUT: Timeout after 5 Sec. **** 2010-06-09 10:17:25 [19819] [0] *** process_perfdata.pl terminated on signal ALRM -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From guidosh at gmail.com Wed Jun 9 19:38:10 2010 From: guidosh at gmail.com (Guy Waugh) Date: Wed, 9 Jun 2010 18:38:10 +0100 Subject: a bit [OT] PNP4nagios help In-Reply-To: <4C0F93BA.7040506@flatto.net> References: <4C0F61B8.7060001@flatto.net> <4C0F93BA.7040506@flatto.net> Message-ID: Hi Assaf, Have you restarted nagios lately? Have the permissions on the pnp4nagios files (or the directory they reside in) changed? Has anything changed in nagios.cfg that might affect this? How are you running pnp4nagios? Synchronous mode, bulk mode or bulk mode with NPCD? Are the nagios perfdata files being populated correctly etc.? Cheers, Guy. On 9 June 2010 14:14, Assaf Flatto wrote: > Assaf Flatto wrote: > > Hello All > > > > Not sure this is the right place for this - but since many of us use > > pnp4nagios - i thought i might be able to get some advice , I've > > installed pnp4nagios and it worked well for more then a month , but now > > it seem it no longer generating graphs for any of the existing checks. > > > > I can see in the nagios debug file that the pnp script is executed , > > and hence i was expecting the xml to be generated . > > When i execute the script manually - perl -d > > /usr/local/pnp4nagios/libexec/process_perfdata.pl , i get the following > > perl error output , which is what i think stops my graphs from being > > created. > > > > Anyone ever encountered this issue ? or know whom/where i should post > > this query at ? > > > > Thanks > > > > Assaf > > > > > > Use of uninitialized value in concatenation (.) or string at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098. > > at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098 > > main::handle_signal('ALRM') called at (eval > > > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > > line 7 > > eval {...} called at (eval > > > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > > line 7 > > Term::ReadKey::ReadKey(0, 'GLOB(0x7aea60)') called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2086 > > readline::rl_getc called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2073 > > readline::getc_with_pending() called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 1649 > > readline::readline(' DB<1> ') called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/Perl.pm line 11 > > > > Term::ReadLine::Perl::readline('Term::ReadLine::Perl=ARRAY(0xc242c0)', > > ' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl line 6371 > > DB::readline(' DB<1> ') called at > > /usr/lib/perl5/5.8.8/perl5db.pl line 2203 > > DB::DB called at /usr/lib/perl5/5.8.8/perl5db.pl line 9425 > > DB::fake::at_exit() called at /usr/lib/perl5/5.8.8/perl5db.pl > > line 8997 > > DB::END() called at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 > > eval {...} called at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 > > Use of uninitialized value in concatenation (.) or string at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098. > > at /usr/local/pnp4nagios/libexec/process_perfdata.pl line 1098 > > main::handle_signal('ALRM') called at (eval > > > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > > line 7 > > eval {...} called at (eval > > > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > > line 7 > > Term::ReadKey::ReadKey(0, 'GLOB(0x7aea60)') called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2086 > > readline::rl_getc called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 2073 > > readline::getc_with_pending() called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm line 1649 > > readline::readline(' DB<1> ') called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/Perl.pm line 11 > > > > Term::ReadLine::Perl::readline('Term::ReadLine::Perl=ARRAY(0xc242c0)', > > ' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl line 6371 > > DB::readline(' DB<1> ') called at > > /usr/lib/perl5/5.8.8/perl5db.pl line 2203 > > DB::DB called at /usr/lib/perl5/5.8.8/perl5db.pl line 9425 > > DB::fake::at_exit() called at /usr/lib/perl5/5.8.8/perl5db.pl > > line 8997 > > DB::END() called at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 > > eval {...} called at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl line 0 > > > > > Maybe more information that may help in this case - > > the perfdata.log has a repeated entry : > 2010-06-09 10:17:25 [19819] [0] *** TIMEOUT: Timeout after 5 Sec. **** > 2010-06-09 10:17:25 [19819] [0] *** process_perfdata.pl terminated on > signal ALRM > > > -- > Never,Ever Cut A Deal With a Dragon > > > I am doing a Charity Bike ride On the 27 of June for the > Capital to Coast Charity. Please help by Donating > http://www.justgiving.com/Lovefilm-capital-to-coast > > > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lmw94002 at hotmail.com Wed Jun 9 23:02:16 2010 From: lmw94002 at hotmail.com (Mathew Walker) Date: Wed, 9 Jun 2010 17:02:16 -0400 Subject: extra "checkresults" files being left behind In-Reply-To: References: <4C0F61B8.7060001@flatto.net>, <4C0F93BA.7040506@flatto.net>, Message-ID: I'm running Nagios on a little VPS box checking a few hosts/services (~50 checks). It's mostly a testing platform for me and checks in on my other test VPS systems. However I keep seeing the extra check results data files build up in /usr/local/nagios/var/spool/checkresults like: -rw------- 1 nagios nagios 249 Jun 7 23:45 checknbu01O -rw------- 1 nagios nagios 252 Jun 8 02:40 checkHxcsiJ -rw------- 1 nagios nagios 291 Jun 8 03:52 checkcyaOva -rw------- 1 nagios nagios 280 Jun 8 04:46 checknlLs4b -rw------- 1 nagios nagios 250 Jun 8 05:52 checkCMATnr -rw------- 1 nagios nagios 285 Jun 8 06:21 checkrblxgG -rw------- 1 nagios nagios 252 Jun 8 07:30 checkikZPk8 -rw------- 1 nagios nagios 285 Jun 8 09:14 check47NrJf -rw------- 1 nagios nagios 285 Jun 8 13:34 check4g81jo -rw------- 1 nagios nagios 249 Jun 8 15:15 checkvFH7JT Some days there will be one or two, some days there will be 30-50. The days w/ more entries seems to be the days with more alerts. The files will just build up and build up for months if I do not manually delete them. I've also seen my one server w/ a passive check, not properly update back to the dummy default value of OK on occassion. I've tried tweaking the various config variables like: max_check_result_file_age=3600, and check_result_reaper_*. I thought it may have been a performance issue with my little VPS, but the memory and CPU load (thanks Nagiosgraph), all seem pretty flat. My typically check interval is 5minutes. With only ~50 checks it shouldn't be THAT much load. Googled a bit and didn't come up with much relevant. Any thoughts? -- Mat W. - http://www.techadre.com _________________________________________________________________ The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with Hotmail. http://www.windowslive.com/campaign/thenewbusy?tile=multicalendar&ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_5 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Thu Jun 10 05:51:35 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Wed, 09 Jun 2010 20:51:35 -0700 Subject: extra "checkresults" files being left behind In-Reply-To: References: <4C0F61B8.7060001@flatto.net>, <4C0F93BA.7040506@flatto.net>, Message-ID: <4C106147.7050401@5dninja.net> Mathew Walker wrote: > I'm running Nagios on a little VPS box checking a few hosts/services > (~50 checks). It's mostly a testing platform for me and checks in on my > other test VPS systems. > > However I keep seeing the extra check results data files build up in > /usr/local/nagios/var/spool/checkresults like: > -rw------- 1 nagios nagios 249 Jun 7 23:45 checknbu01O > -rw------- 1 nagios nagios 252 Jun 8 02:40 checkHxcsiJ > > Googled a bit and didn't come up with much relevant. Any thoughts? If I remember correctly, the parent nagios process writes out that file, then forks a child. The child then runs the check, updates that file and then creates a file with the same name, plus '.ok' in that directory, letting the parent process know the check is completed. So, take a look at the contents of several of those files, if you're lucky, you'll see that either they are for the same host, or the same service check. If so, there might be something in the way that host or service is getting polled that is causing the forked child to die. Also, if you're running a version older than 3.0rc1 (generally always a good thing to include the version of the tool you're useing, when asking for help) then you may want to upgrade, that version fixed a bug that might be related: "Fixed bug with not deleting old check result files that contained results for invalid host/service" -- Mike Lindsey ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sharadgana at gmail.com Thu Jun 10 08:07:38 2010 From: sharadgana at gmail.com (Sharad Ganapathy) Date: Thu, 10 Jun 2010 11:37:38 +0530 Subject: Query on check_http timeout option Message-ID: Hello, I want to understand the timeout option in check_http. From the help option, it states : -t, --timeout=INTEGER Seconds before connection times out (default: 10) I monitor a webservice to check for connectivity and also pass some parameters to get some content back. Usually the download transfer takes around 20 seconds and occasionally it takes well over a minute or two. I have configured my check in this fashion. $ROOT/libexec/nagios/check_http -u 'some URL' -I API Hostname -t 10 -c 20 -p 4080 -A 'Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; pagechecks mon)' I am seeing the check error out with the message 'Socket timeout after 10 seconds' quite often. When I manually connect ( telnet/curl) the URL the connection time is well below 10 seconds. Is the timeout parameter used to check the time it takes to establish a TCP connection or to govern the time the check took to complete ? Thanks Sharad -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From danielhlockard at gmail.com Thu Jun 10 09:30:04 2010 From: danielhlockard at gmail.com (Daniel Lockard) Date: Thu, 10 Jun 2010 02:30:04 -0500 Subject: Query on check_http timeout option In-Reply-To: References: Message-ID: I believe it is the time the check_http took to connect, and download the page. Is the total time when you manually connect using curl greater than 10 seconds? Daniel H Lockard On Thu, Jun 10, 2010 at 1:07 AM, Sharad Ganapathy wrote: > Hello, > I want to understand the timeout option in check_http. From the help option, > it states : > -t, --timeout=INTEGER > ?? ?Seconds before connection times out (default: 10) > I monitor a webservice to check for connectivity and also pass some > parameters to get some content back. Usually the download transfer takes > around 20 seconds and occasionally it takes well over a minute or two. I > have configured my check in this fashion. > $ROOT/libexec/nagios/check_http -u 'some URL' -I API Hostname ?-t 10 -c 20 > -p 4080 -A 'Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0; pagechecks > mon)' > I am seeing the check error out with the message 'Socket timeout after 10 > seconds' quite often. > When I manually connect ( telnet/curl) the URL the connection time is well > below 10 seconds. > Is the timeout parameter used to check the time it takes to establish a TCP > connection or to govern the time the check ?took ?to complete ? > > Thanks > Sharad > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sharadgana at gmail.com Thu Jun 10 10:55:34 2010 From: sharadgana at gmail.com (Sharad Ganapathy) Date: Thu, 10 Jun 2010 14:25:34 +0530 Subject: Query on check_http timeout option In-Reply-To: References: Message-ID: Daniel Lockard wrote: > I believe it is the time the check_http took to connect, and download > the page. > Is the total time when you manually connect using curl greater than 10 seconds? > > > Daniel H Lockard > > > Yes. Sometimes the total time ( time to connect + download the content) goes upto 1 minute. Sharad ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Thu Jun 10 14:46:28 2010 From: lists at xodus.org (Marc Powell) Date: Thu, 10 Jun 2010 07:46:28 -0500 Subject: Query on check_http timeout option In-Reply-To: References: Message-ID: On Jun 10, 2010, at 3:55 AM, Sharad Ganapathy wrote: > Yes. Sometimes the total time ( time to connect + download the content) > goes upto 1 minute. It can go as long as you want as long as you also increase service_check_timeout in nagios.cfg. -- Marc ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sharadgana at gmail.com Thu Jun 10 15:18:17 2010 From: sharadgana at gmail.com (Sharad Ganapathy) Date: Thu, 10 Jun 2010 18:48:17 +0530 Subject: Query on check_http timeout option In-Reply-To: References: Message-ID: On 10 June 2010 18:16, Marc Powell wrote: > > On Jun 10, 2010, at 3:55 AM, Sharad Ganapathy wrote: > > > Yes. Sometimes the total time ( time to connect + download the content) > > goes upto 1 minute. > > It can go as long as you want as long as you also increase > service_check_timeout in nagios.cfg. > > > Right . But the check times out in the host ( passive check). Nagios has never complained of not receiving info from this check ( UNKNOWN) state. My concern is whether the timeout in check_http applies to only the connection part in establishing a TCP connection or the overall completion of the check ( time to connect + connect download .. ) . Thanks Sharad -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Jun 10 16:43:43 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 10 Jun 2010 15:43:43 +0100 Subject: a bit [OT] PNP4nagios help In-Reply-To: References: <4C0F61B8.7060001@flatto.net> <4C0F93BA.7040506@flatto.net> Message-ID: <4C10FA1F.6090609@flatto.net> Thanks for the help , but managed to solve the issue . The problem was that the pnp4 was looking in the wrong place for the rrd data provided by nagios . Once i fixed the paths - the graph resumed their work. Thanks for the "nudge" . Assaf Guy Waugh wrote: > Hi Assaf, > > Have you restarted nagios lately? > > Have the permissions on the pnp4nagios files (or the directory they > reside in) changed? > > Has anything changed in nagios.cfg that might affect this? > > How are you running pnp4nagios? Synchronous mode, bulk mode or bulk > mode with NPCD? Are the nagios perfdata files being populated > correctly etc.? > > Cheers, > Guy. > > On 9 June 2010 14:14, Assaf Flatto > wrote: > > Assaf Flatto wrote: > > Hello All > > > > Not sure this is the right place for this - but since many of us use > > pnp4nagios - i thought i might be able to get some advice , I've > > installed pnp4nagios and it worked well for more then a month , > but now > > it seem it no longer generating graphs for any of the existing > checks. > > > > I can see in the nagios debug file that the pnp script is > executed , > > and hence i was expecting the xml to be generated . > > When i execute the script manually - perl -d > > /usr/local/pnp4nagios/libexec/process_perfdata.pl > , i get the following > > perl error output , which is what i think stops my graphs from being > > created. > > > > Anyone ever encountered this issue ? or know whom/where i should > post > > this query at ? > > > > Thanks > > > > Assaf > > > > > > Use of uninitialized value in concatenation (.) or string at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl > line 1098. > > at /usr/local/pnp4nagios/libexec/process_perfdata.pl > line 1098 > > main::handle_signal('ALRM') called at (eval > > > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > > line 7 > > eval {...} called at (eval > > > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > > line 7 > > Term::ReadKey::ReadKey(0, 'GLOB(0x7aea60)') called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm > line 2086 > > readline::rl_getc called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm > line 2073 > > readline::getc_with_pending() called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm > line 1649 > > readline::readline(' DB<1> ') called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/Perl.pm line 11 > > > > > Term::ReadLine::Perl::readline('Term::ReadLine::Perl=ARRAY(0xc242c0)', > > ' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl > line 6371 > > DB::readline(' DB<1> ') called at > > /usr/lib/perl5/5.8.8/perl5db.pl line 2203 > > DB::DB called at /usr/lib/perl5/5.8.8/perl5db.pl > line 9425 > > DB::fake::at_exit() called at > /usr/lib/perl5/5.8.8/perl5db.pl > > line 8997 > > DB::END() called at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl > line 0 > > eval {...} called at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl > line 0 > > Use of uninitialized value in concatenation (.) or string at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl > line 1098. > > at /usr/local/pnp4nagios/libexec/process_perfdata.pl > line 1098 > > main::handle_signal('ALRM') called at (eval > > > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > > line 7 > > eval {...} called at (eval > > > 10)[/usr/lib/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Term/ReadKey.pm:411] > > line 7 > > Term::ReadKey::ReadKey(0, 'GLOB(0x7aea60)') called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm > line 2086 > > readline::rl_getc called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm > line 2073 > > readline::getc_with_pending() called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/readline.pm > line 1649 > > readline::readline(' DB<1> ') called at > > /usr/lib/perl5/site_perl/5.8.8/Term/ReadLine/Perl.pm line 11 > > > > > Term::ReadLine::Perl::readline('Term::ReadLine::Perl=ARRAY(0xc242c0)', > > ' DB<1> ') called at /usr/lib/perl5/5.8.8/perl5db.pl > line 6371 > > DB::readline(' DB<1> ') called at > > /usr/lib/perl5/5.8.8/perl5db.pl line 2203 > > DB::DB called at /usr/lib/perl5/5.8.8/perl5db.pl > line 9425 > > DB::fake::at_exit() called at > /usr/lib/perl5/5.8.8/perl5db.pl > > line 8997 > > DB::END() called at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl > line 0 > > eval {...} called at > > /usr/local/pnp4nagios/libexec/process_perfdata.pl > line 0 > > > > > Maybe more information that may help in this case - > > the perfdata.log has a repeated entry : > 2010-06-09 10:17:25 [19819] [0] *** TIMEOUT: Timeout after 5 Sec. **** > 2010-06-09 10:17:25 [19819] [0] *** process_perfdata.pl > terminated on > signal ALRM > > -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From 13.sward.13 at gmail.com Thu Jun 10 19:51:06 2010 From: 13.sward.13 at gmail.com (Scott Ward) Date: Thu, 10 Jun 2010 12:51:06 -0500 Subject: Large Installation Message-ID: We are looking to do an large installation of Nagios. Is it possible to monitor over 800 machines and over 14000 services? Has anyone tried doing anything like this? If you have how successful was it and how did you configure it? ~Rultax -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.elsen at gmail.com Thu Jun 10 20:53:54 2010 From: mark.elsen at gmail.com (Mark Elsen) Date: Thu, 10 Jun 2010 20:53:54 +0200 Subject: Large Installation In-Reply-To: References: Message-ID: > We are looking to do an large installation of Nagios. Is it possible to > monitor over 800 machines and over 14000 services? Works like a charm :-) > > Has anyone tried doing anything like this? If you have how successful was it > and how did you configure it? > Same as for a small installation of NAGIOS M. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Thu Jun 10 21:23:09 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Thu, 10 Jun 2010 15:23:09 -0400 Subject: Large Installation In-Reply-To: References: Message-ID: Make sure to read these pages: http://nagios.sourceforge.net/docs/3_0/tuning.html http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html Also, if you're monitoring 800 machines across WANs, you might look into distributed monitoring: http://nagios.sourceforge.net/docs/3_0/distributed.html Let us know how it goes! --Matt BTW, what are you using for your config maintenance? On Thu, Jun 10, 2010 at 1:51 PM, Scott Ward <13.sward.13 at gmail.com> wrote: > We are looking to do an large installation of Nagios. Is it possible to > monitor over 800 machines and over 14000 services? > > Has anyone tried doing anything like this? If you have how successful was it > and how did you configure it? > > ~Rultax > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From 13.sward.13 at gmail.com Thu Jun 10 21:34:22 2010 From: 13.sward.13 at gmail.com (Scott Ward) Date: Thu, 10 Jun 2010 14:34:22 -0500 Subject: Large Installation In-Reply-To: References: Message-ID: >Make sure to read these pages: > >http://nagios.sourceforge.net/docs/3_0/tuning.html >http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html > >Also, if you're monitoring 800 machines across WANs, you might look >into distributed monitoring: >http://nagios.sourceforge.net/docs/3_0/distributed.html > >Let us know how it goes! Thanks for the links. So the distributive monitoring provided by the Nagios docs can handle what we're trying to do? I have read in a few places that Nagios has scalability issues. > >--Matt > >BTW, what are you using for your config maintenance? We haven't decided yet. Do you have any recommendations? ~S On Thu, Jun 10, 2010 at 2:23 PM, Matt Simmons wrote: > Make sure to read these pages: > > http://nagios.sourceforge.net/docs/3_0/tuning.html > http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html > > Also, if you're monitoring 800 machines across WANs, you might look > into distributed monitoring: > http://nagios.sourceforge.net/docs/3_0/distributed.html > > Let us know how it goes! > > --Matt > > BTW, what are you using for your config maintenance? > > > On Thu, Jun 10, 2010 at 1:51 PM, Scott Ward <13.sward.13 at gmail.com> wrote: > > We are looking to do an large installation of Nagios. Is it possible to > > monitor over 800 machines and over 14000 services? > > > > Has anyone tried doing anything like this? If you have how successful was > it > > and how did you configure it? > > > > ~Rultax > > > > > ------------------------------------------------------------------------------ > > ThinkGeek and WIRED's GeekDad team up for the Ultimate > > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > > lucky parental unit. See the prize list and enter to win: > > http://p.sf.net/sfu/thinkgeek-promo > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > -- > LITTLE GIRL: But which cookie will you eat FIRST? > COOKIE MONSTER: Me think you have misconception of cookie-eating process. > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trisha at rockyou.com Thu Jun 10 21:48:11 2010 From: trisha at rockyou.com (Trisha Hoang) Date: Thu, 10 Jun 2010 12:48:11 -0700 Subject: Strange fluctuation in load average Message-ID: Hi all, When I first installed nagios-3.2.0 with embedded perl enabled, nagios experienced increasing latency, starting at 1 sec and climbed upto 300 within a few hours until restarting nagios. I read on one of the older post suggesting to recompile nagios *without* embedded perl, and that resolved the latency issue, with latency consistently at less than 1 sec. However, ever since, the system load average has fluctuated wildly from 1 to 12 and down to say ... 3 within a minute. This fluctuation happens 3-10 minutes each time and calms down for ... say an hour. There doesn't seem to be any cron jobs that can cause this kind of load, and cpu (1-quad core) is usually at least 50% idle , with plenty of free memory, no IO blocks, on Centos 5-2. What's strange is with nagios compiled with embedded perl, the load was consistently at 2-4. Could this be nagios related? Please let me know if you need more information. -- Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: rockyoutrisha -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Thu Jun 10 21:55:45 2010 From: subscription at kkeane.com (Kevin Keane) Date: Thu, 10 Jun 2010 12:55:45 -0700 Subject: Large Installation In-Reply-To: References: Message-ID: Nagios does have some scalability issues, but for the most part you won't run into them until you get to truly huge installations. I can see three main scalability issues: config file maintenance and the need for one central server, and firewall issues. Config file maintenance can be improved to some extent with careful design of the config files, as well as tools. It is an issue that I am running into with a relatively small installation with 80+ hosts and 400+ services. My installation is highly heterogeneous and very dynamic, which makes config file maintenance a nightmare. Having to restart Nagios after a configuration change doesn't help either. On the other hand, a network with 2000 identical machines is probably going to be much easier to manage than my type of network. The central server is an obvious bottleneck. No matter how powerful the machine and the network connection, there are only so many checks results it can handle. Fortunately, Nagios doesn't require much horsepower. Distributed monitoring helps with this issue because the most expensive part of Nagios is running active checks. With distributed monitoring, the active checks can run on multiple smaller boxes, and then send the check results back as passive checks. Of course distributed monitoring compounds the config file maintenance issue, because you have to configure each check multiple times. The third issue is not directly a scalability issue. Nagios is built with the assumption of a local and mostly trusted network. It's non-trivial to securely get checks to work on remote machines without pretty gaping poking holes into firewalls, and/or frequently establishing and tearing down encrypted connections with the attendant processing load. There are some third-party solutions for this issue, though. From: Scott Ward [mailto:13.sward.13 at gmail.com] Sent: Thursday, June 10, 2010 12:34 PM To: Nagios Users List Subject: Re: [Nagios-users] Large Installation >Make sure to read these pages: > >http://nagios.sourceforge.net/docs/3_0/tuning.html >http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html > >Also, if you're monitoring 800 machines across WANs, you might look >into distributed monitoring: >http://nagios.sourceforge.net/docs/3_0/distributed.html > >Let us know how it goes! Thanks for the links. So the distributive monitoring provided by the Nagios docs can handle what we're trying to do? I have read in a few places that Nagios has scalability issues. > >--Matt > >BTW, what are you using for your config maintenance? We haven't decided yet. Do you have any recommendations? ~S On Thu, Jun 10, 2010 at 2:23 PM, Matt Simmons > wrote: Make sure to read these pages: http://nagios.sourceforge.net/docs/3_0/tuning.html http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html Also, if you're monitoring 800 machines across WANs, you might look into distributed monitoring: http://nagios.sourceforge.net/docs/3_0/distributed.html Let us know how it goes! --Matt BTW, what are you using for your config maintenance? On Thu, Jun 10, 2010 at 1:51 PM, Scott Ward <13.sward.13 at gmail.com> wrote: > We are looking to do an large installation of Nagios. Is it possible to > monitor over 800 machines and over 14000 services? > > Has anyone tried doing anything like this? If you have how successful was it > and how did you configure it? > > ~Rultax > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Thu Jun 10 22:12:07 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Thu, 10 Jun 2010 16:12:07 -0400 Subject: Large Installation In-Reply-To: References: Message-ID: I can't say that I've solved the scalability problem, but I I don't have it, just because I've implemented a policy such that I never check any server over a WAN link, with the exception of another Nagios server (plus both ends of all of the WAN links themselves). This does require one Nagios server per site, but to me, that's an appealing idea anyway, because I don't have a single point of failure. Any of my Nagios installations could die completely, and I'd be alerted by the others, just like any one internet connection could die, and I'd still get alerts about it. In the event of a "weird" failure, I can pretty much construct the network diagram based on which links are reporting up, and from where. It does require a certain amount of configuration overhead, but most of that is done with templating anyway. I don't have my system laid out exactly like I want, but I'm implementing version control (subversion, in my case) and I have a different Nagios repository for each site. If I had more templates (or more shared configuration files), I would probably have a 'nagios-shared' repository, so I wouldn't have to replicate everything manually. As for the arrangement of my configs, it mostly follows this howto that I did a year ago: http://www.standalone-sysadmin.com/blog/2009/07/nagios-config/ Hope it can help someone --Matt On Thu, Jun 10, 2010 at 3:55 PM, Kevin Keane wrote: > Nagios does have some scalability issues, but for the most part you won?t > run into them until you get to truly huge installations. > > > > I can see three main scalability issues: config file maintenance and the > need for one central server, and firewall issues. > > > > Config file maintenance can be improved to some extent with careful design > of the config files, as well as tools. It is an issue that I am running into > with a relatively small installation with 80+ hosts and 400+ services. My > installation is highly heterogeneous and very dynamic, which makes config > file maintenance a nightmare. Having to restart Nagios after a configuration > change doesn?t help either. On the other hand, a network with 2000 identical > machines is probably going to be much easier to manage than my type of > network. > > > > The central server is an obvious bottleneck. No matter how powerful the > machine and the network connection, there are only so many checks results it > can handle. Fortunately, Nagios doesn?t require much horsepower. Distributed > monitoring helps with this issue because the most expensive part of Nagios > is running active checks. With distributed monitoring, the active checks can > run on multiple smaller boxes, and then send the check results back as > passive checks. > > > > Of course distributed monitoring compounds the config file maintenance > issue, because you have to configure each check multiple times. > > > > The third issue is not directly a scalability issue. Nagios is built with > the assumption of a local and mostly trusted network. It?s non-trivial to > securely get checks to work on remote machines without pretty gaping poking > holes into firewalls, and/or frequently establishing and tearing down > encrypted connections with the attendant processing load. There are some > third-party solutions for this issue, though. > > > > From: Scott Ward [mailto:13.sward.13 at gmail.com] > Sent: Thursday, June 10, 2010 12:34 PM > To: Nagios Users List > Subject: Re: [Nagios-users] Large Installation > > > >>Make sure to read these pages: >> >>http://nagios.sourceforge.net/docs/3_0/tuning.html >>http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html >> >>Also, if you're monitoring 800 machines across WANs, you might look >>into distributed monitoring: >>http://nagios.sourceforge.net/docs/3_0/distributed.html >> >>Let us know how it goes! > > Thanks for the links.? So the distributive monitoring provided by the Nagios > docs can handle what we're trying to do?? I have read in a few places that > Nagios has scalability issues. > >> >>--Matt >> >>BTW, what are you using for your config maintenance? > > We haven't decided yet. Do you have any recommendations? > > > ~S > > On Thu, Jun 10, 2010 at 2:23 PM, Matt Simmons > wrote: > > Make sure to read these pages: > > http://nagios.sourceforge.net/docs/3_0/tuning.html > http://nagios.sourceforge.net/docs/3_0/largeinstalltweaks.html > > Also, if you're monitoring 800 machines across WANs, you might look > into distributed monitoring: > http://nagios.sourceforge.net/docs/3_0/distributed.html > > Let us know how it goes! > > --Matt > > BTW, what are you using for your config maintenance? > > On Thu, Jun 10, 2010 at 1:51 PM, Scott Ward <13.sward.13 at gmail.com> wrote: > >> We are looking to do an large installation of Nagios. Is it possible to >> monitor over 800 machines and over 14000 services? >> >> Has anyone tried doing anything like this? If you have how successful was >> it >> and how did you configure it? >> >> ~Rultax >> > >> >> ------------------------------------------------------------------------------ >> ThinkGeek and WIRED's GeekDad team up for the Ultimate >> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >> lucky parental unit. ?See the prize list and enter to win: >> http://p.sf.net/sfu/thinkgeek-promo >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting >> any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > -- > LITTLE GIRL: But which cookie will you eat FIRST? > COOKIE MONSTER: Me think you have misconception of cookie-eating process. > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Thu Jun 10 22:14:24 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Thu, 10 Jun 2010 16:14:24 -0400 Subject: Strange fluctuation in load average In-Reply-To: References: Message-ID: When you say load average, do you mean the 1 minute moving average? And what are you using to display the load average? --Matt On Thu, Jun 10, 2010 at 3:48 PM, Trisha Hoang wrote: > Hi all, > When I first installed nagios-3.2.0 with embedded perl enabled, nagios > experienced increasing latency, starting at 1 sec and climbed upto 300 > within a few hours until restarting nagios. I read on one of the older post > suggesting to recompile nagios *without* embedded perl, and that resolved > the latency issue, with latency consistently at less than 1 sec. However, > ever since, the system load average has fluctuated wildly from 1 to 12 and > down to say ... 3 within a minute. This fluctuation happens 3-10 minutes > each time and calms down for ... say an hour. There doesn't seem to be any > cron jobs that can cause this kind of load, and cpu (1-quad core) is usually > at least 50% idle , with plenty of free memory, no IO blocks, on Centos 5-2. > What's strange is with nagios compiled with embedded perl, the load was > consistently at 2-4. > Could this be nagios related? Please let me know if you need more > information. > > -- > Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: > rockyoutrisha > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lmw94002 at hotmail.com Thu Jun 10 22:18:24 2010 From: lmw94002 at hotmail.com (Mathew Walker) Date: Thu, 10 Jun 2010 16:18:24 -0400 Subject: extra "checkresults" files being left behind In-Reply-To: <4C106147.7050401@5dninja.net> References: <4C0F61B8.7060001@flatto.net>, <4C0F93BA.7040506@flatto.net>, , , , <4C106147.7050401@5dninja.net> Message-ID: Nagios v3.2.0 And I see the check and check.ok files: -rw------- 1 nagios nagios 291 Jun 9 07:12 checkzGuzY7 -rw------- 1 nagios nagios 280 Jun 7 21:54 checkzjh6PZ -rw------- 1 nagios nagios 483 Jun 10 13:07 cxHWRxJ -rw------- 1 nagios nagios 0 Jun 10 13:07 cxHWRxJ.ok But the check* orphan files just keep showing up. They don't relate to a specific host or check. No real pattern to time, host, service, etc. I could understand if the system was hitting 100% memory or CPU... but the memory is pretty stable in the 50-70% used range. Load is nearly 0.00 across the board. The system is pretty much dedicated to my running nagios as a test box. -- Mat W. - http://www.techadre.com > Date: Wed, 9 Jun 2010 20:51:35 -0700 > From: mike-nagios at 5dninja.net > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] extra "checkresults" files being left behind > > Mathew Walker wrote: > > I'm running Nagios on a little VPS box checking a few hosts/services > > (~50 checks). It's mostly a testing platform for me and checks in on my > > other test VPS systems. > > > > However I keep seeing the extra check results data files build up in > > /usr/local/nagios/var/spool/checkresults like: > > -rw------- 1 nagios nagios 249 Jun 7 23:45 checknbu01O > > -rw------- 1 nagios nagios 252 Jun 8 02:40 checkHxcsiJ > > > > Googled a bit and didn't come up with much relevant. Any thoughts? > > If I remember correctly, the parent nagios process writes out that file, > then forks a child. The child then runs the check, updates that file > and then creates a file with the same name, plus '.ok' in that > directory, letting the parent process know the check is completed. > > So, take a look at the contents of several of those files, if you're > lucky, you'll see that either they are for the same host, or the same > service check. If so, there might be something in the way that host or > service is getting polled that is causing the forked child to die. > > Also, if you're running a version older than 3.0rc1 (generally always a > good thing to include the version of the tool you're useing, when asking > for help) then you may want to upgrade, that version fixed a bug that > might be related: "Fixed bug with not deleting old check result files > that contained results for invalid host/service" > > -- > Mike Lindsey > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null _________________________________________________________________ The New Busy is not the old busy. Search, chat and e-mail from your inbox. http://www.windowslive.com/campaign/thenewbusy?ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_3 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fseratti at iplan.com.ar Thu Jun 10 22:02:32 2010 From: fseratti at iplan.com.ar (Francisco Seratti) Date: Thu, 10 Jun 2010 17:02:32 -0300 Subject: Nagios and IBM Tivoli SRM integration Message-ID: Hello to everyone, I have a simple question: Did anyone succeed integrating Nagios with this Service Request Manager tool or any other IBM Tivoli software? My goal is to open new service requests in this framework automatically for every Nagios DOWN notification. Thank you very much in advance, any suggestion will be appreciated. Francisco. ESTE MENSAJE ES CONFIDENCIAL. Puede contener informaci?n amparada por el secreto profesional. Si usted ha recibido este e-mail por error, por favor comun?quenoslo inmediatamente v?a e-mail y tenga la amabilidad de eliminarlo de su sistema; no deber? copiar el mensaje ni divulgar su contenido a ninguna persona. Muchas gracias. THIS MESSAGE IS CONFIDENTIAL. It may also contain information that is privileged or otherwise legally exempt from disclosure. If you have received it by mistake please let us know by e-mail immediately and delete it from your system; should also not copy the message nor disclose its contents to anyone. Many thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trisha at rockyou.com Thu Jun 10 22:48:52 2010 From: trisha at rockyou.com (Trisha Hoang) Date: Thu, 10 Jun 2010 13:48:52 -0700 Subject: Strange fluctuation in load average In-Reply-To: References: Message-ID: I'm using uptime to obtain the load average. Here's a snippet of the values. 09:17:34 up 5 days, 16:06, 3 users, load average: 2.07, 2.61, 3.45 09:19:34 up 5 days, 16:08, 3 users, load average: 9.09, 4.78, 4.13 09:21:34 up 5 days, 16:10, 3 users, load average: 10.05, 6.69, 4.91 09:23:34 up 5 days, 16:12, 3 users, load average: 8.83, 7.08, 5.24 09:25:34 up 5 days, 16:14, 3 users, load average: 9.42, 8.26, 5.91 09:27:34 up 5 days, 16:16, 3 users, load average: 4.43, 6.66, 5.60 09:29:34 up 5 days, 16:18, 3 users, load average: 13.06, 8.85, 6.51 09:31:34 up 5 days, 16:20, 3 users, load average: 7.35, 8.61, 6.73 09:33:34 up 5 days, 16:22, 3 users, load average: 7.87, 7.96, 6.69 09:35:34 up 5 days, 16:24, 3 users, load average: 4.25, 6.94, 6.49 09:37:34 up 5 days, 16:26, 3 users, load average: 2.50, 5.34, 5.95 09:39:34 up 5 days, 16:28, 3 users, load average: 7.53, 6.21, 6.19 09:41:34 up 5 days, 16:30, 3 users, load average: 5.71, 6.11, 6.15 09:43:34 up 5 days, 16:32, 3 users, load average: 1.56, 4.39, 5.51 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Fri Jun 11 00:10:21 2010 From: td3201 at gmail.com (Terry) Date: Thu, 10 Jun 2010 17:10:21 -0500 Subject: check_yum issue In-Reply-To: References: Message-ID: On Sat, Jun 5, 2010 at 9:02 AM, Kevin Keane wrote: > You would probably want to use sudo. Instead of having NRPE call check_yum directly, have it call sudo check_yum, and add check_yum for the Nagios user to your sudoers (make sure to not require a password, of course!) > > Be sure to keep the sudoers entry as restrictive as possible, or you may open a security hole. > > -----Original Message----- > From: Terry [mailto:td3201 at gmail.com] > Sent: Thursday, June 03, 2010 11:40 AM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] check_yum issue > > On Thu, Jun 3, 2010 at 1:28 PM, Terry wrote: >> Hello, >> >> I am trying to use check_yum: >> http://exchange.nagios.org/directory/Plugins/Uncategorized/Operating-S >> ystems/Linux/Check_Yum/details >> >> It works great from the command line: >> [root at foo ~]# yum --security check-update Loaded plugins: dellsysid, >> rhnplugin, security Limiting package lists to security relevant ones >> Needed 4 of 11 packages, for security >> >> rhn-check.noarch >> ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 >> rhn-client-tools.noarch >> ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 >> rhn-setup.noarch >> ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 >> rhn-setup-gnome.noarch >> ? ? ? ? ? ? ? ? ? ? ? ? 0.4.20-33.el5_5.2 >> ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? rhel-x86_64-server-5 >> [root at foo ~]# /usr/lib64/nagios/plugins/check_yum >> YUM CRITICAL: 4 Security Updates Available. 7 Non-Security Updates >> Available [root at foo ~]# echo $? >> 2 >> >> It returns this from nagios: >> [root at foo ~]# /usr/lib64/nagios/plugins/check_nrpe -H 10.0.0.2 -t 50 >> -c check_yum YUM OK: 0 Security Updates Available >> >> Here's my NRPE configuration: >> [root at bar ~]# cat /etc/nagios/nrpe.cfg | grep check_yum >> ? ? ? ?command[check_yum]=/usr/lib64/nagios/plugins/check_yum >> >> What am I missing here? >> > > I think I fail here. ?This is a permissions issue as noted in the > description of the plugin. ? ?Anyone doing something similar? ?If so, > how is your solution architected? > > Thanks! > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > I think I did one better, maybe. I am having nagios call check_by_ssh which uses a key that is specific for this command. On the remote side, I am configuring the authorized_hosts such as this: command="/usr/lib/nagios/plugins/check_yum" ssh-rsa AA..... The only thing this key can do is call check_yum on the remote end. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Jun 11 11:38:43 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 11 Jun 2010 11:38:43 +0200 Subject: Large Installation In-Reply-To: References: Message-ID: <4C120423.70204@op5.se> On 06/10/2010 07:51 PM, Scott Ward wrote: > We are looking to do an large installation of Nagios. Is it possible to > monitor over 800 machines and over 14000 services? > > Has anyone tried doing anything like this? If you have how successful was it > and how did you configure it? > We have plenty of customers with far more than 1000 hosts. 800 should just be a matter of running Nagios on a decently beefy hardware. Don't attempt it with a virtual system though. They have notoriously crappy performance with multi-fork()'ing applications, and if you ever hit the swap, they'll degrade even further. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at martinmelin.com Fri Jun 11 11:48:17 2010 From: nagios at martinmelin.com (Martin Melin) Date: Fri, 11 Jun 2010 11:48:17 +0200 Subject: Large Installation In-Reply-To: References: Message-ID: On Thu, Jun 10, 2010 at 21:55, Kevin Keane wrote: > Config file maintenance can be improved to some extent with careful design > of the config files, as well as tools. It is an issue that I am running into > with a relatively small installation with 80+ hosts and 400+ services. My > installation is highly heterogeneous and very dynamic, which makes config > file maintenance a nightmare. Having to restart Nagios after a configuration > change doesn?t help either. On the other hand, a network with 2000 identical > machines is probably going to be much easier to manage than my type of > network. > Nitpicking or helpful tip, you decide: Nagios reloads config changes on SIGHUP, you don't have to do a restart. A full restart can take a while on a sufficiently sized installation so having to do one for every change would indeed be a PITA, but I've never seen a reload take more than a few seconds. Cheers Martin -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From 13.sward.13 at gmail.com Fri Jun 11 15:04:45 2010 From: 13.sward.13 at gmail.com (Scott Ward) Date: Fri, 11 Jun 2010 08:04:45 -0500 Subject: Large Installation In-Reply-To: References: Message-ID: We are going to be using distributed monitoring for sure. We just cannot decide whether we should use NDO to write directly to the database or us NSCA to send back to the master server. Any suggestions? Is there a frontend that actually uses the information in an NDO db? From what I've read it looks like the default Nagios front end uses text files. ~Scott Ward On Fri, Jun 11, 2010 at 4:48 AM, Martin Melin wrote: > On Thu, Jun 10, 2010 at 21:55, Kevin Keane wrote: > >> Config file maintenance can be improved to some extent with careful design >> of the config files, as well as tools. It is an issue that I am running into >> with a relatively small installation with 80+ hosts and 400+ services. My >> installation is highly heterogeneous and very dynamic, which makes config >> file maintenance a nightmare. Having to restart Nagios after a configuration >> change doesn?t help either. On the other hand, a network with 2000 identical >> machines is probably going to be much easier to manage than my type of >> network. >> > Nitpicking or helpful tip, you decide: Nagios reloads config changes on > SIGHUP, you don't have to do a restart. A full restart can take a while on a > sufficiently sized installation so having to do one for every change would > indeed be a PITA, but I've never seen a reload take more than a few seconds. > > Cheers > Martin > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Jun 11 15:29:26 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 11 Jun 2010 15:29:26 +0200 Subject: Large Installation In-Reply-To: References: Message-ID: <4C123A36.1030607@op5.se> On 06/11/2010 03:04 PM, Scott Ward wrote: > We are going to be using distributed monitoring for sure. We just cannot > decide whether we should use NDO to write directly to the database or us > NSCA to send back to the master server. Any suggestions? > > Is there a frontend that actually uses the information in an NDO db? From > what I've read it looks like the default Nagios front end uses text files. > Unless you desperately need performance data from satellite systems handled properly, I'd invite you to give Merlin and Ninja a try. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Fri Jun 11 15:45:36 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Fri, 11 Jun 2010 09:45:36 -0400 Subject: NDOUTILS - Duplicate lines for each service check in servicechecks table (ndoutils 1.4b9) In-Reply-To: <4AF9E48D.70108@univie.ac.at> References: <4AF9E48D.70108@univie.ac.at> Message-ID: Can someone point out the patch location. I have searched nagios-devel mailing list but could not find it. Thanks in advance. On Tue, Nov 10, 2009 at 6:09 PM, Michael Friedrich < michael.friedrich at univie.ac.at> wrote: > > > ?yvind Nordang wrote: > > Duplicate lines for each service check in servicechecks table > > (ndoutils 1.4b9) > > > > I have: > > Nagios 3.2.0 > > NDOutils 1.4b9 > > > > Is this a bug of feature? > I've attached the patch to nagios-devel, dunno when it will be fixed > then. But I have analyzed more queries, and there are several other > tables missing unique constraints and therefore causing duplicate rows - > e.g. systemcommands while testing perfdata output ... > > Maybe another patch will follow but first I will first on Icinga :-) > > Kind regards, > Michael > > > > > ------------------------------------------------------------------------------ > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day > trial. Simplify your report design, integration and deployment - and focus > on > what you do best, core application coding. Discover what's new with > Crystal Reports now. http://p.sf.net/sfu/bobj-july > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsico.com Fri Jun 11 16:13:46 2010 From: mark.frost1 at pepsico.com (Frost, Mark {PBC}) Date: Fri, 11 Jun 2010 10:13:46 -0400 Subject: Merlin/Ninja perfdata status? In-Reply-To: <4C123A36.1030607@op5.se> References: <4C123A36.1030607@op5.se> Message-ID: > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Friday, June 11, 2010 9:29 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Large Installation > > > Unless you desperately need performance data from satellite systems > handled properly, I'd invite you to give Merlin and Ninja a try. Andreas, We're planning on a Nagios refresh/rearchitecture near the end of this year and I'm really hopeful that we might be able to move to Ninja/Merlin as they do a lot of things we'd really like to have. They also solve some issues we have with our current distributed system. I've been trying to pay attention to the latest developments in this area, but I may have missed something as changes are happening quickly. We do, however, rely pretty heavily on performance data. I think I saw someone had a hack to do it with Merlin, but it's not really part of Merlin right now which makes me not want to adopt it for a production Nagios installation. I recall a sort of Merlin roadmap for the rest of the year indicating that upcoming work was to better support distributed setups, if I remember correctly. Is there also work afoot to get perfdata into Merlin perhaps with the next release? I'm trying to build some test systems to try the current version of Merlin/Ninja to assess how "production ready" it might be for us by the end of the year when we need to make a decision. Thanks very much for all the hard work you and others at Op5 have put in to these tools. Mark ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rlemerlus at merethis.com Fri Jun 11 15:52:19 2010 From: rlemerlus at merethis.com (Romain Le Merlus) Date: Fri, 11 Jun 2010 15:52:19 +0200 Subject: Large Installation In-Reply-To: <4C123A36.1030607@op5.se> References: <4C123A36.1030607@op5.se> Message-ID: Hi Scott, You can also try Centreon software to manage your different pollers and configuration: http://www.centreon.com Here is an overview of the functioning: http://en.doc.centreon.com/CentreonArchitecture To see how it looks like, here is a web demo: http://demo.centreon.com Best regards. -- Romain LE MERLUS rlemerlus at merethis.com Tel. +33 (0)1 49 69 97 12 Mob. +33(0)6 85 05 02 82 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Fri Jun 11 16:19:36 2010 From: a31modela at hotmail.com (steve f) Date: Fri, 11 Jun 2010 10:19:36 -0400 Subject: Nagios & Postemsg Message-ID: Hello All, I am currently looking for an alternative to using Tivoli , TEC & postemsg for a rather large ( 6000 + ) remote environment. I have had great success with Nagios in my small local/remote test environment and the obvious cost savings without having TEC anymore is huge. Can I use the existing postemsg tests that are running on the boxes and via I guess External Commands have Nagios process the messages? For those familiar with both Tivoli & Nagios, Is there anything that Tivoli gives me that I cant do with Nagios? I don't see it if there is. Thanks for the help, Steve _________________________________________________________________ The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with Hotmail. http://www.windowslive.com/campaign/thenewbusy?tile=multicalendar&ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_5 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Jun 11 17:21:46 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 11 Jun 2010 17:21:46 +0200 Subject: Merlin/Ninja perfdata status? In-Reply-To: References: <4C123A36.1030607@op5.se> Message-ID: <4C12548A.7090209@op5.se> On 06/11/2010 04:13 PM, Frost, Mark {PBC} wrote: >> -----Original Message----- >> From: Andreas Ericsson [mailto:ae at op5.se] >> Sent: Friday, June 11, 2010 9:29 AM >> To: Nagios Users List >> Subject: Re: [Nagios-users] Large Installation >> >> >> Unless you desperately need performance data from satellite systems >> handled properly, I'd invite you to give Merlin and Ninja a try. > > Andreas, > > We're planning on a Nagios refresh/rearchitecture near the end of this year > and I'm really hopeful that we might be able to move to Ninja/Merlin as they > do a lot of things we'd really like to have. They also solve some issues we > have with our current distributed system. > > I've been trying to pay attention to the latest developments in this area, but > I may have missed something as changes are happening quickly. > > We do, however, rely pretty heavily on performance data. I think I saw someone had > a hack to do it with Merlin, but it's not really part of Merlin right now which makes > me not want to adopt it for a production Nagios installation. > As do most of our customers. We ship pnp with our systems, and our users expect graphs to Just Work(tm). I've just started working on the performance data problem right now, although I might have to modify the Nagios core to do it in an elegant fashion, so perhaps you'll be forced to run a patched Nagios with your ninja/merlin setup. We shall see how it goes. > I recall a sort of Merlin roadmap for the rest of the year indicating that upcoming > work was to better support distributed setups, if I remember correctly. Is there also > work afoot to get perfdata into Merlin perhaps with the next release? > The next stable release of Merlin will support fully redundant and distributed setups, with configuration sync from master to poller and between peers. It's scheduled for stable internal releas by the end of september. I spend roughly 90% of my time at op5 working on Merlin right now, so development is indeed moving along rather rapidly. > I'm trying to build some test systems to try the current version of Merlin/Ninja to > assess how "production ready" it might be for us by the end of the year when we need > to make a decision. > By the end of the year it should be used in production at very nearly all our customers. If you're familiar with git, I'd recommend you to update (using git pull) on a daily basis. Things are moving quickly now, and the more feedback and testing help I can get, the sooner it will be ready. > Thanks very much for all the hard work you and others at Op5 have put in to these > tools. > Thanks. Always nice to be appreciated :) -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Jun 11 17:36:51 2010 From: ae at op5.se (Andreas Ericsson) Date: Fri, 11 Jun 2010 17:36:51 +0200 Subject: Large Installation In-Reply-To: References: <4C123A36.1030607@op5.se> Message-ID: <4C125813.5040000@op5.se> On 06/11/2010 04:08 PM, Scott Ward wrote: > How does Merlin compare with NDO in terms of resource usage? > merlin is fairly lightweight. What little memory its uses resides primarily on the stack and fits well inside the stack of 1MiB. Here's the output of "ps wwaux | grep merlin" on a master system with two connected pollers. As you can see, "grep" consumes more memory than the merlin daemon does. This is with debug symbols compiled in btw, so it will be roughly half that when it's built for production. root 12286 0.0 0.2 61116 660 pts/0 R+ 17:29 0:00 grep -i merlin root 23236 0.0 0.7 50572 1856 ? S 13:56 0:01 /opt/monitor/op5/merlin/merlind -c /opt/monitor/op5/merlin/merlin.conf As for CPU usage, it's definitely more lightweight than NDO. A typical merlin daemon will basically idle away most of its time. It's the database that does the heavy lifting after all, so it's not that hard to make merlin itself lean and extremely quick. As for storage-space, it doesn't use nearly as much as ndoutils does, since we don't store the entire log and all status updates in the database, but only the current status and statechanges, where a statechange is defined as "either the state has changed, or the object went from soft to hard state", which is basically all we need to make reports look good. Since the logfiles are already partitioned by date, it was deemed a lot easier to write a super-fast parser for those instead and make that parser able to display html output. This is the helper we use in ninja, and it's working extremely well, showing interesting logdata in a matter of seconds. It will grow over time ofcourse, but while NDOUtils' database can grow to tens of gigabytes in a matter of months for a large network, merlin stores about 500MiB for a whole year for the same size network. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Fri Jun 11 18:03:08 2010 From: perldork at webwizarddesign.com (Max) Date: Fri, 11 Jun 2010 12:03:08 -0400 Subject: Nagios & Postemsg In-Reply-To: References: Message-ID: The main things you will not get from Nagios that you almost always get with Tivoli: * High recurring licensing fees * On-site Tivoli consultants :) Nagios does not give you "out of the box" the visualization dashboards that Tivoli has but with Nagviz you can you make very nice graphical dashboards at a much much lower cost to your company. Nagios also does not do auto-discovery out of the box but there are projects that give you that capabililty - again at a much lower cost. Distributed Nagios - there are a few choices, you will need to take the time to evaluate them all and choose the right one for you, but again, cost will be lower than Tivoli. The team I am on is building out a distributed architecture for Nagios based on our unique requirements - self service model where many SAs can all change configs on their schedule without our intervention, clustering, fast redistribution of hosts/services across pollers, centralized transparent (to the end user) command and control across all pollers. We are using some existing tools (Nagios and Merlin) and 4 developers and even then the TCO and maintenance cost will be magnitudes of order cheaper than Tivoli with much more functionality than most Tivoli shops offer. A polling model always has some challenges when it comes to scaling big but compared to Tivoli I think you will find Nagios to be both a lot more fun, a lot more flexible, a lot better fit, and, if politics don't interfere, your management should be much more happy with a fixed cost development price tag than the high $$ open ended maintenance costs of a commercial product like Tivoli. - Max On Fri, Jun 11, 2010 at 10:19 AM, steve f wrote: > Hello All, > > I am currently looking for an alternative to using Tivoli , TEC & postemsg > for a rather large ( 6000 + ) remote environment. > > I have had great success with Nagios in my small local/remote test > environment and the obvious cost savings without having TEC anymore is huge. > > Can I use the existing postemsg tests that are running on the boxes and via > I guess External Commands have Nagios process the messages? > > For those familiar with both Tivoli & Nagios, Is there anything that Tivoli > gives me that I cant do with Nagios? I don't see it if there is. > > > Thanks for the help, > > Steve > > ------------------------------ > The New Busy think 9 to 5 is a cute idea. Combine multiple calendars with > Hotmail. Get busy. > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Fri Jun 11 18:26:22 2010 From: perldork at webwizarddesign.com (Max) Date: Fri, 11 Jun 2010 12:26:22 -0400 Subject: Large Installation In-Reply-To: <4C125813.5040000@op5.se> References: <4C123A36.1030607@op5.se> <4C125813.5040000@op5.se> Message-ID: I can attest / confirm what Andreas states about the merlin daemon. BTW, Andreas, I just patched our code base to contain your 0.6.7 changes and I will be posting that on Github for you and anyone else interested to check out over the weekend. Our tests so far are showing that with the Merlin NEB and daemon on a poller we lose less than 10% capacity on the poller compared to the poller without the NEB module and Merlind - our test poller is running 10k active services checks and 1k active host checks in less than 5 minutes with polling headroom to spare. - Max -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Fri Jun 11 18:27:47 2010 From: perldork at webwizarddesign.com (Max) Date: Fri, 11 Jun 2010 12:27:47 -0400 Subject: Large Installation In-Reply-To: References: <4C123A36.1030607@op5.se> <4C125813.5040000@op5.se> Message-ID: Our changes to Merlin allow N pollers to all write to the same database without conflicts. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Fri Jun 11 19:36:01 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Fri, 11 Jun 2010 19:36:01 +0200 Subject: Merlin/Ninja In-Reply-To: <4C12548A.7090209@op5.se> References: <4C123A36.1030607@op5.se> <4C12548A.7090209@op5.se> Message-ID: <26B29E7E-DA04-4A78-93E3-DD4F1796B3EF@linux.it> Well, Talking about Ninja, I installed on a Debian Lennt box. The installation process seemed a bit buggy and I see some problems like with scheduling scripts, but I find Ninja a useful tool. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Jun 11 21:09:33 2010 From: subscription at kkeane.com (Kevin Keane) Date: Fri, 11 Jun 2010 12:09:33 -0700 Subject: Large Installation In-Reply-To: References: Message-ID: If you aren't actually using the data from NDO, there is little point in creating the DB. I would probably not use NDO to write directly from the satellites. Here is why: - Double the network traffic. The satellites have to send check results AND database writes. - Less reliable. How would you keep the master server from writing the same information to the DB that a satellite has just written, and messing up the data? - NDO can be a serious performance bottleneck; you wouldn't want your satellites to be a potential point of failure in terms of performance. - If the satellites are behind a firewall, it may not even be possible to write directly to the DB. From: Scott Ward [mailto:13.sward.13 at gmail.com] Sent: Friday, June 11, 2010 6:05 AM To: Nagios Users List Subject: Re: [Nagios-users] Large Installation We are going to be using distributed monitoring for sure. We just cannot decide whether we should use NDO to write directly to the database or us NSCA to send back to the master server. Any suggestions? Is there a frontend that actually uses the information in an NDO db? From what I've read it looks like the default Nagios front end uses text files. ~Scott Ward On Fri, Jun 11, 2010 at 4:48 AM, Martin Melin > wrote: On Thu, Jun 10, 2010 at 21:55, Kevin Keane > wrote: Config file maintenance can be improved to some extent with careful design of the config files, as well as tools. It is an issue that I am running into with a relatively small installation with 80+ hosts and 400+ services. My installation is highly heterogeneous and very dynamic, which makes config file maintenance a nightmare. Having to restart Nagios after a configuration change doesn't help either. On the other hand, a network with 2000 identical machines is probably going to be much easier to manage than my type of network. Nitpicking or helpful tip, you decide: Nagios reloads config changes on SIGHUP, you don't have to do a restart. A full restart can take a while on a sufficiently sized installation so having to do one for every change would indeed be a PITA, but I've never seen a reload take more than a few seconds. Cheers Martin ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From 13.sward.13 at gmail.com Sat Jun 12 03:03:57 2010 From: 13.sward.13 at gmail.com (Scott Ward) Date: Fri, 11 Jun 2010 20:03:57 -0500 Subject: Large Installation In-Reply-To: References: <4C123A36.1030607@op5.se> Message-ID: We have decided to go with Centreon. I let you know how it goes. ~Scott Ward On Fri, Jun 11, 2010 at 8:52 AM, Romain Le Merlus wrote: > Hi Scott, > > You can also try Centreon software to manage your different pollers and > configuration: > http://www.centreon.com > > Here is an overview of the functioning: > http://en.doc.centreon.com/CentreonArchitecture > > To see how it looks like, here is a web demo: > http://demo.centreon.com > > Best regards. > -- > Romain LE MERLUS > > rlemerlus at merethis.com > Tel. +33 (0)1 49 69 97 12 > Mob. +33(0)6 85 05 02 82 > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dirk.schulz at kinzesberg.de Sat Jun 12 06:33:04 2010 From: dirk.schulz at kinzesberg.de (Dirk H. Schulz) Date: Sat, 12 Jun 2010 06:33:04 +0200 Subject: check_disk and Volumes > 10 TB Message-ID: <4C130E00.6030702@kinzesberg.de> Hi folks, I have run into a problem with check_disk. I have a volume with 14 TB that is 91 % free: > /dev/disk0s3 14Ti 1.2Ti 13Ti 9% /Volumes/EonStor1 check_disk states it is 0 % free: > check_disk -u GB /Volumes/EonStor1 > DISK OK - free space: /Volumes/EonStor1 0 GB (0% inode=91%);| > /Volumes/EonStor1=1276GB;;;0;14665 Is there a known limitation concerning the size of the volumes? With a volume < 2 TB I do not have this problem on the same machine. Is there something I can do to get around this? Any hint or help is appreciated. Dirk ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Sat Jun 12 13:16:47 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Sat, 12 Jun 2010 07:16:47 -0400 Subject: check_disk and Volumes > 10 TB In-Reply-To: <4C130E00.6030702@kinzesberg.de> References: <4C130E00.6030702@kinzesberg.de> Message-ID: I've seen people run into problems like this when they're checking a machine that's 64 bit, and the monitoring host is 32. That's not, by chance, the case now, is it? --Matt On Sat, Jun 12, 2010 at 12:33 AM, Dirk H. Schulz wrote: > Hi folks, > > I have run into a problem with check_disk. I have a volume with 14 TB > that is 91 % free: >> /dev/disk0s3 ? ?14Ti ?1.2Ti ? 13Ti ? ? 9% ? ?/Volumes/EonStor1 > check_disk states it is 0 % free: >> check_disk -u GB /Volumes/EonStor1 >> DISK OK - free space: /Volumes/EonStor1 0 GB (0% inode=91%);| >> /Volumes/EonStor1=1276GB;;;0;14665 > Is there a known limitation concerning the size of the volumes? With a > volume < 2 TB I do not have this problem on the same machine. > > Is there something I can do to get around this? > > Any hint or help is appreciated. > > Dirk > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dirk.schulz at kinzesberg.de Sat Jun 12 13:59:11 2010 From: dirk.schulz at kinzesberg.de (Dirk H. Schulz) Date: Sat, 12 Jun 2010 13:59:11 +0200 Subject: check_disk and Volumes > 10 TB In-Reply-To: References: <4C130E00.6030702@kinzesberg.de> Message-ID: <4C13768F.6000901@kinzesberg.de> Am 12.06.10 13:16, schrieb Matt Simmons: > I've seen people run into problems like this when they're checking a > machine that's 64 bit, and the monitoring host is 32. That's not, by > chance, the case now, is it? > Well, the monitoring host is 32 bit, and the MacOS X 10.5. client machine claims to be i386. But the problem also occurs if I call check_disk manually on the machine where the volume is checked. The lines copied in my first mail where from such a manual run. So I am sure no 32/64Bit hassle is involved in this case. Thank you for your input! Dirk > --Matt > > On Sat, Jun 12, 2010 at 12:33 AM, Dirk H. Schulz > wrote: > >> Hi folks, >> >> I have run into a problem with check_disk. I have a volume with 14 TB >> that is 91 % free: >> >>> /dev/disk0s3 14Ti 1.2Ti 13Ti 9% /Volumes/EonStor1 >>> >> check_disk states it is 0 % free: >> >>> check_disk -u GB /Volumes/EonStor1 >>> DISK OK - free space: /Volumes/EonStor1 0 GB (0% inode=91%);| >>> /Volumes/EonStor1=1276GB;;;0;14665 >>> >> Is there a known limitation concerning the size of the volumes? With a >> volume< 2 TB I do not have this problem on the same machine. >> >> Is there something I can do to get around this? >> >> Any hint or help is appreciated. >> >> Dirk >> >> ------------------------------------------------------------------------------ >> ThinkGeek and WIRED's GeekDad team up for the Ultimate >> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >> lucky parental unit. See the prize list and enter to win: >> http://p.sf.net/sfu/thinkgeek-promo >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> >> > > > ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Mon Jun 14 17:03:09 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Mon, 14 Jun 2010 08:03:09 -0700 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical Message-ID: <4C1644AD.80401@cyber-office.net> My Nagios foo must leave a lot to be desired. 8-( I have not been able to figure out the correct combination of service definition settings to get a check to go CRITICAL:HARD on the first try (no SOFT alerts), and keep sending alerts (every five minutes) until the check/alert clears. Either the complete answer or a clue-stick whack (the settings I should be looking at) would be greatly appreciated. TIA, Rod -- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Mon Jun 14 17:11:12 2010 From: nagios at flatto.net (Assaf Flatto) Date: Mon, 14 Jun 2010 16:11:12 +0100 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: <4C1644AD.80401@cyber-office.net> References: <4C1644AD.80401@cyber-office.net> Message-ID: <4C164690.3050105@flatto.net> Roderick A. Anderson wrote: > My Nagios foo must leave a lot to be desired. 8-( > > I have not been able to figure out the correct combination of service > definition settings to get a check to go CRITICAL:HARD on the first try > (no SOFT alerts), and keep sending alerts (every five minutes) until the > check/alert clears. > > Either the complete answer or a clue-stick whack (the settings I should > be looking at) would be greatly appreciated. > > > TIA, > Rod > What it your service definition ( template and service ) ? what is your escalation path ? and what is the alerts definitions of the contact ? Please post those details so we may help you . Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at martinmelin.com Mon Jun 14 17:40:49 2010 From: nagios at martinmelin.com (Martin Melin) Date: Mon, 14 Jun 2010 17:40:49 +0200 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: <4C1644AD.80401@cyber-office.net> References: <4C1644AD.80401@cyber-office.net> Message-ID: On Mon, Jun 14, 2010 at 17:03, Roderick A. Anderson wrote: > > My Nagios foo must leave a lot to be desired. ?8-( > > I have not been able to figure out the correct combination of service > definition settings to get a check to go CRITICAL:HARD on the first try > (no SOFT alerts), and keep sending alerts (every five minutes) until the > check/alert clears. > > Either the complete answer or a clue-stick whack (the settings I should > be looking at) would be greatly appreciated. From?http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service max_check_attempts:This directive is used to define the number of times that Nagios will retry the service check command if it returns any state other than an OK state. Setting this value to 1 will cause Nagios to generate an alert without retrying the service check again. notification_interval: This directive is used to define the number of "time units" to wait before re-notifying a contact that this service is still in a non-OK state. Unless you've changed the interval_length directive from the default value of 60, this number will mean minutes. If you set this value to 0, Nagios will not re-notify contacts about problems for this service - only one problem notification will be sent out. This should be enough to get you up and running :-) If you still have problems, post your config to the list. Regards Martin Melin ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Mon Jun 14 19:01:22 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Mon, 14 Jun 2010 10:01:22 -0700 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: References: <4C1644AD.80401@cyber-office.net> Message-ID: <4C166062.4050606@cyber-office.net> Martin Merlin wrote: > On Mon, Jun 14, 2010 at 17:03, Roderick A. Anderson > wrote: >> My Nagios foo must leave a lot to be desired. 8-( >> >> I have not been able to figure out the correct combination of service >> definition settings to get a check to go CRITICAL:HARD on the first try >> (no SOFT alerts), and keep sending alerts (every five minutes) until the >> check/alert clears. >> >> Either the complete answer or a clue-stick whack (the settings I should >> be looking at) would be greatly appreciated. > > From http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service Thanks Martin. No sure if I'd made it to this page. "The Book" is a little brief in it's description. > max_check_attempts:This directive is used to define the number of > times that Nagios will retry the service check command if it returns > any state other than an OK state. Setting this value to 1 will cause > Nagios to generate an alert without retrying the service check again. OK. This makes sense. (Whack one!) > notification_interval: This directive is used to define the number of > "time units" to wait before re-notifying a contact that this service > is still in a non-OK state. Unless you've changed the interval_length > directive from the default value of 60, this number will mean minutes. > If you set this value to 0, Nagios will not re-notify contacts about > problems for this service - only one problem notification will be sent > out. This too makes sense and again "The Book" is a little too brief in it's description. > This should be enough to get you up and running :-) I'll try it today. > If you still have problems, post your config to the list. I will if needed but it has nothing out of the ordinary. Uses the 'generic_service' template from a CentOS 5.x install of Nagios 3.0.6 plus check_interval of 5 (minutes). Nothing else special. (3.2.1 from Rpmforge will be coming in a week or so.) Again thanks, Rod -- > > Regards > Martin Melin > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Mon Jun 14 19:09:33 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Mon, 14 Jun 2010 19:09:33 +0200 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: <4C1644AD.80401@cyber-office.net> References: <4C1644AD.80401@cyber-office.net> Message-ID: Hi, have a look at my little 2 cents on escalations, it will show you how the checks timings and notification work. http://www.zarrelli.org/blog/2010/04/26/nagios-notification-escalations-made-easy/ Ciao, Giorgio Il giorno 14/giu/2010, alle ore 17.03, "Roderick A. Anderson" ha scritto: > My Nagios foo must leave a lot to be desired. 8-( > > I have not been able to figure out the correct combination of service > definition settings to get a check to go CRITICAL:HARD on the first > try > (no SOFT alerts), and keep sending alerts (every five minutes) until > the > check/alert clears. > > Either the complete answer or a clue-stick whack (the settings I > should > be looking at) would be greatly appreciated. > > > TIA, > Rod > -- > > > --- > --- > --- > --------------------------------------------------------------------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Mon Jun 14 19:22:13 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Mon, 14 Jun 2010 10:22:13 -0700 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: References: <4C1644AD.80401@cyber-office.net> Message-ID: <4C166545.90909@cyber-office.net> Giorgio Zarrelli wrote: > Hi, > > have a look at my little 2 cents on escalations, it will show you how > the checks timings and notification work. > > > http://www.zarrelli.org/blog/2010/04/26/nagios-notification-escalations-made-easy/ Thanks. I now remember a (your?) your posting about this but I had forgotten it while I was researching. This also it adds some more knowledge that I can use for what I would like to accomplish. "Bothersome alerts for the first hour then move to a really annoying alert interval after that. :-)" \\||/ Rod -- > > Ciao, > > Giorgio > > Il giorno 14/giu/2010, alle ore 17.03, "Roderick A. Anderson" > ha scritto: > >> My Nagios foo must leave a lot to be desired. 8-( >> >> I have not been able to figure out the correct combination of service >> definition settings to get a check to go CRITICAL:HARD on the first >> try >> (no SOFT alerts), and keep sending alerts (every five minutes) until >> the >> check/alert clears. >> >> Either the complete answer or a clue-stick whack (the settings I >> should >> be looking at) would be greatly appreciated. >> >> >> TIA, >> Rod >> -- >> >> >> --- >> --- >> --- >> --------------------------------------------------------------------- >> ThinkGeek and WIRED's GeekDad team up for the Ultimate >> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >> lucky parental unit. See the prize list and enter to win: >> http://p.sf.net/sfu/thinkgeek-promo >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Mon Jun 14 19:32:58 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Mon, 14 Jun 2010 19:32:58 +0200 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: <4C166545.90909@cyber-office.net> References: <4C1644AD.80401@cyber-office.net> <4C166545.90909@cyber-office.net> Message-ID: <2CBC50CB-B468-4E67-BA76-2A7CE68EB508@linux.it> You are right, I took one post I wrote here and I added some details. Now I am on the couch waiting for dinner but tomorrow I hope I can help you. Ciao, Giorgio Il giorno 14/giu/2010, alle ore 19.22, "Roderick A. Anderson" ha scritto: > Giorgio Zarrelli wrote: >> Hi, >> >> have a look at my little 2 cents on escalations, it will show you how >> the checks timings and notification work. >> >> >> http://www.zarrelli.org/blog/2010/04/26/nagios-notification-escalations-made-easy/ > > Thanks. I now remember a (your?) your posting about this but I had > forgotten it while I was researching. This also it adds some more > knowledge that I can use for what I would like to accomplish. > "Bothersome alerts for the first hour then move to a really annoying > alert interval after that. :-)" > > > \\||/ > Rod > -- >> >> Ciao, >> >> Giorgio >> >> Il giorno 14/giu/2010, alle ore 17.03, "Roderick A. Anderson" >> ha scritto: >> >>> My Nagios foo must leave a lot to be desired. 8-( >>> >>> I have not been able to figure out the correct combination of >>> service >>> definition settings to get a check to go CRITICAL:HARD on the first >>> try >>> (no SOFT alerts), and keep sending alerts (every five minutes) until >>> the >>> check/alert clears. >>> >>> Either the complete answer or a clue-stick whack (the settings I >>> should >>> be looking at) would be greatly appreciated. >>> >>> >>> TIA, >>> Rod >>> -- >>> >>> >>> --- >>> --- >>> --- >>> --- >>> ------------------------------------------------------------------ >>> ThinkGeek and WIRED's GeekDad team up for the Ultimate >>> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >>> lucky parental unit. See the prize list and enter to win: >>> http://p.sf.net/sfu/thinkgeek-promo >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when >>> reporting any issue. >>> ::: Messages without supporting info will risk being sent to /dev/ >>> null >> >> --- >> --- >> --- >> --------------------------------------------------------------------- >> ThinkGeek and WIRED's GeekDad team up for the Ultimate >> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >> lucky parental unit. See the prize list and enter to win: >> http://p.sf.net/sfu/thinkgeek-promo >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/ >> null > > > --- > --- > --- > --------------------------------------------------------------------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Mon Jun 14 19:36:14 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Mon, 14 Jun 2010 10:36:14 -0700 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: <2CBC50CB-B468-4E67-BA76-2A7CE68EB508@linux.it> References: <4C1644AD.80401@cyber-office.net> <4C166545.90909@cyber-office.net> <2CBC50CB-B468-4E67-BA76-2A7CE68EB508@linux.it> Message-ID: <4C16688E.1000200@cyber-office.net> Giorgio Zarrelli wrote: > You are right, I took one post I wrote here and I added some details. > Now I am on the couch waiting for dinner but tomorrow I hope I can > help you. No big worry. I'm sure your web page will provide all I need. Enjoy dinner. \\||/ Rod -- > > Ciao, > > Giorgio > > Il giorno 14/giu/2010, alle ore 19.22, "Roderick A. Anderson" > ha scritto: > >> Giorgio Zarrelli wrote: >>> Hi, >>> >>> have a look at my little 2 cents on escalations, it will show you how >>> the checks timings and notification work. >>> >>> >>> http://www.zarrelli.org/blog/2010/04/26/nagios-notification-escalations-made-easy/ >> Thanks. I now remember a (your?) your posting about this but I had >> forgotten it while I was researching. This also it adds some more >> knowledge that I can use for what I would like to accomplish. >> "Bothersome alerts for the first hour then move to a really annoying >> alert interval after that. :-)" >> >> >> \\||/ >> Rod >> -- >>> Ciao, >>> >>> Giorgio >>> >>> Il giorno 14/giu/2010, alle ore 17.03, "Roderick A. Anderson" >>> ha scritto: >>>> My Nagios foo must leave a lot to be desired. 8-( >>>> >>>> I have not been able to figure out the correct combination of >>>> service >>>> definition settings to get a check to go CRITICAL:HARD on the first >>>> try >>>> (no SOFT alerts), and keep sending alerts (every five minutes) until >>>> the >>>> check/alert clears. >>>> >>>> Either the complete answer or a clue-stick whack (the settings I >>>> should >>>> be looking at) would be greatly appreciated. >>>> >>>> >>>> TIA, >>>> Rod >>>> -- >>>> >>>> >>>> --- >>>> --- >>>> --- >>>> --- >>>> ------------------------------------------------------------------ >>>> ThinkGeek and WIRED's GeekDad team up for the Ultimate >>>> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >>>> lucky parental unit. See the prize list and enter to win: >>>> http://p.sf.net/sfu/thinkgeek-promo >>>> _______________________________________________ >>>> Nagios-users mailing list >>>> Nagios-users at lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>>> ::: Please include Nagios version, plugin version (-v) and OS when >>>> reporting any issue. >>>> ::: Messages without supporting info will risk being sent to /dev/ >>>> null >>> --- >>> --- >>> --- >>> --------------------------------------------------------------------- >>> ThinkGeek and WIRED's GeekDad team up for the Ultimate >>> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >>> lucky parental unit. See the prize list and enter to win: >>> http://p.sf.net/sfu/thinkgeek-promo >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when >>> reporting any issue. >>> ::: Messages without supporting info will risk being sent to /dev/ >>> null >> >> --- >> --- >> --- >> --------------------------------------------------------------------- >> ThinkGeek and WIRED's GeekDad team up for the Ultimate >> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >> lucky parental unit. See the prize list and enter to win: >> http://p.sf.net/sfu/thinkgeek-promo >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trisha at rockyou.com Tue Jun 15 04:18:40 2010 From: trisha at rockyou.com (Trisha Hoang) Date: Mon, 14 Jun 2010 19:18:40 -0700 Subject: Looking for an alternative user interface with more advanced features Message-ID: Hi, There are times when I need to disable notifications or submit downtime for *random* hosts/services that don't belong to any particular hostgroups/servicegroups, and the standard Nagios UI doesn't have this kind of feature. Would you recommend some tools out there that are stable, easy to install, easy to use, that have some of the more advanced features? Thank you. Trisha -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Tue Jun 15 05:39:19 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Mon, 14 Jun 2010 23:39:19 -0400 Subject: Looking for an alternative user interface with more advanced features In-Reply-To: References: Message-ID: Do you mean that you can't do it if you go to Services or Hosts, or you mean that you really do want to disable notifications and downtime for *truly* random hosts? Because I don't think there's a whole lot of use cases matching that. --Matt On Mon, Jun 14, 2010 at 10:18 PM, Trisha Hoang wrote: > Hi, > There are times when I need to disable notifications or submit downtime for > *random* hosts/services that don't belong to any particular > hostgroups/servicegroups, and the standard Nagios UI doesn't have this kind > of feature. > Would you recommend some tools out there that are stable, easy to install, > easy to use, that have some of the more advanced features? > Thank you. > Trisha > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at xodus.org Tue Jun 15 13:42:39 2010 From: lists at xodus.org (Marc Powell) Date: Tue, 15 Jun 2010 06:42:39 -0500 Subject: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: <4C1644AD.80401@cyber-office.net> References: <4C1644AD.80401@cyber-office.net> Message-ID: <87989BF6-56AA-446A-BF99-B3FD998F9388@xodus.org> On Jun 14, 2010, at 10:03 AM, Roderick A. Anderson wrote: > My Nagios foo must leave a lot to be desired. 8-( > > I have not been able to figure out the correct combination of service > definition settings to get a check to go CRITICAL:HARD on the first try > (no SOFT alerts), and keep sending alerts (every five minutes) until the > check/alert clears. It sounds like you want to set 'is_volatile' for the service. -- Marc ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Naveen.R at philips.com Tue Jun 15 17:55:07 2010 From: Naveen.R at philips.com (R, Naveen) Date: Tue, 15 Jun 2010 17:55:07 +0200 Subject: Regarding receiving the mail from Nagios In-Reply-To: References: Message-ID: <8DF553C0530569419CAAFA4E3331C32BEB103FD588@NLCLUEXM09.connect1.local> Dear All, I newly configured Nagios and I'm very new to this tool. I have configured my email address (i.e. Naveen.r at philips.com), and when I try to send the test mail, it says [cid:image001.png at 01CB0CD0.D980F830] But till now I have not received any mails as such. I also tried changing the email address to my general email ( I.e. naveenkgr at gmail.com), but no go. Can you please guide me in receiving the mail to my inbox. Thank you in advance. Warm regards, Naveen.R., Philips Consumer Lifestyle, Philips Innovation Campus, Philips Electronics India Ltd, Manyata Tech Park, Nagavara, Bangalore - 560045. Ph: +91 80 40162000 Extn: 2718 Mobile: 9663320455 Email: naveen.r at philips.com intranet: http://pww.bangalore.philips.com, internet: http://www.bangalore.philips.com [cid:image002.png at 01CB0CD1.37BE7D00] ________________________________ The information contained in this message may be confidential and legally protected under applicable law. The message is intended solely for the addressee(s). If you are not the intended recipient, you are hereby notified that any use, forwarding, dissemination, or reproduction of this message is strictly prohibited and may be unlawful. If you are not the intended recipient, please contact the sender by return e-mail and destroy all copies of the original message. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 7012 bytes Desc: image001.png URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image002.png Type: image/png Size: 4858 bytes Desc: image002.png URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Tue Jun 15 18:15:17 2010 From: a31modela at hotmail.com (steve f) Date: Tue, 15 Jun 2010 12:15:17 -0400 Subject: Regarding receiving the mail from Nagios In-Reply-To: <8DF553C0530569419CAAFA4E3331C32BEB103FD588@NLCLUEXM09.connect1.local> References: , <8DF553C0530569419CAAFA4E3331C32BEB103FD588@NLCLUEXM09.connect1.local> Message-ID: Have you verified that the server Nagios is running on can send a test email from the command line? Steve From: Naveen.R at philips.com To: nagios-users at lists.sourceforge.net Date: Tue, 15 Jun 2010 17:55:07 +0200 Subject: [Nagios-users] Regarding receiving the mail from Nagios Dear All, I newly configured Nagios and I?m very new to this tool. I have configured my email address (i.e. Naveen.r at philips.com), and when I try to send the test mail, it says But till now I have not received any mails as such. I also tried changing the email address to my general email ( I.e. naveenkgr at gmail.com), but no go. Can you please guide me in receiving the mail to my inbox. Thank you in advance. Warm regards, Naveen.R., Philips Consumer Lifestyle, Philips Innovation Campus, Philips Electronics India Ltd, Manyata Tech Park, Nagavara, Bangalore - 560045. Ph: +91 80 40162000 Extn: 2718 Mobile: 9663320455 Email: naveen.r at philips.com intranet: http://pww.bangalore.philips.com, internet: http://www.bangalore.philips.com The information contained in this message may be confidential and legally protected under applicable law. The message is intended solely for the addressee(s). If you are not the intended recipient, you are hereby notified that any use, forwarding, dissemination, or reproduction of this message is strictly prohibited and may be unlawful. If you are not the intended recipient, please contact the sender by return e-mail and destroy all copies of the original message. _________________________________________________________________ Hotmail is redefining busy with tools for the New Busy. Get more from your inbox. http://www.windowslive.com/campaign/thenewbusy?ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_2 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 7012 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image002.png Type: image/png Size: 4858 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trisha at rockyou.com Tue Jun 15 19:14:14 2010 From: trisha at rockyou.com (Trisha Hoang) Date: Tue, 15 Jun 2010 10:14:14 -0700 Subject: Looking for an alternative user interface with more advanced features In-Reply-To: References: Message-ID: There are times that we need to take couple of hosts from *multiple* hostgroups for upgrade/testing. It gets to be time consuming commiting downtime for 20+ hosts one by one. Nagios only has features for either hostgroups and/or servicegroups but not a listing of nodes where users can pick and choose which hosts and services to enable/disable/downtime. On Mon, Jun 14, 2010 at 8:39 PM, Matt Simmons wrote: > Do you mean that you can't do it if you go to Services or Hosts, or > you mean that you really do want to disable notifications and downtime > for *truly* random hosts? Because I don't think there's a whole lot of > use cases matching that. > > --Matt > > > On Mon, Jun 14, 2010 at 10:18 PM, Trisha Hoang wrote: > > Hi, > > There are times when I need to disable notifications or submit downtime > for > > *random* hosts/services that don't belong to any particular > > hostgroups/servicegroups, and the standard Nagios UI doesn't have this > kind > > of feature. > > Would you recommend some tools out there that are stable, easy to > install, > > easy to use, that have some of the more advanced features? > > Thank you. > > Trisha > > > > > ------------------------------------------------------------------------------ > > ThinkGeek and WIRED's GeekDad team up for the Ultimate > > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > > lucky parental unit. See the prize list and enter to win: > > http://p.sf.net/sfu/thinkgeek-promo > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > -- > LITTLE GIRL: But which cookie will you eat FIRST? > COOKIE MONSTER: Me think you have misconception of cookie-eating process. > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: rockyoutrisha -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robert.wolfe at robertwolfe.org Tue Jun 15 19:58:01 2010 From: robert.wolfe at robertwolfe.org (Robert Wolfe) Date: Tue, 15 Jun 2010 13:58:01 -0400 Subject: Regarding receiving the mail from Nagios References: <8DF553C0530569419CAAFA4E3331C32BEB103FD588@NLCLUEXM09.connect1.local> Message-ID: <50BE9C7135A64147819E42376C136B2628FD@dc1.wolfe.local> Have you checked the logs on your SMTP server for any clues? ________________________________ From: R, Naveen [mailto:Naveen.R at philips.com] Sent: Tue 6/15/2010 11:55 AM To: Nagios Users List Subject: [Nagios-users] Regarding receiving the mail from Nagios Dear All, I newly configured Nagios and I'm very new to this tool. I have configured my email address (i.e. Naveen.r at philips.com), and when I try to send the test mail, it says But till now I have not received any mails as such. I also tried changing the email address to my general email ( I.e. naveenkgr at gmail.com), but no go. Can you please guide me in receiving the mail to my inbox. Thank you in advance. Warm regards, Naveen.R., Philips Consumer Lifestyle, Philips Innovation Campus, Philips Electronics India Ltd, Manyata Tech Park, Nagavara, Bangalore - 560045. Ph: +91 80 40162000 Extn: 2718 Mobile: 9663320455 Email: naveen.r at philips.com intranet: http://pww.bangalore.philips.com, internet: http://www.bangalore.philips.com cid:image001.png at 01CB0258.198B5390 ________________________________ The information contained in this message may be confidential and legally protected under applicable law. The message is intended solely for the addressee(s). If you are not the intended recipient, you are hereby notified that any use, forwarding, dissemination, or reproduction of this message is strictly prohibited and may be unlawful. If you are not the intended recipient, please contact the sender by return e-mail and destroy all copies of the original message. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.png Type: image/png Size: 7012 bytes Desc: image001.png URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image002.png Type: image/png Size: 4858 bytes Desc: image002.png URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Jun 15 21:52:21 2010 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Tue, 15 Jun 2010 12:52:21 -0700 Subject: Looking for an alternative user interface with more advanced features In-Reply-To: References: Message-ID: <20100615195221.GQ3359@bakgwai.americas.hpqcorp.net> Hi Trisha! On Tue, 15 Jun 2010, Trisha Hoang wrote: > There are times that we need to take couple of hosts from *multiple* hostgroups for upgrade/testing. It gets to be time consuming commiting downtime for 20+ hosts one by one. Nagios only has features for either hostgroups and/or servicegroups but not a listing of nodes where users can pick and choose which hosts and services to enable/disable/downtime. You could always slap together a hostgroup that contains the hosts you want to put in downtime and reload the config. Alternatively, it probably wouldn't be hard to come up with a script that took a list of hosts (and maybe start/end times or durations) and submitted downtimes for those hosts via Nagios's external command interface. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Jun 15 22:14:48 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 15 Jun 2010 22:14:48 +0200 Subject: Looking for an alternative user interface with more advanced features In-Reply-To: References: Message-ID: <4C17DF38.9060109@op5.se> On 06/15/2010 04:18 AM, Trisha Hoang wrote: > Hi, > There are times when I need to disable notifications or submit downtime for > *random* hosts/services that don't belong to any particular > hostgroups/servicegroups, and the standard Nagios UI doesn't have this kind > of feature. > Would you recommend some tools out there that are stable, easy to install, > easy to use, that have some of the more advanced features? That's a lot of easy for free tools with advanced features ;) Ninja has something along those lines if you're willing to run bleeding edge (I think). You can select multiple hosts, hostgroups, services or servicegroups and issue commands for them if you like. I think it's only in the bleeding edge versions though (meaning in our git repositories, which are readable for anyone that wants to clone them). It's been a few weeks since I worked on Ninja, so I can't say for sure. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew1.li at citi.com Wed Jun 16 03:31:28 2010 From: andrew1.li at citi.com (Andrew Li) Date: Wed, 16 Jun 2010 11:31:28 +1000 Subject: Escalate after X warnings or criticals In-Reply-To: References: <4AF38E9F.7000305@createspace.com> Message-ID: <1276651887.17871.14.camel@localhost> Does anyone know if the notification count problem got fixed in 3.2.1? I had a read of the ChangeLog but it doesn't mention anything related to this problem since 3.0.6. Cheers, Andrew On Mon, 2009-11-09 at 08:55, Neil Ramsay wrote: > Hi Martin, > > The escalation_options don't take the state into consideration during > the notification count. So if you have an escalate rule on the 4th > notification and only escalate on Critical in the escalation_options > then following scenario is can occur: > You have 3 warning notifications and the 4th is Critical then it will > escalate as there have been 4 notifications and a Critical. I posted a > help request on this issue a week or two ago and would really like > this to be patched or built into the next update. > http://article.gmane.org/gmane.network.nagios.user/64997/match=escalation+state > > Cheers, > > Neil > > On Sat, Nov 7, 2009 at 12:56 AM, Martin Melin > wrote: > The existing escalation_options directive in escalation > definitions will likely get you this behavior without the need > for a patch. > > http://nagios.sourceforge.net/docs/3_0/escalations.html - see > the very bottom of this page as well as the object definition > documentation for escalation_options. > > Regards, > Martin Melin > > On Fri, Nov 6, 2009 at 3:49 AM, Mark Gius > wrote: > Currently, service notifications contain > "first/last_notification" > directives, that specify the range of notifications > that the escalation > should apply to. This method of escalation has a > weakness however. > > At my work, we let warnings go to the default contact > (which happens to > be email), and escalate to a pager chain on critical. > However, if a > service sits in WARNING for a length of time (which is > likely to happen > in the middle of the night), by the time the service > enters a CRITICAL > state the notification count exceeds our highest > escalation, and our > entire team gets paged immediately. > > What I'd like to see is the ability to distinguish > between a WARNING > notification and a CRITICAL notification in the > escalation, and set up > escalation chains that work based on the number of > CRITICAL's that have > been sent, as opposed to the total number of > notifications. > > I am planning on patching nagios to support this > behavior if there isn't > a way to achieve this behavior with the current > implementation. My plan > is to add a warning/critical count to service, add a > first/last > warning/critical state to service escalations, and add > the directives > "(first|last)_(warning|critical)_notification" to the > service escalation > configs. The idea is also to keep the current > behavior > (notification_count and first/last_notification would > still be present), > but allow finer grained control over when escalations > are sent out. > This way if somebody didn't want to use the finer > grained control their > behavior would stay the same. My current plan is to > match the > escalation if _any_ of the 3 notification ranges match > (all/warning/critical). > > Any advice on making this behavior happen with Nagios > as-is, or > suggestions/advice on the implementation are welcome. > > -Gius > ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Wed Jun 16 05:11:20 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Tue, 15 Jun 2010 20:11:20 -0700 Subject: Escalate after X warnings or criticals In-Reply-To: <1276651887.17871.14.camel@localhost> References: <4AF38E9F.7000305@createspace.com> <1276651887.17871.14.camel@localhost> Message-ID: <4C1840D8.2070803@5dninja.net> If it hasn't, I'll be adding it myself and will be happy to submit my patches back. I've been needing this functionality for awhile, and was planning on rolling it in, in the next 2-3 months. Andrew Li wrote: > Does anyone know if the notification count problem got fixed in 3.2.1? > > I had a read of the ChangeLog but it doesn't mention anything related to > this problem since 3.0.6. -- Mike Lindsey ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Jun 16 13:33:52 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 16 Jun 2010 12:33:52 +0100 Subject: Odd Entries in the Availability report Message-ID: <4C18B6A0.7020609@flatto.net> Hello List . I have encountered an odd occurrence in the Availability reports I produce . When i dig to see Full view , i have many entries of "Program restart" (attached below ) , but there are no matching records in the nagios log to indicate a reload or a restart was issues . Has any one ever experienced such behaviour from nagios previously ? I am using 3.2.0 from source on SLES 10.2 64 bit . Thanks 13-05-2010 00:00:00 13-05-2010 11:41:34 0d 11h 41m 34s SERVICE OK (HARD)HTTP OK: HTTP/1.1 200 OK - 95912 bytes in 0.272 second response time 13-05-2010 11:41:34 13-05-2010 11:41:34 0d 0h 0m 0s PROGRAM (RE)START Program restart 13-05-2010 11:41:34 13-05-2010 11:43:35 0d 0h 2m 1s PROGRAM (RE)START Program start 13-05-2010 11:43:35 13-05-2010 11:43:35 0d 0h 0m 0s PROGRAM (RE)START Program restart 13-05-2010 11:43:35 13-05-2010 12:05:45 0d 0h 22m 10s PROGRAM (RE)START Program start 13-05-2010 12:05:45 13-05-2010 12:05:45 0d 0h 0m 0s PROGRAM (RE)START Program restart 13-05-2010 12:05:45 13-05-2010 12:11:15 0d 0h 5m 30s PROGRAM (RE)START Program start 13-05-2010 12:11:15 13-05-2010 12:11:15 0d 0h 0m 0s PROGRAM (RE)START Program restart 13-05-2010 12:11:15 14-05-2010 00:00:00 0d 11h 48m 45s PROGRAM (RE)START Program start 14-05-2010 00:00:00 14-05-2010 10:26:13 0d 10h 26m 13s SERVICE OK (HARD)HTTP OK: HTTP/1.1 200 OK - 95552 bytes in 0.290 second response time -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Wed Jun 16 16:54:49 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Wed, 16 Jun 2010 07:54:49 -0700 Subject: Follow Up: [Q] Service config, go critical:HARD; Alert every 5 minutes until non-critical In-Reply-To: <4C1644AD.80401@cyber-office.net> References: <4C1644AD.80401@cyber-office.net> Message-ID: <4C18E5B9.7010100@cyber-office.net> Roderick A. Anderson wrote: > My Nagios foo must leave a lot to be desired. 8-( > > I have not been able to figure out the correct combination of service > definition settings to get a check to go CRITICAL:HARD on the first try > (no SOFT alerts), and keep sending alerts (every five minutes) until the > check/alert clears. > > Either the complete answer or a clue-stick whack (the settings I should > be looking at) would be greatly appreciated. My thanks to all for your suggestions and ideas. I now have it working correctly without even having to run a test. ;-) Had a hardware failure yesterday that triggered the Service. :-( It kept sending notifications every five minutes until the problem was solved. \\||/ Rod -- > > > TIA, > Rod ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mgius at createspace.com Wed Jun 16 19:25:10 2010 From: mgius at createspace.com (Gius, Mark) Date: Wed, 16 Jun 2010 13:25:10 -0400 Subject: Escalate after X warnings or criticals In-Reply-To: <4C1840D8.2070803@5dninja.net> References: <4AF38E9F.7000305@createspace.com> <1276651887.17871.14.camel@localhost> <4C1840D8.2070803@5dninja.net> Message-ID: <23193A17705DD246AFFFDF09B779F56F2519763670@EX-IAD6-B.ant.amazon.com> I had submitted a patch a while back that allows for distinguishing between warning and critical. I don't think it's going to be included in any 3.0.X releases, because it apparently breaks plugins that access Nagios' state data directly. I don't know whether or not my patch will be included in 3.2.x or higher releases. You can grab my patch (which I'm applying to vanilla 3.0.6 Nagios sources) out of this thread. At the time I wrote it, it also patched cleanly against HEAD, but I haven't kept up with it. http://article.gmane.org/gmane.network.nagios.devel/7083/ -Gius > -----Original Message----- > From: Mike Lindsey [mailto:mike-nagios at 5dninja.net] > Sent: Tuesday, June 15, 2010 8:11 PM > To: Nagios Users List > Subject: Re: [Nagios-users] Escalate after X warnings or criticals > > If it hasn't, I'll be adding it myself and will be happy to submit my > patches back. I've been needing this functionality for awhile, and was > planning on rolling it in, in the next 2-3 months. > > Andrew Li wrote: > > Does anyone know if the notification count problem got fixed in > 3.2.1? > > > > I had a read of the ChangeLog but it doesn't mention anything related > to > > this problem since 3.0.6. > > > -- > Mike Lindsey > > ----------------------------------------------------------------------- > ------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mgius at createspace.com Wed Jun 16 19:35:07 2010 From: mgius at createspace.com (Gius, Mark) Date: Wed, 16 Jun 2010 13:35:07 -0400 Subject: Escalate after X warnings or criticals In-Reply-To: <23193A17705DD246AFFFDF09B779F56F2519763670@EX-IAD6-B.ant.amazon.com> References: <4AF38E9F.7000305@createspace.com> <1276651887.17871.14.camel@localhost> <4C1840D8.2070803@5dninja.net> <23193A17705DD246AFFFDF09B779F56F2519763670@EX-IAD6-B.ant.amazon.com> Message-ID: <23193A17705DD246AFFFDF09B779F56F2519763695@EX-IAD6-B.ant.amazon.com> Erk, you're going to want this one, it includes directives for host states as well as unknown service states. It's got some docs in it as well, but the docs don't patch cleanly against 3.0.6. http://thread.gmane.org/gmane.network.nagios.devel/7083 -Gius > -----Original Message----- > From: Gius, Mark > Sent: Wednesday, June 16, 2010 10:25 AM > To: Nagios Users List > Subject: Re: [Nagios-users] Escalate after X warnings or criticals > > I had submitted a patch a while back that allows for distinguishing > between warning and critical. I don't think it's going to be included > in any 3.0.X releases, because it apparently breaks plugins that access > Nagios' state data directly. I don't know whether or not my patch will > be included in 3.2.x or higher releases. > > You can grab my patch (which I'm applying to vanilla 3.0.6 Nagios > sources) out of this thread. At the time I wrote it, it also patched > cleanly against HEAD, but I haven't kept up with it. > > http://article.gmane.org/gmane.network.nagios.devel/7083/ > > -Gius > > > -----Original Message----- > > From: Mike Lindsey [mailto:mike-nagios at 5dninja.net] > > Sent: Tuesday, June 15, 2010 8:11 PM > > To: Nagios Users List > > Subject: Re: [Nagios-users] Escalate after X warnings or criticals > > > > If it hasn't, I'll be adding it myself and will be happy to submit my > > patches back. I've been needing this functionality for awhile, and > was > > planning on rolling it in, in the next 2-3 months. > > > > Andrew Li wrote: > > > Does anyone know if the notification count problem got fixed in > > 3.2.1? > > > > > > I had a read of the ChangeLog but it doesn't mention anything > related > > to > > > this problem since 3.0.6. > > > > > > -- > > Mike Lindsey > > > > --------------------------------------------------------------------- > -- > > ------- > > ThinkGeek and WIRED's GeekDad team up for the Ultimate > > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > > lucky parental unit. See the prize list and enter to win: > > http://p.sf.net/sfu/thinkgeek-promo > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent to > /dev/null > > ----------------------------------------------------------------------- > ------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From trisha at rockyou.com Wed Jun 16 19:43:11 2010 From: trisha at rockyou.com (Trisha Hoang) Date: Wed, 16 Jun 2010 10:43:11 -0700 Subject: Looking for an alternative user interface with more advanced features In-Reply-To: <4C17DF38.9060109@op5.se> References: <4C17DF38.9060109@op5.se> Message-ID: Thank you for all your suggestions. Sorry, I got no guts for 'bleeding edge' technology, and would like to sleep peacefully at night, though I will follow up on Ninja's development for future upgrades. Patrick's suggestions are great, simple, tried and true, and will fit our requirements. Thanks again. On Tue, Jun 15, 2010 at 1:14 PM, Andreas Ericsson wrote: > On 06/15/2010 04:18 AM, Trisha Hoang wrote: > > Hi, > > There are times when I need to disable notifications or submit downtime > for > > *random* hosts/services that don't belong to any particular > > hostgroups/servicegroups, and the standard Nagios UI doesn't have this > kind > > of feature. > > Would you recommend some tools out there that are stable, easy to > install, > > easy to use, that have some of the more advanced features? > > > That's a lot of easy for free tools with advanced features ;) > > Ninja has something along those lines if you're willing to run bleeding > edge (I think). You can select multiple hosts, hostgroups, services or > servicegroups and issue commands for them if you like. > > I think it's only in the bleeding edge versions though (meaning in our > git repositories, which are readable for anyone that wants to clone them). > It's been a few weeks since I worked on Ninja, so I can't say for sure. > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > Considering the successes of the wars on alcohol, poverty, drugs and > terror, I think we should give some serious thought to declaring war > on peace. > -- Trisha Hoang | IT/Operations | Rockyou, Inc. | Phone: 408-472-3989 | AIM: rockyoutrisha -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robert.wolfe at robertwolfe.org Wed Jun 16 20:07:28 2010 From: robert.wolfe at robertwolfe.org (Robert Wolfe) Date: Wed, 16 Jun 2010 14:07:28 -0400 Subject: Looking for an alternative user interface with more advanced features In-Reply-To: <4C17DF38.9060109@op5.se> References: <4C17DF38.9060109@op5.se> Message-ID: <20100616140728.b93e9fb3.robert.wolfe@robertwolfe.org> On Tue, 15 Jun 2010 22:14:48 +0200 Andreas Ericsson wrote: > On 06/15/2010 04:18 AM, Trisha Hoang wrote: > > Hi, > > There are times when I need to disable notifications or submit downtime for > > *random* hosts/services that don't belong to any particular > > hostgroups/servicegroups, and the standard Nagios UI doesn't have this kind > > of feature. > > Would you recommend some tools out there that are stable, easy to install, > > easy to use, that have some of the more advanced features? > > > That's a lot of easy for free tools with advanced features ;) > > Ninja has something along those lines if you're willing to run bleeding > edge (I think). You can select multiple hosts, hostgroups, services or > servicegroups and issue commands for them if you like. > > I think it's only in the bleeding edge versions though (meaning in our > git repositories, which are readable for anyone that wants to clone them). > It's been a few weeks since I worked on Ninja, so I can't say for sure. What is this "Ninja" and where might one be able to obtain a copy of it? -- Robert Wolfe - MCP, MCTS, MCSA, MCSE, Linux+, LPIC-1, CCA Email: robert.wolfe at robertwolfe.org Website: http://robertwolfe.org BBS: telnet://robertwolfe.org ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kyle.bader at gmail.com Wed Jun 16 21:04:58 2010 From: kyle.bader at gmail.com (Kyle Bader) Date: Wed, 16 Jun 2010 12:04:58 -0700 Subject: Looking for an alternative user interface with more advanced features In-Reply-To: <20100616140728.b93e9fb3.robert.wolfe@robertwolfe.org> References: <4C17DF38.9060109@op5.se> <20100616140728.b93e9fb3.robert.wolfe@robertwolfe.org> Message-ID: > What is this "Ninja" and where might one be able to obtain a copy of it? Their site explains it far better than I could: http://www.op5.org/community/projects/ninja -- Kyle ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james at linux-source.org Thu Jun 17 08:13:33 2010 From: james at linux-source.org (James Corteciano) Date: Thu, 17 Jun 2010 14:13:33 +0800 Subject: Removing item Message-ID: Hi List, How to remove "Service Groups" item page on Nagios? I have no service group define for this. Thanks James -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bangers at gmail.com Thu Jun 17 09:17:50 2010 From: bangers at gmail.com (Matthew Angelo) Date: Thu, 17 Jun 2010 17:17:50 +1000 Subject: Unregister a service definition Message-ID: Hi Nagios Users, I have an *extremely* modular configuration. My template structure is well defined and when it comes to monitoring a host, it's as simple as including it as a Member of a Hostgroup. An example Linux server would be: # cat ./linux/LINUXSERVER.cfg define host{ use PROD-SERVERS host_name LINUXSERVER.local.lan alias LINUXSERVER.local.lan address LINUXSERVER.local.lan hostgroups LINUX, HTTP, SMTP } And that's it. It auto inherits all service checks defined against LINUX, HTTP and SMTP. 95% of my LINUX servers have a "/apps" mount point. Therefore it makes sense to include a service check for it inside the LINUX host group and then inside the Host Definition for the remaining 5% 'unregister' that service check. Is it possible, inside LINUXSERVER2.cfg, to say "Yes I know I'm a member of hostgroup LINUX, but I *don't* want to inherit the service check "Disk /apps"? I tried: define service{ use GENERIC-SERVICE-TEMPLATE host_name LINUXSERVER2.local.lan service_description Disk /apps check_command null } But obviously this fails with an "Already defined" error. Appreciate any info. Thanks, Matthew. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Thu Jun 17 11:19:45 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Thu, 17 Jun 2010 11:19:45 +0200 Subject: Unregister a service definition In-Reply-To: References: Message-ID: <458E8E64-8052-42EA-AA41-82A96ECA67DB@linux.it> Apply the service to the hostgroup AND in the definition of the service apply it ALSO to the hosts negating them (!) eg: hostgroup_name HTTP host_name !host-not-to-apply Ciao, Giorgio Il giorno 17/giu/2010, alle ore 09.17, Matthew Angelo ha scritto: > Hi Nagios Users, > > I have an *extremely* modular configuration. My template structure > is well defined and when it comes to monitoring a host, it's as > simple as including it as a Member of a Hostgroup. > > An example Linux server would be: > > > # cat ./linux/LINUXSERVER.cfg > define host{ > use PROD-SERVERS > host_name LINUXSERVER.local.lan > alias LINUXSERVER.local.lan > address LINUXSERVER.local.lan > hostgroups LINUX, HTTP, SMTP > } > > > And that's it. It auto inherits all service checks defined against > LINUX, HTTP and SMTP. > > 95% of my LINUX servers have a "/apps" mount point. Therefore it > makes sense to include a service check for it inside the LINUX host > group and then inside the Host Definition for the remaining 5% > 'unregister' that service check. > > Is it possible, inside LINUXSERVER2.cfg, to say "Yes I know I'm a > member of hostgroup LINUX, but I *don't* want to inherit the service > check "Disk /apps"? > > I tried: > > define service{ > use GENERIC-SERVICE-TEMPLATE > host_name LINUXSERVER2.local.lan > service_description Disk /apps > check_command null > } > > But obviously this fails with an "Already defined" error. > > > Appreciate any info. > > Thanks, > Matthew. > --- > --- > --- > --------------------------------------------------------------------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Jun 17 11:50:08 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 17 Jun 2010 10:50:08 +0100 Subject: Removing item In-Reply-To: References: Message-ID: <4C19EFD0.7000509@flatto.net> James Corteciano wrote: > Hi List, > > How to remove "Service Groups" item page on Nagios? I have no service > group define for this. > > Thanks > > James in the share directory of your nagios install there is a file called side.html in it is the definition for the servicegroups view . if you comment it out - the link on the nagios gui will disappear -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Jun 17 12:31:02 2010 From: ae at op5.se (Andreas Ericsson) Date: Thu, 17 Jun 2010 12:31:02 +0200 Subject: Unregister a service definition In-Reply-To: References: Message-ID: <4C19F966.9070104@op5.se> On 06/17/2010 09:17 AM, Matthew Angelo wrote: > Hi Nagios Users, > > I have an *extremely* modular configuration. My template structure is well > defined and when it comes to monitoring a host, it's as simple as including > it as a Member of a Hostgroup. > > An example Linux server would be: > > > # cat ./linux/LINUXSERVER.cfg > define host{ > use PROD-SERVERS > host_name LINUXSERVER.local.lan > alias LINUXSERVER.local.lan > address LINUXSERVER.local.lan > hostgroups LINUX, HTTP, SMTP > } > > > And that's it. It auto inherits all service checks defined against LINUX, > HTTP and SMTP. > > 95% of my LINUX servers have a "/apps" mount point. Therefore it makes > sense to include a service check for it inside the LINUX host group and then > inside the Host Definition for the remaining 5% 'unregister' that service > check. > > Is it possible, inside LINUXSERVER2.cfg, to say "Yes I know I'm a member of > hostgroup LINUX, but I *don't* want to inherit the service check "Disk > /apps"? > > I tried: > > define service{ > use GENERIC-SERVICE-TEMPLATE > host_name LINUXSERVER2.local.lan > service_description Disk /apps > check_command null > } > > But obviously this fails with an "Already defined" error. > This is patched in the 'dev' branch of git://git.op5.org/nagios.git which you will have to build from source. 'null' still has to be a valid check-command, but with the sources in that branch you should be able to override a service-check inherited from hostgroups. It's not very tested, but feel free to try it out. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james at linux-source.org Thu Jun 17 12:35:57 2010 From: james at linux-source.org (James Corteciano) Date: Thu, 17 Jun 2010 18:35:57 +0800 Subject: Removing item In-Reply-To: <4C19EFD0.7000509@flatto.net> References: <4C19EFD0.7000509@flatto.net> Message-ID: Thanks Assaf On Thu, Jun 17, 2010 at 5:50 PM, Assaf Flatto wrote: > James Corteciano wrote: > > Hi List, > > > > How to remove "Service Groups" item page on Nagios? I have no service > > group define for this. > > > > Thanks > > > > James > in the share directory of your nagios install there is a file called > side.html > > in it is the definition for the servicegroups view . > > if you comment it out - the link on the nagios gui will disappear > > > > -- > Never,Ever Cut A Deal With a Dragon > > > I am doing a Charity Bike ride On the 27 of June for the > Capital to Coast Charity. Please help by Donating > http://www.justgiving.com/Lovefilm-capital-to-coast > > > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From vicjalan at gmail.com Thu Jun 17 16:34:41 2010 From: vicjalan at gmail.com (Victor Lanza) Date: Thu, 17 Jun 2010 10:34:41 -0400 Subject: File Monitoring Message-ID: <003c01cb0e2a$39552dd0$abff8970$@com> Hello, I'm trying to find the best way to monitor several user directories and files to see who accessed, modified, renamed, or deleted them. File access is also key because we want to know who is looking at files that they should not. I know that I can deny access to directories but I would have to create hundreds of shares and directories based on the scenarios that we have here. I'm not too interested in OS file monitoring, mostly user shared files and only on Windows. Has anyone done this with Nagios? Aside from watching the event log for audit traps I mean. Thanks, Victor -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gustavocamposaraujo at gmail.com Thu Jun 17 19:18:40 2010 From: gustavocamposaraujo at gmail.com (Gustavo Araujo) Date: Thu, 17 Jun 2010 14:18:40 -0300 Subject: Missing -l parameters. In-Reply-To: References: Message-ID: Hello everyone. I?m monitoring a lot of servers in my nagios. Today I?m trying to use the "check_nt" plugin. But when i try to use it i always get the error message. -- Gustavo Campos Araujo -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gustavocamposaraujo at gmail.com Thu Jun 17 19:34:09 2010 From: gustavocamposaraujo at gmail.com (Gustavo Araujo) Date: Thu, 17 Jun 2010 14:34:09 -0300 Subject: =?iso-8859-1?q?Missing_-l_parameters=2E_Detailed_i?= =?iso-8859-1?q?nformation_=28can=B4t_solve_the_problem=29_PLEASE_H?= =?iso-8859-1?q?ELP!!?= Message-ID: Hello everyone. I?m using ubuntu-server 10.4 with nagios 3.2 (from repository) Follow down my configurations. /etc/nagios-plugins/config/nt.cfg: define command { command_name check_nt command_line /usr/lib/nagios/plugins/check_nt -H '$HOSTADDRESS$' -v '$ARG1$' } define command { command_name check_nscp command_line /usr/lib/nagios/plugins/check_nt -H '$HOSTADDRESS$' -p 12489 -v '$ARG1$' } /etc/nagios3/conf.d/conf_nagios_afis.cfg define service{ host_name dpfmas01 service_description check_space use generic-service check_command check_nscp!USEDDISKSPACE! -l c -w 80 -c 90 } -- Gustavo Campos Araujo -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From smcafee at collaborativefusion.com Thu Jun 17 19:54:37 2010 From: smcafee at collaborativefusion.com (Sean McAfee) Date: Thu, 17 Jun 2010 13:54:37 -0400 Subject: =?iso-8859-1?q?Missing_-l_parameters=2E_Detailed_i?= =?iso-8859-1?q?nformation_=28can=B4t_solve_the_problem=29_PLEASE_HELP!!?= In-Reply-To: References: Message-ID: <4C1A615D.6000300@collaborativefusion.com> Gustavo Araujo wrote: > Hello everyone. > > I?m using ubuntu-server 10.4 with nagios 3.2 (from repository) > > Follow down my configurations. > > /etc/nagios-plugins/config/nt.cfg: > > define command { > command_name check_nt > command_line /usr/lib/nagios/plugins/check_nt -H > '$HOSTADDRESS$' -v '$ARG1$' > } > > define command { > command_name check_nscp > command_line /usr/lib/nagios/plugins/check_nt -H > '$HOSTADDRESS$' -p 12489 -v '$ARG1$' > } > > > /etc/nagios3/conf.d/conf_nagios_afis.cfg > > define service{ > host_name dpfmas01 > service_description check_space > use generic-service > check_command check_nscp!USEDDISKSPACE! -l c -w 80 -c 90 > } Command, not service, definitions contain the switches. Service definition's check_commands can only take !-delimited arguments, which are then used to populate the switches from the command definition. You can hardcode the drive, warning/critical levels, and port into the command definition, but I'd do the following: define command { command_name check_nt_disk command_line /usr/lib/nagios/plugins/check_nt -H '$HOSTADDRESS$' -p '$ARG1$' -v 'USEDDISKSPACE' -l '$ARG2$' -w '$ARG3$' -c '$ARG4$' } Which would mean your service definition should look like: define service{ host_name dpfmas01 service_description check_space use generic-service check_command check_nt_disk!12489!c!80!90 } Hope this helps, -- Sean McAfee Senior Systems Engineer ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Thu Jun 17 20:28:34 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Thu, 17 Jun 2010 19:28:34 +0100 Subject: Missing -l parameters. In-Reply-To: References: Message-ID: On 17 June 2010 18:18, Gustavo Araujo wrote: > > Hello everyone. > > I?m monitoring a lot of servers in my nagios. > > Today I?m trying to use the "check_nt" plugin. > > But when i try to use it i always get the error message. What does the error message say? ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kyle.bader at gmail.com Thu Jun 17 21:15:11 2010 From: kyle.bader at gmail.com (Kyle Bader) Date: Thu, 17 Jun 2010 12:15:11 -0700 Subject: File Monitoring In-Reply-To: <003c01cb0e2a$39552dd0$abff8970$@com> References: <003c01cb0e2a$39552dd0$abff8970$@com> Message-ID: Heyo, > I?m not too interested in OS file monitoring, mostly user shared files and > only on Windows. Has anyone done this with Nagios? Aside from watching the > event log for audit traps I mean. I haven't implemented it myself but stumbled upon this today and remembered your post: http://en.wikipedia.org/wiki/File_alteration_monitor Might be worth checking out, especially [1] if your doing it on windows: [1] http://msdn.microsoft.com/en-us/library/aa365261(VS.85).aspx -- Kyle ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From vicjalan at gmail.com Thu Jun 17 21:40:58 2010 From: vicjalan at gmail.com (Victor Lanza) Date: Thu, 17 Jun 2010 15:40:58 -0400 Subject: File Monitoring In-Reply-To: References: <003c01cb0e2a$39552dd0$abff8970$@com> Message-ID: On Thu, Jun 17, 2010 at 3:15 PM, Kyle Bader wrote: > Heyo, > > > I?m not too interested in OS file monitoring, mostly user shared files > and > > only on Windows. Has anyone done this with Nagios? Aside from watching > the > > event log for audit traps I mean. > > I haven't implemented it myself but stumbled upon this today and > remembered your post: > > http://en.wikipedia.org/wiki/File_alteration_monitor > > Might be worth checking out, especially [1] if your doing it on windows: > > [1] http://msdn.microsoft.com/en-us/library/aa365261(VS.85).aspx > > Thanks Kyle, I actually had not come across this solution however from a brief overview, it seems like it doesn't monitor file access which is what I also want to monitor similar to the windows Event Log. Products like tripwire, OSSEC, Samhain, etc only monitor file changes, deletes, renames, etc. and it seems like FAM falls into that category. I have looked into the FileSystemWatcher but it is not clear if it monitors file access as well. I wouldn't mind using that and tying nagios into it (via log_watch, or other means) Thanks, Victor > -- > > Kyle > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew1.li at citi.com Fri Jun 18 02:42:55 2010 From: andrew1.li at citi.com (Andrew Li) Date: Fri, 18 Jun 2010 10:42:55 +1000 Subject: Escalate after X warnings or criticals In-Reply-To: <23193A17705DD246AFFFDF09B779F56F2519763670@EX-IAD6-B.ant.amazon.com> References: <4AF38E9F.7000305@createspace.com> <1276651887.17871.14.camel@localhost> <4C1840D8.2070803@5dninja.net> <23193A17705DD246AFFFDF09B779F56F2519763670@EX-IAD6-B.ant.amazon.com> Message-ID: <1276821774.17871.78.camel@localhost> On Thu, 2010-06-17 at 03:25, Gius, Mark wrote: > I had submitted a patch a while back that allows for distinguishing > between warning and critical. I don't think it's going to be included > in any 3.0.X releases, because it apparently breaks plugins that access > Nagios' state data directly. I don't know whether or not my patch will > be included in 3.2.x or higher releases. I had a look at 3.2.1, doesn't look like it's the patch is included :( > You can grab my patch (which I'm applying to vanilla 3.0.6 Nagios sources) > out of this thread. At the time I wrote it, it also patched cleanly > against HEAD, but I haven't kept up with it. > > http://article.gmane.org/gmane.network.nagios.devel/7083/ Thanks for the patch, it applies cleanly to 3.0.6 stable. I've read through it but have not yet tried using it. I think it's a good enhancement because it makes the escalation path more logical. Andrew ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.friedrich at univie.ac.at Fri Jun 18 11:29:30 2010 From: michael.friedrich at univie.ac.at (Michael Friedrich) Date: Fri, 18 Jun 2010 11:29:30 +0200 Subject: Escalate after X warnings or criticals In-Reply-To: <1276821774.17871.78.camel@localhost> References: <4AF38E9F.7000305@createspace.com> <1276651887.17871.14.camel@localhost> <4C1840D8.2070803@5dninja.net> <23193A17705DD246AFFFDF09B779F56F2519763670@EX-IAD6-B.ant.amazon.com> <1276821774.17871.78.camel@localhost> Message-ID: <4C1B3C7A.3060501@univie.ac.at> Andrew Li wrote: > Thanks for the patch, it applies cleanly to 3.0.6 stable. I've read > through it but have not yet tried using it. > Hiren has been in contact with Mark on it the last months, in Icinga it works just fine now. > I think it's a good enhancement because it makes the escalation path > more logical. > Yep we too - that is why it has been slightly adapted and will be in Icinga 1.0.2 Kind regards, Michael -- DI (FH) Michael Friedrich michael.friedrich at univie.ac.at Tel: +43 1 4277 14359 Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gustavocamposaraujo at gmail.com Fri Jun 18 14:06:50 2010 From: gustavocamposaraujo at gmail.com (Gustavo Araujo) Date: Fri, 18 Jun 2010 09:06:50 -0300 Subject: =?iso-8859-1?q?Missing_-l_parameters=2E_Detailed_i?= =?iso-8859-1?q?nformation_=28can=B4t_solve_the_problem=29_PLEASE_H?= =?iso-8859-1?q?ELP!!?= In-Reply-To: <4C1A615D.6000300@collaborativefusion.com> References: <4C1A615D.6000300@collaborativefusion.com> Message-ID: Sean, Thanks a lot for your help. It is now working fine. Regards. 2010/6/17 Sean McAfee > Gustavo Araujo wrote: > > Hello everyone. > > > > I?m using ubuntu-server 10.4 with nagios 3.2 (from repository) > > > > Follow down my configurations. > > > > /etc/nagios-plugins/config/nt.cfg: > > > > define command { > > command_name check_nt > > command_line /usr/lib/nagios/plugins/check_nt -H > > '$HOSTADDRESS$' -v '$ARG1$' > > } > > > > define command { > > command_name check_nscp > > command_line /usr/lib/nagios/plugins/check_nt -H > > '$HOSTADDRESS$' -p 12489 -v '$ARG1$' > > } > > > > > > /etc/nagios3/conf.d/conf_nagios_afis.cfg > > > > define service{ > > host_name dpfmas01 > > service_description check_space > > use generic-service > > check_command check_nscp!USEDDISKSPACE! -l c -w 80 -c > 90 > > } > > Command, not service, definitions contain the switches. > > Service definition's check_commands can only take !-delimited arguments, > which are then used to populate the switches from the command definition. > > You can hardcode the drive, warning/critical levels, and port into the > command definition, but I'd do the following: > > define command { > command_name check_nt_disk > command_line /usr/lib/nagios/plugins/check_nt -H > '$HOSTADDRESS$' -p '$ARG1$' -v 'USEDDISKSPACE' -l '$ARG2$' -w '$ARG3$' > -c '$ARG4$' > } > > Which would mean your service definition should look like: > > define service{ > host_name dpfmas01 > service_description check_space > use generic-service > check_command check_nt_disk!12489!c!80!90 > } > > Hope this helps, > > -- > Sean McAfee > Senior Systems Engineer > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Gustavo Campos Araujo -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robert.wolfe at robertwolfe.org Wed Jun 16 21:39:35 2010 From: robert.wolfe at robertwolfe.org (Robert Wolfe) Date: Wed, 16 Jun 2010 15:39:35 -0400 Subject: Looking for an alternative user interface with more advanced features References: <4C17DF38.9060109@op5.se><20100616140728.b93e9fb3.robert.wolfe@robertwolfe.org> Message-ID: <50BE9C7135A64147819E42376C136B262901@dc1.wolfe.local> Okay, I was looking for this, however, I was not able to find it. Either that or I just didn't look hard enough :) ________________________________ From: Kyle Bader [mailto:kyle.bader at gmail.com] Sent: Wed 6/16/2010 3:04 PM To: Nagios Users List Subject: Re: [Nagios-users] Looking for an alternative user interface with more advanced features > What is this "Ninja" and where might one be able to obtain a copy of it? Their site explains it far better than I could: http://www.op5.org/community/projects/ninja -- Kyle ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Fri Jun 18 23:02:12 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Fri, 18 Jun 2010 14:02:12 -0700 Subject: bandwidth monitoring and presentation? Message-ID: <4C1BDED4.8040501@cyber-office.net> Similar to what Solarwinds does? I'm back. Last August I asked about monitoring Tranzeo wireless radios and SNMP. Did some more searching and found my own posts :-) and a few more but nothing really substantial. More research and a bit of clarification of needs/desires leads me to think I need monitor bandwidth utilization and graph it. I found the check_bandwidth plug-in. Haven't tried it yet so was hoping some one on the list could offer some insight and as to what it will provide Nagios-wise. TIA, Rod -- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dirk.schulz at kinzesberg.de Sat Jun 19 15:35:56 2010 From: dirk.schulz at kinzesberg.de (Dirk H. Schulz) Date: Sat, 19 Jun 2010 15:35:56 +0200 Subject: check_disk and Volumes > 10 TB In-Reply-To: References: <4C130E00.6030702@kinzesberg.de> Message-ID: <4C1CC7BC.8050905@kinzesberg.de> Am 12.06.10 13:16, schrieb Matt Simmons: > I've seen people run into problems like this when they're checking a > machine that's 64 bit, and the monitoring host is 32. That's not, by > chance, the case now, is it? > Well, the problem also arises if check_disk is run locally on the monitored host. Could it be that it is compiled for 32Bit and has to compiled for 64Bit? Dirk > --Matt > > On Sat, Jun 12, 2010 at 12:33 AM, Dirk H. Schulz > wrote: > >> Hi folks, >> >> I have run into a problem with check_disk. I have a volume with 14 TB >> that is 91 % free: >> >>> /dev/disk0s3 14Ti 1.2Ti 13Ti 9% /Volumes/EonStor1 >>> >> check_disk states it is 0 % free: >> >>> check_disk -u GB /Volumes/EonStor1 >>> DISK OK - free space: /Volumes/EonStor1 0 GB (0% inode=91%);| >>> /Volumes/EonStor1=1276GB;;;0;14665 >>> >> Is there a known limitation concerning the size of the volumes? With a >> volume< 2 TB I do not have this problem on the same machine. >> >> Is there something I can do to get around this? >> >> Any hint or help is appreciated. >> >> Dirk >> >> ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Sat Jun 19 17:17:28 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Sat, 19 Jun 2010 16:17:28 +0100 Subject: bandwidth monitoring and presentation? In-Reply-To: <4C1BDED4.8040501@cyber-office.net> References: <4C1BDED4.8040501@cyber-office.net> Message-ID: On 18 June 2010 22:02, Roderick A. Anderson wrote: > Similar to what Solarwinds does? > > I'm back. ?Last August I asked about monitoring Tranzeo wireless radios > and SNMP. > > Did some more searching and found my own posts :-) and a few more but > nothing really substantial. > > More research and a bit of clarification of needs/desires leads me to > think I need monitor bandwidth utilization and graph it. ?I found the > check_bandwidth plug-in. ?Haven't tried it yet so was hoping some one on > the list could offer some insight and as to what it will provide > Nagios-wise. I'm not familiar with Tranzeo. I guess what you're trying to do might be similar to the plugins for Motorola/Symbol WS8100 wireless switches. You might be able to adapt those. http://www.monitoringexchange.org/inventory/Check-Plugins/Software/SNMP/Assorted-Nagios-Plugins ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel-listas at gmx.net Mon Jun 21 04:56:42 2010 From: daniel-listas at gmx.net (Daniel Bareiro) Date: Sun, 20 Jun 2010 23:56:42 -0300 Subject: Using PNP4Nagios Message-ID: <20100621025642.GC20558@defiant.freesoftware> Hi, all! I'm testing PNP4Nagios, but for some reason I'm just looking graphics for services in the Nagios server. For the rest of the remote hosts, when I click on the some red star, I get something like this: Initalising (OK) Using /usr/local/nagios/share/perfdata/ (OK) RRDTool /usr/bin/rrdtool found. (OK) RRDTool /usr/bin/rrdtool is executable (OK) PHP Function proc_open is enabled (OK) PHP Function fpassthru is enabled (OK) PHP Function xml_parser_create is enabled (OK) PHP zlib Support found. (OK) PHP GD Support found. (OK) RRD Base Directory /usr/local/nagios/share/perfdata/ found. (OK) Hostname Router is set. (!) Directory /usr/local/nagios/share/perfdata/alderamin not found. Where Alderamin is the hostname. In the service status interface, the Nagios server shows no red stars for the host or for services. However I can access their information through the following URL: http://ws1/nagios/pnp/index.php These are the lines I've in /usr/local/nagios/etc/nagios.cfg for pnp4nagios: broker_module=/usr/local/nagios/bin/npcdmod.o process_performance_data=1 enable_environment_macros=1 host_perfdata_command=process-host-perfdata service_perfdata_command=process-service-perfdata In /usr/local/nagios/etc/objects/mynet.cfg I added "srv-pnp" to the service definitions in the directive "use" and I added "host-pnp" to the directive "use" in the hosts definitions. Also, I'm using "process_perf_data 0" in both hosts and services definitions. In /usr/local/nagios/etc/objects/templates.cfg I added the following definitions: # DGB - 20100620 define host { name host-pnp register 0 action_url /nagios/pnp/index.php?host=$HOSTNAME$ } define service { name srv-pnp register 0 action_url /nagios/pnp/index.php?host=$HOSTNAME$&srv=$SERVICEDESC$ } And in /usr/local/nagios/etc/object/commands.cfg I replace the existing definitions of process-host-perfdata and process-service-perfdata for the following: define command { command_name process-service-perfdata command_line /usr/bin/perl /usr/local/nagios/libexec/process_perfdata.pl } define command { command_name process-host-perfdata command_line /usr/bin/perl /usr/local/nagios/libexec/process_perfdata.pl -d HOSTPERFDATA } Am I doing something wrong or I am forgetting some step? Thanks in advance for your replies. Regards, Daniel -- Fingerprint: BFB3 08D6 B4D1 31B2 72B9 29CE 6696 BF1B 14E6 1D37 Powered by Debian GNU/Linux Lenny - Linux user #188.598 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: Digital signature URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Mon Jun 21 13:32:00 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Mon, 21 Jun 2010 12:32:00 +0100 Subject: Using PNP4Nagios In-Reply-To: <20100621025642.GC20558@defiant.freesoftware> References: <20100621025642.GC20558@defiant.freesoftware> Message-ID: On 21 June 2010 03:56, Daniel Bareiro wrote: > Hi, all! > > I'm testing PNP4Nagios, but for some reason I'm just looking graphics > for services in the Nagios server. For the rest of the remote hosts, > when I click on the some red star, I get something like this: > > Initalising > (OK) Using /usr/local/nagios/share/perfdata/ > (OK) RRDTool /usr/bin/rrdtool found. > (OK) RRDTool /usr/bin/rrdtool is executable > (OK) PHP Function proc_open is enabled > (OK) PHP Function fpassthru is enabled > (OK) PHP Function xml_parser_create is enabled > (OK) PHP zlib Support found. > (OK) PHP GD Support found. > (OK) RRD Base Directory /usr/local/nagios/share/perfdata/ found. > (OK) Hostname Router is set. > (!) ?Directory /usr/local/nagios/share/perfdata/alderamin not found. > > Where Alderamin is the hostname. In the service status interface, the > Nagios server shows no red stars for the host or for services. However I > can access their information through the following URL: > > http://ws1/nagios/pnp/index.php > > > These are the lines I've in /usr/local/nagios/etc/nagios.cfg for > pnp4nagios: > > > broker_module=/usr/local/nagios/bin/npcdmod.o > > process_performance_data=1 > enable_environment_macros=1 > > host_perfdata_command=process-host-perfdata > service_perfdata_command=process-service-perfdata > > > In /usr/local/nagios/etc/objects/mynet.cfg I added "srv-pnp" to the > service definitions in the directive "use" and I added "host-pnp" to the > directive "use" in the hosts definitions. Also, I'm using > "process_perf_data 0" in both hosts and services definitions. The srv-pnp and host-pnp templates will set the process_perf_data directive like so: process_perf_data 1 If you override this by setting process_perf_data to 0 in the host or service definition, this will stop PNP from generating the graph! I recommend you remove the line which says "process_perf_data 0" from those service and host definitions in "mynet.cfg" which use the srv-pnp and host-pnp template. I hope that helps, Jim ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ryan.c.ash.lu4w at statefarm.com Mon Jun 21 15:20:49 2010 From: ryan.c.ash.lu4w at statefarm.com (Ryan C Ash) Date: Mon, 21 Jun 2010 06:20:49 -0700 Subject: New active checks stuck in PENDING Message-ID: Cliff notes: ALL active checks are not executing MY ENV: nagios 3.2.0, RHEL 5.4 I have an earlier post where I discussed this problem in detail. http://forums.meulie.net/viewtopic.php?f=59&t=6062&hilit=pending The services show up in the GUI but they are in a permanent PENDING. if I reload/restart the GUI indicates an updated scheduled check time but it never appears to occur, i.e I see nothing in nagios.log or nagios.debug. Here is my service definition: define service { service_description NG-ETL-LINUX-PERF_swap_used display_name Linux swap used servicegroups NG-ETL-LINUX-PERF_service_group hostgroup_name NG-ETL-LINUX-PERF check_command check_nrpe!check_swap passive_checks_enabled 0 active_checks_enabled 1 check_interval 5 normal_check_interval 5 max_check_attempts 1 check_period 24x7 ; The service can be checked at any time of the day } As you can see it doesn't use a template and it has only the bare minimum because I am trying to figure out what is wrong. Nagios.cfg entries: log_file=/opt/nagios/var/nagios.log cfg_dir=/opt/nagios/etc/objects/hosts cfg_dir=/opt/nagios/etc/objects/rel_hosts cfg_dir=/opt/nagios/etc/objects/services cfg_dir=/opt/nagios/etc/objects/host_groups cfg_dir=/opt/nagios/etc/objects/global object_cache_file=/opt/nagios/var/objects.cache precached_object_file=/opt/nagios/var/objects.precache resource_file=/opt/nagios/etc/resource.cfg status_file=/opt/nagios/var/status.dat status_update_interval=10 nagios_user=ccadmin nagios_group=ccadmin check_external_commands=1 command_check_interval=-1 command_file=/opt/nagios/var/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/opt/nagios/var/nagios.lock temp_file=/opt/nagios/var/nagios.tmp temp_path=/tmp event_broker_options=-1 broker_module=/opt/nagios/bin/ndomod-3x.o config_file=/opt/nagios/etc/ndomod.cfg broker_module=/usr/local/pnp4nagios/bin/npcdmod.o config_file=/usr/local/pnp4nagios/etc/npcd.cfg log_rotation_method=d log_archive_path=/opt/nagios/var/archives use_syslog=0 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 global_host_event_handler=troubleticket_host global_service_event_handler=troubleticket_service service_inter_check_delay_method=s max_service_check_spread=10 #ASH was 30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/opt/nagios/var/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=30 #ASH was 60 host_check_timeout=30 #ASH was 30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/opt/nagios/var/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 check_for_updates=1 bare_update_check=0 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=0 enable_event_handlers=1 process_performance_data=1 service_perfdata_file=/usr/local/pnp4nagios/var/service-perfdata service_perfdata_file_template=DATATYPE::SERVICEPERFDATA\tTIMET::$TIMET$ \tHOSTNAME::$HOSTNAME$\tSERVICEDESC::$SERVICEDESC$\tSERVICEPERFDATA::$SE RVICEPERFDATA$\tSERVICECHECKCOMMAND::$SERVICECHECKCOMMAND$\tHOSTSTATE::$ HOSTSTATE$\tHOSTSTATETYPE::$HOSTSTATETYPE$\tSERVICESTATE::$SERVICESTATE$ \tSERVICESTATETYPE::$SERVICESTATETYPE$ service_perfdata_file_mode=a service_perfdata_file_processing_interval=15 service_perfdata_file_processing_command=process-service-perfdata-file host_perfdata_file=/usr/local/pnp4nagios/var/host-perfdata host_perfdata_file_template=DATATYPE::HOSTPERFDATA\tTIMET::$TIMET$\tHOST NAME::$HOSTNAME$\tHOSTPERFDATA::$HOSTPERFDATA$\tHOSTCHECKCOMMAND::$HOSTC HECKCOMMAND$\tHOSTSTATE::$HOSTSTATE$\tHOSTSTATETYPE::$HOSTSTATETYPE$ host_perfdata_file_mode=a host_perfdata_file_processing_interval=15 host_perfdata_file_processing_command=process-host-perfdata-file obsess_over_services=1 ocsp_command=submit_check_result obsess_over_hosts=1 ochp_command=submit_host_check translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=us p1_file=/opt/nagios/bin/p1.pl enable_embedded_perl=1 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$&|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=xxxx admin_pager=pageccadmin at localhost daemon_dumps_core=0 use_large_installation_tweaks=1 enable_environment_macros=1 debug_level=144 debug_verbosity=2 debug_file=/opt/nagios/var/nagios.debug max_debug_file_size=100000000 I really need help trying to determine why the active checks are not being executed. __________________________________ Ryan Ash State Farm Insurance Infrastructure Automation -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Nigel.Heaney at bgfl.org Mon Jun 21 15:55:20 2010 From: Nigel.Heaney at bgfl.org (Nigel Heaney) Date: Mon, 21 Jun 2010 14:55:20 +0100 Subject: automated response Message-ID: <11006211455.AA49542@bgfl.org> I am unavailable until 8th July. I will respond to email upon my return. ************************************************************* This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. If you have received this email in error please notify postmaster at bgfl.org The views expressed within this email are those of the individual, and not necessarily those of the organisation ************************************************************* ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From raanders at cyber-office.net Mon Jun 21 16:50:29 2010 From: raanders at cyber-office.net (Roderick A. Anderson) Date: Mon, 21 Jun 2010 07:50:29 -0700 Subject: bandwidth monitoring and presentation? In-Reply-To: References: <4C1BDED4.8040501@cyber-office.net> Message-ID: <4C1F7C35.8050605@cyber-office.net> On 06/19/2010 08:17 AM, Jim Avery wrote: > On 18 June 2010 22:02, Roderick A. Anderson wrote: >> Similar to what Solarwinds does? >> >> I'm back. Last August I asked about monitoring Tranzeo wireless radios >> and SNMP. >> >> Did some more searching and found my own posts :-) and a few more but >> nothing really substantial. >> >> More research and a bit of clarification of needs/desires leads me to >> think I need monitor bandwidth utilization and graph it. I found the >> check_bandwidth plug-in. Haven't tried it yet so was hoping some one on >> the list could offer some insight and as to what it will provide >> Nagios-wise. > > > I'm not familiar with Tranzeo. I guess what you're trying to do might > be similar to the plugins for Motorola/Symbol WS8100 wireless > switches. You might be able to adapt those. > > http://www.monitoringexchange.org/inventory/Check-Plugins/Software/SNMP/Assorted-Nagios-Plugins Thanks Jim. Turns out I was misinformed about what they were trying to monitor but I will keep this in mind for further reference. Rod -- > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Naveen.R at philips.com Mon Jun 21 17:11:23 2010 From: Naveen.R at philips.com (R, Naveen) Date: Mon, 21 Jun 2010 17:11:23 +0200 Subject: Not receiving notification mails In-Reply-To: <4C1F7C35.8050605@cyber-office.net> References: <4C1BDED4.8040501@cyber-office.net> <4C1F7C35.8050605@cyber-office.net> Message-ID: <8DF553C0530569419CAAFA4E3331C32BEB13111B33@NLCLUEXM09.connect1.local> Dear All, I have configured Nagios xi in my machine, that is on windows xp, making use of VM Player Virtual Machine. I have configure it to track some websites and it is tracking correctly, but as last I'm not receiving any mails of this notification. I tried to send the test mail, and I'm receiving it correctly, but when the service is down/up or might be any, I'm not receiving any notification mails for that. So, please can you guide me how I go further in this. And also when I click on Apply configuration, it happens correctly and when I come to any apply configuration page it says "Warning: configuration file is out of date!" in red. Also could you tell me the difference between Nagios Xi and Nagios 3.0.6 version, because people who have installed 3.0.6 version, its working correctly for them, but not for me. :-( So please guide me in resolving this issues. Regards, Naveen The information contained in this message may be confidential and legally protected under applicable law. The message is intended solely for the addressee(s). If you are not the intended recipient, you are hereby notified that any use, forwarding, dissemination, or reproduction of this message is strictly prohibited and may be unlawful. If you are not the intended recipient, please contact the sender by return e-mail and destroy all copies of the original message. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Jun 21 17:57:42 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 21 Jun 2010 11:57:42 -0400 Subject: Not receiving notification mails In-Reply-To: <8DF553C0530569419CAAFA4E3331C32BEB13111B33@NLCLUEXM09.connect1.local> References: <4C1BDED4.8040501@cyber-office.net> <4C1F7C35.8050605@cyber-office.net> <8DF553C0530569419CAAFA4E3331C32BEB13111B33@NLCLUEXM09.connect1.local> Message-ID: There are three things you need to check 1) if Nagios server notification is enabled? 2) if service or host level notification is enabled? 3) if your contact definition has proper notification attribute? On Mon, Jun 21, 2010 at 11:11 AM, R, Naveen wrote: > Dear All, > I have configured Nagios xi in my machine, that is on windows xp, making > use of VM Player Virtual Machine. I have configure it to track some websites > and it is tracking correctly, but as last I'm not receiving any mails of > this notification. I tried to send the test mail, and I'm receiving it > correctly, but when the service is down/up or might be any, I'm not > receiving any notification mails for that. So, please can you guide me how I > go further in this. > > And also when I click on Apply configuration, it happens correctly and when > I come to any apply configuration page it says "Warning: configuration file > is out of date!" in red. > > Also could you tell me the difference between Nagios Xi and Nagios 3.0.6 > version, because people who have installed 3.0.6 version, its working > correctly for them, but not for me. :-( > > So please guide me in resolving this issues. > > Regards, > Naveen > > The information contained in this message may be confidential and legally > protected under applicable law. The message is intended solely for the > addressee(s). If you are not the intended recipient, you are hereby notified > that any use, forwarding, dissemination, or reproduction of this message is > strictly prohibited and may be unlawful. If you are not the intended > recipient, please contact the sender by return e-mail and destroy all copies > of the original message. > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Mon Jun 21 18:52:10 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Mon, 21 Jun 2010 10:52:10 -0600 Subject: Define dependencie in service defs? Message-ID: <4F5CCB68-C556-47C3-8052-AF86F39C3A3A@stubhub.com> Is there a way to define what service a service depends on within te service definition itself for cases where the dependency heirarchy is localized to the same host? Dependency definitions are terrific in that they are so flexible, but it would be nice to contain that functionailty within a single service definition in an effort to simplify configuration. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Mon Jun 21 19:03:07 2010 From: nagios at flatto.net (Assaf Flatto) Date: Mon, 21 Jun 2010 18:03:07 +0100 Subject: Define dependencie in service defs? In-Reply-To: <4F5CCB68-C556-47C3-8052-AF86F39C3A3A@stubhub.com> References: <4F5CCB68-C556-47C3-8052-AF86F39C3A3A@stubhub.com> Message-ID: <4C1F9B4B.8090101@flatto.net> Litwin, Matthew wrote: > Is there a way to define what service a service depends on within te > service definition itself for cases where the dependency heirarchy is > localized to the same host? Dependency definitions are terrific in > that they are so flexible, but it would be nice to contain that > functionailty within a single service definition in an effort to > simplify configuration. > At the moment you must define it in a separate definition , but it does not have to be in a separate file . just put the service dependency definition below/above the service declaration and thus you have a more "easy to maintain" way to know what you set up and how you need it . You might want to suggest it as a feature for future development to be somewhat similar to the host parents definition , but i can see several issues with such an "in service" declaration . Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel-listas at gmx.net Mon Jun 21 19:19:40 2010 From: daniel-listas at gmx.net (Daniel Bareiro) Date: Mon, 21 Jun 2010 14:19:40 -0300 Subject: Using PNP4Nagios In-Reply-To: References: <20100621025642.GC20558@defiant.freesoftware> Message-ID: <20100621171940.GD20558@defiant.freesoftware> Hi, Jim. On Monday, 21 June 2010 12:32:00 +0100, Jim Avery wrote: > > I'm testing PNP4Nagios, but for some reason I'm just looking > > graphics for services in the Nagios server. For the rest of the > > remote hosts, when I click on the some red star, I get something > > like this: > > > > Initalising > > (OK) Using /usr/local/nagios/share/perfdata/ > > (OK) RRDTool /usr/bin/rrdtool found. > > (OK) RRDTool /usr/bin/rrdtool is executable > > (OK) PHP Function proc_open is enabled > > (OK) PHP Function fpassthru is enabled > > (OK) PHP Function xml_parser_create is enabled > > (OK) PHP zlib Support found. > > (OK) PHP GD Support found. > > (OK) RRD Base Directory /usr/local/nagios/share/perfdata/ found. > > (OK) Hostname Router is set. > > (!) ?Directory /usr/local/nagios/share/perfdata/alderamin not found. > > > > Where Alderamin is the hostname. In the service status interface, > > the Nagios server shows no red stars for the host or for services. > > However I can access their information through the following URL: > > > > http://ws1/nagios/pnp/index.php > > > > > > These are the lines I've in /usr/local/nagios/etc/nagios.cfg for > > pnp4nagios: > > > > > > broker_module=/usr/local/nagios/bin/npcdmod.o > > > > process_performance_data=1 > > enable_environment_macros=1 > > > > host_perfdata_command=process-host-perfdata > > service_perfdata_command=process-service-perfdata > > > > > > In /usr/local/nagios/etc/objects/mynet.cfg I added "srv-pnp" to the > > service definitions in the directive "use" and I added "host-pnp" to > > the directive "use" in the hosts definitions. Also, I'm using > > "process_perf_data 0" in both hosts and services definitions. > The srv-pnp and host-pnp templates will set the process_perf_data > directive like so: > > process_perf_data 1 > > If you override this by setting process_perf_data to 0 in the host or > service definition, this will stop PNP from generating the graph! > > I recommend you remove the line which says "process_perf_data 0" from > those service and host definitions in "mynet.cfg" which use the > srv-pnp and host-pnp template. Good. This made the difference. Now I can see the graphics on all remote hosts but I can not see them on the host on which Nagios is installed. But I found why this is. I was missing add "host-pnp" and "srv-pnp" in the directive "use" in /usr/local/nagios/etc/objects/localhost.cfg file. There are services that are failing to be plotted but I believe that it is because they are not providing information for perfdata, as in the case of check_md_raid. In the "service state information" table for the check_md_raid service, the "performance data" is empty. But I don't believe that it has much sense, for this particular case, to make some type of graph. Perhaps for this case it may be desirable to set process_perf_data to 0. check_ssh and check_local_procs don't give information for perfdata either. On the other hand, I've a router whose host_name is "Router#1". It appears that the "#" was causing a conflict that would prevent the creation of the directory to keep track of charting. After removing this character, the problem was solved. For this router I'm checking three services: Enet0, PING, uptime. Of these three services, only PING (check_ping) is providing perfdata information. Uptime is obtained using check_snmp and in this case it would not have sense to plotting these values. But the values of the Enet0 interface (check_snmp_int [1]) can be useful. Thanks for your reply. Regards, Daniel [1] http://nagios.manubulon.com/snmp_int.html -- Fingerprint: BFB3 08D6 B4D1 31B2 72B9 29CE 6696 BF1B 14E6 1D37 Powered by Debian GNU/Linux Lenny - Linux user #188.598 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: Digital signature URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Tue Jun 22 05:18:01 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Mon, 21 Jun 2010 21:18:01 -0600 Subject: Define dependencie in service defs? In-Reply-To: <4C1F9B4B.8090101@flatto.net> References: <4F5CCB68-C556-47C3-8052-AF86F39C3A3A@stubhub.com> <4C1F9B4B.8090101@flatto.net> Message-ID: On Jun 21, 2010, at 10:03 AM, Assaf Flatto wrote: > Litwin, Matthew wrote: >> Is there a way to define what service a service depends on within te >> service definition itself for cases where the dependency heirarchy is >> localized to the same host? Dependency definitions are terrific in >> that they are so flexible, but it would be nice to contain that >> functionailty within a single service definition in an effort to >> simplify configuration. >> > At the moment you must define it in a separate definition , but it does > not have to be in a separate file . > just put the service dependency definition below/above the service > declaration and thus you have a more "easy to maintain" way to know what > you set up and how you need it . Thanks, that is actually a good and simple solution. I suppose if I employ the regular expression matching tool for configuration that would make it even more automated. Have you worked with that. It seems pretty new. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jtillotson at techtarget.com Tue Jun 22 11:06:35 2010 From: jtillotson at techtarget.com (Tillotson, Jeff) Date: Tue, 22 Jun 2010 05:06:35 -0400 Subject: Service Escalation Timing Issue Message-ID: I've got a service that I've set up with the following requirements. E-mail a certain group after service has been down for 5 minutes. page when service has been down for 10 minutes. Then, page again after 30 minutes. I'm fairly certain my problem is with notification_interval in the service_escalation and that I'm misunderstanding this from the documentation: "When defining notification escalations, it is important to keep in mind that any contact groups that were members of "lower" escalations (i.e. those with lower notification number ranges) should also be included in "higher" escalation definitions. This should be done to ensure that anyone who gets notified of a problem continues to get notified as the problem is escalated." Following are the configuration options (I've snipped some options down): Nagios.cfg: interval_length=1 (One second) Template: define service{ name distrib-nevent-graph check_period 24x7 max_check_attempts 2 contact_groups no-one notification_options w,u,c,r notification_interval 60 notification_period 24x7 register 0 } Service: define service{ use distrib-nevent-graph hostgroup_name location-v7apache service_description v7apache-check } Service Escalation: define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 5 last_notification 0 notification_interval 1800 contact_groups nopage, core } define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 10 last_notification 0 notification_interval 1800 contact_groups page, nopage, core } ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Jun 22 11:44:53 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 22 Jun 2010 10:44:53 +0100 Subject: Define dependencie in service defs? In-Reply-To: References: <4F5CCB68-C556-47C3-8052-AF86F39C3A3A@stubhub.com> <4C1F9B4B.8090101@flatto.net> Message-ID: <4C208615.5020305@flatto.net> Litwin, Matthew wrote: > On Jun 21, 2010, at 10:03 AM, Assaf Flatto wrote: > > >> Litwin, Matthew wrote: >> >>> Is there a way to define what service a service depends on within te >>> service definition itself for cases where the dependency heirarchy is >>> localized to the same host? Dependency definitions are terrific in >>> that they are so flexible, but it would be nice to contain that >>> functionailty within a single service definition in an effort to >>> simplify configuration. >>> >>> >> At the moment you must define it in a separate definition , but it does >> not have to be in a separate file . >> just put the service dependency definition below/above the service >> declaration and thus you have a more "easy to maintain" way to know what >> you set up and how you need it . >> > > Thanks, that is actually a good and simple solution. I suppose if I employ the regular expression matching tool for configuration that would make it even more automated. Have you worked with that. It seems pretty new. > I have not worked with the regex tool , But i must say i am pretty "old Fashioned " in the way i like my Nagios configuration - extremely readable to the novice and following the KISS policy. Let us know how it worked for you . -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Jun 22 11:53:45 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 22 Jun 2010 10:53:45 +0100 Subject: Service Escalation Timing Issue In-Reply-To: References: Message-ID: <4C208829.4090300@flatto.net> Tillotson, Jeff wrote: > I've got a service that I've set up with the following requirements. E-mail a certain group after service has been down for 5 minutes. page when service has been down for 10 minutes. Then, page again after 30 minutes. I'm fairly certain my problem is with notification_interval in the service_escalation and that I'm misunderstanding this from the documentation: > "When defining notification escalations, it is important to keep in mind that any contact groups that were members of "lower" escalations (i.e. those with lower notification number ranges) should also be included in "higher" escalation definitions. This should be done to ensure that anyone who gets notified of a problem continues to get notified as the problem is escalated." > > > Following are the configuration options (I've snipped some options down): > > Nagios.cfg: > interval_length=1 (One second) > > Template: > > define service{ > name distrib-nevent-graph > check_period 24x7 > max_check_attempts 2 > contact_groups no-one > notification_options w,u,c,r > notification_interval 60 > notification_period 24x7 > register 0 > } > > Service: > define service{ > use distrib-nevent-graph > hostgroup_name location-v7apache > service_description v7apache-check > } > > Service Escalation: > define serviceescalation { > hostgroup_name location-v7apache > service_description v7apache-check > first_notification 5 > last_notification 0 > notification_interval 1800 > contact_groups nopage, core > } > define serviceescalation { > hostgroup_name location-v7apache > service_description v7apache-check > first_notification 10 > last_notification 0 > notification_interval 1800 > contact_groups page, nopage, core > } > > > If i am reading this right , you have your first notification sent after 2.5 hours . 1800sec = 30 minutes x 5 ( first notification) = 2.5 hours. you might want to change the interval to 300 . *first_notification*: This directive is a number that identifies the /first/ notification for which this escalation is effective. For instance, if you set this value to 3, this escalation will only be used if the service is in a non-OK state long enough for a third notification to go out. *notification_interval*: This directive is used to determine the interval at which notifications should be made while this escalation is valid. If you specify a value of 0 for the interval, Nagios will send the first notification when this escalation definition is valid, but will then prevent any more problem notifications from being sent out for the host. Notifications are sent out again until the host recovers. This is useful if you want to stop having notifications sent out after a certain amount of time. Note: If multiple escalation entries for a host overlap for one or more notification ranges, the smallest notification interval from all escalation entries is used. -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sandman42 at libero.it Tue Jun 22 12:36:56 2010 From: sandman42 at libero.it (sandman42 at libero.it) Date: Tue, 22 Jun 2010 12:36:56 +0200 (CEST) Subject: Where to store plugins in a standard nagios3 installation Message-ID: <8421888.884961277203016960.JavaMail.defaultUser@defaultHost> Hi, I'm testing nagios 3 on ubuntu 10.04. I've downloaded a plugin from nagios exchange, but where the file must be in order to be run? Particularly, I've downloaded check_cisco.pl from nagios exchange, and I've put it in /usr/lib/nagios/plugins I've also given a chmod 0755, and chown root:root. The problem is taht it is not recognized, i.e. in a .cfg file I've used it as check_command .pl check_cisco -h $HOSTADDRESS$ -c public -i BRI0 but when I do a /etc/init.d/nagios3 restart I have the following error: Error: Host check command 'check_cisco.pl -h $HOSTADDRESS$ -c public -i BRI0/0/0:1' specified for host 'ISDNRouter' is not defined anywhere! What have I done wrong???? Thanks in advance Francesco ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Jun 22 13:01:07 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 22 Jun 2010 12:01:07 +0100 Subject: Where to store plugins in a standard nagios3 installation In-Reply-To: <8421888.884961277203016960.JavaMail.defaultUser@defaultHost> References: <8421888.884961277203016960.JavaMail.defaultUser@defaultHost> Message-ID: <4C2097F3.7080601@flatto.net> sandman42 at libero.it wrote: > Hi, > > I'm testing nagios 3 on ubuntu 10.04. > > I've downloaded a plugin from nagios exchange, but where the file must be in > order to be run? > > Particularly, I've downloaded check_cisco.pl from nagios exchange, and I've > put it in /usr/lib/nagios/plugins > I've also given a chmod 0755, and chown root:root. > > The problem is taht it is not recognized, i.e. in a .cfg file I've used it as > > > check_command .pl check_cisco -h $HOSTADDRESS$ -c public -i BRI0 > > but when I do a > > /etc/init.d/nagios3 restart > > I have the following error: > > Error: Host check command 'check_cisco.pl -h $HOSTADDRESS$ -c public -i > BRI0/0/0:1' specified for host 'ISDNRouter' is not defined anywhere! > > What have I done wrong???? > > Thanks in advance > > Francesco > The best place to put it is , in the same place that all other script are located. If you installed from source they should be in /usr/local/nagios/libexec the easy way to find this is looking in the resource.cfg file in the nagios /etc directory , the file has entry named $USER1$ = that path is where the plugins are located. Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jperrin at gmail.com Tue Jun 22 13:10:11 2010 From: jperrin at gmail.com (Jim Perrin) Date: Tue, 22 Jun 2010 07:10:11 -0400 Subject: Where to store plugins in a standard nagios3 installation In-Reply-To: <8421888.884961277203016960.JavaMail.defaultUser@defaultHost> References: <8421888.884961277203016960.JavaMail.defaultUser@defaultHost> Message-ID: On Tue, Jun 22, 2010 at 6:36 AM, sandman42 at libero.it wrote: > ? ? ?Error: Host check command 'check_cisco.pl -h $HOSTADDRESS$ -c public -i > BRI0/0/0:1' specified for host 'ISDNRouter' is not defined anywhere! > > What have I done wrong???? > Did you also define check_cisco in the check_commands.cfg or similar? The binary goes basically where you put it, or where the other plugin binaries are located, but you also have to define the command itself, not just the command for the host or service. You should have a commands.cfg or a misc_commands, or a check_commands.cfg or similar in your /etc/nagios/ dir somewhere. When you find it, you'll see what you're missing. -- During times of universal deceit, telling the truth becomes a revolutionary act. George Orwell ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jtillotson at techtarget.com Tue Jun 22 13:30:08 2010 From: jtillotson at techtarget.com (Jeff Tillotson) Date: Tue, 22 Jun 2010 07:30:08 -0400 Subject: Service Escalation Timing Issue In-Reply-To: <4C208829.4090300@flatto.net> References: <4C208829.4090300@flatto.net> Message-ID: <20100622113007.GA6136@mxyzptlk.knowledgestorm.com> On Tue, Jun 22, 2010 at 05:53:45AM -0400, Assaf Flatto wrote: >Tillotson, Jeff wrote: >> I've got a service that I've set up with the following requirements. E-mail a certain group after service has been down for 5 minutes. page when service has been down for 10 minutes. Then, page again after 30 minutes. I'm fairly certain my problem is with notification_interval in the service_escalation and that I'm misunderstanding this from the documentation: >> "When defining notification escalations, it is important to keep in mind that any contact groups that were members of "lower" escalations (i.e. those with lower notification number ranges) should also be included in "higher" escalation definitions. This should be done to ensure that anyone who gets notified of a problem continues to get notified as the problem is escalated." >> >> >> Following are the configuration options (I've snipped some options down): >> >> Nagios.cfg: >> interval_length=1 (One second) >> >> Template: >> >> define service{ >> name distrib-nevent-graph >> check_period 24x7 >> max_check_attempts 2 >> contact_groups no-one >> notification_options w,u,c,r >> notification_interval 60 >> notification_period 24x7 >> register 0 >> } >> >> Service: >> define service{ >> use distrib-nevent-graph >> hostgroup_name location-v7apache >> service_description v7apache-check >> } >> >> Service Escalation: >> define serviceescalation { >> hostgroup_name location-v7apache >> service_description v7apache-check >> first_notification 5 >> last_notification 0 >> notification_interval 1800 >> contact_groups nopage, core >> } >> define serviceescalation { >> hostgroup_name location-v7apache >> service_description v7apache-check >> first_notification 10 >> last_notification 0 >> notification_interval 1800 >> contact_groups page, nopage, core >> } >> >> >> >If i am reading this right , you have your first notification sent after >2.5 hours . > >1800sec = 30 minutes x 5 ( first notification) = 2.5 hours. > >you might want to change the interval to 300 . > Thanks for your response. If I change the interval to 300, than core and nopage get the notification every 5 minutes after the 5th notification. Then I page won't get the first alert until 30 minutes after the host is down (5 at 1min interval + 5 at 5min interval). What I really want is nopage and core to get notifications after service has been down for 5 minutes and than 30 minutes after. page to get notifications after service has been down for 10 minutes and 30 minutes after. I almost think the following will provide what I want but the documentation section I posted in my original post makes me think this is a bad idea. define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 5 last_notification 0 notification_interval 1800 contact_groups nopage, core } define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 10 last_notification 0 notification_interval 1800 contact_groups page } --Jeff ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Jun 22 17:31:46 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 22 Jun 2010 16:31:46 +0100 Subject: Service Escalation Timing Issue In-Reply-To: <20100622113007.GA6136@mxyzptlk.knowledgestorm.com> References: <4C208829.4090300@flatto.net> <20100622113007.GA6136@mxyzptlk.knowledgestorm.com> Message-ID: <4C20D762.9090900@flatto.net> Jeff Tillotson wrote: > On Tue, Jun 22, 2010 at 05:53:45AM -0400, Assaf Flatto wrote: > >> Tillotson, Jeff wrote: >> >>> I've got a service that I've set up with the following requirements. E-mail a certain group after service has been down for 5 minutes. page when service has been down for 10 minutes. Then, page again after 30 minutes. I'm fairly certain my problem is with notification_interval in the service_escalation and that I'm misunderstanding this from the documentation: >>> "When defining notification escalations, it is important to keep in mind that any contact groups that were members of "lower" escalations (i.e. those with lower notification number ranges) should also be included in "higher" escalation definitions. This should be done to ensure that anyone who gets notified of a problem continues to get notified as the problem is escalated." >>> >>> >>> Following are the configuration options (I've snipped some options down): >>> >>> Nagios.cfg: >>> interval_length=1 (One second) >>> >>> Template: >>> >>> define service{ >>> name distrib-nevent-graph >>> check_period 24x7 >>> max_check_attempts 2 >>> contact_groups no-one >>> notification_options w,u,c,r >>> notification_interval 60 >>> notification_period 24x7 >>> register 0 >>> } >>> >>> Service: >>> define service{ >>> use distrib-nevent-graph >>> hostgroup_name location-v7apache >>> service_description v7apache-check >>> } >>> >>> Service Escalation: >>> define serviceescalation { >>> hostgroup_name location-v7apache >>> service_description v7apache-check >>> first_notification 5 >>> last_notification 0 >>> notification_interval 1800 >>> contact_groups nopage, core >>> } >>> define serviceescalation { >>> hostgroup_name location-v7apache >>> service_description v7apache-check >>> first_notification 10 >>> last_notification 0 >>> notification_interval 1800 >>> contact_groups page, nopage, core >>> } >>> >>> >>> >>> >> If i am reading this right , you have your first notification sent after >> 2.5 hours . >> >> 1800sec = 30 minutes x 5 ( first notification) = 2.5 hours. >> >> you might want to change the interval to 300 . >> >> > > Thanks for your response. > > If I change the interval to 300, than core and nopage get the > notification every 5 minutes after the 5th notification. Then I page > won't get the first alert until 30 minutes after the host is down > (5 at 1min interval + 5 at 5min interval). What I really want is nopage > and core to get notifications after service has been down for 5 minutes and > than 30 minutes after. page to get notifications after service has been > down for 10 minutes and 30 minutes after. > > I almost think the following will provide what I want but the > documentation section I posted in my original post makes me think this > is a bad idea. > > > define serviceescalation { > hostgroup_name location-v7apache > service_description v7apache-check > first_notification 5 > last_notification 0 > notification_interval 1800 > contact_groups nopage, core > } > define serviceescalation { > hostgroup_name location-v7apache > service_description v7apache-check > first_notification 10 > last_notification 0 > notification_interval 1800 > contact_groups page > } > > --Jeff > > I think this is what you need : define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 1 #(after 5 minutes) last_notification 0 notification_interval 300 contact_groups nopage, core } define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 6 # (5x 5minutes = 25 after the first notification) last_notification 0 notification_interval 300 contact_groups nopage, core } define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 2 #(2x5 minutes) last_notification 0 notification_interval 300 contact_groups page } define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 8 # (6 x 5minutes = 30 after the first notification) last_notification 0 notification_interval 300 contact_groups page } -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jtillotson at techtarget.com Tue Jun 22 18:44:14 2010 From: jtillotson at techtarget.com (Jeff Tillotson) Date: Tue, 22 Jun 2010 12:44:14 -0400 Subject: Service Escalation Timing Issue In-Reply-To: <4C20D762.9090900@flatto.net> References: <4C208829.4090300@flatto.net> <20100622113007.GA6136@mxyzptlk.knowledgestorm.com> <4C20D762.9090900@flatto.net> Message-ID: <20100622164414.GA8058@mxyzptlk.knowledgestorm.com> On Tue, Jun 22, 2010 at 11:31:46AM -0400, Assaf Flatto wrote: >Jeff Tillotson wrote: >> On Tue, Jun 22, 2010 at 05:53:45AM -0400, Assaf Flatto wrote: >> >>> Tillotson, Jeff wrote: >>> >>>> I've got a service that I've set up with the following requirements. E-mail a certain group after service has been down for 5 minutes. page when service has been down for 10 minutes. Then, page again after 30 minutes. I'm fairly certain my problem is with notification_interval in the service_escalation and that I'm misunderstanding this from the documentation: >>>> "When defining notification escalations, it is important to keep in mind that any contact groups that were members of "lower" escalations (i.e. those with lower notification number ranges) should also be included in "higher" escalation definitions. This should be done to ensure that anyone who gets notified of a problem continues to get notified as the problem is escalated." >> >I think this is what you need : > > >define serviceescalation { > hostgroup_name location-v7apache > service_description v7apache-check > first_notification 1 #(after 5 minutes) > last_notification 0 > notification_interval 300 > contact_groups nopage, core >} > >define serviceescalation { > hostgroup_name location-v7apache > service_description v7apache-check > first_notification 6 # (5x 5minutes = 25 after the first notification) > last_notification 0 > notification_interval 300 > contact_groups nopage, core >} > > >define serviceescalation { > hostgroup_name location-v7apache > service_description v7apache-check > first_notification 2 #(2x5 minutes) > last_notification 0 > notification_interval 300 > contact_groups page >} > > >define serviceescalation { > hostgroup_name location-v7apache > service_description v7apache-check > first_notification 8 # (6 x 5minutes = 30 after the first notification) > last_notification 0 > notification_interval 300 > contact_groups page >} > Thanks for you response and feedback. I got what I needed with the following configs. It make sense now but it wasn't obvious to me in the beginning. define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 5 # notification interval is set to 60s # so the first notification goes out # after five minutes last_notification 5 notification_interval 300 # It sets the notification interval # to 5 minutes contact_groups nopage, core } define serviceescalation { hostgroup_name location-v7apache service_description v7apache-check first_notification 6 # Because notification interval was # set to 5 minutes, this check # goes out 5 minutes later. last_notification 0 notification_interval 1800 # Set the notification interval to # 30 minutes. So the remaining # notifications go out at 30 minute # intervals. contact_groups nopage, core } ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mgius at createspace.com Tue Jun 22 18:13:41 2010 From: mgius at createspace.com (Gius, Mark) Date: Tue, 22 Jun 2010 09:13:41 -0700 Subject: Where to store plugins in a standard nagios3 installation In-Reply-To: <8421888.884961277203016960.JavaMail.defaultUser@defaultHost> References: <8421888.884961277203016960.JavaMail.defaultUser@defaultHost> Message-ID: <23193A17705DD246AFFFDF09B779F56F25198E120A@EX-IAD6-B.ant.amazon.com> We use two variables, USER1 and USER2 to reflect the location of the default Nagios plugins and any custom plugins that we write. So a sample command definition looks like: ## Ping check definition define command { command_name check-host-alive command_line $USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c 5000.0,100% -p 1 -4 } Where $USER1$ is defined as the location that our Nagios RPM installs plugins (/var/lib/Nagios/plugins I believe). Most of our custom plugins are executed entirely on our central Nagios server, so this setup works well for us. -Gius > -----Original Message----- > From: sandman42 at libero.it [mailto:sandman42 at libero.it] > Sent: Tuesday, June 22, 2010 3:37 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Where to store plugins in a standard nagios3 > installation > > Hi, > > I'm testing nagios 3 on ubuntu 10.04. > > I've downloaded a plugin from nagios exchange, but where the file must > be in > order to be run? > > Particularly, I've downloaded check_cisco.pl from nagios exchange, and > I've > put it in /usr/lib/nagios/plugins > I've also given a chmod 0755, and chown root:root. > > The problem is taht it is not recognized, i.e. in a .cfg file I've used > it as > > > check_command .pl check_cisco -h $HOSTADDRESS$ -c public -i > BRI0 > > but when I do a > > /etc/init.d/nagios3 restart > > I have the following error: > > Error: Host check command 'check_cisco.pl -h $HOSTADDRESS$ -c > public -i > BRI0/0/0:1' specified for host 'ISDNRouter' is not defined anywhere! > > What have I done wrong???? > > Thanks in advance > > Francesco > > > > > > ----------------------------------------------------------------------- > ------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From darren at brucetelecom.com Tue Jun 22 20:05:02 2010 From: darren at brucetelecom.com (Darren Hill) Date: Tue, 22 Jun 2010 14:05:02 -0400 Subject: nagios/opsview with qpage In-Reply-To: <1274881667.2482.16.camel@localhost.localdomain> References: <4BFC0604.8020506@brucetelecom.com> <1274817654.2513.19.camel@localhost.localdomain> <4BFD1DA7.9090405@brucetelecom.com> <1274881667.2482.16.camel@localhost.localdomain> Message-ID: <4C20FB4E.6060701@brucetelecom.com> Hi, I have this contact script for my nagios/opsview to work. I have qpage paging manually if I just tell it to run qpage, but I don't want 2 contacts, one for hosts, one for services. I need a way for it to figure out if it's a host or service page and then run the command from there. Any ideas what I've got wrong? Running qpage manually works fine such as "qpage -dip darren this is a test". If I set it up to just use qpage and send it all, it pages but is all garbled as it includes variables for a service in a host page that does not work. Any help would be greatly appreciated. This is the standard opsview submit_sms script which I am attempting to modify to use qpage instead of sending an email. #!/usr/bin/perl -w # # # SYNTAX: my $usage = qq{ sms_darren Usage: sms_darren }; sub usage { if ($_ = shift) { print "Error: $_\n" } print $usage; exit 1; } use strict; use Switch; use lib "/usr/local/nagios/perl/lib"; sub ldie { $_ = shift; print $_.$/; exit 1 } my ($text, $message); if ( $ENV{NAGIOS_SERVICEDESC} ) { ## TODO Need to check for ack message as well ## ACKNOWLEDGE_SVC_PROBLEM $text = "$ENV{NAGIOS_SERVICEDESC} on $ENV{NAGIOS_HOSTNAME} is $ENV{NAGIOS_SERVICESTATE}: $ENV{NAGIOS_SERVICEOUTPUT} ( $ENV{NAGIOS_SHORTDATETIME})"; # $text = "$ENV{NAGIOS_HOSTNAME} $ENV{NAGIOS_SERVICEDESC} $ENV{NAGIOS_SERVICESTATE}: $ENV{NAGIOS_SERVICEOUTPUT} ($ENV{N AGIOS_SHORTDATETIME})"; } else { $text = "$ENV{NAGIOS_HOSTNAME} is $ENV{NAGIOS_HOSTSTATE}: $ENV{NAGIOS_HOSTOUTPUT} ($ENV{NAGIOS_SHORTDATETIME})"; } # Cleanup text $text =~ s/\n/ /g; my $maxchars = 135; $text = substr($text, 0, $maxchars); qpage -s nagios -p $CONTACTNAME$ echo -e $text ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Wed Jun 23 08:49:40 2010 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Wed, 23 Jun 2010 08:49:40 +0200 Subject: check_snmp Cisco 7304 Message-ID: <4C21AE84.4050603@fhv.at> Hi all, I'm running some scripts, which are checking our Cisco Routers (PowerSupplies/Temperature/Fans) The standard Cisco Mibs for the 6500er Cisco series works fine, but not for the 7300 cisco series. What I need, are the mibs for temperature, Chassis Fans, and the mibs for the Powersupplies and the powersupply fans. I found the powersupply mibs with "snmpwalk", but not the mibs for the powersupply state (ok/Fault), also the mibs for the chassis fan(s). # snmpget -v1 -c .1.3.6.1.2.1.47.1.1.1.1.7.35 SNMPv2-SMI::mib-2.47.1.1.1.1.7.35 = STRING: "Cisco 7304 AC Power Supply 1" # snmpget -v1 -c .1.3.6.1.2.1.47.1.1.1.1.7.57 SNMPv2-SMI::mib-2.47.1.1.1.1.7.57 = STRING: "Cisco 7304 AC Power Supply 2" Searching www.mibdepot.com, www.cisco.com and google was without luck. Here is an output from one of our Cisco 6500 Routers: PowerSupply-1Fan OK 06-23-2010 08:41:22 967d 20h 5m 0s 1/3 Fan "Power Supply 2 Fan": OK PowerSupply-1State OK 06-23-2010 08:41:59 769d 7h 43m 9s 1/3 PowerSupply "Power Supply 1, WS-CAC-3000W": OK PowerSupply-2Fan OK 06-23-2010 08:42:34 967d 19h 21m 19s 1/3 Fan "Chassis Fan Tray 1": OK PowerSupply-2State OK 06-23-2010 08:43:10 967d 19h 21m 10s 1/3 PowerSupply "Power Supply 2, WS-CAC-3000W": OK TemperatureState OK 06-23-2010 08:43:46 61d 11h 53m 2s 1/3 Temperature (degrees Celsius) OK - 34 Any hints are welcome ;-) many thanks Richard ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Gerald.Ortner at gespag.at Wed Jun 23 09:48:15 2010 From: Gerald.Ortner at gespag.at (Ortner, Gerald) Date: Wed, 23 Jun 2010 09:48:15 +0200 Subject: check_snmp Cisco 7304 In-Reply-To: <4C21AE84.4050603@fhv.at> References: <4C21AE84.4050603@fhv.at> Message-ID: <13579FFE8B208F4DBA327EE25F804AAB83641112@swvbpheaglxmb02.health.local> Hi, Have a look at https://support.ipmonitor.com/mibs/CISCO-ENVMON-MIB/tree.aspx For Cisco 3701 temperature: SNMPv2-SMI::enterprises.9.9.13.1.3.1.2.1 = STRING: "Thermal Sensor 1" SNMPv2-SMI::enterprises.9.9.13.1.3.1.2.2 = STRING: "Thermal Sensor 2" SNMPv2-SMI::enterprises.9.9.13.1.3.1.3.1 = Gauge32: 20 SNMPv2-SMI::enterprises.9.9.13.1.3.1.3.2 = Gauge32: 23 Gerald -----Urspr?ngliche Nachricht----- Von: Richard Gliebe [mailto:richard.gliebe at fhv.at] Gesendet: Mittwoch, 23. Juni 2010 08:50 An: nagios-users Betreff: [Nagios-users] check_snmp Cisco 7304 Hi all, I'm running some scripts, which are checking our Cisco Routers (PowerSupplies/Temperature/Fans) The standard Cisco Mibs for the 6500er Cisco series works fine, but not for the 7300 cisco series. What I need, are the mibs for temperature, Chassis Fans, and the mibs for the Powersupplies and the powersupply fans. I found the powersupply mibs with "snmpwalk", but not the mibs for the powersupply state (ok/Fault), also the mibs for the chassis fan(s). # snmpget -v1 -c .1.3.6.1.2.1.47.1.1.1.1.7.35 SNMPv2-SMI::mib-2.47.1.1.1.1.7.35 = STRING: "Cisco 7304 AC Power Supply 1" # snmpget -v1 -c .1.3.6.1.2.1.47.1.1.1.1.7.57 SNMPv2-SMI::mib-2.47.1.1.1.1.7.57 = STRING: "Cisco 7304 AC Power Supply 2" Searching www.mibdepot.com, www.cisco.com and google was without luck. Here is an output from one of our Cisco 6500 Routers: PowerSupply-1Fan OK 06-23-2010 08:41:22 967d 20h 5m 0s 1/3 Fan "Power Supply 2 Fan": OK PowerSupply-1State OK 06-23-2010 08:41:59 769d 7h 43m 9s 1/3 PowerSupply "Power Supply 1, WS-CAC-3000W": OK PowerSupply-2Fan OK 06-23-2010 08:42:34 967d 19h 21m 19s 1/3 Fan "Chassis Fan Tray 1": OK PowerSupply-2State OK 06-23-2010 08:43:10 967d 19h 21m 10s 1/3 PowerSupply "Power Supply 2, WS-CAC-3000W": OK TemperatureState OK 06-23-2010 08:43:46 61d 11h 53m 2s 1/3 Temperature (degrees Celsius) OK - 34 Any hints are welcome ;-) many thanks Richard ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Wed Jun 23 10:43:15 2010 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Wed, 23 Jun 2010 10:43:15 +0200 Subject: check_snmp Cisco 7304 In-Reply-To: <13579FFE8B208F4DBA327EE25F804AAB83641112@swvbpheaglxmb02.health.local> References: <4C21AE84.4050603@fhv.at> <13579FFE8B208F4DBA327EE25F804AAB83641112@swvbpheaglxmb02.health.local> Message-ID: <4C21C923.4040906@fhv.at> On 6/23/10 9:48 AM Ortner, Gerald wrote: > Hi, > Have a look at https://support.ipmonitor.com/mibs/CISCO-ENVMON-MIB/tree.aspx > For Cisco 3701 temperature: > SNMPv2-SMI::enterprises.9.9.13.1.3.1.2.1 = STRING: "Thermal Sensor 1" > SNMPv2-SMI::enterprises.9.9.13.1.3.1.2.2 = STRING: "Thermal Sensor 2" > SNMPv2-SMI::enterprises.9.9.13.1.3.1.3.1 = Gauge32: 20 > SNMPv2-SMI::enterprises.9.9.13.1.3.1.3.2 = Gauge32: 23 > Gerald this mibs doesn't match with cisco 7304 temperature: # snmpget -v1 -c 1.3.6.1.4.1.9.9.13.1.3.1.6 Error in packet Reason: (noSuchName) There is no such variable name in this MIB. Failed object: SNMPv2-SMI::enterprises.9.9.13.1.3.1.6 ... also our mibs thats my problem greets from vlbg ;-) thanks Richard ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Wed Jun 23 15:12:40 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Wed, 23 Jun 2010 09:12:40 -0400 Subject: check_disk and Volumes > 10 TB In-Reply-To: <4C1CC7BC.8050905@kinzesberg.de> References: <4C130E00.6030702@kinzesberg.de> <4C1CC7BC.8050905@kinzesberg.de> Message-ID: If the underlying OSes are 32bit, then it needs to be 32 bit as well (indeed, if it were 64 bit, it probably wouldn't run). Does the issue show itself if you change the units that check_disk report in (for example: --units MB)? --Matt On Sat, Jun 19, 2010 at 9:35 AM, Dirk H. Schulz wrote: > Am 12.06.10 13:16, schrieb Matt Simmons: >> I've seen people run into problems like this when they're checking a >> machine that's 64 bit, and the monitoring host is 32. That's not, by >> chance, the case now, is it? >> > Well, the problem also arises if check_disk is run locally on the > monitored host. Could it be that it is compiled for 32Bit and has to > compiled for 64Bit? > > Dirk >> --Matt >> >> On Sat, Jun 12, 2010 at 12:33 AM, Dirk H. Schulz >> ?wrote: >> >>> Hi folks, >>> >>> I have run into a problem with check_disk. I have a volume with 14 TB >>> that is 91 % free: >>> >>>> /dev/disk0s3 ? ?14Ti ?1.2Ti ? 13Ti ? ? 9% ? ?/Volumes/EonStor1 >>>> >>> check_disk states it is 0 % free: >>> >>>> check_disk -u GB /Volumes/EonStor1 >>>> DISK OK - free space: /Volumes/EonStor1 0 GB (0% inode=91%);| >>>> /Volumes/EonStor1=1276GB;;;0;14665 >>>> >>> Is there a known limitation concerning the size of the volumes? With a >>> volume< ?2 TB I do not have this problem on the same machine. >>> >>> Is there something I can do to get around this? >>> >>> Any hint or help is appreciated. >>> >>> Dirk >>> >>> > > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. ?See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From felix at itim-cj.ro Thu Jun 24 08:56:56 2010 From: felix at itim-cj.ro (felix farcas) Date: Thu, 24 Jun 2010 09:56:56 +0300 Subject: file and directory rights Message-ID: <4C2301B8.8020801@itim-cj.ro> Hello My web interface is functioning fine. My configuration verifing command is functioning too. But at the web interface when trying to something at "hosts" the following message appear: The rights on the directory in freebsd are root:wheel but I added in nagios.cfg nagios_user=root nagios_group=wheel Do you have any suggestion? "Whoops!*Error: Could not read object configuration data! *Here are some things you should check in order to resolve this error: 1. Verify configuration options using the *-v* command-line option to check for errors. 2. Check the Nagios log file for messages relating to startup or status data errors. Make sure you read the documentation on installing, configuring and running Nagios thoroughly before continuing. If all else fails, try sending a message to one of the mailing lists." nagios -v /usr/local/etc/nagios/nagios.cfg Nagios Core 3.2.1 Copyright (c) 2009-2010 Nagios Core Development Team and Community Contributors Copyright (c) 1999-2009 Ethan Galstad Last Modified: 03-09-2010 License: GPL Website: http://www.nagios.org Reading configuration data... Read main config file okay... Processing object config file '/usr/local/etc/nagios/objects/commands.cfg'... Processing object config file '/usr/local/etc/nagios/objects/contacts.cfg'... Processing object config file '/usr/local/etc/nagios/objects/timeperiods.cfg'... Processing object config file '/usr/local/etc/nagios/objects/templates.cfg'... Processing object config file '/usr/local/etc/nagios/objects/localhost.cfg'... Read object config files okay... Running pre-flight check on configuration data... Checking services... Checked 8 services. Checking hosts... Checked 1 hosts. Checking host groups... Checked 1 host groups. Checking service groups... Checked 0 service groups. Checking contacts... Checked 1 contacts. Checking contact groups... Checked 1 contact groups. Checking service escalations... Checked 0 service escalations. Checking service dependencies... Checked 0 service dependencies. Checking host escalations... Checked 0 host escalations. Checking host dependencies... Checked 0 host dependencies. Checking commands... Checked 24 commands. Checking time periods... Checked 5 time periods. Checking for circular paths between hosts... Checking for circular host and service dependencies... Checking global event handlers... Checking obsessive compulsive processor commands... Checking misc settings... Total Warnings: 0 Total Errors: 0 Things look okay - No serious problems were detected during the pre-flight check Thank you Felix -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/x-pkcs7-signature Size: 3092 bytes Desc: S/MIME Cryptographic Signature URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dimitrigordeziani-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org Thu Jun 24 14:51:12 2010 From: dimitrigordeziani-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org (Dimitri Gordeziani) Date: Thu, 24 Jun 2010 15:51:12 +0300 Subject: Hello, can someone help me... In-Reply-To: References: Message-ID: Please can you reply me on this email if it is possible to make nagios plugin to monitor traffice in bits instead of bytes and how? thank you.. On Wed, Jun 23, 2010 at 8:26 PM, Dimitri Gordeziani < dimitrigordeziani-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org> wrote: > Hello, > > can someone help me? > > Im using nagious plugins for cacti to monitor vmware esxi (VPS server) > > the issue is that when I try to get net info(vm_net) for Virtuam Machine > located on this VPS host server, I get only (net_receive=0.00KB;; > net_send=0.00KB;;) - the command is example "perl /var/www/cacti/scripts/ > check_esx3.pl -H "VPS host IP" -N "VIrtual Machine Name" -u User -p > password -l NET" > > but as I guess(after investigated) this issue happens when there is network > trafic(for Virtual Machine) in bits(small/less traffic) and if there is the > traffice in KB,MB so then there is no any issue(the script is worknig fine > and gives me a result example "net_receive=122.12KB;; net_send=1210.12KB;;") > > that is why I would like if it is possible to change/replace the default > measure for networ activity from uom="kB"(kbyte) with uom="kb" or uom="b", > is it possible? > > Ive tried to change this value in ..../Nagios/Plugin.pm and > ...../Nagios/Plugin/Performance.pm files but without any success.. > > pleaes let me know if it is possible somehow to set default traffic measure > in bits and not bytes... otherwise I do not get any network info when there > is activity in bits(low network activity) and not in bytes/kB/MB (hight > network activity) > > thank you in advance.. > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagiosplug-help mailing list Nagiosplug-help-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f at public.gmane.org https://lists.sourceforge.net/lists/listinfo/nagiosplug-help ::: Please include plugins version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirde at oppy.com Thu Jun 24 21:55:40 2010 From: mirde at oppy.com (Mirza Dedic) Date: Thu, 24 Jun 2010 12:55:40 -0700 Subject: NSCA + NSClient Message-ID: Hi, I have NSCA configured on my Nagios host, and enabled the necessary plugins on NSClient++ to support NSCA, configure XINETD appropriately inside my NSClient config I have: [NSCA Commands] my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page This is just for testing, I also have in my nagios.cfg: accept_passive_service_checks=1 accept_passive_host_checks=1 In my NSClient Log I can see: 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_cpu_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK CPU Load ok.' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''20m'=0%;80;90; '10s'=6%;80;90; '4'=0%;80;90; ' 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_mem_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=page 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK: page file: 8.82G' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''page file %'=45%;80;90; 'page file'=8.81G;15.6;17.59;0;19.5; ' What I want to know is, until now I have been using active checks, and for some servers I want to use passive_checks as well, so that the server updates Nagios. If I have active checks defined within my Nagios installation such as: define service{ use generic-service host_name van-mail01 service_description D - Disk Space check_command check_nt_disk!D!98!99 } Can I modify this to also read from the passive_check, and what would my [NSCA Commands] definition look like? Does the first part have to reflect the service description? How does the information coming from the NSCA Client get mapped to my configured checks? Thank you. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ryan.c.ash.lu4w at statefarm.com Thu Jun 24 22:07:34 2010 From: ryan.c.ash.lu4w at statefarm.com (Ryan C Ash) Date: Thu, 24 Jun 2010 13:07:34 -0700 Subject: NSCA + NSClient In-Reply-To: References: Message-ID: The short answer is yes, the service description you configure on the client nsca message needs to match that of the service description on your nagios server. If you want to migrate to a passive check you need to disable active checks and enable passive ones. You can leave the check command in place and incorporate freshness checks to force an active check if the passive fail. For me I don't want to do active if passive fail. I would rather cut a ticket "service stale". The nagios doc clearly shows how to configure passive service checks so give it a read. So you need "my_cpu_check" to be a service description so nagios knows what to match that incoming nsca message to. Ash From: Mirza Dedic [mailto:mirde at oppy.com] Sent: Thursday, June 24, 2010 2:56 PM To: 'Nagios-Users' Subject: [Nagios-users] NSCA + NSClient Hi, I have NSCA configured on my Nagios host, and enabled the necessary plugins on NSClient++ to support NSCA, configure XINETD appropriately inside my NSClient config I have: [NSCA Commands] my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page This is just for testing, I also have in my nagios.cfg: accept_passive_service_checks=1 accept_passive_host_checks=1 In my NSClient Log I can see: 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_cpu_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK CPU Load ok.' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''20m'=0%;80;90; '10s'=6%;80;90; '4'=0%;80;90; ' 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_mem_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=page 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK: page file: 8.82G' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''page file %'=45%;80;90; 'page file'=8.81G;15.6;17.59;0;19.5; ' What I want to know is, until now I have been using active checks, and for some servers I want to use passive_checks as well, so that the server updates Nagios. If I have active checks defined within my Nagios installation such as: define service{ use generic-service host_name van-mail01 service_description D - Disk Space check_command check_nt_disk!D!98!99 } Can I modify this to also read from the passive_check, and what would my [NSCA Commands] definition look like? Does the first part have to reflect the service description? How does the information coming from the NSCA Client get mapped to my configured checks? Thank you. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirde at oppy.com Thu Jun 24 22:56:24 2010 From: mirde at oppy.com (Mirza Dedic) Date: Thu, 24 Jun 2010 13:56:24 -0700 Subject: NSCA + NSClient In-Reply-To: References: Message-ID: Thanks, after reading the 3.x on passive_checks I get how to configure the service. Now, what would be the benefit of having active/passive checks enabled for a service? Say, it takes <5 minutes for Nagios to process my 80 hosts/600 services, if the service that I am looking to enable passive checks on as well is checked near the end of the 5 minute mark, wouldn't it get an update much sooner having passive checks enabled? That said, NSClient sending the information to Nagios, logically this sounds like it should work like that, or based off: check_result_reaper_frequency=5 max_check_result_reaper_time=30 So within a max of 30 seconds, I should be able to see if that service is UP/DOWN in the Nagios (or the op5 Ninja) interface? Are passive checks spread out like active checks on say, when Nagios starts? Basically, I want to have the alerting tight as possible, if I login to my IIS server and stop the IISADMIN service, I want to be alerted within those 0-30 seconds based on the reaper frequency. The box that I put Nagios on has enough CPU/RAM and fast enough subsystem I/O to build this type of configuration, but I want to make sure the logic above is correct. Thanks. From: Ryan C Ash [mailto:ryan.c.ash.lu4w at statefarm.com] Sent: June/24/2010 1:08 PM To: Nagios-Users Subject: Re: [Nagios-users] NSCA + NSClient The short answer is yes, the service description you configure on the client nsca message needs to match that of the service description on your nagios server. If you want to migrate to a passive check you need to disable active checks and enable passive ones. You can leave the check command in place and incorporate freshness checks to force an active check if the passive fail. For me I don't want to do active if passive fail. I would rather cut a ticket "service stale". The nagios doc clearly shows how to configure passive service checks so give it a read. So you need "my_cpu_check" to be a service description so nagios knows what to match that incoming nsca message to. Ash From: Mirza Dedic [mailto:mirde at oppy.com] Sent: Thursday, June 24, 2010 2:56 PM To: 'Nagios-Users' Subject: [Nagios-users] NSCA + NSClient Hi, I have NSCA configured on my Nagios host, and enabled the necessary plugins on NSClient++ to support NSCA, configure XINETD appropriately inside my NSClient config I have: [NSCA Commands] my_cpu_check=checkCPU warn=80 crit=90 time=20m time=10s time=4 my_mem_check=checkMem MaxWarn=80% MaxCrit=90% ShowAll type=page This is just for testing, I also have in my nagios.cfg: accept_passive_service_checks=1 accept_passive_host_checks=1 In my NSClient Log I can see: 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_cpu_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkCPU: warn=80, crit=90, time=20m, time=10s, time=4 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK CPU Load ok.' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''20m'=0%;80;90; '10s'=6%;80;90; '4'=0%;80;90; ' 2010-06-24 12:48:44: debug:modules\NSCAAgent\NSCAThread.cpp:205: Executing (from NSCA): my_mem_check 2010-06-24 12:48:44: debug:NSClient++.cpp:1106: Injecting: checkMem: MaxWarn=80%, MaxCrit=90%, ShowAll, type=page 2010-06-24 12:48:44: debug:NSClient++.cpp:1142: Injected Result: OK 'OK: page file: 8.82G' 2010-06-24 12:48:44: debug:NSClient++.cpp:1143: Injected Performance Result: ''page file %'=45%;80;90; 'page file'=8.81G;15.6;17.59;0;19.5; ' What I want to know is, until now I have been using active checks, and for some servers I want to use passive_checks as well, so that the server updates Nagios. If I have active checks defined within my Nagios installation such as: define service{ use generic-service host_name van-mail01 service_description D - Disk Space check_command check_nt_disk!D!98!99 } Can I modify this to also read from the passive_check, and what would my [NSCA Commands] definition look like? Does the first part have to reflect the service description? How does the information coming from the NSCA Client get mapped to my configured checks? Thank you. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Fri Jun 25 00:16:46 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Thu, 24 Jun 2010 16:16:46 -0600 Subject: Help - I just blew away my configs Message-ID: I just blew away all my nagios config files. Nagios is still running. Is there any way I can make nagios spit up the configs that were loaded from the command line or is all hope lost? ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mgius at createspace.com Fri Jun 25 01:11:06 2010 From: mgius at createspace.com (Gius, Mark) Date: Thu, 24 Jun 2010 19:11:06 -0400 Subject: Help - I just blew away my configs In-Reply-To: References: Message-ID: <23193A17705DD246AFFFDF09B779F56F25199A026B@EX-IAD6-B.ant.amazon.com> A pretty significant portion of the configurations are stored in the objects cache (/var/log/Nagios/objects.cache for me). This won't be as clean as your configs (and I'm not sure Nagios can use this file as a config directly), but you should be able to recover a pretty good amount of your running configs from there, and start reconstructing the config files. May I be the first to suggest placing your configuration in revision control, which will help mitigate this problem in the future. I would suggest Subversion (http://subversion.tigris.org/) or git (http://git-scm.com/). Good luck! -Gius > -----Original Message----- > From: Litwin, Matthew [mailto:mlitwin at stubhub.com] > Sent: Thursday, June 24, 2010 3:17 PM > To: Nagios Users List > Subject: [Nagios-users] Help - I just blew away my configs > > I just blew away all my nagios config files. Nagios is still running. > Is there any way I can make nagios spit up the configs that were loaded > from the command line or is all hope lost? > ----------------------------------------------------------------------- > ------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From scarley at gmi-mr.com Fri Jun 25 03:52:59 2010 From: scarley at gmi-mr.com (Sean Carley) Date: Thu, 24 Jun 2010 18:52:59 -0700 Subject: Help - I just blew away my configs In-Reply-To: <23193A17705DD246AFFFDF09B779F56F25199A026B@EX-IAD6-B.ant.amazon.com> References: <23193A17705DD246AFFFDF09B779F56F25199A026B@EX-IAD6-B.ant.amazon.com> Message-ID: You can also try /nagios/cgi-bin/config.cgi, not sure if that shows the running or on-disk configs. -s -----Original Message----- From: Gius, Mark [mailto:mgius at createspace.com] Sent: Thursday, June 24, 2010 4:11 PM To: Nagios Users List Subject: Re: [Nagios-users] Help - I just blew away my configs A pretty significant portion of the configurations are stored in the objects cache (/var/log/Nagios/objects.cache for me). This won't be as clean as your configs (and I'm not sure Nagios can use this file as a config directly), but you should be able to recover a pretty good amount of your running configs from there, and start reconstructing the config files. May I be the first to suggest placing your configuration in revision control, which will help mitigate this problem in the future. I would suggest Subversion (http://subversion.tigris.org/) or git (http://git-scm.com/). Good luck! -Gius > -----Original Message----- > From: Litwin, Matthew [mailto:mlitwin at stubhub.com] > Sent: Thursday, June 24, 2010 3:17 PM > To: Nagios Users List > Subject: [Nagios-users] Help - I just blew away my configs > > I just blew away all my nagios config files. Nagios is still running. > Is there any way I can make nagios spit up the configs that were loaded > from the command line or is all hope lost? > ----------------------------------------------------------------------- > ------- > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Fri Jun 25 06:47:01 2010 From: perldork at webwizarddesign.com (Max) Date: Fri, 25 Jun 2010 00:47:01 -0400 Subject: Help - I just blew away my configs In-Reply-To: References: <23193A17705DD246AFFFDF09B779F56F25199A026B@EX-IAD6-B.ant.amazon.com> Message-ID: Shows running settings (config.cgi) Objects.cache contains a flattened version of all the configuration objects your configuration tree had; the primary loss is that there are no templates in objects.cache, so you will have to recreate those. Isave a copy of objects.cache in a very safe place off server and save a copy on server - nagios has a command line option that will let you start it using an objects.cache formatted file ( search for pre-cached configs)m If you had plugins in your config tree those are lost, cgi.cfg settings are lost, nagios.cfg settings are lost, and resource.cfg settings are lost - minus resource.cfg the rest are pretty eady to redo (as resource.cfg might have had passwords or other unique info. Definitely put your configs in svn, cvs, git or another version control system as you re-create your config templates - and makes sure you backup your vcs repository daily. Max On 6/24/10, Sean Carley wrote: > You can also try /nagios/cgi-bin/config.cgi, not sure if that shows the > running or on-disk configs. > > -s > > > > -----Original Message----- > From: Gius, Mark [mailto:mgius at createspace.com] > Sent: Thursday, June 24, 2010 4:11 PM > To: Nagios Users List > Subject: Re: [Nagios-users] Help - I just blew away my configs > > A pretty significant portion of the configurations are stored in the > objects cache (/var/log/Nagios/objects.cache for me). This won't be as > clean as your configs (and I'm not sure Nagios can use this file as a > config directly), but you should be able to recover a pretty good amount > of your running configs from there, and start reconstructing the config > files. > > May I be the first to suggest placing your configuration in revision > control, which will help mitigate this problem in the future. I would > suggest Subversion (http://subversion.tigris.org/) or git > (http://git-scm.com/). > > Good luck! > > -Gius > >> -----Original Message----- >> From: Litwin, Matthew [mailto:mlitwin at stubhub.com] >> Sent: Thursday, June 24, 2010 3:17 PM >> To: Nagios Users List >> Subject: [Nagios-users] Help - I just blew away my configs >> >> I just blew away all my nagios config files. Nagios is still running. >> Is there any way I can make nagios spit up the configs that were > loaded >> from the command line or is all hope lost? >> > ----------------------------------------------------------------------- >> ------- >> ThinkGeek and WIRED's GeekDad team up for the Ultimate >> GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the >> lucky parental unit. See the prize list and enter to win: >> http://p.sf.net/sfu/thinkgeek-promo >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------ > ------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From roy at karlsbakk.net Thu Jun 24 19:20:42 2010 From: roy at karlsbakk.net (Roy Sigurd Karlsbakk) Date: Thu, 24 Jun 2010 19:20:42 +0200 (CEST) Subject: wiki down? Message-ID: <20285738.139.1277400042080.JavaMail.root@zimbra> hi all seems something is rather bad with http://wiki.nagios.org/ - anyone here with access to the box? Vennlige hilsener / Best regards roy -- Roy Sigurd Karlsbakk (+47) 97542685 roy at karlsbakk.net http://blogg.karlsbakk.net/ -- I all pedagogikk er det essensielt at pensum presenteres intelligibelt. Det er et element?rt imperativ for alle pedagoger ? unng? eksessiv anvendelse av idiomer med fremmed opprinnelse. I de fleste tilfeller eksisterer adekvate og relevante synonymer p? norsk. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From roy at karlsbakk.net Fri Jun 25 13:38:26 2010 From: roy at karlsbakk.net (Roy Sigurd Karlsbakk) Date: Fri, 25 Jun 2010 13:38:26 +0200 (CEST) Subject: wiki down? In-Reply-To: <20285738.139.1277400042080.JavaMail.root@zimbra> References: <20285738.139.1277400042080.JavaMail.root@zimbra> Message-ID: <24264921.40.1277465906150.JavaMail.root@zimbra> ----- Original Message ----- > hi all > > seems something is rather bad with http://wiki.nagios.org/ - anyone > here with access to the box? > Whoever's running the nagios server monitoring wiki.nagios.org must be sleeping or something. Is the Nagios project dying? Vennlige hilsener / Best regards roy -- Roy Sigurd Karlsbakk (+47) 97542685 roy at karlsbakk.net http://blogg.karlsbakk.net/ -- I all pedagogikk er det essensielt at pensum presenteres intelligibelt. Det er et element?rt imperativ for alle pedagoger ? unng? eksessiv anvendelse av idiomer med fremmed opprinnelse. I de fleste tilfeller eksisterer adekvate og relevante synonymer p? norsk. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jun 25 14:07:40 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 25 Jun 2010 13:07:40 +0100 Subject: wiki down? In-Reply-To: <24264921.40.1277465906150.JavaMail.root@zimbra> References: <24264921.40.1277465906150.JavaMail.root@zimbra> Message-ID: <4C249C0C.9080506@flatto.net> Roy Sigurd Karlsbakk wrote: > ----- Original Message ----- > >> hi all >> >> seems something is rather bad with http://wiki.nagios.org/ - anyone >> here with access to the box? >> >> > > Whoever's running the nagios server monitoring wiki.nagios.org must be sleeping or something. Is the Nagios project dying? > > Vennlige hilsener / Best regards > > roy > -- > Roy Sigurd Karlsbakk > (+47) 97542685 > roy at karlsbakk.net > http://blogg.karlsbakk.net/ > The Ip of the server points to a Us located server ., they may have not woken up yet , or they are having a HW issue . -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From etorres at dap.es Fri Jun 25 14:46:15 2010 From: etorres at dap.es (Esteban Torres) Date: Fri, 25 Jun 2010 14:46:15 +0200 Subject: messages for boss Message-ID: <20100625144615.ee4a6199.etorres@dap.es> I want a user to receive all down of machines, but only receives the services down I say. define contact{ contact_name User alias User1 service_notification_period 24x7 host_notification_period 24x7 service_notification_options c,r host_notification_options d,r service_notification_commands notify-service-by-email host_notification_commands notify-host-by-email email user at domain.com } In all this host so: define host{ host_name Host alias Host address 192.168.1.5 check_command check-host-alive check_interval 5 retry_interval 1 max_check_attempts 3 check_period 24x7 process_perf_data 0 retain_nonstatus_information 0 contact_groups admins-linux contacts User notification_interval 300 notification_period 24x7 notification_options d,u,r } define service{ hostgroup_name ping-servers service_description PING check_command check_ping!100.0,20%!500.0,60% max_check_attempts 3 check_interval 5 retry_interval 1 check_period 24x7 notification_interval 300 notification_period 24x7 notification_options w,u,c,r flap_detection_enabled 0 } define hostgroup { hostgroup_name ping-servers alias Pingable servers members Host } But it sends all down from all hosts, but also sends all critical of all services. As I have to do? ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drew.weaver at thenap.com Fri Jun 25 15:48:30 2010 From: drew.weaver at thenap.com (Drew Weaver) Date: Fri, 25 Jun 2010 09:48:30 -0400 Subject: Is there a way to set a hard limit on notifications per a single event? Message-ID: Hi, I am wondering if there is a way to set a hard limit on notifications for a change in status for a host/service. Basically it will send no more than say 3 emails about a problem. thanks, -Drew -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jun 25 15:52:49 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 25 Jun 2010 14:52:49 +0100 Subject: Is there a way to set a hard limit on notifications per a single event? In-Reply-To: References: Message-ID: <4C24B4B1.5050509@flatto.net> Drew Weaver wrote: > > Hi, > > > > I am wondering if there is a way to set a hard limit on notifications > for a change in status for a host/service. > > > > Basically it will send no more than say 3 emails about a problem. > > > > thanks, > > -Drew > > you can use the escalations to do what you need http://nagios.sourceforge.net/docs/3_0/escalations.html just define an escalation for the service itself and limit the alerts. Assaf -- Never,Ever Cut A Deal With a Dragon I am doing a Charity Bike ride On the 27 of June for the Capital to Coast Charity. Please help by Donating http://www.justgiving.com/Lovefilm-capital-to-coast ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ryan.c.ash.lu4w at statefarm.com Fri Jun 25 15:51:56 2010 From: ryan.c.ash.lu4w at statefarm.com (Ryan C Ash) Date: Fri, 25 Jun 2010 06:51:56 -0700 Subject: messages for boss In-Reply-To: <20100625144615.ee4a6199.etorres@dap.es> References: <20100625144615.ee4a6199.etorres@dap.es> Message-ID: <0E224CCCAB9A374293F138E39621972C0E22AB1A@WPSCV6MN.OPR.STATEFARM.ORG> Disable notification on all services except those you are interested in. notifications_enabled = 0 That is a solution...might not be the best. -----Original Message----- From: Esteban Torres [mailto:etorres at dap.es] Sent: Friday, June 25, 2010 7:46 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] messages for boss I want a user to receive all down of machines, but only receives the services down I say. define contact{ contact_name User alias User1 service_notification_period 24x7 host_notification_period 24x7 service_notification_options c,r host_notification_options d,r service_notification_commands notify-service-by-email host_notification_commands notify-host-by-email email user at domain.com } In all this host so: define host{ host_name Host alias Host address 192.168.1.5 check_command check-host-alive check_interval 5 retry_interval 1 max_check_attempts 3 check_period 24x7 process_perf_data 0 retain_nonstatus_information 0 contact_groups admins-linux contacts User notification_interval 300 notification_period 24x7 notification_options d,u,r } define service{ hostgroup_name ping-servers service_description PING check_command check_ping!100.0,20%!500.0,60% max_check_attempts 3 check_interval 5 retry_interval 1 check_period 24x7 notification_interval 300 notification_period 24x7 notification_options w,u,c,r flap_detection_enabled 0 } define hostgroup { hostgroup_name ping-servers alias Pingable servers members Host } But it sends all down from all hosts, but also sends all critical of all services. As I have to do? ------------------------------------------------------------------------ ------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Max.Williams at mflow.com Fri Jun 25 15:50:41 2010 From: Max.Williams at mflow.com (Max Williams) Date: Fri, 25 Jun 2010 14:50:41 +0100 Subject: check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage Message-ID: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> Hi, After adding more storage to a couple of our servers we are getting this error: [root at host ~]# /usr/lib64/nagios/plugins/check_openmanage -C password -b ctrl_driver=0,1,2 -b ctrl_fw=0,1,2 -b intr=0 -H host2 Temperature Probe 1 in enclosure 3 [MD1000] is Inactive C at ( max) EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2312. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2312. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2318. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage line 2318. [root at host ~]# We didn't get this error before adding a new cabinet of disks which now brings the total up to 47 (2x internal disk and 3x full MD1000s). Has any one else come across this error? I am not perl literate so not sure how to debug or fix this. Cheers, Max -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Fri Jun 25 17:02:07 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Fri, 25 Jun 2010 17:02:07 +0200 Subject: check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage In-Reply-To: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> (Max Williams's message of "Fri, 25 Jun 2010 14:50:41 +0100") References: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> Message-ID: <15tocezgnps.fsf@tux.uio.no> Max Williams writes: > Hi, > > After adding more storage to a couple of our servers we are getting this error: > > > > [root at host ~]# /usr/lib64/nagios/plugins/check_openmanage -C password -b > ctrl_driver=0,1,2 -b ctrl_fw=0,1,2 -b intr=0 -H host2 > > Temperature Probe 1 in enclosure 3 [MD1000] is Inactive C at ( max) > > EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2312. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2312. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2318. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2318. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2318. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2318. > > [root at host ~]# > > > > We didn?t get this error before adding a new cabinet of disks which now brings > the total up to 47 (2x internal disk and 3x full MD1000s). > > Has any one else come across this error? I am not perl literate so not sure how > to debug or fix this. Hi Max, This is interesting. I've never seen "Inactive" temperature sensors in external enclosures. Also, that the plugin reports missing EMMs seems like a misfeature. Can you post the output from the following commands: On the monitored host: omreport storage enclosure controller= enclosure= info=temps omreport storage enclosure controller= enclosure= info=emms Replace with controller/enclosure pairs. You'll get the enclosure and controller IDs with commands omreport storage controller omreport storage enclosure Also, since you're checking with SNMP, I'll need the output from an snmpwalk of the enclosures wrt. temperatures and EMMs. From the Nagios server: snmpwalk -v2c -c 1.3.6.1.4.1.674.10893.1.20.130.11 snmpwalk -v2c -c 1.3.6.1.4.1.674.10893.1.20.130.13 If you are uncomfortable with posting this information on the mailinglist, feel free to email me directly. Debug output from the plugin could also be useful: check_openmanage -H -C -d Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Max.Williams at mflow.com Fri Jun 25 18:04:40 2010 From: Max.Williams at mflow.com (Max Williams) Date: Fri, 25 Jun 2010 17:04:40 +0100 Subject: check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage In-Reply-To: <15tocezgnps.fsf@tux.uio.no> References: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> <15tocezgnps.fsf@tux.uio.no> Message-ID: <0357196CB603794BB76F4F6B373F27330506EACE96@SERVER.ddnglobal.local> Thanks for the reply, Trond! Both of the new enclosures show the same output so perhaps these just have a different configuration to the others we have here. Here is the output you asked for..... # omreport storage enclosure controller=1 enclosure=1:0 info=temps List of Temperature Probes in Enclosure 1 ID : 0 Status : Ok Name : Temperature Probe 0 State : Ready Reading : 31 C Minimum Warning Threshold : -2 C Maximum Warning Threshold : 65 C Minimum Failure Threshold : -2 C Maximum Failure Threshold : 65 C ID : 1 Status : Unknown Name : Temperature Probe 1 State : Inactive Reading : Not Applicable Minimum Warning Threshold : Not Applicable Maximum Warning Threshold : Not Applicable Minimum Failure Threshold : Not Applicable Maximum Failure Threshold : Not Applicable ID : 2 Status : Ok Name : Temperature Probe 2 State : Ready Reading : 26 C Minimum Warning Threshold : 8 C Maximum Warning Threshold : 50 C Minimum Failure Threshold : 3 C Maximum Failure Threshold : 55 C ID : 3 Status : Ok Name : Temperature Probe 3 State : Ready Reading : 25 C Minimum Warning Threshold : 8 C Maximum Warning Threshold : 50 C Minimum Failure Threshold : 3 C Maximum Failure Threshold : 55 C =================================================================== # omreport storage enclosure controller=1 enclosure=1:0 info=emms List of Enclosure Management Modules in Enclosure 1 ID : 0 Status : Ok Name : EMM 0 State : Ready Part Number : 0JT517A02 Firmware Version : A.04 SCSI Rate : Not Applicable Bus Protocol : Not Available ID : 1 Status : Unknown Name : EMM 1 State : Not Installed Part Number : Firmware Version : SCSI Rate : Not Applicable Bus Protocol : Not Available =================================================================== # snmpwalk -v2c -c password host 1.3.6.1.4.1.674.10893.1.20.130.11 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.4 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.5 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.6 = INTEGER: 6 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.7 = INTEGER: 7 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.8 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.9 = INTEGER: 9 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.10 = INTEGER: 10 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.11 = INTEGER: 11 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.1.12 = INTEGER: 12 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.1 = STRING: "Temperature Probe 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.2 = STRING: "Temperature Probe 1" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.3 = STRING: "Temperature Probe 2" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.4 = STRING: "Temperature Probe 3" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.5 = STRING: "Temperature Probe 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.6 = STRING: "Temperature Probe 1" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.7 = STRING: "Temperature Probe 2" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.8 = STRING: "Temperature Probe 3" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.9 = STRING: "Temperature Probe 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.10 = STRING: "Temperature Probe 1" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.11 = STRING: "Temperature Probe 2" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.2.12 = STRING: "Temperature Probe 3" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.1 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.2 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.3 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.4 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.5 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.6 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.7 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.8 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.9 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.10 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.11 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.3.12 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.3 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.4 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.5 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.6 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.7 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.8 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.9 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.10 = INTEGER: 9 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.11 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.4.12 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.1 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.2 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.3 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.4 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.5 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.6 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.7 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.8 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.9 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.10 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.11 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.6.12 = STRING: "celsius" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.1 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.2 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.3 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.4 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.5 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.6 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.7 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.8 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.9 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.11 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.7.12 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.1 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.2 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.5 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.6 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.7 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.8 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.9 = INTEGER: -2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.11 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.8.12 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.1 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.2 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.3 = INTEGER: 50 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.4 = INTEGER: 50 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.5 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.6 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.7 = INTEGER: 50 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.8 = INTEGER: 50 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.9 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.11 = INTEGER: 50 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.9.12 = INTEGER: 50 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.1 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.2 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.3 = INTEGER: 55 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.4 = INTEGER: 55 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.5 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.6 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.7 = INTEGER: 55 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.8 = INTEGER: 55 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.9 = INTEGER: 65 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.11 = INTEGER: 55 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.10.12 = INTEGER: 55 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.1 = INTEGER: 31 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.2 = INTEGER: 32 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.3 = INTEGER: 24 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.4 = INTEGER: 24 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.5 = INTEGER: 31 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.6 = INTEGER: 33 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.7 = INTEGER: 25 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.8 = INTEGER: 25 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.9 = INTEGER: 31 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.11 = INTEGER: 26 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.11.12 = INTEGER: 25 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.5 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.6 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.7 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.8 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.9 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.10 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.11 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.12.12 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.5 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.6 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.7 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.8 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.9 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.10 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.11 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.13.12 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.1 = STRING: "\\1\\0\\1\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.2 = STRING: "\\1\\0\\1\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.3 = STRING: "\\1\\0\\1\\2" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.4 = STRING: "\\1\\0\\1\\3" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.5 = STRING: "\\1\\0\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.6 = STRING: "\\1\\0\\0\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.7 = STRING: "\\1\\0\\0\\2" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.8 = STRING: "\\1\\0\\0\\3" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.9 = STRING: "\\1\\1\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.10 = STRING: "\\1\\1\\0\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.11 = STRING: "\\1\\1\\0\\2" SNMPv2-SMI::enterprises.674.10893.1.20.130.11.1.14.12 = STRING: "\\1\\1\\0\\3" ===================================================================== # snmpwalk -v2c -c password host 1.3.6.1.4.1.674.10893.1.20.130.13 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.1.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.1.4 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.1.5 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.1.6 = INTEGER: 6 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.2.1 = STRING: "EMM 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.2.2 = STRING: "EMM 1" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.2.3 = STRING: "EMM 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.2.4 = STRING: "EMM 1" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.2.5 = STRING: "EMM 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.2.6 = STRING: "EMM 1" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.3.1 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.3.2 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.3.3 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.3.4 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.3.5 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.3.6 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.4.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.4.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.4.3 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.4.4 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.4.5 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.4.6 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.6.1 = STRING: "0JT517A02" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.6.2 = STRING: "0JT517A02" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.6.3 = STRING: "0JT517A02" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.6.4 = STRING: "0JT517A02" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.6.5 = STRING: "0JT517A02" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.3 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.4 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.5 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.7.6 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.1 = STRING: "A.04" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.2 = STRING: "A.04" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.3 = STRING: "A.04" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.4 = STRING: "A.04" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.8.5 = STRING: "A.04" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.5 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.10.6 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.5 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.11.6 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.1 = STRING: "\\1\\0\\1\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.2 = STRING: "\\1\\0\\1\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.3 = STRING: "\\1\\0\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.4 = STRING: "\\1\\0\\0\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.5 = STRING: "\\1\\1\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.13.1.12.6 = STRING: "\\1\\1\\0\\1" Let me know if you need anything else. Best Regards, Max Williams -----Original Message----- From: Trond Hasle Amundsen [mailto:t.h.amundsen at usit.uio.no] Sent: 25 June 2010 16:02 To: Nagios Users List Subject: Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage Max Williams writes: > Hi, > > After adding more storage to a couple of our servers we are getting this error: > > > > [root at host ~]# /usr/lib64/nagios/plugins/check_openmanage -C password -b > ctrl_driver=0,1,2 -b ctrl_fw=0,1,2 -b intr=0 -H host2 > > Temperature Probe 1 in enclosure 3 [MD1000] is Inactive C at ( max) > > EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2312. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2312. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2318. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2318. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2318. > > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/check_openmanage line 2318. > > [root at host ~]# > > > > We didn?t get this error before adding a new cabinet of disks which now brings > the total up to 47 (2x internal disk and 3x full MD1000s). > > Has any one else come across this error? I am not perl literate so not sure how > to debug or fix this. Hi Max, This is interesting. I've never seen "Inactive" temperature sensors in external enclosures. Also, that the plugin reports missing EMMs seems like a misfeature. Can you post the output from the following commands: On the monitored host: omreport storage enclosure controller= enclosure= info=temps omreport storage enclosure controller= enclosure= info=emms Replace with controller/enclosure pairs. You'll get the enclosure and controller IDs with commands omreport storage controller omreport storage enclosure Also, since you're checking with SNMP, I'll need the output from an snmpwalk of the enclosures wrt. temperatures and EMMs. From the Nagios server: snmpwalk -v2c -c 1.3.6.1.4.1.674.10893.1.20.130.11 snmpwalk -v2c -c 1.3.6.1.4.1.674.10893.1.20.130.13 If you are uncomfortable with posting this information on the mailinglist, feel free to email me directly. Debug output from the plugin could also be useful: check_openmanage -H -C -d Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From etorres at dap.es Fri Jun 25 19:03:58 2010 From: etorres at dap.es (etorres at dap.es) Date: Fri, 25 Jun 2010 19:03:58 +0200 (CEST) Subject: messages for boss In-Reply-To: <0E224CCCAB9A374293F138E39621972C0E22AB1A@WPSCV6MN.OPR.STATEFARM.ORG> References: <20100625144615.ee4a6199.etorres@dap.es> <0E224CCCAB9A374293F138E39621972C0E22AB1A@WPSCV6MN.OPR.STATEFARM.ORG> Message-ID: <62101.10.160.5.67.1277485438.squirrel@correoweb.dap.es> if disable notifications for all services. Not notify the other, no? I think I will try, with: servicescalation hostescalation > Disable notification on all services except those you are interested in. > > notifications_enabled = 0 > > That is a solution...might not be the best. > > -----Original Message----- > From: Esteban Torres [mailto:etorres at dap.es] > Sent: Friday, June 25, 2010 7:46 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] messages for boss > > > I want a user to receive all down of machines, but only receives the > services down I say. > > define contact{ > contact_name User > alias User1 > service_notification_period 24x7 > host_notification_period 24x7 > service_notification_options c,r > host_notification_options d,r > service_notification_commands notify-service-by-email > host_notification_commands notify-host-by-email > email user at domain.com > } > > In all this host so: > > define host{ > host_name Host > alias Host > address 192.168.1.5 > > check_command check-host-alive > check_interval 5 > retry_interval 1 > max_check_attempts 3 > check_period 24x7 > process_perf_data 0 > retain_nonstatus_information 0 > contact_groups admins-linux > contacts User > notification_interval 300 > notification_period 24x7 > notification_options d,u,r > } > > define service{ > hostgroup_name ping-servers > service_description PING > check_command check_ping!100.0,20%!500.0,60% > max_check_attempts 3 > check_interval 5 > retry_interval 1 > check_period 24x7 > notification_interval 300 > notification_period 24x7 > notification_options w,u,c,r > flap_detection_enabled 0 > } > > define hostgroup { > hostgroup_name ping-servers > alias Pingable servers > members Host > } > > > > But it sends all down from all hosts, but also sends all critical of all > services. > > As I have to do? > > > > ------------------------------------------------------------------------ > ------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's > Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the > prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > ThinkGeek and WIRED's GeekDad team up for the Ultimate > GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the > lucky parental unit. See the prize list and enter to win: > http://p.sf.net/sfu/thinkgeek-promo > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Fri Jun 25 19:20:28 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Fri, 25 Jun 2010 19:20:28 +0200 Subject: check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage In-Reply-To: <0357196CB603794BB76F4F6B373F27330506EACE96@SERVER.ddnglobal.local> (Max Williams's message of "Fri, 25 Jun 2010 17:04:40 +0100") References: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> <15tocezgnps.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE96@SERVER.ddnglobal.local> Message-ID: <15tk4pnghb7.fsf@tux.uio.no> Max Williams writes: > Both of the new enclosures show the same output so perhaps these just > have a different configuration to the others we have here. Yes. I suspect that the is related to one EMM not being installed. My guess is that the inactive temperature sensor is located in the EMM, but there is no way to tell since neither the omreport output nor the SNMP output reveals the location of the temperature sensors. Or perhaps the EMM is needed to activate the sensor. We always order our MD1000s with 2 EMMs, so this is something that I haven't had the opportunity to test. I have created a test version for you to try. This version should: * report inactive temperature sensors as OK * report EMMs with state "Not Installed" as OK In addition it checks that the reading from the sensors are in fact digits before attempting to print the values. The test version is located here: http://folk.uio.no/trondham/software/beta/ Try it with the '-d' option to see that it reports these things properly. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From darren at brucetelecom.com Fri Jun 25 20:55:17 2010 From: darren at brucetelecom.com (Darren Hill) Date: Fri, 25 Jun 2010 14:55:17 -0400 Subject: compiling nagios plugins on sun cobalt In-Reply-To: <1274817654.2513.19.camel@localhost.localdomain> References: <4BFC0604.8020506@brucetelecom.com> <1274817654.2513.19.camel@localhost.localdomain> Message-ID: <4C24FB95.3080406@brucetelecom.com> Hi, I'm on a sun cobalt box, and having trouble compiling the nagios plugins. I have --without-pgsql, but ldaps and check_smart_ide keep failing. What are the commands to disable configuring the plugins without those so it doesn't fail on make? Thanks ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Fri Jun 25 22:05:26 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Fri, 25 Jun 2010 14:05:26 -0600 Subject: groups of hostgroups? Message-ID: <09EDEE59-6A79-45A1-B4D0-FF8F5B334EF9@stubhub.com> It doesn't appear that there is a way to have a way to include hostgroups in other hostgroups, but is there some other way to get this behavior? Since my environment has several dozen types of servers in our environment, it would be helpful to define a "class" of host somehow rather than having servers be listed explicitly in multiple hostgroups. Any ideas? ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jun 25 23:04:52 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 25 Jun 2010 22:04:52 +0100 Subject: groups of hostgroups? In-Reply-To: <09EDEE59-6A79-45A1-B4D0-FF8F5B334EF9@stubhub.com> References: <09EDEE59-6A79-45A1-B4D0-FF8F5B334EF9@stubhub.com> Message-ID: <4C2519F4.50608@flatto.net> Please read the definitions of hostgroups again and you'll see that what you need is already included in the nagios conf http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#hostgroup Litwin, Matthew wrote: > It doesn't appear that there is a way to have a way to include hostgroups in other hostgroups, but is there some other way to get this behavior? Since my environment has several dozen types of servers in our environment, it would be helpful to define a "class" of host somehow rather than having servers be listed explicitly in multiple hostgroups. Any ideas? ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Fri Jun 25 23:11:03 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Fri, 25 Jun 2010 17:11:03 -0400 Subject: groups of hostgroups? In-Reply-To: <09EDEE59-6A79-45A1-B4D0-FF8F5B334EF9@stubhub.com> References: <09EDEE59-6A79-45A1-B4D0-FF8F5B334EF9@stubhub.com> Message-ID: <4C251B67.5040205@paul.dubuc.org> Litwin, Matthew wrote: > It doesn't appear that there is a way to have a way to include hostgroups > in other hostgroups, but is there some other way to get this behavior? > Since my environment has several dozen types of servers in our environment, > it would be helpful to define a "class" of host somehow rather than having > servers be listed explicitly in multiple hostgroups. Any ideas? I use templates to add hosts and services to groups. If the definition inherits from more than one template the 'hostgroups' or 'servicegroups' specifier will replace whatever was specified previously unless you prefix the group name with a plus sign (+). Then it adds the group to whatever other groups are specified: define hostgroup{ hostgroup_name HG_ALPHA ... } define host{ name alpha-host register 0 ; this is a template hostgroups +HG_ALPHA ... } define hostgroup{ hostgroup_name HG_BETA ... } # # Nagios service definition template used by services in this config file # define host{ name beta-host register 0 ; this is a template use alpha-host hostgroups +HG_BETA } Now any host that uses the beta-host template is put in both the HG_BETA hostgroup and the HG_ALPHA hostgroup. This effectively puts the HG_BETA group within the HG_ALPA group. Hope this helps. Same thing can be done with servicegroups of course. ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From roy at karlsbakk.net Sat Jun 26 19:07:45 2010 From: roy at karlsbakk.net (Roy Sigurd Karlsbakk) Date: Sat, 26 Jun 2010 19:07:45 +0200 (CEST) Subject: wiki down? In-Reply-To: <4C249C0C.9080506@flatto.net> References: <4C249C0C.9080506@flatto.net> Message-ID: <6661588.34.1277572065602.JavaMail.root@zimbra> > The Ip of the server points to a Us located server ., > > they may have not woken up yet , or they are having a HW issue . Well, it's still down. Vennlige hilsener / Best regards roy -- Roy Sigurd Karlsbakk (+47) 97542685 roy at karlsbakk.net http://blogg.karlsbakk.net/ -- I all pedagogikk er det essensielt at pensum presenteres intelligibelt. Det er et element?rt imperativ for alle pedagoger ? unng? eksessiv anvendelse av idiomer med fremmed opprinnelse. I de fleste tilfeller eksisterer adekvate og relevante synonymer p? norsk. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From wwanghongrui at cebbank.com Mon Jun 28 05:15:43 2010 From: wwanghongrui at cebbank.com (wwanghongrui) Date: Mon, 28 Jun 2010 11:15:43 +0800 Subject: check_oracle work issue Message-ID: <201006281115417657553@cebbank.com> Hi, My nagios server is SUSE10-SP2 + NAGIOS 3.2.0 + nagios-plugins-1.4.13 . I edit the ORACLE_HOME and other environment in /usr/local/nagios/libexec/check_oracle, and add tns items on tnsnames.ora . Then, I use this plugin to check oracle server which running on Windows. I run the command "/usr/local/nagios/libexec/check_oracle --tns 10.1.88.9", it result OK - reply time 10 msec from 10.1.88.9. But, when I add service to nagios, nagios always return "No TNS Listener on 10.1.88.9". My nagios configuration is like below: define host{ host_name AAA use generic-host alias AAA_10.1.88.9 address 10.1.88.9 hostgroups Windows_Servers check_command check-host-alive max_check_attempts 3 check_interval 6 active_checks_enabled 1 check_period 24x7 contact_groups admins, Supervisors notification_interval 0 notification_period 24x7 notification_options d,u,r,f,s notifications_enabled 1 } define service{ host_name AAA service_description check_AAA_oracle use generic-service servicegroups Linux-Services-Group check_command check_oracle_tnsping!10.1.88.9 max_check_attempts 3 normal_check_interval 6 retry_check_interval 1 active_checks_enabled 1 check_period 24x7 notification_interval 0 notification_period 24x7 notification_options w,u,c,r,f,s notifications_enabled 1 contact_groups admins } define command{ command_name check_oracle_tnsping command_line $USER1$/check_oracle --tns $ARG1$ } Do you have some ideas on resolve this issue? Sorry my bad English. Regards Hrwang mail:wwanghongrui at cebbank.com 2010-06-28 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Max.Williams at mflow.com Mon Jun 28 11:21:19 2010 From: Max.Williams at mflow.com (Max Williams) Date: Mon, 28 Jun 2010 10:21:19 +0100 Subject: check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage In-Reply-To: <15tk4pnghb7.fsf@tux.uio.no> References: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> <15tocezgnps.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE96@SERVER.ddnglobal.local> <15tk4pnghb7.fsf@tux.uio.no> Message-ID: <0357196CB603794BB76F4F6B373F27330506EACE97@SERVER.ddnglobal.local> Thanks for the really fast response! Here is the output, the inactive temperature probe is sorted but the missing EMM still produces an alert: [root at host1 ~]# ./check_openmanage -v check_openmanage 3.5.9-beta1 Copyright (C) 2010 Trond H. Amundsen License GPLv3+: GNU GPL version 3 or later This is free software: you are free to change and redistribute it. There is NO WARRANTY, to the extent permitted by law. Written by Trond H. Amundsen [root at host1 ~]# ./check_openmanage -C password -d -H host2 System: PowerEdge 2950 ServiceTag: JY5CB4J OMSA version: unknown BIOS/date: 2.5.0 09/12/2008 Plugin version: 3.5.9-beta1 ----------------------------------------------------------------------------- Storage Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+----------+-------------------------------------------------------- WARNING | 0 | Controller 0 [PERC 6/i Integrated]: Firmware '6.1.1-0047' is out of date OK | 0 | Controller 0 [PERC 6/i Integrated] is Degraded WARNING | 1 | Controller 1 [PERC 6/E Adapter]: Firmware '6.1.1-0047' is out of date OK | 1 | Controller 1 [PERC 6/E Adapter] is Degraded OK | 0:0:0:0 | Physical Disk 0:0:0 [146GB] on ctrl 0 is Online OK | 0:0:0:1 | Physical Disk 0:0:1 [146GB] on ctrl 0 is Online OK | 1:0:1:14 | Physical Disk 0:1:14 [1.0TB] on ctrl 1 is Online OK | 1:0:1:13 | Physical Disk 0:1:13 [1.0TB] on ctrl 1 is Online OK | 1:0:1:12 | Physical Disk 0:1:12 [1.0TB] on ctrl 1 is Online OK | 1:0:1:11 | Physical Disk 0:1:11 [1.0TB] on ctrl 1 is Online OK | 1:0:1:10 | Physical Disk 0:1:10 [1.0TB] on ctrl 1 is Online OK | 1:0:1:9 | Physical Disk 0:1:9 [1.0TB] on ctrl 1 is Online OK | 1:0:1:8 | Physical Disk 0:1:8 [1.0TB] on ctrl 1 is Online OK | 1:0:1:7 | Physical Disk 0:1:7 [1.0TB] on ctrl 1 is Online OK | 1:0:1:6 | Physical Disk 0:1:6 [1.0TB] on ctrl 1 is Online OK | 1:0:1:5 | Physical Disk 0:1:5 [1.0TB] on ctrl 1 is Online OK | 1:0:1:4 | Physical Disk 0:1:4 [1.0TB] on ctrl 1 is Online OK | 1:0:1:3 | Physical Disk 0:1:3 [1.0TB] on ctrl 1 is Online OK | 1:0:1:2 | Physical Disk 0:1:2 [1.0TB] on ctrl 1 is Online OK | 1:0:1:1 | Physical Disk 0:1:1 [1.0TB] on ctrl 1 is Online OK | 1:0:1:0 | Physical Disk 0:1:0 [1.0TB] on ctrl 1 is Online OK | 1:0:0:14 | Physical Disk 0:0:14 [1.0TB] on ctrl 1 is Online OK | 1:0:0:13 | Physical Disk 0:0:13 [1.0TB] on ctrl 1 is Online OK | 1:0:0:12 | Physical Disk 0:0:12 [1.0TB] on ctrl 1 is Online OK | 1:0:0:11 | Physical Disk 0:0:11 [1.0TB] on ctrl 1 is Online OK | 1:0:0:10 | Physical Disk 0:0:10 [1.0TB] on ctrl 1 is Online OK | 1:0:0:9 | Physical Disk 0:0:9 [1.0TB] on ctrl 1 is Online OK | 1:0:0:8 | Physical Disk 0:0:8 [1.0TB] on ctrl 1 is Online OK | 1:0:0:7 | Physical Disk 0:0:7 [1.0TB] on ctrl 1 is Online OK | 1:0:0:6 | Physical Disk 0:0:6 [1.0TB] on ctrl 1 is Online OK | 1:0:0:5 | Physical Disk 0:0:5 [1.0TB] on ctrl 1 is Online OK | 1:0:0:4 | Physical Disk 0:0:4 [1.0TB] on ctrl 1 is Online OK | 1:0:0:3 | Physical Disk 0:0:3 [1.0TB] on ctrl 1 is Online OK | 1:0:0:2 | Physical Disk 0:0:2 [1.0TB] on ctrl 1 is Online OK | 1:0:0:1 | Physical Disk 0:0:1 [1.0TB] on ctrl 1 is Online OK | 1:0:0:0 | Physical Disk 0:0:0 [1.0TB] on ctrl 1 is Online OK | 1:1:0:14 | Physical Disk 1:0:14 [2.0TB] on ctrl 1 is Online OK | 1:1:0:13 | Physical Disk 1:0:13 [2.0TB] on ctrl 1 is Online OK | 1:1:0:12 | Physical Disk 1:0:12 [2.0TB] on ctrl 1 is Online OK | 1:1:0:11 | Physical Disk 1:0:11 [2.0TB] on ctrl 1 is Online OK | 1:1:0:10 | Physical Disk 1:0:10 [2.0TB] on ctrl 1 is Online OK | 1:1:0:9 | Physical Disk 1:0:9 [2.0TB] on ctrl 1 is Online OK | 1:1:0:8 | Physical Disk 1:0:8 [2.0TB] on ctrl 1 is Online OK | 1:1:0:7 | Physical Disk 1:0:7 [2.0TB] on ctrl 1 is Online OK | 1:1:0:6 | Physical Disk 1:0:6 [2.0TB] on ctrl 1 is Online OK | 1:1:0:5 | Physical Disk 1:0:5 [2.0TB] on ctrl 1 is Online OK | 1:1:0:4 | Physical Disk 1:0:4 [2.0TB] on ctrl 1 is Online OK | 1:1:0:3 | Physical Disk 1:0:3 [2.0TB] on ctrl 1 is Online OK | 1:1:0:2 | Physical Disk 1:0:2 [2.0TB] on ctrl 1 is Online OK | 1:1:0:1 | Physical Disk 1:0:1 [2.0TB] on ctrl 1 is Online OK | 1:1:0:0 | Physical Disk 1:0:0 [2.0TB] on ctrl 1 is Online OK | 0:0 | Logical drive '/dev/sda' [RAID-1, 136.12 GB] is Ready OK | 1:0 | Logical drive '/dev/sdb' [RAID-6, 26068.00 GB] is Ready OK | 1:1 | Logical drive '/dev/sdc' [RAID-6, 24212.50 GB] is Ready OK | 0:0 | Cache battery 0 in controller 0 is Ready OK | 1:0 | Cache battery 0 in controller 1 is Ready OK | 0:0 | Connector 0 [SAS] on controller 0 is Ready OK | 0:1 | Connector 1 [SAS] on controller 0 is Ready OK | 1:0 | Connector 0 [SAS] on controller 1 is Ready OK | 1:1 | Connector 1 [SAS] on controller 1 is Ready OK | 0:0:0 | Enclosure 0:0:0 [Backplane] on controller 0 is Ready OK | 1:0:1 | Enclosure 1:0:1 [MD1000] on controller 1 is Ready OK | 1:0:0 | Enclosure 1:0:0 [MD1000] on controller 1 is Ready OK | 1:1:0 | Enclosure 1:1:0 [MD1000] on controller 1 is Ready OK | 1:0:1:1 | Fan 1 in enclosure 1 [MD1000] is Ready (speed=slow) OK | 1:0:1:2 | Fan 2 in enclosure 1 [MD1000] is Ready (speed=slow) OK | 1:0:1:3 | Fan 3 in enclosure 1 [MD1000] is Ready (speed=slow) OK | 1:0:1:4 | Fan 4 in enclosure 1 [MD1000] is Ready (speed=slow) OK | 1:0:0:1 | Fan 1 in enclosure 2 [MD1000] is Ready (speed=slow) OK | 1:0:0:2 | Fan 2 in enclosure 2 [MD1000] is Ready (speed=slow) OK | 1:0:0:3 | Fan 3 in enclosure 2 [MD1000] is Ready (speed=slow) OK | 1:0:0:4 | Fan 4 in enclosure 2 [MD1000] is Ready (speed=slow) OK | 1:1:0:1 | Fan 1 in enclosure 3 [MD1000] is Ready (speed=slow) OK | 1:1:0:2 | Fan 2 in enclosure 3 [MD1000] is Ready (speed=slow) OK | 1:1:0:3 | Fan 3 in enclosure 3 [MD1000] is Ready (speed=slow) OK | 1:1:0:4 | Fan 4 in enclosure 3 [MD1000] is Ready (speed=slow) OK | 1:0:1:1 | Power Supply 1 in enclosure 1 [MD1000] is Ready OK | 1:0:1:2 | Power Supply 2 in enclosure 1 [MD1000] is Ready OK | 1:0:0:1 | Power Supply 1 in enclosure 2 [MD1000] is Ready OK | 1:0:0:2 | Power Supply 2 in enclosure 2 [MD1000] is Ready OK | 1:1:0:1 | Power Supply 1 in enclosure 3 [MD1000] is Ready OK | 1:1:0:2 | Power Supply 2 in enclosure 3 [MD1000] is Ready OK | 1:0:1:0 | Temperature Probe 0 in enclosure 1 [MD1000]: 31 C (65 max) OK | 1:0:1:1 | Temperature Probe 1 in enclosure 1 [MD1000]: 32 C (65 max) OK | 1:0:1:2 | Temperature Probe 2 in enclosure 1 [MD1000]: 24 C (55 max) OK | 1:0:1:3 | Temperature Probe 3 in enclosure 1 [MD1000]: 24 C (55 max) OK | 1:0:0:0 | Temperature Probe 0 in enclosure 2 [MD1000]: 31 C (65 max) OK | 1:0:0:1 | Temperature Probe 1 in enclosure 2 [MD1000]: 33 C (65 max) OK | 1:0:0:2 | Temperature Probe 2 in enclosure 2 [MD1000]: 25 C (55 max) OK | 1:0:0:3 | Temperature Probe 3 in enclosure 2 [MD1000]: 25 C (55 max) OK | 1:1:0:0 | Temperature Probe 0 in enclosure 3 [MD1000]: 31 C (65 max) OK | 1:1:0:1 | Temperature Probe 1 in enclosure 3 [MD1000] is Inactive OK | 1:1:0:2 | Temperature Probe 2 in enclosure 3 [MD1000]: C ( max) OK | 1:1:0:3 | Temperature Probe 3 in enclosure 3 [MD1000]: C ( max) OK | 1:0:1:0 | EMM 0 in enclosure 1 [MD1000] is Ready OK | 1:0:1:1 | EMM 1 in enclosure 1 [MD1000] is Ready OK | 1:0:0:0 | EMM 0 in enclosure 2 [MD1000] is Ready OK | 1:0:0:1 | EMM 1 in enclosure 2 [MD1000] is Ready OK | 1:1:0:0 | EMM 0 in enclosure 3 [MD1000] is Ready CRITICAL | 1:1:0:1 | EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed ----------------------------------------------------------------------------- Chassis Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+------+------------------------------------------------------------ OK | 1 | Memory module 1 [DIMM1, 4096 MB] is Ok OK | 2 | Memory module 2 [DIMM2, 4096 MB] is Ok OK | 3 | Memory module 3 [DIMM3, 4096 MB] is Ok OK | 4 | Memory module 4 [DIMM4, 4096 MB] is Ok OK | 5 | Memory module 5 [DIMM5, 4096 MB] is Ok OK | 6 | Memory module 6 [DIMM6, 4096 MB] is Ok OK | 7 | Memory module 7 [DIMM7, 4096 MB] is Ok OK | 8 | Memory module 8 [DIMM8, 4096 MB] is Ok OK | 1 | Chassis fan 1 [System Board FAN 1 RPM]: 8700 OK | 2 | Chassis fan 2 [System Board FAN 2 RPM]: 8850 OK | 3 | Chassis fan 3 [System Board FAN 3 RPM]: 8775 OK | 4 | Chassis fan 4 [System Board FAN 4 RPM]: 8400 OK | 0 | Power Supply 0 [AC]: Presence detected OK | 1 | Power Supply 1 [AC]: Presence detected OK | 0 | Temperature Probe 0 [System Board Ambient Temp] reads 16 C (min=8/3, max=42/47) OK | 0 | Processor 0 [Intel Xeon E5420 2.50GHz] is Present OK | 0 | Voltage sensor 0 [CPU1 VCORE] is Good OK | 1 | Voltage sensor 1 [System Board CPU VTT] is Good OK | 2 | Voltage sensor 2 [System Board 1.5V PG] is Good OK | 3 | Voltage sensor 3 [System Board 1.8V PG] is Good OK | 4 | Voltage sensor 4 [System Board 3.3V PG] is Good OK | 5 | Voltage sensor 5 [System Board 5V PG] is Good OK | 6 | Voltage sensor 6 [Riser 1.5V PXH PG] is Good OK | 7 | Voltage sensor 7 [Riser 5V Riser PG] is Good OK | 8 | Voltage sensor 8 [System Board Backplane PG] is Good OK | 9 | Voltage sensor 9 [System Board Linear PG] is Good OK | 10 | Voltage sensor 10 [System Board 0.9V PG] is Good OK | 11 | Voltage sensor 11 [System Board 0.9V Over Volt] is Good OK | 12 | Voltage sensor 12 [System Board CPU Power Fault] is Good OK | 13 | Voltage sensor 13 [PS 1 Voltage 1] is 262.000 V OK | 14 | Voltage sensor 14 [PS 2 Voltage 2] is 260.000 V OK | 0 | Battery probe 0 [System Board CMOS Battery] is Presence Detected OK | 0 | Amperage probe 0 [PS 1 Current 1] reads 0.6 A OK | 1 | Amperage probe 1 [PS 2 Current 2] reads 0.6 A OK | 2 | Amperage probe 2 [System Board System Level] reads 264 W OK | 0 | Chassis intrusion 0 detection: Ok (Not Breached) ----------------------------------------------------------------------------- Other messages ============================================================================= STATE | MESSAGE TEXT ---------+------------------------------------------------------------------- OK | ESM log health is Ok (less than 80% full) INTERNAL ERROR: Use of uninitialized value in sprintf at ./check_openmanage line 2329. INTERNAL ERROR: Use of uninitialized value in sprintf at ./check_openmanage line 2329. INTERNAL ERROR: Use of uninitialized value in sprintf at ./check_openmanage line 2329. INTERNAL ERROR: Use of uninitialized value in sprintf at ./check_openmanage line 2329.[root at host1 ~]# ==================================================================================================================================================== -----Original Message----- From: Trond Hasle Amundsen [mailto:t.h.amundsen at usit.uio.no] Sent: 25 June 2010 18:20 To: Nagios Users List Subject: Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage Max Williams writes: > Both of the new enclosures show the same output so perhaps these just > have a different configuration to the others we have here. Yes. I suspect that the is related to one EMM not being installed. My guess is that the inactive temperature sensor is located in the EMM, but there is no way to tell since neither the omreport output nor the SNMP output reveals the location of the temperature sensors. Or perhaps the EMM is needed to activate the sensor. We always order our MD1000s with 2 EMMs, so this is something that I haven't had the opportunity to test. I have created a test version for you to try. This version should: * report inactive temperature sensors as OK * report EMMs with state "Not Installed" as OK In addition it checks that the reading from the sensors are in fact digits before attempting to print the values. The test version is located here: http://folk.uio.no/trondham/software/beta/ Try it with the '-d' option to see that it reports these things properly. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ ThinkGeek and WIRED's GeekDad team up for the Ultimate GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the lucky parental unit. See the prize list and enter to win: http://p.sf.net/sfu/thinkgeek-promo _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From wwanghongrui at cebbank.com Mon Jun 28 11:22:00 2010 From: wwanghongrui at cebbank.com (wwanghongrui) Date: Mon, 28 Jun 2010 17:22:00 +0800 Subject: how to fix excessive latency Message-ID: <201006281721596568296@cebbank.com> Hi,guys~ Our nagios server envrionment: Nagios3.2.0 + Suse10-sp2 x86_64 + 8 GB mem + 4 x ( Xeon(R) CPU E7420 @ 2.13GHz ) We have 500+ active check hosts and 3k+ active check services. I have adjust some perfomance parameters in nagios.cfg, like below: use_large_installation_tweaks=1 child_processes_fork_twice=0 enable_environment_macros=0 check_result_reaper_frequency=5 max_check_result_reaper_time=30 But, The nagios performance is still bad, like below: Services Actively Checked:Time FrameServices Checked <= 1 minute:271 (9.4%) <= 5 minutes:1749 (60.4%) <= 15 minutes:2824 (97.4%) <= 1 hour:2898 (100.0%) Since program start: 2869 (99.0%) MetricMin.Max.Average Check Execution Time: 0.09 sec32.23 sec1.113 sec Check Latency:1.12 sec212.59 sec116.329 sec Percent State Change:0.00%23.88%0.05% Hosts Acrively Checked:Time FrameHosts Checked <= 1 minute:32 (5.5%) <= 5 minutes:419 (71.5%) <= 15 minutes:586 (100.0%) <= 1 hour:586 (100.0%) Since program start: 586 (100.0%) MetricMin.Max.Average Check Execution Time: 0.08 sec4.29 sec3.035 sec Check Latency:0.00 sec135.25 sec116.420 sec Percent State Change:0.00%11.32%0.09% How could I find which services check or hosts check cause this seriously check latency? Regards HongRui Wang mail: wwanghongrui at cebbank.com 2010-06-28 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Mon Jun 28 14:29:42 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Mon, 28 Jun 2010 08:29:42 -0400 Subject: wiki down? In-Reply-To: <6661588.34.1277572065602.JavaMail.root@zimbra> References: <4C249C0C.9080506@flatto.net> <6661588.34.1277572065602.JavaMail.root@zimbra> Message-ID: If only there were some kind of software available to let us know when websites were down... On Sat, Jun 26, 2010 at 1:07 PM, Roy Sigurd Karlsbakk wrote: >> The Ip of the server points to a Us located server ., >> >> they may have not woken up yet , or they are having a HW issue . > > Well, it's still down. > > Vennlige hilsener / Best regards > > roy > -- > Roy Sigurd Karlsbakk > (+47) 97542685 > roy at karlsbakk.net > http://blogg.karlsbakk.net/ > -- > I all pedagogikk er det essensielt at pensum presenteres intelligibelt. Det er et element?rt imperativ for alle pedagoger ? unng? eksessiv anvendelse av idiomer med fremmed opprinnelse. I de fleste tilfeller eksisterer adekvate og relevante synonymer p? norsk. > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Mon Jun 28 14:34:48 2010 From: perldork at webwizarddesign.com (Max) Date: Mon, 28 Jun 2010 08:34:48 -0400 Subject: wiki down? In-Reply-To: References: <4C249C0C.9080506@flatto.net> <6661588.34.1277572065602.JavaMail.root@zimbra> Message-ID: On Mon, Jun 28, 2010 at 8:29 AM, Matt Simmons wrote: > If only there were some kind of software available to let us know when > websites were down... Or people to respond to alerts from the software :) ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Mon Jun 28 16:21:29 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Mon, 28 Jun 2010 16:21:29 +0200 Subject: check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage In-Reply-To: <0357196CB603794BB76F4F6B373F27330506EACE97@SERVER.ddnglobal.local> (Max Williams's message of "Mon, 28 Jun 2010 10:21:19 +0100") References: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> <15tocezgnps.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE96@SERVER.ddnglobal.local> <15tk4pnghb7.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE97@SERVER.ddnglobal.local> Message-ID: <15td3vbfdau.fsf@tux.uio.no> Max Williams writes: > Here is the output, the inactive temperature probe is sorted but the > missing EMM still produces an alert: > > OK | 1:1:0:1 | Temperature Probe 1 in enclosure 3 [MD1000] is Inactive This one works as expected :) > OK | 1:1:0:2 | Temperature Probe 2 in enclosure 3 [MD1000]: C ( max) > OK | 1:1:0:3 | Temperature Probe 3 in enclosure 3 [MD1000]: C ( max) Hmm... something strange going on here. I wonder why this happens, in the SNMP output you attached previously the values are there. Anyway, I've added some extra checking in the code to make it report better if the reading is unavailable for some reason. It should now report simply: Temperature Probe 0 in enclosure 2:0:0 [MD1000] is Ready if the temp reading is not an integer and OMSA reports the status as OK. > CRITICAL | 1:1:0:1 | EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed Ah.. I misread the SNMP output.. The status is "Unknown" when reported by omreport, but "Other" when reported with SNMP. One little annoying difference between the two.. The output should be: EMM 0 in enclosure 2:0:0 [MD1000] is Not Installed with an OK state. I've created a second test version: http://folk.uio.no/trondham/software/beta/check_openmanage Please give this one a try and see if it performs better. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Max.Williams at mflow.com Mon Jun 28 17:17:34 2010 From: Max.Williams at mflow.com (Max Williams) Date: Mon, 28 Jun 2010 16:17:34 +0100 Subject: check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage In-Reply-To: <15td3vbfdau.fsf@tux.uio.no> References: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> <15tocezgnps.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE96@SERVER.ddnglobal.local> <15tk4pnghb7.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE97@SERVER.ddnglobal.local> <15td3vbfdau.fsf@tux.uio.no> Message-ID: <0357196CB603794BB76F4F6B373F27330506EACE9B@SERVER.ddnglobal.local> Excellent, sorted, everything reports as OK now. Thanks so much Trond, amazing support and an amazingly useful plugin! Best Regards, Max Williams -----Original Message----- From: Trond Hasle Amundsen [mailto:t.h.amundsen at usit.uio.no] Sent: 28 June 2010 15:21 To: Nagios Users List Subject: Re: [Nagios-users] check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage Max Williams writes: > Here is the output, the inactive temperature probe is sorted but the > missing EMM still produces an alert: > > OK | 1:1:0:1 | Temperature Probe 1 in enclosure 3 [MD1000] is Inactive This one works as expected :) > OK | 1:1:0:2 | Temperature Probe 2 in enclosure 3 [MD1000]: C ( max) > OK | 1:1:0:3 | Temperature Probe 3 in enclosure 3 [MD1000]: C ( max) Hmm... something strange going on here. I wonder why this happens, in the SNMP output you attached previously the values are there. Anyway, I've added some extra checking in the code to make it report better if the reading is unavailable for some reason. It should now report simply: Temperature Probe 0 in enclosure 2:0:0 [MD1000] is Ready if the temp reading is not an integer and OMSA reports the status as OK. > CRITICAL | 1:1:0:1 | EMM 1 in enclosure 3 [MD1000] needs attention: Not Installed Ah.. I misread the SNMP output.. The status is "Unknown" when reported by omreport, but "Other" when reported with SNMP. One little annoying difference between the two.. The output should be: EMM 0 in enclosure 2:0:0 [MD1000] is Not Installed with an OK state. I've created a second test version: http://folk.uio.no/trondham/software/beta/check_openmanage Please give this one a try and see if it performs better. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Mon Jun 28 17:17:49 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Mon, 28 Jun 2010 11:17:49 -0400 Subject: wiki down? In-Reply-To: References: <4C249C0C.9080506@flatto.net> <6661588.34.1277572065602.JavaMail.root@zimbra> Message-ID: Bah! If you don't have an event handler that fences the misbehaving machine at the first sign of trouble, you're not trying hard enough ;-) On Mon, Jun 28, 2010 at 8:34 AM, Max wrote: > On Mon, Jun 28, 2010 at 8:29 AM, Matt Simmons > wrote: >> If only there were some kind of software available to let us know when >> websites were down... > > Or people to respond to alerts from the software :) > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Mon Jun 28 18:16:36 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Mon, 28 Jun 2010 18:16:36 +0200 Subject: check_openmanage: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/check_openmanage In-Reply-To: <0357196CB603794BB76F4F6B373F27330506EACE9B@SERVER.ddnglobal.local> (Max Williams's message of "Mon, 28 Jun 2010 16:17:34 +0100") References: <0357196CB603794BB76F4F6B373F27330506EACE95@SERVER.ddnglobal.local> <15tocezgnps.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE96@SERVER.ddnglobal.local> <15tk4pnghb7.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE97@SERVER.ddnglobal.local> <15td3vbfdau.fsf@tux.uio.no> <0357196CB603794BB76F4F6B373F27330506EACE9B@SERVER.ddnglobal.local> Message-ID: <15tzkyfdtej.fsf@tux.uio.no> Max Williams writes: > Excellent, sorted, everything reports as OK now. Good. I'll try to make a release with these changes in the next couple of days. > Thanks so much Trond, amazing support and an amazingly useful plugin! Glad you like it, Max. Thanks for reporting this issue :) Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Jun 28 18:56:58 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 28 Jun 2010 12:56:58 -0400 Subject: how to fix excessive latency In-Reply-To: <201006281721596568296@cebbank.com> References: <201006281721596568296@cebbank.com> Message-ID: There is something definitely not right here. We have about 10000 checks and the performance is lot better. Anyhow we are using the following values check_result_reaper_frequency=10 max_check_result_reaper_time=20 You should enabled debug mode and check the debug logs. Are you writing to any backend database? Are you using nsca to transfer service information to remote location. what is the value of your status_update_interval? what is your external_command_buffer_slots? 2010/6/28 wwanghongrui > Hi,guys~ > > Our nagios server envrionment: Nagios3.2.0 + Suse10-sp2 x86_64 + 8 GB mem + > 4 x ( Xeon(R) CPU E7420 @ 2.13GHz ) > We have 500+ active check hosts and 3k+ active check services. I have > adjust some perfomance parameters in nagios.cfg, like below: > use_large_installation_tweaks=1 > child_processes_fork_twice=0 > enable_environment_macros=0 > check_result_reaper_frequency=5 > max_check_result_reaper_time=30 > > But, The nagios performance is still bad, like below: > > Services Actively Checked: > Time Frame Services Checked <= 1 minute: 271 (9.4%) <= 5 minutes: 1749 > (60.4%) <= 15 minutes: 2824 (97.4%) <= 1 hour: 2898 (100.0%) Since program > start: 2869 (99.0%) Metric Min. Max. Average Check Execution Time: 0.09 > sec 32.23 sec 1.113 sec Check Latency: 1.12 sec 212.59 sec 116.329 sec Percent > State Change: 0.00% 23.88% 0.05% > Hosts Acrively Checked: > Time Frame Hosts Checked <= 1 minute: 32 (5.5%) <= 5 minutes: 419 > (71.5%) <= 15 minutes: 586 (100.0%) <= 1 hour: 586 (100.0%) Since program > start: 586 (100.0%) Metric Min. Max. Average Check Execution Time: 0.08 > sec 4.29 sec 3.035 sec Check Latency: 0.00 sec 135.25 sec 116.420 sec Percent > State Change: 0.00% 11.32% 0.09% > > How could I find which services check or hosts check cause this seriously > check latency? > > > Regards > > HongRui Wang > mail: wwanghongrui at cebbank.com > 2010-06-28 > > > > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirde-h1ER1paUyZg at public.gmane.org Tue Jun 29 01:52:00 2010 From: mirde-h1ER1paUyZg at public.gmane.org (Mirza Dedic) Date: Mon, 28 Jun 2010 16:52:00 -0700 Subject: Cannot get pnp4nagios graphs using NSClient++ & check_nrpe ? Message-ID: Checking disk space using NSClient++ (NRPE) and check_nrpe (2.12), trying to get RRD graphs (pnp4nagios): Configuration: NSClient++ v.0.3.8.76 (2010-05-27 x64) Check_nrpe 2.12 Nagios Merlin 0.6.7-beta2sp1 Nagios 3.2.1 Ninja 1.0 PNP 0.4.14 RRDTool 1.2.19 Error log in perfdata.log: 2010-06-28 16:10:39 [16465] [1] process_perfdata.pl-0.4.14 starting in DEFAULT Mode 2010-06-28 16:10:39 [16465] [1] Found Performance Data for van-mail01 / DISK__F ('F: %'=52%;5;3; 'F:'=194.62G;20;12;0;399.99;) 2010-06-28 16:10:39 [16465] [1] Invalid Perfdata detected 2010-06-28 16:10:39 [16465] [1] PNP exiting (runtime 0.00175s) ... Perfdata has been enabled in nagios.cfg process_performance_data=1 host_perfdata_command=process-host-perfdata service_perfdata_command=process-service-perfdata host_perfdata_file_mode=a service_perfdata_file_mode=a Also, my commands for perfdata: define command { command_name process-service-perfdata command_line /usr/bin/perl /usr/local/nagios/libexec/process_perfdata.pl } define command { command_name process-host-perfdata command_line /usr/bin/perl /usr/local/nagios/libexec/process_perfdata.pl -d HOSTPERFDATA } Finally, the command configuration for the service: define command{ command_name check_nrpe_disk command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -u -n -p XXXXX -t 30 -c CheckDriveSize -a ShowAll MinWarnFree=$ARG2$ MinCritFree=$ARG3$ Drive="$ARG1$" } I recently switched from check_nt to check_nrpe, and I removed my .xml and .rrd files from the service checks that got switched from nt to nrpe. Shouldn't the pnp4nagios use the default.php to create the perfdata output? I don't see how the performance data is invalid? I can confirm that the RRD graphs work for other services, such as ping. Any help would be appreciated. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagiosplug-help mailing list Nagiosplug-help-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f at public.gmane.org https://lists.sourceforge.net/lists/listinfo/nagiosplug-help ::: Please include plugins version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bangers at gmail.com Tue Jun 29 02:34:29 2010 From: bangers at gmail.com (Matthew Angelo) Date: Tue, 29 Jun 2010 10:34:29 +1000 Subject: Assign contact_group to a host without notifications Message-ID: Hi Nagios Users, We have a super modular config. Essentially [almost] all Service Checks are defined to HostGroups, and then Hosts merely assign themselve to that HostGroup. # # # HostGroup { # LINUX_SERVER # check_cpu # check_memory # check_disk # } # # # Host { # use TEAM1 # name MY_LINUXSERVER1 # hostgroup LINUX_SERVER # } # # "use TEAM1" is a Host Template definition which defines contact_group and notification period. How do I expand on this to allow another team (contact_group) read-only access or visibility into the Host service checks for MY_LINUXSERVER1. *without* notifiying them? I added: contact_groups +TEAM2 to the host definition. However it is now also *alerting* to TEAM2 which I don't want. Think of TEAM1 as "LINUX team" and TEAM2 as the Application team which want visibility into a server, but not be alerted if disk space starts filling up on the Server itself. Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirde at oppy.com Tue Jun 29 02:37:18 2010 From: mirde at oppy.com (Mirza Dedic) Date: Mon, 28 Jun 2010 17:37:18 -0700 Subject: [op5-users] Cannot get pnp4nagios graphs using NSClient++ & check_nrpe ? In-Reply-To: References: Message-ID: Checking disk space using NSClient++ (NRPE) and check_nrpe (2.12), trying to get RRD graphs (pnp4nagios): Configuration: NSClient++ v.0.3.8.76 (2010-05-27 x64) Check_nrpe 2.12 Nagios Merlin 0.6.7-beta2sp1 Nagios 3.2.1 Ninja 1.0 PNP 0.4.14 RRDTool 1.2.19 Error log in perfdata.log: 2010-06-28 16:10:39 [16465] [1] process_perfdata.pl-0.4.14 starting in DEFAULT Mode 2010-06-28 16:10:39 [16465] [1] Found Performance Data for van-mail01 / DISK__F ('F: %'=52%;5;3; 'F:'=194.62G;20;12;0;399.99;) 2010-06-28 16:10:39 [16465] [1] Invalid Perfdata detected 2010-06-28 16:10:39 [16465] [1] PNP exiting (runtime 0.00175s) ... Perfdata has been enabled in nagios.cfg process_performance_data=1 host_perfdata_command=process-host-perfdata service_perfdata_command=process-service-perfdata host_perfdata_file_mode=a service_perfdata_file_mode=a Also, my commands for perfdata: define command { command_name process-service-perfdata command_line /usr/bin/perl /usr/local/nagios/libexec/process_perfdata.pl } define command { command_name process-host-perfdata command_line /usr/bin/perl /usr/local/nagios/libexec/process_perfdata.pl -d HOSTPERFDATA } Finally, the command configuration for the service: define command{ command_name check_nrpe_disk command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -u -n -p XXXXX -t 30 -c CheckDriveSize -a ShowAll MinWarnFree=$ARG2$ MinCritFree=$ARG3$ Drive="$ARG1$" } I recently switched from check_nt to check_nrpe, and I removed my .xml and .rrd files from the service checks that got switched from nt to nrpe. Shouldn't the pnp4nagios use the default.php to create the perfdata output? I don't see how the performance data is invalid? I can confirm that the RRD graphs work for other services, such as ping. Any help would be appreciated. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From wwanghongrui at cebbank.com Tue Jun 29 03:57:47 2010 From: wwanghongrui at cebbank.com (=?utf-8?B?d3dhbmdob25ncnVp?=) Date: Tue, 29 Jun 2010 09:57:47 +0800 Subject: =?utf-8?q?how_to_fix_excessive_latency?= References: <201006281721596568296@cebbank.com> Message-ID: <201006290957462960388@cebbank.com> Thanks your reply. We are writing to mysql database by ndoutils.We don't use nsca. About external_command_buffer_slots, we don't set it up. status_update_interval =15 I use vmstate to capture system performance,like below.Maybe the bottleneck is not at system. procs -----------memory---------- ---swap-- -----io---- -system-- -----cpu------ r b swpd free buff cache si so bi bo in cs us sy id wa st 1 0 160 239708 289248 6031924 0 0 1 29 0 0 2 3 94 1 0 1 0 160 242168 289248 6031924 0 0 0 0 260 1023 0 6 94 0 0 1 0 160 246912 289248 6031924 0 0 0 392 291 1044 0 6 93 1 0 1 0 160 246696 289248 6031924 0 0 0 100 265 1056 0 6 93 0 0 2 0 160 243604 289248 6035008 0 0 4668 0 598 1324 1 7 91 1 0 1 0 160 245276 289248 6035008 0 0 32 0 265 1403 0 6 93 0 0 1 0 160 245268 289248 6035008 0 0 0 0 253 1187 0 6 94 0 0 1 1 160 245548 289248 6035008 0 0 0 4728 887 1759 0 6 88 5 0 1 1 160 246288 289248 6036036 0 0 0 1740 1065 1103 1 6 87 6 0 0 1 160 247368 289248 6036036 0 0 0 1720 1086 2252 1 3 90 6 0 0 0 160 247492 289248 6036036 0 0 0 980 984 539 4 0 90 6 0 0 0 160 247624 289248 6036036 0 0 0 0 254 330 0 0 100 0 0 0 0 160 247624 289248 6036036 0 0 0 5420 622 342 0 0 97 3 0 0 0 160 247844 289248 6036036 0 0 0 0 254 312 0 0 100 0 0 0 0 160 247844 289248 6036036 0 0 0 0 254 317 0 0 100 0 0 0 0 160 247984 289248 6036036 0 0 0 0 254 313 0 0 100 0 0 0 0 160 247984 289248 6036036 0 0 0 0 254 315 0 0 100 0 0 0 0 160 248260 289248 6036036 0 0 0 352 362 317 0 0 99 1 0 0 0 160 248260 289248 6036036 0 0 0 0 306 303 0 0 100 0 0 1 0 160 248876 289248 6036036 0 0 0 100 270 367 0 0 99 0 0 5 0 160 233840 289248 6036036 0 0 0 0 341 1490 6 8 86 0 0 5 0 160 187468 289248 6036036 0 0 0 4 866 2736 9 22 69 0 0 4 1 160 171508 289248 6036036 0 0 0 5352 837 2205 3 20 76 1 0 procs -----------memory---------- ---swap-- -----io---- -system-- -----cpu------ r b swpd free buff cache si so bi bo in cs us sy id wa st 4 0 160 175172 289248 6036036 0 0 0 568 453 2091 1 15 83 0 0 3 0 160 154108 289248 6036036 0 0 0 0 427 3456 1 20 79 0 0 5 0 160 125684 289248 6036036 0 0 0 4 469 2620 1 19 80 0 0 9 0 160 146712 289248 6036036 0 0 0 0 603 2272 4 26 70 0 0 6 0 160 168804 289248 6036036 0 0 0 0 668 2784 9 27 64 0 0 4 0 160 181032 289248 6036036 0 0 0 1164 736 2654 4 25 70 1 0 1 0 160 210728 289248 6036036 0 0 0 0 465 2152 5 19 76 0 0 1 0 160 211216 289248 6036036 0 0 0 0 294 837 0 6 94 0 0 1 0 160 216644 289248 6036036 0 0 0 0 293 954 0 7 93 0 0 1 0 160 227320 289248 6036036 0 0 0 0 285 943 0 8 92 0 0 1 0 160 238864 289248 6036036 0 0 0 576 343 2308 1 8 91 1 0 1 2 160 233660 289248 6039120 0 0 2252 100 393 1046 1 6 92 1 0 1 0 160 239548 289248 6039120 0 0 984 3316 571 1055 1 6 92 1 0 1 0 160 240084 289248 6039120 0 0 0 0 253 998 0 6 94 0 0 1 0 160 239968 289248 6039120 0 0 0 0 253 990 0 6 93 0 0 1 1 160 240388 289248 6039120 0 0 0 1956 781 1111 0 6 89 4 0 1 1 160 240256 289248 6039120 0 0 0 1828 1088 1452 1 6 87 6 0 1 2 160 239648 289248 6039120 0 0 0 1620 1038 1614 1 6 87 6 0 1 1 160 240028 289248 6039120 0 0 0 1700 1065 1459 0 6 85 9 0 1 1 160 239912 289248 6039120 0 0 0 2512 1211 1623 0 6 87 6 0 1 1 160 240648 289248 6039120 0 0 4 2880 1380 1128 0 5 87 7 0 1 0 160 241124 289248 6039120 0 0 0 84 499 1024 0 6 93 0 0 1 0 160 241000 289248 6039120 0 0 0 296 287 1757 1 6 93 1 0 procs -----------memory---------- ---swap-- -----io---- -system-- -----cpu------ r b swpd free buff cache si so bi bo in cs us sy id wa st 3 0 160 241808 289248 6039120 0 0 0 0 253 1630 1 6 93 0 0 1 0 160 241800 289248 6039120 0 0 0 0 253 977 0 6 94 0 0 1 0 160 241880 289248 6039120 0 0 0 0 253 989 0 6 94 0 0 3 0 160 218192 289248 6039120 0 0 0 100 350 1810 3 14 83 0 0 4 0 160 181560 289248 6039120 0 0 0 5792 957 2948 6 21 72 1 0 6 0 160 182036 289248 6040148 0 0 0 0 853 2947 7 22 70 0 0 4 0 160 187860 289248 6040148 0 0 0 0 564 2748 12 25 64 0 0 4 0 160 202880 289248 6040148 0 0 0 0 432 2336 5 22 73 0 0 5 0 160 189956 289248 6040148 0 0 0 416 824 2762 7 24 69 1 0 2 0 160 195912 289248 6041176 0 0 52 1224 789 2332 5 15 78 2 0 1 0 160 205060 289248 6041176 0 0 0 8 343 1718 2 8 90 0 0 1 0 160 205076 289248 6041176 0 0 0 0 320 1177 0 6 93 0 0 1 0 160 213844 289248 6041176 0 0 0 0 315 1100 0 7 92 0 0 1 0 160 226900 289248 6041176 0 0 0 0 305 1210 0 8 92 0 0 2 0 160 227188 289248 6041176 0 0 0 956 556 901 0 4 92 3 0 1 0 160 228924 289248 6041176 0 0 0 0 294 1034 1 6 93 0 0 1 0 160 229740 289248 6041176 0 0 0 0 292 1235 1 6 93 0 0 1 0 160 230228 289248 6041176 0 0 0 0 287 1696 1 6 93 0 0 3 1 160 230456 289248 6041176 0 0 0 128 288 1307 1 6 93 0 0 1 1 160 228756 289248 6042204 0 0 3052 4944 921 1673 5 7 84 4 0 1 1 160 229004 289248 6042204 0 0 0 1676 1061 1122 1 6 87 6 0 1 1 160 229004 289248 6042204 0 0 0 1672 1081 1093 0 6 87 6 0 1 1 160 230788 289248 6042204 0 0 0 1856 1171 1198 1 6 87 6 0 Regards HongRui Wang Mail:wwanghongrui at cebbank.com 2010-06-29 ???? shadih rahman ????? 2010-06-29 00:57:24 ???? wwanghongrui; Nagios Users List ??? ??? Re: [Nagios-users] how to fix excessive latency There is something definitely not right here. We have about 10000 checks and the performance is lot better. Anyhow we are using the following values check_result_reaper_frequency=10 max_check_result_reaper_time=20 You should enabled debug mode and check the debug logs. Are you writing to any backend database? Are you using nsca to transfer service information to remote location. what is the value of your status_update_interval? what is your external_command_buffer_slots? 2010/6/28 wwanghongrui Hi,guys~ Our nagios server envrionment: Nagios3.2.0 + Suse10-sp2 x86_64 + 8 GB mem + 4 x ( Xeon(R) CPU E7420 @ 2.13GHz ) We have 500+ active check hosts and 3k+ active check services. I have adjust some perfomance parameters in nagios.cfg, like below: use_large_installation_tweaks=1 child_processes_fork_twice=0 enable_environment_macros=0 check_result_reaper_frequency=5 max_check_result_reaper_time=30 But, The nagios performance is still bad, like below: Services Actively Checked:Time FrameServices Checked <= 1 minute:271 (9.4%) <= 5 minutes:1749 (60.4%) <= 15 minutes:2824 (97.4%) <= 1 hour:2898 (100.0%) Since program start: 2869 (99.0%) MetricMin.Max.Average Check Execution Time: 0.09 sec32.23 sec1.113 sec Check Latency:1.12 sec212.59 sec116.329 sec Percent State Change:0.00%23.88%0.05% Hosts Acrively Checked:Time FrameHosts Checked <= 1 minute:32 (5.5%) <= 5 minutes:419 (71.5%) <= 15 minutes:586 (100.0%) <= 1 hour:586 (100.0%) Since program start: 586 (100.0%) MetricMin.Max.Average Check Execution Time: 0.08 sec4.29 sec3.035 sec Check Latency:0.00 sec135.25 sec116.420 sec Percent State Change:0.00%11.32%0.09% How could I find which services check or hosts check cause this seriously check latency? Regards HongRui Wang mail: wwanghongrui at cebbank.com 2010-06-28 ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirde at oppy.com Tue Jun 29 03:58:04 2010 From: mirde at oppy.com (Mirza Dedic) Date: Mon, 28 Jun 2010 18:58:04 -0700 Subject: [op5-users] Cannot get pnp4nagios graphs using NSClient++ & check_nrpe ? In-Reply-To: References: Message-ID: As far as I am aware, the perfdata is correct? 'F:'=194.62G;20;12;0;399.99; | | | | | | | |----|--|-----|--|-|-|----- * label |--|-----|--|-|-|----- * current value |-----|--|-|-|----- unit ( UOM = UNIT of Measurement ) |--|-|-|----- warning threshold |-|-|----- critical threshold |-|----- minimum value |----- maximum value From: Mirza Dedic Sent: June/28/2010 5:37 PM To: 'Nagios Users List' Subject: RE: [op5-users] Cannot get pnp4nagios graphs using NSClient++ & check_nrpe ? Checking disk space using NSClient++ (NRPE) and check_nrpe (2.12), trying to get RRD graphs (pnp4nagios): Configuration: NSClient++ v.0.3.8.76 (2010-05-27 x64) Check_nrpe 2.12 Nagios Merlin 0.6.7-beta2sp1 Nagios 3.2.1 Ninja 1.0 PNP 0.4.14 RRDTool 1.2.19 Error log in perfdata.log: 2010-06-28 16:10:39 [16465] [1] process_perfdata.pl-0.4.14 starting in DEFAULT Mode 2010-06-28 16:10:39 [16465] [1] Found Performance Data for van-mail01 / DISK__F ('F: %'=52%;5;3; 'F:'=194.62G;20;12;0;399.99;) 2010-06-28 16:10:39 [16465] [1] Invalid Perfdata detected 2010-06-28 16:10:39 [16465] [1] PNP exiting (runtime 0.00175s) ... Perfdata has been enabled in nagios.cfg process_performance_data=1 host_perfdata_command=process-host-perfdata service_perfdata_command=process-service-perfdata host_perfdata_file_mode=a service_perfdata_file_mode=a Also, my commands for perfdata: define command { command_name process-service-perfdata command_line /usr/bin/perl /usr/local/nagios/libexec/process_perfdata.pl } define command { command_name process-host-perfdata command_line /usr/bin/perl /usr/local/nagios/libexec/process_perfdata.pl -d HOSTPERFDATA } Finally, the command configuration for the service: define command{ command_name check_nrpe_disk command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -u -n -p XXXXX -t 30 -c CheckDriveSize -a ShowAll MinWarnFree=$ARG2$ MinCritFree=$ARG3$ Drive="$ARG1$" } I recently switched from check_nt to check_nrpe, and I removed my .xml and .rrd files from the service checks that got switched from nt to nrpe. Shouldn't the pnp4nagios use the default.php to create the perfdata output? I don't see how the performance data is invalid? I can confirm that the RRD graphs work for other services, such as ping. Any help would be appreciated. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jwellband at gmail.com Tue Jun 29 04:18:15 2010 From: jwellband at gmail.com (Jason W.) Date: Mon, 28 Jun 2010 22:18:15 -0400 Subject: Additional states in Nagios Message-ID: (I've tried Googling for the answer, but there seems to be some ambiguity in defining terms - even in the Nagios docs) I've got Nagios monitoring a bunch of things on our servers and I also have events being sent to Nagios via passive checks. This is all useful information to us as sysadmins, but there is a difference in criticality, e.g. is is down, is it about to go down, or is it purely informational? The latter is what I am writing about. Currently, there are two "states" we use - WARNING and CRITICAL. This is the ambiguous part since the docs refer to states as HARD or SOFT, but the plugin API docs refer to WARNING and CRITICAL as states. I realize there is also UNKNOWN, but with non-technical people occasionally looking at our Nagios, that may lead them astray... Is there a way to get more states, e.g. INFORMATION? This would allow one to sort by state in the web interface. Currently, we use WARNING for most informational messages, so there is a mashup of "Service X is about to die" and "Server Y did something you may want to know about" I am guessing not without hacking the source, but I can dream ;) Thoughts & comments appreciated - even if it's to say I'm Doing it Wrong. -- HTH, YMMV, HANW :) Jason The path to enlightenment is /usr/bin/enlightenment. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Tue Jun 29 05:05:40 2010 From: subscription at kkeane.com (Kevin Keane) Date: Mon, 28 Jun 2010 20:05:40 -0700 Subject: Additional states in Nagios In-Reply-To: References: Message-ID: Actually, there are four states reported by plugins: OK, WARNING, CRITICAL and UNKNOWN. Services will have the same four states. There are also three states that hosts can have: UP, DOWN, UNREACHABLE. UP, DOWN and unreachable depends on the state reported by the plugin, as well as the state of parents. http://nagios.sourceforge.net/docs/3_0/hostchecks.html HARD and SOFT states are separate from all of that. You can have a soft warning or a hard warning, and a soft critical or a hard critical. http://nagios.sourceforge.net/docs/3_0/statetypes.html OK, WARNING, CRITICAL and UNKNOWN are the actual state of whatever you are monitoring. The plugins decide which state it is. HARD, SOFT, as well as UP or DOWN, are computed by Nagios based on the status reported by the plugins. Exactly how Nagios does that is configurable. -----Original Message----- From: Jason W. [mailto:jwellband at gmail.com] Sent: Monday, June 28, 2010 7:18 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Additional states in Nagios (I've tried Googling for the answer, but there seems to be some ambiguity in defining terms - even in the Nagios docs) I've got Nagios monitoring a bunch of things on our servers and I also have events being sent to Nagios via passive checks. This is all useful information to us as sysadmins, but there is a difference in criticality, e.g. is is down, is it about to go down, or is it purely informational? The latter is what I am writing about. Currently, there are two "states" we use - WARNING and CRITICAL. This is the ambiguous part since the docs refer to states as HARD or SOFT, but the plugin API docs refer to WARNING and CRITICAL as states. I realize there is also UNKNOWN, but with non-technical people occasionally looking at our Nagios, that may lead them astray... Is there a way to get more states, e.g. INFORMATION? This would allow one to sort by state in the web interface. Currently, we use WARNING for most informational messages, so there is a mashup of "Service X is about to die" and "Server Y did something you may want to know about" I am guessing not without hacking the source, but I can dream ;) Thoughts & comments appreciated - even if it's to say I'm Doing it Wrong. -- HTH, YMMV, HANW :) Jason The path to enlightenment is /usr/bin/enlightenment. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From NOC at eurofmc.com Tue Jun 29 07:40:18 2010 From: NOC at eurofmc.com (Network Operation Center FMC Luxemburg) Date: Tue, 29 Jun 2010 07:40:18 +0200 Subject: IP and hostname mapping control Message-ID: <4C298742.6060306@eurofmc.com> Hi everybody, I'm looking for a way to check out the mapping between a hostname and IP address. Example: IP 192.168.0.1 exists and if the hostname foo.mylan.com is not associated with this IP, I would have an alarm. Indeed the script below returns no alarm : define host { use unix-server host_name foo.mylan.com display_name foo address 192.168.0.1 check_command check_http } define service { use local-service host_name foo.mylan.com service_description HTTP local check_command check_http } Any idea? Thanks a lot Fran?ois -- Network Operation Center LUXEMBURG -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sandman42 at libero.it Tue Jun 29 09:58:35 2010 From: sandman42 at libero.it (sandman42 at libero.it) Date: Tue, 29 Jun 2010 09:58:35 +0200 (CEST) Subject: Monitoring traffic Message-ID: <13086732.342931277798315188.JavaMail.defaultUser@defaultHost> Hi, I'd need to keep traffic on a UMTS router monitored, i.e. I'd need to set up something that counts bytes passed and gives an alarm when a particular thresold is reached. The router gives no SNMP information about it. Is there any other way to do that with nagios? Thanks ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Jun 29 14:23:46 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 29 Jun 2010 14:23:46 +0200 Subject: how to fix excessive latency In-Reply-To: <201006290957462960388@cebbank.com> References: <201006281721596568296@cebbank.com> <201006290957462960388@cebbank.com> Message-ID: <4C29E5D2.6060203@op5.se> On 06/29/2010 03:57 AM, wwanghongrui wrote: > Thanks your reply. We are writing to mysql database by ndoutils.We don't use nsca. About external_command_buffer_slots, we don't set it up. > status_update_interval =15 > > I use vmstate to capture system performance,like below.Maybe the bottleneck is not at system. > Endeavour to not run Nagios on a virtual server. If you must use a virtual server, make very sure that your checkresult spooldirectory and status data files are on a ramdisk, or you will certainly run into trouble. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Jun 29 14:36:37 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 29 Jun 2010 13:36:37 +0100 Subject: Assign contact_group to a host without notifications In-Reply-To: References: Message-ID: On 29 June 2010 01:34, Matthew Angelo wrote: > Hi Nagios Users, > > We have a super modular config.? Essentially [almost] all Service Checks are > defined to HostGroups, and then Hosts merely assign themselve to that > HostGroup. > > > # > # > # HostGroup { > # ??? LINUX_SERVER > # ??? ??? check_cpu > # ??? ??? check_memory > # ??? ??? check_disk > # } > # > # > # Host { > # ??? use TEAM1 > # ??? name MY_LINUXSERVER1 > # ??? hostgroup LINUX_SERVER > # } > # > # > > "use TEAM1" is a Host Template definition which defines contact_group and > notification period. > > > How do I expand on this to allow another team (contact_group) read-only > access or visibility into the Host service checks for MY_LINUXSERVER1. > *without* notifiying them? > > I added: > > contact_groups????????? +TEAM2 > > to the host definition.? However it is now also *alerting* to TEAM2 which I > don't want. > > > Think of TEAM1 as "LINUX team" and TEAM2 as the Application team which want > visibility into a server, but not be alerted if disk space starts filling up > on the Server itself. I think whatever you do is going to be a bit messy. It will probably involve setting up two Nagios contact definitions for each physical user (which is what I do in similar circumstances). For example I would set up the contacts which your TEAM2 users log in with to all have host_notification_options = n and service_notification_options = n. I would then set up separate contacts (for example if the user logs in with "Fred", set up a contact called "Fred-notify". This 2nd contact can have host_notification_options and service_notification_options = y. You will then need to structure your contact groups so for example contact group "TEAM2" contains user "Fred" and contact group "TEAM2-notify" contains user "Fred-notify". Those hosts you want TEAM2 to just see, you set contactgroups = "+TEAM2" in the relevant host template. Those hosts you want them to see and get notifications for, you set contactgroups = "+TEAM2,TEAM2-notify" in the relevant host template. Note that the user never has actually to log in using "Fred-notify" so it doesn't need an entry in your htpasswd config - we only set that one up for the notifications. You might be able to do something a little tidier using a custom notification script, but doing that might make your configs less easy for anyone coming from outside to understand. I hope that helps rather than confuses things for you! Cheers, Jim ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From paul.weaver at bbc.co.uk Tue Jun 29 13:05:23 2010 From: paul.weaver at bbc.co.uk (Paul Weaver) Date: Tue, 29 Jun 2010 12:05:23 +0100 Subject: IP and hostname mapping control In-Reply-To: <4C298742.6060306@eurofmc.com> References: <4C298742.6060306@eurofmc.com> Message-ID: <5C44E813F7481A46980A2F6778680B1A02B8EBB1@bbcxues16.national.core.bbc.co.uk> We have a service that checks dns addresses. We run it against some of our internal DNS servers (we're checking the DNS server is resolving, so the check belongs on the dns server rather than the host you're interested in). define service{ use infrastructure-dns-service host_name dc1025,dc1026,dc1030 service_description DNS-myserver check_command check_dns!myserver.com!4.5.6.7 } define command{ command_name check_dns command_line $USER1$/check_dns -H $ARG1$ -s $HOSTADDRESS$ -a $ARG2$ } (I believe check_dns is a standard plugin) So you'd have: define service { use local-service host_name local.dns.server service_description DNS resolving foo.mylan.com check_command check_dns!foo.mylan.com!192.168.0.1 } -- The probability of someone watching you is proportional to the stupidity of your action. Paul Weaver Systems Development Engineer BBC FM&T BETG TDD SDT ________________________________ From: Network Operation Center FMC Luxemburg [mailto:NOC at eurofmc.com] Sent: 29 June 2010 06:40 To: Nagios Users List Subject: [Nagios-users] IP and hostname mapping control Hi everybody, I'm looking for a way to check out the mapping between a hostname and IP address. Example: IP 192.168.0.1 exists and if the hostname foo.mylan.com is not associated with this IP, I would have an alarm. Indeed the script below returns no alarm : define host { use unix-server host_name foo.mylan.com display_name foo address 192.168.0.1 check_command check_http } define service { use local-service host_name foo.mylan.com service_description HTTP local check_command check_http } Any idea? Thanks a lot Fran?ois -- Network Operation Center LUXEMBURG http://www.bbc.co.uk/ This e-mail (and any attachments) is confidential and may contain personal views which are not the views of the BBC unless specifically stated. If you have received it in error, please delete it from your system. Do not use, copy or disclose the information in any way nor act in reliance on it and notify the sender immediately. Please note that the BBC monitors e-mails sent or received. Further communication will signify your consent to this. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Jun 29 14:11:28 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 29 Jun 2010 13:11:28 +0100 Subject: Additional states in Nagios In-Reply-To: References: Message-ID: On 29 June 2010 03:18, Jason W. wrote: > (I've tried Googling for the answer, but there seems to be some > ambiguity in defining terms - even in the Nagios docs) > > I've got Nagios monitoring a bunch of things on our servers and I also > have events being sent to Nagios via passive checks. This is all > useful information to us as sysadmins, but there is a difference in > criticality, e.g. is is down, is it about to go down, or is it purely > informational? > > The latter is what I am writing about. Currently, there are two > "states" we use - WARNING and CRITICAL. This is the ambiguous part > since the docs refer to states as HARD or SOFT, but the plugin API > docs refer to WARNING and CRITICAL as states. I realize there is also > UNKNOWN, but with non-technical people occasionally looking at our > Nagios, that may lead them astray... > > Is there a way to get more states, e.g. INFORMATION? ?This would allow > one to sort by state in the web interface. Currently, we use WARNING > for most informational messages, so there is a mashup of "Service X is > about to die" and "Server Y did something you may want to know about" I don't believe there is. If you want to make sure that Nagios always records the text for a specific kind of check, you can set the "is_volatile" directive to "1". For services which normally report the same information each time they are checked this could be overkill, but for passive checks which are event-driven, for example SNMP Trap handling it is very useful. If you haven't already found it, it's worth taking a look at the documentation on volatile services: http://nagios.sourceforge.net/docs/3_0/volatileservices.html Cheers, Jim ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Jun 29 14:51:33 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 29 Jun 2010 13:51:33 +0100 Subject: Monitoring traffic In-Reply-To: <13086732.342931277798315188.JavaMail.defaultUser@defaultHost> References: <13086732.342931277798315188.JavaMail.defaultUser@defaultHost> Message-ID: On 29 June 2010 08:58, sandman42 at libero.it wrote: > Hi, > > I'd need to keep traffic on a UMTS router monitored, i.e. I'd need to set up > something that counts bytes passed and gives an alarm when a particular > thresold is reached. > > The router gives no SNMP information about it. > > Is there any other way to do that with nagios? > > Thanks What options are available to you for querying the counter on the router? If, for example, there is a command-line interface you could use then you can at a pinch script something around that. There are a bunch of old AS400 plugins which I think do that kind of thing at http://exchange.nagios.org/directory/Plugins/Hardware/Server-Hardware/IBM/AS400/details - it might give you some ideas on how to go about it. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bparish at cognex.com Tue Jun 29 14:42:34 2010 From: bparish at cognex.com (Parish, Brent) Date: Tue, 29 Jun 2010 08:42:34 -0400 Subject: Additional states in Nagios In-Reply-To: References: Message-ID: Kevin gave a GREAT answer - succinct and yet informative. It sounds like he answered the first part of the question - clearing the ambiguity of the states. I interpreted the second part (the dream) as the desire to have Nagios differentiating between informational messages and things that perhaps require action (alarms). I don't think there is any single 'catch all' solution to this, I suppose it really depends on your environment, admin team, etc. For example, in your company, ALL alerts from S.M.A.R.T. disks might deserve immediate attention. In my world, we take the ostrich approach to those (just kidding, and don't flay me for perpetuating the myth of ostrich heads and sand). I personally use a combination of things to tune the alerts. For example, with printers alerting on low toner, I set the frequency of alerts to once every 24 hours, so as not to flood people with non critical messages. For disk alerts that come in as 'unknown' state, I have set the retry time high to avoid extra alarms getting sent just because network latency is high (thus returning the unknown state). I have also modified the plugins to strip out messages/states that are (to us here) strictly informational and not worth alarming on. And for any alert that comes in, you can always just 'acknowledge' it through the CGIs to hush it if it is strictly informational - it will alarm again (depending on your setup) if/when it changes state again (for better or worse). Lastly, though it is a TON of work, you can rebuild the entire alerting process. I store user preferences in a MySQL database and let the individual admins change those through a CGI. Then I send ALL Nagios alerts through that processor which matches up the alert, time of day, host, service, etc against the user prefs to decide who gets alerted and how. When you do something like that, you can then define alternative methods of alerting. For example, I get alerted on disks at warning level during business hours, but not until critical level off hours. In addition, I have the alerts just going to email during business hours, but I also send via instant messenger and to a home email address in off hours. You could use the same intelligence to split out what are strictly informational messages vs. what are real alerts. Ooops, I said lastly, didn't I? Another thought: maybe you could send all alerts to an Exchange (group) mailbox, and use Exchange rules to filter the informational messages vs. real alerts and send those on to individuals. Just my 2 cents. - Brent -----Original Message----- From: Kevin Keane [mailto:subscription at kkeane.com] Sent: Monday, June 28, 2010 11:06 PM To: Nagios Users List Subject: Re: [Nagios-users] Additional states in Nagios Actually, there are four states reported by plugins: OK, WARNING, CRITICAL and UNKNOWN. Services will have the same four states. There are also three states that hosts can have: UP, DOWN, UNREACHABLE. UP, DOWN and unreachable depends on the state reported by the plugin, as well as the state of parents. http://nagios.sourceforge.net/docs/3_0/hostchecks.html HARD and SOFT states are separate from all of that. You can have a soft warning or a hard warning, and a soft critical or a hard critical. http://nagios.sourceforge.net/docs/3_0/statetypes.html OK, WARNING, CRITICAL and UNKNOWN are the actual state of whatever you are monitoring. The plugins decide which state it is. HARD, SOFT, as well as UP or DOWN, are computed by Nagios based on the status reported by the plugins. Exactly how Nagios does that is configurable. -----Original Message----- From: Jason W. [mailto:jwellband at gmail.com] Sent: Monday, June 28, 2010 7:18 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Additional states in Nagios (I've tried Googling for the answer, but there seems to be some ambiguity in defining terms - even in the Nagios docs) I've got Nagios monitoring a bunch of things on our servers and I also have events being sent to Nagios via passive checks. This is all useful information to us as sysadmins, but there is a difference in criticality, e.g. is is down, is it about to go down, or is it purely informational? The latter is what I am writing about. Currently, there are two "states" we use - WARNING and CRITICAL. This is the ambiguous part since the docs refer to states as HARD or SOFT, but the plugin API docs refer to WARNING and CRITICAL as states. I realize there is also UNKNOWN, but with non-technical people occasionally looking at our Nagios, that may lead them astray... Is there a way to get more states, e.g. INFORMATION? This would allow one to sort by state in the web interface. Currently, we use WARNING for most informational messages, so there is a mashup of "Service X is about to die" and "Server Y did something you may want to know about" I am guessing not without hacking the source, but I can dream ;) Thoughts & comments appreciated - even if it's to say I'm Doing it Wrong. -- HTH, YMMV, HANW :) Jason The path to enlightenment is /usr/bin/enlightenment. ------------------------------------------------------------------------ ------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------ ------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Tue Jun 29 14:02:33 2010 From: perldork at webwizarddesign.com (Max) Date: Tue, 29 Jun 2010 08:02:33 -0400 Subject: IP and hostname mapping control In-Reply-To: <4C298742.6060306@eurofmc.com> References: <4C298742.6060306@eurofmc.com> Message-ID: On Tue, Jun 29, 2010 at 1:40 AM, Network Operation Center FMC Luxemburg wrote: > Hi everybody, > > I'm looking for a way to check out the mapping between a hostname and IP > address. > > Example: IP 192.168.0.1 exists and if the hostname foo.mylan.com is not > associated with this IP, I would have an alarm. > > Indeed the script below returns no alarm : Take a look at check_dns. - Max ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Tue Jun 29 15:38:00 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Tue, 29 Jun 2010 15:38:00 +0200 Subject: Monitoring traffic In-Reply-To: References: <13086732.342931277798315188.JavaMail.defaultUser@defaultHost> Message-ID: <3A01E24F-78DC-4A4A-B74C-A221E3EEF8BE@linux.it> Try an snmpwalk and check the output Ciao, Giorgio Il giorno 29/giu/2010, alle ore 14:51, Jim Avery ha scritto: > On 29 June 2010 08:58, sandman42 at libero.it wrote: >> Hi, >> >> I'd need to keep traffic on a UMTS router monitored, i.e. I'd need to set up >> something that counts bytes passed and gives an alarm when a particular >> thresold is reached. >> >> The router gives no SNMP information about it. >> >> Is there any other way to do that with nagios? >> >> Thanks > > > What options are available to you for querying the counter on the router? > > If, for example, there is a command-line interface you could use then > you can at a pinch script something around that. There are a bunch of > old AS400 plugins which I think do that kind of thing at > http://exchange.nagios.org/directory/Plugins/Hardware/Server-Hardware/IBM/AS400/details > - it might give you some ideas on how to go about it. > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Jun 29 14:42:37 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 29 Jun 2010 13:42:37 +0100 Subject: IP and hostname mapping control In-Reply-To: <4C298742.6060306@eurofmc.com> References: <4C298742.6060306@eurofmc.com> Message-ID: On 29 June 2010 06:40, Network Operation Center FMC Luxemburg wrote: > Hi everybody, > > I'm looking for a way to check out the mapping between a hostname and IP > address. > > Example: IP 192.168.0.1 exists and if the hostname foo.mylan.com is not > associated with this IP, I would have an alarm. > > Indeed the script below returns no alarm : > > define host { > ??? use???????????? unix-server > ??? host_name??? ?? foo.mylan.com > ??? display_name??? foo > ??? address??? ???? 192.168.0.1 > ??? check_command?? check_http > } > > define service { > ??? use??? ??? ??? local-service > ??? host_name??? ? foo.mylan.com > ??? service_description?? HTTP local > ??? check_command? check_http > } If you are referring to name/address resolution by DNS, then does the check_dns plugin do what you want? ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Tue Jun 29 15:36:49 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Tue, 29 Jun 2010 15:36:49 +0200 Subject: how to fix excessive latency In-Reply-To: <4C29E5D2.6060203@op5.se> References: <201006281721596568296@cebbank.com> <201006290957462960388@cebbank.com> <4C29E5D2.6060203@op5.se> Message-ID: <7A4B8E4E-4269-43EA-8414-49C4DD2C7A1A@linux.it> I agree, better not to use Nagios on virtual machine. The I/O layer of vms have poor performances. Ciao, Giorgio Il giorno 29/giu/2010, alle ore 14:23, Andreas Ericsson ha scritto: > On 06/29/2010 03:57 AM, wwanghongrui wrote: >> Thanks your reply. We are writing to mysql database by ndoutils.We don't use nsca. About external_command_buffer_slots, we don't set it up. >> status_update_interval =15 >> >> I use vmstate to capture system performance,like below.Maybe the bottleneck is not at system. >> > > Endeavour to not run Nagios on a virtual server. If you must use a virtual server, > make very sure that your checkresult spooldirectory and status data files are on > a ramdisk, or you will certainly run into trouble. > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > Considering the successes of the wars on alcohol, poverty, drugs and > terror, I think we should give some serious thought to declaring war > on peace. > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sandman42 at libero.it Tue Jun 29 16:29:10 2010 From: sandman42 at libero.it (sandman42 at libero.it) Date: Tue, 29 Jun 2010 16:29:10 +0200 (CEST) Subject: R: Re: Monitoring traffic Message-ID: <16578040.431541277821750463.JavaMail.defaultUser@defaultHost> >----Messaggio originale---- >Da: jim at jimavery.me.uk >Data: 29/06/2010 14.51 >> The router gives no SNMP information about it. >What options are available to you for querying the counter on the router? It is possible to send a string from the router to a particular port of a remote server. Is there any way to make nagios act as a tcp server so it receives notifications about it? Thanks ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Tue Jun 29 16:52:31 2010 From: perldork at webwizarddesign.com (Max) Date: Tue, 29 Jun 2010 10:52:31 -0400 Subject: how to fix excessive latency In-Reply-To: <7A4B8E4E-4269-43EA-8414-49C4DD2C7A1A@linux.it> References: <201006281721596568296@cebbank.com> <201006290957462960388@cebbank.com> <4C29E5D2.6060203@op5.se> <7A4B8E4E-4269-43EA-8414-49C4DD2C7A1A@linux.it> Message-ID: Clock skew can be an issue as well depending on the virtualization platform. On 6/29/10, Giorgio Zarrelli wrote: > I agree, better not to use Nagios on virtual machine. The I/O layer of vms > have poor performances. > > Ciao, > > Giorgio > > Il giorno 29/giu/2010, alle ore 14:23, Andreas Ericsson ha > scritto: > >> On 06/29/2010 03:57 AM, wwanghongrui wrote: >>> Thanks your reply. We are writing to mysql database by ndoutils.We don't >>> use nsca. About external_command_buffer_slots, we don't set it up. >>> status_update_interval =15 >>> >>> I use vmstate to capture system performance,like below.Maybe the >>> bottleneck is not at system. >>> >> >> Endeavour to not run Nagios on a virtual server. If you must use a virtual >> server, >> make very sure that your checkresult spooldirectory and status data files >> are on >> a ramdisk, or you will certainly run into trouble. >> >> -- >> Andreas Ericsson andreas.ericsson at op5.se >> OP5 AB www.op5.se >> Tel: +46 8-230225 Fax: +46 8-230231 >> >> Considering the successes of the wars on alcohol, poverty, drugs and >> terror, I think we should give some serious thought to declaring war >> on peace. >> >> ------------------------------------------------------------------------------ >> This SF.net email is sponsored by Sprint >> What will you do first with EVO, the first 4G phone? >> Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Tue Jun 29 16:48:39 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Tue, 29 Jun 2010 16:48:39 +0200 Subject: R: Re: Monitoring traffic In-Reply-To: <16578040.431541277821750463.JavaMail.defaultUser@defaultHost> References: <16578040.431541277821750463.JavaMail.defaultUser@defaultHost> Message-ID: <3BAF1A3C-E84B-4804-A9F6-D19B9E9A8A95@linux.it> You can use netcat for this purpose and pipe whatever you watt from the net to Nagios Ciao, Giorgio Il giorno 29/giu/2010, alle ore 16:29, "sandman42 at libero.it" ha scritto: >> ----Messaggio originale---- >> Da: jim at jimavery.me.uk >> Data: 29/06/2010 14.51 >>> The router gives no SNMP information about it. >> What options are available to you for querying the counter on the router? > > It is possible to send a string from the router to a particular port of a > remote server. > > Is there any way to make nagios act as a tcp server so it receives > notifications about it? > > Thanks > > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Tue Jun 29 15:06:06 2010 From: subscription at kkeane.com (Kevin Keane) Date: Tue, 29 Jun 2010 06:06:06 -0700 Subject: IP and hostname mapping control In-Reply-To: <4C298742.6060306@eurofmc.com> References: <4C298742.6060306@eurofmc.com> Message-ID: You could use the check_dns plugin with the -a argument. From: Network Operation Center FMC Luxemburg [mailto:NOC at eurofmc.com] Sent: Monday, June 28, 2010 10:40 PM To: Nagios Users List Subject: [Nagios-users] IP and hostname mapping control Hi everybody, I'm looking for a way to check out the mapping between a hostname and IP address. Example: IP 192.168.0.1 exists and if the hostname foo.mylan.com is not associated with this IP, I would have an alarm. Indeed the script below returns no alarm : define host { use unix-server host_name foo.mylan.com display_name foo address 192.168.0.1 check_command check_http } define service { use local-service host_name foo.mylan.com service_description HTTP local check_command check_http } Any idea? Thanks a lot Fran?ois -- Network Operation Center LUXEMBURG -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirde at oppy.com Tue Jun 29 23:37:09 2010 From: mirde at oppy.com (Mirza Dedic) Date: Tue, 29 Jun 2010 14:37:09 -0700 Subject: Help with creating a pnp4nagios template Message-ID: Hi, I was hoping someone out there can help me here, below is my perfdata output for checking disk space on remote Win32/64 servers that have NSClient++ installed. The perfdata received is in the format of: 'C:'=35.62G;3.99;1.99;0;39.98; The command used to query the C:\ is: define command{ command_name check_nrpe_disk command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -u -n -p 12778 -t 30 -c CheckDriveSize -a Drive=$ARG1$ ShowAll=long MinWarnFree=$ARG2$ MinCritFree=$ARG3$ } For my MinWarnFree and MinCritFree, I define a % to warn/crit for minimum allowed space. Can someone give me an example of a rrd template I would use for this so that I can build off it? It would be greatly appreciated, thank you. The Oppenheimer Group ---- CONFIDENTIAL This message is for the designated recipient only and may contain privileged, proprietary, or otherwise private information. If you have received it in error, please notify the sender immediately and delete the original. Any other use of the email by you is prohibited. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From wwanghongrui at cebbank.com Wed Jun 30 02:57:56 2010 From: wwanghongrui at cebbank.com (wwanghongrui) Date: Wed, 30 Jun 2010 08:57:56 +0800 Subject: how to fix excessive latency References: <201006281721596568296@cebbank.com>, <201006290957462960388@cebbank.com> Message-ID: <201006300857559212907@cebbank.com> I am sorry for my bad english. My nagios server is not running in virtual server. Nagios3.2.0 + Suse10-sp2 x86_64 + 8 GB mem + 4 x ( Xeon(R) CPU E7420 @ 2.13GHz ), I think this hardware is enough. "I use vmstate to capture system performance", vmstate is a command in SUSE10,not a virtual server. My configuration is like below,I don't know which parameter should I optimize,could you give me suggestions, thanks~ cfg_file=/usr/local/nagios/etc/hosts.cfg cfg_file=/usr/local/nagios/etc/services.cfg cfg_file=/usr/local/nagios/etc/misccommands.cfg cfg_file=/usr/local/nagios/etc/checkcommands.cfg cfg_file=/usr/local/nagios/etc/contactgroups.cfg cfg_file=/usr/local/nagios/etc/contacts.cfg cfg_file=/usr/local/nagios/etc/hostgroups.cfg cfg_file=/usr/local/nagios/etc/servicegroups.cfg cfg_file=/usr/local/nagios/etc/timeperiods.cfg cfg_file=/usr/local/nagios/etc/escalations.cfg cfg_file=/usr/local/nagios/etc/dependencies.cfg cfg_file=/usr/local/nagios/etc/hostextinfo.cfg cfg_file=/usr/local/nagios/etc/serviceextinfo.cfg cfg_file=/usr/local/nagios/etc/meta_commands.cfg cfg_file=/usr/local/nagios/etc/meta_contactgroup.cfg cfg_file=/usr/local/nagios/etc/meta_contact.cfg cfg_file=/usr/local/nagios/etc/meta_dependencies.cfg cfg_file=/usr/local/nagios/etc/meta_escalations.cfg cfg_file=/usr/local/nagios/etc/meta_hostgroup.cfg cfg_file=/usr/local/nagios/etc/meta_host.cfg cfg_file=/usr/local/nagios/etc/meta_services.cfg cfg_file=/usr/local/nagios/etc/meta_timeperiod.cfg resource_file=/usr/local/nagios/etc//resource.cfg log_file=/usr/local/nagios/var/nagios.log temp_file=/usr/local/nagios/var/nagios.tmp status_file=/usr/local/nagios/var/status.log p1_file=/usr/local/nagios/bin/p1.pl status_update_interval=15 nagios_user=nagios nagios_group=nagios enable_notifications=1 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_event_handlers=1 log_rotation_method=d log_archive_path=/usr/local/nagios/var/archives/ check_external_commands=1 command_check_interval=1s command_file=/usr/local/nagios/var/rw/nagios.cmd lock_file=/usr/local/nagios/var/nagios.lock retain_state_information=1 retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=1 log_external_commands=1 sleep_time=1 service_inter_check_delay_method=s service_interleave_factor=s max_concurrent_checks=2000 service_reaper_frequency=5 interval_length=60 use_agressive_host_checking=1 enable_flap_detection=0 low_service_flap_threshold=25.0 high_service_flap_threshold=50.0 low_host_flap_threshold=25.0 high_host_flap_threshold=50.0 service_check_timeout=60 host_check_timeout=10 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 ochp_timeout=5 perfdata_timeout=5 process_performance_data=1 host_perfdata_command=107 service_perfdata_command=process-service-perfdata host_perfdata_file=/usr/local/pnp4nagios/var/host-perfdata service_perfdata_file=/usr/local/pnp4nagios/var/service-perfdata host_perfdata_file_template=DATATYPE::HOSTPERFDATA TIMET::$TIMET$ HOSTNAME::$HOSTNAME$ HOSTPERFDATA::$HOSTPERFDATA$ HOSTCHECKCOMMAND::$HOSTCHECKCOMMAND$ HOSTSTATE::$HOSTSTATE$ HOSTSTATETYPE::$HOSTSTATETYPE$ service_perfdata_file_template=DATATYPE::SERVICEPERFDATA TIMET::$TIMET$ HOSTNAME::$HOSTNAME$ SERVICEDESC::$SERVICEDESC$SERVICEPERFDATA::$SERVICEPERFDATA$ SERVICECHECKCOMMAND::$SERVICECHECKCOMMAND$ HOSTSTATE::$HOSTSTATE$ HOSTSTATETYPE::$HOSTSTATETYPE$ SERVICESTATE::$SERVICESTATE$ SERVICESTATETYPE::$SERVICESTATETYPE$ host_perfdata_file_mode=a service_perfdata_file_mode=a host_perfdata_file_processing_interval=30 service_perfdata_file_processing_interval=30 host_perfdata_file_processing_command=process-host-perfdata-file service_perfdata_file_processing_command=process-service-perfdata-file check_service_freshness=1 date_format=euro illegal_object_name_chars=~!$%^&*"|'<>?,()= illegal_macro_output_chars=`~$^&"|'<> admin_email=admin admin_pager=admin at localhost broker_module=/usr/local/nagios/bin/ndomod-3x.o config_file=/usr/local/nagios/etc/ndomod.cfg event_broker_options=-1 use_large_installation_tweaks=1 child_processes_fork_twice=0 enable_environment_macros=0 debug_file=/usr/local/centreon/log/Debug-Graphs.log debug_level=-1 max_debug_file_size=600000000 check_result_reaper_frequency=10 max_check_result_reaper_time=20 Regards HongRui Wang Mail:wwanghongrui at cebbank.com 2010-06-30 wwanghongrui 2010-06-30 ???? Andreas Ericsson ????? 2010-06-29 20:24:12 ???? wwanghongrui; Nagios Users List ??? shadih rahman ??? Re: [Nagios-users] how to fix excessive latency On 06/29/2010 03:57 AM, wwanghongrui wrote: > Thanks your reply. We are writing to mysql database by ndoutils.We don't use nsca. About external_command_buffer_slots, we don't set it up. > status_update_interval =15 > > I use vmstate to capture system performance,like below.Maybe the bottleneck is not at system. > Endeavour to not run Nagios on a virtual server. If you must use a virtual server, make very sure that your checkresult spooldirectory and status data files are on a ramdisk, or you will certainly run into trouble. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bangers at gmail.com Wed Jun 30 07:21:54 2010 From: bangers at gmail.com (Matthew Angelo) Date: Wed, 30 Jun 2010 15:21:54 +1000 Subject: Modifying normal_check_interval breaks nagiosgrapher Message-ID: Hi, While testing and deploying Nagios I used the following service definition: max_check_attempts 1 normal_check_interval 1 retry_check_interval 1 Everything was configured and tested including nagiosgrapher (rrdtool). It passed proof and concept and we have now moved it into production. We tested this against a small set of 200 servers. Then we went to our "production" values of: max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 Data collection still works, but when we try to view a graph it's basically empty data (nothing drawn) for the period in which the new normal_check_internal was changed. If we change it back to 1 from 5, it starts to display data correctly again. How do I *keep* the new normal_check_internal value, but make it work again? Googling led me to this: http://forum.centreon.com/archive/index.php/t-4506.html which essentially says nagios.cfg:normal_check_interval and nrpe.cfg:step are tied together and if you change normal_check_internal you need need to lose all your data. The existing data isn't super critical. We're happy to lose it. I tried removing ./var/rrd directory (well rename) but then all the graphs broke with 'no such device' error. What other directories do I need to delete to clear all nagiosgrapher/rrdtool data? Cheers -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From velinsky at fzu.cz Wed Jun 30 14:20:26 2010 From: velinsky at fzu.cz (Otakar Velinsky) Date: Wed, 30 Jun 2010 14:20:26 +0200 (CEST) Subject: check_ntp_time Message-ID: Dear Madams/Sirs, we use free version of your product-nagios that does not work as we need. I would like to ask you for advice regarding configuration of check_command - check_ntp_time . In line format is check_ntp_time O.K. - linux-mn52:/usr/local/nagios/etc/objects # /usr/local/nagios/libexec/check_ntp_time -H sie127.fzu.cz -w 0.5 -c 1 NTP CRITICAL: Offset 44,3129 secs|offset=44,312900s;0,000000;1,000000; or - linux-mn52:/usr/local/nagios/etc/objects # /usr/local/nagios/libexec/check_ntp_time -H sie127.fzu.cz -w 0.5 -c 60 NTP WARNING: Offset 42,12555 secs|offset=42,125550s;0,000000;60,000000; or - linux-mn52:/usr/local/nagios/etc/objects # /usr/local/nagios/libexec/check_ntp_time -H sie127.fzu.cz -w 50 -c 60 NTP OK: Offset 42,103537 secs|offset=42,103537s;50,000000;60,000000; Can you let me know the Example Definition - define service, eventually define command, define host, please? I apologize for wasting your time and would be glad of any answer. Thank you in advance. Sincerely yours, Otakar Velinsky *************************************** Institute of Physics Czech Academy of Sciences Cukrovarnick? 10/112 16200 Praha 6 Czech Republic e-mail: velinsky at fzu.cz -- -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From roy at karlsbakk.net Wed Jun 30 14:45:14 2010 From: roy at karlsbakk.net (Roy Sigurd Karlsbakk) Date: Wed, 30 Jun 2010 14:45:14 +0200 (CEST) Subject: wiki down? In-Reply-To: References: Message-ID: <7250942.9.1277901914254.JavaMail.root@zimbra> This is getting silly - can someone please fix that wiki, or should we all move over to Icinga and do some real work for once? ----- Original Message ----- > Bah! If you don't have an event handler that fences the misbehaving > machine at the first sign of trouble, you're not trying hard enough > ;-) > > > On Mon, Jun 28, 2010 at 8:34 AM, Max > wrote: > > On Mon, Jun 28, 2010 at 8:29 AM, Matt Simmons > > wrote: > >> If only there were some kind of software available to let us know > >> when > >> websites were down... > > > > Or people to respond to alerts from the software :) > > > > ------------------------------------------------------------------------------ > > This SF.net email is sponsored by Sprint > > What will you do first with EVO, the first 4G phone? > > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent to > > /dev/null > > > > > > -- > LITTLE GIRL: But which cookie will you eat FIRST? > COOKIE MONSTER: Me think you have misconception of cookie-eating > process. > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Vennlige hilsener / Best regards roy -- Roy Sigurd Karlsbakk (+47) 97542685 roy at karlsbakk.net http://blogg.karlsbakk.net/ -- I all pedagogikk er det essensielt at pensum presenteres intelligibelt. Det er et element?rt imperativ for alle pedagoger ? unng? eksessiv anvendelse av idiomer med fremmed opprinnelse. I de fleste tilfeller eksisterer adekvate og relevante synonymer p? norsk. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From matgand at gmail.com Wed Jun 30 15:33:47 2010 From: matgand at gmail.com (Mattia Gandolfi) Date: Wed, 30 Jun 2010 15:33:47 +0200 Subject: Sync states in failover scenario Message-ID: Hi all, I'm running Nagios 3.2.1 on two RHEL boxes in a HA failover setup. Both the master and the slave run with execute_service_checks=1, only the master has enable_notifications=1. The slave checks the master's status every minute, and in case of error notifications get enabled. I guess a pretty standard configuration... If I manually disable notifications for a host or service, and the master goes away for wathever reason, the slave enables notifications globally, but it has no knowledge of hosts on wich notifications were disabled on the master, so I get tons of alarms for those hosts. Is there a way to sync states between the master and the slave? Thanks Cheers Mattia -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robert.wolfe at robertwolfe.org Wed Jun 30 14:37:54 2010 From: robert.wolfe at robertwolfe.org (Robert Wolfe) Date: Wed, 30 Jun 2010 08:37:54 -0400 Subject: check_ntp_time References: Message-ID: <50BE9C7135A64147819E42376C136B262942@dc1.wolfe.local> Otakar, the plugin configuration in Nagios appears to follow the same basic command line pattern as other Nagios plugins. ____ _ _______ / __ \ | / / ___/ Robert Wolfe / /_/ / | /| / /\__ \ Robert Wolfe Software / _, _/| |/ |/ /___/ / Web : http://www.robertwolfe.org /_/ |_| |__/|__//____/ Debian Blog : http://debian.robertwolfe.org ________________________________ From: Otakar Velinsky [mailto:velinsky at fzu.cz] Sent: Wed 6/30/2010 8:20 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] check_ntp_time Dear Madams/Sirs, we use free version of your product-nagios that does not work as we need. I would like to ask you for advice regarding configuration of check_command - check_ntp_time . In line format is check_ntp_time O.K. - linux-mn52:/usr/local/nagios/etc/objects # /usr/local/nagios/libexec/check_ntp_time -H sie127.fzu.cz -w 0.5 -c 1 NTP CRITICAL: Offset 44,3129 secs|offset=44,312900s;0,000000;1,000000; or - linux-mn52:/usr/local/nagios/etc/objects # /usr/local/nagios/libexec/check_ntp_time -H sie127.fzu.cz -w 0.5 -c 60 NTP WARNING: Offset 42,12555 secs|offset=42,125550s;0,000000;60,000000; or - linux-mn52:/usr/local/nagios/etc/objects # /usr/local/nagios/libexec/check_ntp_time -H sie127.fzu.cz -w 50 -c 60 NTP OK: Offset 42,103537 secs|offset=42,103537s;50,000000;60,000000; Can you let me know the Example Definition - define service, eventually define command, define host, please? I apologize for wasting your time and would be glad of any answer. Thank you in advance. Sincerely yours, Otakar Velinsky *************************************** Institute of Physics Czech Academy of Sciences Cukrovarnick? 10/112 16200 Praha 6 Czech Republic e-mail: velinsky at fzu.cz -- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From matgand at gmail.com Wed Jun 30 16:40:17 2010 From: matgand at gmail.com (Mattia Gandolfi) Date: Wed, 30 Jun 2010 16:40:17 +0200 Subject: Sync state in failover scenario Message-ID: Hi all, I'm running Nagios 3.2.1 on two RHEL boxes in a HA failover setup. Both the master and the slave run with execute_service_checks=1, only the master has enable_notifications=1. The slave checks the master's status every minute, and in case of error notifications get enabled. I guess a pretty standard configuration... If I manually disable notifications for a host or service, and the master goes away for wathever reason, the slave enables notifications globally, but it has no knowledge of hosts on wich notifications were disabled on the master, so I get tons of alarms for those hosts. Is there a way to sync states between the master and the slave? Thanks Cheers Mattia -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zarrelli at linux.it Wed Jun 30 17:27:15 2010 From: zarrelli at linux.it (Giorgio Zarrelli) Date: Wed, 30 Jun 2010 17:27:15 +0200 Subject: Sync state in failover scenario In-Reply-To: References: Message-ID: <8062CE39-F3C4-4103-8E9F-FB7B9B3C16D9@linux.it> Use drdb? Ciao, Giorgio Il giorno 30/giu/2010, alle ore 16:40, Mattia Gandolfi ha scritto: > Hi all, > > I'm running Nagios 3.2.1 on two RHEL boxes in a HA failover setup. > Both the master and the slave run with execute_service_checks=1, only the master has enable_notifications=1. The slave checks the master's status every minute, and in case of error notifications get enabled. I guess a pretty standard configuration... > > If I manually disable notifications for a host or service, and the master goes away for wathever reason, the slave enables notifications globally, but it has no knowledge of hosts on wich notifications were disabled on the master, so I get tons of alarms for those hosts. > > Is there a way to sync states between the master and the slave? > > Thanks > > Cheers > > Mattia > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Wed Jun 30 17:56:10 2010 From: perldork at webwizarddesign.com (Max) Date: Wed, 30 Jun 2010 11:56:10 -0400 Subject: wiki down? In-Reply-To: <7250942.9.1277901914254.JavaMail.root@zimbra> References: <7250942.9.1277901914254.JavaMail.root@zimbra> Message-ID: On Wed, Jun 30, 2010 at 8:45 AM, Roy Sigurd Karlsbakk wrote: > This is getting silly - can someone please fix that wiki, or should we all move over to Icinga and do some real work for once? Why would you assume that any of us are not doing real work? ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Jun 30 18:03:28 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 30 Jun 2010 17:03:28 +0100 Subject: wiki down? In-Reply-To: References: <7250942.9.1277901914254.JavaMail.root@zimbra> Message-ID: <4C2B6AD0.6050002@flatto.net> Max wrote: > On Wed, Jun 30, 2010 at 8:45 AM, Roy Sigurd Karlsbakk wrote: > >> This is getting silly - can someone please fix that wiki, or should we all move over to Icinga and do some real work for once? >> > > Why would you assume that any of us are not doing real work? > > > Or not working on Icinga also ?? -- Never,Ever Cut A Deal With a Dragon ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mgius at createspace.com Wed Jun 30 18:23:38 2010 From: mgius at createspace.com (Gius, Mark) Date: Wed, 30 Jun 2010 12:23:38 -0400 Subject: wiki down? In-Reply-To: <7250942.9.1277901914254.JavaMail.root@zimbra> References: <7250942.9.1277901914254.JavaMail.root@zimbra> Message-ID: <23193A17705DD246AFFFDF09B779F56F2519A89769@EX-IAD6-B.ant.amazon.com> I wasn't even aware that there was a wiki until this email thread. What exactly are you missing that the wiki contains? I've never needed anything other than the docs that ship with Nagios. -Gius > -----Original Message----- > From: Roy Sigurd Karlsbakk [mailto:roy at karlsbakk.net] > Sent: Wednesday, June 30, 2010 5:45 AM > To: Nagios Users List > Subject: Re: [Nagios-users] wiki down? > > This is getting silly - can someone please fix that wiki, or should we > all move over to Icinga and do some real work for once? > > ----- Original Message ----- > > Bah! If you don't have an event handler that fences the misbehaving > > machine at the first sign of trouble, you're not trying hard enough > > ;-) > > > > > > On Mon, Jun 28, 2010 at 8:34 AM, Max > > wrote: > > > On Mon, Jun 28, 2010 at 8:29 AM, Matt Simmons > > > wrote: > > >> If only there were some kind of software available to let us know > > >> when > > >> websites were down... > > > > > > Or people to respond to alerts from the software :) > > > > > > ------------------------------------------------------------------- > ----------- > > > This SF.net email is sponsored by Sprint > > > What will you do first with EVO, the first 4G phone? > > > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > > > _______________________________________________ > > > Nagios-users mailing list > > > Nagios-users at lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > ::: Please include Nagios version, plugin version (-v) and OS when > > > reporting any issue. > > > ::: Messages without supporting info will risk being sent to > > > /dev/null > > > > > > > > > > > -- > > LITTLE GIRL: But which cookie will you eat FIRST? > > COOKIE MONSTER: Me think you have misconception of cookie-eating > > process. > > > > --------------------------------------------------------------------- > --------- > > This SF.net email is sponsored by Sprint > > What will you do first with EVO, the first 4G phone? > > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent to > /dev/null > > -- > Vennlige hilsener / Best regards > > roy > -- > Roy Sigurd Karlsbakk > (+47) 97542685 > roy at karlsbakk.net > http://blogg.karlsbakk.net/ > -- > I all pedagogikk er det essensielt at pensum presenteres intelligibelt. > Det er et element?rt imperativ for alle pedagoger ? unng? eksessiv > anvendelse av idiomer med fremmed opprinnelse. I de fleste tilfeller > eksisterer adekvate og relevante synonymer p? norsk. > > ----------------------------------------------------------------------- > ------- > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tyarusso at nagios.com Wed Jun 30 18:49:43 2010 From: tyarusso at nagios.com (Tony Yarusso) Date: Wed, 30 Jun 2010 11:49:43 -0500 Subject: wiki down? In-Reply-To: <20285738.139.1277400042080.JavaMail.root@zimbra> References: <20285738.139.1277400042080.JavaMail.root@zimbra> Message-ID: <4C2B75A7.8070008@nagios.com> On 06/24/2010 12:20 PM, Roy Sigurd Karlsbakk wrote: > hi all > > seems something is rather bad with http://wiki.nagios.org/ - anyone here with access to the box? > Thanks for the note - Ethan's looking into it now. (He just got back from another trip.) -- Tony Yarusso Technical Team ___ Nagios Enterprises, LLC Email: tyarusso at nagios.com Web: www.nagios.com ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Wed Jun 30 20:12:32 2010 From: a31modela at hotmail.com (steve f) Date: Wed, 30 Jun 2010 14:12:32 -0400 Subject: Can I Configure the NRPE Plugins without check_pgsql? Message-ID: So I am installing the Nagios NRPE addon on a SuSE 8 server. When I try to do the nagios-plugins-1.4.14 configure, I error out every time for missing pg_config_manual.h If I am not running pgsql on the box, is there a way to get the plugins compiled? I know I can do the config with the --with-pgsql=DIR option to point to the correct location but how can I not install the check_pgsql plugin? Can I remove it from the plugins dir in the nagios-plugins-1.4.14/plugins dir before I do the configure? Can I ignore the error for this during the ./configure & just continue on with the configure? ( I assume when the error happens, it stops the config at that point) gcc -DLOCALEDIR=\"/usr/local/nagios/share/locale\" -DHAVE_CONFIG_H -I. -I.. -I.. -I../lib -I../gl -I../intl -I/usr/include/ldap -I/usr/include/pgsql -I/usr/include -DNP_VERSION='"1.4.14"' -g -O2 -MT check_pgsql.o -MD -MP -MF .deps/check_pgsql.Tpo -c -o check_pgsql.o check_pgsql.c check_pgsql.c:40:30: pg_config_manual.h: No such file or directory make[2]: *** [check_pgsql.o] Error 1 make[2]: Leaving directory `/home/tech/fiedle/nagios_downloads/nagios-plugins-1.4.14/plugins' make[1]: *** [all-recursive] Error 1 make[1]: Leaving directory `/home/tech/fiedle/nagios_downloads/nagios-plugins-1.4.14' make: *** [all] Error 2 Thanks, Steve _________________________________________________________________ The New Busy is not the too busy. Combine all your e-mail accounts with Hotmail. http://www.windowslive.com/campaign/thenewbusy?tile=multiaccount&ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_4 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From darren at brucetelecom.com Wed Jun 30 20:22:01 2010 From: darren at brucetelecom.com (Darren Hill) Date: Wed, 30 Jun 2010 14:22:01 -0400 Subject: Can I Configure the NRPE Plugins without check_pgsql? In-Reply-To: References: Message-ID: <4C2B8B49.4070804@brucetelecom.com> ./configure --without-pgsql On 6/30/2010 2:12 PM, steve f wrote: > > > So I am installing the Nagios NRPE addon on a SuSE 8 server. When I > try to do the nagios-plugins-1.4.14 configure, I error out every time > for missing pg_config_manual.h > > If I am not running pgsql on the box, is there a way to get the > plugins compiled? I know I can do the config with the > --with-pgsql=DIR option to point to the correct location but how can > I not install the check_pgsql plugin? Can I remove it from the > plugins dir in the nagios-plugins-1.4.14/plugins dir before I do the > configure? > > Can I ignore the error for this during the ./configure & just continue > on with the configure? ( I assume when the error happens, it stops > the config at that point) > > gcc -DLOCALEDIR=\"/usr/local/nagios/share/locale\" -DHAVE_CONFIG_H -I. > -I.. -I.. -I../lib -I../gl -I../intl -I/usr/include/ldap > -I/usr/include/pgsql -I/usr/include -DNP_VERSION='"1.4.14"' -g -O2 > -MT check_pgsql.o -MD -MP -MF .deps/check_pgsql.Tpo -c -o > check_pgsql.o check_pgsql.c > check_pgsql.c:40:30: pg_config_manual.h: No such file or directory > make[2]: *** [check_pgsql.o] Error 1 > make[2]: Leaving directory > `/home/tech/fiedle/nagios_downloads/nagios-plugins-1.4.14/plugins' > make[1]: *** [all-recursive] Error 1 > make[1]: Leaving directory > `/home/tech/fiedle/nagios_downloads/nagios-plugins-1.4.14' > make: *** [all] Error 2 > > Thanks, > > Steve > > > ------------------------------------------------------------------------ > The New Busy is not the too busy. Combine all your e-mail accounts > with Hotmail. Get busy. > > > > > ------------------------------------------------------------------------------ > This SF.net email is sponsored by Sprint > What will you do first with EVO, the first 4G phone? > Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Wed Jun 30 20:49:54 2010 From: a31modela at hotmail.com (steve f) Date: Wed, 30 Jun 2010 14:49:54 -0400 Subject: Can I Configure the NRPE Plugins without check_pgsql? In-Reply-To: <4C2B8B49.4070804@brucetelecom.com> References: , <4C2B8B49.4070804@brucetelecom.com> Message-ID: That did it ! Thanks Darren Steve Date: Wed, 30 Jun 2010 14:22:01 -0400 From: darren at brucetelecom.com To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Can I Configure the NRPE Plugins without check_pgsql? ./configure --without-pgsql On 6/30/2010 2:12 PM, steve f wrote: So I am installing the Nagios NRPE addon on a SuSE 8 server. When I try to do the nagios-plugins-1.4.14 configure, I error out every time for missing pg_config_manual.h If I am not running pgsql on the box, is there a way to get the plugins compiled? I know I can do the config with the --with-pgsql=DIR option to point to the correct location but how can I not install the check_pgsql plugin? Can I remove it from the plugins dir in the nagios-plugins-1.4.14/plugins dir before I do the configure? Can I ignore the error for this during the ./configure & just continue on with the configure? ( I assume when the error happens, it stops the config at that point) gcc -DLOCALEDIR=\"/usr/local/nagios/share/locale\" -DHAVE_CONFIG_H -I. -I.. -I.. -I../lib -I../gl -I../intl -I/usr/include/ldap -I/usr/include/pgsql -I/usr/include -DNP_VERSION='"1.4.14"' -g -O2 -MT check_pgsql.o -MD -MP -MF .deps/check_pgsql.Tpo -c -o check_pgsql.o check_pgsql.c check_pgsql.c:40:30: pg_config_manual.h: No such file or directory make[2]: *** [check_pgsql.o] Error 1 make[2]: Leaving directory `/home/tech/fiedle/nagios_downloads/nagios-plugins-1.4.14/plugins' make[1]: *** [all-recursive] Error 1 make[1]: Leaving directory `/home/tech/fiedle/nagios_downloads/nagios-plugins-1.4.14' make: *** [all] Error 2 Thanks, Steve The New Busy is not the too busy. Combine all your e-mail accounts with Hotmail. Get busy. ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null _________________________________________________________________ The New Busy is not the old busy. Search, chat and e-mail from your inbox. http://www.windowslive.com/campaign/thenewbusy?ocid=PID28326::T:WLMTAGL:ON:WL:en-US:WM_HMP:042010_3 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.net email is sponsored by Sprint What will you do first with EVO, the first 4G phone? Visit sprint.com/first -- http://p.sf.net/sfu/sprint-com-first -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null