From yu.watanabe at jp.fujitsu.com Tue Dec 1 15:51:36 2009 From: yu.watanabe at jp.fujitsu.com (Yu Watanabe) Date: Tue, 01 Dec 2009 23:51:36 +0900 Subject: Nagios is logging "Service Check Timed Out" for certain service In-Reply-To: <765d77c80911250509y70fa57dfo6b8bdb99799efef8@mail.gmail.com> References: <765d77c80911250509y70fa57dfo6b8bdb99799efef8@mail.gmail.com> Message-ID: <200912011451.AA01287@S2007337.jp.fujitsu.com> Hello Jim. Thank you for the reply and sorry for the late reply from me. Well , my situation was using plugin that scans through the syslog file and whenever any regular expression match occurs it sends an passive check alert to nagios. Weird thing was there was existing log file but nagios plugin itself was not executed. Yu Watanabe Jim Avery ????????: >2009/11/25 Yu Watanabe : >> Hello All. >> >> I am gathering information for a problem I had experienced in nagios v2.10. >> >> I was running Nagios on RHEL 4.5. The story is that Nagios suddenly started returing "Service Check Timed Out" >> on a certain service, not on other services. The time that "Service Check Timed Out" started occuring was >> 23:13:48. >> >> Has anyone had such experience? > > >Yes. I occasionally see this behaviour for certain SNMP checks when >the SNMP daemon on the montitored host stops working properly. > >Which service is causing the problem and which ones seem to be still OK? > ********************************************************************** FUJITSU SOCIAL SCIENCE LABORATORY LIMITED(FUJITSU SSL) System Infrastructure Development Department Telecom Sstems Business Group 2 Yu Watanabe E-Mail : yu.watanabe at jp.fujitsu.com Phone : 044-739-1427 +5283 / 7166-5283 ********************************************************************** ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tommy.mogensen at uni-c.dk Tue Dec 1 16:14:39 2009 From: tommy.mogensen at uni-c.dk (Tommy Mogensen) Date: Tue, 1 Dec 2009 16:14:39 +0100 Subject: Hardware requirements Message-ID: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> Hi Nagios-experts I am looking for a free system able to monitor 3000-5000 hosts (Mainly cisco routers, switches and ap's) via ping and snmp. I would prefer to run everything on one server if possible. I have received a good deal on a machine with 16G ram, 2xSSD-disks (maybe in raid-0), and 2xIntel E5520-CPUs. I would appreciate your input regarding the performance issues should I use Nagios for this system. Is this configuration powerful enough, what are the limiting parts of the setup and are some of the pieces unnecessary? I.e. I could go for one cpu (4 cores) if nagios does not support threading to 8 cores....or is the main bottleneck that I do not run it distributed? Regards, Tommy ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Tue Dec 1 16:44:46 2009 From: shadhin71 at gmail.com (shadih rahman) Date: Tue, 1 Dec 2009 10:44:46 -0500 Subject: Hardware requirements In-Reply-To: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> References: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> Message-ID: <6db4a4200912010744g64ebf99fx60829444b05ffa20@mail.gmail.com> I have a quad core 2GHZ and 4 GB memory machine and I am getting the following performance. Service Check Execution Time:0.00 / 60.01 / 0.381 sec Service Check Latency:0.00 / 15.93 / 0.156 sec Host Check Execution Time:0.01 / 30.01 / 0.583 sec Host Check Latency:0.00 / 712.08 / 1.051 sec I have ndoUtils running in the background with long retention time. I have no performance issue. I had seen some issue in the past but that had to do with configuration issue. Please read the performance tuning section of the Nagios doc. Thanks On Tue, Dec 1, 2009 at 10:14 AM, Tommy Mogensen wrote: > Hi Nagios-experts > > I am looking for a free system able to monitor 3000-5000 hosts (Mainly > cisco routers, switches and ap's) via ping and snmp. I would prefer to > run everything on one server if possible. > > I have received a good deal on a machine with 16G ram, 2xSSD-disks > (maybe in raid-0), and 2xIntel E5520-CPUs. I would appreciate your input > regarding the performance issues should I use Nagios for this system. > > Is this configuration powerful enough, what are the limiting parts of > the setup and are some of the pieces unnecessary? I.e. I could go for > one cpu (4 cores) if nagios does not support threading to 8 cores....or > is the main bottleneck that I do not run it distributed? > > Regards, > Tommy > > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Steve.Onotsky at broadridge.com Tue Dec 1 16:38:14 2009 From: Steve.Onotsky at broadridge.com (Onotsky, Steve x55328) Date: Tue, 1 Dec 2009 10:38:14 -0500 Subject: Hardware requirements In-Reply-To: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> References: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> Message-ID: <30A417B62E7EE448B3864ADC881CEFD60AB3B252@missemsa01.bsg.ad.adp.com> > -----Original Message----- > From: Tommy Mogensen [mailto:tommy.mogensen at uni-c.dk] > Sent: December 1, 2009 10:15 > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Hardware requirements > > > I have received a good deal on a machine with 16G ram, 2xSSD-disks > (maybe in raid-0), and 2xIntel E5520-CPUs. I would appreciate your input > regarding the performance issues should I use Nagios for this system. I would strongly recommend that you not put Nagios on the SSDs; the amount of writes/deletes would burn up the flash cells on the drives very quickly, especially with the number of hosts that you plan to monitor. For the OS, yes, I recommend SSDs (just not swap space), but for applications that tend to change data frequently, I recommend you get a couple of platter drives. Cheers, Steve. This message and any attachments are intended only for the use of the addressee and may contain information that is privileged and confidential. If the reader of the message is not the intended recipient or an authorized representative of the intended recipient, you are hereby notified that any dissemination of this communication is strictly prohibited. If you have received this communication in error, please notify us immediately by e-mail and delete the message and any attachments from your system. ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Dec 1 17:59:49 2009 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 1 Dec 2009 16:59:49 +0000 Subject: Nagios is logging "Service Check Timed Out" for certain service In-Reply-To: <200912011451.AA01287@S2007337.jp.fujitsu.com> References: <765d77c80911250509y70fa57dfo6b8bdb99799efef8@mail.gmail.com> <200912011451.AA01287@S2007337.jp.fujitsu.com> Message-ID: <765d77c80912010859w11a4d758iec8b8e810c65112f@mail.gmail.com> 2009/12/1 Yu Watanabe : > Hello Jim. > > Thank you for the reply and sorry for the late reply from me. > > Well , my situation was using plugin that scans through the syslog file and > whenever any regular expression match occurs it sends an passive check alert to nagios. > > Weird thing was there was existing log file but nagios plugin itself > was not executed. > > Yu Watanabe Is your problem solved now? If not, the first thing which I would mention is that if it is a passive check you almost certainly want to disable active checks for that service in Nagios. Cheers, Jim ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue Dec 1 20:13:31 2009 From: mitsuto at gmail.com (Marcel) Date: Tue, 1 Dec 2009 17:13:31 -0200 Subject: check_http checking with verbose output breaks performance data Message-ID: <2dfcbd1b0912011113y4bc96679v5449ec9d94413cf0@mail.gmail.com> I've had a need of getting the full output from plugins to appear at the extended status information page: http://path.to.nagios/nagios/cgi-bin/extinfo.cgi?type=2&host=host_name&service=service_description Then I've realized that performance data is being mangled by this verbose command_line. Is this a bug? Or I should not use verbose output for check_http in production? Current Status: OK (for 0d 0h 0m 47s) Status Information:GET /mfa/ad/script/ User-Agent: check_http/v2053 (nagios-plugins 1.4.13) Connection: close Host: ad.adnetwork.com.br http://174.132.205.30:80/mfa/ad/script/ Performance Data:s=101|c=120|p=2|br=1|? HTTP/1.0 s=101|c=120|p=2|br=1|? is 664 characters STATUS: HTTP/1.1 200 OK **** HEADER **** Date: Tue, 01 Dec 2009 19:06:45 GMT Server: Resin/2.1.17 P3P: CP=" PSA CONo OUR ONL NOI BUS", policyref="/w3c/p3p.xml" Pragma: no-cache Cache-Control: no-cache Expires: 0 Content-Type: text/html Set-Cookie: mfa=24292751; path=/; expires=Sun, 30-Nov-2014 19:06:45 GMT Set-Cookie: limpa=1; path=/; expires=Wed, 02-Dec-2009 02:06:45 GMT Set-Cookie: c120=770|101|-999|1259694406260; path=/; expires=Sun, 30-Nov-2014 19:06:45 GMT Connection: close **** CONTENT **** document.write('>'); HTTP OK: Status line output matched "HTTP/1.1 200" HTTP OK HTTP/1.1 200 OK - 0.074 second response time |time=0.074327s;1.000000;5.000000;0.000000 size=664B;;;0 Current Attempt:1/3 (HARD state) Last Check Time:01-12-2009 17:06:46 Check Type:ACTIVE Check Latency / Duration:0.196 / 0.086 seconds Next Scheduled Check: 01-12-2009 17:11:46 Last State Change:01-12-2009 17:06:46 Last Notification:N/A (notification 0) Is This Service Flapping?N/A In Scheduled Downtime? NO Last Update:01-12-2009 17:07:28 ( 0d 0h 0m 5s ago) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue Dec 1 20:18:31 2009 From: mitsuto at gmail.com (Marcel) Date: Tue, 1 Dec 2009 17:18:31 -0200 Subject: check_http checking with verbose output breaks performance data In-Reply-To: <2dfcbd1b0912011113y4bc96679v5449ec9d94413cf0@mail.gmail.com> References: <2dfcbd1b0912011113y4bc96679v5449ec9d94413cf0@mail.gmail.com> Message-ID: <2dfcbd1b0912011118s4e6e0ab5h2e0361c4e6de1c1d@mail.gmail.com> sorry for replying my own post. This particular application has it's query string separator as "|", then the mangled perfdata. Is it possible to perfdata to parses only the last segment separated by "|" ? Thanks, On Tue, Dec 1, 2009 at 5:13 PM, Marcel wrote: > I've had a need of getting the full output from plugins to appear at the > extended status information page: > > > http://path.to.nagios/nagios/cgi-bin/extinfo.cgi?type=2&host=host_name&service=service_description > > Then I've realized that performance data is being mangled by this verbose > command_line. > > Is this a bug? Or I should not use verbose output for check_http in > production? > > > Current Status: > OK > (for 0d 0h 0m 47s) Status Information:GET /mfa/ad/script/ > User-Agent: check_http/v2053 (nagios-plugins 1.4.13) > Connection: close > Host: ad.adnetwork.com.br > > > http://174.132.205.30:80/mfa/ad/script/ Performance Data:s=101|c=120|p=2|br=1|? > HTTP/1.0 s=101|c=120|p=2|br=1|? is 664 characters STATUS: HTTP/1.1 200 OK > **** HEADER **** Date: Tue, 01 Dec 2009 19:06:45 GMT Server: Resin/2.1.17 > P3P: CP=" PSA CONo OUR ONL NOI BUS", policyref="/w3c/p3p.xml" Pragma: > no-cache Cache-Control: no-cache Expires: 0 Content-Type: text/html > Set-Cookie: mfa=24292751; path=/; expires=Sun, 30-Nov-2014 19:06:45 GMT > Set-Cookie: limpa=1; path=/; expires=Wed, 02-Dec-2009 02:06:45 GMT > Set-Cookie: c120=770|101|-999|1259694406260; path=/; expires=Sun, > 30-Nov-2014 19:06:45 GMT Connection: close **** CONTENT **** > document.write(' target=_blank> src=http://www.adserver.com.br/120/bt_mfa.gif border=0 alt=>'); HTTP > OK: Status line output matched "HTTP/1.1 200" HTTP OK HTTP/1.1 200 OK - > 0.074 second response time |time=0.074327s;1.000000;5.000000;0.000000 > size=664B;;;0 Current Attempt:1/3 (HARD state) Last Check Time:01-12-2009 > 17:06:46 Check Type:ACTIVE Check Latency / Duration:0.196 / 0.086 seconds Next > Scheduled Check: 01-12-2009 17:11:46 Last State Change:01-12-2009 > 17:06:46 Last Notification:N/A (notification 0) Is This Service Flapping? > N/A In Scheduled Downtime? > NO > Last Update:01-12-2009 17:07:28 ( 0d 0h 0m 5s ago) > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From akpgeek at gmail.com Tue Dec 1 22:17:42 2009 From: akpgeek at gmail.com (akp geek) Date: Tue, 1 Dec 2009 16:17:42 -0500 Subject: Nagios email notifications Message-ID: <2024a9fb0912011317s220f744fya4e15c0df38990a7@mail.gmail.com> Dear All - I have installed nagios and it has been working fine. I am getting email notifications also. But I would like to edit those notifications and customize it. In the emails that I receive now, I don't get any subject in the email. Can you please point me / guide me? Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue Dec 1 23:05:40 2009 From: mitsuto at gmail.com (Marcel) Date: Tue, 1 Dec 2009 20:05:40 -0200 Subject: Nagios email notifications In-Reply-To: <2024a9fb0912011317s220f744fya4e15c0df38990a7@mail.gmail.com> References: <2024a9fb0912011317s220f744fya4e15c0df38990a7@mail.gmail.com> Message-ID: <2dfcbd1b0912011405j763ed849mffcff4d811073203@mail.gmail.com> http://nagios.sourceforge.net/docs/3_0/macros.html >From this document you should be able to tweak your notification commands to send customized email notifications. Cheers, On Tue, Dec 1, 2009 at 7:17 PM, akp geek wrote: > Dear All - > > I have installed nagios and it has been working fine. I am > getting email notifications also. But I would like to edit those > notifications and customize it. In the emails that I receive now, I don't > get any subject in the email. Can you please point me / guide me? > > Regards > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Tue Dec 1 23:18:08 2009 From: mmelin at gmail.com (Martin Melin) Date: Tue, 1 Dec 2009 23:18:08 +0100 Subject: Nagios email notifications In-Reply-To: <2024a9fb0912011317s220f744fya4e15c0df38990a7@mail.gmail.com> References: <2024a9fb0912011317s220f744fya4e15c0df38990a7@mail.gmail.com> Message-ID: The command definition for your notification command is where you want to look. It probably pipes a big echo into mail, if you add an -s switch to the mail command you can set a subject. Regards, Martin Melin On Tue, Dec 1, 2009 at 10:17 PM, akp geek wrote: > Dear All - > > I have installed nagios and it has been working fine. I am > getting email notifications also. But I would like to edit those > notifications and customize it. In the emails that I receive now, I don't > get any subject in the email. Can you please point me / guide me? > > Regards > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yu.watanabe at jp.fujitsu.com Wed Dec 2 03:03:02 2009 From: yu.watanabe at jp.fujitsu.com (Yu Watanabe) Date: Wed, 02 Dec 2009 11:03:02 +0900 Subject: Nagios is logging "Service Check Timed Out" for certain service In-Reply-To: <765d77c80912010859w11a4d758iec8b8e810c65112f@mail.gmail.com> References: <765d77c80912010859w11a4d758iec8b8e810c65112f@mail.gmail.com> Message-ID: <200912020203.AA01289@S2007337.jp.fujitsu.com> Hello Jim Uh, I'm afraid the problem isn't solved yet. First of all , my explanation wasn't enough. The sequence of the syslog checking was the following: 1. Scan through the syslog file (Active Check) 2. If any error exist, transfer to passive service in Nagios (Passive Check) So I have defined 2 services in Nagios , 1 Active service check and 1 Passive service check. However the problem was above 1 wasn't executed from Nagios. Yu Watanabe Jim Avery ????????: >2009/12/1 Yu Watanabe : >> Hello Jim. >> >> Thank you for the reply and sorry for the late reply from me. >> >> Well , my situation was using plugin that scans through the syslog file and >> whenever any regular expression match occurs it sends an passive check alert to nagios. >> >> Weird thing was there was existing log file but nagios plugin itself >> was not executed. >> >> Yu Watanabe > >Is your problem solved now? If not, the first thing which I would >mention is that if it is a passive check you almost certainly want to >disable active checks for that service in Nagios. > >Cheers, > >Jim > ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From juki.emma at gmail.com Wed Dec 2 06:54:00 2009 From: juki.emma at gmail.com (Juki) Date: Wed, 2 Dec 2009 08:54:00 +0300 Subject: Integrate Nagios with Netcool Message-ID: <7545d7d20912012154h50e3491dx38df8a016ced858e@mail.gmail.com> Hello people, I have a Nagios (v3.2.0) installation running on openSUSE 11.1 with nagio-plugins (v1.4.14) and nrpe (v2.12) on the target hosts. I would like to integrate Nagios with Netcool so that Nagios alerts can be forwarded to Netcool. I have read somewhere that its possible to do so by sending SNMP traps (when an alarm occurs) to Netcool using net-SNMP, however, I do not know how to achieve this. Anyone on the list that has implemented this before? Or some kind of a step-by-step guide that I could make use of? Thanks in advance. Juki -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lists at cloned.org.uk Wed Dec 2 10:38:47 2009 From: lists at cloned.org.uk (john) Date: Wed, 2 Dec 2009 09:38:47 +0000 (GMT) Subject: nagios3 only doing one check attempt In-Reply-To: <65F22868-33D9-4DF0-B78D-75EF346FB2A7@ena.com> References: <65F22868-33D9-4DF0-B78D-75EF346FB2A7@ena.com> Message-ID: On Mon, 30 Nov 2009, Marc Powell wrote: > On Nov 30, 2009, at 12:05 PM, john wrote: > >> normal_check_interval and retry_check_interval were renamed to >> check_interval and retry_interval respectively. Now to go and update all >> my config files :/ > > They're interchangeable at this point. Either will produce the exact same results. > > else if(!strcmp(variable,"check_interval") || !strcmp(variable,"normal_check_interval")){ > temp_host->check_interval=strtod(value,NULL); > temp_host->have_check_interval=TRUE; > } > else if(!strcmp(variable,"retry_interval") || !strcmp(variable,"retry_check_interval")){ > temp_host->retry_interval=strtod(value,NULL); > temp_host->have_retry_interval=TRUE; > } You are right. I'm currently changing the config one line at a time to see what else I added fixed it so I can do all the other services! john ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From juki.emma at gmail.com Wed Dec 2 14:46:14 2009 From: juki.emma at gmail.com (Juki) Date: Wed, 2 Dec 2009 16:46:14 +0300 Subject: Exact same swap + root / partition info showing up for different hosts(?) In-Reply-To: <0B28F2D0-1F1C-48EB-A37F-596C2C1E78C6@ena.com> References: <208830.99896.qm@web31904.mail.mud.yahoo.com> <0B28F2D0-1F1C-48EB-A37F-596C2C1E78C6@ena.com> Message-ID: <7545d7d20912020546r5086b111o982a5f4d5b6982d7@mail.gmail.com> Hi list, I am facing a similar problem as posted by Bill. I'm running Nagios-3.2.0 on openSUSE 11.1 and monitoring a number of hosts using NRPE v2.12. When I look at the web GUI, it reports/shows the exact stats about disk size/usage, no. of users, swap usage etc.. for the monitored hosts (target machines) and the localhost (Nagios server). The check_disk plugin is installed and running on the target (remote) hosts. I also tested this on each of the remote hosts and it returns the right results as shown; *bash-3.00$ /usr/local/nagios/libexec/check_disk /var/opt/BGw/Server1 DISK OK - free space: /var/opt/BGw/Server1 41643 MB (80% inode=99%);| /var/opt/BGw/Server1=10192MB;;;0;54272* However, from the web GUI, it shows the exact stats as those of the localhost. This means it is reporting the localhosts's statistics and not those of the remote (target) hosts. Anyone with an idea about how to go about this? Regards, Juki -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Wed Dec 2 15:06:06 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Wed, 2 Dec 2009 08:06:06 -0600 Subject: Exact same swap + root / partition info showing up for different hosts(?) In-Reply-To: <7545d7d20912020546r5086b111o982a5f4d5b6982d7@mail.gmail.com> References: <208830.99896.qm@web31904.mail.mud.yahoo.com> <0B28F2D0-1F1C-48EB-A37F-596C2C1E78C6@ena.com> <7545d7d20912020546r5086b111o982a5f4d5b6982d7@mail.gmail.com> Message-ID: Without your config file I cannot be sure, but likely the services are either using the wrong command ie not the nrpe version, or your check command is not configured properly.. Can you provide your host and service definitions as well as the command configuration for the commands you are using? Greg Pangrazio pangrazi at gmail.com On Wed, Dec 2, 2009 at 7:46 AM, Juki wrote: > Hi list, > > I am facing a similar problem as posted by Bill. > > I'm running Nagios-3.2.0 on openSUSE 11.1 and monitoring a number of hosts > using NRPE v2.12. When I look at the web GUI, it reports/shows > the exact stats about disk size/usage, no. of users, swap usage etc.. for > the monitored hosts (target machines) and the localhost (Nagios server). > > The check_disk plugin is installed and running on the target (remote) hosts. > I also tested this on each of the remote hosts and it returns the > right results as shown; > > bash-3.00$ /usr/local/nagios/libexec/check_disk /var/opt/BGw/Server1 > DISK OK - free space: /var/opt/BGw/Server1 41643 MB (80% inode=99%);| > /var/opt/BGw/Server1=10192MB;;;0;54272 > > > However, from the web GUI, it shows the exact stats as those of the > localhost. This means it is reporting the localhosts's statistics and not > those > of the remote (target) hosts. > > Anyone with an idea about how to go about this? > > > Regards, > Juki > > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eemerson at safecore.com Wed Dec 2 14:46:59 2009 From: eemerson at safecore.com (Eric Emerson) Date: Wed, 2 Dec 2009 08:46:59 -0500 Subject: Time Conversion Bug (Was: no email notifications sent) In-Reply-To: <46D85130-C9D5-4871-B622-D2F11FDB5311@opsera.com> References: <1228334472.5351.0@antares> <46D85130-C9D5-4871-B622-D2F11FDB5311@opsera.com> Message-ID: Hi, Have not seen anything on this in a bit, is there patch available for this (Version 3.1.2)? I would hate to hit this again next year :) Take it Easy Eric On Tue, Nov 3, 2009 at 8:09 AM, Ton Voon wrote: > Hi Albrecht, > > On 3 Dec 2008, at 20:01, Albrecht Dre? wrote: > > > Am 01.12.08 21:59 schrieb(en) Albrecht Dre?: > >> I have a self-compiled nagios 3.0.5 running on a 64-bit Xeon box > >> with Ubuntu 8.04 LTS. Everything went perfectly (including > >> notifications) until I upgraded from 3.0.1 (iirc) to 3.0.5 during a > >> bigger service downtime (inter alia shifting the box into an other > >> network etc). > > > > After adding tons of debug messages, I was finally able to track > > down the problem. The problem is that I added a german holiday in > > the form > > > > october 3 00:00-00:00 > > > > In function (all inbase/utils.c) check_time_against_period(), the > > start time was calculated by calculate_time_from_day_of_month() as > > 1222988400 (Fri Oct 3 01:00:00 2008), but the end time as > > 1222984800 (Fri Oct 3 00:00:00 2008), which is /before/ the start, > > and therefore the time span was expanded to a whole year, i.e. the > > end now was Oct 3, 2009... > > > > Looking deeper into function calculate_time_from_day_of_month(), > > this is apparently caused by a wrong usage of the call to mktime(), > > as the field tm_isdst is /not/ initialised properly. The observed 1 > > hour offset seems to come from an earlier call to this function > > which returned an active dst flag. > > > > According to the IEEE Std 1003.1 [1], the field tm_isdst /is/ used > > in the time conversion, and should be set to -1 if the dst status is > > unknown (you may want to write a small test app, setting this flag > > to -1, 0 or 1 to observe the effect). As mktime() is used > > frequently in the code without properly initialising this field, > > this seems to be a systematic bug which can trigger interesting > > effects in nagios. > > > > Any opinions? > > As you are probably aware, this change has caused the DST bug that has > affected a lot of users of Nagios 3.2.0. > > I have only reverted one instance of the tm_isdst field, which was > specifically causing the rescheduling problem. There is a testcase in > HEAD ( download from > http://nagios.sourceforge.net/download/cvs/nagios-HEAD.tar.gz > && ./configure --enable-libtap && make all && make test) which tests > the DST bug (see t-tap/check_timeperiods.c). > > However, I think some of the other isdst settings maybe causing other > bugs, though I'm not sure. > > It would be helpful if you could add some test for the problem you > have fixed, to prove that the functionality you expect continues to > work. I'd be happy to apply to CVS if you have an updated > check_timeperiods.c file. > > Ton > > > > ------------------------------------------------------------------------------ > Come build with us! The BlackBerry(R) Developer Conference in SF, CA > is the only developer event you need to attend this year. Jumpstart your > developing skills, take BlackBerry mobile applications to market and stay > ahead of the curve. Join us from November 9 - 12, 2009. Register now! > http://p.sf.net/sfu/devconference > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From juki.emma at gmail.com Wed Dec 2 15:21:04 2009 From: juki.emma at gmail.com (Juki) Date: Wed, 2 Dec 2009 17:21:04 +0300 Subject: Exact same swap + root / partition info showing up for different hosts(?) In-Reply-To: References: <208830.99896.qm@web31904.mail.mud.yahoo.com> <0B28F2D0-1F1C-48EB-A37F-596C2C1E78C6@ena.com> <7545d7d20912020546r5086b111o982a5f4d5b6982d7@mail.gmail.com> Message-ID: <7545d7d20912020621n231f1257k5281c5fcd4db941a@mail.gmail.com> Hi Greg, My configuration is as below; *For host definition;* define host{ host_name emm4 use generic-host alias Old BGW address 10.151.28.32 check_command check-host-alive check_period 24x7 check_interval 5 contact_groups admins retry_interval 1 max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r } *For service definitions;* define service{ use generic-service host_name emm4 service_description Users is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 3 retry_check_interval 1 contact_groups admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_users!5!10 } define service{ use generic-service host_name emm4 service_description Load is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 3 retry_check_interval 1 contact_groups admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_load!5.0,4.0,3.0!10.0,6.0,4.0 } define service{ use generic-service host_name emm4 service_description Total Processes is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 3 retry_check_interval 1 contact_groups admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_procs!250!400!RSZDT } define service{ use generic-service host_name emm4 service_description Disk Usage is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 3 retry_check_interval 1 contact_groups admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_disk!/ } *And finally the command definitions; * # 'check_users' command definition define command{ command_name check_users command_line $USER1$/check_users -w $ARG1$ -c $ARG2$ } # 'check_load' command definition define command{ command_name check_load command_line $USER1$/check_load -w $ARG1$ -c $ARG2$ } # 'check_procs' command definition define command{ command_name check_procs command_line $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$ } # 'check_disk' command definition define command{ command_name check_disk command_line $USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ } Regards, Juki 2009/12/2 Greg Pangrazio > Without your config file I cannot be sure, but likely the services are > either using the wrong command ie not the nrpe version, or your check > command is not configured properly.. > > Can you provide your host and service definitions as well as the > command configuration for the commands you are using? > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Wed Dec 2 15:30:48 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Wed, 2 Dec 2009 08:30:48 -0600 Subject: Exact same swap + root / partition info showing up for different hosts(?) In-Reply-To: <7545d7d20912020621n231f1257k5281c5fcd4db941a@mail.gmail.com> References: <208830.99896.qm@web31904.mail.mud.yahoo.com> <0B28F2D0-1F1C-48EB-A37F-596C2C1E78C6@ena.com> <7545d7d20912020546r5086b111o982a5f4d5b6982d7@mail.gmail.com> <7545d7d20912020621n231f1257k5281c5fcd4db941a@mail.gmail.com> Message-ID: The commands as you have them defined execute the check commands on the local machine. First you need to verify that in the nrpe config on the remote hosts you have the check users command defined. Then you need to define a command on the monitoring station similar to define command{ command_name check_users_nrpe; command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c check_users } That is why you are getting the results for the local machine when it is run via nagios. The NRPE docs are located here http://nagios.sourceforge.net/docs/nrpe/NRPE.pdf Greg Pangrazio pangrazi at gmail.com On Wed, Dec 2, 2009 at 8:21 AM, Juki wrote: > Hi Greg, > > My configuration is as below; > > For host definition; > > define host{ > ??????? host_name?????????????? emm4 > ??????? use???????????????????? generic-host > ??????? alias?????????????????? Old BGW > ??????? address???????????????? 10.151.28.32 > ??????? check_command?????????? check-host-alive > ??????? check_period??????????? 24x7 > ??????? check_interval????????? 5 > ??????? contact_groups????????? admins > ??????? retry_interval????????? 1 > ??????? max_check_attempts????? 10 > ??????? notification_interval?? 120 > ??????? notification_period???? 24x7 > ??????? notification_options??? d,u,r > ??????? } > > > For service definitions; > > define service{ > ??????? use???????????????????????????? generic-service > > ??????? host_name?????????????????????? emm4 > ??????? service_description???????????? Users > ??????? is_volatile???????????????????? 0 > ??????? check_period??????????????????? 24x7 > ??????? max_check_attempts????????????? 3 > ??????? normal_check_interval?????????? 3 > ??????? retry_check_interval??????????? 1 > ??????? contact_groups????????????????? admins > ??????? notification_interval?????????? 120 > ??????? notification_period???????????? 24x7 > ??????? notification_options??????????? w,u,c,r > ??????? check_command?????????????????? check_users!5!10 > ??????? } > > define service{ > ??????? use???????????????????????????? generic-service > > ??????? host_name?????????????????????? emm4 > ??????? service_description???????????? Load > ??????? is_volatile???????????????????? 0 > ??????? check_period??????????????????? 24x7 > ??????? max_check_attempts????????????? 3 > ??????? normal_check_interval?????????? 3 > ??????? retry_check_interval??????????? 1 > ??????? contact_groups????????????????? admins > ??????? notification_interval?????????? 120 > ??????? notification_period???????????? 24x7 > ??????? notification_options??????????? w,u,c,r > ??????? check_command?????????????????? check_load!5.0,4.0,3.0!10.0,6.0,4.0 > ??????? } > > define service{ > ??????? use???????????????????????????? generic-service > > ??????? host_name?????????????????????? emm4 > ??????? service_description???????????? Total Processes > ??????? is_volatile???????????????????? 0 > ??????? check_period??????????????????? 24x7 > ??????? max_check_attempts????????????? 3 > ??????? normal_check_interval?????????? 3 > ??????? retry_check_interval??????????? 1 > ??????? contact_groups????????????????? admins > ??????? notification_interval?????????? 120 > ??????? notification_period???????????? 24x7 > ??????? notification_options??????????? w,u,c,r > ??????? check_command?????????????????? check_procs!250!400!RSZDT > ??????? } > > define service{ > ??????? use???????????????????????????? generic-service > > ??????? host_name?????????????????????? emm4 > ??????? service_description???????????? Disk Usage > ??????? is_volatile???????????????????? 0 > ??????? check_period??????????????????? 24x7 > ??????? max_check_attempts????????????? 3 > ??????? normal_check_interval?????????? 3 > ??????? retry_check_interval??????????? 1 > ??????? contact_groups????????????????? admins > ??????? notification_interval?????????? 120 > ??????? notification_period???????????? 24x7 > ??????? notification_options??????????? w,u,c,r > ??????? check_command?????????????????? check_disk!/ > ??????? } > > > And finally the command definitions; > > > # 'check_users' command definition > define command{ > ??????? command_name??? check_users > ??????? command_line??? $USER1$/check_users -w $ARG1$ -c $ARG2$ > ??????? } > > # 'check_load' command definition > define command{ > ??????? command_name??? check_load > ??????? command_line??? $USER1$/check_load -w $ARG1$ -c $ARG2$ > ??????? } > > # 'check_procs' command definition > define command{ > ??????? command_name??? check_procs > ??????? command_line??? $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$ > ??????? } > > # 'check_disk' command definition > define command{ > ??????? command_name??? check_disk > ??????? command_line??? $USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ > ??????? } > > > Regards, > Juki > > > > 2009/12/2 Greg Pangrazio >> >> Without your config file I cannot be sure, but likely the services are >> either using the wrong command ie not the nrpe version, or your check >> command is not configured properly.. >> >> Can you provide your host and service definitions as well as the >> command configuration for the commands you are using? > > > > ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Wed Dec 2 15:44:44 2009 From: subscription at kkeane.com (Kevin Keane) Date: Wed, 2 Dec 2009 06:44:44 -0800 Subject: Exact same swap + root / partition info showing up for different hosts(?) In-Reply-To: References: <208830.99896.qm@web31904.mail.mud.yahoo.com> <0B28F2D0-1F1C-48EB-A37F-596C2C1E78C6@ena.com> <7545d7d20912020546r5086b111o982a5f4d5b6982d7@mail.gmail.com> <7545d7d20912020621n231f1257k5281c5fcd4db941a@mail.gmail.com> Message-ID: <724C3B2F1C5EB44D9108E471EC5E99332F2F135D35@akechi-denki.ad.nctechcenter.com> In addition, I notice a mismatch in the Disk service with the corresponding check_disk command definition. The check_disk command is configured to require three arguments, but the service only supplies a single argument (and a wrong one at that - the first argument should be a warning level, not a directory). In the command definition, you see the arguments as $ARG1$, $ARG2$ and $ARG3$. In the service, these correspond to whatever you find between the ! characters. So the actual command that gets called is: check_disk -w / -c -p where in reality you would want something like check_disk -w 80 -c 90 -p / (Off the top of my head, I'm not sure if the warning and critical levels refer to percent free or percent used, so the numbers might be wrong). > -----Original Message----- > From: Greg Pangrazio [mailto:pangrazi at gmail.com] > Sent: Wednesday, December 02, 2009 6:31 AM > To: Juki > Cc: Nagios > Subject: Re: [Nagios-users] Exact same swap + root / partition info > showing up for different hosts(?) > > The commands as you have them defined execute the check commands on > the local machine. First you need to verify that in the nrpe config > on the remote hosts you have the check users command defined. Then > you need to define a command on the monitoring station similar to > > define command{ > command_name check_users_nrpe; > command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c check_users > } > > That is why you are getting the results for the local machine when it > is run via nagios. The NRPE docs are located here > http://nagios.sourceforge.net/docs/nrpe/NRPE.pdf > > Greg Pangrazio > pangrazi at gmail.com > > > > > > On Wed, Dec 2, 2009 at 8:21 AM, Juki wrote: > > Hi Greg, > > > > My configuration is as below; > > > > For host definition; > > > > define host{ > > ??????? host_name?????????????? emm4 > > ??????? use???????????????????? generic-host > > ??????? alias?????????????????? Old BGW > > ??????? address???????????????? 10.151.28.32 > > ??????? check_command?????????? check-host-alive > > ??????? check_period??????????? 24x7 > > ??????? check_interval????????? 5 > > ??????? contact_groups????????? admins > > ??????? retry_interval????????? 1 > > ??????? max_check_attempts????? 10 > > ??????? notification_interval?? 120 > > ??????? notification_period???? 24x7 > > ??????? notification_options??? d,u,r > > ??????? } > > > > > > For service definitions; > > > > define service{ > > ??????? use???????????????????????????? generic-service > > > > ??????? host_name?????????????????????? emm4 > > ??????? service_description???????????? Users > > ??????? is_volatile???????????????????? 0 > > ??????? check_period??????????????????? 24x7 > > ??????? max_check_attempts????????????? 3 > > ??????? normal_check_interval?????????? 3 > > ??????? retry_check_interval??????????? 1 > > ??????? contact_groups????????????????? admins > > ??????? notification_interval?????????? 120 > > ??????? notification_period???????????? 24x7 > > ??????? notification_options??????????? w,u,c,r > > ??????? check_command?????????????????? check_users!5!10 > > ??????? } > > > > define service{ > > ??????? use???????????????????????????? generic-service > > > > ??????? host_name?????????????????????? emm4 > > ??????? service_description???????????? Load > > ??????? is_volatile???????????????????? 0 > > ??????? check_period??????????????????? 24x7 > > ??????? max_check_attempts????????????? 3 > > ??????? normal_check_interval?????????? 3 > > ??????? retry_check_interval??????????? 1 > > ??????? contact_groups????????????????? admins > > ??????? notification_interval?????????? 120 > > ??????? notification_period???????????? 24x7 > > ??????? notification_options??????????? w,u,c,r > > ??????? check_command > check_load!5.0,4.0,3.0!10.0,6.0,4.0 > > ??????? } > > > > define service{ > > ??????? use???????????????????????????? generic-service > > > > ??????? host_name?????????????????????? emm4 > > ??????? service_description???????????? Total Processes > > ??????? is_volatile???????????????????? 0 > > ??????? check_period??????????????????? 24x7 > > ??????? max_check_attempts????????????? 3 > > ??????? normal_check_interval?????????? 3 > > ??????? retry_check_interval??????????? 1 > > ??????? contact_groups????????????????? admins > > ??????? notification_interval?????????? 120 > > ??????? notification_period???????????? 24x7 > > ??????? notification_options??????????? w,u,c,r > > ??????? check_command?????????????????? check_procs!250!400!RSZDT > > ??????? } > > > > define service{ > > ??????? use???????????????????????????? generic-service > > > > ??????? host_name?????????????????????? emm4 > > ??????? service_description???????????? Disk Usage > > ??????? is_volatile???????????????????? 0 > > ??????? check_period??????????????????? 24x7 > > ??????? max_check_attempts????????????? 3 > > ??????? normal_check_interval?????????? 3 > > ??????? retry_check_interval??????????? 1 > > ??????? contact_groups????????????????? admins > > ??????? notification_interval?????????? 120 > > ??????? notification_period???????????? 24x7 > > ??????? notification_options??????????? w,u,c,r > > ??????? check_command?????????????????? check_disk!/ > > ??????? } > > > > > > And finally the command definitions; > > > > > > # 'check_users' command definition > > define command{ > > ??????? command_name??? check_users > > ??????? command_line??? $USER1$/check_users -w $ARG1$ -c $ARG2$ > > ??????? } > > > > # 'check_load' command definition > > define command{ > > ??????? command_name??? check_load > > ??????? command_line??? $USER1$/check_load -w $ARG1$ -c $ARG2$ > > ??????? } > > > > # 'check_procs' command definition > > define command{ > > ??????? command_name??? check_procs > > ??????? command_line??? $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s > $ARG3$ > > ??????? } > > > > # 'check_disk' command definition > > define command{ > > ??????? command_name??? check_disk > > ??????? command_line??? $USER1$/check_disk -w $ARG1$ -c $ARG2$ -p > $ARG3$ > > ??????? } > > > > > > Regards, > > Juki > > > > > > > > 2009/12/2 Greg Pangrazio > >> > >> Without your config file I cannot be sure, but likely the services > are > >> either using the wrong command ie not the nrpe version, or your > check > >> command is not configured properly.. > >> > >> Can you provide your host and service definitions as well as the > >> command configuration for the commands you are using? > > > > > > > > > ----------------------------------------------------------------------- > ------- > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From akpgeek at gmail.com Wed Dec 2 18:12:10 2009 From: akpgeek at gmail.com (akp geek) Date: Wed, 2 Dec 2009 12:12:10 -0500 Subject: Nagios email notifications In-Reply-To: References: <2024a9fb0912011317s220f744fya4e15c0df38990a7@mail.gmail.com> Message-ID: <2024a9fb0912020912w2c7fd951r36a8204b7215bf67@mail.gmail.com> Thank you.. It worked fine. I used the mailx instead of mail. If I use mail -s , subject is missing in the notification Regards On Tue, Dec 1, 2009 at 5:18 PM, Martin Melin wrote: > The command definition for your notification command is where you want to > look. It probably pipes a big echo into mail, if you add an -s switch to the > mail command you can set a subject. > > Regards, > Martin Melin > > On Tue, Dec 1, 2009 at 10:17 PM, akp geek wrote: > >> Dear All - >> >> I have installed nagios and it has been working fine. I am >> getting email notifications also. But I would like to edit those >> notifications and customize it. In the emails that I receive now, I don't >> get any subject in the email. Can you please point me / guide me? >> >> Regards >> >> >> ------------------------------------------------------------------------------ >> Join us December 9, 2009 for the Red Hat Virtual Experience, >> a free event focused on virtualization and cloud computing. >> Attend in-depth sessions from your desk. Your couch. Anywhere. >> http://p.sf.net/sfu/redhat-sfdev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Terry.Chow at hk.fortisnl.com Thu Dec 3 09:38:57 2009 From: Terry.Chow at hk.fortisnl.com (Terry.Chow at hk.fortisnl.com) Date: Thu, 3 Dec 2009 16:38:57 +0800 Subject: Nagios with MRTG In-Reply-To: <2dfcbd1b0911261250g7a8bcc49r7f84a9080170d103@mail.gmail.com> References: <2dfcbd1b0911261250g7a8bcc49r7f84a9080170d103@mail.gmail.com> Message-ID: Hello, Thank you for your reply, but seems the check_mrtg plugin can get the MRTG data and show the value in the Nagios. However, it seems no way to use the MRTG data and create a graph like MRTG, and embedded into Nagios. That is when I click the monitored interface, it will show the MRTG graph instead of Nagios format. Thank you. Terry Chow System Engineer Fortis (Hong Kong) Limited 28/F Fortis Bank Tower 77-79 Gloucester Road Hong Kong Tel : +852-3653-0823 Email : terry.chow at hk.fortisnl.com From: webknowledge at gmail.com [mailto:webknowledge at gmail.com] On Behalf Of Marcel Sent: Friday, November 27, 2009 4:51 AM To: Chow Terry Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Nagios with MRTG There is a check_mrtg plugin, if i recall correctly, you can check against values on the RRD. On Thu, Nov 26, 2009 at 7:07 AM, wrote: Dear all, I am currently using Nagios 3 in Solaris 10 X86 version, it is fine. Also I have install MRTG on the server and configure to collect one switch interface traffic for testing. Is it possible to integrate MRTG graph about the switch interface traffic into Nagios? So that I can use Nagios portal to view the switch uptime, or other alert and also view the bandwidth usage of the switch in MRTG format. Thank you. Terry Chow System Engineer Fortis (Hong Kong) Limited 28/F Fortis Bank Tower 77-79 Gloucester Road Hong Kong Tel : +852-3653-0823 Email : terry.chow at hk.fortisnl.com ******** This message (including any attachments ) is confidential and is intended solely for the use of the individual or entity to whom it is addressed. If you have received this message by mistake please notify the sender by return email and delete this message from your system. Any unauthorised use or dissemination of this message in whole or in part is strictly prohibited. ******** ------------------------------------------------------------------------ ------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ******** This message (including any attachments ) is confidential and is intended solely for the use of the individual or entity to whom it is addressed. If you have received this message by mistake please notify the sender by return email and delete this message from your system. Any unauthorised use or dissemination of this message in whole or in part is strictly prohibited. ******** -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Gerald.Ortner at gespag.at Thu Dec 3 10:37:50 2009 From: Gerald.Ortner at gespag.at (Ortner, Gerald) Date: Thu, 3 Dec 2009 10:37:50 +0100 Subject: WG: Nagios with MRTG Message-ID: <13579FFE8B208F4DBA327EE25F804AAB086B847C@swvbpheaglxmb02.health.local> Von: Ortner, Gerald Gesendet: Donnerstag, 03. Dezember 2009 10:37 An: 'Terry.Chow at hk.fortisnl.com' Betreff: AW: [Nagios-users] Nagios with MRTG http://www.pnp4nagios.org/ Von: Terry.Chow at hk.fortisnl.com [mailto:Terry.Chow at hk.fortisnl.com] Gesendet: Donnerstag, 03. Dezember 2009 09:39 An: mitsuto at gmail.com Cc: nagios-users at lists.sourceforge.net Betreff: Re: [Nagios-users] Nagios with MRTG Hello, Thank you for your reply, but seems the check_mrtg plugin can get the MRTG data and show the value in the Nagios. However, it seems no way to use the MRTG data and create a graph like MRTG, and embedded into Nagios. That is when I click the monitored interface, it will show the MRTG graph instead of Nagios format. Thank you. Terry Chow System Engineer Fortis (Hong Kong) Limited 28/F Fortis Bank Tower 77-79 Gloucester Road Hong Kong Tel : +852-3653-0823 Email : terry.chow at hk.fortisnl.com From: webknowledge at gmail.com [mailto:webknowledge at gmail.com] On Behalf Of Marcel Sent: Friday, November 27, 2009 4:51 AM To: Chow Terry Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Nagios with MRTG There is a check_mrtg plugin, if i recall correctly, you can check against values on the RRD. On Thu, Nov 26, 2009 at 7:07 AM, > wrote: Dear all, I am currently using Nagios 3 in Solaris 10 X86 version, it is fine. Also I have install MRTG on the server and configure to collect one switch interface traffic for testing. Is it possible to integrate MRTG graph about the switch interface traffic into Nagios? So that I can use Nagios portal to view the switch uptime, or other alert and also view the bandwidth usage of the switch in MRTG format. Thank you. Terry Chow System Engineer Fortis (Hong Kong) Limited 28/F Fortis Bank Tower 77-79 Gloucester Road Hong Kong Tel : +852-3653-0823 Email : terry.chow at hk.fortisnl.com ******** This message (including any attachments ) is confidential and is intended solely for the use of the individual or entity to whom it is addressed. If you have received this message by mistake please notify the sender by return email and delete this message from your system. Any unauthorised use or dissemination of this message in whole or in part is strictly prohibited. ******** ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ******** This message (including any attachments ) is confidential and is intended solely for the use of the individual or entity to whom it is addressed. If you have received this message by mistake please notify the sender by return email and delete this message from your system. Any unauthorised use or dissemination of this message in whole or in part is strictly prohibited. ******** -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From texner at eoipso.com Thu Dec 3 09:09:15 2009 From: texner at eoipso.com (Tobias Exner) Date: Thu, 03 Dec 2009 09:09:15 +0100 Subject: check_log multiple pattern and filtering Message-ID: <4B17722B.4030904@eoipso.com> Hi list, 1. any suggestions how to check multiple pattern with one command? example: check_log -F /var/adm/messages -O /var/adm/nagios_messages -q "error|connection timed out|SCSI transport failed" 2. Is there a way to ignore lines from the result? example: check_log -F /var/adm/messages -O /var/adm/nagios_messages -q "error" This will search for all errors. But what can I do to ignore lines with with spezial errors like Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] CPU_FAN @ MB.P0.F0.RS has FAILED. regards, Tobias ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From juki.emma at gmail.com Thu Dec 3 15:03:11 2009 From: juki.emma at gmail.com (Juki) Date: Thu, 3 Dec 2009 17:03:11 +0300 Subject: Exact same swap + root / partition info showing up for different hosts(?) - Resolved! Message-ID: <7545d7d20912030603u1979b973q155d08b135bad9af@mail.gmail.com> Hello again, I followed the documentation that Greg advised and also the additions that Kevin suggested and everything worked out just fine using the check_nrpe plugin. A snippet of some definitions are; *Service Definition;* define service{ use generic-service host_name eapp03 service_description Users is_volatile 0 check_period 24x7 max_check_attempts 6 normal_check_interval 3 retry_check_interval 1 contact_groups admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_nrpe!check_users } define service{ use generic-service host_name eapp03 service_description Load is_volatile 0 check_period 24x7 max_check_attempts 6 normal_check_interval 3 retry_check_interval 1 contact_groups admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_nrpe!check_load *Command Definition;* # 'check_users' command definition define command{ command_name check_users command_line $USER1$/check_users -w $ARG1$ -c $ARG2$ } # 'check_load' command definition define command{ command_name check_load command_line $USER1$/check_load -w $ARG1$ -c $ARG2$ } *Host Definition;* # Application server host definition define host{ host_name eapp03 use generic-host alias Server 3 address xx.xx.xx.xx check_command check-host-alive check_period 24x7 check_interval 5 contact_groups admins retry_interval 1 max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r } Thanks everyone for the help - much appreciated! Regards, Juki -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Dec 3 16:17:28 2009 From: marc at ena.com (Marc Powell) Date: Thu, 3 Dec 2009 09:17:28 -0600 Subject: check_log multiple pattern and filtering In-Reply-To: <4B17722B.4030904@eoipso.com> References: <4B17722B.4030904@eoipso.com> Message-ID: <6B456F61-8890-4165-9AD5-054B31EFF018@ena.com> On Dec 3, 2009, at 2:09 AM, Tobias Exner wrote: > 1. > > any suggestions how to check multiple pattern with one command? > > > example: > > check_log -F /var/adm/messages -O /var/adm/nagios_messages -q > "error|connection timed out|SCSI transport failed" check_log uses egrep to search for the pattern. Your example will work. You should be able to test this -- $ ./check_log -F /var/log/messages -O /tmp/foo.log -q "notfound|winbind|this really works" (313) < Dec 3 08:59:57 noctools sshd[10928]: pam_winbind(sshd:account): request failed $ echo $? 2 Remember the check_log only parses lines seen _after_ each successive run. > 2. > > Is there a way to ignore lines from the result? > > example: > > check_log -F /var/adm/messages -O /var/adm/nagios_messages -q "error" > > This will search for all errors. > But what can I do to ignore lines with with spezial errors like Not with check_log but you can with check_log2.pl -- $ ./check_log2.pl --help check_log2.pl (nagios-plugins 1.4.3) 1.2 The nagios plugins come with ABSOLUTELY NO WARRANTY. You may redistribute copies of the plugins under the terms of the GNU General Public License. For more information about these matters, see the file named COPYING. Scan arbitrary log files for regular expression matches. Usage: check_log2.pl -l -s -p [-n ] -c | --critical Usage: check_log2.pl [ -v | --version ] Usage: check_log2.pl [ -h | --help ] -l, --logfile= The log file to be scanned -s, --seekfile= The temporary file to store the seek position of the last scan -p, --pattern= The regular expression to scan for in the log file -n, --negpattern= The regular expression to skip in the log file -- Marc ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From texner at eoipso.com Thu Dec 3 16:31:29 2009 From: texner at eoipso.com (Tobias Exner) Date: Thu, 03 Dec 2009 16:31:29 +0100 Subject: check_log multiple pattern and filtering In-Reply-To: <6B456F61-8890-4165-9AD5-054B31EFF018@ena.com> References: <4B17722B.4030904@eoipso.com> <6B456F61-8890-4165-9AD5-054B31EFF018@ena.com> Message-ID: <4B17D9D1.6000603@eoipso.com> Marc, thank you... point 1 is now clear for me.. point 2 I checked my plugins version. It's "1.4.13,REV=2009.04.26" from blastwave. Is there an other repository available or do I have to compile it all by my self? regards, Tobias Marc Powell schrieb: > On Dec 3, 2009, at 2:09 AM, Tobias Exner wrote: > > >> 1. >> >> any suggestions how to check multiple pattern with one command? >> >> >> example: >> >> check_log -F /var/adm/messages -O /var/adm/nagios_messages -q >> "error|connection timed out|SCSI transport failed" >> > > check_log uses egrep to search for the pattern. Your example will work. You should be able to test this -- > > $ ./check_log -F /var/log/messages -O /tmp/foo.log -q "notfound|winbind|this really works" > (313) < Dec 3 08:59:57 noctools sshd[10928]: pam_winbind(sshd:account): request failed > $ echo $? > 2 > > Remember the check_log only parses lines seen _after_ each successive run. > > >> 2. >> >> Is there a way to ignore lines from the result? >> >> example: >> >> check_log -F /var/adm/messages -O /var/adm/nagios_messages -q "error" >> >> This will search for all errors. >> But what can I do to ignore lines with with spezial errors like >> > > Not with check_log but you can with check_log2.pl -- > > $ ./check_log2.pl --help > check_log2.pl (nagios-plugins 1.4.3) 1.2 > The nagios plugins come with ABSOLUTELY NO WARRANTY. You may redistribute > copies of the plugins under the terms of the GNU General Public License. > For more information about these matters, see the file named COPYING. > > Scan arbitrary log files for regular expression matches. > > Usage: check_log2.pl -l -s -p [-n ] -c | --critical > Usage: check_log2.pl [ -v | --version ] > Usage: check_log2.pl [ -h | --help ] > > -l, --logfile= > The log file to be scanned > -s, --seekfile= > The temporary file to store the seek position of the last scan > -p, --pattern= > The regular expression to scan for in the log file > -n, --negpattern= > The regular expression to skip in the log file > > -- > Marc > > > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Gerhard.Lausser at consol.de Thu Dec 3 17:42:56 2009 From: Gerhard.Lausser at consol.de (Gerhard Lausser) Date: Thu, 3 Dec 2009 17:42:56 +0100 Subject: check_log multiple pattern and filtering In-Reply-To: <4B17722B.4030904@eoipso.com> References: <4B17722B.4030904@eoipso.com> Message-ID: <9C62550D6CB24ED59BE7AE9A127442AE@int.consol.de> Hi, echo "Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] POWER_FAN @MB.P0.F0.RS has FAILED." >> messages echo "Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] CPU_FAN @MB.P0.F0.RS has FAILED." >> messages echo "Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] CPU_FAN @MB.P0.F0.RS has FAILED." >> messages echo "Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] CPU_FAN @MB.P0.F0.RS has FAILED." >> messages echo "Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] DISK_FAN @MB.P0.F0.RS has FAILED." >> messages $ check_logfiles --tag miscerrors --logfile messages --criticalpattern "error|connection timed out|SCSI transport failed" --criticalexception "CPU_FAN @MB.P0.F0.RS has FAILED" --report long CRITICAL - (2 errors in check_logfiles.protocol-2009-12-03-17-34-46) - Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] DISK_FAN @MB.P0.F0.RS has FAILED. ...|miscerrors_lines=5 miscerrors_warnings=0 miscerrors_criticals=2 miscerrors_unknowns=0 tag miscerrors CRITICAL Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] POWER_FAN @MB.P0.F0.RS has FAILED. Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] DISK_FAN @MB.P0.F0.RS has FAILED. with check_logfiles you can define special cases (criticalexceptions) which do not count even if they match in the first place. As you see from the performance data miscerrors_lines=5 5 lines of the messages-file were scanned. The last match is shown in the 1st line of the plugin's output. With the option --report long you get the complete list of all matched lines. (2 errors in check_logfiles.protocol-2009-12-03-17-34-46) means, the matched lines were also written in a protocol file for later analysis. This can be switched off with --noprotocol. You find the plugin at http://labs.consol.de/nagios/check_logfiles Cheers, Gerhard > -----Urspr?ngliche Nachricht----- > Von: Tobias Exner [mailto:texner at eoipso.com] > Gesendet: Donnerstag, 3. Dezember 2009 09:09 > An: Nagios-Users Mailinglist > Betreff: [Nagios-users] check_log multiple pattern and filtering > > Hi list, > > > > 1. > > any suggestions how to check multiple pattern with one command? > > > example: > > check_log -F /var/adm/messages -O /var/adm/nagios_messages -q > "error|connection timed out|SCSI transport failed" > > > > > 2. > > Is there a way to ignore lines from the result? > > example: > > check_log -F /var/adm/messages -O /var/adm/nagios_messages -q "error" > > This will search for all errors. > But what can I do to ignore lines with with spezial errors like > > Dec 3 07:26:23 SERVER rmclomv: [ID 431010 kern.error] CPU_FAN @ > MB.P0.F0.RS has FAILED. > > > > > regards, > > Tobias > > > -------------------------------------------------------------- > ---------------- > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sascha.Runschke at gfkl.com Thu Dec 3 17:32:06 2009 From: Sascha.Runschke at gfkl.com (Sascha.Runschke at gfkl.com) Date: Thu, 3 Dec 2009 17:32:06 +0100 Subject: Antwort: Hardware requirements In-Reply-To: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> References: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> Message-ID: "Tommy Mogensen" schrieb am 01.12.2009 16:14:39: > I am looking for a free system able to monitor 3000-5000 hosts (Mainly > cisco routers, switches and ap's) via ping and snmp. I would prefer to > run everything on one server if possible. > > I have received a good deal on a machine with 16G ram, 2xSSD-disks > (maybe in raid-0), and 2xIntel E5520-CPUs. I would appreciate your input > regarding the performance issues should I use Nagios for this system. > > Is this configuration powerful enough, what are the limiting parts of > the setup and are some of the pieces unnecessary? I.e. I could go for > one cpu (4 cores) if nagios does not support threading to 8 cores....or > is the main bottleneck that I do not run it distributed? It depends on how you want to implement the monitoring. First of all you need to break down the services on a per minute basis. The number of checks does not matter, but in which timeframe matters a lot. If you plan on running 5.000 checks with 5 minute interval - all is cool. If you are running 5.000 checks with 1 minute interval - you will need to tweak your nagios server a lot. If you are just using nagios for pure monitoring, then you have more then enough power. If you are looking for using ndo for visualisation with NagVis for example and/or graphing performance data with PNP4Nagios, then I/O is your biggest obstacle. Using SSDs for a DB is a two-fold sword - they runlike hell, but SSDs melt in hell ;) It really depends on the quality of the SSDs. If they are enterprise drives, they should be good to go for 1-2 years until they drop dead. If they are midline drives, you will burn them quite fast. The most important thing to have for a nagios installation with local NDO and/or PNP4Nagios togeter with massive services is a BBWC (battery backed write cache) of at least 512MB to aggregate the written blocks and put it on 100% write / 0% read cache. This speeds up NDO/PNP like tenfold. Read performance is neglectable for nagios, write performance is all that matters. I'm running 2.000+ service checks per minute on 5GB, 2x 2GHz QuadCore machine with 2 local 10K hdd's in RAID1 with a local mysql DB for NDO, performance graphing of around 1.200 services with PNP4Nagios and hosting NagVis for visualisation via mysql on the same server. Average load over 1 month is ~2.7 with peaks going up to load 8 aprox. Before the use of a BBWC I had an average load of 8 with peaks around 20 due to I/O retention. Regards Sascha GFKL Financial Services AG Vorstand: Dr. Peter J?nsch (Vors.), J?rgen Baltes, Dr. Tom Haverkamp Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Dec 3 18:21:10 2009 From: marc at ena.com (Marc Powell) Date: Thu, 3 Dec 2009 11:21:10 -0600 Subject: check_log multiple pattern and filtering In-Reply-To: <4B17D9D1.6000603@eoipso.com> References: <4B17722B.4030904@eoipso.com> <6B456F61-8890-4165-9AD5-054B31EFF018@ena.com> <4B17D9D1.6000603@eoipso.com> Message-ID: On Dec 3, 2009, at 9:31 AM, Tobias Exner wrote: > Marc, > > thank you... > > point 1 is now clear for me.. > > > point 2 > > I checked my plugins version. > It's "1.4.13,REV=2009.04.26" from blastwave. > Is there an other repository available or do I have to compile it all by my self? I don't use repositories and always compile from source myself. I have none that I can recommend. -- Marc ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gareth.beale at boeing.com Fri Dec 4 00:49:09 2009 From: gareth.beale at boeing.com (Gareth Beale) Date: Thu, 03 Dec 2009 15:49:09 -0800 Subject: Arbitrary alert processing Message-ID: <4B184E75.8040003@boeing.com> I'm looking at replacing a system that was used to monitor production systems. It performed two basic functions: 1. Monitor hosts (using ping) and alert the operations staff if a host went down. 2. Receive arbitrary text messages sent to it and display them, possibly with an audio alert, on the console. This was achieved by using remote copy to put a text file (with several delimited fields) in a specific directory on the monitoring system. A process checked the directory periodically and displayed any messages found there, after interpreting the fields in the text file. Nagios is a no-brainer for the first function, and although it wouldn't be hard to write a simple script to do part 2, I'd like to incorporate it into nagios. One of the reasons that I'm going to use nagios in the first place is that it is browser based. If the support staff get a call at home, they can display the nagios dashboard and see exactly what the operations staff are seeing. So I'd like the text alerts to be there too. I'm not aware of a plugin that provides this capability, nor have I seen popups for the nagios pages. I am aware of the Win-popups but I want the messaging incorporated into the nagios display. I know too that host comments can be added by external commands, but comments are not very visible or "loud". I hope that is clear. I can follow up if not. Any suggestions? ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri Dec 4 09:22:52 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Fri, 04 Dec 2009 00:22:52 -0800 Subject: Arbitrary alert processing In-Reply-To: <4B184E75.8040003@boeing.com> References: <4B184E75.8040003@boeing.com> Message-ID: <4B18C6DC.107@hp.com> Maybe this will work for you: http://nagios.sourceforge.net/docs/3_0/cgiincludes.html Gareth Beale wrote: > I'm looking at replacing a system that was used to monitor production > systems. > > It performed two basic functions: > > 1. Monitor hosts (using ping) and alert the operations staff if a host > went down. > 2. Receive arbitrary text messages sent to it and display them, possibly > with an audio alert, on the console. > This was achieved by using remote copy to put a text file (with > several delimited fields) in a specific directory on the monitoring > system. A process checked the directory periodically and displayed any > messages found there, after interpreting the fields in the text file. > > Nagios is a no-brainer for the first function, and although it wouldn't > be hard to write a simple script to do part 2, I'd like to incorporate > it into nagios. One of the reasons that I'm going to use nagios in the > first place is that it is browser based. If the support staff get a call > at home, they can display the nagios dashboard and see exactly what the > operations staff are seeing. So I'd like the text alerts to be there > too. I'm not aware of a plugin that provides this capability, nor have I > seen popups for the nagios pages. I am aware of the Win-popups but I > want the messaging incorporated into the nagios display. I know too that > host comments can be added by external commands, but comments are not > very visible or "loud". > > I hope that is clear. I can follow up if not. Any suggestions? > > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gareth.beale at boeing.com Fri Dec 4 16:24:04 2009 From: gareth.beale at boeing.com (Gareth Beale) Date: Fri, 04 Dec 2009 07:24:04 -0800 Subject: Arbitrary alert processing In-Reply-To: <4B18C6DC.107@hp.com> References: <4B184E75.8040003@boeing.com> <4B18C6DC.107@hp.com> Message-ID: <4B192994.2050206@boeing.com> Looks promising, thanks. I will try it out. Gareth Morris, Patrick wrote: > Maybe this will work for you: > > http://nagios.sourceforge.net/docs/3_0/cgiincludes.html > > Gareth Beale wrote: >> I'm looking at replacing a system that was used to monitor production >> systems. >> >> It performed two basic functions: >> >> 1. Monitor hosts (using ping) and alert the operations staff if a >> host went down. >> 2. Receive arbitrary text messages sent to it and display them, >> possibly with an audio alert, on the console. >> This was achieved by using remote copy to put a text file (with >> several delimited fields) in a specific directory on the monitoring >> system. A process checked the directory periodically and displayed >> any messages found there, after interpreting the fields in the text >> file. >> >> Nagios is a no-brainer for the first function, and although it >> wouldn't be hard to write a simple script to do part 2, I'd like to >> incorporate it into nagios. One of the reasons that I'm going to use >> nagios in the first place is that it is browser based. If the support >> staff get a call at home, they can display the nagios dashboard and >> see exactly what the operations staff are seeing. So I'd like the >> text alerts to be there too. I'm not aware of a plugin that provides >> this capability, nor have I seen popups for the nagios pages. I am >> aware of the Win-popups but I want the messaging incorporated into >> the nagios display. I know too that host comments can be added by >> external commands, but comments are not very visible or "loud". >> >> I hope that is clear. I can follow up if not. Any suggestions? >> >> >> >> ------------------------------------------------------------------------------ >> >> Join us December 9, 2009 for the Red Hat Virtual Experience, >> a free event focused on virtualization and cloud computing. Attend >> in-depth sessions from your desk. Your couch. Anywhere. >> http://p.sf.net/sfu/redhat-sfdev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. ::: Messages without supporting info will risk >> being sent to /dev/null >> > > ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Dec 4 17:34:47 2009 From: marc at ena.com (Marc Powell) Date: Fri, 4 Dec 2009 10:34:47 -0600 Subject: Arbitrary alert processing In-Reply-To: <4B184E75.8040003@boeing.com> References: <4B184E75.8040003@boeing.com> Message-ID: <54D30E51-A0C7-4F52-AA5F-92D40EBD3B49@ena.com> On Dec 3, 2009, at 5:49 PM, Gareth Beale wrote: > I'm looking at replacing a system that was used to monitor production > systems. > > It performed two basic functions: > 2. Receive arbitrary text messages sent to it and display them, possibly > with an audio alert, on the console. > This was achieved by using remote copy to put a text file (with > several delimited fields) in a specific directory on the monitoring > system. A process checked the directory periodically and displayed any > messages found there, after interpreting the fields in the text file. > > Nagios is a no-brainer for the first function, and although it wouldn't > be hard to write a simple script to do part 2, I'd like to incorporate > it into nagios. You could treat these as a typical critical event. If you have a script that parses the file and determines that something is alertable, you're already mostly there. Create a new host/service definition; give it a generic name relevant to the type of alerts it will display; enable passive checks for it and set is_volatile. Have your script submit a passive CRITICAL or WARNING result to nagios based on it's alert logic. Each time this happens, it will show up as an event that must be acknowledged or reset to clear. -- Marc ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fvanhee at gmail.com Fri Dec 4 19:10:19 2009 From: fvanhee at gmail.com (Vanhee Frederik) Date: Fri, 04 Dec 2009 19:10:19 +0100 Subject: Hardware requirements In-Reply-To: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> References: <0DA7049604D4BD44839DFA90DBC7C1BB635194@LGBEXCHANGE01.unic.local> Message-ID: <4B19508B.30504@gmail.com> Hello, make sure you have fast disks. CPU and Ram looks ok. If you are using ndo, then there's much disk access for the database. Nagios still stores everything in text files, so there's much disk activity involved there too. I have put my disks like this : (2000 hosts, 12000 services) 2 disks in mirror for the OS 2 disks in mirror for the application (Nagios) 4 disks in raid 1+0 for the database (NDO) Maybe this is a little overkill, but since disks are quite cheap nowadays I did it like this. I use Nagios + NagiosGrapher + NDO + Nagvis for visualization. Regards, Frederik Tommy Mogensen schreef: > Hi Nagios-experts > > I am looking for a free system able to monitor 3000-5000 hosts (Mainly > cisco routers, switches and ap's) via ping and snmp. I would prefer to > run everything on one server if possible. > > I have received a good deal on a machine with 16G ram, 2xSSD-disks > (maybe in raid-0), and 2xIntel E5520-CPUs. I would appreciate your input > regarding the performance issues should I use Nagios for this system. > > Is this configuration powerful enough, what are the limiting parts of > the setup and are some of the pieces unnecessary? I.e. I could go for > one cpu (4 cores) if nagios does not support threading to 8 cores....or > is the main bottleneck that I do not run it distributed? > > Regards, > Tommy > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From israel at frontierflying.com Fri Dec 4 22:29:25 2009 From: israel at frontierflying.com (Israel Brewster) Date: Fri, 4 Dec 2009 12:29:25 -0900 Subject: Nagios with MRTG In-Reply-To: References: <2dfcbd1b0911261250g7a8bcc49r7f84a9080170d103@mail.gmail.com> Message-ID: <65A7FFC1-A4C9-4181-AA98-0D4F185B6FCB@frontierflying.com> Probably the most straightforward way is to use the notes_url host directive to link to the MRTG generated graphs from nagios. Doesn't precisely "embed" the graphs in nagios, but at least you can access them from the nagios interface with just one click. For my installation, I've simply placed a link to the MRTG graphs in the nagios sidebar, but while perhaps more work, I would think using the notes_url (and optional notes_icon) directives would be a bit nicer. On Dec 2, 2009, at 11:38 PM, wrote: > Hello, > > Thank you for your reply, but seems the check_mrtg > plugin can get the MRTG data and show the value in the Nagios. > However, it seems no way to use the MRTG data and create a graph > like MRTG, and embedded into Nagios. > > That is when I click the monitored interface, it > will show the MRTG graph instead of Nagios format. > > Thank you. > > Terry Chow > System Engineer > Fortis (Hong Kong) Limited > 28/F Fortis Bank Tower > 77-79 Gloucester Road > Hong Kong > Tel : +852-3653-0823 > Email : terry.chow at hk.fortisnl.com > > From: webknowledge at gmail.com [mailto:webknowledge at gmail.com] On > Behalf Of Marcel > Sent: Friday, November 27, 2009 4:51 AM > To: Chow Terry > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios with MRTG > > There is a check_mrtg plugin, if i recall correctly, you can check > against values on the RRD. > > On Thu, Nov 26, 2009 at 7:07 AM, wrote: > Dear all, > > I am currently using Nagios 3 in Solaris 10 X86 > version, it is fine. Also I have install MRTG on the server and > configure to collect one switch interface traffic for testing. > > Is it possible to integrate MRTG graph about the > switch interface traffic into Nagios? So that I can use Nagios > portal to view the switch uptime, or other alert and also view the > bandwidth usage of the switch in MRTG format. > > Thank you. > > Terry Chow > System Engineer > Fortis (Hong Kong) Limited > 28/F Fortis Bank Tower > 77-79 Gloucester Road > Hong Kong > Tel : +852-3653-0823 > Email : terry.chow at hk.fortisnl.com > > ******** > This message (including any attachments ) is confidential and is > intended solely for the use of the individual or entity to whom it > is addressed. If you have received this message by mistake please > notify the sender by return email and delete this message from your > system. Any unauthorised use or dissemination of this message in > whole or in part is strictly prohibited. > ******** > > ------------------------------------------------------------------------------ > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 > 30-Day > trial. Simplify your report design, integration and deployment - and > focus on > what you do best, core application coding. Discover what's new with > Crystal Reports now. http://p.sf.net/sfu/bobj-july > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ******** > This message (including any attachments ) is confidential and is > intended solely for the use of the individual or entity to whom it > is addressed. If you have received this message by mistake please > notify the sender by return email and delete this message from your > system. Any unauthorised use or dissemination of this message in > whole or in part is strictly prohibited. > ******** > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev_______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ----------------------------------------------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Israel Brewster.vcf Type: text/directory Size: 417 bytes Desc: not available URL: -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From smahesh at alcatel-lucent.com Mon Dec 7 06:36:44 2009 From: smahesh at alcatel-lucent.com (MAHESH, SIDDACHETTY M (SIDDACHETTY M)) Date: Sun, 6 Dec 2009 23:36:44 -0600 Subject: need expert advice/suggestions Message-ID: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com> Hi list, I am trying out Nagios v3. I followed the documentation and it was easy getting the system up and running. Now that the base system is working as expected, I am trying to improve the configuration to avoid duplication of information using templates and clean up the configuration. So, I would like your feedback/suggestions on my approach. Let me explain the scenario. I have multiple hosts that can run a combination of applications. Each application can have one or more services associated with it. My idea is as follows: 1. Each application is mapped to a service group template (APP_xyz_SVC_GRP. The services in that application are associated with that application service group. define servicegroup { servicegroup_name APP_xyz_SVC_GRP register 0 } define service { name SSH use generic-service servicegroups APP_xyz_SVC_GRP service_description SSH check_command check_ssh register 0 } These application level templates do not change often. 2. Each host has a service group associated with it that is based on the actual applications installed on that host. The host.cfg for that host looks like define host { use linux-server host_name HOST_abc address 10.10.10.10 hostgroups HOSTGROUP_123 } define servicegroup { servicegroup_name HOST_abc_SVCS servicegroup_members APP_xyz_SVC_GRP, APP_aaa_SVC_GRP, APP_bbb_SVC_GRP } The list of hosts and applications are dynamic and can change over time. 1. The first problem I see is that the host.cfg requires the actual service definition - I cannot use the template service SSH directly. So, for each application service, I need to add the following lines to my host.cfg define service { use SSH host_name HOST_abc servicegroups HOST_abc_SVCS } Is there a solution that can avoid the above service redefinition? 2. servicegroup definition does not include a host_name or similar directive - that would have simplified adding all services in the servicegroup to a host. Is there a way to achieve this - this will resolve problem #1 above. Basically auto-instantiate (or register) the template services in a group with associated host. Note that both the hosts and applications running on the hosts are dynamic. The applications are fixed - they are part of a limited set. The eventual end goal is to auto-generate the nagios configuration. For this, I need the basic building blocks in place. In case someone has encountered this problem before and has a better solution/design, please let me know. Thanks, Mahesh ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tdondich at lilacnetworks.com Mon Dec 7 07:03:03 2009 From: tdondich at lilacnetworks.com (Taylor Dondich) Date: Sun, 06 Dec 2009 22:03:03 -0800 Subject: need expert advice/suggestions In-Reply-To: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com> References: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com> Message-ID: <4B1C9A97.9030205@lilacnetworks.com> I hate to toot our own horn here in the Nagios Users mailing list (not the reason why I joined it). However, our configuration tool really does support what you are trying to do. Lilac Configurator has support for templates, but we take it one step further. Unlike Nagios, we support attaching services to host templates. So when you create a new host and have it inherit from a template, it brings in all services attached to that template. We do the same for escalations and depedencies. Something we feel Nagios should have, but doesn't. So our tool supports it then exports it to a configuration format that Nagios understands. Give it a whirl, I think it may help out your configuration in the way you desire. The url is at: http://www.lilacplatform.com -- Taylor Dondich (tdondich at lilacnetworks.com) CEO at Lilac Networks (http://www.lilacnetworks.com) Provider of quality support for open source monitoring solutions View our open source Nagios Configuration Tool at http://www.lilacplatform.com MAHESH, SIDDACHETTY M (SIDDACHETTY M) wrote: > Hi list, > > I am trying out Nagios v3. I followed the documentation and it was easy getting the system up and running. > > Now that the base system is working as expected, I am trying to improve the configuration to avoid duplication of information using templates and clean up the configuration. So, I would like your feedback/suggestions on my approach. > > Let me explain the scenario. I have multiple hosts that can run a combination of applications. Each application can have one or more services associated with it. > > My idea is as follows: > 1. Each application is mapped to a service group template (APP_xyz_SVC_GRP. The services in that application are associated with that application service group. > > define servicegroup { > servicegroup_name APP_xyz_SVC_GRP > register 0 > } > > define service { > name SSH > use generic-service > servicegroups APP_xyz_SVC_GRP > service_description SSH > check_command check_ssh > register 0 > } > > These application level templates do not change often. > > > 2. Each host has a service group associated with it that is based on the actual applications installed on that host. The host.cfg for that host looks like > > define host { > use linux-server > host_name HOST_abc > address 10.10.10.10 > hostgroups HOSTGROUP_123 > } > > define servicegroup { > servicegroup_name HOST_abc_SVCS > servicegroup_members APP_xyz_SVC_GRP, APP_aaa_SVC_GRP, APP_bbb_SVC_GRP > } > > > The list of hosts and applications are dynamic and can change over time. > > 1. The first problem I see is that the host.cfg requires the actual service definition - I cannot use the template service SSH directly. So, for each application service, I need to add the following lines to my host.cfg > > define service { > use SSH > host_name HOST_abc > servicegroups HOST_abc_SVCS > } > > Is there a solution that can avoid the above service redefinition? > > 2. servicegroup definition does not include a host_name or similar directive - that would have simplified adding all services in the servicegroup to a host. Is there a way to achieve this - this will resolve problem #1 above. Basically auto-instantiate (or register) the template services in a group with associated host. > > > Note that both the hosts and applications running on the hosts are dynamic. The applications are fixed - they are part of a limited set. The eventual end goal is to auto-generate the nagios configuration. For this, I need the basic building blocks in place. > > In case someone has encountered this problem before and has a better solution/design, please let me know. > > Thanks, > Mahesh > > > ------------------------------------------------------------------------------ > Join us December 9, 2009 for the Red Hat Virtual Experience, > a free event focused on virtualization and cloud computing. > Attend in-depth sessions from your desk. Your couch. Anywhere. > http://p.sf.net/sfu/redhat-sfdev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From durket at highwire.stanford.edu Mon Dec 7 16:32:43 2009 From: durket at highwire.stanford.edu (Michael Durket) Date: Mon, 7 Dec 2009 07:32:43 -0800 Subject: Scheduled downtime, availability report and dependencies Message-ID: Suppose I have 2 services, A and B. Further suppose that A depends on B. Now if I need to take B out of service for some maintenance, and schedule downtime in Nagios for B, will the availability report for A during the time that B's scheduled downtime occurs indicate that A's outage was scheduled? ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From doepain at gmail.com Mon Dec 7 18:13:01 2009 From: doepain at gmail.com (dOE) Date: Mon, 7 Dec 2009 12:13:01 -0500 Subject: Nagios monitor for VMware ESXi (free edition) Message-ID: I have created a monitor using the "check_esxwbem.py", but it returns an "OK" and *null* It is not doing what it is intended to do. The script pulls the hardware resources of the host server through WBEM. I know WBEM is working because I am able to pull this information from HP SIM, but I want Nagios to be my one stop shop for monitoring. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Mon Dec 7 18:19:21 2009 From: jpratt at norwich.edu (James Pratt) Date: Mon, 7 Dec 2009 12:19:21 -0500 Subject: Nagios monitor for VMware ESXi (free edition) In-Reply-To: References: Message-ID: <24B6509E4191AF44B60A24EAA3B4AD49361A95@nuexchg.norwich.edu> >> -----Original Message----- >> From: dOE [mailto:doepain at gmail.com] >> Sent: Monday, December 07, 2009 12:13 PM >> To: Nagios User-List >> Subject: [Nagios-users] Nagios monitor for VMware ESXi (free edition) >> >> I have created a monitor using the "check_esxwbem.py", but it returns an "OK" and >> null It is not doing what it is intended to do. >> The script pulls the hardware resources of the host server through WBEM. I know >> WBEM is working because I am able to pull this information from HP SIM, but I want >> Nagios to be my one stop shop for monitoring. What exactly are you trying to monitor? Raid? I have some snmp stuff for using the HP agents on ESX, but I can't help without more info. Regards, jamie ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From doepain at gmail.com Mon Dec 7 18:25:57 2009 From: doepain at gmail.com (dOE) Date: Mon, 7 Dec 2009 12:25:57 -0500 Subject: Nagios monitor for VMware ESXi (free edition) In-Reply-To: <24B6509E4191AF44B60A24EAA3B4AD49361A95@nuexchg.norwich.edu> References: <24B6509E4191AF44B60A24EAA3B4AD49361A95@nuexchg.norwich.edu> Message-ID: >From the VMware Infrastructure client under "Health Check" you ca see vital hardware health statistics such as processor, memory, hard drives power supplies, and fans. The information is gathered by WBEM, and can be done with HP SIM too. The script is able to poll this information and return the values of all of the hardware under one monitor. That is good enough for me because if a memory module goes bad the monitor will go red and prompt us to investigate cause. I am trying to do this with ESX*i* which does not support SNMP, not ESX. On Mon, Dec 7, 2009 at 12:19 PM, James Pratt wrote: > > > >> -----Original Message----- > >> From: dOE [mailto:doepain at gmail.com] > >> Sent: Monday, December 07, 2009 12:13 PM > >> To: Nagios User-List > >> Subject: [Nagios-users] Nagios monitor for VMware ESXi (free edition) > >> > >> I have created a monitor using the "check_esxwbem.py", but it returns > an "OK" and > >> null It is not doing what it is intended to do. > >> The script pulls the hardware resources of the host server through > WBEM. I know > >> WBEM is working because I am able to pull this information from HP > SIM, but I want > >> Nagios to be my one stop shop for monitoring. > > What exactly are you trying to monitor? Raid? I have some snmp stuff for > using the HP agents on ESX, but I can't help without more info. > > Regards, > jamie > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Join us December 9, 2009 for the Red Hat Virtual Experience, a free event focused on virtualization and cloud computing. Attend in-depth sessions from your desk. Your couch. Anywhere. http://p.sf.net/sfu/redhat-sfdev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Dec 7 22:29:19 2009 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Mon, 7 Dec 2009 13:29:19 -0800 Subject: need expert advice/suggestions In-Reply-To: <4B1C9A97.9030205@lilacnetworks.com> References: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com> <4B1C9A97.9030205@lilacnetworks.com> Message-ID: <20091207212919.GO5494@bakgwai.americas.hpqcorp.net> Hi Taylor! On Mon, 07 Dec 2009, Taylor Dondich wrote: > I hate to toot our own horn here in the Nagios Users mailing list (not > the reason why I joined it). However, our configuration tool really > does support what you are trying to do. Lilac Configurator has support > for templates, but we take it one step further. Unlike Nagios, we > support attaching services to host templates. So when you create a new > host and have it inherit from a template, it brings in all services > attached to that template. We do the same for escalations and > depedencies. Something we feel Nagios should have, but doesn't. So our > tool supports it then exports it to a configuration format that Nagios > understands. Give it a whirl, I think it may help out your > configuration in the way you desire. For what it's worth, Nagios *does* support this. We routinely use templates which assign a hostgroup to a host, and that hostgroup will have a set of standard check for that type of host assigned to it. When a new host gets added, all it takes is a "use some_host_template" and all the standard services we run on that type of host just show up. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tdondich at lilacnetworks.com Mon Dec 7 23:02:53 2009 From: tdondich at lilacnetworks.com (Taylor Dondich) Date: Mon, 07 Dec 2009 14:02:53 -0800 Subject: need expert advice/suggestions In-Reply-To: <20091207212919.GO5494@bakgwai.americas.hpqcorp.net> References: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com> <4B1C9A97.9030205@lilacnetworks.com> <20091207212919.GO5494@bakgwai.americas.hpqcorp.net> Message-ID: <4B1D7B8D.3060203@lilacnetworks.com> Yes, I I consider that a workaround, as many people have hostgroups just for this purpose, where I see hostgroups as more of a grouping of devices for visibility sake. So both are solutions, but I think assigning services (checks) directly to a host template is a stronger solution, as it may not clutter up your management UI with potentially strange hostgroup names. Taylor patrick.morris at hp.com wrote: > Hi Taylor! > > On Mon, 07 Dec 2009, Taylor Dondich wrote: > > >> I hate to toot our own horn here in the Nagios Users mailing list (not >> the reason why I joined it). However, our configuration tool really >> does support what you are trying to do. Lilac Configurator has support >> for templates, but we take it one step further. Unlike Nagios, we >> support attaching services to host templates. So when you create a new >> host and have it inherit from a template, it brings in all services >> attached to that template. We do the same for escalations and >> depedencies. Something we feel Nagios should have, but doesn't. So our >> tool supports it then exports it to a configuration format that Nagios >> understands. Give it a whirl, I think it may help out your >> configuration in the way you desire. >> > > For what it's worth, Nagios *does* support this. We routinely use > templates which assign a hostgroup to a host, and that hostgroup will > have a set of standard check for that type of host assigned to it. When > a new host gets added, all it takes is a "use some_host_template" and > all the standard services we run on that type of host just show up. > -- Taylor Dondich (tdondich at lilacnetworks.com) CEO at Lilac Networks (http://www.lilacnetworks.com) Provider of quality support for open source monitoring solutions View our open source Nagios Configuration Tool at http://www.lilacplatform.com ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Tue Dec 8 11:40:00 2009 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Tue, 8 Dec 2009 11:40:00 +0100 Subject: NSCA strange behaviour Message-ID: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> Hello, I'm having troubles with NSCA. What we have : - about 47 passive hosts - about 220 passive services Versions : all are redhat servers, with: - NSCA 2.7.2 (latest one) - Nagios 3.1.2 We have a single "nagios aggregator", which collect all NSCA status from the other hosts. What's happening: a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. What I've done: - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) - trying to push status by hand from client22 to server01; ALL packets are sent successfully """1 data packet(s) sent to host successfully.""". I've done this with a loop like that: for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. On another hand, it shows me: [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. The only differences are : - the working network is debian-only - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. Does anyone of you have an idea ? Thank you in advance. Best regards, C. -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gmartin at gmartin.org Tue Dec 8 13:36:47 2009 From: gmartin at gmartin.org (gmartin) Date: Tue, 8 Dec 2009 07:36:47 -0500 Subject: need expert advice/suggestions In-Reply-To: <20091207212919.GO5494@bakgwai.americas.hpqcorp.net> References: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com> <4B1C9A97.9030205@lilacnetworks.com> <20091207212919.GO5494@bakgwai.americas.hpqcorp.net> Message-ID: On Mon, Dec 7, 2009 at 4:29 PM, wrote: > Hi Taylor! > > For what it's worth, Nagios *does* support this. We routinely use > templates which assign a hostgroup to a host, and that hostgroup will > have a set of standard check for that type of host assigned to it. When > a new host gets added, all it takes is a "use some_host_template" and > all the standard services we run on that type of host just show up. > Patrick, can you explain this a bit further or point me towards another post that does the same. Sounds like an interesting feature I want to explore. Thanks -- \\Greg -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Tue Dec 8 15:08:03 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Tue, 8 Dec 2009 08:08:03 -0600 Subject: NSCA strange behaviour In-Reply-To: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> Message-ID: Do all of your clients fail, or just the new one? Greg Pangrazio pangrazi at gmail.com On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret wrote: > Hello, > > I'm having troubles with NSCA. > What we have : > > - about 47 passive hosts > - about 220 passive services > > Versions : all are redhat servers, with: > - NSCA 2.7.2 (latest one) > - Nagios 3.1.2 > > We have a single "nagios aggregator", which collect all NSCA status from the other hosts. > > What's happening: > a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. > > What I've done: > > - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) > > - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) > > - trying to push status by hand from client22 to server01; ALL packets are sent successfully """1 data packet(s) sent to host successfully.""". I've done this with a loop like that: > for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done > > - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : > Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). ?I'm forcing an immediate check of the host. > > On another hand, it shows me: > [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. > > > I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. > > The only differences are : > - the working network is debian-only > - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. > > Does anyone of you have an idea ? > > Thank you in advance. > > Best regards, > > C. > > > > -- > C?dric Jeanneret ? ? ? ? ? ? ? ? | ?System Administrator > 021 619 10 32 ? ? ? ? ? ? ? ? ? ?| ?Camptocamp SA > cedric.jeanneret at camptocamp.com ?| ?PSE-A / EPFL > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Tue Dec 8 15:14:25 2009 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Tue, 8 Dec 2009 15:14:25 +0100 Subject: NSCA strange behaviour In-Reply-To: References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> Message-ID: <20091208151425.33af0e89@saya.wrk.lsn.camptocamp.com> Hello, As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled....). Best regards, C. On Tue, 8 Dec 2009 08:08:03 -0600 Greg Pangrazio wrote: > Do all of your clients fail, or just the new one? > > Greg Pangrazio > pangrazi at gmail.com > > > > > On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret > wrote: > > Hello, > > > > I'm having troubles with NSCA. > > What we have : > > > > - about 47 passive hosts > > - about 220 passive services > > > > Versions : all are redhat servers, with: > > - NSCA 2.7.2 (latest one) > > - Nagios 3.1.2 > > > > We have a single "nagios aggregator", which collect all NSCA status from the other hosts. > > > > What's happening: > > a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. > > > > What I've done: > > > > - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) > > > > - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) > > > > - trying to push status by hand from client22 to server01; ALL packets are sent successfully """1 data packet(s) sent to host successfully.""". I've done this with a loop like that: > > for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done > > > > - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : > > Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). ?I'm forcing an immediate check of the host. > > > > On another hand, it shows me: > > [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. > > > > > > I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. > > > > The only differences are : > > - the working network is debian-only > > - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. > > > > Does anyone of you have an idea ? > > > > Thank you in advance. > > > > Best regards, > > > > C. > > > > > > > > -- > > C?dric Jeanneret ? ? ? ? ? ? ? ? | ?System Administrator > > 021 619 10 32 ? ? ? ? ? ? ? ? ? ?| ?Camptocamp SA > > cedric.jeanneret at camptocamp.com ?| ?PSE-A / EPFL > > > > ------------------------------------------------------------------------------ > > Return on Information: > > Google Enterprise Search pays you back > > Get the facts. > > http://p.sf.net/sfu/google-dev2dev > > > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Tue Dec 8 15:31:59 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Tue, 8 Dec 2009 08:31:59 -0600 Subject: NSCA strange behaviour In-Reply-To: <20091208151425.33af0e89@saya.wrk.lsn.camptocamp.com> References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> <20091208151425.33af0e89@saya.wrk.lsn.camptocamp.com> Message-ID: It sounds like there is something that changed with the re-install. Is the IP address of the system the same? Did you pick the same encryption type in the nsca config? Can you diff the nsca config with a working host? Greg Pangrazio pangrazi at gmail.com On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret wrote: > Hello, > > As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled....). > > Best regards, > > C. > > On Tue, 8 Dec 2009 08:08:03 -0600 > Greg Pangrazio wrote: > >> Do all of your clients fail, or just the new one? >> >> Greg Pangrazio >> pangrazi at gmail.com >> >> >> >> >> On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret >> wrote: >> > Hello, >> > >> > I'm having troubles with NSCA. >> > What we have : >> > >> > - about 47 passive hosts >> > - about 220 passive services >> > >> > Versions : all are redhat servers, with: >> > - NSCA 2.7.2 (latest one) >> > - Nagios 3.1.2 >> > >> > We have a single "nagios aggregator", which collect all NSCA status from the other hosts. >> > >> > What's happening: >> > a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. >> > >> > What I've done: >> > >> > - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) >> > >> > - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) >> > >> > - trying to push status by hand from client22 to server01; ALL packets are sent successfully """1 data packet(s) sent to host successfully.""". I've done this with a loop like that: >> > for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done >> > >> > - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : >> > Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). ?I'm forcing an immediate check of the host. >> > >> > On another hand, it shows me: >> > [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. >> > >> > >> > I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. >> > >> > The only differences are : >> > - the working network is debian-only >> > - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. >> > >> > Does anyone of you have an idea ? >> > >> > Thank you in advance. >> > >> > Best regards, >> > >> > C. >> > >> > >> > >> > -- >> > C?dric Jeanneret ? ? ? ? ? ? ? ? | ?System Administrator >> > 021 619 10 32 ? ? ? ? ? ? ? ? ? ?| ?Camptocamp SA >> > cedric.jeanneret at camptocamp.com ?| ?PSE-A / EPFL >> > >> > ------------------------------------------------------------------------------ >> > Return on Information: >> > Google Enterprise Search pays you back >> > Get the facts. >> > http://p.sf.net/sfu/google-dev2dev >> > >> > _______________________________________________ >> > Nagios-users mailing list >> > Nagios-users at lists.sourceforge.net >> > https://lists.sourceforge.net/lists/listinfo/nagios-users >> > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> > ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- > C?dric Jeanneret ? ? ? ? ? ? ? ? | ?System Administrator > 021 619 10 32 ? ? ? ? ? ? ? ? ? ?| ?Camptocamp SA > cedric.jeanneret at camptocamp.com ?| ?PSE-A / EPFL > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lmw94002 at hotmail.com Tue Dec 8 16:57:54 2009 From: lmw94002 at hotmail.com (Mathew Walker) Date: Tue, 8 Dec 2009 10:57:54 -0500 Subject: need expert advice/suggestions In-Reply-To: References: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com>, <4B1C9A97.9030205@lilacnetworks.com>, <20091207212919.GO5494@bakgwai.americas.hpqcorp.net>, Message-ID: We make extensive use of hostgroups for templating. We define a "OS" hostgroup. A simple example would be "Windows" or "Linux" host groups where we monitor CPU/Load, Root/C disk space, and memory. The Windows group has EventLog & RDP checks, while Linux groups will monitor the SSH port. Then we use nested groups for something like "DB Servers" (dbservers.cfg) where we have the parent group defined and the subgroups of "MSSQL" and "MYSQL" hostgroups. Then each smaller group has a few common checks we run such as base TCP Ports and even simple test queries against the default databases. Ironically we had very view checks to the actual host.cfg files, but add most individual one-off "role" checks to the group file. -- Mat W. - http://www.techadre.com Date: Tue, 8 Dec 2009 07:36:47 -0500 From: gmartin at gmartin.org To: patrick.morris at hp.com; nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] need expert advice/suggestions On Mon, Dec 7, 2009 at 4:29 PM, wrote: Hi Taylor! For what it's worth, Nagios *does* support this. We routinely use templates which assign a hostgroup to a host, and that hostgroup will have a set of standard check for that type of host assigned to it. When a new host gets added, all it takes is a "use some_host_template" and all the standard services we run on that type of host just show up. Patrick, can you explain this a bit further or point me towards another post that does the same. Sounds like an interesting feature I want to explore. Thanks -- \\Greg _________________________________________________________________ Windows Live Hotmail gives you a free,exclusive gift. http://www.microsoft.com/windows/windowslive/hotmail_bl1/hotmail_bl1.aspx?ocid=PID23879::T:WLMTAGL:ON:WL:en-ww:WM_IMHM_7:092009 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue Dec 8 17:07:31 2009 From: mitsuto at gmail.com (Marcel) Date: Tue, 8 Dec 2009 14:07:31 -0200 Subject: need expert advice/suggestions In-Reply-To: References: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com> <4B1C9A97.9030205@lilacnetworks.com> <20091207212919.GO5494@bakgwai.americas.hpqcorp.net> Message-ID: <2dfcbd1b0912080807q671ff8d9ic6353541d77ebb43@mail.gmail.com> define hostgroup{ hostgroup_name some_name members some_host,another_one,!router } define service{ use some-generic-template hostgroup_name some_name check_command check_something (...) } define host{ use some-generic-template host_name some_name hostgroup some_name } On Tue, Dec 8, 2009 at 10:36 AM, gmartin wrote: > > > On Mon, Dec 7, 2009 at 4:29 PM, wrote: > >> Hi Taylor! >> >> For what it's worth, Nagios *does* support this. We routinely use >> templates which assign a hostgroup to a host, and that hostgroup will >> have a set of standard check for that type of host assigned to it. When >> a new host gets added, all it takes is a "use some_host_template" and >> all the standard services we run on that type of host just show up. >> > > Patrick, can you explain this a bit further or point me towards another > post that does the same. Sounds like an interesting feature I want to > explore. > > Thanks > > -- > \\Greg > > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Tue Dec 8 17:13:37 2009 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Tue, 8 Dec 2009 17:13:37 +0100 Subject: NSCA strange behaviour In-Reply-To: References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> <20091208151425.33af0e89@saya.wrk.lsn.camptocamp.com> Message-ID: <20091208171337.5f6459a0@saya.wrk.lsn.camptocamp.com> Hello again, In fact, configuration files are dployed via puppet(http://reductivelabs.com/trac/puppet/wiki), so all files (should be) are the same. I'll check it, but as puppet runs on every hosts, they all should have the same files. IP addresses are the same (fixed IP, fixed ports). I'll check files once again. If anyone has another idea... Thank you. Best regards, C. On Tue, 8 Dec 2009 08:31:59 -0600 Greg Pangrazio wrote: > It sounds like there is something that changed with the re-install. > > Is the IP address of the system the same? > > Did you pick the same encryption type in the nsca config? > > Can you diff the nsca config with a working host? > Greg Pangrazio > pangrazi at gmail.com > > > > > > On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret > wrote: > > Hello, > > > > As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled....). > > > > Best regards, > > > > C. > > > > On Tue, 8 Dec 2009 08:08:03 -0600 > > Greg Pangrazio wrote: > > > >> Do all of your clients fail, or just the new one? > >> > >> Greg Pangrazio > >> pangrazi at gmail.com > >> > >> > >> > >> > >> On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret > >> wrote: > >> > Hello, > >> > > >> > I'm having troubles with NSCA. > >> > What we have : > >> > > >> > - about 47 passive hosts > >> > - about 220 passive services > >> > > >> > Versions : all are redhat servers, with: > >> > - NSCA 2.7.2 (latest one) > >> > - Nagios 3.1.2 > >> > > >> > We have a single "nagios aggregator", which collect all NSCA status from the other hosts. > >> > > >> > What's happening: > >> > a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. > >> > > >> > What I've done: > >> > > >> > - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) > >> > > >> > - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) > >> > > >> > - trying to push status by hand from client22 to server01; ALL packets are sent successfully """1 data packet(s) sent to host successfully.""". I've done this with a loop like that: > >> > for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done > >> > > >> > - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : > >> > Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). ?I'm forcing an immediate check of the host. > >> > > >> > On another hand, it shows me: > >> > [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. > >> > > >> > > >> > I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. > >> > > >> > The only differences are : > >> > - the working network is debian-only > >> > - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. > >> > > >> > Does anyone of you have an idea ? > >> > > >> > Thank you in advance. > >> > > >> > Best regards, > >> > > >> > C. > >> > > >> > > >> > > >> > -- > >> > C?dric Jeanneret ? ? ? ? ? ? ? ? | ?System Administrator > >> > 021 619 10 32 ? ? ? ? ? ? ? ? ? ?| ?Camptocamp SA > >> > cedric.jeanneret at camptocamp.com ?| ?PSE-A / EPFL > >> > > >> > ------------------------------------------------------------------------------ > >> > Return on Information: > >> > Google Enterprise Search pays you back > >> > Get the facts. > >> > http://p.sf.net/sfu/google-dev2dev > >> > > >> > _______________________________________________ > >> > Nagios-users mailing list > >> > Nagios-users at lists.sourceforge.net > >> > https://lists.sourceforge.net/lists/listinfo/nagios-users > >> > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > >> > ::: Messages without supporting info will risk being sent to /dev/null > >> > > > > > > > -- > > C?dric Jeanneret ? ? ? ? ? ? ? ? | ?System Administrator > > 021 619 10 32 ? ? ? ? ? ? ? ? ? ?| ?Camptocamp SA > > cedric.jeanneret at camptocamp.com ?| ?PSE-A / EPFL > > > > ------------------------------------------------------------------------------ > > Return on Information: > > Google Enterprise Search pays you back > > Get the facts. > > http://p.sf.net/sfu/google-dev2dev > > > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Tue Dec 8 17:29:13 2009 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Tue, 8 Dec 2009 17:29:13 +0100 Subject: NSCA strange behaviour In-Reply-To: References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> <20091208151425.33af0e89@saya.wrk.lsn.camptocamp.com> Message-ID: <20091208172913.77d3fca6@saya.wrk.lsn.camptocamp.com> I just rsync-ed a complet config from a working host, then: for file in $(grep -lr working-client *); do sed -i 's/working-client/client22/g' $i done ... and it doesn't work any better. More over, puppet doesn't want to change anything... I'm stuck... :( On Tue, 8 Dec 2009 08:31:59 -0600 Greg Pangrazio wrote: > It sounds like there is something that changed with the re-install. > > Is the IP address of the system the same? > > Did you pick the same encryption type in the nsca config? > > Can you diff the nsca config with a working host? > Greg Pangrazio > pangrazi at gmail.com > > > > > > On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret > wrote: > > Hello, > > > > As far as I can see, only new one. Even if they are just reinstalled (client22 was in nagios config before it was reinstalled....). > > > > Best regards, > > > > C. > > > > On Tue, 8 Dec 2009 08:08:03 -0600 > > Greg Pangrazio wrote: > > > >> Do all of your clients fail, or just the new one? > >> > >> Greg Pangrazio > >> pangrazi at gmail.com > >> > >> > >> > >> > >> On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret > >> wrote: > >> > Hello, > >> > > >> > I'm having troubles with NSCA. > >> > What we have : > >> > > >> > - about 47 passive hosts > >> > - about 220 passive services > >> > > >> > Versions : all are redhat servers, with: > >> > - NSCA 2.7.2 (latest one) > >> > - Nagios 3.1.2 > >> > > >> > We have a single "nagios aggregator", which collect all NSCA status from the other hosts. > >> > > >> > What's happening: > >> > a host was reinstalled yesterday (say client22), and now it seems NSCA daemon on the aggregator (say server01) doesn't seem to collect data. > >> > > >> > What I've done: > >> > > >> > - tcpdump on both client22 and server01, both show me traffic between them, on NSCA default port (5667) > >> > > >> > - checked iptables rules, all is ok (as tcpdump shows me traffic, that's a confirmation) > >> > > >> > - trying to push status by hand from client22 to server01; ALL packets are sent successfully """1 data packet(s) sent to host successfully.""". I've done this with a loop like that: > >> > for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) UP 'Host is up'; sleep 2; done > >> > > >> > - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : > >> > Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). ?I'm forcing an immediate check of the host. > >> > > >> > On another hand, it shows me: > >> > [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. > >> > > >> > > >> > I really don't know where to find a solution, neither where is the real problem. We have another network with about 200 passive hosts and over 350 passive services, and it works fine. > >> > > >> > The only differences are : > >> > - the working network is debian-only > >> > - the working network's NSCA server doesn't do anything else than central nagios server. server01 does some other stuff, like syslog server and collectd server... maybe there's a bottleneck in there, but I can't be sure about that. > >> > > >> > Does anyone of you have an idea ? > >> > > >> > Thank you in advance. > >> > > >> > Best regards, > >> > > >> > C. > >> > > >> > > >> > > >> > -- > >> > C?dric Jeanneret ? ? ? ? ? ? ? ? | ?System Administrator > >> > 021 619 10 32 ? ? ? ? ? ? ? ? ? ?| ?Camptocamp SA > >> > cedric.jeanneret at camptocamp.com ?| ?PSE-A / EPFL > >> > > >> > ------------------------------------------------------------------------------ > >> > Return on Information: > >> > Google Enterprise Search pays you back > >> > Get the facts. > >> > http://p.sf.net/sfu/google-dev2dev > >> > > >> > _______________________________________________ > >> > Nagios-users mailing list > >> > Nagios-users at lists.sourceforge.net > >> > https://lists.sourceforge.net/lists/listinfo/nagios-users > >> > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > >> > ::: Messages without supporting info will risk being sent to /dev/null > >> > > > > > > > -- > > C?dric Jeanneret ? ? ? ? ? ? ? ? | ?System Administrator > > 021 619 10 32 ? ? ? ? ? ? ? ? ? ?| ?Camptocamp SA > > cedric.jeanneret at camptocamp.com ?| ?PSE-A / EPFL > > > > ------------------------------------------------------------------------------ > > Return on Information: > > Google Enterprise Search pays you back > > Get the facts. > > http://p.sf.net/sfu/google-dev2dev > > > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue Dec 8 17:34:43 2009 From: mitsuto at gmail.com (Marcel) Date: Tue, 8 Dec 2009 14:34:43 -0200 Subject: NSCA strange behaviour In-Reply-To: <20091208171337.5f6459a0@saya.wrk.lsn.camptocamp.com> References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> <20091208151425.33af0e89@saya.wrk.lsn.camptocamp.com> <20091208171337.5f6459a0@saya.wrk.lsn.camptocamp.com> Message-ID: <2dfcbd1b0912080834n19c491f5h3111ff9c651068e4@mail.gmail.com> check openssl versions and compatibility. On Tue, Dec 8, 2009 at 2:13 PM, Cedric Jeanneret < cedric.jeanneret at camptocamp.com> wrote: > Hello again, > > In fact, configuration files are dployed via puppet( > http://reductivelabs.com/trac/puppet/wiki), so all files (should be) are > the same. I'll check it, but as puppet runs on every hosts, they all should > have the same files. > IP addresses are the same (fixed IP, fixed ports). > > I'll check files once again. > > If anyone has another idea... > > Thank you. > > Best regards, > > C. > > On Tue, 8 Dec 2009 08:31:59 -0600 > Greg Pangrazio wrote: > > > It sounds like there is something that changed with the re-install. > > > > Is the IP address of the system the same? > > > > Did you pick the same encryption type in the nsca config? > > > > Can you diff the nsca config with a working host? > > Greg Pangrazio > > pangrazi at gmail.com > > > > > > > > > > > > On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret > > wrote: > > > Hello, > > > > > > As far as I can see, only new one. Even if they are just reinstalled > (client22 was in nagios config before it was reinstalled....). > > > > > > Best regards, > > > > > > C. > > > > > > On Tue, 8 Dec 2009 08:08:03 -0600 > > > Greg Pangrazio wrote: > > > > > >> Do all of your clients fail, or just the new one? > > >> > > >> Greg Pangrazio > > >> pangrazi at gmail.com > > >> > > >> > > >> > > >> > > >> On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret > > >> wrote: > > >> > Hello, > > >> > > > >> > I'm having troubles with NSCA. > > >> > What we have : > > >> > > > >> > - about 47 passive hosts > > >> > - about 220 passive services > > >> > > > >> > Versions : all are redhat servers, with: > > >> > - NSCA 2.7.2 (latest one) > > >> > - Nagios 3.1.2 > > >> > > > >> > We have a single "nagios aggregator", which collect all NSCA status > from the other hosts. > > >> > > > >> > What's happening: > > >> > a host was reinstalled yesterday (say client22), and now it seems > NSCA daemon on the aggregator (say server01) doesn't seem to collect data. > > >> > > > >> > What I've done: > > >> > > > >> > - tcpdump on both client22 and server01, both show me traffic > between them, on NSCA default port (5667) > > >> > > > >> > - checked iptables rules, all is ok (as tcpdump shows me traffic, > that's a confirmation) > > >> > > > >> > - trying to push status by hand from client22 to server01; ALL > packets are sent successfully """1 data packet(s) sent to host > successfully.""". I've done this with a loop like that: > > >> > for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) > UP 'Host is up'; sleep 2; done > > >> > > > >> > - Enbling debug for nsca on server01 doesn't show me anything > interesting. I just don't see where nsca catch up client22 status, and it > keeps on saying : > > >> > Warning: The results of host 'client22.domain.lt' are stale by 0d > 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the > host. > > >> > > > >> > On another hand, it shows me: > > >> > [1260267958.216051] [016.1] [pid=23191] Check results for service > 'Cron service' on host 'client22.domain.lt' are fresh. > > >> > > > >> > > > >> > I really don't know where to find a solution, neither where is the > real problem. We have another network with about 200 passive hosts and over > 350 passive services, and it works fine. > > >> > > > >> > The only differences are : > > >> > - the working network is debian-only > > >> > - the working network's NSCA server doesn't do anything else than > central nagios server. server01 does some other stuff, like syslog server > and collectd server... maybe there's a bottleneck in there, but I can't be > sure about that. > > >> > > > >> > Does anyone of you have an idea ? > > >> > > > >> > Thank you in advance. > > >> > > > >> > Best regards, > > >> > > > >> > C. > > >> > > > >> > > > >> > > > >> > -- > > >> > C?dric Jeanneret | System Administrator > > >> > 021 619 10 32 | Camptocamp SA > > >> > cedric.jeanneret at camptocamp.com | PSE-A / EPFL > > >> > > > >> > > ------------------------------------------------------------------------------ > > >> > Return on Information: > > >> > Google Enterprise Search pays you back > > >> > Get the facts. > > >> > http://p.sf.net/sfu/google-dev2dev > > >> > > > >> > _______________________________________________ > > >> > Nagios-users mailing list > > >> > Nagios-users at lists.sourceforge.net > > >> > https://lists.sourceforge.net/lists/listinfo/nagios-users > > >> > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > > >> > ::: Messages without supporting info will risk being sent to > /dev/null > > >> > > > > > > > > > > -- > > > C?dric Jeanneret | System Administrator > > > 021 619 10 32 | Camptocamp SA > > > cedric.jeanneret at camptocamp.com | PSE-A / EPFL > > > > > > > ------------------------------------------------------------------------------ > > > Return on Information: > > > Google Enterprise Search pays you back > > > Get the facts. > > > http://p.sf.net/sfu/google-dev2dev > > > > > > _______________________________________________ > > > Nagios-users mailing list > > > Nagios-users at lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > -- > C?dric Jeanneret | System Administrator > 021 619 10 32 | Camptocamp SA > cedric.jeanneret at camptocamp.com | PSE-A / EPFL > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Tue Dec 8 17:39:18 2009 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Tue, 8 Dec 2009 17:39:18 +0100 Subject: NSCA strange behaviour In-Reply-To: <2dfcbd1b0912080834n19c491f5h3111ff9c651068e4@mail.gmail.com> References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> <20091208151425.33af0e89@saya.wrk.lsn.camptocamp.com> <20091208171337.5f6459a0@saya.wrk.lsn.camptocamp.com> <2dfcbd1b0912080834n19c491f5h3111ff9c651068e4@mail.gmail.com> Message-ID: <20091208173918.63a5c63c@saya.wrk.lsn.camptocamp.com> Hello Marcel, well, same on server01 and client22... working-server has an earlier one, but it works on this. NSCA doesn't show me any encryption error (encryption method and passphrase are correct on both ends) :/ Regards, C. On Tue, 8 Dec 2009 14:34:43 -0200 Marcel wrote: > check openssl versions and compatibility. > > On Tue, Dec 8, 2009 at 2:13 PM, Cedric Jeanneret < > cedric.jeanneret at camptocamp.com> wrote: > > > Hello again, > > > > In fact, configuration files are dployed via puppet( > > http://reductivelabs.com/trac/puppet/wiki), so all files (should be) are > > the same. I'll check it, but as puppet runs on every hosts, they all should > > have the same files. > > IP addresses are the same (fixed IP, fixed ports). > > > > I'll check files once again. > > > > If anyone has another idea... > > > > Thank you. > > > > Best regards, > > > > C. > > > > On Tue, 8 Dec 2009 08:31:59 -0600 > > Greg Pangrazio wrote: > > > > > It sounds like there is something that changed with the re-install. > > > > > > Is the IP address of the system the same? > > > > > > Did you pick the same encryption type in the nsca config? > > > > > > Can you diff the nsca config with a working host? > > > Greg Pangrazio > > > pangrazi at gmail.com > > > > > > > > > > > > > > > > > > On Tue, Dec 8, 2009 at 8:14 AM, Cedric Jeanneret > > > wrote: > > > > Hello, > > > > > > > > As far as I can see, only new one. Even if they are just reinstalled > > (client22 was in nagios config before it was reinstalled....). > > > > > > > > Best regards, > > > > > > > > C. > > > > > > > > On Tue, 8 Dec 2009 08:08:03 -0600 > > > > Greg Pangrazio wrote: > > > > > > > >> Do all of your clients fail, or just the new one? > > > >> > > > >> Greg Pangrazio > > > >> pangrazi at gmail.com > > > >> > > > >> > > > >> > > > >> > > > >> On Tue, Dec 8, 2009 at 4:40 AM, Cedric Jeanneret > > > >> wrote: > > > >> > Hello, > > > >> > > > > >> > I'm having troubles with NSCA. > > > >> > What we have : > > > >> > > > > >> > - about 47 passive hosts > > > >> > - about 220 passive services > > > >> > > > > >> > Versions : all are redhat servers, with: > > > >> > - NSCA 2.7.2 (latest one) > > > >> > - Nagios 3.1.2 > > > >> > > > > >> > We have a single "nagios aggregator", which collect all NSCA status > > from the other hosts. > > > >> > > > > >> > What's happening: > > > >> > a host was reinstalled yesterday (say client22), and now it seems > > NSCA daemon on the aggregator (say server01) doesn't seem to collect data. > > > >> > > > > >> > What I've done: > > > >> > > > > >> > - tcpdump on both client22 and server01, both show me traffic > > between them, on NSCA default port (5667) > > > >> > > > > >> > - checked iptables rules, all is ok (as tcpdump shows me traffic, > > that's a confirmation) > > > >> > > > > >> > - trying to push status by hand from client22 to server01; ALL > > packets are sent successfully """1 data packet(s) sent to host > > successfully.""". I've done this with a loop like that: > > > >> > for i in $(seq 1000); do /usr/local/bin/submit_ochp $(hostname -f) > > UP 'Host is up'; sleep 2; done > > > >> > > > > >> > - Enbling debug for nsca on server01 doesn't show me anything > > interesting. I just don't see where nsca catch up client22 status, and it > > keeps on saying : > > > >> > Warning: The results of host 'client22.domain.lt' are stale by 0d > > 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the > > host. > > > >> > > > > >> > On another hand, it shows me: > > > >> > [1260267958.216051] [016.1] [pid=23191] Check results for service > > 'Cron service' on host 'client22.domain.lt' are fresh. > > > >> > > > > >> > > > > >> > I really don't know where to find a solution, neither where is the > > real problem. We have another network with about 200 passive hosts and over > > 350 passive services, and it works fine. > > > >> > > > > >> > The only differences are : > > > >> > - the working network is debian-only > > > >> > - the working network's NSCA server doesn't do anything else than > > central nagios server. server01 does some other stuff, like syslog server > > and collectd server... maybe there's a bottleneck in there, but I can't be > > sure about that. > > > >> > > > > >> > Does anyone of you have an idea ? > > > >> > > > > >> > Thank you in advance. > > > >> > > > > >> > Best regards, > > > >> > > > > >> > C. > > > >> > > > > >> > > > > >> > > > > >> > -- > > > >> > C?dric Jeanneret | System Administrator > > > >> > 021 619 10 32 | Camptocamp SA > > > >> > cedric.jeanneret at camptocamp.com | PSE-A / EPFL > > > >> > > > > >> > > > ------------------------------------------------------------------------------ > > > >> > Return on Information: > > > >> > Google Enterprise Search pays you back > > > >> > Get the facts. > > > >> > http://p.sf.net/sfu/google-dev2dev > > > >> > > > > >> > _______________________________________________ > > > >> > Nagios-users mailing list > > > >> > Nagios-users at lists.sourceforge.net > > > >> > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > >> > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > > >> > ::: Messages without supporting info will risk being sent to > > /dev/null > > > >> > > > > > > > > > > > > > -- > > > > C?dric Jeanneret | System Administrator > > > > 021 619 10 32 | Camptocamp SA > > > > cedric.jeanneret at camptocamp.com | PSE-A / EPFL > > > > > > > > > > ------------------------------------------------------------------------------ > > > > Return on Information: > > > > Google Enterprise Search pays you back > > > > Get the facts. > > > > http://p.sf.net/sfu/google-dev2dev > > > > > > > > _______________________________________________ > > > > Nagios-users mailing list > > > > Nagios-users at lists.sourceforge.net > > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > > > -- > > C?dric Jeanneret | System Administrator > > 021 619 10 32 | Camptocamp SA > > cedric.jeanneret at camptocamp.com | PSE-A / EPFL > > > > > > ------------------------------------------------------------------------------ > > Return on Information: > > Google Enterprise Search pays you back > > Get the facts. > > http://p.sf.net/sfu/google-dev2dev > > > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 8 17:52:50 2009 From: marc at ena.com (Marc Powell) Date: Tue, 8 Dec 2009 10:52:50 -0600 Subject: NSCA strange behaviour In-Reply-To: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> Message-ID: On Dec 8, 2009, at 4:40 AM, Cedric Jeanneret wrote: > - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : > Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. > > On another hand, it shows me: > [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. What do they show? What you've quoted here doesn't come from NSCA (that I can tell from a simple grep). -- nsca-2.7.2]# grep -r 'are fresh' * nsca-2.7.2]# I'm sure that comes from nagios via nagios.log, not NSCA. Are you sure you're looking in the right place? It's typically in /var/log/messages. How are you running nsca? daemon mode or via inetd? If inetd, is inetd rejecting the connection? -- Marc ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 18:13:37 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 12:13:37 -0500 Subject: check_ssh and check_ping not working with NRPE on remote host Message-ID: <4B1E8941.5000003@uci.cu> Hi every: I've experimenting some problems when trying to use check_ssh and check_ping plugins. The configuration at remote server (where NRPE resides) is as follow: command[check_ssh]=/usr/local/nagios/libexec/check_ssh command[check_ping]=/usr/local/nagios/libexec/check_ping!100.0,20%!500.0,60% And then at server where .cfg resides the config is this one: define service{ use generic-service host_name bacula-server service_description SSH check_command check_nrpe!check_ssh notifications_enabled 0 } define service{ use generic-service host_name bacula-server service_description PING check_command check_nrpe!check_ping } The error for check_ping command is this: NRPE: Unable to read output and for check_ssh is this: check_ssh: Could not parse arguments What I'm doing wrong or what I'm not doing? -- Cheers and thanks in advance ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Tue Dec 8 18:19:02 2009 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Tue, 8 Dec 2009 18:19:02 +0100 Subject: NSCA strange behaviour In-Reply-To: References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> Message-ID: <20091208181902.344ab911@saya.wrk.lsn.camptocamp.com> Hello Marc, Indeed, the "are fresh" comes from nagios.log. ok, done. NSCA is running in daemon mode, iptables is opened for nsca port, and connections can go through it (tcpdump shows it to me, in both directions). Setting "debug=1" in nsca.cfg seems to do nothing more in /var/log/messages (redhat server). I just see "down" hosts passing through (results for ... are stalled - forcing immedia check...). I set up debug for nagios itself, but it really seems to be a problem at NSCA level. Regards, C. On Tue, 8 Dec 2009 10:52:50 -0600 Marc Powell wrote: > > On Dec 8, 2009, at 4:40 AM, Cedric Jeanneret wrote: > > > - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : > > Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. > > > > On another hand, it shows me: > > [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. > > What do they show? What you've quoted here doesn't come from NSCA (that I can tell from a simple grep). -- > > nsca-2.7.2]# grep -r 'are fresh' * > nsca-2.7.2]# > > I'm sure that comes from nagios via nagios.log, not NSCA. Are you sure you're looking in the right place? It's typically in /var/log/messages. > > How are you running nsca? daemon mode or via inetd? If inetd, is inetd rejecting the connection? > > -- > Marc > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Dec 8 18:23:14 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Tue, 08 Dec 2009 09:23:14 -0800 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E8941.5000003@uci.cu> References: <4B1E8941.5000003@uci.cu> Message-ID: <4B1E8B82.8070001@hp.com> ReynierPM wrote: > Hi every: > I've experimenting some problems when trying to use check_ssh and > check_ping plugins. The configuration at remote server (where NRPE > resides) is as follow: > > command[check_ssh]=/usr/local/nagios/libexec/check_ssh > command[check_ping]=/usr/local/nagios/libexec/check_ping!100.0,20%!500.0,60% > > And then at server where .cfg resides the config is this one: > define service{ > use generic-service > host_name bacula-server > service_description SSH > check_command check_nrpe!check_ssh > notifications_enabled 0 > } > > define service{ > use generic-service > host_name bacula-server > service_description PING > check_command check_nrpe!check_ping > } > > The error for check_ping command is this: > NRPE: Unable to read output > and for check_ssh is this: > check_ssh: Could not parse arguments > > What I'm doing wrong or what I'm not doing? > For check_ssh, you aren't passing any arguments, and it requires at least a host: [pmorris at sf3-bb6 ~]$ /opt/nagios/lib/plugins/check_ssh check_ssh: Could not parse arguments Usage:check_ssh [-46] [-t ] [-r ] [-p ] For check_ping, you're not passing the arguments correctly. The "!" syntax only works in Nagios command definitions, and does not apply to NRPE. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From luc.maignan at winxpert.com Tue Dec 8 18:32:32 2009 From: luc.maignan at winxpert.com (Luc MAIGNAN) Date: Tue, 08 Dec 2009 18:32:32 +0100 Subject: check_disk_smb problem Message-ID: <4B1E8DB0.2040104@winxpert.com> Hi, I have a NAS with a SMB share enable on it. I want to monitor it using check_disk_smb plugin, but it gives me the error : "Result from smbclient not suitable" Has someone an idea ? BR ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Tue Dec 8 18:32:06 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Tue, 8 Dec 2009 11:32:06 -0600 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E8B82.8070001@hp.com> References: <4B1E8941.5000003@uci.cu> <4B1E8B82.8070001@hp.com> Message-ID: I use commands similar to what you are doing, did you enable command processing from remote submissions? In the windows version it says something about "don't blame me" Greg Pangrazio pangrazi at gmail.com On Tue, Dec 8, 2009 at 11:23 AM, Morris, Patrick wrote: > ReynierPM wrote: >> Hi every: >> I've experimenting some problems when trying to use check_ssh and >> check_ping plugins. The configuration at remote server (where NRPE >> resides) is as follow: >> >> command[check_ssh]=/usr/local/nagios/libexec/check_ssh >> command[check_ping]=/usr/local/nagios/libexec/check_ping!100.0,20%!500.0,60% >> >> And then at server where .cfg resides the config is this one: >> define service{ >> ? ? ? ? ?use ? ? ? ? ? ? ? ? ? ? ? ? ? ? generic-service >> ? ? ? ? ?host_name ? ? ? ? ? ? ? ? ? ? ? bacula-server >> ? ? ? ? ?service_description ? ? ? ? ? ? SSH >> ? ? ? ? ?check_command ? ? ? ? ? ? ? ? ? check_nrpe!check_ssh >> ? ? ? ? ?notifications_enabled ? ? ? ? ? 0 >> } >> >> define service{ >> ? ? ? ? ?use ? ? ? ? ? ? ? ? ? ? ? ? ? ? generic-service >> ? ? ? ? ?host_name ? ? ? ? ? ? ? ? ? ? ? bacula-server >> ? ? ? ? ?service_description ? ? ? ? ? ? PING >> ? ? ? ? ?check_command ? ? ? ? ? ? ? ? ? check_nrpe!check_ping >> } >> >> The error for check_ping command is this: >> NRPE: Unable to read output >> and for check_ssh is this: >> check_ssh: Could not parse arguments >> >> What I'm doing wrong or what I'm not doing? >> > For check_ssh, you aren't passing any arguments, and it requires at > least a host: > > [pmorris at sf3-bb6 ~]$ /opt/nagios/lib/plugins/check_ssh > check_ssh: Could not parse arguments > Usage:check_ssh [-46] [-t ] [-r ] [-p ] > > > For check_ping, you're not passing the arguments correctly. The "!" > syntax only works in Nagios command definitions, and does not apply to NRPE. > > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 18:41:05 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 12:41:05 -0500 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: References: <4B1E8941.5000003@uci.cu> <4B1E8B82.8070001@hp.com> Message-ID: <4B1E8FB1.6060408@uci.cu> Greg Pangrazio wrote: > I use commands similar to what you are doing, did you enable command > processing from remote submissions? What you mean with "command processing remote submissions"? I can't understand this part. > In the windows version it says something about "don't blame me" I don't know what you mean with "windows version" but I found this line: # COMMAND ARGUMENT PROCESSING # This option determines whether or not the NRPE daemon will allow clients # to specify arguments to commands that are executed. This option only works # if the daemon was configured with the --enable-command-args configure script # option. # # *** ENABLING THIS OPTION IS A SECURITY RISK! *** # Read the SECURITY file for information on some of the security implications # of enabling this variable. # # Values: 0=do not allow arguments, 1=allow command arguments dont_blame_nrpe=0 So if I enable thi, which are the main security risks? -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From wallis at aps.anl.gov Tue Dec 8 18:41:21 2009 From: wallis at aps.anl.gov (David Wallis) Date: Tue, 08 Dec 2009 11:41:21 -0600 Subject: check_disk_smb problem In-Reply-To: <4B1E8DB0.2040104@winxpert.com> References: <4B1E8DB0.2040104@winxpert.com> Message-ID: <4B1E8FC1.1080500@aps.anl.gov> Luc MAIGNAN wrote: > Hi, > > I have a NAS with a SMB share enable on it. > > I want to monitor it using check_disk_smb plugin, but it gives me the > error : > > "Result from smbclient not suitable" > > Has someone an idea ? > > You're giving check_disk_smb some parameters that your NAS device doesn't like, or you're omitting one or more parameters that it needs. You need to carefully go over your service definition and check command, and run the actual command from the command line to see how the NAS device is responding. I believe that that check command accepts a '-v' (verbose output) option that will probably help you. -- David Wallis Information Technology Advanced Photon Source Argonne National Laboratory ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 18:44:15 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 12:44:15 -0500 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E8B82.8070001@hp.com> References: <4B1E8941.5000003@uci.cu> <4B1E8B82.8070001@hp.com> Message-ID: <4B1E906F.1090802@uci.cu> Morris, Patrick wrote: > ReynierPM wrote: > For check_ssh, you aren't passing any arguments, and it requires at > least a host: > > [pmorris at sf3-bb6 ~]$ /opt/nagios/lib/plugins/check_ssh > check_ssh: Could not parse arguments > Usage:check_ssh [-46] [-t ] [-r ] [-p ] > Ok, I fixed this one > For check_ping, you're not passing the arguments correctly. The "!" > syntax only works in Nagios command definitions, and does not apply to > NRPE. For this, where I can find detailed info about parameters? When I run the command ./check_ping this is returned: check_ping: Could not parse arguments Usage:check_ping -H -w ,% -c ,% [-p packets] [-t timeout] [-4|-6] But some of this are no so clear for me. -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Tue Dec 8 18:42:38 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Tue, 8 Dec 2009 11:42:38 -0600 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E8FB1.6060408@uci.cu> References: <4B1E8941.5000003@uci.cu> <4B1E8B82.8070001@hp.com> <4B1E8FB1.6060408@uci.cu> Message-ID: The security risks are that there is a potential for remote command execution on the system. This is exactly the section I was refering to. Greg Pangrazio pangrazi at gmail.com On Tue, Dec 8, 2009 at 11:41 AM, ReynierPM wrote: > Greg Pangrazio wrote: >> >> I use commands similar to what you are doing, did you enable command >> processing from remote submissions? > > What you mean with "command processing remote submissions"? I can't > understand this part. > >> In the windows version it says something about "don't blame me" > > I don't know what you mean with "windows version" but I found this line: > > # COMMAND ARGUMENT PROCESSING > # This option determines whether or not the NRPE daemon will allow clients > # to specify arguments to commands that are executed. ?This option only > works > # if the daemon was configured with the --enable-command-args configure > script > # option. > # > # *** ENABLING THIS OPTION IS A SECURITY RISK! *** > # Read the SECURITY file for information on some of the security > implications > # of enabling this variable. > # > # Values: 0=do not allow arguments, 1=allow command arguments > > dont_blame_nrpe=0 > > So if I enable thi, which are the main security risks? > -- > Cheers > ReynierPM > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Dec 8 18:44:41 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Tue, 08 Dec 2009 09:44:41 -0800 Subject: need expert advice/suggestions In-Reply-To: References: <6440EB3D9B6A5646B7B2B52D1A25B270307AD3EF@USNAVSXCHMBSA2.ndc.alcatel-lucent.com> <4B1C9A97.9030205@lilacnetworks.com> <20091207212919.GO5494@bakgwai.americas.hpqcorp.net> Message-ID: <4B1E9089.5020102@hp.com> gmartin wrote: > > > On Mon, Dec 7, 2009 at 4:29 PM, > wrote: > > Hi Taylor! > > For what it's worth, Nagios *does* support this. We routinely use > templates which assign a hostgroup to a host, and that hostgroup will > have a set of standard check for that type of host assigned to it. > When > a new host gets added, all it takes is a "use some_host_template" and > all the standard services we run on that type of host just show up. > > > Patrick, can you explain this a bit further or point me towards > another post that does the same. Sounds like an interesting feature I > want to explore. Sure. Here's an example off the top of my head: # Generic template to base other types of host on. # Sets up some baseline defaults for all hosts define host { name generic-host notifications_enabled 1 event_handler_enabled 1 flap_detection_enabled 1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 obsess_over_host 1 check_command check_host_alive check_interval 0 check_freshness 0 max_check_attempts 10 notification_interval 15 notification_period 24x7 notification_options d,u,r,f,s } # Linux hosts use this template define host { name linux-host hostgroups +systems,linux_servers use generic-host contact_groups sysadm,sysadm-oncall } #Windows hosts use this template define host { name windows-host hostgroups +systems,windows_servers use generic-host contact_groups winadm,winadm_oncall } # Check SSH on Linux hosts define service { use generic_service hostgroup_name linux_servers service_description SSH contact_groups sysadm,sysadm_oncall check_command check_trap } # Check telnet on Windows hosts (bad example, but if we ran telnet on Windows, this would work) define service { use generic_service hostgroup_name windows_servers service_description TELNET contact_groups winadm,winadm_oncall check_command check_telnet } # Check for SNMP traps on all hosts define service { use generic_service hostgroup_name systems service_description TRAP contact_groups sysadm,sysadm_oncall check_command check_trap } define host { host_name mylinuxhost use linux-host address mylinuxhost.example.com } define host { host_name mywindowshost use windows-host address mywindowshost.example.com } Note that the last two definitions are the only "real" ones. Those set up two hosts: one Linux, one Windows. Since they're defined using the templates set up earlier, they get the checks appropriate to their host type (SSH for Linux, Telnet for Windows), and all hosts get the "TRAP" check assigned to them. Our real configs here are much more complex and use a larger number of hostgroups, but basically we've set up templates for each hardware/OS combination we use, so that a standard set of checks ends up applied to, say, all of our C-class blade servers that run Red Hat Linux, while a different set of checks gets assigned to DL380 Windows servers, and still other checks get assigned to Cisco 6000-series routers running IOS. We then assign hostgroups at the host level for any applications that need monitoring. We *could* do that with templates, too, but made a design decision to use the templates for the hardware/OS-level checks, and to assign to app-level stuff on the hosts. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 18:46:05 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 12:46:05 -0500 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: References: <4B1E8941.5000003@uci.cu> <4B1E8B82.8070001@hp.com> <4B1E8FB1.6060408@uci.cu> Message-ID: <4B1E90DD.3080404@uci.cu> Greg Pangrazio wrote: > The security risks are that there is a potential for remote command > execution on the system. This is exactly the section I was refering > to. > Concerning this, what's your recommendation? I'm newbie on this topics and want to learn from those who are gurus of the theme -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Dec 8 18:48:19 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Tue, 08 Dec 2009 09:48:19 -0800 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E906F.1090802@uci.cu> References: <4B1E8941.5000003@uci.cu> <4B1E8B82.8070001@hp.com> <4B1E906F.1090802@uci.cu> Message-ID: <4B1E9163.8090504@hp.com> ReynierPM wrote: > Morris, Patrick wrote: > >> For check_ping, you're not passing the arguments correctly. The "!" >> syntax only works in Nagios command definitions, and does not apply to >> NRPE. >> > > For this, where I can find detailed info about parameters? When I run > the command ./check_ping this is returned: > > check_ping: Could not parse arguments > Usage:check_ping -H -w ,% -c ,% > [-p packets] [-t timeout] [-4|-6] > > But some of this are no so clear for me. > Try check_ping -h ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Tue Dec 8 18:50:53 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Tue, 8 Dec 2009 11:50:53 -0600 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E90DD.3080404@uci.cu> References: <4B1E8941.5000003@uci.cu> <4B1E8B82.8070001@hp.com> <4B1E8FB1.6060408@uci.cu> <4B1E90DD.3080404@uci.cu> Message-ID: Some of this depends upon the security of the system. Maybe on a host with protected information you should run a standalone version of Nagios and submit the results to the main server, just like a distributed setup. Likely for most instances this is not a problem especially if you run a firewall on the system and restrict the nsca port to only your nagios server. In the version i am running there is a section for allowed hosts and this should only contain your Nagios server, and the password should be difficult to guess and SSL should be enabled. Greg Pangrazio pangrazi at gmail.com 847-707-7933 (c) 847-973-0307 (h) On Tue, Dec 8, 2009 at 11:46 AM, ReynierPM wrote: > Greg Pangrazio wrote: >> >> The security risks are that there is a potential for remote command >> execution on the system. ?This is exactly the section I was refering >> to. >> > Concerning this, what's your recommendation? I'm newbie on this topics and > want to learn from those who are gurus of the theme > -- > Cheers > ReynierPM > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From wallis at aps.anl.gov Tue Dec 8 18:56:47 2009 From: wallis at aps.anl.gov (David Wallis) Date: Tue, 08 Dec 2009 11:56:47 -0600 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E8941.5000003@uci.cu> References: <4B1E8941.5000003@uci.cu> Message-ID: <4B1E935F.70903@aps.anl.gov> ReynierPM wrote: > Hi every: > I've experimenting some problems when trying to use check_ssh and > check_ping plugins. The configuration at remote server (where NRPE > resides) is as follow: > > command[check_ssh]=/usr/local/nagios/libexec/check_ssh > command[check_ping]=/usr/local/nagios/libexec/check_ping!100.0,20%!500.0,60% > > And then at server where .cfg resides the config is this one: > define service{ > use generic-service > host_name bacula-server > service_description SSH > check_command check_nrpe!check_ssh > notifications_enabled 0 > } > > define service{ > use generic-service > host_name bacula-server > service_description PING > check_command check_nrpe!check_ping > } > > The error for check_ping command is this: > NRPE: Unable to read output > and for check_ssh is this: > check_ssh: Could not parse arguments > > What I'm doing wrong or what I'm not doing? > It's not clear why you're using check_nrpe for these two services... that's probably not what you want to do. Check_ssh and check_ping should be run on the Nagios server itself. And, as another member pointed out, you've got the command line options messed up for both service checks: Usage: check_ping -H -w ,% -c ,% [-p packets] [-t timeout] [-4|-6] Usage: check_ssh [-46] [-t ] [-r ] [-p ] -- David Wallis Information Technology Advanced Photon Source Argonne National Laboratory -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 8 19:09:07 2009 From: marc at ena.com (Marc Powell) Date: Tue, 8 Dec 2009 12:09:07 -0600 Subject: NSCA strange behaviour In-Reply-To: <20091208181902.344ab911@saya.wrk.lsn.camptocamp.com> References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> <20091208181902.344ab911@saya.wrk.lsn.camptocamp.com> Message-ID: <536DC192-3F76-40E3-8295-3E4EF81A5999@ena.com> On Dec 8, 2009, at 11:19 AM, Cedric Jeanneret wrote: > Hello Marc, > > Indeed, the "are fresh" comes from nagios.log. ok, done. > > NSCA is running in daemon mode, iptables is opened for nsca port, and connections can go through it (tcpdump shows it to me, in both directions). > > Setting "debug=1" in nsca.cfg seems to do nothing more in /var/log/messages (redhat server). I just see "down" hosts passing through (results for ... are stalled - forcing immedia check...). There will be lots of output. I'm pretty sure that NSCA logs to syslog's 'daemon' facility using the 'debug','err', and 'info' priorities. Find out what file those are being logged to in /etc/syslog.conf. You're not going to get much traction on this issue until you can see that NSCA output. You could also try grepping your log files for 'Listening for connections' to find where it's going (if it it's currently being logged by syslog). -- Marc ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 19:25:22 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 13:25:22 -0500 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: References: <4B1E8941.5000003@uci.cu> <4B1E8B82.8070001@hp.com> <4B1E8FB1.6060408@uci.cu> <4B1E90DD.3080404@uci.cu> Message-ID: <4B1E9A12.4010501@uci.cu> Greg Pangrazio wrote: > Some of this depends upon the security of the system. Maybe on a host > with protected information you should run a standalone version of > Nagios and submit the results to the main server, just like a > distributed setup. This server have Bacula installed and attached to this exists a SAN disk with 2 TB where backups are saved. This is the only information "sensible" that I have on this server. > Likely for most instances this is not a problem especially if you run > a firewall on the system and restrict the nsca port to only your > nagios server. I'm running iptables on each of my server and if with NCSA you mean port 5666 the it's the only open port and of course those related to services installed on this server > In the version i am running there is a section for allowed hosts and > this should only contain your Nagios server, and the password should > be difficult to guess and SSL should be enabled. Could you please post a example of this configuration? Cheers -- ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 19:27:34 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 13:27:34 -0500 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E935F.70903@aps.anl.gov> References: <4B1E8941.5000003@uci.cu> <4B1E935F.70903@aps.anl.gov> Message-ID: <4B1E9A96.1000608@uci.cu> David Wallis wrote: > ReynierPM wrote: > > Usage: check_ping -H -w ,% -c > ,% [-p packets] [-t timeout] [-4|-6] Ok I'm trying to fix this one too but can't understand some of parameters need to be passed to check_ping command. Could any one point me to documentation about this? Right now I'm getting this error: was not set, but because I unknown what "wrta" means I can fix the error. -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From breandan at dezendorf.com Tue Dec 8 19:28:30 2009 From: breandan at dezendorf.com (Breandan Dezendorf) Date: Tue, 8 Dec 2009 13:28:30 -0500 Subject: Async Nagios Logging Message-ID: <3d9f8fe80912081028h6d360c7epc2c284ac851cb329@mail.gmail.com> I'm trying to figure out how to make nagios write it's logs asynchronously. I can push the logs into syslog, which I can force into async mode, but the format of logs it generates don't match up with the logs in nagios.log, and initial tests to have the CGI's read those logs have been fruitless. For what it's worth, I'm getting about 700 check results a second to the main nagios instances, on RHEL 5. I'm also running pnp4nagios and NDOUtils. Anyone have any suggestions for making the main daemon write in an async fashion? -- Breandan Dezendorf breandan at dezendorf.com bwdezend at gmail.com ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From wallis at aps.anl.gov Tue Dec 8 19:35:05 2009 From: wallis at aps.anl.gov (David Wallis) Date: Tue, 08 Dec 2009 12:35:05 -0600 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E9A96.1000608@uci.cu> References: <4B1E8941.5000003@uci.cu> <4B1E935F.70903@aps.anl.gov> <4B1E9A96.1000608@uci.cu> Message-ID: <4B1E9C59.1040100@aps.anl.gov> ReynierPM wrote: > David Wallis wrote: >> ReynierPM wrote: >> >> Usage: check_ping -H -w ,% -c >> ,% [-p packets] [-t timeout] [-4|-6] > > Ok I'm trying to fix this one too but can't understand some of > parameters need to be passed to check_ping command. Could any one > point me to documentation about this? Right now I'm getting this > error: was not set, but because I unknown what "wrta" means I > can fix the error. > is the Round-Trip Average travel time of the ping packet, in millisecond. The "-w" (warning level) and "-c" (critical level) options each take a pair of values... the RTA in milliseconds, and packet loss rate, in percent. So, a typical command line would look like: check_ping -H 192.168.1.1 -w 400,20% -c 800,40% -- David Wallis Information Technology Advanced Photon Source Argonne National Laboratory 630.252.7375 ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 19:42:12 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 13:42:12 -0500 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E9C59.1040100@aps.anl.gov> References: <4B1E8941.5000003@uci.cu> <4B1E935F.70903@aps.anl.gov> <4B1E9A96.1000608@uci.cu> <4B1E9C59.1040100@aps.anl.gov> Message-ID: <4B1E9E04.60504@uci.cu> David Wallis wrote: > ReynierPM wrote: > > is the Round-Trip Average travel time of the ping packet, in > millisecond. The "-w" (warning level) and "-c" (critical level) options > each take a pair of values... the RTA in milliseconds, and packet loss > rate, in percent. So, a typical command line would look like: > > check_ping -H 192.168.1.1 -w 400,20% -c 800,40% > Thx to all of yours for take some time to help me, now it working fine and I really understood what those parameters means, anyway I'm looking for a guide to see every one of this parameters and can't find one, could any tell me where to find one? -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From wallis at aps.anl.gov Tue Dec 8 19:43:48 2009 From: wallis at aps.anl.gov (David Wallis) Date: Tue, 08 Dec 2009 12:43:48 -0600 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E9E04.60504@uci.cu> References: <4B1E8941.5000003@uci.cu> <4B1E935F.70903@aps.anl.gov> <4B1E9A96.1000608@uci.cu> <4B1E9C59.1040100@aps.anl.gov> <4B1E9E04.60504@uci.cu> Message-ID: <4B1E9E64.8090105@aps.anl.gov> ReynierPM wrote: > David Wallis wrote: >> ReynierPM wrote: >> >> is the Round-Trip Average travel time of the ping packet, in >> millisecond. The "-w" (warning level) and "-c" (critical level) >> options each take a pair of values... the RTA in milliseconds, and >> packet loss rate, in percent. So, a typical command line would look >> like: >> >> check_ping -H 192.168.1.1 -w 400,20% -c 800,40% >> > > Thx to all of yours for take some time to help me, now it working fine > and I really understood what those parameters means, anyway I'm > looking for a guide to see every one of this parameters and can't find > one, could any tell me where to find one? > Any check_* command will give you fairly detailed help if you run it with the "-h" option. -- David Wallis Information Technology Advanced Photon Source Argonne National Laboratory ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Tue Dec 8 20:07:28 2009 From: cedric.jeanneret at camptocamp.com (=?UTF-8?Q?C=C3=A9dric_Jeanneret?=) Date: Tue, 8 Dec 2009 20:07:28 +0100 Subject: NSCA strange behaviour In-Reply-To: <536DC192-3F76-40E3-8295-3E4EF81A5999@ena.com> References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> <20091208181902.344ab911@saya.wrk.lsn.camptocamp.com> <536DC192-3F76-40E3-8295-3E4EF81A5999@ena.com> Message-ID: <7f569b520912081107n6f7dbb0eq18dbd04ab7bc3750@mail.gmail.com> Oh, thank you. I'll do that tomorrow and see what can be used here. On Tue, Dec 8, 2009 at 7:09 PM, Marc Powell wrote: > > On Dec 8, 2009, at 11:19 AM, Cedric Jeanneret wrote: > >> Hello Marc, >> >> Indeed, the "are fresh" comes from nagios.log. ok, done. >> >> NSCA is running in daemon mode, iptables is opened for nsca port, and connections can go through it (tcpdump shows it to me, in both directions). >> >> Setting "debug=1" in nsca.cfg seems to do nothing more in /var/log/messages (redhat server). I just see "down" hosts passing through (results for ... are stalled - forcing immedia check...). > > There will be lots of output. I'm pretty sure that NSCA logs to syslog's 'daemon' facility using the 'debug','err', and 'info' priorities. Find out what file those are being logged to in /etc/syslog.conf. You're not going to get much traction on this issue until you can see that NSCA output. You could also try grepping your log files for 'Listening for connections' to find where it's going (if it it's currently being logged by syslog). > > -- > Marc > > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Dec 8 20:34:35 2009 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Tue, 8 Dec 2009 11:34:35 -0800 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E9E04.60504@uci.cu> References: <4B1E8941.5000003@uci.cu> <4B1E935F.70903@aps.anl.gov> <4B1E9A96.1000608@uci.cu> <4B1E9C59.1040100@aps.anl.gov> <4B1E9E04.60504@uci.cu> Message-ID: <20091208193435.GU5494@bakgwai.americas.hpqcorp.net> On Tue, 08 Dec 2009, ReynierPM wrote: > David Wallis wrote: > > ReynierPM wrote: > > > > is the Round-Trip Average travel time of the ping packet, in > > millisecond. The "-w" (warning level) and "-c" (critical level) options > > each take a pair of values... the RTA in milliseconds, and packet loss > > rate, in percent. So, a typical command line would look like: > > > > check_ping -H 192.168.1.1 -w 400,20% -c 800,40% > > > > Thx to all of yours for take some time to help me, now it working fine > and I really understood what those parameters means, anyway I'm looking > for a guide to see every one of this parameters and can't find one, > could any tell me where to find one? Again, try check_ping -h ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 20:41:58 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 14:41:58 -0500 Subject: Nagios does not send email notifications Message-ID: <4B1EAC06.1010803@uci.cu> Hi every: I have problems now with email notification unless I didn't receive one. I check Nagios Event Logs and see this: [12-08-2009 14:53:28] Warning: Attempting to execute the command "/usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: PROBLEM\n\nService: Bacula Dir Daemon\nHost: bacula\nAddress: 10.128.50.11\nState: UNKNOWN\n\nDate/Time: Tue Dec 8 14:53:28 CST 2009\n\nAdditional Info:\n\nRemote command execution failed: Permission denied, please try again." | /bin/mail -s "** PROBLEM Service Alert: bacula/Bacula Dir Daemon is UNKNOWN **" rperezm at uci.cu" resulted in a return code of 127. Make sure the script or binary you are trying to execute actually exists... But the mail never arrive to my Inbox. Why? My config is as follow: define command{ command_name notify-host-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "** $NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is $HOSTSTATE$ **" $CONTACTEMAIL$ } # 'notify-service-by-email' command definition define command{ command_name notify-service-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /bin/mail -s "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } Cheers and thanks in advance -- ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 8 21:10:54 2009 From: marc at ena.com (Marc Powell) Date: Tue, 8 Dec 2009 14:10:54 -0600 Subject: check_ssh and check_ping not working with NRPE on remote host In-Reply-To: <4B1E9E64.8090105@aps.anl.gov> References: <4B1E8941.5000003@uci.cu> <4B1E935F.70903@aps.anl.gov> <4B1E9A96.1000608@uci.cu> <4B1E9C59.1040100@aps.anl.gov> <4B1E9E04.60504@uci.cu> <4B1E9E64.8090105@aps.anl.gov> Message-ID: <1303B7DF-5D0A-4C52-A8E4-83ECE6CD914B@ena.com> On Dec 8, 2009, at 12:43 PM, David Wallis wrote: > ReynierPM wrote: >> David Wallis wrote: >> Thx to all of yours for take some time to help me, now it working fine >> and I really understood what those parameters means, anyway I'm >> looking for a guide to see every one of this parameters and can't find >> one, could any tell me where to find one? >> > > Any check_* command will give you fairly detailed help if you run it > with the "-h" option. And online versions for the standard plugins are available at http://nagiosplugins.org/man. -- Marc ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue Dec 8 21:26:43 2009 From: mitsuto at gmail.com (Marcel) Date: Tue, 8 Dec 2009 18:26:43 -0200 Subject: Nagios does not send email notifications In-Reply-To: <4B1EAC06.1010803@uci.cu> References: <4B1EAC06.1010803@uci.cu> Message-ID: <2dfcbd1b0912081226m1a9de431icd076ac41da714b6@mail.gmail.com> either you don't have printf of mail. On Tue, Dec 8, 2009 at 5:41 PM, ReynierPM wrote: > Hi every: > I have problems now with email notification unless I didn't receive one. > I check Nagios Event Logs and see this: > > [12-08-2009 14:53:28] Warning: Attempting to execute the command > "/usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: > PROBLEM\n\nService: Bacula Dir Daemon\nHost: bacula\nAddress: > 10.128.50.11\nState: UNKNOWN\n\nDate/Time: Tue Dec 8 14:53:28 CST > 2009\n\nAdditional Info:\n\nRemote command execution failed: Permission > denied, please try again." | /bin/mail -s "** PROBLEM Service Alert: > bacula/Bacula Dir Daemon is UNKNOWN **" rperezm at uci.cu" resulted in a > return code of 127. Make sure the script or binary you are trying to > execute actually exists... > > But the mail never arrive to my Inbox. Why? My config is as follow: > > define command{ > command_name notify-host-by-email > command_line /usr/bin/printf "%b" "***** Nagios > *****\n\nNotification > Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: > $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | > /bin/mail -s "** $NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is > $HOSTSTATE$ **" $CONTACTEMAIL$ > } > > # 'notify-service-by-email' command definition > define command{ > command_name notify-service-by-email > command_line /usr/bin/printf "%b" "***** Nagios > *****\n\nNotification > Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: > $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: > $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /bin/mail -s > "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is > $SERVICESTATE$ **" $CONTACTEMAIL$ > } > > Cheers and thanks in advance > -- > ReynierPM > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Tue Dec 8 21:26:49 2009 From: mitsuto at gmail.com (Marcel) Date: Tue, 8 Dec 2009 18:26:49 -0200 Subject: Nagios does not send email notifications In-Reply-To: <2dfcbd1b0912081226m1a9de431icd076ac41da714b6@mail.gmail.com> References: <4B1EAC06.1010803@uci.cu> <2dfcbd1b0912081226m1a9de431icd076ac41da714b6@mail.gmail.com> Message-ID: <2dfcbd1b0912081226y465328fag60edf54c3a4727be@mail.gmail.com> *or* On Tue, Dec 8, 2009 at 6:26 PM, Marcel wrote: > either you don't have printf of mail. > > On Tue, Dec 8, 2009 at 5:41 PM, ReynierPM wrote: > >> Hi every: >> I have problems now with email notification unless I didn't receive one. >> I check Nagios Event Logs and see this: >> >> [12-08-2009 14:53:28] Warning: Attempting to execute the command >> "/usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: >> PROBLEM\n\nService: Bacula Dir Daemon\nHost: bacula\nAddress: >> 10.128.50.11\nState: UNKNOWN\n\nDate/Time: Tue Dec 8 14:53:28 CST >> 2009\n\nAdditional Info:\n\nRemote command execution failed: Permission >> denied, please try again." | /bin/mail -s "** PROBLEM Service Alert: >> bacula/Bacula Dir Daemon is UNKNOWN **" rperezm at uci.cu" resulted in a >> return code of 127. Make sure the script or binary you are trying to >> execute actually exists... >> >> But the mail never arrive to my Inbox. Why? My config is as follow: >> >> define command{ >> command_name notify-host-by-email >> command_line /usr/bin/printf "%b" "***** Nagios >> *****\n\nNotification >> Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: >> $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | >> /bin/mail -s "** $NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is >> $HOSTSTATE$ **" $CONTACTEMAIL$ >> } >> >> # 'notify-service-by-email' command definition >> define command{ >> command_name notify-service-by-email >> command_line /usr/bin/printf "%b" "***** Nagios >> *****\n\nNotification >> Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: >> $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: >> $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /bin/mail -s >> "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is >> $SERVICESTATE$ **" $CONTACTEMAIL$ >> } >> >> Cheers and thanks in advance >> -- >> ReynierPM >> >> >> ------------------------------------------------------------------------------ >> Return on Information: >> Google Enterprise Search pays you back >> Get the facts. >> http://p.sf.net/sfu/google-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Tue Dec 8 21:28:52 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Tue, 8 Dec 2009 14:28:52 -0600 Subject: Nagios does not send email notifications In-Reply-To: <4B1EAC06.1010803@uci.cu> References: <4B1EAC06.1010803@uci.cu> Message-ID: Can you send a command using the "mail" command via the command line? If you don't have mail configured on the server nagios cannot send mail. Greg Pangrazio pangrazi at gmail.com On Tue, Dec 8, 2009 at 1:41 PM, ReynierPM wrote: > Hi every: > I have problems now with email notification unless I didn't receive one. > ? I check Nagios Event Logs and see this: > > [12-08-2009 14:53:28] Warning: Attempting to execute the command > "/usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: > PROBLEM\n\nService: Bacula Dir Daemon\nHost: bacula\nAddress: > 10.128.50.11\nState: UNKNOWN\n\nDate/Time: Tue Dec 8 14:53:28 CST > 2009\n\nAdditional Info:\n\nRemote command execution failed: Permission > denied, please try again." | /bin/mail -s "** PROBLEM Service Alert: > bacula/Bacula Dir Daemon is UNKNOWN **" rperezm at uci.cu" resulted in a > return code of 127. Make sure the script or binary you are trying to > execute actually exists... > > But the mail never arrive to my Inbox. Why? My config is as follow: > > define command{ > ? ? ? ?command_name ? ?notify-host-by-email > ? ? ? ?command_line ? ?/usr/bin/printf "%b" "***** Nagios *****\n\nNotification > Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: > $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | > /bin/mail -s "** $NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is > $HOSTSTATE$ **" $CONTACTEMAIL$ > } > > # 'notify-service-by-email' command definition > define command{ > ? ? ? ?command_name ? ?notify-service-by-email > ? ? ? ?command_line ? ?/usr/bin/printf "%b" "***** Nagios *****\n\nNotification > Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: > $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: > $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /bin/mail -s > "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is > $SERVICESTATE$ **" $CONTACTEMAIL$ > } > > Cheers and thanks in advance > -- > ReynierPM > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 8 21:41:00 2009 From: marc at ena.com (Marc Powell) Date: Tue, 8 Dec 2009 14:41:00 -0600 Subject: Nagios does not send email notifications In-Reply-To: <4B1EAC06.1010803@uci.cu> References: <4B1EAC06.1010803@uci.cu> Message-ID: On Dec 8, 2009, at 1:41 PM, ReynierPM wrote: > Hi every: > I have problems now with email notification unless I didn't receive one. > I check Nagios Event Logs and see this: > > [12-08-2009 14:53:28] Warning: Attempting to execute the command > "/usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: > PROBLEM\n\nService: Bacula Dir Daemon\nHost: bacula\nAddress: > 10.128.50.11\nState: UNKNOWN\n\nDate/Time: Tue Dec 8 14:53:28 CST > 2009\n\nAdditional Info:\n\nRemote command execution failed: Permission > denied, please try again." | /bin/mail -s "** PROBLEM Service Alert: > bacula/Bacula Dir Daemon is UNKNOWN **" rperezm at uci.cu" resulted in a > return code of 127. Make sure the script or binary you are trying to > execute actually exists... > > But the mail never arrive to my Inbox. Why? Because it wasn't sent. Either /usr/bin/printf doesn't exist or /bin/mail doesn't exist. Make sure the packages that provide them for your system are installed. Was nagios compiled on this same box? Did you change this command in any way after install? Nagios is pretty good about detecting these kinds of things during the ./configure phase and/or your package maintainer should have detected it. -- Marc ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Dec 8 22:09:57 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 16:09:57 -0500 Subject: Nagios does not send email notifications In-Reply-To: References: <4B1EAC06.1010803@uci.cu> Message-ID: <4B1EC0A5.1020208@uci.cu> Marc Powell wrote: > On Dec 8, 2009, at 1:41 PM, ReynierPM wrote: > > > Because it wasn't sent. Either /usr/bin/printf doesn't exist or > /bin/mail doesn't exist. Make sure the packages that provide them for > your system are installed. > > Was nagios compiled on this same box? Did you change this command in > any way after install? Nagios is pretty good about detecting these > kinds of things during the ./configure phase and/or your package > maintainer should have detected it. > Yes, I compiled without touch anything. Maybe my system has nothing installed able to send emails. I've installed Postfix and configure it as Satellite but I think this is just the MTA and not the program who send emails. I'm wrong? How I could check this requirement? -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 8 22:35:53 2009 From: marc at ena.com (Marc Powell) Date: Tue, 8 Dec 2009 15:35:53 -0600 Subject: Nagios does not send email notifications In-Reply-To: <4B1EC0A5.1020208@uci.cu> References: <4B1EAC06.1010803@uci.cu> <4B1EC0A5.1020208@uci.cu> Message-ID: On Dec 8, 2009, at 3:09 PM, ReynierPM wrote: > Yes, I compiled without touch anything. Maybe my system has nothing installed able to send emails. I've installed Postfix and configure it as Satellite but I think this is just the MTA and not the program who send emails. I'm wrong? How I could check this requirement? You could use ls to see if they exist -- $ ls -l /usr/bin/printf /bin/mail -rwxr-xr-x 1 root mail 84856 Jan 7 2007 /bin/mail -rwxr-xr-x 1 root root 31496 Jan 21 2009 /usr/bin/printf You could query your package system to verify if it's been installed. How to do so depends on your distribution. Under Redhat and clones it's -- [mpowell at noctools ~]$ rpm -q --whatprovides /usr/bin/printf coreutils-5.97-19.el5 [mpowell at noctools ~]$ rpm -q --whatprovides /bin/mail mailx-8.1.1-44.2.2 [mpowell at noctools ~]$ rpm -qa | egrep 'coreutils|mailx' coreutils-5.97-19.el5 mailx-8.1.1-44.2.2 -- Marc ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Dec 8 22:37:28 2009 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Tue, 8 Dec 2009 13:37:28 -0800 Subject: Nagios does not send email notifications In-Reply-To: <4B1EC0A5.1020208@uci.cu> References: <4B1EAC06.1010803@uci.cu> <4B1EC0A5.1020208@uci.cu> Message-ID: <20091208213728.GW5494@bakgwai.americas.hpqcorp.net> Hi ReynierPM! On Tue, 08 Dec 2009, ReynierPM wrote: > Marc Powell wrote: > > On Dec 8, 2009, at 1:41 PM, ReynierPM wrote: > > > > > > Because it wasn't sent. Either /usr/bin/printf doesn't exist or > > /bin/mail doesn't exist. Make sure the packages that provide them for > > your system are installed. > > > > Was nagios compiled on this same box? Did you change this command in > > any way after install? Nagios is pretty good about detecting these > > kinds of things during the ./configure phase and/or your package > > maintainer should have detected it. > > > > Yes, I compiled without touch anything. Maybe my system has nothing > installed able to send emails. I've installed Postfix and configure it > as Satellite but I think this is just the MTA and not the program who > send emails. I'm wrong? How I could check this requirement? The first step would be to check what was suggested earlier: do /usr/bin/printf and /bin/mail exist? ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From joliver at john-oliver.net Tue Dec 8 23:04:44 2009 From: joliver at john-oliver.net (John Oliver) Date: Tue, 8 Dec 2009 14:04:44 -0800 Subject: check_http, SSL, and DoD Message-ID: <20091208220444.GA13993@ns.sdsitehosting.net> Does anyone have a hack to let check_http -S work on DoD hosts? [joliver at services4 ~]$ openssl s_client -connect infosec.navy.mil:443 CONNECTED(00000003) depth=0 /C=US/O=U.S. Government/OU=DoD/OU=PKI/OU=USN/CN=infosec.navy.mil verify error:num=20:unable to get local issuer certificate verify return:1 depth=0 /C=US/O=U.S. Government/OU=DoD/OU=PKI/OU=USN/CN=infosec.navy.mil verify error:num=27:certificate not trusted verify return:1 depth=0 /C=US/O=U.S. Government/OU=DoD/OU=PKI/OU=USN/CN=infosec.navy.mil verify error:num=21:unable to verify the first certificate verify return:1 12244:error:14094410:SSL routines:SSL3_READ_BYTES:sslv3 alert handshake failure:s3_pkt.c:1053:SSL alert number 40 12244:error:140790E5:SSL routines:SSL23_WRITE:ssl handshake failure:s23_lib.c:188: It would need to be trust DoD root and intermediate certs, and probably to present a client certificate as well. I suppose getting it to accept the "handshake failure" as success would be a stopgap. -- *********************************************************************** * John Oliver http://www.john-oliver.net/ * * * *********************************************************************** ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Steve_Fiedler at PepBoys.com Tue Dec 8 23:43:58 2009 From: Steve_Fiedler at PepBoys.com (Steve_Fiedler at PepBoys.com) Date: Tue, 8 Dec 2009 17:43:58 -0500 Subject: ? on Notification of threshold being exceeded Message-ID: Hello All, Is it possible to set up a check for nagios where an admin is notified when a threshold value has been exceeded in a specific time frame? For example, send me a notification when a check_ping has failed 20 times during the past hour for a server but only send me the notification 1 time for that hour time frame. We get 100's or 1000s of emails from Tivoli for alerts and want to somehow keep track of the alerts but only send out the notification when deemed critical. Thought Nagios could do it better than Tivoli. :) Hope I explained that correctly, Thanks, Steve Disclaimer: The information contained in this communication is confidential and only for the use of the intended addressee(s). If you have received this communication in error, any disclosure or use of such information is strictly prohibited. Please notify the sender immediately and destroy all copies. Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Wed Dec 9 02:23:22 2009 From: rperezm at uci.cu (ReynierPM) Date: Tue, 08 Dec 2009 20:23:22 -0500 Subject: Nagios does not send email notifications In-Reply-To: <20091208213728.GW5494@bakgwai.americas.hpqcorp.net> References: <4B1EAC06.1010803@uci.cu> <4B1EC0A5.1020208@uci.cu> <20091208213728.GW5494@bakgwai.americas.hpqcorp.net> Message-ID: <4B1EFC0A.6030401@uci.cu> patrick.morris at hp.com wrote: > Hi ReynierPM! > The first step would be to check what was suggested earlier: do > /usr/bin/printf and /bin/mail exist? > Hi Patrick: /usr/bin/printf -> exists /bin/mail -> doesn't exists What I need to install to get mail functionalities on this server? -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Dec 9 10:05:52 2009 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 09 Dec 2009 09:05:52 +0000 Subject: ? on Notification of threshold being exceeded In-Reply-To: References: Message-ID: <4B1F6870.9000501@flatto.net> Hello have you checked the options of escalations ? http://nagios.sourceforge.net/docs/3_0/escalations.html I think this can solve your problem Assaf Steve_Fiedler at PepBoys.com wrote: > Hello All, > > Is it possible to set up a check for nagios where an admin is notified > when a threshold value has been exceeded in a specific time frame? > > For example, send me a notification when a check_ping has failed 20 > times during the past hour for a server but only send me the > notification 1 time for that hour time frame. > > We get 100's or 1000s of emails from Tivoli for alerts and want to > somehow keep track of the alerts but only send out the notification > when deemed critical. Thought Nagios could do it better than Tivoli. :) > > Hope I explained that correctly, > > Thanks, > Steve > ------------------------------------------------------------------------ > *Disclaimer: The information contained in this communication is > confidential and only for the use of the intended addressee(s). If you > have received this communication in error, any disclosure or use of > such information is strictly prohibited. Please notify the sender > immediately and destroy all copies. Thank you.* > ------------------------------------------------------------------------ > ** > ** > ------------------------------------------------------------------------ > ** > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > ** > ** > ** > ------------------------------------------------------------------------ > ** > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null** ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jinxi at STACHANOV.COM Wed Dec 9 11:41:56 2009 From: jinxi at STACHANOV.COM (Jinxi Cheng) Date: Wed, 9 Dec 2009 11:41:56 +0100 Subject: availability reports in Nagios2 Message-ID: <4806478B83689B42BFEB1B0746572F3535259D@priamus.Stachanov.local> Hi, I have Nagios2 installed and configured. It seems everything is running fine. Can some one explains why I get "program start. Normal program termination" in the logs? I see this when I create a availability report. It used to be "HTTP OK" in Nagios (older version). What does this message "program start. Normal program termination" mean? Also, availability report does not seems to work correctly in Nagios2. Here is copy of the log: Event Start Time Event End TimeEvent DurationEvent/State Type Event/State Information 2009-08-25 13:08:35 2009-08-25 13:08:360d 0h 0m 1s PROGRAM (RE)START Program start 2009-08-25 13:08:36 2009-08-25 13:09:120d 0h 0m 36s PROGRAM END Abnormal program termination 2009-08-25 13:09:12 2009-08-25 13:09:130d 0h 0m 1s PROGRAM (RE)START Program start 2009-08-25 13:09:13 2009-08-25 13:10:330d 0h 1m 20s PROGRAM END Abnormal program termination 2009-08-25 13:10:33 2009-08-25 13:17:470d 0h 7m 14s PROGRAM (RE)START Program start 2009-08-25 13:17:47 2009-08-25 13:17:470d 0h 0m 0s PROGRAM END Normal program termination Thanks in advance Jinxi -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Dec 9 14:13:02 2009 From: marc at ena.com (Marc Powell) Date: Wed, 9 Dec 2009 07:13:02 -0600 Subject: availability reports in Nagios2 In-Reply-To: <4806478B83689B42BFEB1B0746572F3535259D@priamus.Stachanov.local> References: <4806478B83689B42BFEB1B0746572F3535259D@priamus.Stachanov.local> Message-ID: On Dec 9, 2009, at 4:41 AM, Jinxi Cheng wrote: > Hi, > I have Nagios2 installed and configured. It seems everything is running fine. Can some one explains why I get ?program start. Normal program termination? in the logs? You're apparently starting and stopping nagios. > I see this when I create a availability report. It used to be ?HTTP OK? in Nagios (older version). What does this message ?program start. Normal program termination? mean? Two log entries -- [timestamp] starting... [timestamp] shutting down... > Also, availability report does not seems to work correctly in Nagios2. Be more specific. I'm not aware of any issues with them. You are installing an old version of nagios though. If you have no specific requirements, you should be installing nagios-3. > Here is copy of the log: > Event Start Time Event End TimeEvent DurationEvent/State Type Event/State Information > 2009-08-25 13:08:35 2009-08-25 13:08:360d 0h 0m 1s PROGRAM (RE)START Program start This corresponds to 'starting...' in the logs. > 2009-08-25 13:08:36 2009-08-25 13:09:120d 0h 0m 36s PROGRAM END Abnormal program termination This corresponds to 'Bailing out (reason...)" in the logs. > 2009-08-25 13:17:47 2009-08-25 13:17:470d 0h 0m 0s PROGRAM END Normal program termination This corresponds to 'shutting down...' in the logs. -- Marc ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Wed Dec 9 14:43:12 2009 From: rperezm at uci.cu (ReynierPM) Date: Wed, 09 Dec 2009 08:43:12 -0500 Subject: Nagios does not send email notifications In-Reply-To: References: <4B1EAC06.1010803@uci.cu> <4B1EC0A5.1020208@uci.cu> Message-ID: <4B1FA970.2010601@uci.cu> Marc Powell wrote: > On Dec 8, 2009, at 3:09 PM, ReynierPM wrote: > > > You could use ls to see if they exist -- > > $ ls -l /usr/bin/printf /bin/mail > -rwxr-xr-x 1 root mail 84856 Jan 7 2007 /bin/mail > -rwxr-xr-x 1 root root 31496 Jan 21 2009 /usr/bin/printf > > You could query your package system to verify if it's been installed. How to do so depends on your distribution. Under Redhat and clones it's -- > > [mpowell at noctools ~]$ rpm -q --whatprovides /usr/bin/printf > coreutils-5.97-19.el5 > > [mpowell at noctools ~]$ rpm -q --whatprovides /bin/mail > mailx-8.1.1-44.2.2 > > [mpowell at noctools ~]$ rpm -qa | egrep 'coreutils|mailx' > coreutils-5.97-19.el5 > mailx-8.1.1-44.2.2 > Well I check all and the file exists at /usr/bin/mail so my services definitions is as follow define command{ command_name notify-host-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ Host Alert: $HOSTNAME$ is $HOSTSTATE$ **" $CONTACTEMAIL$ } # 'notify-service-by-email' command definition define command{ command_name notify-service-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } But now checking logs I get this error: [12-09-2009 07:53:34] SERVICE NOTIFICATION: nagiosadmin;bacula-server;Bacula Dir Daemon;UNKNOWN;notify-service-by-email;Remote command execution failed: Permission denied, please try again. How can I fix this one? Cheers -- ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Wed Dec 9 16:39:16 2009 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Wed, 9 Dec 2009 16:39:16 +0100 Subject: NSCA strange behaviour In-Reply-To: References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> Message-ID: <20091209163916.5636ca1e@saya.wrk.lsn.camptocamp.com> Hello again I've made some other tests: - I changed encryption algo on server01 and client22, restarted nagios&nsca on server01, restarted nagios on client22. - let it run like that, checking logs for something. As I thought, nsca begins to output lot of error regarding version client and/or encryption method and/or password. Nice. - I put back the right encryption method on server01 but NOT on client22. I should have seen a pattern like "Received invalid packet".... but nothing. I forced some status update from client22: for i in $(seq 1000); do echo -n 'host '; /usr/local/bin/submit_ochp $(hostname -f) "UP" "Host is up"; sleep 2; done Nothing. TCPDump shows me traffic, in and out for both hosts... Last test, I stopped nagios&nsca on server01, removed nsca.dump objects.cache retention.dat files, and start again nsca&nagios.... If we let the fact that all my hosts are "pending", my client22 doesn't push any status... nothing in nsca logs. Any other idea ? It's like the nsca daemon ignore (without any output) client22 queries. and client22 doesn't know about it. Best regards, C. -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Wed Dec 9 16:58:53 2009 From: mitsuto at gmail.com (Marcel) Date: Wed, 9 Dec 2009 13:58:53 -0200 Subject: ? on Notification of threshold being exceeded In-Reply-To: References: Message-ID: <2dfcbd1b0912090758n5221f250j6d58b533e218b6fe@mail.gmail.com> Hello, On Tue, Dec 8, 2009 at 8:43 PM, wrote: > Hello All, > > Is it possible to set up a check for nagios where an admin is notified when > a threshold value has been exceeded in a specific time frame? > Sure. With a proper service check or with escalations. > For example, send me a notification when a check_ping has failed 20 times > during the past hour for a server but only send me the notification 1 time > for that hour time frame. > I'd setup a ping service that has max_check_attempts set to 20 and normal_check_interval set to 300 - That way you'd only receive notifications in one hour of ping errors. > We get 100's or 1000s of emails from Tivoli for alerts and want to somehow > keep track of the alerts but only send out the notification when deemed > critical. Thought Nagios could do it better than Tivoli. :) > If Tivoli floods your mailbox, and all of them are false positives, then you have a Tivoli setup problem, or a network related issue. Nagios maybe at the same point of view of Tivoli, if you're having network issues. HTH, Marcel -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcall at verio.net Thu Dec 10 00:06:25 2009 From: jcall at verio.net (Jonathan Call) Date: Wed, 9 Dec 2009 18:06:25 -0500 Subject: Nagios2 process overwhelmed by NSCA daemon? Message-ID: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> I recently added two new slaves to a distributed Nagios system. The central server now passively processes 17,000+ service checks on 3000+ servers. It's been over an hour and a half since I brought those new slaves online and I have about 150 hosts still stuck in 'Pending' and about 1300 services in the same state. In addition to that it seems that the service check results from the other slaves that were working normally are now arbitrarily disappearing. For example, on one host three of the service checks have been updated relatively recently (i.e. 5-30 minutes ago) but three other service checks haven't been updated for almost an hour. The slaves all appear operational and the hosts are being checked on time. Is it possible I've overwhelmed Nagios' ability to process data from the NSCA daemon or struck some internal Nagios bottleneck? Any suggestions would be appreciated. Jonathan This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Thu Dec 10 15:25:47 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Thu, 10 Dec 2009 08:25:47 -0600 Subject: Nagios2 process overwhelmed by NSCA daemon? In-Reply-To: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> References: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> Message-ID: Are you running the full nagios on the "slaves"? Do the checks seem to be working on those hosts? Greg Pangrazio pangrazi at gmail.com On Wed, Dec 9, 2009 at 5:06 PM, Jonathan Call wrote: > I recently added two new slaves to a distributed Nagios system. The > central server now passively processes 17,000+ service checks on 3000+ > servers. > > It's been over an hour and a half since I brought those new slaves > online and I have about 150 hosts still stuck in 'Pending' and about > 1300 services in the same state. In addition to that it seems that the > service check results from the other slaves that were working normally > are now arbitrarily disappearing. For example, on one host three of the > service checks have been updated relatively recently (i.e. 5-30 minutes > ago) but three other service checks haven't been updated for almost an > hour. The slaves all appear operational and the hosts are being checked > on time. Is it possible I've overwhelmed Nagios' ability to process data > from the NSCA daemon or struck some internal Nagios bottleneck? Any > suggestions would be appreciated. > > Jonathan > > > This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. ?Thank you. > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Thu Dec 10 15:29:15 2009 From: mitsuto at gmail.com (Marcel) Date: Thu, 10 Dec 2009 12:29:15 -0200 Subject: Nagios2 process overwhelmed by NSCA daemon? In-Reply-To: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> References: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> Message-ID: <2dfcbd1b0912100629m70bc939egbae060ec6ed496ab@mail.gmail.com> In my last job, I was dealing with a nagios install a little bit over than yours, On Wed, Dec 9, 2009 at 9:06 PM, Jonathan Call wrote: > I recently added two new slaves to a distributed Nagios system. The > central server now passively processes 17,000+ service checks on 3000+ > servers. > > It's been over an hour and a half since I brought those new slaves > online and I have about 150 hosts still stuck in 'Pending' and about > 1300 services in the same state. In addition to that it seems that the > service check results from the other slaves that were working normally > are now arbitrarily disappearing. For example, on one host three of the > service checks have been updated relatively recently (i.e. 5-30 minutes > ago) but three other service checks haven't been updated for almost an > hour. The slaves all appear operational and the hosts are being checked > on time. Is it possible I've overwhelmed Nagios' ability to process data > from the NSCA daemon or struck some internal Nagios bottleneck? Any > suggestions would be appreciated. > With 4K servers and just over 24K service checks, with 12 or 13 distributed servers. Well, I've ran into many kinds of problems because of nagios poor design of distributed monitoring setup. Appears that distributed setup was done almost as a poor patch just to have to overcome some limitation . We ended up doing some custom passive plugins. They were built to send status information updates just in case of state change. In that way the load on NSCA side was very much reduced (it was Load Balanced with a Virtual IP, batch updates, but problems would still occur). This set of plugins were a little hard to mantain, because configuration of each server needed to be at the monitored server, puppet ftw. All checks were logged and later synchronized with ndo to have last checks history. NDO and the database schema has had to be modified too. The volume of inserts was way too high to be handled correctly in a timely manner, recurrent restarts of the database causing staled results, every sort of problem in managing those systems, even after a thorough tunning of the database. After adding logic to update only when state change ocurred, and another batch update to update last check and the fields that needed to be updated with last check information, the database load was normalized and scalability could be proven. So what I'd suggest to you, is to first tweak with the large installation procedures, tmpfs for the status.dat, objects.cache, retention.dat, setting batch jobs to send_nsca output to central/master nagios instance, and so on. Also, you can do some nagios setup magic aswell, having distributed nodes checking in a frequency (normal_check_interval) different than central nagios expects, say, setup central nagios to wait for status information on 30 minutes frequency, but have the distributed nodes to send them at 15 minutes freq., something like that. For what I know, it's really a cumbersome job to have enterprise scalability nagios configuration. For tiny and trivial installs it's like using Zennoss or Zabbixx, but with a lot of extra configuration-files pain. I think that no other competitor's tool (Z*bbnn*ssxx) would scale too when you need enterprise huge installs, so nagios is a little ahead and gives flexibility, but with an associated cost that scares anyone (ending up buying another tool to much less for much more). That's why I've liked Gab?s Jean's Shinken approach to have scalability and to ease interoperability with puppet. That would be the ?bber-super-mega-ultra tool. Also, with nginx and asynchronicity of front-end, back-end, and checks, would end up with the most robust, easy, enterprise NMS. So, G?an, continue on that path to have your Shinken working with backcompatibility with nagios setups, but also think ahead on design to have puppet integrated to handle configuration convergence (maybe eventhandlers too?). Cheers, M -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Thu Dec 10 16:18:45 2009 From: shadhin71 at gmail.com (shadih rahman) Date: Thu, 10 Dec 2009 10:18:45 -0500 Subject: check_snmp with regular expression Message-ID: <6db4a4200912100718l53e3b3fbr7a8ca67aaf0c3517@mail.gmail.com> List, I am trying to use check_snmp plugin with the following regular expression and I am getting an error, can someone point out what am I doing wrong. Thanks /usr/lib64/nagios/plugins/check_snmp -H hostname -C community -o .1.3.6.1.2.1.1.6.0 -r "^*.some string*$" Could Not Compile Regular Expressioncheck_snmp: Could not parse arguments -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Thu Dec 10 16:28:07 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Thu, 10 Dec 2009 09:28:07 -0600 Subject: check_snmp with regular expression In-Reply-To: <6db4a4200912100718l53e3b3fbr7a8ca67aaf0c3517@mail.gmail.com> References: <6db4a4200912100718l53e3b3fbr7a8ca67aaf0c3517@mail.gmail.com> Message-ID: did you mean "^*.some string.*$" notice the period before the second * Greg Pangrazio pangrazi at gmail.com On Thu, Dec 10, 2009 at 9:18 AM, shadih rahman wrote: > List, > ?? I am trying to use check_snmp plugin with the following regular > expression and I am getting an error, can someone point out what am I doing > wrong.? Thanks > > > > /usr/lib64/nagios/plugins/check_snmp -H hostname -C community -o > .1.3.6.1.2.1.1.6.0 -r "^*.some string*$" > > Could Not Compile Regular Expressioncheck_snmp: Could not parse arguments > > > > > -- > Cordially, > Shadhin Rahman > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Thu Dec 10 16:34:40 2009 From: mmelin at gmail.com (Martin Melin) Date: Thu, 10 Dec 2009 16:34:40 +0100 Subject: check_snmp with regular expression In-Reply-To: <6db4a4200912100718l53e3b3fbr7a8ca67aaf0c3517@mail.gmail.com> References: <6db4a4200912100718l53e3b3fbr7a8ca67aaf0c3517@mail.gmail.com> Message-ID: It looks like you're trying to match "some string", no matter where it appears in the document. In that case, anchoring to line beginning and end is just extra work. Simply match on "some string", and you're good to go. The asterisk is a modifier to the dot, so it needs to come after that. So the regex you pasted should probably be "^.*some string.*$", but this is functionally equivalent to "some string". Regards, Martin Melin On Thu, Dec 10, 2009 at 4:18 PM, shadih rahman wrote: > List, > I am trying to use check_snmp plugin with the following regular > expression and I am getting an error, can someone point out what am I doing > wrong. Thanks > > > > /usr/lib64/nagios/plugins/check_snmp -H hostname -C community -o > .1.3.6.1.2.1.1.6.0 -r "^*.some string*$" > > Could Not Compile Regular Expressioncheck_snmp: Could not parse arguments > > > > > -- > Cordially, > Shadhin Rahman > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcall at verio.net Thu Dec 10 17:39:29 2009 From: jcall at verio.net (Jonathan Call) Date: Thu, 10 Dec 2009 11:39:29 -0500 Subject: Nagios2 process overwhelmed by NSCA daemon? In-Reply-To: References: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> Message-ID: <04F3233F47E2714CB7431AE913E57E7703AB6E06@IAD-WPRD-XCHB02.corp.verio.net> Yes, Full Nagios is running on the slaves. They use OCP_daemon to pass on data to the central server since the NSCA client can't hack the load. They seem to be sending data properly to the NSCA daemon. Part of the issue I've tracked down to the status.cgi. The central server appears to be underpowered when it comes to both having Nagios process data AND have several people pounding out host/service status queries from the web interface. I will be adding another CPU to see if this helps, however I'm dismayed that Nagios on the central server doesn't seem to be reporting any errors, or indicating that there is any problem processing passive results. Nagios just starts to lose the data at a certain point. Jonathan > -----Original Message----- > From: Greg Pangrazio [mailto:pangrazi at gmail.com] > Sent: Thursday, December 10, 2009 7:26 AM > To: Jonathan Call > Cc: nagios-user Mailinglist > Subject: Re: [Nagios-users] Nagios2 process overwhelmed by NSCA daemon? > > Are you running the full nagios on the "slaves"? Do the checks seem > to be working on those hosts? > > Greg Pangrazio > pangrazi at gmail.com > > > > > > On Wed, Dec 9, 2009 at 5:06 PM, Jonathan Call wrote: > > I recently added two new slaves to a distributed Nagios system. The > > central server now passively processes 17,000+ service checks on > 3000+ > > servers. > > > > It's been over an hour and a half since I brought those new slaves > > online and I have about 150 hosts still stuck in 'Pending' and about > > 1300 services in the same state. In addition to that it seems that > the > > service check results from the other slaves that were working > normally > > are now arbitrarily disappearing. For example, on one host three of > the > > service checks have been updated relatively recently (i.e. 5-30 > minutes > > ago) but three other service checks haven't been updated for almost > an > > hour. The slaves all appear operational and the hosts are being > checked > > on time. Is it possible I've overwhelmed Nagios' ability to process > data > > from the NSCA daemon or struck some internal Nagios bottleneck? Any > > suggestions would be appreciated. > > > > Jonathan > > > > > > This email message is intended for the use of the person to whom it > has been sent, and may contain information that is confidential or > legally protected. If you are not the intended recipient or have > received this message in error, you are not authorized to copy, > distribute, or otherwise use this message or its attachments. Please > notify the sender immediately by return e-mail and permanently delete > this message and any attachments. Verio, Inc. makes no warranty that > this email is error or virus free. Thank you. > > > > --------------------------------------------------------------------- > --------- > > Return on Information: > > Google Enterprise Search pays you back > > Get the facts. > > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > > ::: Messages without supporting info will risk being sent to > /dev/null > > This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cristoir at gmail.com Thu Dec 10 18:08:46 2009 From: cristoir at gmail.com (Christopher McAtackney) Date: Thu, 10 Dec 2009 17:08:46 +0000 Subject: Nagios as a Service Resiliency Manager Message-ID: Hi all, I have a need to control an Active / Passive pair of components and was wondering if anyone had tackled this problem with Nagios? The scenario is as follows; Host A has SERVICE_1 installed and running. Host B has SERVICE_2 installed, but not running. The desired functionality is to detect when SERVICE_1 is not running (or that Host A is down / unreachable), and then to start SERVICE_2 on Host B. I believe I can do this with Nagios by defining an event handler on SERVICE_1 which will make the appropriate call to start SERVICE_2 on Host B Would it make sense to store the relationship between SERVICE_1 and Host B / SERVICE_2 as a service macro, e.g. $_SERVICE_PASSIVE_HOSTNAME, $_SERVICE_PASSIVE_SERVICENAME? There are too many scenarios in which the SERVICE_1 might come back up to try automate the switching off of SERVICE_2 I believe, e.g. if someone pulled a network cable on Host A accidently, then plugged it in 15 minutes later - during which time Nagios detects that it is down and so starts up SERVICE_2. The user then plugs the network lead back in and now we have two Active instances running - which is what we specifically wanted to avoid. Even if Nagios detects that the primary component is up, it's still too late because any Active / Active overlap will cause problems for this particular application. I can't think of any way to automate that side of things - but does the general concept of having Nagios start up a Passive partner make sense? Thanks for any insight you have, Chris ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Thu Dec 10 22:58:46 2009 From: mitsuto at gmail.com (Marcel) Date: Thu, 10 Dec 2009 19:58:46 -0200 Subject: Nagios as a Service Resiliency Manager In-Reply-To: References: Message-ID: <2dfcbd1b0912101358n2a537c37oc20c248534b3bbf6@mail.gmail.com> Maybe this would help: http://onlamp.com/onlamp/2006/05/25/self-healing-networks.html On Thu, Dec 10, 2009 at 3:08 PM, Christopher McAtackney wrote: > Hi all, > > I have a need to control an Active / Passive pair of components and > was wondering if anyone had tackled this problem with Nagios? > > The scenario is as follows; > > Host A has SERVICE_1 installed and running. Host B has SERVICE_2 > installed, but not running. > > The desired functionality is to detect when SERVICE_1 is not running > (or that Host A is down / unreachable), and then to start SERVICE_2 on > Host B. > > I believe I can do this with Nagios by defining an event handler on > SERVICE_1 which will make the appropriate call to start SERVICE_2 on > Host B > > Would it make sense to store the relationship between SERVICE_1 and > Host B / SERVICE_2 as a service macro, e.g. > $_SERVICE_PASSIVE_HOSTNAME, $_SERVICE_PASSIVE_SERVICENAME? > > There are too many scenarios in which the SERVICE_1 might come back up > to try automate the switching off of SERVICE_2 I believe, e.g. if > someone pulled a network cable on Host A accidently, then plugged it > in 15 minutes later - during which time Nagios detects that it is down > and so starts up SERVICE_2. The user then plugs the network lead back > in and now we have two Active instances running - which is what we > specifically wanted to avoid. Even if Nagios detects that the primary > component is up, it's still too late because any Active / Active > overlap will cause problems for this particular application. > > I can't think of any way to automate that side of things - but does > the general concept of having Nagios start up a Passive partner make > sense? > > Thanks for any insight you have, > > Chris > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cris.daniluk at gmail.com Fri Dec 11 04:03:34 2009 From: cris.daniluk at gmail.com (Cris Daniluk) Date: Thu, 10 Dec 2009 22:03:34 -0500 Subject: obsessive acknowledgment processing Message-ID: Hi, We are currently forwarding checks from multiple Nagios sites into a central location to create a consolidated view for our operations team. Some sites have their own operations teams as well who acknowledge issues from time to time. I set up a contact attached to all services and created a simple notification command that fires an external command on the central server. This works great for checks with notifications enabled, but if notifications are disabled for the service, it obviously does not forward the acknowledement. I looked for an obvious way to work around this but did not find one. Is there anything that works similar to ocsp but includes acknowledgments? Thanks, Cris -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris.blake at tauspace.com Fri Dec 11 11:04:16 2009 From: chris.blake at tauspace.com (Chris Blake) Date: Fri, 11 Dec 2009 12:04:16 +0200 Subject: Unable to log in to Nagios Message-ID: Greetings community, I have recently installed Nagios by following this tutorial, to the letter : http://nagios.sourceforge.net/docs/3_0/quickstart-fedora.html However, when I log into http://localhost/nagios, and enter the user nagiosadmin and the password I set, it just keeps bouncing back to the login screen. I have run through the setup instructions a number of times thinking I may have missed something, but I covered everything again and I`m still not able to log in. When I try to access the forums (http://nagios.meulie.net/) I get a page not found error. I am running Centos 5.2 i386. Can someone please point me in the right direction to help solve this, or simply blurt out the answer for me if you know it :) Thank you in advance for your prompt reply and your willingness to help. -- Regards Chris Blake TAU SPACE Operations Manager ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Fri Dec 11 11:18:15 2009 From: mmelin at gmail.com (Martin Melin) Date: Fri, 11 Dec 2009 11:18:15 +0100 Subject: Unable to log in to Nagios In-Reply-To: References: Message-ID: >From the quickstart: "5) Configure the Web Interface [...] Create a nagiosadmin account for logging into the Nagios web interface. Remember the password you assign to this account - you'll need it later. htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin [...]" Your htpasswd file probably either does not contain what you think it does, or is not where it should be. Try re-creating it and using a very simple password. Regards, Martin Melin On Fri, Dec 11, 2009 at 11:04 AM, Chris Blake wrote: > > Greetings community, > > I have recently installed Nagios by following this tutorial, to the > letter : http://nagios.sourceforge.net/docs/3_0/quickstart-fedora.html > > However, when I log into http://localhost/nagios, and enter the user > nagiosadmin and the password I set, it just keeps bouncing back to the > login screen. > > I have run through the setup instructions a number of times thinking I > may have missed something, but I covered everything again and I`m > still not able to log in. > > When I try to access the forums (http://nagios.meulie.net/) I get a > page not found error. > > I am running Centos 5.2 i386. > > Can someone please point me in the right direction to help solve this, > or simply blurt out the answer for me if you know it :) > > Thank you in advance for your prompt reply and your willingness to help. > > -- > Regards > > Chris Blake > TAU SPACE > Operations Manager > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris.blake at tauspace.com Fri Dec 11 11:36:05 2009 From: chris.blake at tauspace.com (Chris Blake) Date: Fri, 11 Dec 2009 12:36:05 +0200 Subject: Unable to log in to Nagios In-Reply-To: References: Message-ID: n Fri, Dec 11, 2009 at 12:18 PM, Martin Melin wrote: > From the quickstart: > > "5) Configure the Web Interface > > [...] > > Create a nagiosadmin account for logging into the Nagios web > interface. Remember the password you assign to this account - you'll > need it later. > > htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin > > [...]" > > Your htpasswd file probably either does not contain what you think it > does, or is not where it should be. Try re-creating it and using a > very simple password. > > Regards, > Martin Melin Hi Martin, thank you for your reply. I re-did those steps a number of times already, restarting Apache each time. I used a simple password, 123, and this is the result of that command : [root at nagios ~]# cat /usr/local/nagios/etc/htpassword.users nagiosadmin:as6UMfOBi.VLI I am still getting bounced back to the login screen. Is the problem perhaps occurring with Apache ? -- Regards Chris Blake TAU SPACE Operations Manager Phone: 011 807 8300 ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cristoir at gmail.com Fri Dec 11 11:41:56 2009 From: cristoir at gmail.com (Christopher McAtackney) Date: Fri, 11 Dec 2009 10:41:56 +0000 Subject: Nagios as a Service Resiliency Manager In-Reply-To: <2dfcbd1b0912101358n2a537c37oc20c248534b3bbf6@mail.gmail.com> References: <2dfcbd1b0912101358n2a537c37oc20c248534b3bbf6@mail.gmail.com> Message-ID: That's an interesting link - but unfortunately I don't think it really covers the situation where a host goes down or becomes unreachable. It may be the case that Nagios is not suitable for this purpose, but I thought I would check on here in case anyone had done anything like this previously. Cheers, Chris 2009/12/10 Marcel : > Maybe this would help: > http://onlamp.com/onlamp/2006/05/25/self-healing-networks.html > > On Thu, Dec 10, 2009 at 3:08 PM, Christopher McAtackney > wrote: >> >> Hi all, >> >> I have a need to control an Active / Passive pair of components and >> was wondering if anyone had tackled this problem with Nagios? >> >> The scenario is as follows; >> >> Host A has SERVICE_1 installed and running. Host B has SERVICE_2 >> installed, but not running. >> >> The desired functionality is to detect when SERVICE_1 is not running >> (or that Host A is down / unreachable), and then to start SERVICE_2 on >> Host B. >> >> I believe I can do this with Nagios by defining an event handler on >> SERVICE_1 which will make the appropriate call to start SERVICE_2 on >> Host B >> >> Would it make sense to store the relationship between SERVICE_1 and >> Host B / SERVICE_2 as a service macro, e.g. >> $_SERVICE_PASSIVE_HOSTNAME, $_SERVICE_PASSIVE_SERVICENAME? >> >> There are too many scenarios in which the SERVICE_1 might come back up >> to try automate the switching off of SERVICE_2 I believe, e.g. if >> someone pulled a network cable on Host A accidently, then plugged it >> in 15 minutes later - during which time Nagios detects that it is down >> and so starts up SERVICE_2. The user then plugs the network lead back >> in and now we have two Active instances running - which is what we >> specifically wanted to avoid. Even if Nagios detects that the primary >> component is up, it's still too late because any Active / Active >> overlap will cause problems for this particular application. >> >> I can't think of any way to automate that side of things - but does >> the general concept of having Nagios start up a Passive partner make >> sense? >> >> Thanks for any insight you have, >> >> Chris >> >> >> ------------------------------------------------------------------------------ >> Return on Information: >> Google Enterprise Search pays you back >> Get the facts. >> http://p.sf.net/sfu/google-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris.blake at tauspace.com Fri Dec 11 12:56:53 2009 From: chris.blake at tauspace.com (Chris Blake) Date: Fri, 11 Dec 2009 13:56:53 +0200 Subject: Unable to log in to Nagios In-Reply-To: References: Message-ID: On Fri, Dec 11, 2009 at 12:18 PM, Martin Melin wrote: > From the quickstart: > > "5) Configure the Web Interface > > [...] > > Create a nagiosadmin account for logging into the Nagios web > interface. Remember the password you assign to this account - you'll > need it later. > > htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin > > [...]" > > Your htpasswd file probably either does not contain what you think it > does, or is not where it should be. Try re-creating it and using a > very simple password. > > Regards, > Martin Melin Hi again, I followed a bunch of instructions relating to configuring Apache directives and setting things in nagios.conf, and I`m still not able to log in. I am really confused now, when I tail the logs for Apache I get this when trying to log in : [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] (2)No such file or directory: Could not open password file: /usr/local/nagios/etc/htpasswd.users [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] access to /nagios/ failed, reason: verification of user id 'nagiosadmin' not configured I have the following permissions set up : [root at zabbix etc]# pwd /usr/local/nagios/etc [root at zabbix etc]# ls -l total 88 -rw-rw-r-- 1 nagios nagios 11408 Dec 11 13:16 cgi.cfg -rwxrwxrwx 1 nagios nagios 26 Dec 11 12:52 htpassword.users -rw-rw-r-- 1 nagios nagios 43776 Dec 11 10:36 nagios.cfg drwxrwxr-x 2 nagios nagios 4096 Dec 11 10:36 objects -rw-rw---- 1 nagios nagios 1340 Dec 11 10:36 resource.cfg [root at zabbix etc]# Can someone please help me, I have gone through a few articles and I am obviously missing something as to why I cannot log in Thank you. -- Regards Chris Blake TAU SPACE Operations Manag ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sascha.Runschke at gfkl.com Fri Dec 11 13:05:23 2009 From: Sascha.Runschke at gfkl.com (Sascha.Runschke at gfkl.com) Date: Fri, 11 Dec 2009 13:05:23 +0100 Subject: Alternatives to NSCA? Message-ID: Greetings, does anybody know of a NSCA replacement, which does the job without the need of unsupported libraries? The requirement of libmcrypt makes NSCA a real hassle to use for us, since we have to manually install third party libraries - which we really do not want to. RedHat won't support libmcrypt, as its development is long dead. Any alternatives out there, my google searches didn't come up with anything sadly. Or might it be easier to patch NSCA to use libgcrypt for example, which is available in every distribution? Regards Sascha GFKL Financial Services AG Vorstand: Dr. Peter J?nsch (Vors.), J?rgen Baltes, Dr. Tom Haverkamp Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Fri Dec 11 13:08:28 2009 From: mmelin at gmail.com (Martin Melin) Date: Fri, 11 Dec 2009 13:08:28 +0100 Subject: Unable to log in to Nagios In-Reply-To: References: Message-ID: The line: htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin should be exactly that. htpasswd is not a typo of htpassword ;) As you can see from your ls output, your htpasswd.users file is actually called htpasswORd.users - if you rename the file, you should be good to go. Regards, Martin Melin On Fri, Dec 11, 2009 at 12:56 PM, Chris Blake wrote: > On Fri, Dec 11, 2009 at 12:18 PM, Martin Melin wrote: >> From the quickstart: >> >> "5) Configure the Web Interface >> >> [...] >> >> Create a nagiosadmin account for logging into the Nagios web >> interface. Remember the password you assign to this account - you'll >> need it later. >> >> htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin >> >> [...]" >> >> Your htpasswd file probably either does not contain what you think it >> does, or is not where it should be. Try re-creating it and using a >> very simple password. >> >> Regards, >> Martin Melin > > Hi again, > > I followed a bunch of instructions relating to configuring Apache > directives and setting things in nagios.conf, and I`m still not able > to log in. > > I am really confused now, when I tail the logs for Apache I get this > when trying to log in : > > > [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] (2)No such > file or directory: Could not open password file: > /usr/local/nagios/etc/htpasswd.users > [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] access to > /nagios/ failed, reason: verification of user id 'nagiosadmin' not > configured > > I have the following permissions set up : > > [root at zabbix etc]# pwd > /usr/local/nagios/etc > [root at zabbix etc]# ls -l > total 88 > -rw-rw-r-- 1 nagios nagios 11408 Dec 11 13:16 cgi.cfg > -rwxrwxrwx 1 nagios nagios ? ?26 Dec 11 12:52 htpassword.users > -rw-rw-r-- 1 nagios nagios 43776 Dec 11 10:36 nagios.cfg > drwxrwxr-x 2 nagios nagios ?4096 Dec 11 10:36 objects > -rw-rw---- 1 nagios nagios ?1340 Dec 11 10:36 resource.cfg > [root at zabbix etc]# > > Can someone please help me, I have gone through a few articles and I > am obviously missing something as to why I cannot log in > > Thank you. > > -- > Regards > > Chris Blake > TAU SPACE > Operations Manag > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Deborah.Martin at Kognitio.com Fri Dec 11 13:12:16 2009 From: Deborah.Martin at Kognitio.com (Deborah Martin) Date: Fri, 11 Dec 2009 12:12:16 -0000 Subject: Unable to log in to Nagios Message-ID: <84836290D5AD43418C40DCF0C4A54ED3015DB908@kogex02.pmpl.co.uk> Chris, The file name the error is referring to is htpasswd.users [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] (2)No such file or directory: Could not open password file: /usr/local/nagios/etc/htpasswd.users But your directory listing shows -rwxrwxrwx 1 nagios nagios 26 Dec 11 12:52 htpassword.users Rename the file to htpasswd.users and that may resolve it! Regards, Deborah -----Original Message----- From: Chris Blake [mailto:chris.blake at tauspace.com] Sent: 11 December 2009 11:57 To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Unable to log in to Nagios On Fri, Dec 11, 2009 at 12:18 PM, Martin Melin wrote: > From the quickstart: > > "5) Configure the Web Interface > > [...] > > Create a nagiosadmin account for logging into the Nagios web > interface. Remember the password you assign to this account - you'll > need it later. > > htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin > > [...]" > > Your htpasswd file probably either does not contain what you think it > does, or is not where it should be. Try re-creating it and using a > very simple password. > > Regards, > Martin Melin Hi again, I followed a bunch of instructions relating to configuring Apache directives and setting things in nagios.conf, and I`m still not able to log in. I am really confused now, when I tail the logs for Apache I get this when trying to log in : [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] (2)No such file or directory: Could not open password file: /usr/local/nagios/etc/htpasswd.users [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] access to /nagios/ failed, reason: verification of user id 'nagiosadmin' not configured I have the following permissions set up : [root at zabbix etc]# pwd /usr/local/nagios/etc [root at zabbix etc]# ls -l total 88 -rw-rw-r-- 1 nagios nagios 11408 Dec 11 13:16 cgi.cfg -rwxrwxrwx 1 nagios nagios 26 Dec 11 12:52 htpassword.users -rw-rw-r-- 1 nagios nagios 43776 Dec 11 10:36 nagios.cfg drwxrwxr-x 2 nagios nagios 4096 Dec 11 10:36 objects -rw-rw---- 1 nagios nagios 1340 Dec 11 10:36 resource.cfg [root at zabbix etc]# Can someone please help me, I have gone through a few articles and I am obviously missing something as to why I cannot log in Thank you. -- Regards Chris Blake TAU SPACE Operations Manag ---------------------------------------------------------------------------- -- Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null *************************************************************************** This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. Kognitio Limited, a company registered in England and Wales. Registered number 0212 7833. Registered Office: 3a Waterside Park, Cookham Road, Bracknell, Berks, RG12 1RB. VAT number 864 4378 92. Kognitio Inc, a company incorporated in Delaware, principal office 180 North Stetson, Suite 3500, Chicago, IL 60601, USA *************************************************************************** -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris.blake at tauspace.com Fri Dec 11 13:15:02 2009 From: chris.blake at tauspace.com (Chris Blake) Date: Fri, 11 Dec 2009 14:15:02 +0200 Subject: Unable to log in to Nagios In-Reply-To: <84836290D5AD43418C40DCF0C4A54ED3015DB908@kogex02.pmpl.co.uk> References: <84836290D5AD43418C40DCF0C4A54ED3015DB908@kogex02.pmpl.co.uk> Message-ID: On Fri, Dec 11, 2009 at 2:12 PM, Deborah Martin wrote: > Chris, > > The file name the error is referring to is htpasswd.users > > [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] (2)No such file or > directory: Could not open password file: > /usr/local/nagios/etc/htpasswd.users > > But your directory listing shows > > -rwxrwxrwx 1 nagios nagios??? 26 Dec 11 12:52 htpassword.users > > Rename the file to htpasswd.users and that may resolve it! > > Regards, > Deborah > Thank you Deborah, I am quite embarrassed about this oversight...lol...it was pointed out by Martin as well. Thank you both, I am now able to access Nagios -- Regards Chris Blake TAU SPACE Operations Manager ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris.blake at tauspace.com Fri Dec 11 13:12:56 2009 From: chris.blake at tauspace.com (Chris Blake) Date: Fri, 11 Dec 2009 14:12:56 +0200 Subject: Unable to log in to Nagios In-Reply-To: References: Message-ID: On Fri, Dec 11, 2009 at 2:08 PM, Martin Melin wrote: > The line: > > htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin > > should be exactly that. htpasswd is not a typo of htpassword ;) > > As you can see from your ls output, your htpasswd.users file is > actually called htpasswORd.users - if you rename the file, you should > be good to go. > > Regards, > Martin Melin Oh my hat !! All this and I was overlooking a spelling mistake :( Aaaaaaaaaaaaaaaaaaaaaaaaarghhh !!!! Thank you for pointing out the error of my ways, much appreciated. I am now able to access Nagios. -- Regards Chris Blake TAU SPACE Operations Manager ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From schneemann at b1-systems.de Fri Dec 11 13:17:17 2009 From: schneemann at b1-systems.de (Christian Schneemann) Date: Fri, 11 Dec 2009 13:17:17 +0100 Subject: Unable to log in to Nagios In-Reply-To: References: Message-ID: <200912111317.17431.schneemann@b1-systems.de> On Friday 11 December 2009 12:56:53 Chris Blake wrote: > On Fri, Dec 11, 2009 at 12:18 PM, Martin Melin wrote: > [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] (2)No such > file or directory: Could not open password file: > /usr/local/nagios/etc/htpasswd.users > [Fri Dec 11 13:20:09 2009] [error] [client 192.168.2.102] access to > /nagios/ failed, reason: verification of user id 'nagiosadmin' not > configured > > I have the following permissions set up : > > [root at zabbix etc]# pwd > /usr/local/nagios/etc > [root at zabbix etc]# ls -l > total 88 > -rw-rw-r-- 1 nagios nagios 11408 Dec 11 13:16 cgi.cfg > -rwxrwxrwx 1 nagios nagios 26 Dec 11 12:52 htpassword.users > -rw-rw-r-- 1 nagios nagios 43776 Dec 11 10:36 nagios.cfg > drwxrwxr-x 2 nagios nagios 4096 Dec 11 10:36 objects > -rw-rw---- 1 nagios nagios 1340 Dec 11 10:36 resource.cfg > [root at zabbix etc]# > > Can someone please help me, I have gone through a few articles and I > am obviously missing something as to why I cannot log in Hi, your webserver looks for /usr/local/nagios/etc/htpasswd.users, but your file is named htpassword.users. Just rename your htpassword.users to htpasswd.users and everything should be fine. Greetings, Christian > > Thank you. > -- Christian Schneemann Geschaeftsfuehrer: Ralph Dehner Technical Tester & Writer Unternehmenssitz: Vohburg B1 Systems GmbH Amtsgericht: Ingolstadt Handelsregister: HRB 3537 EMail: schneemann at b1-systems.de http://www.b1-systems.de Adresse: B1 Systems GmbH, Osterfeldstra?e 7, 85088 Vohburg GPG: http://pgpkeys.pca.dfn.de/pks/lookup?op=get&search=0x2FA8643A41BDAB81 ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Fri Dec 11 15:20:37 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Fri, 11 Dec 2009 08:20:37 -0600 Subject: Alternatives to NSCA? In-Reply-To: References: Message-ID: I just used nsca with encryption turned off and if you need the security on your local network wrap it in ssl via stunnel. this works on my rhel5.2 boxes and my Ubuntu servers (which have libmcrypt in repo) Greg Pangrazio pangrazi at gmail.com On Fri, Dec 11, 2009 at 6:05 AM, wrote: > Greetings, > > does anybody know of a NSCA replacement, which does the job without the need > of unsupported libraries? The requirement of libmcrypt makes NSCA a real > hassle > to use for us, since we have to manually install third party libraries - > which we really > do not want to. RedHat won't support libmcrypt, as its development is long > dead. > > Any alternatives out there, my google searches didn't come up with anything > sadly. > > Or might it be easier to patch NSCA to use libgcrypt for example, which is > available > in every distribution? > > Regards > ? ? ? ? Sascha > > > GFKL Financial Services AG > Vorstand: Dr. Peter J?nsch (Vors.), J?rgen Baltes, Dr. Tom Haverkamp > Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma > Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522 > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel at danielemmanuelfeinsmith.com Fri Dec 11 15:22:23 2009 From: daniel at danielemmanuelfeinsmith.com (Daniel Emmanuel Feinsmith) Date: Fri, 11 Dec 2009 09:22:23 -0500 Subject: [Nagios-devel] Alternatives to NSCA? In-Reply-To: References: Message-ID: <1255F147-D2CC-4DBE-93D6-D0D3A691CB4B@danielemmanuelfeinsmith.com> Sascha, Why don't you create a libmcrypt.a and statically link it into your nsca, thereby not requiring mcrypt as an external library at runtime? Daniel. ===================================================== Check out Brooklyn for Nagios 2.0 on the iPhone App Store! Now Supporting SSL and Host/Service Acknowledgements. ===================================================== On Dec 11, 2009, at 7:05 AM, Sascha.Runschke at gfkl.com wrote: > Greetings, > > does anybody know of a NSCA replacement, which does the job without the need > of unsupported libraries? The requirement of libmcrypt makes NSCA a real hassle > to use for us, since we have to manually install third party libraries - which we really > do not want to. RedHat won't support libmcrypt, as its development is long dead. > > Any alternatives out there, my google searches didn't come up with anything sadly. > > Or might it be easier to patch NSCA to use libgcrypt for example, which is available > in every distribution? > > Regards > Sascha > > > GFKL Financial Services AG > Vorstand: Dr. Peter J?nsch (Vors.), J?rgen Baltes, Dr. Tom Haverkamp > Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma > Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-devel mailing list > Nagios-devel at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-devel -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cedric.jeanneret at camptocamp.com Fri Dec 11 16:29:20 2009 From: cedric.jeanneret at camptocamp.com (Cedric Jeanneret) Date: Fri, 11 Dec 2009 16:29:20 +0100 Subject: NSCA strange behaviour In-Reply-To: References: <20091208114000.2872e363@saya.wrk.lsn.camptocamp.com> Message-ID: <20091211162920.6f65f080@saya.wrk.lsn.camptocamp.com> Hello, problem solved, it was indeed a version problem (minor number).. redhat has released a new nsca client/server package, removing a patch, and that made it incompatible with previous version. See: http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg19402.html and related patch: http://cvs.fedoraproject.org/viewvc/EL-5/nsca/nsca-increase_max_plugin_output_length.patch?revision=1.1&view=markup&sortby=log Best regards, C. On Tue, 8 Dec 2009 10:52:50 -0600 Marc Powell wrote: > > On Dec 8, 2009, at 4:40 AM, Cedric Jeanneret wrote: > > > - Enbling debug for nsca on server01 doesn't show me anything interesting. I just don't see where nsca catch up client22 status, and it keeps on saying : > > Warning: The results of host 'client22.domain.lt' are stale by 0d 0h 2m 0s (threshold=0d 0h 6m 0s). I'm forcing an immediate check of the host. > > > > On another hand, it shows me: > > [1260267958.216051] [016.1] [pid=23191] Check results for service 'Cron service' on host 'client22.domain.lt' are fresh. > > What do they show? What you've quoted here doesn't come from NSCA (that I can tell from a simple grep). -- > > nsca-2.7.2]# grep -r 'are fresh' * > nsca-2.7.2]# > > I'm sure that comes from nagios via nagios.log, not NSCA. Are you sure you're looking in the right place? It's typically in /var/log/messages. > > How are you running nsca? daemon mode or via inetd? If inetd, is inetd rejecting the connection? > > -- > Marc > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- C?dric Jeanneret | System Administrator 021 619 10 32 | Camptocamp SA cedric.jeanneret at camptocamp.com | PSE-A / EPFL -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 197 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bluesman at bluesman.it Fri Dec 11 17:17:18 2009 From: bluesman at bluesman.it (Diego Roccia) Date: Fri, 11 Dec 2009 17:17:18 +0100 Subject: Modifying custom variables via external commands Message-ID: <4B22708E.2060203@bluesman.it> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi all, I'm having some problem in changing custom object variables via external commands. Using nagios 3.0.6 and 3.2.0 but it seems to ignore *almost* completely my command. > define service{ > name generic-service > [...] > _PRIORITY 1 > _SUPPORT_TEAM Tech Db - Generic > action_url /wiki/doku.php?id=procedures:$SERVICEDESC$ > notes priority:$_SERVICEPRIORITY$
Support Team(s):$_SERVICESUPPORT_TEAM$ > } In the debug I see: > [1260547711.216663] [128.2] [pid=18837] Raw command entry: [1260547696] CHANGE_CUSTOM_SVC_VAR;blinkongdbslave1;SSH Service;_PRIORITY;1 > [1260547711.216970] [001.0] [pid=18837] process_external_command2() > [1260547711.216974] [128.1] [pid=18837] External Command Type: 148 > [1260547711.216979] [128.1] [pid=18837] Command Entry Time: 1260547696 > [1260547711.216983] [128.1] [pid=18837] Command Arguments: blinkongdbslave1;SSH Service;_PRIORITY;1 but the value of the variable is not being changed. Any ideas? thanks in advance Diego -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.12 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iQEcBAEBAgAGBQJLInCOAAoJEMJ8KUrKt830Z04H/1Bmq4xNjdZEfqXvrP8kSEar 7w9+KnPb0SvtLwIEAG9HWk2sdqcJg4qhq9vnsu079XWBjeCNFRrFW37fUqhi2Oio 5t5flyJuatDGFhFUA+w8oRImhiTmi0/QKzYeJSzMuXlgSFqUDlvQ6JlPlDL6go2z R5fJBSC4vbwhGlxcLhdswXV/FVbzNc363P7IBQJbjI9mL1r1gk+rPx2pDoqKvjig K5hI25OVmXe+QTOEKY4HqeBwu0dx9LiSTvdk17WzwsKjqlj4WwCZ41OZm9VqEpO1 TGfyhiQ92yw3rft/M5oI/AoaW0TTPMgs3wY3TFv6UQsZhmv/LTsAFNt7SsKHkyw= =Y84l -----END PGP SIGNATURE----- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gmartin at gmartin.org Fri Dec 11 18:20:33 2009 From: gmartin at gmartin.org (gmartin) Date: Fri, 11 Dec 2009 12:20:33 -0500 Subject: Nagios as a Service Resiliency Manager In-Reply-To: References: <2dfcbd1b0912101358n2a537c37oc20c248534b3bbf6@mail.gmail.com> Message-ID: Chris, great thing about Nagios is it enables creative solution like this. I'd love to see you try it and report back on how it works for you. On 12/11/09, Christopher McAtackney wrote: > That's an interesting link - but unfortunately I don't think it really > covers the situation where a host goes down or becomes unreachable. It > may be the case that Nagios is not suitable for this purpose, but I > thought I would check on here in case anyone had done anything like > this previously. > > Cheers, > Chris > > 2009/12/10 Marcel : >> Maybe this would help: >> http://onlamp.com/onlamp/2006/05/25/self-healing-networks.html >> >> On Thu, Dec 10, 2009 at 3:08 PM, Christopher McAtackney >> >> wrote: >>> >>> Hi all, >>> >>> I have a need to control an Active / Passive pair of components and >>> was wondering if anyone had tackled this problem with Nagios? >>> >>> The scenario is as follows; >>> >>> Host A has SERVICE_1 installed and running. Host B has SERVICE_2 >>> installed, but not running. >>> >>> The desired functionality is to detect when SERVICE_1 is not running >>> (or that Host A is down / unreachable), and then to start SERVICE_2 on >>> Host B. >>> >>> I believe I can do this with Nagios by defining an event handler on >>> SERVICE_1 which will make the appropriate call to start SERVICE_2 on >>> Host B >>> >>> Would it make sense to store the relationship between SERVICE_1 and >>> Host B / SERVICE_2 as a service macro, e.g. >>> $_SERVICE_PASSIVE_HOSTNAME, $_SERVICE_PASSIVE_SERVICENAME? >>> >>> There are too many scenarios in which the SERVICE_1 might come back up >>> to try automate the switching off of SERVICE_2 I believe, e.g. if >>> someone pulled a network cable on Host A accidently, then plugged it >>> in 15 minutes later - during which time Nagios detects that it is down >>> and so starts up SERVICE_2. The user then plugs the network lead back >>> in and now we have two Active instances running - which is what we >>> specifically wanted to avoid. Even if Nagios detects that the primary >>> component is up, it's still too late because any Active / Active >>> overlap will cause problems for this particular application. >>> >>> I can't think of any way to automate that side of things - but does >>> the general concept of having Nagios start up a Passive partner make >>> sense? >>> >>> Thanks for any insight you have, >>> >>> Chris >>> >>> >>> ------------------------------------------------------------------------------ >>> Return on Information: >>> Google Enterprise Search pays you back >>> Get the facts. >>> http://p.sf.net/sfu/google-dev2dev >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when >>> reporting any issue. >>> ::: Messages without supporting info will risk being sent to /dev/null >> >> > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Sent from my mobile device \\Greg ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Fri Dec 11 18:26:34 2009 From: mitsuto at gmail.com (Marcel) Date: Fri, 11 Dec 2009 15:26:34 -0200 Subject: Alternatives to NSCA? In-Reply-To: References: Message-ID: <2dfcbd1b0912110926g10b23dadyec383042674e48d@mail.gmail.com> Once upon a time, I've tweaked syslog-ng to write to the command pipe. That worked like a charm. On Fri, Dec 11, 2009 at 10:05 AM, wrote: > Greetings, > > does anybody know of a NSCA replacement, which does the job without the > need > of unsupported libraries? The requirement of libmcrypt makes NSCA a real > hassle > to use for us, since we have to manually install third party libraries - > which we really > do not want to. RedHat won't support libmcrypt, as its development is long > dead. > > Any alternatives out there, my google searches didn't come up with anything > sadly. > > Or might it be easier to patch NSCA to use libgcrypt for example, which is > available > in every distribution? > > Regards > Sascha > > > GFKL Financial Services AG > Vorstand: Dr. Peter J?nsch (Vors.), J?rgen Baltes, Dr. Tom Haverkamp > Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma > Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522 > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-devel mailing list > Nagios-devel at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-devel > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-devel mailing list Nagios-devel at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-devel From chris at mowisp.net Fri Dec 11 18:33:06 2009 From: chris at mowisp.net (Christopher Tyler) Date: Fri, 11 Dec 2009 11:33:06 -0600 Subject: Notifications not being sent Message-ID: <4B228252.60400@mowisp.net> First, thank you all for any help that you can provide. Here is the problem that I am having, I hope someone can help point out something obvious that I have missed. 1) Hosts are being monitored just fine, status map shows red when a host goes down, event log shows alerts like it should. 2) Timeframes are set for 24/7 3) All host and service notifications are enabled 4) email address is correct 5) All hosts services are, at a minimum, set for w,c,u,r 6) If I go to a service or host and click on the link "Send custom host notification" it will send the notification without "forcing" it so I know that the timeframes are being read correctly and this also verifies the email address is correct and the mail send is working. 7) The event log doesn't show that it's even trying to send a notification when something alerts. Running Nagios Version 3.2.0 And here is a snippet from my configuration files, all hosts/services are set up identically: define contact { contact_name nagiosadmin alias Nagios Admin host_notification_options d,u,r,f,n service_notification_options w,u,c,r,f,n email support at mowisp.com pager chris at mowisp.net host_notification_period 24x7 service_notification_period 24x7 host_notification_commands notify-host-by-email service_notification_commands notify-service-by-email } define contactgroup { contactgroup_name admins alias Nagios Administrators members nagiosadmin } define host { name Default_timeperiod_interval_5 register 0 max_check_attempts 2 check_interval 5 retry_interval 1 notification_interval 60 notification_options d,u,r,f,s,n active_checks_enabled 1 passive_checks_enabled 0 notifications_enabled 1 check_freshness 0 } define service { name Default_timeperiod_interval_5 register 0 max_check_attempts 2 check_interval 5 retry_interval 1 notification_interval 60 notification_options w,u,c,r,f,s,n active_checks_enabled 1 passive_checks_enabled 0 notifications_enabled 1 check_freshness 0 } define timeperiod { timeperiod_name 24x7 alias 24 Hours A Day, 7 Days A Week sunday 00:00-24:00 monday 00:00-24:00 tuesday 00:00-24:00 wednesday 00:00-24:00 thursday 00:00-24:00 friday 00:00-24:00 saturday 00:00-24:00 } define host { host_name Miller Silo 900 Sector 1 address 10.90.16.1 check_command check-host-alive notification_interval 60 notification_options d,u,r,f,n max_check_attempts 3 active_checks_enabled 1 passive_checks_enabled 0 notifications_enabled 1 check_period 24x7 notification_period 24x7 contact_groups admins parents Miller Silo CMM use Default_timeperiod_interval_5 } define service { service_description check_ping check_command check_ping!20,80%!30,90% host_name Miller Silo 900 Sector 1 check_period 24x7 notification_period 24x7 contact_groups admins notification_interval 60 notification_options w,u,c,r,f,n max_check_attempts 3 check_interval 5 retry_interval 1 active_checks_enabled 1 passive_checks_enabled 0 notifications_enabled 1 check_freshness 0 freshness_threshold 86400 } ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri Dec 11 18:52:21 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Fri, 11 Dec 2009 09:52:21 -0800 Subject: Notifications not being sent In-Reply-To: <4B228252.60400@mowisp.net> References: <4B228252.60400@mowisp.net> Message-ID: <4B2286D5.3030102@hp.com> Christopher Tyler wrote: > First, thank you all for any help that you can provide. Here is the > problem that I am having, I hope someone can help point out something > obvious that I have missed. > > 1) Hosts are being monitored just fine, status map shows red when a host > goes down, event log shows alerts like it should. > 2) Timeframes are set for 24/7 > 3) All host and service notifications are enabled > 4) email address is correct > 5) All hosts services are, at a minimum, set for w,c,u,r > 6) If I go to a service or host and click on the link "Send custom host > notification" it will send the notification without "forcing" it so I > know that the timeframes are being read correctly and this also verifies > the email address is correct and the mail send is working. > 7) The event log doesn't show that it's even trying to send a > notification when something alerts. > Take the "N" (do not notify) option out of your contact configs. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mattias.ryrlen at op5.com Fri Dec 11 19:15:12 2009 From: mattias.ryrlen at op5.com (=?ISO-8859-1?Q?Mattias_Ryrl=E9n?=) Date: Fri, 11 Dec 2009 18:15:12 +0000 Subject: [Nagios-devel] Alternatives to NSCA? In-Reply-To: <2dfcbd1b0912110926g10b23dadyec383042674e48d@mail.gmail.com> References: <2dfcbd1b0912110926g10b23dadyec383042674e48d@mail.gmail.com> Message-ID: <8357ce520912111015x5a663ebdtf7b07d5712cfbc4b@mail.gmail.com> i did that with syslog to, and with netcat (but then you don't get encryption though) i think i still have my scripts (they are prob searchable in the list also) if someone is interested On Fri, Dec 11, 2009 at 5:26 PM, Marcel wrote: > Once upon a time, I've tweaked syslog-ng to write to the command pipe. That > worked like a charm. > > On Fri, Dec 11, 2009 at 10:05 AM, wrote: > >> Greetings, >> >> does anybody know of a NSCA replacement, which does the job without the >> need >> of unsupported libraries? The requirement of libmcrypt makes NSCA a real >> hassle >> to use for us, since we have to manually install third party libraries - >> which we really >> do not want to. RedHat won't support libmcrypt, as its development is long >> dead. >> >> Any alternatives out there, my google searches didn't come up with >> anything sadly. >> >> Or might it be easier to patch NSCA to use libgcrypt for example, which is >> available >> in every distribution? >> >> Regards >> Sascha >> >> >> GFKL Financial Services AG >> Vorstand: Dr. Peter J?nsch (Vors.), J?rgen Baltes, Dr. Tom Haverkamp >> Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma >> Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522 >> >> ------------------------------------------------------------------------------ >> Return on Information: >> Google Enterprise Search pays you back >> Get the facts. >> http://p.sf.net/sfu/google-dev2dev >> >> _______________________________________________ >> Nagios-devel mailing list >> Nagios-devel at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-devel >> >> > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-devel mailing list > Nagios-devel at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-devel > > -- V?nliga h?lsningar / Best Regards Mattias Ryrl?n __________________________ op5 AB F?rsta L?nggatan 19 SE-413 27 G?teborg Mobil: +46 735-17 70 99 Support: +46 31-774 09 24 www.op5.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From vikramms123 at gmail.com Sat Dec 12 07:33:55 2009 From: vikramms123 at gmail.com (VIKRAM MS) Date: Sat, 12 Dec 2009 12:03:55 +0530 Subject: Monitor Cisco Switches Message-ID: <2e22dff70912112233w7ecefc40s3b66f54bf0cc72d0@mail.gmail.com> Hello, I am trying to monitor Cisco switches and routers. I am able to ping and check the uptime of the switch. 1) But for the port 1 link status, I am getting "SNMP CRITICAL - *down(2)*" error. 2) For port 1 bandwidth usage, the error is "check_mrtgtraf: Unable to open MRTG log file" I have installed MRTG, but I am not able to go any further. Also for the http://localhost/mrtg/10.111.100.102_43.html check, I am getting the following error *404 Not found The requested URL /mrtg/10.111.100.102_43.html was not found on this server. * I have checked for assistance in http://www.mail-archive.com/nagios-users at lists.sourceforge.net/ but in vain. I am using Nagios Core version 3.2.0. Thanks in advance Regards Vikram -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Sat Dec 12 15:43:54 2009 From: rperezm at uci.cu (ReynierPM) Date: Sat, 12 Dec 2009 09:43:54 -0500 Subject: Monitoring Telesync Switches is possible? Message-ID: <4B23AC2A.6070009@uci.cu> Hi every: As the subject said: it's possible to monitoring a Allied Telesync switches? I have almost 3 of them in my infraestructure and need to be monitored but don't know how to do this. Any help? -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Sat Dec 12 16:10:21 2009 From: rperezm at uci.cu (ReynierPM) Date: Sat, 12 Dec 2009 10:10:21 -0500 Subject: Problems checking external host In-Reply-To: <24B6509E4191AF44B60A24EAA3B4AD492D4556@nuexchg.norwich.edu> References: <4AFA3E25.3030003@uci.cu> <4AFAF49C.1070701@hp.com><4AFC721D.8050108@uci.cu> <4AFCE407.1020503@hp.com> <4B01656A.3020300@uci.cu> <24B6509E4191AF44B60A24EAA3B4AD492D453F@nuexchg.norwich.edu> <4B0169D1.7030201@uci.cu> <24B6509E4191AF44B60A24EAA3B4AD492D4556@nuexchg.norwich.edu> Message-ID: <4B23B25D.8010402@uci.cu> James Pratt wrote: > > http://www.zdnetasia.com/techguide/opensource/0,39044899,62052006,00.htm > some > create > > No problem - As root, just run: > > su - nagios -c "ssh nagios at remotebox.com" (replace your target hostname > here, or use IP) > > let it connect and accept the key, then do the same on the target box in > reverse, so that both sides have the key in ~/.ssh/authorized_keys > files. > Hi: It's me again trying to configure check_by_ssh but without success. See, I follow this tutorial[1] but when I've done can't login to remote server from Nagios server. Let me explain a bit what I do: 1) Login into my Nagios server as "root" not as "nagios" user 2) Run this commands: ssh-keygen -t rsa1 (for SSH1, I think I don't need this but just run for precaution) ssh-keygen -t dsa ssh-keygen -t rsa 3) Copy the generated files to the remote server scp ~/.ssh/*.pub nagios at 10.128.50.11:/home/nagios/ 4) Run this others commands: cat identity.pub >>~/.ssh/authorized_keys cat id_dsa.pub >>~/.ssh/authorized_keys cat id_rsa.pub >>~/.ssh/authorized_keys rm identity.pub id_dsa.pub id_rsa.pub Now when I try to login from Nagios server to the remote server (10.128.50.11) I always need to enter the password. I try as "root" and also as "nagios" (meaning ssh root at 10.128.50.11, ssh nagios at 10.128.50.11). Why? The curiosity came to me and I check the file authorized_keys at remote host and have this: 2048 35 31537320408745229838365562405624946802370792096499059223774165383570113281161048240756249546198805679184056103143919830145818642104082292170996730416929422264174662938941716685989426016074582046007764918772604041829437044357969148541210017569485061724990330392006573284601283454700329897647888326315719461278230886781115132496222294195579706117375955677922834002228681170251111807857141282704805088831501704787050993949809146632808041890108774648791697895838722205506992426654008098461046497741222563633988038536169891094257004960432390755965669333326650500537312297715834727417885056386391177047203249702515327707761 root at monitoring ssh-dss AAAAB3NzaC1kc3MAAACBAPCKZyo6kPGMyGuWMhF6I/HcbY/2h0C2mIp0eMsnwi5nh1nT93VcJZL+hZd6etsDMzXSfN9EbQKlvXUKr3O05Ce8WBbesP7sYngR8ZfApZzUG+cnbia4XU9bf4KeA70UYSN9MWQWh0yvTfLJOX3X0ER0yQrNwVbiD3cwpyWMjGR7AAAAFQCQlSos9XFsf7o/sqYXE+E2NStJowAAAIEA8BmrviwMVaRT8Dg0L6h3ugViKlM+h2Ka4g1oO0mP+6wlZ1tf8+1p7bS2AZTLHsVdT8JdDt4kXr0h9A2+OHCyIZtIkwnJfgppjZri2wNsL6xBe/8YoNRAjuT28gsyYhm3Y1z7x4MTaii9KADotO/Pzc4QSj8RfNRdXKgMWBysEkMAAACATN+wyWnkYoHnskIkVofKuUckLE2VloyIrRl+ZJtV0mkC2PJ8/7nuT/qbqQGucI/60xqApjUH5BvXkUt7rm+aiGSL3s4ehRGfgsqp6BnzuzSJMCCWJQCPzXt0qTh/2l4wcxLqxtItKBxFHpPCh4ltV1jsxseCAoJiIH6GRHt5k1M= root at monitoring ssh-rsa AAAAB3NzaC1yc2EAAAABIwAAAQEAvIvPpR2k3br05Yel6LHdziEp6uLx53gsTiSPko9tCuj26dxwJUg1Pt1LrNKObApdA0QWoLVXUmZx/MFicCvqND9Mj93nCSwZ9fN8MRlea5DNDpJORE2NPjmV5IlxX9S3qLDhkp1bXrqLS556sipxXigDZlvCJ/nHa4ZCdFRek2pT7vNVNA8E/wxu38zCnCDLFmmq73r+Sf+8Ud/whBBWWAIrQgGcP1oQ1MTo+rMYJSudof4CWAS9IWV3TI1yLg9EJK0CpzHVIYReo0QZzgin8op70/mx09OQsDCxZD/Ht9D3NTFxiTByRgtU//SzCJyLZigyeJODdEDr3PiK7+f4Nw== root at monitoring As you can see all have "root at monitoring" at the end. What is the problem? Does this have anything to do? [1] http://hocuspokus.net/2008/01/ssh-shared-key-setup-ssh-logins-without-passwords/comment-page-1 -- Cheers and thanks in advance ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Sat Dec 12 17:00:45 2009 From: rperezm at uci.cu (ReynierPM) Date: Sat, 12 Dec 2009 11:00:45 -0500 Subject: Problems checking external host In-Reply-To: <1260632861.3303.192.camel@jimsworktop.packetalk.net> References: <4AFA3E25.3030003@uci.cu> <4AFAF49C.1070701@hp.com> <4AFC721D.8050108@uci.cu> <4AFCE407.1020503@hp.com> <4B01656A.3020300@uci.cu> <24B6509E4191AF44B60A24EAA3B4AD492D453F@nuexchg.norwich.edu> <4B0169D1.7030201@uci.cu> <24B6509E4191AF44B60A24EAA3B4AD492D4556@nuexchg.norwich.edu> <4B23B25D.8010402@uci.cu> <1260632861.3303.192.camel@jimsworktop.packetalk.net> Message-ID: <4B23BE2D.20906@uci.cu> Jim McNamara wrote: Thanks for your fast reply Jim and it's a great entry I understood all and now know why my config doesn't work. > Nagios runs as user nagios (usually), not as root, so that is the > beginning of the problem. It is also possible that the remote host > doesn't accept key based authentication, but the normal ssh daemon will > accept keys ahead of passwords. > > On the nagios box, give yourself a shell as user nagios. depending on > your permissions, you may need to specify /bin/bash or /bin/sh for your > shell. Then you can generate the key with the ssh-keygen command. That > needs to be done as user nagios. You also don't need to create 3 keys. > That isn't the source of the problem, the limit on the number of keys is > likely in the thousands, but the "default" key on most linuxes is > ~/.ssh/id_rsa. So generate that without a password at a size that works > for you. Use man ssh-keygen if anything I'm saying about this is > unclear. I just have a little problem here. How can I became "nagios" if I login as "root"? I mean after login as "root" I get a "root" shell, how I can became "nagios" from this point? -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at packetalk.net Sat Dec 12 16:47:41 2009 From: jim at packetalk.net (Jim McNamara) Date: Sat, 12 Dec 2009 10:47:41 -0500 Subject: Problems checking external host In-Reply-To: <4B23B25D.8010402@uci.cu> References: <4AFA3E25.3030003@uci.cu> <4AFAF49C.1070701@hp.com> <4AFC721D.8050108@uci.cu> <4AFCE407.1020503@hp.com> <4B01656A.3020300@uci.cu> <24B6509E4191AF44B60A24EAA3B4AD492D453F@nuexchg.norwich.edu> <4B0169D1.7030201@uci.cu> <24B6509E4191AF44B60A24EAA3B4AD492D4556@nuexchg.norwich.edu> <4B23B25D.8010402@uci.cu> Message-ID: <1260632861.3303.192.camel@jimsworktop.packetalk.net> On Sat, 2009-12-12 at 10:10 -0500, ReynierPM wrote: > James Pratt wrote: > > > > http://www.zdnetasia.com/techguide/opensource/0,39044899,62052006,00.htm > > some > > create > > > > No problem - As root, just run: > > > > su - nagios -c "ssh nagios at remotebox.com" (replace your target hostname > > here, or use IP) > > > > let it connect and accept the key, then do the same on the target box in > > reverse, so that both sides have the key in ~/.ssh/authorized_keys > > files. > > > > Hi: > It's me again trying to configure check_by_ssh but without success. See, > I follow this tutorial[1] but when I've done can't login to remote > server from Nagios server. Let me explain a bit what I do: > > 1) Login into my Nagios server as "root" not as "nagios" user > 2) Run this commands: > ssh-keygen -t rsa1 (for SSH1, I think I don't need this but just run > for precaution) > ssh-keygen -t dsa > ssh-keygen -t rsa > 3) Copy the generated files to the remote server > scp ~/.ssh/*.pub nagios at 10.128.50.11:/home/nagios/ > 4) Run this others commands: > cat identity.pub >>~/.ssh/authorized_keys > cat id_dsa.pub >>~/.ssh/authorized_keys > cat id_rsa.pub >>~/.ssh/authorized_keys > rm identity.pub id_dsa.pub id_rsa.pub > > Now when I try to login from Nagios server to the remote server > (10.128.50.11) I always need to enter the password. I try as "root" and > also as "nagios" (meaning ssh root at 10.128.50.11, ssh > nagios at 10.128.50.11). Why? > > The curiosity came to me and I check the file authorized_keys at remote > host and have this: > > 2048 35 > 31537320408745229838365562405624946802370792096499059223774165383570113281161048240756249546198805679184056103143919830145818642104082292170996730416929422264174662938941716685989426016074582046007764918772604041829437044357969148541210017569485061724990330392006573284601283454700329897647888326315719461278230886781115132496222294195579706117375955677922834002228681170251111807857141282704805088831501704787050993949809146632808041890108774648791697895838722205506992426654008098461046497741222563633988038536169891094257004960432390755965669333326650500537312297715834727417885056386391177047203249702515327707761 > root at monitoring > ssh-dss > AAAAB3NzaC1kc3MAAACBAPCKZyo6kPGMyGuWMhF6I/HcbY/2h0C2mIp0eMsnwi5nh1nT93VcJZL+hZd6etsDMzXSfN9EbQKlvXUKr3O05Ce8WBbesP7sYngR8ZfApZzUG+cnbia4XU9bf4KeA70UYSN9MWQWh0yvTfLJOX3X0ER0yQrNwVbiD3cwpyWMjGR7AAAAFQCQlSos9XFsf7o/sqYXE+E2NStJowAAAIEA8BmrviwMVaRT8Dg0L6h3ugViKlM+h2Ka4g1oO0mP+6wlZ1tf8+1p7bS2AZTLHsVdT8JdDt4kXr0h9A2+OHCyIZtIkwnJfgppjZri2wNsL6xBe/8YoNRAjuT28gsyYhm3Y1z7x4MTaii9KADotO/Pzc4QSj8RfNRdXKgMWBysEkMAAACATN+wyWnkYoHnskIkVofKuUckLE2VloyIrRl+ZJtV0mkC2PJ8/7nuT/qbqQGucI/60xqApjUH5BvXkUt7rm+aiGSL3s4ehRGfgsqp6BnzuzSJMCCWJQCPzXt0qTh/2l4wcxLqxtItKBxFHpPCh4ltV1jsxseCAoJiIH6GRHt5k1M= > root at monitoring > ssh-rsa > AAAAB3NzaC1yc2EAAAABIwAAAQEAvIvPpR2k3br05Yel6LHdziEp6uLx53gsTiSPko9tCuj26dxwJUg1Pt1LrNKObApdA0QWoLVXUmZx/MFicCvqND9Mj93nCSwZ9fN8MRlea5DNDpJORE2NPjmV5IlxX9S3qLDhkp1bXrqLS556sipxXigDZlvCJ/nHa4ZCdFRek2pT7vNVNA8E/wxu38zCnCDLFmmq73r+Sf+8Ud/whBBWWAIrQgGcP1oQ1MTo+rMYJSudof4CWAS9IWV3TI1yLg9EJK0CpzHVIYReo0QZzgin8op70/mx09OQsDCxZD/Ht9D3NTFxiTByRgtU//SzCJyLZigyeJODdEDr3PiK7+f4Nw== > root at monitoring > > As you can see all have "root at monitoring" at the end. What is the > problem? Does this have anything to do? > > [1] > http://hocuspokus.net/2008/01/ssh-shared-key-setup-ssh-logins-without-passwords/comment-page-1 Nagios runs as user nagios (usually), not as root, so that is the beginning of the problem. It is also possible that the remote host doesn't accept key based authentication, but the normal ssh daemon will accept keys ahead of passwords. On the nagios box, give yourself a shell as user nagios. depending on your permissions, you may need to specify /bin/bash or /bin/sh for your shell. Then you can generate the key with the ssh-keygen command. That needs to be done as user nagios. You also don't need to create 3 keys. That isn't the source of the problem, the limit on the number of keys is likely in the thousands, but the "default" key on most linuxes is ~/.ssh/id_rsa. So generate that without a password at a size that works for you. Use man ssh-keygen if anything I'm saying about this is unclear. Do copy the id_rsa.pub (or id_dsa.pub, or whatever the public part is) to the remote box, and dump it into the nagios ~/.ssh/authorized_keys file as you did before. You can erase the previous entries you made, unless you want root on the the nagios box to be able to ssh into the remote box as user nagios. Back on the nagios monitoring box, again become user nagios with a shell, and do: ssh -i ~/.ssh/id_rsa nagios at 10.128.50.11 It will ask you to accept the identity of the remote host, once you've done that, you should have shell access as user nagios on the remote box. After you've accepted the key, the nagios daemon can now make that connection whenever it needs to. Here is the generic check_by_ssh config that I use, notice that the key is specifically being called to designate the identity file. # 'check_ssh_disk' command definition define command{ command_name check_ssh_disk command_line $USER1$/check_by_ssh -H $HOSTADDRESS$ \ -i /usr/local/nagios/.ssh/id_rsa \ -C "$USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$" } -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shacky83 at gmail.com Sun Dec 13 00:56:28 2009 From: shacky83 at gmail.com (shacky) Date: Sun, 13 Dec 2009 00:56:28 +0100 Subject: Interface for control room Message-ID: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> Hi. Could you tell me what interface (web-based is better) to view the services status on a wall-screen in a control room? Thank you very much! Bye. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From juki.emma at gmail.com Sun Dec 13 13:35:35 2009 From: juki.emma at gmail.com (Juki) Date: Sun, 13 Dec 2009 15:35:35 +0300 Subject: Problems checking external host In-Reply-To: <4B23BE2D.20906@uci.cu> References: <4AFA3E25.3030003@uci.cu> <4AFC721D.8050108@uci.cu> <4AFCE407.1020503@hp.com> <4B01656A.3020300@uci.cu> <24B6509E4191AF44B60A24EAA3B4AD492D453F@nuexchg.norwich.edu> <4B0169D1.7030201@uci.cu> <24B6509E4191AF44B60A24EAA3B4AD492D4556@nuexchg.norwich.edu> <4B23B25D.8010402@uci.cu> <1260632861.3303.192.camel@jimsworktop.packetalk.net> <4B23BE2D.20906@uci.cu> Message-ID: <7545d7d20912130435s697f1c9bped014021401e8f00@mail.gmail.com> 2009/12/12 ReynierPM rperezm at uci.cu > I just have a little problem here. How can I became "nagios" if I login > as "root"? I mean after login as "root" I get a "root" shell, how I can > became "nagios" from this point? > Do; # su - nagios -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kasper at nordal-lund.dk Sun Dec 13 14:15:22 2009 From: kasper at nordal-lund.dk (Kasper Nordal Lund) Date: Sun, 13 Dec 2009 14:15:22 +0100 Subject: Interface for control room In-Reply-To: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> References: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> Message-ID: <4B24E8EA.90609@nordal-lund.dk> shacky wrote: > Hi. > > Could you tell me what interface (web-based is better) to view the > services status on a wall-screen in a control room? > I do it with a little HTML script like this: Nagios Monitor Site Then i install the autoauth and the default zoom level addons in firefox. Then open the above script in firefox, make it remember the credentials, zoom in to about 220%, press F11 and there it is :) I then usually save the site as the default start page. This has worked for me for several years, i think it gives a really good overview. /Kasper > Thank you very much! > Bye. > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pangrazi at gmail.com Sun Dec 13 15:14:50 2009 From: pangrazi at gmail.com (Greg Pangrazio) Date: Sun, 13 Dec 2009 08:14:50 -0600 Subject: Monitor Cisco Switches In-Reply-To: <2e22dff70912112233w7ecefc40s3b66f54bf0cc72d0@mail.gmail.com> References: <2e22dff70912112233w7ecefc40s3b66f54bf0cc72d0@mail.gmail.com> Message-ID: For the first error, do you have something plugged into port 1 on the switch and is the port in the Up/UP state? For the check_mrtg plugin you must have mrtg installed and graphing utilization before you can configure the alterts via nagios. You should check out the MRTG site for more information on that. Greg Pangrazio pangrazi at gmail.com On Sat, Dec 12, 2009 at 12:33 AM, VIKRAM MS wrote: > Hello, > > ???????????? I am trying to monitor Cisco switches and routers. I am able to > ping and check the uptime of the switch. > 1) But for the port 1 link status, I am getting "SNMP CRITICAL - *down(2)*" > error. > > 2) For port 1 bandwidth usage, the error is "check_mrtgtraf: Unable to open > MRTG log file" > ??????????? I have installed MRTG, but I am not able to go any further. Also > for the http://localhost/mrtg/10.111.100.102_43.html check, I am getting the > following error > 404 Not found > The requested URL /mrtg/10.111.100.102_43.html was not found on this server. > > I have checked for assistance in > http://www.mail-archive.com/nagios-users at lists.sourceforge.net/ but in vain. > I am using Nagios Core version 3.2.0. > Thanks in advance > > Regards > Vikram > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rlemerlus at merethis.com Sun Dec 13 13:42:11 2009 From: rlemerlus at merethis.com (Romain Le Merlus) Date: Sun, 13 Dec 2009 13:42:11 +0100 Subject: Interface for control room In-Reply-To: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> References: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> Message-ID: <8d9ba4010912130442k4ed30057q4dbea52bb722a72b@mail.gmail.com> Hi, Centreon is a web based interface for Nagios. http://www.centreon.com/ Here is some screenshots and a demo platform, you may find what you are looking for: http://www.centreon.com/centreon-screenshots-en.html http://demo.centreon.com/ You can also see on this demo the Centreon MAP extension, it is an advanced cartography module. Can be very useful in a control room ;) Best regards. -- Romain LE MERLUS -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From doepain at gmail.com Mon Dec 14 03:20:55 2009 From: doepain at gmail.com (dOE) Date: Sun, 13 Dec 2009 21:20:55 -0500 Subject: Nagios Plug-Ins Message-ID: I have inherited a Nagios 3.0.3 server, and have been trying to get acquainted with it. How could I tell which plugin version I have, and is it worth upgrading the plugins and Nagios to the latest? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Mon Dec 14 03:28:28 2009 From: mitsuto at gmail.com (Marcel) Date: Mon, 14 Dec 2009 00:28:28 -0200 Subject: Nagios Plug-Ins In-Reply-To: References: Message-ID: <2dfcbd1b0912131828l159cdf80o6ae941ba41c27d4a@mail.gmail.com> # /opt/nagios/libexec/check_http --version check_http v2053 (nagios-plugins 1.4.13) # On Mon, Dec 14, 2009 at 12:20 AM, dOE wrote: > I have inherited a Nagios 3.0.3 server, and have been trying to get > acquainted with it. How could I tell which plugin version I have, and is it > worth upgrading the plugins and Nagios to the latest? > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dermoth at aei.ca Mon Dec 14 05:22:48 2009 From: dermoth at aei.ca (Thomas Guyot-Sionnest) Date: Sun, 13 Dec 2009 23:22:48 -0500 Subject: Nagios2 process overwhelmed by NSCA daemon? In-Reply-To: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> References: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> Message-ID: <4B25BD98.50204@aei.ca> On 09/12/09 06:06 PM, Jonathan Call wrote: > I recently added two new slaves to a distributed Nagios system. The > central server now passively processes 17,000+ service checks on 3000+ > servers. > > It's been over an hour and a half since I brought those new slaves > online and I have about 150 hosts still stuck in 'Pending' and about > 1300 services in the same state. In addition to that it seems that the > service check results from the other slaves that were working normally > are now arbitrarily disappearing. For example, on one host three of the > service checks have been updated relatively recently (i.e. 5-30 minutes > ago) but three other service checks haven't been updated for almost an > hour. The slaves all appear operational and the hosts are being checked > on time. Is it possible I've overwhelmed Nagios' ability to process data > from the NSCA daemon or struck some internal Nagios bottleneck? Any > suggestions would be appreciated. Hummmm Very interesting. Which Nagios version are you using? This sounds a lot like a problem I encountered a few years ago with passive checks. I had about 50-60 servers returning cron-scheduled check results to the Nagios server. 120 results ain't that much, but is seemed that with all the servers fully time-synced (using NTP) out of these ~120 results I was often missing some of them, which would eventually cause false-alarm due to stale services. I could easily reproduce the problem by feeding lots of results to Nagios right when I was expecting a batch of passive results - this would cause random results to be dropped. I spent some time trying to debug this but I couldn't figure our where commands were dropped. My primary target was the ring buffer used by the command reaper. As far as I can remember I tested with version of Nagios ranging from 2.3 to 2.5; I never tried with recent version If you're running a recent version of nagios what do you get for "Used/High/Total Command Buffers" in the "nagiostats" command output? (you can also get these numbers from the web interface, "Performance Info" in the left bar.). If it seems to be maxed out, you may try setting "command_check_interval" to "-1" and raising the "external_command_buffer_slots" option in nagios.cfg. If you're still having this problem with Nagios v3 and up I might try to reproduce this as well, and maybe I'll be able to figure out what's wrong this time. -- Thomas ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dermoth at aei.ca Mon Dec 14 05:37:21 2009 From: dermoth at aei.ca (Thomas Guyot-Sionnest) Date: Sun, 13 Dec 2009 23:37:21 -0500 Subject: Nagios as a Service Resiliency Manager In-Reply-To: References: Message-ID: <4B25C101.1080809@aei.ca> On 10/12/09 12:08 PM, Christopher McAtackney wrote: > Hi all, > > I have a need to control an Active / Passive pair of components and > was wondering if anyone had tackled this problem with Nagios? > > The scenario is as follows; > > Host A has SERVICE_1 installed and running. Host B has SERVICE_2 > installed, but not running. > > The desired functionality is to detect when SERVICE_1 is not running > (or that Host A is down / unreachable), and then to start SERVICE_2 on > Host B. > > I believe I can do this with Nagios by defining an event handler on > SERVICE_1 which will make the appropriate call to start SERVICE_2 on > Host B > > Would it make sense to store the relationship between SERVICE_1 and > Host B / SERVICE_2 as a service macro, e.g. > $_SERVICE_PASSIVE_HOSTNAME, $_SERVICE_PASSIVE_SERVICENAME? > > There are too many scenarios in which the SERVICE_1 might come back up > to try automate the switching off of SERVICE_2 I believe, e.g. if > someone pulled a network cable on Host A accidently, then plugged it > in 15 minutes later - during which time Nagios detects that it is down > and so starts up SERVICE_2. The user then plugs the network lead back > in and now we have two Active instances running - which is what we > specifically wanted to avoid. Even if Nagios detects that the primary > component is up, it's still too late because any Active / Active > overlap will cause problems for this particular application. > > I can't think of any way to automate that side of things - but does > the general concept of having Nagios start up a Passive partner make > sense? Short answer: not really. You're talking about clustering here, and clustering has its very own set of problems than Nagios was never meant to solve. You should rather spend your time looking at a real clustering solution like Linux-HA (I used this one but I know there's other OSS clustering software around...). Once you have your cluster set up then it makes sense to monitor the services *and* the cluster software using Nagios. For failover services I find the easiest way is you use a shared IP (IP that moves from one server to the other along with the services - this is very easy to add once the cluster is set-up) so you always look for the service where it's supposed to be running. If a shared IP isn't an option just monitor the service on both servers and use check_cluster to detect across all nodes. I'm not saying that you can't achieve this using Nagios... It might actually work for very simplistic scenarios but even in that case you may end up accidentally running the service on both servers if you're not very careful (something that a clustering software sill not let happen). You have to take into account not only every possible failure scenarios but also every possible thing a human could be doing at the same time your handlers try to recover the service! If kind of like reinventing the wheel, but not even using the right tools :) -- Thomas ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Mon Dec 14 15:13:08 2009 From: rperezm at uci.cu (ReynierPM) Date: Mon, 14 Dec 2009 09:13:08 -0500 Subject: The famous message: "NRPE: Unable to read output " Message-ID: <4B2647F4.8090501@uci.cu> Hi every: Yesterday I configure Nagios to monitoring a new installed server. After install, configure and running xinetd, nrpe and nagios-plugins on the remote server and startup Nagios on main server I get the error: "NRPE: Unable to read output", can any tell me the possible causes for this error? I run this: root at monitoring:~# /usr/local/nagios/libexec/check_nrpe -H 10.128.50.5 NRPE v2.12 And as you can see all seem to be OK on both sides. -- Cheers and thanks in advance ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From schneemann at b1-systems.de Mon Dec 14 15:34:51 2009 From: schneemann at b1-systems.de (Christian Schneemann) Date: Mon, 14 Dec 2009 15:34:51 +0100 Subject: The famous message: "NRPE: Unable to read output " In-Reply-To: <4B2647F4.8090501@uci.cu> References: <4B2647F4.8090501@uci.cu> Message-ID: <200912141534.51577.schneemann@b1-systems.de> On Monday 14 December 2009 15:13:08 ReynierPM wrote: > Hi every: > Yesterday I configure Nagios to monitoring a new installed server. After > install, configure and running xinetd, nrpe and nagios-plugins on the > remote server and startup Nagios on main server I get the error: "NRPE: > Unable to read output", can any tell me the possible causes for this error? > > I run this: > root at monitoring:~# /usr/local/nagios/libexec/check_nrpe -H 10.128.50.5 > NRPE v2.12 > > And as you can see all seem to be OK on both sides. > Have a look in the logs, maybe nrpe on the clientside cannot execute the Plugin you want to call through nrpe. Greetings, Christian -- Christian Schneemann Geschaeftsfuehrer: Ralph Dehner Technical Tester & Writer Unternehmenssitz: Vohburg B1 Systems GmbH Amtsgericht: Ingolstadt Mobil: +49-(0)-1757250665 Handelsregister: HRB 3537 EMail: schneemann at b1-systems.de http://www.b1-systems.de Adresse: B1 Systems GmbH, Osterfeldstra?e 7, 85088 Vohburg GPG: http://pgpkeys.pca.dfn.de/pks/lookup?op=get&search=0x2FA8643A41BDAB81 ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Mon Dec 14 15:38:49 2009 From: rperezm at uci.cu (ReynierPM) Date: Mon, 14 Dec 2009 09:38:49 -0500 Subject: The famous message: "NRPE: Unable to read output " In-Reply-To: <200912141534.51577.schneemann@b1-systems.de> References: <4B2647F4.8090501@uci.cu> <200912141534.51577.schneemann@b1-systems.de> Message-ID: <4B264DF9.5070509@uci.cu> Christian Schneemann wrote: > On Monday 14 December 2009 15:13:08 ReynierPM wrote: > > Have a look in the logs, maybe nrpe on the clientside cannot execute the > Plugin you want to call through nrpe. > > Greetings, > Christian > I found the mistake (was mine): I compile nagios-plugins as "root" and forget to change permissions on /usr/local/nagios/libexec directory and also files. After make: chown nagios.nagios /usr/local/nagios/libexec chown -R nagios.nagios /usr/local/nagios/libexec All works fine -- Cheers ReynierPM ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Dec 14 15:54:24 2009 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 14 Dec 2009 09:54:24 -0500 Subject: benchmark question Message-ID: <6db4a4200912140654geae25bs4e408232dee92373@mail.gmail.com> List, Is there any information as to what is the maximum number of services one can run on a single box with single instance of Nagios? I did not find any concrete data on this. I am running a single instance of nagios on a quad core 2.5 GHZ machine with with 4 Gigs of RAM. I have total of 7359 service check running on this box. I have also ndoutils running on the same box as backend. My total service checks is going to five fold very soon with a lot of nrpe checks. Now, my question is should I run multiple instance of nagios on the same box or a single instance will be able to handle about 30000 service checks? I already tested a dev environment with multiple instance of nagios with some hacking to init.d script, it started and monitored fine. What would be the preferred method of running nagios, single instance on a box or multiple instance on a box when we are dealing with high number of service checks? Please comment on this. Thanks -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From doepain at gmail.com Mon Dec 14 16:38:26 2009 From: doepain at gmail.com (dOE) Date: Mon, 14 Dec 2009 10:38:26 -0500 Subject: Nagios Plug-Ins In-Reply-To: <2dfcbd1b0912131828l159cdf80o6ae941ba41c27d4a@mail.gmail.com> References: <2dfcbd1b0912131828l159cdf80o6ae941ba41c27d4a@mail.gmail.com> Message-ID: Thank you! On Sun, Dec 13, 2009 at 9:28 PM, Marcel wrote: > # /opt/nagios/libexec/check_http --version > check_http v2053 (nagios-plugins 1.4.13) > # > > On Mon, Dec 14, 2009 at 12:20 AM, dOE wrote: > >> I have inherited a Nagios 3.0.3 server, and have been trying to get >> acquainted with it. How could I tell which plugin version I have, and is it >> worth upgrading the plugins and Nagios to the latest? >> >> >> ------------------------------------------------------------------------------ >> Return on Information: >> Google Enterprise Search pays you back >> Get the facts. >> http://p.sf.net/sfu/google-dev2dev >> >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jbroughton at truecos.com Mon Dec 14 17:06:11 2009 From: jbroughton at truecos.com (Jayson Broughton) Date: Mon, 14 Dec 2009 09:06:11 -0700 Subject: Monitoring Telesync Switches is possible? In-Reply-To: <4B23AC2A.6070009@uci.cu> References: <4B23AC2A.6070009@uci.cu> Message-ID: <1260806771.12432.77.camel@localhost.localdomain> ReynierPM, We have alot of 12-24port Allied Telesyn switch's, but all of them are un-managed switches. If the switch you have is 'managed' (you can log into an ip address, or get snmp information from the switch) then nagios can monitor it with snmp. But in my experience, I have been unable to monitor un-managed switch's. I guess a workaround would be to monitor the port of a managed switch that the allied telesyn is plugged into for status (up/down). Jayson On Sat, 2009-12-12 at 09:43 -0500, ReynierPM wrote: > Hi every: > As the subject said: it's possible to monitoring a Allied Telesync > switches? I have almost 3 of them in my infraestructure and need to be > monitored but don't know how to do this. Any help? The information in this electronic mail message and any attached files is confidential and may be legally privileged. If you are not the intended recipient, delete this message and contact the sender immediately. Access to this message by anyone other than its intended recipient is unauthorized. You must not use or disseminate this information as it is proprietary property of the True companies. Communications on or through the True companies' computer systems may be monitored or recorded to secure effective system operation and for other lawful purposes. Thank you. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dfulton at nuvox.com Mon Dec 14 17:28:08 2009 From: dfulton at nuvox.com (Fulton, David) Date: Mon, 14 Dec 2009 11:28:08 -0500 Subject: benchmark question In-Reply-To: <6db4a4200912140654geae25bs4e408232dee92373@mail.gmail.com> References: <6db4a4200912140654geae25bs4e408232dee92373@mail.gmail.com> Message-ID: Do you have another box? I don't know of much of, if any, of an advantage to running multiple instances. The SMP will usually not help there because the other cores will be occupied executing plug-ins. If you had another box you could go with a distributed environment instead. Perhaps using SVN to keep record of config changes. From: shadih rahman [mailto:shadhin71 at gmail.com] Sent: Monday, December 14, 2009 9:54 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] benchmark question List, Is there any information as to what is the maximum number of services one can run on a single box with single instance of Nagios? I did not find any concrete data on this. I am running a single instance of nagios on a quad core 2.5 GHZ machine with with 4 Gigs of RAM. I have total of 7359 service check running on this box. I have also ndoutils running on the same box as backend. My total service checks is going to five fold very soon with a lot of nrpe checks. Now, my question is should I run multiple instance of nagios on the same box or a single instance will be able to handle about 30000 service checks? I already tested a dev environment with multiple instance of nagios with some hacking to init.d script, it started and monitored fine. What would be the preferred method of running nagios, single instance on a box or multiple instance on a box when we are dealing with high number of service checks? Please comment on this. Thanks -- Cordially, Shadhin Rahman This email and any attachments ("Message") may contain legally privileged and/or confidential information. If you are not the addressee, or if this Message has been addressed to you in error, you are not authorized to read, copy, or distribute it, and we ask that you please delete it (including all copies) and notify the sender by return email. Delivery of this Message to any person other than the intended recipient(s) shall not be deemed a waiver of confidentiality and/or a privilege. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Dec 14 17:29:00 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Mon, 14 Dec 2009 08:29:00 -0800 Subject: benchmark question In-Reply-To: <6db4a4200912140654geae25bs4e408232dee92373@mail.gmail.com> References: <6db4a4200912140654geae25bs4e408232dee92373@mail.gmail.com> Message-ID: <4B2667CC.8090107@hp.com> shadih rahman wrote: > List, > Is there any information as to what is the maximum number of > services one can run on a single box with single instance of Nagios? > I did not find any concrete data on this. > > I am running a single instance of nagios on a quad core 2.5 GHZ > machine with with 4 Gigs of RAM. > > I have total of 7359 service check running on this box. I have also > ndoutils running on the same box as backend. > > My total service checks is going to five fold very soon with a lot of > nrpe checks. > > Now, my question is should I run multiple instance of nagios on the > same box or a single instance will be able to handle about 30000 > service checks? In my experience, you'll run into issues right around 8,000 checks or so, depending no output verbosity, on a fairly stock setup. This is due to the size of the pipe used to temporarily store check results, at least on Linux (and you may be seeing it already with your number of checks). Around that point, even with quite a bit of tuning, the check results will fill the pipe in less than a second, which is the minimum amount of time I've been able to configure Nagios to flush it. When that happens, latencies go through the roof. Distributing the checks doesn't solve the problem if you're still sending the results to a centralized Nagios instance, since one machine still needs to process all of them. I'm working through this situation now, and it's looking like it may take a custom kernel with a larger pipe size to handle it. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dario.bestetti at opservices.com.br Mon Dec 14 17:53:04 2009 From: dario.bestetti at opservices.com.br (Dario B. Bestetti (OpServices)) Date: Mon, 14 Dec 2009 14:53:04 -0200 (BRST) Subject: benchmark question In-Reply-To: <18333628.44241260809361265.JavaMail.root@mail.opservices.com.br> References: <18333628.44241260809361265.JavaMail.root@mail.opservices.com.br> Message-ID: <31275029.44311260809584268.JavaMail.root@mail.opservices.com.br> ----- "Patrick Morris" escreveu: > shadih rahman wrote: > > List, > > Is there any information as to what is the maximum number of > > services one can run on a single box with single instance of Nagios? > > > I did not find any concrete data on this. > > > > I am running a single instance of nagios on a quad core 2.5 GHZ > > machine with with 4 Gigs of RAM. > > > > I have total of 7359 service check running on this box. I have also > > > ndoutils running on the same box as backend. > > > > My total service checks is going to five fold very soon with a lot > of > > nrpe checks. > > > > Now, my question is should I run multiple instance of nagios on the > > > same box or a single instance will be able to handle about 30000 > > service checks? > > In my experience, you'll run into issues right around 8,000 checks or > > so, depending no output verbosity, on a fairly stock setup. This is > due > to the size of the pipe used to temporarily store check results, at > least on Linux (and you may be seeing it already with your number of > checks). Around that point, even with quite a bit of tuning, the check > > results will fill the pipe in less than a second, which is the minimum > > amount of time I've been able to configure Nagios to flush it. When > that > happens, latencies go through the roof. > > Distributing the checks doesn't solve the problem if you're still > sending the results to a centralized Nagios instance, since one > machine > still needs to process all of them. > > I'm working through this situation now, and it's looking like it may > take a custom kernel with a larger pipe size to handle > it. > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null Shadih, we have a customer running over 20.000 service checks in a single box. The box is 2x Intel Dual-core Xeon 3Ghz with 8Gb RAM. The latency is around 0.7s. The checks are performed between 3min and 5min. They have around 40 simultaneous users. _________________________________________________ Dario B. Bestetti OpServices R. Luciana de Abreu, 471 - Sala 403 Porto Alegre, RS - CEP 90570-060 Fone 55(51)32753588 Mobile 55(51)81518218 Fax 55(51)32753588 Email dario.bestetti at opservices.com.br "In God we trust, the rest we monitor ..." _________________________________________________ ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gael.cheron at free.fr Mon Dec 14 18:06:10 2009 From: gael.cheron at free.fr (Gael Cheron) Date: Mon, 14 Dec 2009 18:06:10 +0100 Subject: Interface for control room In-Reply-To: <4B24E8EA.90609@nordal-lund.dk> References: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> <4B24E8EA.90609@nordal-lund.dk> Message-ID: <8480e60f0912140906o5cddf711g8f0425933e939d28@mail.gmail.com> I do this with NagVis. Quite simple and powerfull. You can create new maps and add your own iconset (big blinking animated gif!) http://www.nagvis.org/ Regards, Ga?l 2009/12/13 Kasper Nordal Lund > shacky wrote: > > Hi. > > > > Could you tell me what interface (web-based is better) to view the > > services status on a wall-screen in a control room? > > > I do it with a little HTML script like this: > > "http://www.w3.org/TR/xhtml1/xhtml1-strict.dtd"> > > > > Nagios Monitor Site > > > > > > > > Then i install the autoauth and the default zoom level addons in firefox. > > Then open the above script in firefox, make it remember the credentials, > zoom in to about 220%, press F11 and there it is :) > > I then usually save the site as the default start page. > > This has worked for me for several years, i think it gives a really good > overview. > > /Kasper > > > Thank you very much! > > Bye. > > > > > ------------------------------------------------------------------------------ > > Return on Information: > > Google Enterprise Search pays you back > > Get the facts. > > http://p.sf.net/sfu/google-dev2dev > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Edwin.Zoeller at ama-assn.org Mon Dec 14 17:43:11 2009 From: Edwin.Zoeller at ama-assn.org (Edwin Zoeller) Date: Mon, 14 Dec 2009 10:43:11 -0600 Subject: Question - Scheduled Downtime Message-ID: When a service or host is put in scheduled downtime does the monitor still execute against the service it's checking? OR does it stop? I am having a big debate here at work on this. I say, as what I understand, is that when it is scheduled for downtime, no monitor is sent out for that service. Please someone correct me if I am wrong and explain what really happens so I can end this dispute. Thanks, Ed P Please consider the environment before printing this e-mail -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From SLofland at slco.org Mon Dec 14 18:29:29 2009 From: SLofland at slco.org (Scott Lofland) Date: Mon, 14 Dec 2009 10:29:29 -0700 Subject: Question - Scheduled Downtime In-Reply-To: References: Message-ID: <30C3BD2973EB154C9487948AEB8D11340417C93D@slcmail02.slcounty.org> I've currently got a server in my console in scheduled downtime(been powered off for a couple days). Checks are still performed against the machine regularly and show it in a critical state in the console(CRITICAL - Socket timeout after 30 seconds ) but no alerts are sent out for it. -Scott From: Edwin Zoeller [mailto:Edwin.Zoeller at ama-assn.org] Sent: Monday, December 14, 2009 9:43 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Question - Scheduled Downtime Importance: High When a service or host is put in scheduled downtime does the monitor still execute against the service it's checking? OR does it stop? I am having a big debate here at work on this. I say, as what I understand, is that when it is scheduled for downtime, no monitor is sent out for that service. Please someone correct me if I am wrong and explain what really happens so I can end this dispute. Thanks, Ed P Please consider the environment before printing this e-mail -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Dec 14 19:01:52 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Mon, 14 Dec 2009 10:01:52 -0800 Subject: Question - Scheduled Downtime In-Reply-To: References: Message-ID: <4B267D90.4070504@hp.com> Edwin Zoeller wrote: > When a service or host is put in scheduled downtime does the monitor > still execute against the service it's checking? OR does it stop? I am > having a big debate here at work on this. I say, as what I understand, > is that when it is scheduled for downtime, no monitor is sent out for > that service. > > Please someone correct me if I am wrong and explain what really > happens so I can end this dispute. > You're wrong. Scheduling downtime only disables notifications. You can easily verify this by watching your logs during downtimes: You'll still see a host or service go offline, but no notification will go out. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcall at verio.net Mon Dec 14 19:41:08 2009 From: jcall at verio.net (Jonathan Call) Date: Mon, 14 Dec 2009 13:41:08 -0500 Subject: Nagios2 process overwhelmed by NSCA daemon? In-Reply-To: <4B25BD98.50204@aei.ca> References: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> <4B25BD98.50204@aei.ca> Message-ID: <04F3233F47E2714CB7431AE913E57E7703B5C18D@IAD-WPRD-XCHB02.corp.verio.net> See responses inline: > -----Original Message----- > From: Thomas Guyot-Sionnest [mailto:dermoth at aei.ca] > Sent: Sunday, December 13, 2009 9:23 PM > To: Jonathan Call > Cc: nagios-user Mailinglist > Subject: Re: [Nagios-users] Nagios2 process overwhelmed by NSCA daemon? > > On 09/12/09 06:06 PM, Jonathan Call wrote: > > I recently added two new slaves to a distributed Nagios system. The > > central server now passively processes 17,000+ service checks on > 3000+ > > servers. > > > > It's been over an hour and a half since I brought those new slaves > > online and I have about 150 hosts still stuck in 'Pending' and about > > 1300 services in the same state. In addition to that it seems that > the > > service check results from the other slaves that were working > normally > > are now arbitrarily disappearing. For example, on one host three of > the > > service checks have been updated relatively recently (i.e. 5-30 > minutes > > ago) but three other service checks haven't been updated for almost > an > > hour. The slaves all appear operational and the hosts are being > checked > > on time. Is it possible I've overwhelmed Nagios' ability to process > data > > from the NSCA daemon or struck some internal Nagios bottleneck? Any > > suggestions would be appreciated. > > Hummmm Very interesting. Which Nagios version are you using? Nagios 2.12 (May 19, 2008) on FreeBSD 6.3 > > This sounds a lot like a problem I encountered a few years ago with > passive checks. I had about 50-60 servers returning cron-scheduled > check > results to the Nagios server. 120 results ain't that much, but is > seemed > that with all the servers fully time-synced (using NTP) out of these > ~120 results I was often missing some of them, which would eventually > cause false-alarm due to stale services. > > I could easily reproduce the problem by feeding lots of results to > Nagios right when I was expecting a batch of passive results - this > would cause random results to be dropped. I spent some time trying to > debug this but I couldn't figure our where commands were dropped. My > primary target was the ring buffer used by the command reaper. As far > as > I can remember I tested with version of Nagios ranging from 2.3 to 2.5; > I never tried with recent version > > If you're running a recent version of nagios what do you get for > "Used/High/Total Command Buffers" in the "nagiostats" command output? > (you can also get these numbers from the web interface, "Performance > Info" in the left bar.). If it seems to be maxed out, you may try > setting "command_check_interval" to "-1" and raising the > "external_command_buffer_slots" option in nagios.cfg. > Buffer report from Nagiostats: Used/High/Total Command Buffers: 25 / 4096 / 4096 Used/High/Total Check Result Buffers: 0 / 4096 / 4096 Nagios config: command_check_interval=-1 external_command_buffer_slots=4096 > > If you're still having this problem with Nagios v3 and up I might try > to > reproduce this as well, and maybe I'll be able to figure out what's > wrong this time. Upgrading to Nagios v3 is being considered but isn't possible at this time. As I mentioned to someone else on this thread, it seems that having a large number of queries (status.cgi) being run against the web interface seems to provoke poor performance from the central server, this is even after we switched the main objects.cache and status.dat files to a memory disk. Jonathan This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Mon Dec 14 20:17:05 2009 From: benny at bennyvision.com (C. Bensend) Date: Mon, 14 Dec 2009 13:17:05 -0600 Subject: Anyone testing Microsoft Volume Shadow Copy? Message-ID: <3d756a733b83a4283e25eaaf78e56f05.squirrel@webmail.stinkweasel.net> Hey folks, I am hitting a situation where certain Windows 2003 Server file servers will stop doing shadow copies on their share volumes. The service itself starts up and shuts down periodically like it should, but a new shadow copy is *not* created as viewed in Pervious Versions from a Windows box. Looking at the vssadmin.exe command on the Windows server, I see I can: vssadmin List Shadows And it prints out a listing of the existing shadow copies. However, without awk and grep (and no, I cannot install cygwin or anything like that), I don't have a clue how to process it. Is anyone already testing Volume Shadow Copy? Does anyone know of performance counters or anything I can use via the Nagios EventLog agent or NSClient++ to check to see that new shadow copies are being created? I'm running out of Google on this one... Thanks! Benny -- "It's not all about getting up and putting four slices of kickass in a two slice toaster." -- ark86, on Fazed.net ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Mon Dec 14 20:25:01 2009 From: jpratt at norwich.edu (James Pratt) Date: Mon, 14 Dec 2009 14:25:01 -0500 Subject: Anyone testing Microsoft Volume Shadow Copy? In-Reply-To: <3d756a733b83a4283e25eaaf78e56f05.squirrel@webmail.stinkweasel.net> References: <3d756a733b83a4283e25eaaf78e56f05.squirrel@webmail.stinkweasel.net> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD493627AD@nuexchg.norwich.edu> >> -----Original Message----- >> From: C. Bensend [mailto:benny at bennyvision.com] >> Sent: Monday, December 14, 2009 2:17 PM >> To: nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] Anyone testing Microsoft Volume Shadow Copy? >> >> >> Hey folks, >> >> I am hitting a situation where certain Windows 2003 Server file >> servers will stop doing shadow copies on their share volumes. The >> service itself starts up and shuts down periodically like it >> should, but a new shadow copy is *not* created as viewed in Pervious >> Versions from a Windows box. >> >> Looking at the vssadmin.exe command on the Windows server, I see >> I can: >> >> >> vssadmin List Shadows >> >> >> And it prints out a listing of the existing shadow copies. >> However, without awk and grep (and no, I cannot install cygwin or >> anything like that), I don't have a clue how to process it. >> >> Is anyone already testing Volume Shadow Copy? Does anyone know >> of performance counters or anything I can use via the Nagios EventLog >> agent or NSClient++ to check to see that new shadow copies are being >> created? >> >> I'm running out of Google on this one... >> >> Thanks! >> >> Benny >> >> >> -- >> "It's not all about getting up and putting four slices of kickass >> in a two slice toaster." -- ark86, on Fazed.net Hi, this may help, but I cannot tell for sure in your particular case. I've installed it on a few servers here to resolve win2k3 VSS-related backup errors/issues. http://support.microsoft.com/kb/940349 cheers, James ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Mon Dec 14 20:34:18 2009 From: benny at bennyvision.com (C. Bensend) Date: Mon, 14 Dec 2009 13:34:18 -0600 Subject: Anyone testing Microsoft Volume Shadow Copy? In-Reply-To: <24B6509E4191AF44B60A24EAA3B4AD493627AD@nuexchg.norwich.edu> References: <3d756a733b83a4283e25eaaf78e56f05.squirrel@webmail.stinkweasel.net> <24B6509E4191AF44B60A24EAA3B4AD493627AD@nuexchg.norwich.edu> Message-ID: >>> -----Original Message----- >>> From: C. Bensend [mailto:benny at bennyvision.com] >>> I am hitting a situation where certain Windows 2003 Server file >>> servers will stop doing shadow copies on their share volumes. The >>> service itself starts up and shuts down periodically like it >>> should, but a new shadow copy is *not* created as viewed in Pervious >>> Versions from a Windows box. >>> >>> Looking at the vssadmin.exe command on the Windows server, I see >>> I can: >>> >>> >>> vssadmin List Shadows >>> >>> >>> And it prints out a listing of the existing shadow copies. >>> However, without awk and grep (and no, I cannot install cygwin or >>> anything like that), I don't have a clue how to process it. >>> >>> Is anyone already testing Volume Shadow Copy? Does anyone know >>> of performance counters or anything I can use via the Nagios EventLog >>> agent or NSClient++ to check to see that new shadow copies are being >>> created? >>> >>> I'm running out of Google on this one... >>> >>> Thanks! >>> >>> Benny >>> >>> >>> -- >>> "It's not all about getting up and putting four slices of kickass >>> in a two slice toaster." -- ark86, on Fazed.net > > > Hi, this may help, but I cannot tell for sure in your particular case. > I've installed it on a few servers here to resolve win2k3 VSS-related > backup errors/issues. > > http://support.microsoft.com/kb/940349 Hey James, Thanks for the article... I'll pass that along to the Windows guys and have them read it over. However, problem or not, I'd still like to monitor the creation of shadow copies... So even if this would fix their problem, the original question stands - how to monitor shadow copies. :) Thanks much! Benny -- "It's not all about getting up and putting four slices of kickass in a two slice toaster." -- ark86, on Fazed.net ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Mon Dec 14 20:52:48 2009 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Mon, 14 Dec 2009 20:52:48 +0100 Subject: Anyone testing Microsoft Volume Shadow Copy? In-Reply-To: <3d756a733b83a4283e25eaaf78e56f05.squirrel@webmail.stinkweasel.net> References: <3d756a733b83a4283e25eaaf78e56f05.squirrel@webmail.stinkweasel.net> Message-ID: <90f6e8270912141152l68761fedk97bb235f8f49ac90@mail.gmail.com> On Mon, Dec 14, 2009 at 8:17 PM, C. Bensend wrote: > > ? Looking at the vssadmin.exe command on the Windows server, I see > I can: > > > vssadmin List Shadows > > > ? And it prints out a listing of the existing shadow copies. > However, without awk and grep (and no, I cannot install cygwin or > anything like that), I don't have a clue how to process it. you'll have to parse the output of that command with the tools at hand: vbscript or powershell, even and oldfashined bat file. Write your own plugin, it is not that hard. Once you have the script, you have it executed by nrpe (nsclient++ has an nrpe client). -- natxo ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Dec 14 20:57:57 2009 From: marc at ena.com (Marc Powell) Date: Mon, 14 Dec 2009 13:57:57 -0600 Subject: Nagios2 process overwhelmed by NSCA daemon? In-Reply-To: <04F3233F47E2714CB7431AE913E57E7703B5C18D@IAD-WPRD-XCHB02.corp.verio.net> References: <04F3233F47E2714CB7431AE913E57E7703AB6DA5@IAD-WPRD-XCHB02.corp.verio.net> <4B25BD98.50204@aei.ca> <04F3233F47E2714CB7431AE913E57E7703B5C18D@IAD-WPRD-XCHB02.corp.verio.net> Message-ID: On Dec 14, 2009, at 12:41 PM, Jonathan Call wrote: > See responses inline: >> >> On 09/12/09 06:06 PM, Jonathan Call wrote: >>> I recently added two new slaves to a distributed Nagios system. The >>> central server now passively processes 17,000+ service checks on >> 3000+ >>> servers. > Buffer report from Nagiostats: > Used/High/Total Command Buffers: 25 / 4096 / 4096 > Used/High/Total Check Result Buffers: 0 / 4096 / 4096 > > Nagios config: > command_check_interval=-1 > external_command_buffer_slots=4096 You're hitting this limit and you should increase it. Try doubling it, then monitor. -- Marc ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yinyang at eburg.com Tue Dec 15 01:56:32 2009 From: yinyang at eburg.com (Gordon Messmer) Date: Mon, 14 Dec 2009 16:56:32 -0800 Subject: Notification period inheritance problem Message-ID: <4B26DEC0.7010607@eburg.com> The documentation on "Implied Inheritance"[1] indicates that a service with no notification period will use the notification period of the associated host definition. This doesn't seem to work when a service is applied to a hostgroup. In my configuration files, I define a generic service template of my own. It has no notification period. I also define a host template for each customer which defines the hostgroup and notification period which is appropriate for each host. Finally, I define a service which will apply to all hosts within that hostgroup. When I run "nagios -v", I get warnings that no notification period is given: Warning: Service 'root partition' on host 'customer1-samba' has no notification time period defined! I've received some pages from hosts when checks timed out during a period when no notications should be sent. Is this a bug, or am I making some mistake? Shouldn't the service be inheriting the value from the host? Where would I start looking if it were a bug and I wanted to fix it? define service{ use generic-service name my-generic-service ; The 'name' of this service template notification_period null ; Use the period from the host register 0 } define host{ name customer1-linux-server use my-generic-server hostgroups my-linux-servers notification_period my-workhours register 0 } define service{ use my-generic-service hostgroup_name my-linux-servers service_description root partition check_command ssh_disk!20%!10%!/ } 1: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From amoran at apple.com Tue Dec 15 02:25:04 2009 From: amoran at apple.com (Andy Moran) Date: Mon, 14 Dec 2009 17:25:04 -0800 Subject: nrpe producing odd garbage in output? Message-ID: Run remotely, odd garbage returned: obfuscatedserver:~ amoran$ sudo -u nagios /opt/local/libexec/nagios/check_nrpe -H obfuscatedclient -c check_load -a 5.0,4.0,3.0 20.0,15.0,10.0 WARNING - load average: 5.28, 5.00, 4.78|load1=5.285;5.000;20.000;0; load5=5.003;4.000;15.000;0; load15=4.785;3.000;10.000;0; ?R[??h???+?? obfuscatedserver:~ amoran$ run locally, no garbage: obfuscatedclient:~ amoran$ sudo -u nagios /opt/local/libexec/nagios/check_load -w 5.0,4.0,3.0 -c 20.0,15.0,10.0 WARNING - load average: 4.46, 4.77, 4.74|load1=4.461;5.000;20.000;0; load5=4.774;4.000;15.000;0; load15=4.735;3.000;10.000;0; obfuscatedclient:~ amoran$ So it would seem the garbage is coming from nrpe. I've compiled nrpe using 2.8.1 and it's running on Snow Leopard (OS X 10.6). Anyone seen anything like this? --Andy -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From justinp at norchemlab.com Tue Dec 15 04:12:34 2009 From: justinp at norchemlab.com (Justin Pryzby) Date: Mon, 14 Dec 2009 20:12:34 -0700 Subject: nrpe producing odd garbage in output? In-Reply-To: References: Message-ID: <20091215031234.GA28769@norchemlab.com> On Mon, Dec 14, 2009 at 05:25:04PM -0800, Andy Moran wrote: > > Run remotely, odd garbage returned: > > obfuscatedserver:~ amoran$ sudo -u nagios /opt/local/libexec/nagios/check_nrpe -H obfuscatedclient -c check_load -a 5.0,4.0,3.0 20.0,15.0,10.0 > WARNING - load average: 5.28, 5.00, 4.78|load1=5.285;5.000;20.000;0; load5=5.003;4.000;15.000;0; load15=4.785;3.000;10.000;0; > ?R[??h???+?? > obfuscatedserver:~ amoran$ > > run locally, no garbage: > > obfuscatedclient:~ amoran$ sudo -u nagios /opt/local/libexec/nagios/check_load -w 5.0,4.0,3.0 -c 20.0,15.0,10.0 > WARNING - load average: 4.46, 4.77, 4.74|load1=4.461;5.000;20.000;0; load5=4.774;4.000;15.000;0; load15=4.735;3.000;10.000;0; > obfuscatedclient:~ amoran$ > > > So it would seem the garbage is coming from nrpe. I've compiled nrpe using 2.8.1 and it's running on Snow Leopard (OS X 10.6). Anyone seen anything like this? Yes, I noticed it in the web interface, and hadn't gotten around to digging into it. Thanks for bringing it up. It seems that upgrading to 2.12 will resolve it: 2.12 - 03/10/2008 ----------------- - Fix for unterminated multiline plugin (garbage) output (Krzysztof Oledzki) Justin ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Tue Dec 15 09:52:34 2009 From: mmelin at gmail.com (Martin Melin) Date: Tue, 15 Dec 2009 09:52:34 +0100 Subject: Notification period inheritance problem In-Reply-To: <4B26DEC0.7010607@eburg.com> References: <4B26DEC0.7010607@eburg.com> Message-ID: Have you tried leaving out the notification_period line from the service definition? I think, but don't have time to verify, that by saying "notification_period null" you are actually overriding all inherited values for notification_period and setting it to null instead. Best regards, Martin Melin On Tue, Dec 15, 2009 at 1:56 AM, Gordon Messmer wrote: > The documentation on "Implied Inheritance"[1] indicates that a service > with no notification period will use the notification period of the > associated host definition. ?This doesn't seem to work when a service is > applied to a hostgroup. > > In my configuration files, I define a generic service template of my > own. ?It has no notification period. ?I also define a host template for > each customer which defines the hostgroup and notification period which > is appropriate for each host. ?Finally, I define a service which will > apply to all hosts within that hostgroup. > > When I run "nagios -v", I get warnings that no notification period is given: > Warning: Service 'root partition' on host 'customer1-samba' has no > notification time period defined! > > I've received some pages from hosts when checks timed out during a > period when no notications should be sent. > > Is this a bug, or am I making some mistake? ?Shouldn't the service be > inheriting the value from the host? ?Where would I start looking if it > were a bug and I wanted to fix it? > > > > define service{ > ? ? ? ? use ? ? ? ? ? ? ? ? ? ? ? ? ? ? generic-service > ? ? ? ? name ? ? ? ? ? ? ? ? ? ? ? ? ? ?my-generic-service ; The 'name' > of this service template > ? ? ? ? notification_period ? ? ? ? ? ? null ? ? ? ? ? ?; Use the > period from the host > ? ? ? ? register ? ? ? ? ? ? ? ? ? ? ? ?0 > ? ? ? ? } > > define host{ > ? ? ? ? name ? ? ? ? ? ?customer1-linux-server > ? ? ? ? use ? ? ? ? ? ? my-generic-server > ? ? ? ? hostgroups ? ? ?my-linux-servers > ? ? ? ? notification_period ? ? my-workhours > ? ? ? ? register ? ? ? ?0 > ? ? ? ? } > > define ?service{ > ? ? ? ? use ? ? ? ? ? ? ? ? ? ? my-generic-service > ? ? ? ? hostgroup_name ? ? ? ? ?my-linux-servers > ? ? ? ? service_description ? ? root partition > ? ? ? ? check_command ? ? ? ? ? ssh_disk!20%!10%!/ > ? ? ? ? } > > > 1: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From david.dumortier at linagora.com Tue Dec 15 14:07:34 2009 From: david.dumortier at linagora.com (David Dumortier) Date: Tue, 15 Dec 2009 14:07:34 +0100 Subject: benchmark question In-Reply-To: <31275029.44311260809584268.JavaMail.root@mail.opservices.com.br> References: <31275029.44311260809584268.JavaMail.root@mail.opservices.com.br> Message-ID: <3d74b035291e4d317dc3e8582b079659.squirrel@intranet.linagora.com> Hi Patrick, hi all, Dario B. Bestetti (OpServices) a ?crit : > > > ----- "Patrick Morris" escreveu: > >> shadih rahman wrote: >> > List, >> > Is there any information as to what is the maximum number of >> > services one can run on a single box with single instance of Nagios? >> >> > I did not find any concrete data on this. >> > >> > I am running a single instance of nagios on a quad core 2.5 GHZ >> > machine with with 4 Gigs of RAM. >> > >> > I have total of 7359 service check running on this box. I have also >> >> > ndoutils running on the same box as backend. >> > >> > My total service checks is going to five fold very soon with a lot >> of >> > nrpe checks. >> > >> > Now, my question is should I run multiple instance of nagios on the >> >> > same box or a single instance will be able to handle about 30000 >> > service checks? >> >> In my experience, you'll run into issues right around 8,000 checks or >> >> so, depending no output verbosity, on a fairly stock setup. This is >> due >> to the size of the pipe used to temporarily store check results, at >> least on Linux (and you may be seeing it already with your number of >> checks). Around that point, even with quite a bit of tuning, the check >> >> results will fill the pipe in less than a second, which is the minimum >> >> amount of time I've been able to configure Nagios to flush it. When >> that >> happens, latencies go through the roof. >> >> Distributing the checks doesn't solve the problem if you're still >> sending the results to a centralized Nagios instance, since one >> machine >> still needs to process all of them. >> >> I'm working through this situation now, and it's looking like it may >> take a custom kernel with a larger pipe size to handle >> it. >> >> > Shadih, we have a customer running over 20.000 service checks in a single > box. > > The box is 2x Intel Dual-core Xeon 3Ghz with 8Gb RAM. The latency is > around 0.7s. The checks are performed between 3min and 5min. They have > around 40 simultaneous users. > > _________________________________________________ > Dario B. Bestetti OpServices I had to install a bunch of nagios for 400000 polls. We ran some tests on bi-Xeon Dual-Core with 4Go. It seems the limit of Nagios 3 is around 25000 services by server, but we didn't use the interface and the recording to database is with Perl scripts of our own. Taking 20000 services as a limit seem to be the right thing to do, perhaps less as 15000 with NDO. Regards, -- David Dumortier LINAGORA Service Management Monitoring ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dario.bestetti at opservices.com.br Tue Dec 15 14:37:05 2009 From: dario.bestetti at opservices.com.br (Dario B. Bestetti (OpServices)) Date: Tue, 15 Dec 2009 11:37:05 -0200 (BRST) Subject: benchmark question In-Reply-To: <3d74b035291e4d317dc3e8582b079659.squirrel@intranet.linagora.com> References: <3d74b035291e4d317dc3e8582b079659.squirrel@intranet.linagora.com> Message-ID: <12553564.50681260884225266.JavaMail.root@mail.opservices.com.br> ----- "David Dumortier" escreveu: > Hi Patrick, hi all, > > Dario B. Bestetti (OpServices) a ?crit : > > > > > > ----- "Patrick Morris" escreveu: > > > >> shadih rahman wrote: > >> > List, > >> > Is there any information as to what is the maximum number of > >> > services one can run on a single box with single instance of > Nagios? > >> > >> > I did not find any concrete data on this. > >> > > >> > I am running a single instance of nagios on a quad core 2.5 GHZ > >> > machine with with 4 Gigs of RAM. > >> > > >> > I have total of 7359 service check running on this box. I have > also > >> > >> > ndoutils running on the same box as backend. > >> > > >> > My total service checks is going to five fold very soon with a > lot > >> of > >> > nrpe checks. > >> > > >> > Now, my question is should I run multiple instance of nagios on > the > >> > >> > same box or a single instance will be able to handle about 30000 > >> > service checks? > >> > >> In my experience, you'll run into issues right around 8,000 checks > or > >> > >> so, depending no output verbosity, on a fairly stock setup. This > is > >> due > >> to the size of the pipe used to temporarily store check results, > at > >> least on Linux (and you may be seeing it already with your number > of > >> checks). Around that point, even with quite a bit of tuning, the > check > >> > >> results will fill the pipe in less than a second, which is the > minimum > >> > >> amount of time I've been able to configure Nagios to flush it. > When > >> that > >> happens, latencies go through the roof. > >> > >> Distributing the checks doesn't solve the problem if you're still > >> sending the results to a centralized Nagios instance, since one > >> machine > >> still needs to process all of them. > >> > >> I'm working through this situation now, and it's looking like it > may > >> take a custom kernel with a larger pipe size to handle > >> it. > >> > >> > > Shadih, we have a customer running over 20.000 service checks in a > single > > box. > > > > The box is 2x Intel Dual-core Xeon 3Ghz with 8Gb RAM. The latency > is > > around 0.7s. The checks are performed between 3min and 5min. They > have > > around 40 simultaneous users. > > > > _________________________________________________ > > Dario B. Bestetti OpServices > > I had to install a bunch of nagios for 400000 polls. We ran some tests > on > bi-Xeon Dual-Core with 4Go. It seems the limit of Nagios 3 is around > 25000 > services by server, but we didn't use the interface and the recording > to > database is with Perl scripts of our own. Taking 20000 services as a > limit > seem to be the right thing to do, perhaps less as 15000 with NDO. > > Regards, > -- > David Dumortier > LINAGORA > Service Management Monitoring > > > > ------------------------------------------------------------------------------ > Return on Information: > Google Enterprise Search pays you back > Get the facts. > http://p.sf.net/sfu/google-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null I forgot to tell that we did run some stress tests in this customer with up to 30.000 services and Nagios supported it very well. We also use our own broker to store data in a MySQL database. Regards, Dario ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shacky83 at gmail.com Tue Dec 15 16:34:40 2009 From: shacky83 at gmail.com (shacky) Date: Tue, 15 Dec 2009 16:34:40 +0100 Subject: Interface for control room In-Reply-To: <8480e60f0912140906o5cddf711g8f0425933e939d28@mail.gmail.com> References: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> <4B24E8EA.90609@nordal-lund.dk> <8480e60f0912140906o5cddf711g8f0425933e939d28@mail.gmail.com> Message-ID: <7fedbc910912150734l7fb517fxef3d8cacdc2bf017@mail.gmail.com> > I do this with NagVis. Quite simple and powerfull. You can create new maps > and add your own iconset (big blinking animated gif!) > http://www.nagvis.org/ Thank you very much. I will try Centreon and Nagvis. The complication I forgot to tell you in my first post is that I need to monitor at least three different Nagios servers. How can I solve this requirement? ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Dec 15 16:45:05 2009 From: ae at op5.se (Andreas Ericsson) Date: Tue, 15 Dec 2009 16:45:05 +0100 Subject: Interface for control room In-Reply-To: <7fedbc910912150734l7fb517fxef3d8cacdc2bf017@mail.gmail.com> References: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> <4B24E8EA.90609@nordal-lund.dk> <8480e60f0912140906o5cddf711g8f0425933e939d28@mail.gmail.com> <7fedbc910912150734l7fb517fxef3d8cacdc2bf017@mail.gmail.com> Message-ID: <4B27AF01.8090806@op5.se> On 12/15/2009 04:34 PM, shacky wrote: >> I do this with NagVis. Quite simple and powerfull. You can create new maps >> and add your own iconset (big blinking animated gif!) >> http://www.nagvis.org/ > > Thank you very much. > I will try Centreon and Nagvis. > > The complication I forgot to tell you in my first post is that I need > to monitor at least three different Nagios servers. > How can I solve this requirement? > Get three different monitors and hook up with virtual desktops to one computer. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rlemerlus at merethis.com Tue Dec 15 16:47:40 2009 From: rlemerlus at merethis.com (Romain Le Merlus) Date: Tue, 15 Dec 2009 16:47:40 +0100 Subject: Interface for control room In-Reply-To: <7fedbc910912150734l7fb517fxef3d8cacdc2bf017@mail.gmail.com> References: <7fedbc910912121556k317d3d59k30127b4d2d030aaa@mail.gmail.com> <4B24E8EA.90609@nordal-lund.dk> <8480e60f0912140906o5cddf711g8f0425933e939d28@mail.gmail.com> <7fedbc910912150734l7fb517fxef3d8cacdc2bf017@mail.gmail.com> Message-ID: <8d9ba4010912150747x7836ac8du89b9390770d818b2@mail.gmail.com> On Tue, Dec 15, 2009 at 4:34 PM, shacky wrote: > The complication I forgot to tell you in my first post is that I need > to monitor at least three different Nagios servers. > How can I solve this requirement? > You can administrate your different Nagios poller through Centreon frontend. Communication between them is based on SSH. Each host definition is link to a poller, so the configuration dispatch is automatic. -- Romain LE MERLUS -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Return on Information: Google Enterprise Search pays you back Get the facts. http://p.sf.net/sfu/google-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yinyang at eburg.com Tue Dec 15 18:33:53 2009 From: yinyang at eburg.com (Gordon Messmer) Date: Tue, 15 Dec 2009 09:33:53 -0800 Subject: Notification period inheritance problem In-Reply-To: References: <4B26DEC0.7010607@eburg.com> Message-ID: <4B27C881.6030401@eburg.com> On 12/15/2009 12:52 AM, Martin Melin wrote: > Have you tried leaving out the notification_period line from the > service definition? I think, but don't have time to verify, that by > saying "notification_period null" you are actually overriding all > inherited values for notification_period and setting it to null > instead. > That's kind of the point. I want the service to inherit its notification period from the hosts it checks. In order to do that, its own notification period has to be null. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 15 19:49:09 2009 From: marc at ena.com (Marc Powell) Date: Tue, 15 Dec 2009 12:49:09 -0600 Subject: Notification period inheritance problem In-Reply-To: <4B27C881.6030401@eburg.com> References: <4B26DEC0.7010607@eburg.com> <4B27C881.6030401@eburg.com> Message-ID: On Dec 15, 2009, at 11:33 AM, Gordon Messmer wrote: > On 12/15/2009 12:52 AM, Martin Melin wrote: >> Have you tried leaving out the notification_period line from the >> service definition? I think, but don't have time to verify, that by >> saying "notification_period null" you are actually overriding all >> inherited values for notification_period and setting it to null >> instead. >> > > That's kind of the point. I want the service to inherit its > notification period from the hosts it checks. In order to do that, its > own notification period has to be null. No, in order to do that, the notification_period needs to be unspecified. null != unspecified. Because you're explicitly setting it to null, you break the implied inheritance. To take advantage of the implied inheritance, you need to leave out the directive entirely as indicated above. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yinyang at eburg.com Tue Dec 15 21:04:06 2009 From: yinyang at eburg.com (Gordon Messmer) Date: Tue, 15 Dec 2009 12:04:06 -0800 Subject: Notification period inheritance problem In-Reply-To: References: <4B26DEC0.7010607@eburg.com> <4B27C881.6030401@eburg.com> Message-ID: <4B27EBB6.7050403@eburg.com> On 12/15/2009 10:49 AM, Marc Powell wrote: > > No, in order to do that, the notification_period needs to be unspecified. null != unspecified. Because you're explicitly setting it to null, you break the implied inheritance. > > To take advantage of the implied inheritance, you need to leave out the directive entirely as indicated above. > Quoting from the documentation: The following table lists the object variables that will be implicitly inherited from related objects if you don't explicitly specify their value in your object definition or inherit them from a template ...so I need to not inherit the value from my template. How do I do that? In some cases you may not want your host, service, or contact definitions to inherit values of string variables from the templates they reference. If this is the case, you can specify "*null*" (without quotes) as the value of the variable that you do not want to inherit. If I don't specify a value, it'll be inherited. Right? How will that help me not specify a value for the service? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 15 22:38:41 2009 From: marc at ena.com (Marc Powell) Date: Tue, 15 Dec 2009 15:38:41 -0600 Subject: Notification period inheritance problem In-Reply-To: <4B27EBB6.7050403@eburg.com> References: <4B26DEC0.7010607@eburg.com> <4B27C881.6030401@eburg.com> <4B27EBB6.7050403@eburg.com> Message-ID: <85963A63-74F3-4292-8472-CE9DC45F22AD@ena.com> On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote: > On 12/15/2009 10:49 AM, Marc Powell wrote: >> >> No, in order to do that, the notification_period needs to be unspecified. null != unspecified. Because you're explicitly setting it to null, you break the implied inheritance. >> >> To take advantage of the implied inheritance, you need to leave out the directive entirely as indicated above. >> >> > > Quoting from the documentation: > > The following table lists the object variables that will be implicitly inherited from related objects if you don't explicitly specify their value in your object definition or inherit them from a template > > ...so I need to not inherit the value from my template. How do I do that? Don't specify the directive in the template. > In some cases you may not want your host, service, or contact definitions to inherit values of string variables from the templates they reference. If this is the case, you can specify "null" (without quotes) as the value of the variable that you do not want to inherit. You're mixing sections of the documentation to try to make a connection that doesn't exist. Again, specified with a value of 'null' does not equate to unspecified. Looking at the code, if you specify the value in any way, including a value of 'null', it breaks the implied inheritance. xodtemplate.h -- #define XODTEMPLATE_NULL "null" xdata/xodtemplate.c -- else if(!strcmp(variable,"notification_period")){ if(strcmp(value,XODTEMPLATE_NULL)){ if((temp_service->notification_period=(char *)strdup(value))==NULL) result=ERROR; } temp_service->have_notification_period=TRUE; } 'If the variable is "notification_period" and the value is not the word "null", save the value unless the value is blank (error otherwise). If the value is the word "null" or something else, set the service have_notification_period flag to TRUE." Further down -- /* services inherit notification period from host if not already specified */ if(temp_service->have_notification_period==FALSE && temp_host->have_notification_period==TRUE && temp_host->notification_period!=NULL){ temp_service->notification_period=(char *)strdup(temp_host->notification_period); temp_service->have_notification_period=TRUE; } "If the service 'have_notification_period' flag is not TRUE and the host and the host has a notification period, use that for the service." As you can see, you fail that first condition. > If I don't specify a value, it'll be inherited. Right? How will that help me not specify a value for the service? If you don't specify the value in the template, there's nothing to inherit from the template. If the directive is also not specified in the service{} definition, the template engine will look to the value in the host{} definition. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From PetterborgCa at ldschurch.org Tue Dec 15 23:38:22 2009 From: PetterborgCa at ldschurch.org (Cary Petterborg) Date: Tue, 15 Dec 2009 15:38:22 -0700 Subject: Nagios 3.2 and max_concurrent_checks=0 Message-ID: I'm doing some testing for migrating our installation to Nagios3.2. The test server is running 3.2 on an 8 CPU box with 34,000 active service checks and 3,000 active host checks. The initial configuration file had max_concurrent_checks=0, but latency was about 9,000 seconds. I changed it to max_concurrent_checks=200 and the latency went down to about 7,000 seconds. I then set it to 2,000 and the latency dropped to about 200 seconds. I currently have it set to 100,000 and latency has not changed from about 200 seconds. >From all the documentation I have seen, if max_concurrent_checks is set to zero, there should be no limit on the number of concurrent checks, but this doesn't appear to be the case. Is there some other part of the configuration that I'm missing which would make max_concurrent_checks=0 be limited instead of unlimited? Cary Petterborg ICS Monitoring The Church of Jesus Christ of Latter-day Saints Office Phone: 801-240-8267 Email: petterborgca at ldschurch.org NOTICE: This email message is for the sole use of the intended recipient(s) and may contain confidential and privileged information. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipient, please contact the sender by reply email and destroy all copies of the original message. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ampranti at gmail.com Tue Dec 15 23:57:07 2009 From: ampranti at gmail.com (Brandino Andreas) Date: Wed, 16 Dec 2009 00:57:07 +0200 Subject: Nagios 30 seconds initial delay Message-ID: <1595932556.20091216005707@gmail.com> Hi all, I am using Nagios 3.2.0 (I just upgrade from early 3.0 releases) Every time I restart nagios I face a 30 seconds delay before the various pages appear for first time (not before starting checks, 30 seconds before displaying pages!!!) When I click a page I get the error "Error: Could not read host and service status information!" . After 30 seconds, all pages appear again!! - I don't have duplicated nagios service running - This delay didn't show up to older versions - My linux is Debian 5.0.3 (stable) - This error appears every time I restart nagios Any idea what can cause this delay?? Thank you <> --- - - - --- <> Brandino Andreas ampranti at gmail.com <> --- - - - --- <> ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Edwin.Zoeller at ama-assn.org Wed Dec 16 00:24:19 2009 From: Edwin.Zoeller at ama-assn.org (Edwin Zoeller) Date: Tue, 15 Dec 2009 17:24:19 -0600 Subject: Question - Scripting to stop monitor Message-ID: Is it possible and if so can someone share with a script that will stop a monitor script for a certain time period then restart. Just putting the service in scheduled downtime still lets runs and I don't want to access the service being monitored. Thanks, Ed -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Wed Dec 16 00:25:36 2009 From: perldork at webwizarddesign.com (Max) Date: Tue, 15 Dec 2009 18:25:36 -0500 Subject: Nagios 30 seconds initial delay In-Reply-To: <1595932556.20091216005707@gmail.com> References: <1595932556.20091216005707@gmail.com> Message-ID: On Tue, Dec 15, 2009 at 5:57 PM, Brandino Andreas wrote: > When I click a page I get the error "Error: Could not read host and > service status information!" . After 30 seconds, all pages appear > again!! > > - I don't have duplicated nagios service running > - This delay didn't show up to older versions > - My linux is Debian 5.0.3 (stable) > - This error appears every time I restart nagios We typically see about the same start up time, both on 3.0.3 and 3.2.0 on hosts with 1-2k hosts and ~10k services where we have retention.dat files that are weeks or months old and we are using regular expressions and service -> hostgroup mappings extensively. - Max ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yinyang at eburg.com Wed Dec 16 00:26:50 2009 From: yinyang at eburg.com (Gordon Messmer) Date: Tue, 15 Dec 2009 15:26:50 -0800 Subject: Notification period inheritance problem In-Reply-To: <85963A63-74F3-4292-8472-CE9DC45F22AD@ena.com> References: <4B26DEC0.7010607@eburg.com> <4B27C881.6030401@eburg.com> <4B27EBB6.7050403@eburg.com> <85963A63-74F3-4292-8472-CE9DC45F22AD@ena.com> Message-ID: <4B281B3A.9040803@eburg.com> On 12/15/2009 01:38 PM, Marc Powell wrote: > On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote: > >> Quoting from the documentation: >> >> The following table lists the object variables that will be implicitly inherited from related objects if you don't explicitly specify their value in your object definition or inherit them from a template >> >> ...so I need to not inherit the value from my template. How do I do that? >> > Don't specify the directive in the template. > Which means that I have to expand the template where I null-ify the notification period with the values from any of the templates it "uses". That seems like a minor inconvenience, but it's still uglier than it ought to be. When I do so, I still get the warning from "nagios -v". I'll test it to make sure that it's working. >> In some cases you may not want your host, service, or contact definitions to inherit values of string variables from the templates they reference. If this is the case, you can specify "null" (without quotes) as the value of the variable that you do not want to inherit. >> > You're mixing sections of the documentation to try to make a connection that doesn't exist. If that's so, then at the very least the documentation should be updated to note that this won't work. > Again, specified with a value of 'null' does not equate to unspecified. Looking at the code, if you specify the value in any way, including a value of 'null', it breaks the implied inheritance. > > xodtemplate.h -- > > #define XODTEMPLATE_NULL "null" > > xdata/xodtemplate.c -- > > else if(!strcmp(variable,"notification_period")){ > if(strcmp(value,XODTEMPLATE_NULL)){ > if((temp_service->notification_period=(char *)strdup(value))==NULL) > result=ERROR; > } > temp_service->have_notification_period=TRUE; > } > That sure looks like a bug to me. Even the indentation looks weird to me, is that the normal style? I can't imagine why "null" would be considered a valid notification period. If have_notification_period=TRUE were in the preceding block, the documentation would make more sense, and implied inheritance would be a lot easier to use. Do you think a patch to modify this behavior would be accepted? Thanks for the explanation. I appreciate your help. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shacky83 at gmail.com Wed Dec 16 01:32:52 2009 From: shacky83 at gmail.com (shacky) Date: Wed, 16 Dec 2009 01:32:52 +0100 Subject: NRPE refused connect Message-ID: <7fedbc910912151632j24ab45aameb95b16d589b4da3@mail.gmail.com> Hi. I installed the check_nrpe plugin on the Nagios server and NRPE running as daemon on the host I have to monitor, both compiled with SSL support and the dh.h file created and saved in the include/ directory on the NRPE host. [root at monitored-host ~]# /opt/nrpe/bin/nrpe NRPE - Nagios Remote Plugin Executor Copyright (c) 1999-2008 Ethan Galstad (nagios at nagios.org) Version: 2.12 Last Modified: 03-10-2008 License: GPL v2 with exemptions (-l for more info) SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required TCP Wrappers Available nagios-server:/# /usr/local/nagios/libexec/check_nrpe Incorrect command line arguments supplied NRPE Plugin for Nagios Copyright (c) 1999-2008 Ethan Galstad (nagios at nagios.org) Version: 2.12 Last Modified: 03-10-2008 License: GPL v2 with exemptions (-l for more info) SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required The IP address of the Nagios server is specified in the "allowed_hosts" declaration in the nrpe.cfg file: [root at monitored-host ~]# grep allowed_hosts /opt/nrpe/etc/nrpe.cfg allowed_hosts=192.168.10.250 The problem is that if I try to execute the check_nrpe command to test the NRPE daemon on the monitored host, I receive this error: nagios-server:/# /usr/local/nagios/libexec/check_nrpe -H 192.168.10.18 CHECK_NRPE: Error - Could not complete SSL handshake. On the monitored host I see that the IP address of the Nagios server is allowed and then refused: [root at monitored-host ~]# tail /var/log/messages Dec 16 01:24:27 monitored-host nrpe[25047]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Dec 16 01:24:27 monitored-host nrpe[25048]: Starting up daemon Dec 16 01:24:27 monitored-host nrpe[25048]: Warning: Daemon is configured to accept command arguments from clients! Dec 16 01:24:27 monitored-host nrpe[25048]: Listening for connections on port 5666 Dec 16 01:24:27 monitored-host nrpe[25048]: Allowing connections from: 192.168.10.250 Dec 16 01:27:01 monitored-host nrpe[25063]: refused connect from 192.168.10.250 (192.168.10.250) What is the problem? It is not a firewall problem because the connection works, and it does not seems to be a SSL related problem because it does not work even if I try the check command disabling SSL with -n (and the NRPE daemon runned with -n also), and it is quite curious to have two opposite log messages. Could you help me please? I worked all the afternoon trying to let it work, but it does not work... Thank you very much!! Bye. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From vikramms123 at gmail.com Wed Dec 16 03:37:03 2009 From: vikramms123 at gmail.com (VIKRAM MS) Date: Wed, 16 Dec 2009 08:07:03 +0530 Subject: Problem installing MRTG Message-ID: <2e22dff70912151837i3a9eb54ej6291c1319ad8f930@mail.gmail.com> Hi, 1) I dont have anything plugged to port 1. So I cannot monitor any port unless it is plugged to something. 2) Now I have this trouble while installing MRTG. I followed the steps given in http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html. Should I create any of those folders? [root at localhost bin]# pwd /usr/local/src/mrtg-2.16.2/bin [root at localhost bin]# ls cfgmaker cfgmaker~ indexmaker indexmaker~ mrtg mrtg~ mrtg-traffic-sum rateup rateup.o [root at localhost bin]# cfgmaker --global 'WorkDir: /home/httpd/mrtg' --global 'Options[_]: bits,growright' --output /home/mrtg/cfg/mrtg.cfg xyz at 12.345.67.890 ERROR: creating /home/mrtg/cfg/mrtg.cfg: No such file or directory [root at localhost bin]# Thanks Viki On Sun, Dec 13, 2009 at 7:44 PM, Greg Pangrazio wrote: > For the first error, do you have something plugged into port 1 on the > switch and is the port in the Up/UP state? > > > For the check_mrtg plugin you must have mrtg installed and graphing > utilization before you can configure the alterts via nagios. You > should check out the MRTG site for more information on that. > > > Greg Pangrazio > pangrazi at gmail.com > > > > > > On Sat, Dec 12, 2009 at 12:33 AM, VIKRAM MS wrote: > > Hello, > > > > I am trying to monitor Cisco switches and routers. I am able > to > > ping and check the uptime of the switch. > > 1) But for the port 1 link status, I am getting "SNMP CRITICAL - > *down(2)*" > > error. > > > > 2) For port 1 bandwidth usage, the error is "check_mrtgtraf: Unable to > open > > MRTG log file" > > I have installed MRTG, but I am not able to go any further. > Also > > for the http://localhost/mrtg/10.111.100.102_43.html check, I am getting > the > > following error > > 404 Not found > > The requested URL /mrtg/10.111.100.102_43.html was not found on this > server. > > > > I have checked for assistance in > > http://www.mail-archive.com/nagios-users at lists.sourceforge.net/ but in > vain. > > I am using Nagios Core version 3.2.0. > > Thanks in advance > > > > Regards > > Vikram > > > > > > > ------------------------------------------------------------------------------ > > Return on Information: > > Google Enterprise Search pays you back > > Get the facts. > > http://p.sf.net/sfu/google-dev2dev > > > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Dec 16 04:57:16 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Tue, 15 Dec 2009 19:57:16 -0800 Subject: Problem installing MRTG In-Reply-To: <2e22dff70912151837i3a9eb54ej6291c1319ad8f930@mail.gmail.com> References: <2e22dff70912151837i3a9eb54ej6291c1319ad8f930@mail.gmail.com> Message-ID: <4B285A9C.6080406@hp.com> VIKRAM MS wrote: > Hi, > > 1) I dont have anything plugged to port 1. So I cannot > monitor any port unless it is plugged to something. > > 2) Now I have this trouble while installing MRTG. I followed > the steps given in > http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html. > Should I create any of those folders? > > [root at localhost bin]# pwd > /usr/local/src/mrtg-2.16.2/bin > [root at localhost bin]# ls > cfgmaker cfgmaker~ indexmaker indexmaker~ mrtg mrtg~ > mrtg-traffic-sum rateup rateup.o > [root at localhost bin]# cfgmaker --global 'WorkDir: /home/httpd/mrtg' > --global 'Options[_]: bits,growright' --output /home/mrtg/cfg/mrtg.cfg > xyz at 12.345.67.890 > ERROR: creating /home/mrtg/cfg/mrtg.cfg: No such file or directory > [root at localhost bin]# Yes, if you're going to tell mrtg to place your config in a specific directory with the --output directive, that directory should exist. And, again, this really isn't the right place to look for help on setting up MRTG. You may want to try http://oss.oetiker.ch/mrtg/support/ ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Wed Dec 16 08:03:40 2009 From: mmelin at gmail.com (Martin Melin) Date: Wed, 16 Dec 2009 08:03:40 +0100 Subject: Notification period inheritance problem In-Reply-To: <4B281B3A.9040803@eburg.com> References: <4B26DEC0.7010607@eburg.com> <4B27C881.6030401@eburg.com> <4B27EBB6.7050403@eburg.com> <85963A63-74F3-4292-8472-CE9DC45F22AD@ena.com> <4B281B3A.9040803@eburg.com> Message-ID: I don't understand why this is confusing. You have a service template that defines a timeperiod. All services that use that template will inherit the notification_period value from the template, if they don't specify a notification_period themselves. By setting "notification_period null" in your service definition, you are explicitly overriding all forms of inheritance and setting the notification_period to null, which happens to be a special notification_period. So, the issue you are running into is due to you expecting implied inheritance to take precedence over values defined in the service definition itself. This would break the documented and expected behavior of inheritance, and a lot of people's configurations. Now, I understand how you could run into this issue. But take a look at the documentation again: >>> Quoting from the documentation: >>> >>> The following table lists the object variables that will be implicitly inherited from related objects if you don't explicitly specify their value in your object definition or inherit them from a template The implied inheritance variables will only be inherited from related objects, if they are not specified in either the object definition or template. In your case, notification_period is defined in your template, so the implied inheritance is already overridden. You can't bring that back by also explicitly defining a notification_period in your service definition, even if that happens to be null, as null is also a valid value for notification_period. Something that definitely would work is for you to define another service template without a notification_period value, and use that for checks you want implied inheritance from. Something that might work, but unfortunately I can't try it as I've already spent too much time on this email :-), is to set notification_period to null in the template, which could mean that the service definition will behave as though the notification_period value is unset in the template. This would allow you to let the "null notification period" template inherit from your normal template. Best regards, Martin Melin On Wed, Dec 16, 2009 at 12:26 AM, Gordon Messmer wrote: > On 12/15/2009 01:38 PM, Marc Powell wrote: >> On Dec 15, 2009, at 2:04 PM, Gordon Messmer wrote: >> >>> Quoting from the documentation: >>> >>> The following table lists the object variables that will be implicitly inherited from related objects if you don't explicitly specify their value in your object definition or inherit them from a template >>> >>> ...so I need to not inherit the value from my template. ?How do I do that? >>> >> Don't specify the directive in the template. >> > > Which means that I have to expand the template where I null-ify the > notification period with the values from any of the templates it > "uses". ?That seems like a minor inconvenience, but it's still uglier > than it ought to be. ?When I do so, I still get the warning from "nagios > -v". ?I'll test it to make sure that it's working. > >>> In some cases you may not want your host, service, or contact definitions to inherit values of string variables from the templates they reference. If this is the case, you can specify "null" (without quotes) as the value of the variable that you do not want to inherit. >>> >> You're mixing sections of the documentation to try to make a connection that doesn't exist. > > If that's so, then at the very least the documentation should be updated > to note that this won't work. > >> Again, specified with a value of 'null' does not equate to unspecified. Looking at the code, if you specify the value in any way, including a value of 'null', it breaks the implied inheritance. >> >> xodtemplate.h -- >> >> #define XODTEMPLATE_NULL ? ? ? ? ? ? ? ? ?"null" >> >> xdata/xodtemplate.c -- >> >> ? ? ? ? ?else if(!strcmp(variable,"notification_period")){ >> ? ? ? ? ? ? ?if(strcmp(value,XODTEMPLATE_NULL)){ >> ? ? ? ? ? ? ? ? ?if((temp_service->notification_period=(char *)strdup(value))==NULL) >> ? ? ? ? ? ? ? ? ? ? ?result=ERROR; >> ? ? ? ? ? ? ? ? ? ? ?} >> ? ? ? ? ? ? ?temp_service->have_notification_period=TRUE; >> ? ? ? ? ? ? ? ? ?} >> > > That sure looks like a bug to me. ?Even the indentation looks weird to > me, is that the normal style? ?I can't imagine why "null" would be > considered a valid notification period. ?If > have_notification_period=TRUE were in the preceding block, the > documentation would make more sense, and implied inheritance would be a > lot easier to use. ?Do you think a patch to modify this behavior would > be accepted? > > Thanks for the explanation. ?I appreciate your help. > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yinyang at eburg.com Wed Dec 16 11:05:35 2009 From: yinyang at eburg.com (Gordon Messmer) Date: Wed, 16 Dec 2009 02:05:35 -0800 Subject: Notification period inheritance problem In-Reply-To: References: <4B26DEC0.7010607@eburg.com> <4B27C881.6030401@eburg.com> <4B27EBB6.7050403@eburg.com> <85963A63-74F3-4292-8472-CE9DC45F22AD@ena.com> <4B281B3A.9040803@eburg.com> Message-ID: <4B28B0EF.3070002@eburg.com> On 12/15/2009 11:03 PM, Martin Melin wrote: > I don't understand why this is confusing. > ... > By setting "notification_period null" in your service definition, you > are explicitly overriding all forms of inheritance and setting the > notification_period to null, which happens to be a special > notification_period. > Is that so? What does the special value do? In all of the documentation, I only see "null" mentioned once. In a section labeled "Cancelling Inheritance of String Values". The documentation indicates that "null" can be used to prevent inheriting a value. Neither of those things say to me, the user, that "null" is a special notification period. > So, the issue you are running into is due to you expecting implied > inheritance to take precedence over values defined in the service > definition itself. Only where the value is "null", which is documented only to prevent the inheritance of a value. > This would break the documented and expected > behavior of inheritance, and a lot of people's configurations. > I'm sure that when you tell me what the "null" notification period does, I'll understand how. For now, I don't, because it doesn't seem to be documented. > Something that might work, but unfortunately I can't try it as I've > already spent too much time on this email :-), is to set > notification_period to null in the template, which could mean that the > service definition will behave as though the notification_period value > is unset in the template. This would allow you to let the "null > notification period" template inherit from your normal template. > I'm not sure if you read my original email, but that's EXACTLY what I did. Tell me again how you don't understand why this is confusing. Snark aside, I appreciate the help and discussion. However, looking at the documentation again only convinces me more that the documentation and the code don't match up. Your confusion, too, lends credence to my position, IMO. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Wed Dec 16 13:43:59 2009 From: mmelin at gmail.com (Martin Melin) Date: Wed, 16 Dec 2009 13:43:59 +0100 Subject: Notification period inheritance problem In-Reply-To: <4B28B0EF.3070002@eburg.com> References: <4B26DEC0.7010607@eburg.com> <4B27C881.6030401@eburg.com> <4B27EBB6.7050403@eburg.com> <85963A63-74F3-4292-8472-CE9DC45F22AD@ena.com> <4B281B3A.9040803@eburg.com> <4B28B0EF.3070002@eburg.com> Message-ID: On Wed, Dec 16, 2009 at 11:05 AM, Gordon Messmer wrote: > On 12/15/2009 11:03 PM, Martin Melin wrote: >> I don't understand why this is confusing. >> > ... >> By setting "notification_period null" in your service definition, you >> are explicitly overriding all forms of inheritance and setting the >> notification_period to null, which happens to be a special >> notification_period. >> > > Is that so? ?What does the special value do? ?In all of the > documentation, I only see "null" mentioned once. ?In a section labeled > "Cancelling Inheritance of String Values". ?The documentation indicates > that "null" can be used to prevent inheriting a value. ?Neither of those > things say to me, the user, that "null" is a special notification period. > >> So, the issue you are running into is due to you expecting implied >> inheritance to take precedence over values defined in the service >> definition itself. > > Only where the value is "null", which is documented only to prevent the > inheritance of a value. > >> This would break the documented and expected >> behavior of inheritance, and a lot of people's configurations. >> > > I'm sure that when you tell me what the "null" notification period does, > I'll understand how. ?For now, I don't, because it doesn't seem to be > documented. null is a value for a notification period that does not have any effect. Most config's I've seen have a timeperiod named "none", configured to never match. This accomplishes the same thing. Using null does not do anything special as far as inheritance is concerned - you could define a timeperiod call "foobar" that never matches, and use that instead of null. The problem is that both you and I got the impression that 'null' might instead reset a variable so that it behaves the same as being undefined, i.e. would not propagate to an implementing definition from a template. > >> Something that might work, but unfortunately I can't try it as I've >> already spent too much time on this email :-), is to set >> notification_period to null in the template, which could mean that the >> service definition will behave as though the notification_period value >> is unset in the template. This would allow you to let the "null >> notification period" template inherit from your normal template. >> > > I'm not sure if you read my original email, but that's EXACTLY what I > did. ?Tell me again how you don't understand why this is confusing. Interesting. I thought that you were setting the notification_period in the service definition, but I see now that it is in the service template. My mistake. > > Snark aside, I appreciate the help and discussion. ?However, looking at > the documentation again only convinces me more that the documentation > and the code don't match up. ?Your confusion, too, lends credence to my > position, IMO. I agree that the documentation can be clarified to explain that the value 'null' does not, as one might think, reset a variable to undefined. Best regards, Martin Melin ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ampranti at gmail.com Thu Dec 17 09:07:41 2009 From: ampranti at gmail.com (Brandino Andreas) Date: Thu, 17 Dec 2009 10:07:41 +0200 Subject: Nagios 30 seconds initial delay In-Reply-To: <137B4E3B8F30074EADD080AA8A6E2A93129324FACE@ZDE070.lenze.com> References: <137B4E3B8F30074EADD080AA8A6E2A93129324FACE@ZDE070.lenze.com> Message-ID: <541023876.20091217100741@gmail.com> Hi, I am not using NDO. Wednesday, December 16, 2009, 9:51:29 AM, you wrote: > Hi, > are you using ndo ? If yes, disable ndo and see if the problem still > persists. > Thomas > On Wed, 2009-12-16 at 00:22 +0000, Brandino Andreas wrote: >> Hi all, >> I am using Nagios 3.2.0 (I just upgrade from early 3.0 releases) >> Every time I restart nagios I face a 30 seconds delay before the >> various pages appear for first time (not before starting checks, 30 >> seconds before displaying pages!!!) >> >> When I click a page I get the error "Error: Could not read host and >> service status information!" . After 30 seconds, all pages appear >> again!! >> >> - I don't have duplicated nagios service running >> - This delay didn't show up to older versions >> - My linux is Debian 5.0.3 (stable) >> - This error appears every time I restart nagios >> >> Any idea what can cause this delay?? >> Thank you >> >> >> <> --- - - - --- <> >> Brandino Andreas >> ampranti at gmail.com >> <> --- - - - --- <> >> >> >> ------------------------------------------------------------------------------ >> This SF.Net email is sponsored by the Verizon Developer Community >> Take advantage of Verizon's best-in-class app development support >> A streamlined, 14 day to market process makes app distribution fast and easy >> Join now and get one step closer to millions of Verizon customers >> http://p.sf.net/sfu/verizon-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null <> --- - - - --- <> Brandino Andreas ampranti at gmail.com <> --- - - - --- <> ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ampranti at gmail.com Thu Dec 17 09:12:12 2009 From: ampranti at gmail.com (Brandino Andreas) Date: Thu, 17 Dec 2009 10:12:12 +0200 Subject: Nagios 30 seconds initial delay In-Reply-To: References: <1595932556.20091216005707@gmail.com> Message-ID: <1786557943.20091217101212@gmail.com> I have much less hosts and services (for the moment). After deleting "retention.dat" I still face the same delay... Wednesday, December 16, 2009, 1:25:36 AM, you wrote: > On Tue, Dec 15, 2009 at 5:57 PM, Brandino Andreas wrote: >> When I click a page I get the error "Error: Could not read host and >> service status information!" . After 30 seconds, all pages appear >> again!! >> >> - I don't have duplicated nagios service running >> - This delay didn't show up to older versions >> - My linux is Debian 5.0.3 (stable) >> - This error appears every time I restart nagios > We typically see about the same start up time, both on 3.0.3 and 3.2.0 > on hosts with 1-2k hosts and ~10k services where we have retention.dat > files that are weeks or months old and we are using regular > expressions and service -> hostgroup mappings extensively. > - Max <> --- - - - --- <> Brandino Andreas ampranti at gmail.com <> --- - - - --- <> ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Thu Dec 17 14:44:48 2009 From: benny at bennyvision.com (C. Bensend) Date: Thu, 17 Dec 2009 07:44:48 -0600 Subject: Anyone testing Microsoft Volume Shadow Copy? In-Reply-To: <90f6e8270912141152l68761fedk97bb235f8f49ac90@mail.gmail.com> References: <3d756a733b83a4283e25eaaf78e56f05.squirrel@webmail.stinkweasel.net> <90f6e8270912141152l68761fedk97bb235f8f49ac90@mail.gmail.com> Message-ID: <8bfd7eeae7b802d42cf149886d320e9d.squirrel@webmail.stinkweasel.net> > On Mon, Dec 14, 2009 at 8:17 PM, C. Bensend wrote: >> >> ? Looking at the vssadmin.exe command on the Windows server, I see >> I can: >> >> >> vssadmin List Shadows >> >> >> ? And it prints out a listing of the existing shadow copies. >> However, without awk and grep (and no, I cannot install cygwin or >> anything like that), I don't have a clue how to process it. > > you'll have to parse the output of that command with the tools at > hand: vbscript or powershell, even and oldfashined bat file. Write > your own plugin, it is not that hard. Once you have the script, you > have it executed by nrpe (nsclient++ has an nrpe client). Just an FYI, after I spent an hour or two learning VBscript and writing a shell wrapper to parse the output, I found: http://exchange.nagios.org/directory/Plugins/%2A-Plugin-Packages/Nagios-Plugin-Collection/details which includes a test for Shadow Copies. Sigh. Oh well, good learning experience. :) Benny -- "It's not all about getting up and putting four slices of kickass in a two slice toaster." -- ark86, on Fazed.net ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From EWScott at scotborders.gov.uk Thu Dec 17 16:52:39 2009 From: EWScott at scotborders.gov.uk (Scott, Ewan) Date: Thu, 17 Dec 2009 15:52:39 +0000 Subject: conftest errors on configure of source nagios-3.2.0 Message-ID: <771645F4ADF2C9449B7E6AE630C720018BC4BD22E8@HQ-MAIL-01.scotborders.gov.uk> Hi I am trying to install nagios3 from source onto a ubuntu server. When running the ./configure statement I get the following error in the config.log. (Other errors for conftest then follow. ) . . . configure:2230: test -s conftest.o configure:2233: $? = 0 configure:2251: result: none needed configure:2269: gcc -c -g -O2 conftest.c >&5 conftest.c:2: error: expected '=', ',', ';', 'asm' or '__attribute__' before 'me ' configure:2275: $? = 1 configure: failed program was: | #ifndef __cplusplus | choke me | #endif configure:2408: checking whether make sets $(MAKE) . . . I have checked the packages installed and the basic configuration and I believe these to be as required. I have googled extensively and beyond checking the packages installed I can't see what changes I can make. Any help gratefully received. Supporting information listed below. Regards Ewan 1. Ununtu server This is 64-bit. root at sbc-omega-test:~/downloads/nagios-3.2.0# uname -a Linux sbc-omega-test 2.6.31-14-server #48-Ubuntu SMP Fri Oct 16 15:07:34 UTC 2009 x86_64 GNU/Linux Install type - LAMP Packages then installed with apt-get: apache2 build-essential libgd2-xpm-dev 1. Users and groups root at sbc-omega-test:~/downloads/nagios-3.2.0# egrep 'nagios|www' /etc/passwd www-data:x:33:33:www-data:/var/www:/bin/sh nagios:x:1001:1001::/home/nagios:/bin/sh root at sbc-omega-test:~/downloads/nagios-3.2.0# egrep nag /etc/group nagios:x:1001: nagcmd:x:1002:www-data,nagios 1. nagios install info version - 3.2.0 ===== #gcc -v ====== Using built-in specs. Target: x86_64-linux-gnu Configured with: ../src/configure -v --with-pkgversion='Ubuntu 4.4.1-4ubuntu8' --with-bugurl=file:///usr/share/doc/gcc-4.4/README.Bugs --enable-languages=c,c++,fortran,objc,obj-c++ --prefix=/usr --enable-shared --enable-multiarch --enable-linker-build-id --with-system-zlib --libexecdir=/usr/lib --without-included-gettext --enable-threads=posix --with-gxx-include-dir=/usr/include/c++/4.4 --program-suffix=-4.4 --enable-nls --enable-clocale=gnu --enable-libstdcxx-debug --enable-objc-gc --disable-werror --with-arch-32=i486 --with-tune=generic --enable-checking=release --build=x86_64-linux-gnu --host=x86_64-linux-gnu --target=x86_64-linux-gnu Thread model: posix gcc version 4.4.1 (Ubuntu 4.4.1-4ubuntu8) root at sbc-omega-test:~/downloads/nagios-3.2.0# config.log file (part) ================ This file contains any messages produced by compilers while running configure, to aid debugging if configure makes a mistake. It was created by configure, which was generated by GNU Autoconf 2.59. Invocation command line was $ ./configure --with-command-group=nagcmd ## --------- ## ## Platform. ## ## --------- ## hostname = sbc-omega-test uname -m = x86_64 uname -r = 2.6.31-14-server uname -s = Linux uname -v = #48-Ubuntu SMP Fri Oct 16 15:07:34 UTC 2009 /usr/bin/uname -p = unknown /bin/uname -X = unknown /bin/arch = unknown /usr/bin/arch -k = unknown /usr/convex/getsysinfo = unknown hostinfo = unknown /bin/machine = unknown /usr/bin/oslevel = unknown /bin/universe = unknown PATH: /usr/local/sbin PATH: /usr/local/bin PATH: /usr/sbin PATH: /usr/bin PATH: /sbin PATH: /bin ## ----------- ## ## Core tests. ## ## ----------- ## configure:1355: checking for a BSD-compatible install configure:1410: result: /usr/bin/install -c configure:1429: checking build system type configure:1447: result: x86_64-unknown-linux-gnu configure:1455: checking host system type configure:1469: result: x86_64-unknown-linux-gnu configure:1524: checking for gcc configure:1540: found /usr/bin/gcc configure:1550: result: gcc configure:1794: checking for C compiler version configure:1797: gcc --version &5 gcc (Ubuntu 4.4.1-4ubuntu8) 4.4.1 Copyright (C) 2009 Free Software Foundation, Inc. This is free software; see the source for copying conditions. There is NO warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. configure:1800: $? = 0 configure:1802: gcc -v &5 Using built-in specs. Target: x86_64-linux-gnu Configured with: ../src/configure -v --with-pkgversion='Ubuntu 4.4.1-4ubuntu8' - -with-bugurl=file:///usr/share/doc/gcc-4.4/README.Bugs --enable-languages=c,c++, fortran,objc,obj-c++ --prefix=/usr --enable-shared --enable-multiarch --enable-l inker-build-id --with-system-zlib --libexecdir=/usr/lib --without-included-gette xt --enable-threads=posix --with-gxx-include-dir=/usr/include/c++/4.4 --program- suffix=-4.4 --enable-nls --enable-clocale=gnu --enable-libstdcxx-debug --enable- objc-gc --disable-werror --with-arch-32=i486 --with-tune=generic --enable-checki ng=release --build=x86_64-linux-gnu --host=x86_64-linux-gnu --target=x86_64-linu x-gnu Thread model: posix gcc version 4.4.1 (Ubuntu 4.4.1-4ubuntu8) configure:1805: $? = 0 configure:1807: gcc -V &5 gcc: '-V' option must have argument configure:1810: $? = 1 configure:1833: checking for C compiler default output file name configure:1836: gcc conftest.c >&5 configure:1839: $? = 0 configure:1885: result: a.out configure:1890: checking whether the C compiler works configure:1896: ./a.out configure:1899: $? = 0 configure:1916: result: yes configure:1923: checking whether we are cross compiling configure:1925: result: no configure:1928: checking for suffix of executables configure:1930: gcc -o conftest conftest.c >&5 configure:1933: $? = 0 configure:1958: result: configure:1964: checking for suffix of object files configure:1985: gcc -c conftest.c >&5 configure:1988: $? = 0 configure:2010: result: o configure:2014: checking whether we are using the GNU C compiler configure:2038: gcc -c conftest.c >&5 configure:2044: $? = 0 configure:2048: test -z || test ! -s conftest.err configure:2051: $? = 0 configure:2054: test -s conftest.o configure:2057: $? = 0 configure:2070: result: yes configure:2076: checking whether gcc accepts -g configure:2097: gcc -c -g conftest.c >&5 configure:2103: $? = 0 configure:2107: test -z || test ! -s conftest.err configure:2110: $? = 0 configure:2113: test -s conftest.o configure:2116: $? = 0 configure:2127: result: yes configure:2144: checking for gcc option to accept ANSI C configure:2214: gcc -c -g -O2 conftest.c >&5 configure:2220: $? = 0 configure:2224: test -z || test ! -s conftest.err configure:2227: $? = 0 configure:2230: test -s conftest.o configure:2233: $? = 0 configure:2251: result: none needed configure:2269: gcc -c -g -O2 conftest.c >&5 conftest.c:2: error: expected '=', ',', ';', 'asm' or '__attribute__' before 'me ' configure:2275: $? = 1configure: failed program was: | #ifndef __cplusplus | choke me | #endif configure:2408: checking whether make sets $(MAKE) configure:2428: result: yes configure:2439: checking for strip configure:2457: found /usr/bin/strip configure:2470: result: /usr/bin/strip configure:2484: checking how to run the C preprocessor configure:2519: gcc -E conftest.c configure:2525: $? = 0 configure:2557: gcc -E conftest.c conftest.c:9:28: error: ac_nonexistent.h: No such file or directory configure:2563: $? = 1 configure: failed program was: | /* confdefs.h. */ | | #define PACKAGE_NAME "" | #define PACKAGE_TARNAME "" | #define PACKAGE_VERSION "" | #define PACKAGE_STRING "" | #define PACKAGE_BUGREPORT "" | /* end confdefs.h. */ | #include configure:2602: result: gcc -E configure:2626: gcc -E conftest.c configure:2632: $? = 0 configure:2664: gcc -E conftest.c conftest.c:9:28: error: ac_nonexistent.h: No such file or directory configure:2670: $? = 1 configure: failed program was: | /* confdefs.h. */ | | #define PACKAGE_NAME "" | #define PACKAGE_TARNAME "" | #define PACKAGE_VERSION "" | #define PACKAGE_STRING "" | #define PACKAGE_BUGREPORT "" | /* end confdefs.h. */ | #include configure:2714: checking for egrep configure:2724: result: grep -E configure:2729: checking for ANSI C header files configure:2754: gcc -c -g -O2 conftest.c >&5 configure:2760: $? = 0 configure:2764: test -z || test ! -s conftest.err configure:2767: $? = 0 configure:2770: test -s conftest.o configure:2773: $? = 0 configure:2862: gcc -o conftest -g -O2 conftest.c >&5 conftest.c: In function 'main': conftest.c:26: warning: incompatible implicit declaration of built-in function ' exit' configure:2865: $? = 0 configure:2867: ./conftest configure:2870: $? = 0 configure:2885: result: yes . . . . ###other error lines: conftest.c:9:28: error: ac_nonexistent.h: No such file or directory conftest.c:68:22: error: pthreads.h: No such file or directory conftest.c:34:22: error: pthreads.h: No such file or directory conftest.c:71:20: error: socket.h: No such file or directory conftest.c:37:20: error: socket.h: No such file or directory conftest.c:87:17: error: uio.h: No such file or directory conftest.c:53:17: error: uio.h: No such file or directory . . . . ###end of file: #define USE_NANOSLEEP 1 #define USE_STATUSMAP 1 #define USE_STATUSWRL 1 #define USE_TRENDS 1 #define USE_XCDDEFAULT 1 #define USE_XDDDEFAULT 1 #define USE_XODTEMPLATE 1 #define USE_XPDDEFAULT 1 #define USE_XRDDEFAULT 1 #define USE_XSDDEFAULT 1 configure: exit 0 1. dpkg installed package info adduser install apache2 install apache2-mpm-prefork install apache2-utils install apache2.2-bin install apache2.2-common install apparmor install apparmor-utils install apport install apport-symptoms install apt install apt-transport-https install apt-utils install aptitude install at install base-files install base-passwd install bash install bash-completion install bind9-host install binutils install bsdmainutils install bsdutils install build-essential install busybox-initramfs install byobu install bzip2 install ca-certificates install command-not-found install command-not-found-data install console-setup install console-terminus install coreutils install cpio install cpp install cpp-4.4 install cron install dash install debconf install debconf-i18n install debianutils install defoma install dhcp3-client install dhcp3-common install diff install dmidecode install dmsetup install dnsutils install dosfstools install dpkg install dpkg-dev install e2fslibs install e2fsprogs install ed install eject install exim4 install exim4-base install exim4-config install exim4-daemon-light install fakeroot install file install findutils install finger install fontconfig install fontconfig-config install friendly-recovery install ftp install ftpd install fuse-utils install g++ install g++-4.4 install gcc install gcc-4.4 install gcc-4.4-base install geoip-database install gettext-base install gnupg install gpgv install grep install groff-base install grub-common install grub-pc install guile-1.8-libs install gzip install hdparm install hicolor-icon-theme install hostname install ifupdown install info install initramfs-tools install initscripts install insserv install install-info install installation-report install iproute install iptables install iputils-arping install iputils-ping install iputils-tracepath install iso-codes install kbd install klibc-utils install landscape-common install language-pack-en install language-pack-en-base install language-selector-common install laptop-detect install less install libacl1 install libapache2-mod-php5 install libapparmor-perl install libapparmor1 install libapr1 install libaprutil1 install libaprutil1-dbd-sqlite3 install libaprutil1-ldap install libatk1.0-0 install libatk1.0-data install libatm1 install libattr1 install libavahi-client3 install libavahi-common-data install libavahi-common3 install libbind9-50 install libblkid1 install libbsd0 install libbz2-1.0 install libc-bin install libc-dev-bin install libc6 install libc6-dev install libcairo2 install libcap2 install libclass-accessor-perl install libcomerr2 install libcompress-bzip2-perl install libcups2 install libcurl3-gnutls install libcwidget3 install libdatrie1 install libdb4.6 install libdb4.7 install libdbd-mysql-perl install libdbi-perl install libdbus-1-3 install libdbus-glib-1-2 install libdevmapper1.02.1 install libdirectfb-1.2-0 install libdns50 install libedit2 install libelf1 install libept0 install libexpat1 install libexpat1-dev install libffi5 install libfont-afm-perl install libfontconfig1 install libfontconfig1-dev install libfontenc1 install libfreetype6 install libfreetype6-dev install libfribidi0 install libfuse2 install libgc1c2 install libgcc1 install libgcrypt11 install libgd2-xpm install libgd2-xpm-dev install libgdbm3 install libgeoip1 install libglade2-0 install libglib2.0-0 install libglib2.0-data install libgmp3c2 install libgnutls26 install libgomp1 install libgpg-error0 install libgpm2 install libgsasl7 install libgssapi-krb5-2 install libgtk2.0-0 install libgtk2.0-bin install libgtk2.0-common install libhtml-format-perl install libhtml-parser-perl install libhtml-tagset-perl install libhtml-template-perl install libhtml-tree-perl install libidn11 install libio-string-perl install libisc50 install libisccc50 install libisccfg50 install libiw29 install libjasper1 install libjpeg62 install libjpeg62-dev install libjs-jquery install libk5crypto3 install libkeyutils1 install libklibc install libkrb5-3 install libkrb5support0 install libldap-2.4-2 install liblocale-gettext-perl install liblockfile1 install libltdl7 install liblwres50 install libmagic1 install libmailtools-perl install libmailutils2 install libmpfr1ldbl install libmysqlclient15off install libmysqlclient16 install libncurses5 install libncursesw5 install libnet-daemon-perl install libnewt0.52 install libnl1 install libntfs-3g54 install libntlm0 install libpam-modules install libpam-runtime install libpam0g install libpango1.0-0 install libpango1.0-common install libparse-debianchangelog-perl install libparted1.8-12 install libpcap0.8 install libpci3 install libpcre3 install libpcsclite1 install libpixman-1-0 install libplrpc-perl install libpng12-0 install libpng12-dev install libpopt0 install libpthread-stubs0 install libpthread-stubs0-dev install libpython2.6 install libreadline5 install libreadline6 install librpc-xml-perl install libsasl2-2 install libsasl2-modules install libselinux1 install libsepol1 install libsigc++-2.0-0c2a install libslang2 install libsqlite3-0 install libss2 install libssl0.9.8 install libstdc++6 install libstdc++6-4.4-dev install libsub-name-perl install libsysfs2 install libt1-5 install libtasn1-3 install libterm-readkey-perl install libtext-charwidth-perl install libtext-iconv-perl install libtext-wrapi18n-perl install libthai-data install libthai0 install libtiff4 install libtimedate-perl install libts-0.0-0 install libudev0 install liburi-perl install libusb-0.1-4 install libuser1 install libuuid1 install libwrap0 install libwww-perl install libx11-6 install libx11-data install libx11-dev install libxapian15 install libxau-dev install libxau6 install libxcb-render-util0 install libxcb-render0 install libxcb1 install libxcb1-dev install libxcomposite1 install libxcursor1 install libxdamage1 install libxdmcp-dev install libxdmcp6 install libxext6 install libxfixes3 install libxfont1 install libxft2 install libxi6 install libxinerama1 install libxml-parser-perl install libxml2 install libxmuu1 install libxpm-dev install libxpm4 install libxrandr2 install libxrender1 install linux-firmware install linux-headers-2.6.31-14 install linux-headers-2.6.31-14-server install linux-headers-server install linux-image-2.6.31-14-server install linux-image-server install linux-libc-dev install linux-server install locales install lockfile-progs install login install logrotate install lsb-base install lsb-release install lshw install lsof install ltrace install lvm2 install lzma install mailutils install make install makedev install man-db install manpages install mawk install memtest86+ install mime-support install mlocate install module-init-tools install mount install mountall install mtr-tiny install mysql-client-5.1 install mysql-common install mysql-server install mysql-server-5.1 install mysql-server-core-5.1 install nano install ncurses-base install ncurses-bin install net-tools install netbase install netcat install netcat-traditional install ntfs-3g install ntpdate install openbsd-inetd install openssh-client install openssh-server install openssl install os-prober install parted install passwd install patch install pciutils install perl install perl-base install perl-modules install php5 install php5-common install php5-gd install php5-mysql deinstall pkg-config install popularity-contest install powermgmt-base install ppp install pppconfig install pppoeconf install procps install psmisc install python install python-apport install python-apt install python-central install python-dbus install python-gdbm install python-gnupginterface install python-gobject install python-httplib2 install python-launchpadlib install python-lazr-restfulclient install python-lazr-uri install python-minimal install python-newt install python-oauth install python-openssl install python-pam install python-pexpect install python-pkg-resources install python-problem-report install python-pycurl install python-serial install python-simplejson install python-smartpm install python-support install python-twisted-bin install python-twisted-core install python-wadllib install python-zope.interface install python2.6 install python2.6-minimal install readline-common install rsync install rsyslog install screen install sed install sgml-base install shared-mime-info install ssh install ssl-cert install strace install sudo install sysv-rc install sysvinit-utils install tar install tasksel install tasksel-data install tcpd install tcpdump install telnet install telnetd install time install tsconf install ttf-dejavu install ttf-dejavu-core install ttf-dejavu-extra install tzdata install ubuntu-keyring install ubuntu-minimal install ubuntu-serverguide install ubuntu-standard install ucf install udev install ufw install update-inetd install update-manager-core install update-notifier-common install upstart install usbutils install usermode deinstall util-linux install uuid-runtime install vim install vim-common install vim-runtime install vim-tiny install w3m install watershed install wget install whiptail install wireless-crda install wireless-tools install wpasupplicant install x-ttcidfont-conf install x11-common install x11proto-core-dev install x11proto-input-dev install x11proto-kb-dev install xauth install xfonts-encodings install xfonts-utils install xkb-data install xml-core install xtrans-dev install zlib1g install zlib1g-dev install === end ==== ********************************************************************** This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. ********************************************************************** -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Dec 17 18:34:27 2009 From: marc at ena.com (Marc Powell) Date: Thu, 17 Dec 2009 11:34:27 -0600 Subject: conftest errors on configure of source nagios-3.2.0 In-Reply-To: <771645F4ADF2C9449B7E6AE630C720018BC4BD22E8@HQ-MAIL-01.scotborders.gov.uk> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD22E8@HQ-MAIL-01.scotborders.gov.uk> Message-ID: On Dec 17, 2009, at 9:52 AM, Scott, Ewan wrote: > Hi > > I am trying to install nagios3 from source onto a ubuntu server. When running the ./configure statement I get the following error in the config.log. (Other errors for conftest then follow. ) > > . > . > . > configure:2230: test -s conftest.o > configure:2233: $? = 0 > configure:2251: result: none needed > configure:2269: gcc -c -g -O2 conftest.c >&5 > conftest.c:2: error: expected '=', ',', ';', 'asm' or '__attribute__' before 'me > ' > configure:2275: $? = 1 > configure: failed program was: > | #ifndef __cplusplus > | choke me > | #endif > configure:2408: checking whether make sets $(MAKE) > . > . > . > Why do you think this is a problem? Looks normal to me. Not everything configure tests will succeed. That's kind of the point of configure; to figure out what it can and can't do and what options/features/programs are available to it on the current system. Did configure bail or complete successfully? If it bailed, post the output leading up to the bail. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dermoth at aei.ca Fri Dec 18 05:03:44 2009 From: dermoth at aei.ca (Thomas Guyot-Sionnest) Date: Thu, 17 Dec 2009 23:03:44 -0500 Subject: Nagios 30 seconds initial delay In-Reply-To: <1595932556.20091216005707@gmail.com> References: <1595932556.20091216005707@gmail.com> Message-ID: <4B2AFF20.4060102@aei.ca> On 15/12/09 05:57 PM, Brandino Andreas wrote: > Hi all, > I am using Nagios 3.2.0 (I just upgrade from early 3.0 releases) > Every time I restart nagios I face a 30 seconds delay before the > various pages appear for first time (not before starting checks, 30 > seconds before displaying pages!!!) > > When I click a page I get the error "Error: Could not read host and > service status information!" . After 30 seconds, all pages appear > again!! > > - I don't have duplicated nagios service running > - This delay didn't show up to older versions > - My linux is Debian 5.0.3 (stable) > - This error appears every time I restart nagios > > Any idea what can cause this delay?? What is your status_update_interval (in nagios.cfg)? I'm guessing if it's set to 30 seconds then maybe status.dat isn't getting created until the first 30 seconds pass, which would explain this. Unless you have a large number of services you can set this pretty low without impacting performance, and a lower value gives better responsiveness on the CGI. -- Thomas ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ampranti at gmail.com Fri Dec 18 09:22:46 2009 From: ampranti at gmail.com (Brandino Andreas) Date: Fri, 18 Dec 2009 10:22:46 +0200 Subject: Nagios 30 seconds initial delay In-Reply-To: <4B2AFF20.4060102@aei.ca> References: <1595932556.20091216005707@gmail.com> <4B2AFF20.4060102@aei.ca> Message-ID: <1331728827.20091218102246@gmail.com> Hi all, my status_update_interval value: "status_update_interval=10" Nagios need 35 seconds to show up... Friday, December 18, 2009, 6:03:44 AM, you wrote: > On 15/12/09 05:57 PM, Brandino Andreas wrote: >> Hi all, >> I am using Nagios 3.2.0 (I just upgrade from early 3.0 releases) >> Every time I restart nagios I face a 30 seconds delay before the >> various pages appear for first time (not before starting checks, 30 >> seconds before displaying pages!!!) >> >> When I click a page I get the error "Error: Could not read host and >> service status information!" . After 30 seconds, all pages appear >> again!! >> >> - I don't have duplicated nagios service running >> - This delay didn't show up to older versions >> - My linux is Debian 5.0.3 (stable) >> - This error appears every time I restart nagios >> >> Any idea what can cause this delay?? > What is your status_update_interval (in nagios.cfg)? > I'm guessing if it's set to 30 seconds then maybe status.dat isn't > getting created until the first 30 seconds pass, which would explain this. > Unless you have a large number of services you can set this pretty low > without impacting performance, and a lower value gives better > responsiveness on the CGI. <> --- - - - --- <> Brandino Andreas ampranti at gmail.com <> --- - - - --- <> ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ampranti at gmail.com Fri Dec 18 13:25:01 2009 From: ampranti at gmail.com (Brandino Andreas) Date: Fri, 18 Dec 2009 14:25:01 +0200 Subject: Nagios 30 seconds initial delay In-Reply-To: <137B4E3B8F30074EADD080AA8A6E2A9312931C73F1@ZDE070.lenze.com> References: <1595932556.20091216005707@gmail.com> <4B2AFF20.4060102@aei.ca> <1331728827.20091218102246@gmail.com> <137B4E3B8F30074EADD080AA8A6E2A9312931C73F1@ZDE070.lenze.com> Message-ID: <76277075.20091218142501@gmail.com> Exactly what I was thinking! I just asked, because older versions haven't introduce that delay. Moreover, I was searching if this was a "normal" nagios delay or a misconfiguration on my side (interoperability with some package installed or something I can't think at the moment) Friday, December 18, 2009, 12:13:53 PM, you wrote: > Hi, > I'm experiencing exactly the same problem. I always thought, that > this behaviour is caused by ndo (which seems not to be the case). > Looking into the nagios log I recognized that the initial state > info is loaded within 9 seconds into nagios - so the cgi should be > ready to go. It's somehow strange: you cannot view the tactical > overview, host or service information. But during that 30 s you can have a look at the Alert history. > Perhaps nagios needs the missing 20s to build up some internal data > structures, plan the scheduling or so ? > Thomas > Von: Brandino Andreas [mailto:ampranti at gmail.com] > Gesendet: Freitag, 18. Dezember 2009 09:23 > An: Thomas Guyot-Sionnest > Cc: nagios-users at lists.sourceforge.net > Betreff: Re: [Nagios-users] Nagios 30 seconds initial delay > Hi all, > my status_update_interval value: "status_update_interval=10" > Nagios need 35 seconds to show up... > Friday, December 18, 2009, 6:03:44 AM, you wrote: >> On 15/12/09 05:57 PM, Brandino Andreas wrote: >>> Hi all, >>> I am using Nagios 3.2.0 (I just upgrade from early 3.0 releases) >>> Every time I restart nagios I face a 30 seconds delay before the >>> various pages appear for first time (not before starting checks, 30 >>> seconds before displaying pages!!!) >>> >>> When I click a page I get the error "Error: Could not read host and >>> service status information!" . After 30 seconds, all pages appear >>> again!! >>> >>> - I don't have duplicated nagios service running >>> - This delay didn't show up to older versions >>> - My linux is Debian 5.0.3 (stable) >>> - This error appears every time I restart nagios >>> >>> Any idea what can cause this delay?? >> What is your status_update_interval (in nagios.cfg)? >> I'm guessing if it's set to 30 seconds then maybe status.dat isn't >> getting created until the first 30 seconds pass, which would explain this. >> Unless you have a large number of services you can set this pretty low >> without impacting performance, and a lower value gives better >> responsiveness on the CGI. > <> --- - - - --- <> > Brandino Andreas > ampranti at gmail.com > <> --- - - - --- <> > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support A > streamlined, 14 day to market process makes app distribution fast > and easy Join now and get one step closer to millions of Verizon > customers http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null <> --- - - - --- <> Brandino Andreas ampranti at gmail.com <> --- - - - --- <> ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Dec 18 15:08:05 2009 From: marc at ena.com (Marc Powell) Date: Fri, 18 Dec 2009 08:08:05 -0600 Subject: conftest errors on configure of source nagios-3.2.0 In-Reply-To: <771645F4ADF2C9449B7E6AE630C720018BC4BD22EB@HQ-MAIL-01.scotborders.gov.uk> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD22E8@HQ-MAIL-01.scotborders.gov.uk> <771645F4ADF2C9449B7E6AE630C720018BC4BD22EB@HQ-MAIL-01.scotborders.gov.uk> Message-ID: <3E4404A2-9B38-41D1-A7C9-B24B9744C587@ena.com> Please always respond on list so that everyone sees, follows and benefits from the conversation. More below... On Dec 18, 2009, at 4:23 AM, Scott, Ewan wrote: > Marc > Thanks for this. The log file shows a normal end -see below. So, from what you say, I should just continue with the install. Yes, if the console output from configure indicated so (I'll bet it did). > My inexperience with linux compiling I'm afraid. I didn't want to end up with problems further down the line that could be traced back to the compile. *nod*. You should rarely, if ever, have to look at config.log. Only if configure fails or shows some type of error and you know specifically what to look for in config.log to help figure out why it failed. Generally the ./configure console output will tell you what you need to know though. Note that 99% of that output is informational. Developers tend to make the things that are really bad really stand out. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kgoutos at libertymgt.com Fri Dec 18 14:40:19 2009 From: kgoutos at libertymgt.com (Goutos, Kevin) Date: Fri, 18 Dec 2009 08:40:19 -0500 Subject: Nagios Alerts per Group Message-ID: Hello everyone, I'm using Nagios 3.2 to monitor approximately 100 servers, switches, and routers. I have them broken down into multiple groups. Basically, my problem is that when the network goes down for example, I'm getting massive texts from every single device going down, when I would much rather just say "A host in the 'Router' group is down". Is there a way to do this? I would really appreciate it! Kevin -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Dec 18 16:44:15 2009 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 18 Dec 2009 15:44:15 +0000 Subject: Nagios Alerts per Group In-Reply-To: References: Message-ID: <4B2BA34F.1000106@flatto.net> hello kevin what you need to set up is dependencies and parent/child relationships . have a look here http://nagios.sourceforge.net/docs/3_0/dependencies.html Assaf Goutos, Kevin wrote: > > Hello everyone, > > I?m using Nagios 3.2 to monitor approximately 100 servers, switches, > and routers. I have them broken down into multiple groups. Basically, > my problem is that when the network goes down for example, I?m getting > massive texts from every single device going down, when I would much > rather just say ?A host in the ?Router? group is down?. Is there a way > to do this? I would really appreciate it! > > Kevin > > ------------------------------------------------------------------------ > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > ------------------------------------------------------------------------ > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Dec 18 17:14:09 2009 From: marc at ena.com (Marc Powell) Date: Fri, 18 Dec 2009 10:14:09 -0600 Subject: Nagios Alerts per Group In-Reply-To: <4B2BA34F.1000106@flatto.net> References: <4B2BA34F.1000106@flatto.net> Message-ID: On Dec 18, 2009, at 9:44 AM, Assaf Flatto wrote: > hello kevin > > what you need to set up is dependencies and parent/child relationships . > > have a look here > http://nagios.sourceforge.net/docs/3_0/dependencies.html Parent/child is easier than dependencies and should be sufficient... http://nagios.sourceforge.net/docs/3_0/networkreachability.html -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From doepain at gmail.com Sat Dec 19 16:45:24 2009 From: doepain at gmail.com (dOE) Date: Sat, 19 Dec 2009 10:45:24 -0500 Subject: Nagios monitor for VMware ESXi (free edition) In-Reply-To: References: <24B6509E4191AF44B60A24EAA3B4AD49361A95@nuexchg.norwich.edu> Message-ID: I am hoping to monitor some hardware with the HP agents that are pre-built in the ESXi (free edition) build we are using. On Mon, Dec 7, 2009 at 12:25 PM, dOE wrote: > From the VMware Infrastructure client under "Health Check" you ca see vital > hardware health statistics such as processor, memory, hard drives power > supplies, and fans. The information is gathered by WBEM, and can be done > with HP SIM too. The script is able to poll this information and return the > values of all of the hardware under one monitor. That is good enough for me > because if a memory module goes bad the monitor will go red and prompt us to > investigate cause. I am trying to do this with ESX*i* which does not > support SNMP, not ESX. > > > On Mon, Dec 7, 2009 at 12:19 PM, James Pratt wrote: > >> >> >> >> -----Original Message----- >> >> From: dOE [mailto:doepain at gmail.com] >> >> Sent: Monday, December 07, 2009 12:13 PM >> >> To: Nagios User-List >> >> Subject: [Nagios-users] Nagios monitor for VMware ESXi (free edition) >> >> >> >> I have created a monitor using the "check_esxwbem.py", but it returns >> an "OK" and >> >> null It is not doing what it is intended to do. >> >> The script pulls the hardware resources of the host server through >> WBEM. I know >> >> WBEM is working because I am able to pull this information from HP >> SIM, but I want >> >> Nagios to be my one stop shop for monitoring. >> >> What exactly are you trying to monitor? Raid? I have some snmp stuff for >> using the HP agents on ESX, but I can't help without more info. >> >> Regards, >> jamie >> >> > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Sun Dec 20 14:56:31 2009 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Sun, 20 Dec 2009 14:56:31 +0100 Subject: Nagios monitor for VMware ESXi (free edition) In-Reply-To: References: <24B6509E4191AF44B60A24EAA3B4AD49361A95@nuexchg.norwich.edu> Message-ID: <90f6e8270912200556v28925254y5903430df01275d1@mail.gmail.com> On Sat, Dec 19, 2009 at 4:45 PM, dOE wrote: > I am hoping to monitor some hardware with the HP agents that are pre-built > in the ESXi (free edition) build we are using. we use http://labs.consol.de/lang/en/nagios/check_hpasm/ to check all our proliant servers (regardless of operating system). It works very fine, but you need to compile the plugin yourself. natxo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Sun Dec 20 20:46:03 2009 From: rperezm at uci.cu (ReynierPM) Date: Sun, 20 Dec 2009 14:46:03 -0500 Subject: Nagios monitor for VMware ESXi (free edition) In-Reply-To: <90f6e8270912200556v28925254y5903430df01275d1@mail.gmail.com> References: <24B6509E4191AF44B60A24EAA3B4AD49361A95@nuexchg.norwich.edu> <90f6e8270912200556v28925254y5903430df01275d1@mail.gmail.com> Message-ID: <4B2E7EFB.5070305@uci.cu> On 12/20/2009 8:56 am, Natxo Asenjo wrote: > On Sat, Dec 19, 2009 at 4:45 PM, dOE wrote: >> I am hoping to monitor some hardware with the HP agents that are pre-built >> in the ESXi (free edition) build we are using. > > we use http://labs.consol.de/lang/en/nagios/check_hpasm/ to check all > our proliant servers (regardless of operating system). It works very > fine, but you need to compile the plugin yourself. > Nice, I'll use on some Proliant Servers I have but what about Blade? I have 5 Blade and need to check it, any way? -- Saludos Ing. Reynier P?rez Mira ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yu.watanabe at jp.fujitsu.com Mon Dec 21 02:26:42 2009 From: yu.watanabe at jp.fujitsu.com (Yu Watanabe) Date: Mon, 21 Dec 2009 10:26:42 +0900 Subject: Nagios is logging "Service Check Timed Out" for certain service In-Reply-To: <765d77c80912010859w11a4d758iec8b8e810c65112f@mail.gmail.com> References: <765d77c80912010859w11a4d758iec8b8e810c65112f@mail.gmail.com> Message-ID: <200912210126.AA01354@S2007337.jp.fujitsu.com> Hello Jim. I still cannnot solve why this had occured. So , would you please give us an advice? In the previous mail , I have realized that the detail about the situation was not articulated. Following are the details about the situation: 1. About the log checking process: There is one active check and one passive service check to do the job. The active check service scans the log file using the pre defined regexpression. If there are any lines that matches the regex, it send the result to the passive service. 2. Only the active check for log checkin process returns "Service Timed Out" All the other services but the corresponding active check were returning proper check result. 3. The active check itself was not executed. This, I found out by writing a debug line in the active check plugin. I wrote a debug line at the very top of the source code but even though the check time came , the debug log wasn't created. The most suspicious fact is 2 and 3. If the the reason was completely dependent on the plugin, there must be some kind of debug log created, but since the plugin itself was not executed , it is becoming a little bit tricky. Would it possibe to here your opinion about this ? Thank you Yu Watanabe Jim Avery ????????: >2009/12/1 Yu Watanabe : >> Hello Jim. >> >> Thank you for the reply and sorry for the late reply from me. >> >> Well , my situation was using plugin that scans through the syslog file and >> whenever any regular expression match occurs it sends an passive check alert to nagios. >> >> Weird thing was there was existing log file but nagios plugin itself >> was not executed. >> >> Yu Watanabe > >Is your problem solved now? If not, the first thing which I would >mention is that if it is a passive check you almost certainly want to >disable active checks for that service in Nagios. > >Cheers, > >Jim > >------------------------------------------------------------------------------ >Join us December 9, 2009 for the Red Hat Virtual Experience, >a free event focused on virtualization and cloud computing. >Attend in-depth sessions from your desk. Your couch. Anywhere. >http://p.sf.net/sfu/redhat-sfdev2dev >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yinyang at eburg.com Mon Dec 21 09:17:28 2009 From: yinyang at eburg.com (Gordon Messmer) Date: Mon, 21 Dec 2009 00:17:28 -0800 Subject: Notification period inheritance problem In-Reply-To: References: <4B26DEC0.7010607@eburg.com> <4B27C881.6030401@eburg.com> <4B27EBB6.7050403@eburg.com> <85963A63-74F3-4292-8472-CE9DC45F22AD@ena.com> <4B281B3A.9040803@eburg.com> <4B28B0EF.3070002@eburg.com> Message-ID: <4B2F2F18.4050803@eburg.com> On 12/16/2009 04:43 AM, Martin Melin wrote: > On Wed, Dec 16, 2009 at 11:05 AM, Gordon Messmer wrote: > >> >> I'm sure that when you tell me what the "null" notification period does, >> I'll understand how. For now, I don't, because it doesn't seem to be >> documented. >> > null is a value for a notification period that does not have any > effect. Most config's I've seen have a timeperiod named "none", > configured to never match. This accomplishes the same thing. I appreciate the time you've taken to answer my questions, but you're wrong. The services which I've configured which have a "null" notification period will send notices regardless of the time, not never as you've suggested. > Using > null does not do anything special as far as inheritance is concerned - > Yes, I noticed that. That is why I brought the issue up to begin with. My question is: why not? The documentation states that services will inherit the hosts' notification time if the service definition has no time specified. It also suggests that "null" is a special value that prevents inheritance. It isn't documented to do anything other than prevent inheritance. Why shouldn't that create an unspecified value? I'm no longer asking anyone to explain the behavior. The code was already pointed out, so I understand what /is/ happening. My question is (and always was) whether or not the behavior should be considered a bug. I'm convinced that it should. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Mon Dec 21 14:30:53 2009 From: jim at jimavery.me.uk (Jim Avery) Date: Mon, 21 Dec 2009 13:30:53 +0000 Subject: Nagios is logging "Service Check Timed Out" for certain service In-Reply-To: <200912210126.AA01354@S2007337.jp.fujitsu.com> References: <765d77c80912010859w11a4d758iec8b8e810c65112f@mail.gmail.com> <200912210126.AA01354@S2007337.jp.fujitsu.com> Message-ID: <765d77c80912210530g4ccd0f3csbe8241e8d5c63330@mail.gmail.com> 2009/12/21 Yu Watanabe : > Hello Jim. > > I still cannnot solve why this had occured. > So , would you please give us an advice? > > In the previous mail , I have realized that > the detail about the situation was not articulated. > > Following are the details about the situation: > > 1. About the log checking process: > ?There is one active check and one passive service check to do the job. > > ?The active check service scans the log file using the pre defined regexpression. > ?If there are any lines that matches the regex, it send the result to the passive service. > > 2. Only the active check for log checkin process returns "Service Timed Out" > ?All the other services but the corresponding active check were returning proper > ?check result. > > 3. The active check itself was not executed. > ?This, I found out by writing a debug line in the active check plugin. > ?I wrote a debug line at the very top of the source code but even though the > ?check time came , the debug log wasn't created. > > The most suspicious fact is 2 and 3. If the the reason was completely dependent on the plugin, > there must be some kind of debug log created, but since the plugin itself was not > executed , it is becoming a little bit tricky. > > Would it possibe to here your opinion about this ? > > Thank you > Yu Watanabe I am sorry Yu, but I do not see why you have an active check service sending results to a passive check service. If you could provide the relevant service and command definitions, the plugins used and any other scripts and configs which are relevant, it might be easier to understand. Cheers, Jim ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From doepain at gmail.com Mon Dec 21 16:15:09 2009 From: doepain at gmail.com (dOE) Date: Mon, 21 Dec 2009 10:15:09 -0500 Subject: Nagios monitor for VMware ESXi (free edition) In-Reply-To: <90f6e8270912200556v28925254y5903430df01275d1@mail.gmail.com> References: <24B6509E4191AF44B60A24EAA3B4AD49361A95@nuexchg.norwich.edu> <90f6e8270912200556v28925254y5903430df01275d1@mail.gmail.com> Message-ID: Could you post an example of how you compile this plugin? On Sun, Dec 20, 2009 at 8:56 AM, Natxo Asenjo wrote: > On Sat, Dec 19, 2009 at 4:45 PM, dOE wrote: > > I am hoping to monitor some hardware with the HP agents that are > pre-built > > in the ESXi (free edition) build we are using. > > we use http://labs.consol.de/lang/en/nagios/check_hpasm/ to check all > our proliant servers (regardless of operating system). It works very > fine, but you need to compile the plugin yourself. > > natxo > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and > easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jtata at gpworldwide.com Mon Dec 21 17:01:59 2009 From: jtata at gpworldwide.com (Tata, Joseph) Date: Mon, 21 Dec 2009 11:01:59 -0500 Subject: Problems With Availability Reports Message-ID: I am having problems with availability reports in Nagios 2.x. There are two that are of concern to my manager: First Nagios does not seem to be taking into account scheduled downtime against the Total Uptime percentage, this is throwing off our reporting numbers. Second the availability reports for Service Groups do not display scheduled downtime at all. My questions are what if anything can be done to fix these issues, and are there any packages/plug ins which offer more robust reporting? What else are Nagios users doing to deal with reporting issues? Thanks. Joseph P. Tata Systems Administrator General Physics Corporation (GP) www.gpworldwide.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yinyang at eburg.com Mon Dec 21 19:33:32 2009 From: yinyang at eburg.com (Gordon Messmer) Date: Mon, 21 Dec 2009 10:33:32 -0800 Subject: Notification period inheritance problem In-Reply-To: References: Message-ID: <4B2FBF7C.7030008@eburg.com> On 12/21/2009 08:09 AM, Tata, Joseph wrote: > Which version of Nagios are you using? 2.x does not seem to inherit > properties or define services for hostgroups/servicegroups as > described in the documentation for 3.x. nagios-3.2.0 > Secondly services have their own notification periods which can be > different from the hosts they run on. You should have a service > template as well as a host template which defines these. I know that. I want the services to use the host's notification time. In order for that to happen, they need to not have a notification time of their own. The documentation seems to suggest that "null" would accomplish this, but the code does something slightly different. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Mon Dec 21 21:40:45 2009 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Mon, 21 Dec 2009 21:40:45 +0100 Subject: Nagios monitor for VMware ESXi (free edition) In-Reply-To: References: <24B6509E4191AF44B60A24EAA3B4AD49361A95@nuexchg.norwich.edu> <90f6e8270912200556v28925254y5903430df01275d1@mail.gmail.com> Message-ID: <90f6e8270912211240t60d7b75cl3687dfd4a439c7b1@mail.gmail.com> On Mon, Dec 21, 2009 at 4:15 PM, dOE wrote: > Could you post an example of how you compile this plugin? Check the installation instructions on the link I posted. natxo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kasper at nordal-lund.dk Tue Dec 22 08:47:53 2009 From: kasper at nordal-lund.dk (Kasper Lund) Date: Tue, 22 Dec 2009 08:47:53 +0100 Subject: Problems with notification interval. Message-ID: Hi List. I have just upgraded my nagios from 2.12 to 3.20 - everything seems to be running just fine except for my notification interval. Usually i use 0 as interval as i only want to receive the message once, but nagios does not accept this option or any other. I get the message every 5 minuttes no matter what i write in the notification_interval option. I have tried with 120 and 240 as well, i still get it every 5 minuttes :( Here is my service definition file: define service { host_name dkcopenh010p-DHCP service_description Fixed_Drives check_command NRPE_Check_Disk!15%!10% is_volatile 1 max_check_attempts 3 check_interval 5 retry_interval 3 passive_checks_enabled 1 check_period 24x7 check_freshness 1 freshness_threshold 0 low_flap_threshold 0 high_flap_threshold 0 notification_interval 240 notification_period 24x7 notification_options w,u,r,c,f notifications_enabled 1 contact_groups windows-admins register 1 } Where could my problem be. I suspect that some of the options i use may be deprecated or something similar? Thanks in advance. /Kasper ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From EWScott at scotborders.gov.uk Tue Dec 22 11:18:00 2009 From: EWScott at scotborders.gov.uk (Scott, Ewan) Date: Tue, 22 Dec 2009 10:18:00 +0000 Subject: conftest errors on configure of source nagios-3.2.0 In-Reply-To: <3E4404A2-9B38-41D1-A7C9-B24B9744C587@ena.com> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD22E8@HQ-MAIL-01.scotborders.gov.uk> <771645F4ADF2C9449B7E6AE630C720018BC4BD22EB@HQ-MAIL-01.scotborders.gov.uk> <3E4404A2-9B38-41D1-A7C9-B24B9744C587@ena.com> Message-ID: <771645F4ADF2C9449B7E6AE630C720018BC4BD22EF@HQ-MAIL-01.scotborders.gov.uk> Marc Noted for future. Thanks again. Ewan -----Original Message----- From: Marc Powell [mailto:marc at ena.com] Sent: 18 December 2009 14:08 To: Nagios Mailinglist Subject: Re: [Nagios-users] conftest errors on configure of source nagios-3.2.0 Please always respond on list so that everyone sees, follows and benefits from the conversation. More below... On Dec 18, 2009, at 4:23 AM, Scott, Ewan wrote: > Marc > Thanks for this. The log file shows a normal end -see below. So, from what you say, I should just continue with the install. Yes, if the console output from configure indicated so (I'll bet it did). > My inexperience with linux compiling I'm afraid. I didn't want to end up with problems further down the line that could be traced back to the compile. *nod*. You should rarely, if ever, have to look at config.log. Only if configure fails or shows some type of error and you know specifically what to look for in config.log to help figure out why it failed. Generally the ./configure console output will tell you what you need to know though. Note that 99% of that output is informational. Developers tend to make the things that are really bad really stand out. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ********************************************************************** This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. ********************************************************************** ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Tue Dec 22 16:08:31 2009 From: stanb at panix.com (stan) Date: Tue, 22 Dec 2009 10:08:31 -0500 Subject: Monitoring Wimdows machines Message-ID: <20091222150831.GA843@teddy.fas.com> We have a fair sized Nagios installation that currently only monitors UNIX machines, and we want to add some Windows clients, primarily XP. We started down the road using NSClient++ using the check_nt Nagios plugin, but haven't been able to communicate with any XP boxes. The NSClient++ web page suggests using the NRPE method which has us wondering if the check_nt method even works. Has anyone had success with either of these methods and if so which? Perhaps there is another tool that someone can recommend. Thanks in advance, Stan Brown -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Dec 22 17:22:17 2009 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 22 Dec 2009 16:22:17 +0000 Subject: Monitoring Wimdows machines In-Reply-To: <20091222150831.GA843@teddy.fas.com> References: <20091222150831.GA843@teddy.fas.com> Message-ID: <4B30F239.5090500@flatto.net> I've had success on both counts , and the check_nt works with no issues. points to consider that might be causing the problem : in the nsc.ini , did you allow the nagios server in the allowed hosts , did you specify the port that you initiate the checks on ? if you did any modification to the nsc.ini , did you restart the nsclient++ service ? have you disabled or allowed connection on the port on the internal windows firewall? Assaf stan wrote: > We have a fair sized Nagios installation that currently only monitors UNIX > machines, and we want to add some Windows clients, primarily XP. We started > down the road using NSClient++ using the check_nt Nagios plugin, but > haven't been able to communicate with any XP boxes. The NSClient++ web > page suggests using the NRPE method which has us wondering if the check_nt > method even works. Has anyone had success with either of these methods and > if so which? Perhaps there is another tool that someone can recommend. > > Thanks in advance, > Stan Brown > > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Tue Dec 22 17:32:08 2009 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Tue, 22 Dec 2009 17:32:08 +0100 Subject: Monitoring Wimdows machines In-Reply-To: <20091222150831.GA843@teddy.fas.com> References: <20091222150831.GA843@teddy.fas.com> Message-ID: <90f6e8270912220832l42c93708y93b9cb16a50cd67d@mail.gmail.com> On Tue, Dec 22, 2009 at 4:08 PM, stan wrote: > We have a fair sized Nagios installation that currently only monitors UNIX > machines, and we want to add some Windows clients, primarily XP. We started > down the road using NSClient++ using the check_nt Nagios plugin, but > haven't been able to communicate with any XP boxes. ?The NSClient++ web > page suggests using the NRPE method which has us wondering if the check_nt > method even works. ?Has anyone had success with either of these methods and > if so which? ?Perhaps there is another tool that someone can recommend. nsclient++ works fine with xp. You need to check the ini file and verifying you are using the correct ports on the xp machines and the nagios server. Check the xp firewall as well, standard since sp 2 the xp machines are firewalled (unless you have already disabled that). -- natxo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Tue Dec 22 17:34:31 2009 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Tue, 22 Dec 2009 17:34:31 +0100 Subject: NRPE refused connect In-Reply-To: <7fedbc910912151632j24ab45aameb95b16d589b4da3@mail.gmail.com> References: <7fedbc910912151632j24ab45aameb95b16d589b4da3@mail.gmail.com> Message-ID: <90f6e8270912220834t4447d657gf03e9c7f987468cb@mail.gmail.com> did you restart the nrpe service/daemon on the client after modifying the allowed hosts parameter? -- Groeten, J.Asenjo On Wed, Dec 16, 2009 at 1:32 AM, shacky wrote: > Hi. > > I installed the check_nrpe plugin on the Nagios server and NRPE > running as daemon on the host I have to monitor, both compiled with > SSL support and the dh.h file created and saved in the include/ > directory on the NRPE host. > > [root at monitored-host ~]# /opt/nrpe/bin/nrpe > NRPE - Nagios Remote Plugin Executor > Copyright (c) 1999-2008 Ethan Galstad (nagios at nagios.org) > Version: 2.12 > Last Modified: 03-10-2008 > License: GPL v2 with exemptions (-l for more info) > SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required > TCP Wrappers Available > > nagios-server:/# /usr/local/nagios/libexec/check_nrpe > Incorrect command line arguments supplied > NRPE Plugin for Nagios > Copyright (c) 1999-2008 Ethan Galstad (nagios at nagios.org) > Version: 2.12 > Last Modified: 03-10-2008 > License: GPL v2 with exemptions (-l for more info) > SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required > > The IP address of the Nagios server is specified in the > "allowed_hosts" declaration in the nrpe.cfg file: > > [root at monitored-host ~]# grep allowed_hosts /opt/nrpe/etc/nrpe.cfg > allowed_hosts=192.168.10.250 > > The problem is that if I try to execute the check_nrpe command to test > the NRPE daemon on the monitored host, I receive this error: > > nagios-server:/# /usr/local/nagios/libexec/check_nrpe -H 192.168.10.18 > CHECK_NRPE: Error - Could not complete SSL handshake. > > On the monitored host I see that the IP address of the Nagios server > is allowed and then refused: > > [root at monitored-host ~]# tail /var/log/messages > Dec 16 01:24:27 monitored-host nrpe[25047]: INFO: SSL/TLS initialized. > All network traffic will be encrypted. > Dec 16 01:24:27 monitored-host nrpe[25048]: Starting up daemon > Dec 16 01:24:27 monitored-host nrpe[25048]: Warning: Daemon is > configured to accept command arguments from clients! > Dec 16 01:24:27 monitored-host nrpe[25048]: Listening for connections > on port 5666 > Dec 16 01:24:27 monitored-host nrpe[25048]: Allowing connections from: > 192.168.10.250 > Dec 16 01:27:01 monitored-host nrpe[25063]: refused connect from > 192.168.10.250 (192.168.10.250) > > What is the problem? > It is not a firewall problem because the connection works, and it does > not seems to be a SSL related problem because it does not work even if > I try the check command disabling SSL with -n (and the NRPE daemon > runned with -n also), and it is quite curious to have two opposite log > messages. > > Could you help me please? I worked all the afternoon trying to let it > work, but it does not work... > > Thank you very much!! > Bye. > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Tue Dec 22 17:40:02 2009 From: stanb at panix.com (stan) Date: Tue, 22 Dec 2009 11:40:02 -0500 Subject: Monitoring Wimdows machines In-Reply-To: <4B30F239.5090500@flatto.net> References: <20091222150831.GA843@teddy.fas.com> <4B30F239.5090500@flatto.net> Message-ID: <20091222164002.GA3228@teddy.fas.com> On Tue, Dec 22, 2009 at 04:22:17PM +0000, Assaf Flatto wrote: > I've had success on both counts , and the check_nt works with no issues. > > points to consider that might be causing the problem : > in the nsc.ini , did you allow the nagios server in the allowed hosts , > did you specify the port that you initiate the checks on ? > > if you did any modification to the nsc.ini , did you restart the > nsclient++ service ? > > have you disabled or allowed connection on the port on the internal > windows firewall? > > Assaf > Yes, we have done all of these. We do live in a world with way too many firewalls. What ports should we check with nmap between the What else can you think of that we might need to check? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From xml.devel at gmail.com Tue Dec 22 18:24:14 2009 From: xml.devel at gmail.com (Kumar, Ashish) Date: Tue, 22 Dec 2009 22:54:14 +0530 Subject: Monitoring Wimdows machines In-Reply-To: <20091222164002.GA3228@teddy.fas.com> References: <20091222150831.GA843@teddy.fas.com> <4B30F239.5090500@flatto.net> <20091222164002.GA3228@teddy.fas.com> Message-ID: <633e02960912220924x14774a22hfacb709788686727@mail.gmail.com> > Yes, we have done all of these. We do live in a world with way too many > firewalls. What ports should we check with nmap between the > > What else can you think of that we might need to check? > Try this: edit nsc.ini, in the log section uncomment debug and file options, e.g. [log] debug=1 file=NSC.log restart nsclient++. from nagios server run a command like $ check_nt -H $HOST -p $PORT -s $PASSWORD -v UPTIME post the output of check_nt command and output of NSC.log, if any. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at medin.name Tue Dec 22 18:05:46 2009 From: michael at medin.name (Michael Medin) Date: Tue, 22 Dec 2009 18:05:46 +0100 Subject: Monitoring Wimdows machines In-Reply-To: <20091222164002.GA3228@teddy.fas.com> References: <20091222150831.GA843@teddy.fas.com> <4B30F239.5090500@flatto.net> <20091222164002.GA3228@teddy.fas.com> Message-ID: <4B30FC6A.7080705@medin.name> Read the guide it should have the information you need to get it setup. http://nsclient.org/nscp/wiki/doc/usage/nagios/nrpe // Michael Medin stan wrote: > On Tue, Dec 22, 2009 at 04:22:17PM +0000, Assaf Flatto wrote: > >> I've had success on both counts , and the check_nt works with no issues. >> >> points to consider that might be causing the problem : >> in the nsc.ini , did you allow the nagios server in the allowed hosts , >> did you specify the port that you initiate the checks on ? >> >> if you did any modification to the nsc.ini , did you restart the >> nsclient++ service ? >> >> have you disabled or allowed connection on the port on the internal >> windows firewall? >> >> Assaf >> >> > > Yes, we have done all of these. We do live in a world with way too many > firewalls. What ports should we check with nmap between the > > What else can you think of that we might need to check? > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Tue Dec 22 21:06:05 2009 From: stanb at panix.com (stan) Date: Tue, 22 Dec 2009 15:06:05 -0500 Subject: Monitoring Wimdows machines In-Reply-To: <633e02960912220924x14774a22hfacb709788686727@mail.gmail.com> References: <20091222150831.GA843@teddy.fas.com> <4B30F239.5090500@flatto.net> <20091222164002.GA3228@teddy.fas.com> <633e02960912220924x14774a22hfacb709788686727@mail.gmail.com> Message-ID: <20091222200605.GA7864@teddy.fas.com> On Tue, Dec 22, 2009 at 10:54:14PM +0530, Kumar, Ashish wrote: > > Yes, we have done all of these. We do live in a world with way too many > > firewalls. What ports should we check with nmap between the > > > > What else can you think of that we might need to check? > > > > Try this: > > edit nsc.ini, in the log section uncomment debug and file options, e.g. > [log] > debug=1 > file=NSC.log > > restart nsclient++. > > from nagios server run a command like > $ check_nt -H $HOST -p $PORT -s $PASSWORD -v UPTIME > > post the output of check_nt command and output of NSC.log, if any. To clse the thread. We had 2 problesm. We had odd networking, and the nagios machine is dual hiomed. We had put the wrong IP in the config on the Windows machine, but fixing that did not make it work. We had to tell the windows client what IP to bind on (it's only interface BTW). It is a virtualbox machine, with a bridged interface from the host, so mayvbe that had something to do with it. Thanks for the help! -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From HBruenjes at nowis.de Tue Dec 22 22:02:50 2009 From: HBruenjes at nowis.de (=?ISO-8859-1?B?SGVubmluZyBCcvxuamVz?=) Date: Tue, 22 Dec 2009 22:02:50 +0100 Subject: =?iso-8859-1?q?Henning_Br=FCnjes_ist_au=DFer_Haus?= =?iso-8859-1?q?=2E?= Message-ID: Ich werde ab 22.12.2009 nicht im B?ro sein. Ich kehre zur?ck am 04.01.2010. Ich werde Ihre Nachricht nach meiner R?ckkehr beantworten. In dringenden F?llen schreiben Sie bitte eine E-mail an system-services at nowis.de * * * * * NOWIS - Nordwest-Informationssysteme GmbH & Co. KG DISCLAIMER * * * * * Der Inhalt dieser E-Mail ist vertraulich und ausschlie?lich f?r den bezeichneten Adressaten bestimmt. Wenn Sie nicht der vorgesehene Adressat dieser E-Mail oder dessen Vertreter sein sollten, so beachten Sie bitte, da? jede Form der Kenntnisnahme, Ver?ffentlichung, Vervielf?ltigung oder Weitergabe des Inhalts dieser E-Mail unzul?ssig ist. Wir bitten Sie, sich in diesem Fall mit dem Absender der E-Mail in Verbindung zu setzen. NOWIS - Nordwest-Informationssysteme GmbH & Co.KG - Mittelkamp 110-118 - D-26125 Oldenburg Telefon: +49 / (0)441 / 3907-0, Fax: +49 / (0)441 / 3907-175 - email: info at nowis.de - http://www.nowis.de Handelsregister: Amtsgericht Oldenburg HRB 3608 - Gesch?ftsf?hrer: L?der Wohltmann, Dr. Uwe Vaihinger - Pers?nlich haftende Gesellschafterin: NOWIS Verwaltungs GmbH, Sitz: Oldenburg, Handelsregister: Amtsgericht Oldenburg - HRB 4566, Gesch?ftsf?hrer: Dr.-Ing. Uwe Vaihinger - Vorsitzender des Aufsichtsrates: Klaus-Dietrich Schrepp * * * * * ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Wed Dec 23 17:56:19 2009 From: jim at jimavery.me.uk (Jim Avery) Date: Wed, 23 Dec 2009 16:56:19 +0000 Subject: Problems With Availability Reports In-Reply-To: References: Message-ID: <765d77c80912230856m62dbadbdv485a1e603f46ffe5@mail.gmail.com> 2009/12/21 Tata, Joseph : > I am having problems with availability reports in Nagios 2.x.? There are two > that are of concern to my manager: > First Nagios does not seem to be taking into account scheduled downtime > against the Total Uptime percentage, this is throwing off our reporting > numbers. > Second the availability reports for Service Groups do not display scheduled > downtime at all. > > My questions are what if anything can be done to fix these issues, and are > there any packages/plug ins which offer more robust reporting?? What else > are Nagios users doing to deal with reporting issues? There were some bugs in the availability reports in Nagios 2.x. I can't rememeber if they were fixed in later versions of Nagios 2.x or in 3.x. I would recommend upgrading to Nagios 3.x but I see that in Nagios 3.2 the availability report doesn't account for scheduled downtime either so it wouldn't help you with your second problem. I'm sorry I don't suppose that helps much. Jim ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lee at dropio.com Wed Dec 23 22:58:14 2009 From: lee at dropio.com (Lee Azzarello) Date: Wed, 23 Dec 2009 16:58:14 -0500 Subject: Dynamic host, hostgroup and service addition/subtraction Message-ID: <16b031c0912231358t261861fbq30cb614011e04e81@mail.gmail.com> I have an application that requires new cluster nodes be created and destroyed to handle load spikes. I would like this additional capacity to be represented in nagios during it's lifetime. Currently my installation requires manually adding host definitions and dependent hostgroup and service configurations. Is there a system in Nagios 3 to change the host definitions without editing a configuration file, redeploying the configs and restarting the nagios service? -- _______________ Lee Azzarello drop.io staff hacker ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Dec 23 23:44:29 2009 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Wed, 23 Dec 2009 14:44:29 -0800 Subject: Dynamic host, hostgroup and service addition/subtraction In-Reply-To: <16b031c0912231358t261861fbq30cb614011e04e81@mail.gmail.com> References: <16b031c0912231358t261861fbq30cb614011e04e81@mail.gmail.com> Message-ID: <20091223224429.GT17169@bakgwai.americas.hpqcorp.net> Hi Lee! On Wed, 23 Dec 2009, Lee Azzarello wrote: > I have an application that requires new cluster nodes be created and > destroyed to handle load spikes. I would like this additional > capacity to be represented in nagios during it's lifetime. Currently > my installation requires manually adding host definitions and > dependent hostgroup and service configurations. Is there a system in > Nagios 3 to change the host definitions without editing a > configuration file, redeploying the configs and restarting the nagios > service? It sounds like you're asking if you can modify the config without modifying the config, and the answer to that would be mostly no, though there are some things you can do through external commands. Those are all pretty well spelled-out in the docs. A restart for a config change isn't necessary, though. Sending Nagios a HUP will cause it to reload its configs without a full restart, and it would be a relatively simple matter, I suspect, to just have a config dir defined and drop hosts in it and pull them out (sending the HUP signal afterward, of course) as needed. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Thu Dec 24 00:59:16 2009 From: mmelin at gmail.com (Martin Melin) Date: Thu, 24 Dec 2009 00:59:16 +0100 Subject: Dynamic host, hostgroup and service addition/subtraction In-Reply-To: <16b031c0912231358t261861fbq30cb614011e04e81@mail.gmail.com> References: <16b031c0912231358t261861fbq30cb614011e04e81@mail.gmail.com> Message-ID: We have a config with similar characteristics (EC2 app) Set up templates so that the only unique information in your host definition is the IP address, then have your scaling scripts add and remove definitions for your nodes. For easy management, I suggest having one host definition per .cfg file in a subdirectory. Have cron send Nagios a HUP to reload the config every 5 or 10 minutes, and you should be good to go. Use regex matching with host and service group names to get nice dynamic group memberships etc. Good luck! Best regards, Martin Melin On Wed, Dec 23, 2009 at 10:58 PM, Lee Azzarello wrote: > I have an application that requires new cluster nodes be created and > destroyed to handle load spikes. ?I would like this additional > capacity to be represented in nagios during it's lifetime. Currently > my installation requires manually adding host definitions and > dependent hostgroup and service configurations. Is there a system in > Nagios 3 to change the host definitions without editing a > configuration file, redeploying the configs and restarting the nagios > service? > > -- > _______________ > Lee Azzarello > drop.io staff hacker > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Thu Dec 24 14:09:16 2009 From: jim at jimavery.me.uk (Jim Avery) Date: Thu, 24 Dec 2009 13:09:16 +0000 Subject: When to HUP and when to restart? Message-ID: <765d77c80912240509m7abb7a0em7388fddc72338291@mail.gmail.com> Thanks to Patrick mentioning you can send a HUP to get Nagios to reload it's config, (how on earth did I now know that??), it got me wondering... When, if at all, do I need to do a full restart of the Nagios daemon? Cheers and Happy Christmas everyone. Jim ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Thu Dec 24 14:14:49 2009 From: jim at jimavery.me.uk (Jim Avery) Date: Thu, 24 Dec 2009 13:14:49 +0000 Subject: Problems with notification interval. In-Reply-To: References: Message-ID: <765d77c80912240514m9e970a2t4634d386684121f4@mail.gmail.com> 2009/12/22 Kasper Lund : > Hi List. > > I have just upgraded my nagios from 2.12 to 3.20 - everything seems to be > running just fine except for my notification interval. > > Usually i use 0 as interval as i only want to receive the message once, > but nagios does not accept this option or any other. I get the message > every 5 minuttes no matter what i write in the notification_interval > option. > > I have tried with 120 and 240 as well, i still get it every 5 minuttes :( > > Here is my service definition file: > > define service { > ? ? ? ?host_name ? ? ? ? ? ? ? ? ? ? ? dkcopenh010p-DHCP > ? ? ? ?service_description ? ? ? ? ? ? Fixed_Drives > ? ? ? ?check_command ? ? ? ? ? ? ? ? ? NRPE_Check_Disk!15%!10% > ? ? ? ?is_volatile ? ? ? ? ? ? ? ? ? ? 1 > ? ? ? ?max_check_attempts ? ? ? ? ? ? ?3 > ? ? ? ?check_interval ? ? ? ? ? ? ? ? ?5 > ? ? ? ?retry_interval ? ? ? ? ? ? ? ? ?3 > ? ? ? ?passive_checks_enabled ? ? ? ? ?1 > ? ? ? ?check_period ? ? ? ? ? ? ? ? ? ?24x7 > ? ? ? ?check_freshness ? ? ? ? ? ? ? ? 1 > ? ? ? ?freshness_threshold ? ? ? ? ? ? 0 > ? ? ? ?low_flap_threshold ? ? ? ? ? ? ?0 > ? ? ? ?high_flap_threshold ? ? ? ? ? ? 0 > ? ? ? ?notification_interval ? ? ? ? ? 240 > ? ? ? ?notification_period ? ? ? ? ? ? 24x7 > ? ? ? ?notification_options ? ? ? ? ? ?w,u,r,c,f > ? ? ? ?notifications_enabled ? ? ? ? ? 1 > ? ? ? ?contact_groups ? ? ? ? ? ? ? ? ?windows-admins > ? ? ? ?register ? ? ? ? ? ? ? ? ? ? ? ?1 > ? ? ? ?} > > Where could my problem be. I suspect that some of the options i use may be > deprecated or something similar? I think the problem lies in your "is_volatile 1" directive. Normally you would want "is_volatile 0". I don't know why this would behave any different between your old and new setups. Cheers, Jim ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From averyjim at gmail.com Thu Dec 24 14:32:20 2009 From: averyjim at gmail.com (Jim Avery) Date: Thu, 24 Dec 2009 13:32:20 +0000 Subject: When to HUP and when to restart? Message-ID: <765d77c80912240532y17a638c5yddc3d001bdb530ac@mail.gmail.com> Thanks to Patrick mentioning you can send a HUP to get Nagios to reload it's config, (how on earth did I now know that??), it got me wondering... When, if at all, do I need to do a full restart of the Nagios daemon? Cheers and Happy Christmas everyone. Jim (p.s. I'm sorry if this is the second time you've seen this. I've been getting bounce notifications when posting to the nagios-users list so am trying again from my gmail address). ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From averyjim at gmail.com Thu Dec 24 14:38:10 2009 From: averyjim at gmail.com (Jim Avery) Date: Thu, 24 Dec 2009 13:38:10 +0000 Subject: Problems with notification interval. In-Reply-To: References: Message-ID: <765d77c80912240538x493b7c2bmc45c50bf121c8c75@mail.gmail.com> 2009/12/22 Kasper Lund : > Hi List. > > I have just upgraded my nagios from 2.12 to 3.20 - everything seems to be > running just fine except for my notification interval. > > Usually i use 0 as interval as i only want to receive the message once, > but nagios does not accept this option or any other. I get the message > every 5 minuttes no matter what i write in the notification_interval > option. > > I have tried with 120 and 240 as well, i still get it every 5 minuttes :( > > Here is my service definition file: > > define service { > ? ? ? ?host_name ? ? ? ? ? ? ? ? ? ? ? dkcopenh010p-DHCP > ? ? ? ?service_description ? ? ? ? ? ? Fixed_Drives > ? ? ? ?check_command ? ? ? ? ? ? ? ? ? NRPE_Check_Disk!15%!10% > ? ? ? ?is_volatile ? ? ? ? ? ? ? ? ? ? 1 > ? ? ? ?max_check_attempts ? ? ? ? ? ? ?3 > ? ? ? ?check_interval ? ? ? ? ? ? ? ? ?5 > ? ? ? ?retry_interval ? ? ? ? ? ? ? ? ?3 > ? ? ? ?passive_checks_enabled ? ? ? ? ?1 > ? ? ? ?check_period ? ? ? ? ? ? ? ? ? ?24x7 > ? ? ? ?check_freshness ? ? ? ? ? ? ? ? 1 > ? ? ? ?freshness_threshold ? ? ? ? ? ? 0 > ? ? ? ?low_flap_threshold ? ? ? ? ? ? ?0 > ? ? ? ?high_flap_threshold ? ? ? ? ? ? 0 > ? ? ? ?notification_interval ? ? ? ? ? 240 > ? ? ? ?notification_period ? ? ? ? ? ? 24x7 > ? ? ? ?notification_options ? ? ? ? ? ?w,u,r,c,f > ? ? ? ?notifications_enabled ? ? ? ? ? 1 > ? ? ? ?contact_groups ? ? ? ? ? ? ? ? ?windows-admins > ? ? ? ?register ? ? ? ? ? ? ? ? ? ? ? ?1 > ? ? ? ?} I think the problem lies in your "is_volatile 1" directive. Normally you would want "is_volatile 0". I don't know why this would behave any different between your old and new setups. Cheers, Jim ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcall at verio.net Thu Dec 24 21:56:48 2009 From: jcall at verio.net (Jonathan Call) Date: Thu, 24 Dec 2009 15:56:48 -0500 Subject: When to HUP and when to restart? In-Reply-To: <765d77c80912240532y17a638c5yddc3d001bdb530ac@mail.gmail.com> References: <765d77c80912240532y17a638c5yddc3d001bdb530ac@mail.gmail.com> Message-ID: <04F3233F47E2714CB7431AE913E57E7703B5CA0A@IAD-WPRD-XCHB02.corp.verio.net> If you?re using the embedded Perl interpreter a restart is probably better since the interpreter leaks memory. If you have a very large solution (thousands of service checks) a restart will take a considerable amount of time so a HUP would probably be wise in that situation. Jonathan > -----Original Message----- > From: Jim Avery [mailto:averyjim at gmail.com] > Sent: Thursday, December 24, 2009 6:32 AM > To: nagios List > Subject: [Nagios-users] When to HUP and when to restart? > > Thanks to Patrick mentioning you can send a HUP to get Nagios to > reload it's config, (how on earth did I now know that??), it got me > wondering... > > When, if at all, do I need to do a full restart of the Nagios daemon? > > Cheers and Happy Christmas everyone. > > Jim > > (p.s. I'm sorry if this is the second time you've seen this. I've been > getting bounce notifications when posting to the nagios-users list so > am trying again from my gmail address). > > ----------------------------------------------------------------------- > ------- > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and > easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From averyjim at gmail.com Fri Dec 25 12:05:28 2009 From: averyjim at gmail.com (Jim Avery) Date: Fri, 25 Dec 2009 11:05:28 +0000 Subject: When to HUP and when to restart? In-Reply-To: <04F3233F47E2714CB7431AE913E57E7703B5CA0A@IAD-WPRD-XCHB02.corp.verio.net> References: <765d77c80912240532y17a638c5yddc3d001bdb530ac@mail.gmail.com> <04F3233F47E2714CB7431AE913E57E7703B5CA0A@IAD-WPRD-XCHB02.corp.verio.net> Message-ID: <765d77c80912250305s56027345u93aaf169f7bb2b0b@mail.gmail.com> 2009/12/24 Jonathan Call : > If you?re using the embedded Perl interpreter a restart is probably better since the interpreter leaks memory. > > If you have a very large solution (thousands of service checks) a restart will take a considerable amount of time so a HUP would probably be wise in that situation. Thank you - it will make a huge difference not to have to restart once or twice a day. Thanks to 6000 or so service checks and the NDO back-end a restart takes a minute or two. Cheers, Jim ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ampranti at gmail.com Fri Dec 25 12:28:35 2009 From: ampranti at gmail.com (Brandino Andreas) Date: Fri, 25 Dec 2009 13:28:35 +0200 Subject: Service dependencies Message-ID: <876561375.20091225132835@gmail.com> Hi all and happy holidays! Lets assume that a host has three service checks (ping, uptime and cpu). If for some reason connection fails, a mail for each service is being sent (with random order). Is any way when "uptime" or "cpu" check fails (not when is critical), to force a "ping service" check and finally send a notification only for that service?? This behavior can be achieved somehow? With services dependencies can I force a check to be performed at specific points? Thank you <> --- - - - --- <> Brandino Andreas ampranti at gmail.com <> --- - - - --- <> ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yu.watanabe at jp.fujitsu.com Fri Dec 25 13:05:09 2009 From: yu.watanabe at jp.fujitsu.com (Yu Watanabe) Date: Fri, 25 Dec 2009 21:05:09 +0900 Subject: Nagios is logging "Service Check Timed Out" for certain service In-Reply-To: <765d77c80912210530g4ccd0f3csbe8241e8d5c63330@mail.gmail.com> References: <765d77c80912210530g4ccd0f3csbe8241e8d5c63330@mail.gmail.com> Message-ID: <200912251205.AA01375@S2007337.jp.fujitsu.com> Hello Jim. Sorry for the late reply. I will send you the plugins that is related to this service. We had made a customization from the original one. The command definition would be the following. $USER1$/check_syslog_gw.pl -l "$ARG1$" -s /tmp/$HOSTNAME$.tmp -x /usr/local/groundwork/var/log/syslog-ng/$HOSTNAME$.reg -b $HOSTNAME$ Thank you Yu Watanabe Jim Avery さんは書きました: >2009/12/21 Yu Watanabe : >> Hello Jim. >> >> I still cannnot solve why this had occured. >> So , would you please give us an advice? >> >> In the previous mail , I have realized that >> the detail about the situation was not articulated. >> >> Following are the details about the situation: >> >> 1. About the log checking process: >> ?There is one active check and one passive service check to do the job. >> >> ?The active check service scans the log file using the pre defined regexpression. >> ?If there are any lines that matches the regex, it send the result to the passive service. >> >> 2. Only the active check for log checkin process returns "Service Timed Out" >> ?All the other services but the corresponding active check were returning proper >> ?check result. >> >> 3. The active check itself was not executed. >> ?This, I found out by writing a debug line in the active check plugin. >> ?I wrote a debug line at the very top of the source code but even though the >> ?check time came , the debug log wasn't created. >> >> The most suspicious fact is 2 and 3. If the the reason was completely dependent on the plugin, >> there must be some kind of debug log created, but since the plugin itself was not >> executed , it is becoming a little bit tricky. >> >> Would it possibe to here your opinion about this ? >> >> Thank you >> Yu Watanabe > >I am sorry Yu, but I do not see why you have an active check service >sending results to a passive check service. > >If you could provide the relevant service and command definitions, the >plugins used and any other scripts and configs which are relevant, it >might be easier to understand. > >Cheers, > >Jim -------------- next part -------------- A non-text attachment was scrubbed... Name: check_syslog_gw.zip Type: application/x-zip-compressed Size: 5561 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: exception_filter.zip Type: application/x-zip-compressed Size: 510 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Dec 25 15:19:08 2009 From: marc at ena.com (Marc Powell) Date: Fri, 25 Dec 2009 08:19:08 -0600 Subject: Service dependencies In-Reply-To: <876561375.20091225132835@gmail.com> References: <876561375.20091225132835@gmail.com> Message-ID: <181A586A-7DC3-4DBD-BF9B-FF90AF1E4C8F@ena.com> On Dec 25, 2009, at 5:28 AM, Brandino Andreas wrote: > Hi all and happy holidays! > > Lets assume that a host has three service checks (ping, uptime and cpu). If for some reason > connection fails, a mail for each service is being sent (with random > order). > Is any way when "uptime" or "cpu" check fails (not when is critical), to > force a "ping service" check and finally send a notification only for that > service?? > > This behavior can be achieved somehow? With services dependencies can > I force a check to be performed at specific points? No service dependencies needed. Get rid of the ping service check and make it the host check. If a service on a host fails, the host is checked. If the host check returns non-OK, notifications for services are suppressed and only the host down notification is sent. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kasper at nordal-lund.dk Sun Dec 27 17:29:09 2009 From: kasper at nordal-lund.dk (Kasper Nordal Lund) Date: Sun, 27 Dec 2009 17:29:09 +0100 Subject: Problems with notification interval. In-Reply-To: <765d77c80912240514m9e970a2t4634d386684121f4@mail.gmail.com> References: <765d77c80912240514m9e970a2t4634d386684121f4@mail.gmail.com> Message-ID: <4B378B55.5020706@nordal-lund.dk> Jim Avery wrote: > > I think the problem lies in your "is_volatile 1" directive. Normally > you would want "is_volatile 0". > > > Thanks Jim, that did the trick. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From guifre.bosch at gmail.com Mon Dec 28 11:38:41 2009 From: guifre.bosch at gmail.com (Guifre Bosch Fabregas) Date: Mon, 28 Dec 2009 11:38:41 +0100 Subject: Problems to run my Apache with nagios Message-ID: <630294ec0912280238ge158cc8y4c47f95beff323e4@mail.gmail.com> I see this error in all my pages in nagios: *It appears as though you do not have permission to view information for any of the hosts you requested... If you believe this is an error, check the HTTP server authentication requirements for accessing this CGI and check the authorization options in your CGI configuration file. * When i install nagios i put: useradd -m nagios groupadd nagios usermod -a -G nagcmd nagios usermod -a -G nagcmd apache tar xzf nagios-3.2.0.tar.gz cd nagios-3.2.0 ./configure --with-command-group=nagcmd --prefix=/nagios make all && make install && make install-init && make install-config && make install-commandmode && make install-webconf htpasswd -c /nagios/etc/htpasswd.users admin tar xzf nagios-plugins-1.4.14.tar.gz cd nagios-plugins-1.4.14 ./configure --with-nagios-user=nagios --with-nagios-group=nagcmd --prefix=/nagios make && make install chcon -R -t httpd_sys_content_t /nagios/sbin/ chcon -R -t httpd_sys_content_t /nagios/share/ Where is my problem??? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravirajpatil at tataelxsi.co.in Mon Dec 28 13:30:02 2009 From: ravirajpatil at tataelxsi.co.in (Raviraj Patil) Date: Mon, 28 Dec 2009 18:00:02 +0530 Subject: error nagios-3.2 and nagios- pluggins-1.4.11 on a linux box server runnning fedora 6. Message-ID: <4B38A4CA.7040209@tataelxsi.co.in> On the basis of nagios quickstart guide, i install without error nagios-3.2 and nagios- pluggins-1.4.11 on a linux box server runnning fedora 6. But when i v?rify the sample nagios configuration files via the following command /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg I get this error : /usr/local/nagios/bin/nagios : error while loading the shared libraries: libltdl.so.3: cannot open shared object file: No such file or directory. Any help to resolve this problem will be greatly appreciated. -Raviraj ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From joydeep at infoservices.in Mon Dec 28 14:44:51 2009 From: joydeep at infoservices.in (joydeep at infoservices.in) Date: Mon, 28 Dec 2009 14:44:51 +0100 Subject: error nagios-3.2 and nagios- pluggins-1.4.11 on a linux box server runnning fedora 6. In-Reply-To: <4B38A4CA.7040209@tataelxsi.co.in> References: <4B38A4CA.7040209@tataelxsi.co.in> Message-ID: <4bb47cf8b2016cd09697a61f19911980@infoservices.in> On Mon, 28 Dec 2009 18:00:02 +0530, Raviraj Patil wrote: > I get this error : /usr/local/nagios/bin/nagios : error while loading > the shared libraries: libltdl.so.3: cannot open shared object file: > No such file or directory. Any help to resolve this problem will be > greatly appreciated. Do you have libltdl package in your box ? ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From israel at frontierflying.com Mon Dec 28 18:35:00 2009 From: israel at frontierflying.com (Israel Brewster) Date: Mon, 28 Dec 2009 08:35:00 -0900 Subject: Nagios stops updating Message-ID: <5DA6B11E-4F77-4CAF-9A21-5D8FFF160EE1@frontierflying.com> I want to say this is a FAQ, as I seem to recall seeing something about this behavior before, but I didn't find anything with a quick search, and I don't thing there was any real solution posted, so I'm going to ask now. I am running Nagios 3.2.0 on OpenBSD 4.6. I have found that from time to time nagios will simply stop updating. No errors will be produced, and the nagios log will show normal operation right up until it stops updating. After that, nothing. I have implemented a work around of a script that checks for this and restarts nagios if needed, but obviously this is a sub-optimal solution. Has anyone seen this behavior before that might have some idea how I can fix it? Or does anyone have a suggestion as to how I can go about debugging it? Thanks. ----------------------------------------------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Israel Brewster.vcf Type: text/directory Size: 417 bytes Desc: not available URL: -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Dec 28 19:00:08 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Mon, 28 Dec 2009 10:00:08 -0800 Subject: Problems to run my Apache with nagios In-Reply-To: <630294ec0912280238ge158cc8y4c47f95beff323e4@mail.gmail.com> References: <630294ec0912280238ge158cc8y4c47f95beff323e4@mail.gmail.com> Message-ID: <4B38F228.4080503@hp.com> Guifre Bosch Fabregas wrote: > I see this error in all my pages in nagios: > > *It appears as though you do not have permission to view information > for any of the hosts you requested... > If you believe this is an error, check the HTTP server authentication > requirements for accessing this CGI > and check the authorization options in your CGI configuration file. > * > When i install nagios i put: > > useradd -m nagios > groupadd nagios > > usermod -a -G nagcmd nagios > usermod -a -G nagcmd apache > > tar xzf nagios-3.2.0.tar.gz > cd nagios-3.2.0 > ./configure --with-command-group=nagcmd --prefix=/nagios > make all && make install && make install-init && make install-config > && make install-commandmode && make install-webconf > > htpasswd -c /nagios/etc/htpasswd.users admin > > tar xzf nagios-plugins-1.4.14.tar.gz > cd nagios-plugins-1.4.14 > ./configure --with-nagios-user=nagios --with-nagios-group=nagcmd > --prefix=/nagios > make && make install > > chcon -R -t httpd_sys_content_t /nagios/sbin/ > chcon -R -t httpd_sys_content_t /nagios/share/ > > > > Where is my problem??? "admin" will need to be a contact on all the hosts and services you want to view, or you need to specify it as being authorized to view all hosts and services in your cgi.cfg. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Dec 28 19:05:27 2009 From: patrick.morris at hp.com (Morris, Patrick) Date: Mon, 28 Dec 2009 10:05:27 -0800 Subject: When to HUP and when to restart? In-Reply-To: <765d77c80912250305s56027345u93aaf169f7bb2b0b@mail.gmail.com> References: <765d77c80912240532y17a638c5yddc3d001bdb530ac@mail.gmail.com> <04F3233F47E2714CB7431AE913E57E7703B5CA0A@IAD-WPRD-XCHB02.corp.verio.net> <765d77c80912250305s56027345u93aaf169f7bb2b0b@mail.gmail.com> Message-ID: <4B38F367.8000301@hp.com> Jim Avery wrote: > 2009/12/24 Jonathan Call : > >> If you?re using the embedded Perl interpreter a restart is probably better since the interpreter leaks memory. >> >> If you have a very large solution (thousands of service checks) a restart will take a considerable amount of time so a HUP would probably be wise in that situation. >> > > > Thank you - it will make a huge difference not to have to restart once > or twice a day. Thanks to 6000 or so service checks and the NDO > back-end a restart takes a minute or two. > It should probably be noted that even a reload can take a while if you've got NDO on the back end, because it still needs to do all the database voodoo that happens when you restart. The upside to a SIGHUP, though, is that Nagios just stops doing anything during that period, but won't throw an error page like it does on a full restart. FWIW, we use embedded Perl here and thousands of service checks, and rarely, if ever, need to issue a full restart. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ampranti at gmail.com Mon Dec 28 20:26:24 2009 From: ampranti at gmail.com (Brandino Andreas) Date: Mon, 28 Dec 2009 21:26:24 +0200 Subject: Service dependencies In-Reply-To: <181A586A-7DC3-4DBD-BF9B-FF90AF1E4C8F@ena.com> References: <876561375.20091225132835@gmail.com> <181A586A-7DC3-4DBD-BF9B-FF90AF1E4C8F@ena.com> Message-ID: <1405731610.20091228212624@gmail.com> Are you sure that it works that way?? I disabled notifications for "ping" service, but still I receive notifications from other services although the host is down.... Do I have to enable any other option? Thank you Regards Friday, December 25, 2009, 4:19:08 PM, you wrote: > On Dec 25, 2009, at 5:28 AM, Brandino Andreas wrote: >> Hi all and happy holidays! >> >> Lets assume that a host has three service checks (ping, uptime and cpu). If for some reason >> connection fails, a mail for each service is being sent (with random >> order). >> Is any way when "uptime" or "cpu" check fails (not when is critical), to >> force a "ping service" check and finally send a notification only for that >> service?? >> >> This behavior can be achieved somehow? With services dependencies can >> I force a check to be performed at specific points? > No service dependencies needed. Get rid of the ping service check > and make it the host check. If a service on a host fails, the host > is checked. If the host check returns non-OK, notifications for > services are suppressed and only the host down notification is sent. > -- > Marc > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null <> --- - - - --- <> Brandino Andreas ampranti at gmail.com <> --- - - - --- <> ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Mon Dec 28 21:20:18 2009 From: mmelin at gmail.com (Martin Melin) Date: Mon, 28 Dec 2009 21:20:18 +0100 Subject: Service dependencies In-Reply-To: <1405731610.20091228212624@gmail.com> References: <876561375.20091225132835@gmail.com> <181A586A-7DC3-4DBD-BF9B-FF90AF1E4C8F@ena.com> <1405731610.20091228212624@gmail.com> Message-ID: A service that happens to be a check_ping is not the same thing as the host check command. The host check command should be a plugin that determines whether or not the host is alive - usually a ping, but can be something else. See http://nagios.sourceforge.net/docs/3_0/hostchecks.html for more info on this, but as Marc explained, if you set up your host definition with a correct check_command, what will happen when a host goes down is this (without setting up regularly scheduled host checks): 1. A service check fails on the host 2. Nagios immediately schedules a host check for the host 3. If the host check_command fails, notifications for all services on the host are suppressed until the host check_command clears which is what you want in this case. Regards Martin Melin On Mon, Dec 28, 2009 at 8:26 PM, Brandino Andreas wrote: > Are you sure that it works that way?? > I disabled notifications for "ping" service, but still I receive > notifications from other services although the host is down.... > > Do I have to enable any other option? > > Thank you > Regards > > > > Friday, December 25, 2009, 4:19:08 PM, you wrote: > >> On Dec 25, 2009, at 5:28 AM, Brandino Andreas wrote: > >>> Hi all and happy holidays! >>> >>> Lets assume that a host has three service checks (ping, uptime and cpu). If for some reason >>> connection fails, a mail for each service is being sent (with random >>> order). >>> Is any way when "uptime" or "cpu" check fails (not when is critical), to >>> force a "ping service" check and finally send a notification only for that >>> service?? >>> >>> This behavior can be achieved somehow? With services dependencies can >>> I force a check to be performed at specific points? > >> No service dependencies needed. Get rid of the ping service check >> and make it the host check. If a service on a host fails, the host >> is checked. If the host check returns non-OK, notifications for >> services are suppressed and only the host down notification is sent. > >> -- >> Marc > > >> ------------------------------------------------------------------------------ >> This SF.Net email is sponsored by the Verizon Developer Community >> Take advantage of Verizon's best-in-class app development support >> A streamlined, 14 day to market process makes app distribution fast and easy >> Join now and get one step closer to millions of Verizon customers >> http://p.sf.net/sfu/verizon-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null > > <> --- ?- ? ?- ? ?- ?--- <> > ? ?Brandino Andreas > ? ?ampranti at gmail.com > <> --- ?- ? ?- ? ?- ?--- <> > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravirajpatil at tataelxsi.co.in Tue Dec 29 08:16:15 2009 From: ravirajpatil at tataelxsi.co.in (Raviraj Patil) Date: Tue, 29 Dec 2009 12:46:15 +0530 Subject: error nagios-3.2 and nagios- pluggins-1.4.11 on a linux In-Reply-To: References: Message-ID: <4B39ACBF.2070609@tataelxsi.co.in> Yes,I have libltdl installed by ./configure , make & make install command from the freeradius-server-2.1.7 folder. linltdl comes along with the freeradius-server-2.1.7 server. My locate output is: [root at tataelxs-f8ba86 bin]# locate libltdl.so.* /home/rbp/freeradius-server-2.1.7/libltdl/.libs/libltdl.so.3 /home/rbp/freeradius-server-2.1.7/libltdl/.libs/libltdl.so.3.1.4 /usr/local/lib/libltdl.so.3 /usr/local/lib/libltdl.so.3.1.4 Thanks Raviraj joydeep at infoservices.in wrote: > box server runnning fedora 6. > MIME-Version: 1.0 > Date: Mon, 28 Dec 2009 14:44:51 +0100 > From: > Cc: > In-Reply-To: <4B38A4CA.7040209 at tataelxsi.co.in> > References: <4B38A4CA.7040209 at tataelxsi.co.in> > Message-ID: <4bb47cf8b2016cd09697a61f19911980 at infoservices.in> > X-Sender: joydeep at infoservices.in > User-Agent: RoundCube Webmail/0.3.1 > Content-Transfer-Encoding: 8bit > Content-Type: text/plain; charset=UTF-8 > > > On Mon, 28 Dec 2009 18:00:02 +0530, Raviraj Patil > > wrote: > > > > >> I get this error : /usr/local/nagios/bin/nagios : error while loading >> > > >> the shared libraries: libltdl.so.3: cannot open shared object file: >> > > >> No such file or directory. Any help to resolve this problem will be >> > > >> greatly appreciated. >> > > > > Do you have libltdl package in your box ? > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Quentin.Marquez at experian.com Tue Dec 29 10:24:15 2009 From: Quentin.Marquez at experian.com (Marquez, Quentin) Date: Tue, 29 Dec 2009 09:24:15 -0000 Subject: error nagios-3.2 and nagios- pluggins-1.4.11 on a linux In-Reply-To: <4B39ACBF.2070609@tataelxsi.co.in> References: <4B39ACBF.2070609@tataelxsi.co.in> Message-ID: <28D49132D5CF31498C9C381879963AD736C849@exch-mon-msg01.uk.experian.local> Hi Raviraj, It looks like your are missing the path to your libltdl in your linker. I would suggest to add /usr/local/lib in /etc/ld.so.conf and run ldconfig. man ldconfig for more info Cheers, Quentin From: Raviraj Patil [mailto:ravirajpatil at tataelxsi.co.in] Sent: 29 December 2009 08:16 To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] error nagios-3.2 and nagios- pluggins-1.4.11 on a linux Yes,I have libltdl installed by ./configure , make & make install command from the freeradius-server-2.1.7 folder. linltdl comes along with the freeradius-server-2.1.7 server. My locate output is: [root at tataelxs-f8ba86 bin]# locate libltdl.so.* /home/rbp/freeradius-server-2.1.7/libltdl/.libs/libltdl.so.3 /home/rbp/freeradius-server-2.1.7/libltdl/.libs/libltdl.so.3.1.4 /usr/local/lib/libltdl.so.3 /usr/local/lib/libltdl.so.3.1.4 Thanks Raviraj joydeep at infoservices.in wrote: box server runnning fedora 6. MIME-Version: 1.0 Date: Mon, 28 Dec 2009 14:44:51 +0100 From: Cc: In-Reply-To: <4B38A4CA.7040209 at tataelxsi.co.in> References: <4B38A4CA.7040209 at tataelxsi.co.in> Message-ID: <4bb47cf8b2016cd09697a61f19911980 at infoservices.in> X-Sender: joydeep at infoservices.in User-Agent: RoundCube Webmail/0.3.1 Content-Transfer-Encoding: 8bit Content-Type: text/plain; charset=UTF-8 On Mon, 28 Dec 2009 18:00:02 +0530, Raviraj Patil wrote: I get this error : /usr/local/nagios/bin/nagios : error while loading the shared libraries: libltdl.so.3: cannot open shared object file: No such file or directory. Any help to resolve this problem will be greatly appreciated. Do you have libltdl package in your box ? This e-mail has come from Experian, the only business to have been twice named the UK's 'Business of the Year? =================================================================================== Information in this e-mail and any attachments is confidential, and may not be copied or used by anyone other than the addressee, nor disclosed to any third party without our permission. There is no intention to create any legally binding contract or other binding commitment through the use of this electronic communication unless it is issued in accordance with the Experian Limited standard terms and conditions of purchase or other express written agreement between Experian Limited and the recipient. Although Experian has taken reasonable steps to ensure that this communication and any attachments are free from computer virus, you are advised to take your own steps to ensure that they are actually virus free. Companies Act information: Registered name: Experian Limited Registered office: Landmark House, Experian Way, NG2 Business Park, Nottingham, NG80 1ZZ, United Kingdom Place of registration: England and Wales Registered number: 653331 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From brentgclarklist at gmail.com Tue Dec 29 12:38:29 2009 From: brentgclarklist at gmail.com (Brent Clark) Date: Tue, 29 Dec 2009 13:38:29 +0200 Subject: peer review debian check Message-ID: <4B39EA35.7040809@gmail.com> Hiya I was hoping someone would be kind to peer review my service check. I got a nrpe check to check that the debian packages on my servers are up to date. I would like for this check to run once a day. This is what I was able to come up with. define service{ use generic-service hostgroup_name debian-packages service_description Debian Packages check_command check_nrpe!check_debian_updates notification_interval 1440 is_volatile 0 check_period 24x7 normal_check_interval 5 retry_check_interval 1 max_check_attempts 10 notification_period 24x7 } Is this correct, if not, would someone be kind enough to point me on the correct path so that i can achieve this check. Kind Regards Brent Clark ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Dec 29 13:46:45 2009 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 29 Dec 2009 12:46:45 +0000 Subject: peer review debian check In-Reply-To: <4B39EA35.7040809@gmail.com> References: <4B39EA35.7040809@gmail.com> Message-ID: <765d77c80912290446h7fc29a7ybe88b50d4409c21f@mail.gmail.com> 2009/12/29 Brent Clark : > Hiya > > I was hoping someone would be kind to peer review my service check. > > I got a nrpe check to check that the debian packages on my servers are > up to date. I would like for this check to run once a day. > > This is what I was able to come up with. > > define service{ > ? ? ? ? use ? ? ? ? ? ? ? ? ? ? generic-service > ? ? ? ? hostgroup_name ? ? ? ? ?debian-packages > ? ? ? ? service_description ? ? Debian Packages > ? ? ? ? check_command ? ? ? ? ? check_nrpe!check_debian_updates > ? ? ? ? notification_interval ? 1440 > ? ? ? ? is_volatile ? ? ? ? ? ? 0 > ? ? ? ? check_period ? ? ? ? ? ?24x7 > ? ? ? ? normal_check_interval ? 5 > ? ? ? ? retry_check_interval ? ?1 > ? ? ? ? max_check_attempts ? ? ?10 > ? ? ? ? notification_period ? ? 24x7 > ? ? ? ? } > > Is this correct, if not, would someone be kind enough to point me on the > correct path so that i can achieve this check. It looks okay to me. As with any change to your config, you need to run "/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg" to verify your configuration and make sure there are no warnings or errors. You could try running the check command from the command-line on your Nagios server (while logged in as the nagios user) to make sure it is likely to work okay: /usr/local/nagios/libexec/check_nrpe -H serverA -u check_debian_updates (replace "serverA" with the hostname or IP address of a host you want to test this on). If this test doesn't work, then you may need to look at your nrpe setup. Personally I wouldn't run this kind of test as frequently as this, but if your Nagios server isn't too stressed I don't suppose it matters much one way or the other. hth, Jim ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Tue Dec 29 13:24:09 2009 From: mmelin at gmail.com (Martin Melin) Date: Tue, 29 Dec 2009 13:24:09 +0100 Subject: peer review debian check In-Reply-To: <4B39EA35.7040809@gmail.com> References: <4B39EA35.7040809@gmail.com> Message-ID: Hello, This service definition will, assuming a default interval_length, execute the check_debian_updates NRPE check every 5 minutes but only notify you every 24 hours in case of failure. For this case, I'd suggest switching the values of notification_interval and check_interval (normal_check_interval is AFAIK v2, is that what you're running?) so that the check is executed once every 24 hours, but that you're notified every 5 minutes if the check returns non-OK. I don't know how your NRPE check is set up, but you might want to reconsider your max_check_attempts as well. If the check simply returns non-OK if there are updates available, you probably don't want to re-check for updates 10 times before notifying. I would also set flap_detection_enabled to 0 because you don't want Nagios to detect flapping with a check like this (otherwise the flap detection logic will consider the last 21 days of results) HTH, Martin Melin On Tue, Dec 29, 2009 at 12:38 PM, Brent Clark wrote: > Hiya > > I was hoping someone would be kind to peer review my service check. > > I got a nrpe check to check that the debian packages on my servers are > up to date. I would like for this check to run once a day. > > This is what I was able to come up with. > > define service{ > ? ? ? ? use ? ? ? ? ? ? ? ? ? ? generic-service > ? ? ? ? hostgroup_name ? ? ? ? ?debian-packages > ? ? ? ? service_description ? ? Debian Packages > ? ? ? ? check_command ? ? ? ? ? check_nrpe!check_debian_updates > ? ? ? ? notification_interval ? 1440 > ? ? ? ? is_volatile ? ? ? ? ? ? 0 > ? ? ? ? check_period ? ? ? ? ? ?24x7 > ? ? ? ? normal_check_interval ? 5 > ? ? ? ? retry_check_interval ? ?1 > ? ? ? ? max_check_attempts ? ? ?10 > ? ? ? ? notification_period ? ? 24x7 > ? ? ? ? } > > Is this correct, if not, would someone be kind enough to point me on the > correct path so that i can achieve this check. > > Kind Regards > Brent Clark > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From georgyboy at gmail.com Tue Dec 29 16:01:13 2009 From: georgyboy at gmail.com (Jorge Gil) Date: Tue, 29 Dec 2009 16:01:13 +0100 Subject: I get emails but I can't fill them with any information from the specific alarms Message-ID: <605203aa0912290701u738c0c6bm56db55fbd2178891@mail.gmail.com> Hi all: First of all, thank you very much for your support to other users, I have gone thru hundreds of your emails, but still I couldnt find the solution to my case... I have set up email notifications in nagios, and I do get emails, but I cant make Nagios send me any information like $hostname$ in the body of the email. When I tried the typical configuration for nagios, like: # 'notify-service-by-email' command definition define command{ command_name notify-service-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } i wasnt getting emails. Now a simplified version of the command: # 'notify-service-by-email' command definition define command{ command_name notify-service-by-email command_line printf "%b" "$HOSTADDRESS$" | /usr/bin/mail -s Subject_of_email my_email_address } does send me emails, but it doesnt translate the hostaddress, i get just a dollar sign in the body of the email. I have tried to put hostaddress in small letters, with and without quotation marks, redone all the possible files that call the alert... and I am lost. What can be wrong, please? How can I make Nagios translate those variables into the actual information they contain? (for the service checks, they actually work, I see the state change in the Nagios webserver, individually for all the hosts that I set up) Thanks a lot, Jorge -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Tue Dec 29 16:09:46 2009 From: mmelin at gmail.com (Martin Melin) Date: Tue, 29 Dec 2009 16:09:46 +0100 Subject: I get emails but I can't fill them with any information from the specific alarms In-Reply-To: <605203aa0912290701u738c0c6bm56db55fbd2178891@mail.gmail.com> References: <605203aa0912290701u738c0c6bm56db55fbd2178891@mail.gmail.com> Message-ID: If your nagios.cfg sets enable_environment_macros to 0, that would be the problem. Otherwise, possibly HOSTADDRESS does not have a value for this service. Have you tried sending just $SERVICESTATE$ instead? (a service notification will always have a value for the macro) Regards Martin Melin On Tue, Dec 29, 2009 at 4:01 PM, Jorge Gil wrote: > Hi all: > > First of all, thank you very much for your support to other users, I have > gone thru hundreds of your emails, but still I couldnt find the solution to > my case... > > I have set up email notifications in nagios, and I do get emails, but I cant > make Nagios send me any information like $hostname$ in the body of the > email. When I tried the typical configuration for nagios, like: > > > # 'notify-service-by-email' command definition > > define command{ > > command_name notify-service-by-email > > command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: > $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: > $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: > $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /usr/bin/mail -s > "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is > $SERVICESTATE$ **" $CONTACTEMAIL$ > > } > > > > i wasnt getting emails. > > Now a simplified version of the command: > > > > # 'notify-service-by-email' command definition > > define command{ > > command_name notify-service-by-email > > command_line printf "%b" "$HOSTADDRESS$" | /usr/bin/mail -s Subject_of_email > my_email_address > > } > > does send me emails, but it doesnt translate the hostaddress, i get just?a > dollar sign in the body of the email. > > I have tried to put hostaddress in small letters, with and without quotation > marks, redone all the possible files that call the alert... and I am lost. > > What can be wrong, please? How can I make Nagios translate those variables > into the actual information they contain? (for the service checks, they > actually work, I see the state change in the Nagios webserver, individually > for all the hosts that I set up) > > Thanks a lot, > > Jorge > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From georgyboy at gmail.com Tue Dec 29 16:25:47 2009 From: georgyboy at gmail.com (Jorge Gil) Date: Tue, 29 Dec 2009 16:25:47 +0100 Subject: I get emails but I can't fill them with any information from the specific alarms In-Reply-To: References: <605203aa0912290701u738c0c6bm56db55fbd2178891@mail.gmail.com> Message-ID: <605203aa0912290725h6c02e71v91d15c24d01beb91@mail.gmail.com> Hi again! Thank you for the quick answer, to begin with... enable_environment_macros was already 1... I have tried $SERVICESTATE$ ($hoststate$ for the host notification), and i got "WARNING" in the email body... you cant believe how happy that has made me! I copy one service definition define host{ use generic-switch host_name N7-Rack2 alias Backbone N7 icon_image multilayer_switch.gif statusmap_image multilayer_switch.gd2 address 192.168.4.8 parents ns500.bne.es } define service{ use generic-service host_name nagios, BNS31, BNS32, BNS49, BNS50, E1-MUSEO, E1-MUSICA_1, E1-MUSICA_2, E1-PROCESO_1, E1-PROCESO_2, E1-PROCESO_3, E1-PUBLICACIONES, E1-Ref_Biblio1, E1-Ref_Biblio2, E1-BELLAS_ARTES, E1-GERENCIA_1, E1-GERENCIA_2, E1-JARDIN_NORTE, E1-JARDIN_SUR, E1-JORGE_JUAN, E1-LABORATORIO, E1-MANUSCRITOS, E1-Salon italiano, E1-SALON_LECTURA, E1-UCI, N7-Rack2, N7-Rack3, ns500.bne.es, Salida LAN1 privado, Salida LAN2 publico, router alcobendas principal, Rediris, Router interministerial en BNE, Router interministerial en Cultura, Balanceador minos.bne.es, Bluecoat 1, Bluecoat 2, SW1 Fujitsu, SW2 Fujitsu, SW3 Fujitsu, SW4 Fujitsu (and a few lines below...) service_description PING check_command check_ping!200.0,20%!600.0,60% normal_check_interval 5 retry_check_interval 1 } and what i see in the nagios webserver: Command Name Command Line check-host-alive $USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c 5000.0,100% -p 5 check_dhcp $USER1$/check_dhcp $ARG1$ check_ftp $USER1$/check_ftp -H $HOSTADDRESS$ $ARG1$ check_hpjd $USER1$/check_hpjd -H $HOSTADDRESS$ $ARG1$ check_http $USER1$/check_http -I $HOSTADDRESS$ $ARG1$ check_imap $USER1$/check_imap -H $HOSTADDRESS$ $ARG1$ check_local_disk $USER1$/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ check_local_load $USER1$/check_load -w $ARG1$ -c $ARG2$ check_local_mrtgtraf $USER1$/check_mrtgtraf -F $ARG1$ -a $ARG2$ -w $ARG3$ -c $ARG4$ -e $ARG5$ check_local_procs $USER1$/check_procs -w $ARG1$ -c $ARG2$ -s $ARG3$ check_local_swap $USER1$/check_swap -w $ARG1$ -c $ARG2$ check_local_users $USER1$/check_users -w $ARG1$ -c $ARG2$ check_nt $USER1$/check_nt -H $HOSTADDRESS$ -p 12489 -v $ARG1$ $ARG2$ check_ping $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -p 5 check_pop $USER1$/check_pop -H $HOSTADDRESS$ $ARG1$ check_smtp $USER1$/check_smtp -H $HOSTADDRESS$ $ARG1$ check_snmp $USER1$/check_snmp -H $HOSTADDRESS$ $ARG1$ check_ssh $USER1$/check_ssh $ARG1$ $HOSTADDRESS$ check_tcp $USER1$/check_tcp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$ check_udp $USER1$/check_udp -H $HOSTADDRESS$ -p $ARG1$ $ARG2$ notify-host-by-email printf $hoststate$|mail -s Testhostminusculas2 georgyboy at gmail.com notify-service-by-email echo $SERVICESTATE$|mail -s Testservicesincomillas georgyboy at gmail.com process-host-perfdata /usr/bin/printf "%b" "$LASTHOSTCHECK$\t$HOSTNAME$\t$HOSTSTATE$\t$HOSTATTEMPT$\t$HOSTSTATETYPE$\t$HOSTEXECUTIONTIME$\t$HOSTOUTPUT$\t$HOSTPERFDATA$\n" >> /usr/local/nagios/var/host-perfdata.out process-service-perfdata /usr/bin/printf "%b" "$LASTSERVICECHECK$\t$HOSTNAME$\t$SERVICEDESC$\t$SERVICESTATE$\t$SERVICEATTEMPT$\t$SERVICESTATETYPE$\t$SERVICEEXECUTIONTIME$\t$SERVICELATENCY$\t$SERVICEOUTPUT$\t$SERVICEPERFDATA$\n" >> /usr/local/nagios/var/service-perfdata.out any idea how to get all those dollar values apart of that, please? thanks so much! jorge On 12/29/09, Martin Melin wrote: > > If your nagios.cfg sets enable_environment_macros to 0, that would be > the problem. > > Otherwise, possibly HOSTADDRESS does not have a value for this > service. Have you tried sending just $SERVICESTATE$ instead? (a > service notification will always have a value for the macro) > > Regards > Martin Melin > > On Tue, Dec 29, 2009 at 4:01 PM, Jorge Gil wrote: > > Hi all: > > > > First of all, thank you very much for your support to other users, I have > > gone thru hundreds of your emails, but still I couldnt find the solution > to > > my case... > > > > I have set up email notifications in nagios, and I do get emails, but I > cant > > make Nagios send me any information like $hostname$ in the body of the > > email. When I tried the typical configuration for nagios, like: > > > > > > # 'notify-service-by-email' command definition > > > > define command{ > > > > command_name notify-service-by-email > > > > command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification > Type: > > $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: > > $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: > > $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /usr/bin/mail -s > > "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is > > $SERVICESTATE$ **" $CONTACTEMAIL$ > > > > } > > > > > > > > i wasnt getting emails. > > > > Now a simplified version of the command: > > > > > > > > # 'notify-service-by-email' command definition > > > > define command{ > > > > command_name notify-service-by-email > > > > command_line printf "%b" "$HOSTADDRESS$" | /usr/bin/mail -s > Subject_of_email > > my_email_address > > > > } > > > > does send me emails, but it doesnt translate the hostaddress, i get just > a > > dollar sign in the body of the email. > > > > I have tried to put hostaddress in small letters, with and without > quotation > > marks, redone all the possible files that call the alert... and I am > lost. > > > > What can be wrong, please? How can I make Nagios translate those > variables > > into the actual information they contain? (for the service checks, they > > actually work, I see the state change in the Nagios webserver, > individually > > for all the hosts that I set up) > > > > Thanks a lot, > > > > Jorge > > > > > ------------------------------------------------------------------------------ > > This SF.Net email is sponsored by the Verizon Developer Community > > Take advantage of Verizon's best-in-class app development support > > A streamlined, 14 day to market process makes app distribution fast and > easy > > Join now and get one step closer to millions of Verizon customers > > http://p.sf.net/sfu/verizon-dev2dev > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 29 17:15:59 2009 From: marc at ena.com (Marc Powell) Date: Tue, 29 Dec 2009 10:15:59 -0600 Subject: I get emails but I can't fill them with any information from the specific alarms In-Reply-To: References: <605203aa0912290701u738c0c6bm56db55fbd2178891@mail.gmail.com> Message-ID: <181682F9-1674-4EA4-AB2C-DB02C536BFC0@ena.com> On Dec 29, 2009, at 9:09 AM, Martin Melin wrote: > If your nagios.cfg sets enable_environment_macros to 0, that would be > the problem. Actually, this is unrelated. The standard way of passing macros in the command_line doesn't rely on them being available as environment variables. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Dec 29 17:23:05 2009 From: marc at ena.com (Marc Powell) Date: Tue, 29 Dec 2009 10:23:05 -0600 Subject: I get emails but I can't fill them with any information from the specific alarms In-Reply-To: <605203aa0912290701u738c0c6bm56db55fbd2178891@mail.gmail.com> References: <605203aa0912290701u738c0c6bm56db55fbd2178891@mail.gmail.com> Message-ID: <3F7F28F5-5947-44BA-A98C-FDE71AD84EB4@ena.com> On Dec 29, 2009, at 9:01 AM, Jorge Gil wrote: > # 'notify-service-by-email' command definition > > define command{ > > command_name notify-service-by-email > > command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ > > } This looks typical. > i wasnt getting emails. Emails for what and how were they being instigated? Did you have a service failure and not receive a notification? Does nagios.log show NOTIFICATION entries? Does /var/log/maillog show an attempt to send? > Now a simplified version of the command: > > > # 'notify-service-by-email' command definition > > define command{ > > command_name notify-service-by-email > > command_line printf "%b" "$HOSTADDRESS$" | /usr/bin/mail -s Subject_of_email my_email_address > > } > > does send me emails, but it doesnt translate the hostaddress, i get just a dollar sign in the body of the email. A '$' in the body means that you either have a typo in the $MACRONAME$ or the macro isn't valid for the type of notification you are trying to send. $HOSTADDRESS$ is valid for all notification types so you should verify you don't have a typo in the macro name. http://nagios.sourceforge.net/docs/3_0/macrolist.html An alternative is that you aren't actually calling these as notification commands but as event_handlers. Using a notification command as an event_handler is not proper. It should be listed in the appropriate contact{} definition as host/service_notification_command (off the top of my head). -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Thu Dec 31 08:42:11 2009 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Thu, 31 Dec 2009 08:42:11 +0100 Subject: Cisco Aironet 1200 AccessPoints Message-ID: <4B3C55D3.1080808@fhv.at> Hi all, I want to monitor our Cisco Aironet 1200 AccessPoints with snmp. particularly how many users are connected to each AP and the traffic which is passing the radio interface. Has someone done this or are there some templates available? Thanks in advance Richard Uah, we are running nagios 3.0.6 on a FreeBSD 7.2-STABLE box. nagios-3.0.6_3 Extremely powerful network monitoring system nagios-plugins-1.4.13,1 Plugins for Nagios nagios-radauth-plugin-1.00_1 Nagios plugin for checking radius server ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null