From aleksey.tsalolikhin at gmail.com Thu Dec 1 00:11:19 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Wed, 30 Nov 2005 15:11:19 -0800 Subject: trouble getting started: getting error 127 out of bounds on check_http Message-ID: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> Hi. I am trying to install Nagios for the first time. I have a host, "hockey", I want to run an HTTP check on. I've got everything configured correctly near as I can tell, but I am getting: Return code of 127 for check of service 'HTTP' on host 'hockey' was out of bounds. Make sure the plugin you're trying to run actually exists. Please help. $ ls -l /usr/local/nagios/etc/*cfg -rw-r--r-- 1 nagios nagios 17496 Nov 29 18:06 /usr/local/nagios/etc/cgi.cfg -rw-r--r-- 1 nagios nagios 4607 Nov 29 19:04 /usr/local/nagios/etc/checkcommands.cfg -rw-r--r-- 1 nagios nagios 789 Nov 29 19:12 /usr/local/nagios/etc/contactgroups.cfg -rw-r--r-- 1 nagios nagios 1076 Nov 29 19:11 /usr/local/nagios/etc/contacts.cfg -rw-r--r-- 1 nagios nagios 667 Nov 29 19:14 /usr/local/nagios/etc/dependencies.cfg -rw-r--r-- 1 nagios nagios 1589 Nov 29 19:13 /usr/local/nagios/etc/escalations.cfg -rw-r--r-- 1 nagios nagios 832 Nov 29 19:03 /usr/local/nagios/etc/hostgroups.cfg -rw-r--r-- 1 nagios nagios 2196 Nov 29 19:02 /usr/local/nagios/etc/hosts.cfg -rw-r--r-- 1 nagios nagios 4246 Nov 29 18:36 /usr/local/nagios/etc/misccommands.cfg -rw-r--r-- 1 nagios nagios 21329 Nov 30 16:44 /usr/local/nagios/etc/nagios.cfg -rw-r--r-- 1 nagios nagios 3069 Nov 29 18:36 /usr/local/nagios/etc/resource.cfg -rw-r--r-- 1 nagios nagios 2907 Nov 29 18:50 /usr/local/nagios/etc/services.cfg -rw-r--r-- 1 nagios nagios 1589 Nov 29 18:37 /usr/local/nagios/etc/timeperiods.cfg $ checkcommands.cfg contains: # 'check_http' command definition define command{ command_name check_http command_line $USER1$/check_http -H $HOSTADDRESS$ } services.cfg contains: define service{ use generic-service ; Name of service template to use host_name hockey service_description HTTP is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_http } resource.cfg contains: $USER1$=/usr/local/nagios/libexec I can run check_http manually: $ id uid=504(nagios) gid=505(nagios) groups=505(nagios) $ /usr/local/nagios/libexec/check_http -H hockey HTTP OK HTTP/1.1 200 OK - 14678 bytes in 0.035 seconds |time=0.035396s;;;0.000000 size=14678B;;;0 $ nagios -v looks happy: Total Warnings: 0 Total Errors: 0 Everything under /usr/local/nagios is owned by user nagios and group nagios, except for the lockfile, /usr/local/nagios/var/nagios.lock, which is owned by root, probably from the start-up script. Everything seems to be configured correctly. However, service checks are failing: [1133392048] Warning: Return code of 127 for check of service 'HTTP' on host 'hockey' was out of bounds. Make sure the plugin you're trying to run actually exists. Please advise. I've read FAQ ID F0017 and searched the mailing list archives and Google. $ uname -a Linux my.node.name 2.4.21-32.0.1.ELsmp #1 SMP Tue May 17 17:52:23 EDT 2005 i686 i686 i386 GNU/Linux $ Thanks, Aleksey ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mfatihakbulut at gmail.com Thu Dec 1 00:31:12 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Thu, 1 Dec 2005 01:31:12 +0200 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> References: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> Message-ID: hi Aleykey. i myself have the same problem for check_ldaps. it says : return code of 127 is out of bounds. plugin may be missing :p and looking for help for 2 days in mailing lists. and received just RTFM (read the fuckin manual) so far. i think people in here dont like to help others much :) if u find or receive anything helps you solve this problem please forward it to me too. because i really need the solution too :) (ps: u defined generic-service in your service.cfg ? as i see u sent a well prepared mail body to the mailing list. but in your service.cfg u just mentioned the service description. maybe this is not the reason of your problem. its just an idea) -------------- next part -------------- An HTML attachment was scrubbed... URL: From aleksey.tsalolikhin at gmail.com Thu Dec 1 00:42:31 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Wed, 30 Nov 2005 15:42:31 -0800 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> References: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> Message-ID: <72a19efd0511301542l1d2e53d8ge1471057e2290c90@mail.gmail.com> I have more date: strace shows the following: 32076 execve("/bin/sh", ["sh", "-c", "/check_http -H 70.86.102.98"], [/* 25 vars */]) = 0 32076 execve("/check_http", ["/check_http", "-H", "70.86.102.98"], [/* 25 vars */]) = -1 ENOENT (No such file or directory) 32076 stat64("/check_http", 0xbfffba40) = -1 ENOENT (No such file or directory) 32076 stat64("/check_http", 0xbfffb8a0) = -1 ENOENT (No such file or directory) 32076 write(2, "sh: line 1: /check_http: No such"..., 51) = 51 so looks like the path isn't getting passed along. Please advise. On 11/30/05, Aleksey Tsalolikhin wrote: > Hi. I am trying to install Nagios for the first time. I have a host, > "hockey", I want to run an HTTP check on. I've got everything > configured correctly near as I can tell, but I am getting: > > Return code of 127 for check of service 'HTTP' on host 'hockey' was > out of bounds. Make sure the plugin you're trying to run actually > exists. > > Please help. > > $ ls -l /usr/local/nagios/etc/*cfg > -rw-r--r-- 1 nagios nagios 17496 Nov 29 18:06 > /usr/local/nagios/etc/cgi.cfg > -rw-r--r-- 1 nagios nagios 4607 Nov 29 19:04 > /usr/local/nagios/etc/checkcommands.cfg > -rw-r--r-- 1 nagios nagios 789 Nov 29 19:12 > /usr/local/nagios/etc/contactgroups.cfg > -rw-r--r-- 1 nagios nagios 1076 Nov 29 19:11 > /usr/local/nagios/etc/contacts.cfg > -rw-r--r-- 1 nagios nagios 667 Nov 29 19:14 > /usr/local/nagios/etc/dependencies.cfg > -rw-r--r-- 1 nagios nagios 1589 Nov 29 19:13 > /usr/local/nagios/etc/escalations.cfg > -rw-r--r-- 1 nagios nagios 832 Nov 29 19:03 > /usr/local/nagios/etc/hostgroups.cfg > -rw-r--r-- 1 nagios nagios 2196 Nov 29 19:02 > /usr/local/nagios/etc/hosts.cfg > -rw-r--r-- 1 nagios nagios 4246 Nov 29 18:36 > /usr/local/nagios/etc/misccommands.cfg > -rw-r--r-- 1 nagios nagios 21329 Nov 30 16:44 > /usr/local/nagios/etc/nagios.cfg > -rw-r--r-- 1 nagios nagios 3069 Nov 29 18:36 > /usr/local/nagios/etc/resource.cfg > -rw-r--r-- 1 nagios nagios 2907 Nov 29 18:50 > /usr/local/nagios/etc/services.cfg > -rw-r--r-- 1 nagios nagios 1589 Nov 29 18:37 > /usr/local/nagios/etc/timeperiods.cfg > $ > > checkcommands.cfg contains: > > # 'check_http' command definition > define command{ > command_name check_http > command_line $USER1$/check_http -H $HOSTADDRESS$ > } > > > services.cfg contains: > > define service{ > use generic-service ; Name > of service template to use > > host_name hockey > service_description HTTP > is_volatile 0 > check_period 24x7 > max_check_attempts 3 > normal_check_interval 5 > retry_check_interval 1 > contact_groups admins > notification_interval 120 > notification_period 24x7 > notification_options w,u,c,r > check_command check_http > } > > > resource.cfg contains: > > $USER1$=/usr/local/nagios/libexec > > > > I can run check_http manually: > $ id > uid=504(nagios) gid=505(nagios) groups=505(nagios) > $ /usr/local/nagios/libexec/check_http -H hockey > HTTP OK HTTP/1.1 200 OK - 14678 bytes in 0.035 seconds > |time=0.035396s;;;0.000000 size=14678B;;;0 > $ > > nagios -v looks happy: > > Total Warnings: 0 > Total Errors: 0 > > > > Everything under /usr/local/nagios is owned by user nagios and group > nagios, except for the lockfile, > /usr/local/nagios/var/nagios.lock, which is owned by root, probably > from the start-up script. > > Everything seems to be configured correctly. However, service checks > are failing: > > [1133392048] Warning: Return code of 127 for check of service 'HTTP' > on host 'hockey' was out of bounds. Make sure the plugin you're trying > to run actually exists. > > Please advise. I've read FAQ ID F0017 and searched the mailing list > archives and Google. > > $ uname -a > Linux my.node.name 2.4.21-32.0.1.ELsmp #1 SMP Tue May 17 17:52:23 EDT > 2005 i686 i686 i386 GNU/Linux > $ > > Thanks, > Aleksey > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Thu Dec 1 01:00:54 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Thu, 1 Dec 2005 13:00:54 +1300 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: <72a19efd0511301542l1d2e53d8ge1471057e2290c90@mail.gmail.com> References: <72a19efd0511301542l1d2e53d8ge1471057e2290c90@mail.gmail.com> Message-ID: <05c801c5f60a$4c759f90$01fea8c0@itss.auckland.ac.nz> >sh: line 1: /check_http: No such file or directory ... >> Return code of 127 for check of service 'HTTP' on host 'hockey' was >> out of bounds. Make sure the plugin you're trying to run actually >> exists. ... >> # 'check_http' command definition >> define command{ >> command_name check_http >> command_line $USER1$/check_http -H $HOSTADDRESS$ >> } This shows that your Nagios installation is trying to call '/check_http', which does not exist. A non-existant or non-executable plugin will always give you a code of 127 (which is an exec failure). Nagios is telling you that the plugin does not exist where defined. Looking at your definition of check_http, it would seem that you have maybe forgotten to set $USER1$ ? You have said $USER1$/check_http, but the error emssage is for /check_http. Check your definitions, and run the config check on your entire setup. Some people get more polite responses from the mailing list than others due to the way they asked, and the information they give :) Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From aleksey.tsalolikhin at gmail.com Thu Dec 1 01:07:46 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Wed, 30 Nov 2005 16:07:46 -0800 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: <05c801c5f60a$4c759f90$01fea8c0@itss.auckland.ac.nz> References: <72a19efd0511301542l1d2e53d8ge1471057e2290c90@mail.gmail.com> <05c801c5f60a$4c759f90$01fea8c0@itss.auckland.ac.nz> Message-ID: <72a19efd0511301607w35e0b7d7nd945aa08b667b854@mail.gmail.com> On 11/30/05, Steve Shipway wrote: > > This shows that your Nagios installation is trying to call '/check_http', > which does not exist. A non-existant or non-executable plugin will always > give you a code of 127 (which is an exec failure). Nagios is telling you > that the plugin does not exist where defined. > > Looking at your definition of check_http, it would seem that you have maybe > forgotten to set $USER1$ ? You have said $USER1$/check_http, but the error > emssage is for /check_http. Check your definitions, and run the config check > on your entire setup. Dear Steve, Thank you for your reply. I do have USER1 set in my resource.cfg: nagios at mynode$ grep USER1 resource.cfg # Nagios supports up to 32 $USERx$ macros ($USER1$ through $USER32$) # Sets $USER1$ to be the path to the plugins $USER1$=/usr/local/nagios/libexec nagios at mynode$ However this does not seem to be getting passed along. I've resorted to replacing $USER1$ with /usr/local/nagios/libexec in my checkcommands.cfg which got me to the next stage (check_http is now working and I am playing with configuring additional plug-ins). I don't think it will hurt me that I've bypassed the macro. If I am mistaken, please let me know. Truly, -at ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris at chrisburgess.com.au Thu Dec 1 01:39:01 2005 From: chris at chrisburgess.com.au (Chris Burgess) Date: Thu, 01 Dec 2005 11:39:01 +1100 Subject: HTTP WARNING: HTTP/1.1 401 Unauthorized In-Reply-To: <60e826da0511301630r264df7ddx3d0924b184f539df@mail.gmail.com> References: <60e826da0511301630r264df7ddx3d0924b184f539df@mail.gmail.com> Message-ID: On 1/12/05 11:30 AM, "Rene Nelson" wrote: > > My HTTP check on a windows 2003 IIS 6 server suddenly started reporting the > following. HTTP WARNING: HTTP/1.1 401 Unauthorized > > Does anyone have any suggestion as to the cause? Thanks in advance for any > help. > > When going to the URL via lynx I get: > > You are not authorized to view this page > > You do not have permission to view this directory or page using the > credentials that you supplied because your Web browser is sending a > WWW-Authenticate header field that the Web server is not configured to > accept. > > This worked fine until I restarted nagios today. Any suggestions? > > Hi Rene, If you can't view the page from Lynx, I would say it's not likely related to Nagios. Checking the Event Viewer on the IIS server might turn up clues that might help. The following link has a bit more info. http://www.chat11.com/You_Are_Not_Authorized_To_View_This_Page_Error_From_II S_When_Using_Perl_And_LWP_To_Browse_Localhost Cheers, Chris ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From neririn at gmail.com Thu Dec 1 01:30:16 2005 From: neririn at gmail.com (Rene Nelson) Date: Wed, 30 Nov 2005 17:30:16 -0700 Subject: HTTP WARNING: HTTP/1.1 401 Unauthorized Message-ID: <60e826da0511301630r264df7ddx3d0924b184f539df@mail.gmail.com> My HTTP check on a windows 2003 IIS 6 server suddenly started reporting the following. HTTP WARNING: HTTP/1.1 401 Unauthorized Does anyone have any suggestion as to the cause? Thanks in advance for any help. When going to the URL via lynx I get: You are not authorized to view this page You do not have permission to view this directory or page using the credentials that you supplied because your Web browser is sending a WWW-Authenticate header field that the Web server is not configured to accept. This worked fine until I restarted nagios today. Any suggestions? -------------- next part -------------- An HTML attachment was scrubbed... URL: From s.shipway at auckland.ac.nz Thu Dec 1 01:44:29 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Thu, 1 Dec 2005 13:44:29 +1300 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: <72a19efd0511301607w35e0b7d7nd945aa08b667b854@mail.gmail.com> References: <72a19efd0511301607w35e0b7d7nd945aa08b667b854@mail.gmail.com> Message-ID: <05ce01c5f610$638cab00$01fea8c0@itss.auckland.ac.nz> >> Looking at your definition of check_http, it would seem that >you have maybe forgotten to set $USER1$ ? ... >nagios at mynode$ grep USER1 resource.cfg ># Nagios supports up to 32 $USERx$ macros ($USER1$ through $USER32$) ># Sets $USER1$ to be the path to the plugins >$USER1$=/usr/local/nagios/libexec > >However this does not seem to be getting passed along. Check to make sure that the resource.cfg file location is correctly defined in your nagios.cfg. If nagios can't find it, then it will not be processed. The part shown above seems to be the correct syntax. However, you've found the obvious workaround, of specifying the paths explicitly in the commands.cfg files. This should cause no problems in operation, although it may make the system very slightly more work to manage. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From aleksey.tsalolikhin at gmail.com Thu Dec 1 01:48:48 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Wed, 30 Nov 2005 16:48:48 -0800 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: <05ce01c5f610$638cab00$01fea8c0@itss.auckland.ac.nz> References: <72a19efd0511301607w35e0b7d7nd945aa08b667b854@mail.gmail.com> <05ce01c5f610$638cab00$01fea8c0@itss.auckland.ac.nz> Message-ID: <72a19efd0511301648g669c16acu52da825c9bf0939c@mail.gmail.com> On 11/30/05, Steve Shipway wrote: > > Check to make sure that the resource.cfg file location is correctly defined > in your nagios.cfg. If nagios can't find it, then it will not be processed. > The part shown above seems to be the correct syntax. Sure enough, I had resource.cfg commented out in nagios.cfg. Mystery explained. Thanks, Steve! Best, -at ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Thu Dec 1 01:48:47 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Thu, 1 Dec 2005 13:48:47 +1300 Subject: HTTP WARNING: HTTP/1.1 401 Unauthorized In-Reply-To: <60e826da0511301630r264df7ddx3d0924b184f539df@mail.gmail.com> References: <60e826da0511301630r264df7ddx3d0924b184f539df@mail.gmail.com> Message-ID: <05cf01c5f610$fd00e0d0$01fea8c0@itss.auckland.ac.nz> >My HTTP check on a windows 2003 IIS 6 server suddenly started reporting the following. >HTTP WARNING: HTTP/1.1 401 Unauthorized Your web server being monitored is giving an 'unauthorised' to your check command. This is probably because the IIS6 server has been reconfigured to require authentication to view the page. Since you get the same from lynx (and not a prompt for uername/password) I would guess that someone has configured it to require the 'Evil-Microsoft-NTLM' Authentication... nothing can support this sort of authentication except IE due to various patents and secrets (and the fact that it hooks into your windows client). Get the IIS6 admin to enable normal authentication as well or instead, or disable authentication altogether. You can pass auth credentials to the check_http plugin, if necessary, but it can only support basic authentication. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From aleksey.tsalolikhin at gmail.com Thu Dec 1 03:23:26 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Wed, 30 Nov 2005 18:23:26 -0800 Subject: trouble with check_mysql and USER macros Message-ID: <72a19efd0511301823r135b0166ra98503096df8e153@mail.gmail.com> Hi again. I am trying to configure the check_mysql plugin to check my database. In checkcommands.cfg, I've put the following command line for check_mysql so I can specify the IP address, database name, username and password: command_line $USER1$/check_mysql -H $ARG1$ -d $ARG2$ -u $ARG3$ -p $ARG4$ In services.cfg, I put the following in the service definition for this service: check_command check_mysql!192.168.160.2!mydbname!mydbusername!mydbpassword and restarted Nagios, and it worked. The Nagios web page displayed the database statistics. However, when I tried changing that to check_command check_mysql!192.168.160.2!mydbname!mydbusername!$USER4$ and setting $USER4$ in resource.cfg, I get error Access denied for user 'mydbusername'@'192.168.160.1' (using password: YES) note: 192.168.160.1 is the ip address of the machine Nagios is running on. Different from the machine that the database is on. very strange that the ip address changed. I double checked services.cfg and 192.168.160.2 is still in there. Am I approaching this the right way in terms of configuring it? I am going to replace check_mysql with a little script that logs the arguments it is called with, so I can get a better idea of what's happenning. New Nagios User. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 1 07:47:26 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 1 Dec 2005 07:47:26 +0100 (CET) Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: References: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> Message-ID: On Thu, 1 Dec 2005, Mehmet Fatih AKBULUT wrote: > hi Aleykey. > i myself have the same problem for check_ldaps. > it says : return code of 127 is out of bounds. plugin may be missing :p > and looking for help for 2 days in mailing lists. > and received just RTFM (read the fuckin manual) so far. > i think people in here dont like to help others much :) > if u find or receive anything helps you solve this problem please forward it > to me too. > because i really need the solution too :) Did YOU install a check_ldaps command? Because it is not a default service in one of the sample configs I have seen untill today. As shown in the manual (http://nagios.sourceforge.net/docs/2_0/beginners.html) you need time to install this or hire someone to do it for you. The nagios setup is quite logical but you need to put in work, a lot of work before it runs with all the service checks because some service checks must be written by you if no one else did it. You should not take things for granted or assume they are present just because you like them to be. Untill now you have not shown to actually understand how Nagios works or how you need to configure it. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 1 07:54:24 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 1 Dec 2005 07:54:24 +0100 (CET) Subject: trouble with check_mysql and USER macros In-Reply-To: <72a19efd0511301823r135b0166ra98503096df8e153@mail.gmail.com> References: <72a19efd0511301823r135b0166ra98503096df8e153@mail.gmail.com> Message-ID: On Wed, 30 Nov 2005, Aleksey Tsalolikhin wrote: > command_line $USER1$/check_mysql -H $ARG1$ -d $ARG2$ -u > $ARG3$ -p $ARG4$ Why not use $USER4$ here? > check_command > check_mysql!192.168.160.2!mydbname!mydbusername!mydbpassword > check_command > check_mysql!192.168.160.2!mydbname!mydbusername!$USER4$ > > and setting $USER4$ in resource.cfg, I get error I think not all files will support variables the way you think. But this time you have passed a password named $USER4$ instead of the variable. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From charles at bueche.ch Thu Dec 1 09:08:50 2005 From: charles at bueche.ch (Charles Bueche) Date: Thu, 01 Dec 2005 09:08:50 +0100 Subject: wanted feature : event list Message-ID: <1133424530.12404.5.camel@bluez.bueche.ch> Hi, we are using nagios to monitor about 250 services on 25 hosts. So far so good, nagios is OK for the task. We are checking how to implement system monitoring, eg sending events in nagios for stuff that happens outside its service list. Mainly, stuff that are easier to monitor from outside like Sun disksuite, smartmontools alarms, and all these asynchronous events that can be generated from scripts and tools. Something I miss is a kind of even list, so I could see what events came. Any hint ? Charles -- Charles Bueche sand, snow, wave, wind and net -surfer ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mfatihakbulut at gmail.com Thu Dec 1 11:04:21 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Thu, 1 Dec 2005 12:04:21 +0200 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: References: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> Message-ID: hi Hugo. root at localhost# ls check_breeze check_jabber check_rpc check_by_ssh "check_ldaps" check_sensors check_dhcp check_load check_simap check_dig check_log check_smtp check_disk check_mailq check_snmp check_disk_smb check_mrtg check_spop check_dns check_mrtgtraf check_ssh check_dummy check_mssql.sh check_ssmtp check_file_age check_nagios check_swap check_flexlm check_nntp check_tcp check_fping check_nntps check_time check_ftp check_nt check_udp check_game check_ntp check_udp2 check_hpjd check_nwstat check_ups check_http check_oracle check_users check_icmp check_overcr check_wave check_ifoperstatus check_ping negate check_ifstatus check_pop urlize check_imap check_procs utils.pm check_ircd check_real utils.sh root at localhost# check_ldaps seems to be there but look below : root at localhost# ./check_ldaps -help bash: ./check_ldaps: No such file or directory root at localhost# dont know why getting this error. -------------- next part -------------- An HTML attachment was scrubbed... URL: From fernando.shayani at bsb.politec.com.br Thu Dec 1 13:25:26 2005 From: fernando.shayani at bsb.politec.com.br (Fernando Shayani) Date: Thu, 1 Dec 2005 09:25:26 -0300 Subject: Nagios HANGS scheduling info Message-ID: <1964B929D8FF4D4DB06854A75D0FD3730C9AF027@bsbserv028.bsb.politec.com.br> Nagios STATD isn't running. I will enable Orphaned Service and check it out again. I will also upgrade to Beta 6 Thanks Fernando Shayani fernando.shayani at bsb.politec.com.br (61) 3038-6951 POLITEC - Bras?lia - DF _____ From: Eli Stair [mailto:estair at ilm.com] Sent: ter?a-feira, 29 de novembro de 2005 14:57 To: Fernando Shayani; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] Nagios HANGS scheduling info Try enabling the orphaned service checks if you haven't already. /eli -----Original Message----- From: nagios-users-admin at lists.sourceforge.net on behalf of Fernando Shayani Sent: Tue 11/29/2005 6:17 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Nagios HANGS scheduling info A lot of times (1 or 2 times a week!) Nagios stops checking the services. Then I go to SCHEDULING QUEUE and noticed that are tests scheduled to check but it didn't run. Tests that usually runs fine, but sometimes, different tests doesn't run. Then I have to RESTART nagios to fix it. Does anyone could help me? Thanks Fernando Shayani fernando.shayani at bsb.politec.com.br (61) 3038-6951 POLITEC -------------- next part -------------- An HTML attachment was scrubbed... URL: From prashant at mac-india.com Fri Dec 2 02:00:11 2005 From: prashant at mac-india.com (Prashant Nadgir) Date: Thu, 1 Dec 2005 17:00:11 -0800 Subject: help in config Message-ID: <00be01c5f6db$bf1a26c0$441010c1@prashant> hello, first time i am configuring nagios 2.-0b5 on redhat linux 9. When checking configuration, ./nagios -v nagios.cfg getting error "check-host-alive has already been defined" "could not register command ( config file name ../minimal.cfg) please help me thanks prashant -------------- next part -------------- An HTML attachment was scrubbed... URL: From hvdkooij at vanderkooij.org Thu Dec 1 13:23:14 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 1 Dec 2005 13:23:14 +0100 (CET) Subject: help in config In-Reply-To: <00be01c5f6db$bf1a26c0$441010c1@prashant> References: <00be01c5f6db$bf1a26c0$441010c1@prashant> Message-ID: On Thu, 1 Dec 2005, Prashant Nadgir wrote: > first time i am configuring nagios 2.-0b5 on redhat linux 9. > When checking configuration, ./nagios -v nagios.cfg getting error > "check-host-alive has already been defined" > "could not register command ( config file name ../minimal.cfg) Out of the box there are some duplicates. You need to check which ones you want to remove. But as you need to define the lot any way it is in fact a good safeguard to prevent uncontrolled Nagios boxes from popping up. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 1 13:17:47 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 1 Dec 2005 13:17:47 +0100 (CET) Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: References: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> Message-ID: On Thu, 1 Dec 2005, Mehmet Fatih AKBULUT wrote: > root at localhost# ./check_ldaps -help > bash: ./check_ldaps: No such file or directory > root at localhost# > > dont know why getting this error. No one knows either. Because we have no idea where you got it nor if all requirements are met on your system. We know absolutetly nothing about your setup unless you tell us explicitly. But I suggest you go and try to read this script yourself. At the moment we are spoonfeeding you and I will not bother again untill you show some initiative and find things out for yourself. If you can not fix minor things like this then nagios is not the tool you should install. Hire someone to do these things for you and build you a nice shiny system that you can just point and click at. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Thu Dec 1 13:44:40 2005 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Thu, 01 Dec 2005 13:44:40 +0100 Subject: Foundry Switches Message-ID: <1133441080.871.25.camel@glr-nb.dh.uclv.net> Hi, I have to monitor the temperature from our Foundry FastIron 4802 Switches via check_snmp. I found a mib (.1.3.6.1.4.1.1991.1.1.1.1.18.0) but it's return the value "72", so I thinks that meens "Fahrenheit". Is there a mib for Celsius available? thanks Richard -- Richard Gliebe Fachhochschule Vorarlberg GmbH / University for Applied Science Information Services Hochschulstra?e 1, A-6850 Dornbirn Telefon +43 / (0)5572 / 792-2207 E-Mail: richard.gliebe at fhv.at ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mfatihakbulut at gmail.com Thu Dec 1 13:41:14 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Thu, 1 Dec 2005 14:41:14 +0200 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: References: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> Message-ID: dont know why people are so angry. and dont know how many times i received that message : "hire someone to do it for you" so far. Hugo and others that post in the same way, i am telling this to you all : firstly u act like u'll help me, but then in the second or third message strangely most of you post such an upsetting mail. u dont have to answer my thread. yes i am in need of help cos i heard about nagios 2-3 weeks ago and i am not an expert on such programs. i am willing to learn but as 50cent says in his song : "progress is a slow process". therefore i will really need help sometimes when i come across with an interrupt. you may see my posts in mailing list and u want to help me. but please dont do that favor if you at the end will send such a reply you did to me. just delete the mail from inbox. cos such thing u wrote is not much pleasent. anyway. Regards. Bye -------------- next part -------------- An HTML attachment was scrubbed... URL: From Thomas.Zimmer at oppenheim.de Thu Dec 1 13:54:06 2005 From: Thomas.Zimmer at oppenheim.de (Thomas.Zimmer at oppenheim.de) Date: Thu, 1 Dec 2005 13:54:06 +0100 Subject: AW: trouble getting started: getting error 127 out of bounds on check_http Message-ID: <9A834A8DCEC68648B20701E1CDE10925F8491B@OKWPMXS01.de.sal-opp.net> don?t you understand? you are spamming this list with useless questions! many of them have been responded on the same list in the past, or are covered in the documentation of nagios or in the faq?s! No Greetz. Thomas Zimmer Produktservice & Betrieb Betrieb & Support Sal. Oppenheim jr. & Cie., Frankfurt a. Main Internet: http://www.oppenheim.de E-Mail: thomas.zimmer at oppenheim.de -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] Im Auftrag von Mehmet Fatih AKBULUT Gesendet: Donnerstag, 1. Dezember 2005 13:41 An: nagios-users at lists.sourceforge.net Cc: hvdkooij at vanderkooij.org Betreff: Re: [Nagios-users] trouble getting started: getting error 127 out of bounds on check_http dont know why people are so angry. and dont know how many times i received that message : "hire someone to do it for you" so far. Hugo and others that post in the same way, i am telling this to you all : firstly u act like u'll help me, but then in the second or third message strangely most of you post such an upsetting mail. u dont have to answer my thread. yes i am in need of help cos i heard about nagios 2-3 weeks ago and i am not an expert on such programs. i am willing to learn but as 50cent says in his song : "progress is a slow process". therefore i will really need help sometimes when i come across with an interrupt. you may see my posts in mailing list and u want to help me. but please dont do that favor if you at the end will send such a reply you did to me. just delete the mail from inbox. cos such thing u wrote is not much pleasent. anyway. Regards. Bye -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Zimmer, Thomas (Produktservices und Betrieb).vcf Type: application/octet-stream Size: 254 bytes Desc: not available URL: From barbereau at gmail.com Thu Dec 1 16:02:07 2005 From: barbereau at gmail.com (=?ISO-8859-1?Q?S=E9bastien_Barbereau?=) Date: Thu, 1 Dec 2005 16:02:07 +0100 Subject: Foundry Switches In-Reply-To: <1133441080.871.25.camel@glr-nb.dh.uclv.net> References: <1133441080.871.25.camel@glr-nb.dh.uclv.net> Message-ID: <4e0e33ee0512010702h4216d81eu7f9890669be545e0@mail.gmail.com> >From the foundry mib file: snChasActualTemperature OBJECT-TYPE SYNTAX INTEGER (-110..250) ACCESS read-only STATUS mandatory DESCRIPTION "Temperature of the chassis. Each unit is 0.5 degrees Celcius. Only management module built with temperature sensor hardware is applicable. For those non-applicable management module, it returns no-such-name." ::= { snChasGen 18 } So should be 72 is 36?C On 12/1/05, Richard Gliebe wrote: > > Hi, > > I have to monitor the temperature from our Foundry FastIron 4802 > Switches via check_snmp. > > I found a mib (.1.3.6.1.4.1.1991.1.1.1.1.18.0) but it's return the value > "72", so I thinks that meens "Fahrenheit". > > Is there a mib for Celsius available? > > thanks > Richard > > -- > Richard Gliebe > Fachhochschule Vorarlberg GmbH / University for Applied Science > Information Services > > Hochschulstra?e 1, A-6850 Dornbirn > Telefon +43 / (0)5572 / 792-2207 > E-Mail: richard.gliebe at fhv.at > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&opclick > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Thu Dec 1 16:20:16 2005 From: marc at ena.com (Marc Powell) Date: Thu, 1 Dec 2005 09:20:16 -0600 Subject: trouble getting started: getting error 127 out of bounds on check_http Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Aleksey Tsalolikhin > Sent: Wednesday, November 30, 2005 6:49 PM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Re: trouble getting started: getting error 127 > out of bounds on check_http > > On 11/30/05, Steve Shipway wrote: > > > > Check to make sure that the resource.cfg file location is correctly > defined > > in your nagios.cfg. If nagios can't find it, then it will not be > processed. > > The part shown above seems to be the correct syntax. > > Sure enough, I had resource.cfg commented out in nagios.cfg. Mystery > explained. Thanks, Steve! Thank _you_ for the very detailed problem description. You covered almost all possible ways that issue could have come about. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Dec 1 16:30:03 2005 From: marc at ena.com (Marc Powell) Date: Thu, 1 Dec 2005 09:30:03 -0600 Subject: trouble getting started: getting error 127 out of bounds on check_http Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Mehmet Fatih AKBULUT > Sent: Thursday, December 01, 2005 4:04 AM > To: nagios-users at lists.sourceforge.net > Cc: hvdkooij at vanderkooij.org > Subject: Re: [Nagios-users] trouble getting started: getting error 127 out > of bounds on check_http > > hi Hugo. > > root at localhost# ls > check_breeze check_jabber check_rpc > check_by_ssh "check_ldaps" check_sensors [ls contents removed] > > check_ldaps seems to be there but look below : > > root at localhost# ./check_ldaps -help > bash: ./check_ldaps: No such file or directory > root at localhost# > > dont know why getting this error. In the ls output above, the program is "check_ldaps" (with quotes!). Did you add those while writing the e-mail or is the file *really* named *"check_ldaps"*? If it is, it shouldn't be and you need to rename it. That would be user error that no-one would think to ask about since it's very basic. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrea.gabellini at telecomitalia.sm Thu Dec 1 16:43:19 2005 From: andrea.gabellini at telecomitalia.sm (Andrea Gabellini) Date: Thu, 01 Dec 2005 16:43:19 +0100 Subject: statuswml Message-ID: <438F1A17.60707@telecomitalia.sm> Hi, I'm using nagios tray, that use the statuswml cgi to get status information. This cgi doesn't report problems on services which are passive check. The normal status cgi instead report this problem. I don't know if this is a bug or a feature. I wrote this little patch that report passive check problems also. Andrea --- cgi/statuswml.c.orig 2005-08-28 16:01:22.417798000 +0200 +++ cgi/statuswml.c 2005-08-28 16:27:02.270983000 +0200 @@ -1397,8 +1397,8 @@ if(display_type==DISPLAY_UNHANDLED_PROBLEMS){ if(temp_hoststatus->problem_has_been_acknowledged==TRUE) continue; - if(temp_hoststatus->checks_enabled==FALSE) - continue; +// if(temp_hoststatus->checks_enabled==FALSE) +// continue; if(temp_hoststatus->notifications_enabled==FALSE) continue; if(temp_hoststatus->scheduled_downtime_depth>0) @@ -1446,8 +1446,8 @@ if(display_type==DISPLAY_UNHANDLED_PROBLEMS){ if(temp_servicestatus->problem_has_been_acknowledged==TRUE) continue; - if(temp_servicestatus->checks_enabled==FALSE) - continue; +// if(temp_servicestatus->checks_enabled==FALSE) +// continue; if(temp_servicestatus->notifications_enabled==FALSE) continue; if(temp_servicestatus->scheduled_downtime_depth>0) -- --------------------------------------- If you and your partner always agree, one of you is unnecessary. --------------------------------------- Ing. Andrea Gabellini Email: andrea.gabellini at telecomitalia.sm Tel: 0549 886111 (Italy) Tel. +378 0549 886111 (International) Telecom Italia San Marino S.p.A. Strada degli Angariari, 3 47891 Rovereta Repubblic of San Marino http://www.omniway.sm http://www.telecomitalia.sm ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at nagios.org Thu Dec 1 17:32:52 2005 From: nagios at nagios.org (Ethan Galstad) Date: Thu, 01 Dec 2005 10:32:52 -0600 Subject: statuswml In-Reply-To: <438F1A17.60707@telecomitalia.sm> References: <438F1A17.60707@telecomitalia.sm> Message-ID: <438ED154.30244.80D090@nagios.nagios.org> Thanks! Patch will be in CVS shortly. On 1 Dec 2005 at 16:43, Andrea Gabellini wrote: > Hi, > > I'm using nagios tray, that use the statuswml cgi to get status information. This cgi doesn't report > problems on services which are passive check. The normal status cgi instead report this problem. I > don't know if this is a bug or a feature. I wrote this little patch that report passive check > problems also. > > Andrea > > > > > --- cgi/statuswml.c.orig 2005-08-28 16:01:22.417798000 +0200 > +++ cgi/statuswml.c 2005-08-28 16:27:02.270983000 +0200 > @@ -1397,8 +1397,8 @@ > if(display_type==DISPLAY_UNHANDLED_PROBLEMS){ > if(temp_hoststatus->problem_has_been_acknowledged==TRUE) > continue; > - if(temp_hoststatus->checks_enabled==FALSE) > - continue; > +// if(temp_hoststatus->checks_enabled==FALSE) > +// continue; > if(temp_hoststatus->notifications_enabled==FALSE) > continue; > if(temp_hoststatus->scheduled_downtime_depth>0) > @@ -1446,8 +1446,8 @@ > if(display_type==DISPLAY_UNHANDLED_PROBLEMS){ > if(temp_servicestatus->problem_has_been_acknowledged==TRUE) > continue; > - if(temp_servicestatus->checks_enabled==FALSE) > - continue; > +// if(temp_servicestatus->checks_enabled==FALSE) > +// continue; > if(temp_servicestatus->notifications_enabled==FALSE) > continue; > if(temp_servicestatus->scheduled_downtime_depth>0) > > -- > > --------------------------------------- > If you and your partner always agree, one of you is unnecessary. > --------------------------------------- > Ing. Andrea Gabellini > Email: andrea.gabellini at telecomitalia.sm > Tel: 0549 886111 (Italy) > Tel. +378 0549 886111 (International) > > Telecom Italia San Marino S.p.A. > Strada degli Angariari, 3 > 47891 Rovereta > Repubblic of San Marino > > http://www.omniway.sm http://www.telecomitalia.sm > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > Ethan Galstad, Nagios Developer --- Email: nagios at nagios.org Website: http://www.nagios.org ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Dec 1 17:30:42 2005 From: marc at ena.com (Marc Powell) Date: Thu, 1 Dec 2005 10:30:42 -0600 Subject: return code of 127 is out of bounds > check_ldaps error. Message-ID: Please always reply on-list unless specifically asked otherwise. That's normal etiquette for any mailing list and especially important for a support list. > -----Original Message----- > From: Mehmet Fatih AKBULUT [mailto:mfatihakbulut at gmail.com] > Sent: Thursday, December 01, 2005 9:45 AM > To: Marc Powell > Subject: return code of 127 is out of bounds > check_ldaps error. > > hi. > > root at localhost# pwd > /usr/local/nagios/libexec > root at localhost# ls > check_breeze check_fping check_jabber > check_nntps check_rpc check_time > check_by_ssh check_ftp check_ldaps [ls output chopped] > root at localhost# ./check_ldaps -help > bash: ./check_ldaps: No such file or directory > root at localhost# > > > yes i added /* "" */ to remark that check_ldaps plugin seems to be there. > but actually its just a symbolic link i think. Yes, it should be a symlink to check_ldap which you don't seem to have. >From the plugin REQUIREMENTS file -- "check_ldap: - Requires the LDAP libraries available from http://www.openldap.org Lib: libldap, liblber Redhat Source: openldap-1.2.9-6.i386.rpm, openldap-devel-1.2.9-6.i386.rpm" Does your system meet the pre-requisite to have that plugin compile? Did ./configure complain or say anything about LDAP? Was it compiled (i.e. do you see it compiled in the nagios-plugins-[version]/plugins directory? If you see check_ldap (not check_ldap.c or check_ldap.o) in that plugins directory, try running it from there. If it works, manually copy it to your libexec directory. During ./configure you should be able to find lines like the following -- checking for main in -lldap... yes checking for ldap_set_option... yes checking for ldap_init... yes checking for ldap_set_option... (cached) yes checking for ldap_get_option... yes checking for ldap_start_tls_s... yes You _probably_ see something like the following right now -- WARNING: Skipping LDAP plugin WARNING: install LDAP libs to compile this plugin (see REQUIREMENTS). > although i reinstall and install nagios-plugins for couple of times i > still dont have a real check_ldap plugin. > always i get a symbolic link which doesnt work for me. > cos each time it gives the same error. In a case like this, basic troubleshooting should have been to see where the symlink pointed to and investigate why _that_ file, not check_ldaps, didn't exist. > and the second problem is this : (which is i think the most weird one:p) > check this : > > root at localhost# pwd > /root > root at localhost# ls [/root directory listing removed] > root at localhost# > > as you see theres no folder named nagios-plugin !!! > > now look at this! : > > root at localhost# locate check_ldap > /root/nagios-plugins-1.4.1/plugins/check_ldap.c > /usr/local/nagios/libexec/check_ldaps > root at localhost# > > this freaks me out! when i search for check_ldap, there seems to be a > folder named nagios-plugin under root folder which is proved not to exists > there as u see above. ??? > > same for ls -al ... theres no nagios-plugin folder :( > nagios really makes me feel sick! > dont know what to do ... This is certainly not a nagios issue and there's no way it could be. Your OS hasn't updated its [s]locate database. Most OS's have a cron job that updates it daily. Try running 'updatedb' to force a refresh of the database. As far as nagios making you feel sick, so far the only real nagios related issue you've shown has been an apparent lack of meeting spelled out requirements. Admittedly though, the Makefile probably shouldn't create the check_ldaps symlink if check_ldap can't be compiled (if it doesn't already have that logic). Other than that it's a lack of your familiarization with your chosen OS, something that we assume is a given as a technical community (rightly so). > (ps: do u use check_ldaps ? is there a real check_ldap [not a symbolic > one] plugin in your libexec folder ?) I do not. Yes there is since I met the pre-requisites. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eht-dummy at gmx.net Thu Dec 1 17:47:44 2005 From: eht-dummy at gmx.net (Alexander Rudolf) Date: Thu, 1 Dec 2005 17:47:44 +0100 (MET) Subject: (no subject) Message-ID: <27078.1133455664@www15.gmx.net> Hello! I?m using NSClient on my windows server to get some perfomance counter data. As long as I ask standard counters all is ok. But I have to ask some of the SQL Server Counters (german version) and this dosnt work. I expect the problem is with the ":" character. I dont get an error but a "0" value. [plugins]#./check_nt -H 192.168.xxx.yyy -p 12345 -v COUNTER -l "\SQLServer:Allgemeine Statistik\Benutzerverbindungen","%.f" 0 | %.f=0,000000%;0,000000;0,000000; this works: [plugins]#./check_nt -H 192.168.xxx.yyy -p 12345 -v COUNTER -l "\Prozessor(_Total)\Prozessorzeit (%)","%.f %%" 59 % | '%.f %%'=59,000000%;0,000000;0,000000; Versions: nagios-plugins-1.4.1-1.1.fc3.rf NSClient++-0.2.4g.zip Thanks in advance! Alex -- Telefonieren Sie schon oder sparen Sie noch? NEU: GMX Phone_Flat http://www.gmx.net/de/go/telefonie ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrea.gabellini at telecomitalia.sm Thu Dec 1 16:37:39 2005 From: andrea.gabellini at telecomitalia.sm (Andrea Gabellini) Date: Thu, 01 Dec 2005 16:37:39 +0100 Subject: Patch for daemon-init Message-ID: <438F18C3.8040006@telecomitalia.sm> Hi, I've some problems starting nagios with daemon-init. The problem is the su command on it. Looking in the man, I think the correct sintax is the one in the following patch. With this change I have no problem. Andrea --- daemon-init.orig 2005-12-01 16:31:17.629258000 +0100 +++ daemon-init 2005-12-01 16:31:56.815603000 +0100 @@ -114,7 +114,7 @@ echo "Starting network monitor: nagios" $NagiosBin -v $NagiosCfgFile > /dev/null 2>&1; if [ $? -eq 0 ]; then - su - $NagiosUser -c "touch $NagiosVarDir/nagios.log $NagiosRetentionFile" + su -c "touch $NagiosVarDir/nagios.log $NagiosRetentionFile" $NagiosUser rm -f $NagiosCommandFile touch $NagiosRunFile chown $NagiosUser:$NagiosGroup $NagiosRunFile -- --------------------------------------- Have a nice day and so have I. --------------------------------------- Ing. Andrea Gabellini Email: andrea.gabellini at telecomitalia.sm Tel: 0549 886111 (Italy) Tel. +378 0549 886111 (International) Telecom Italia San Marino S.p.A. Strada degli Angariari, 3 47891 Rovereta Repubblic of San Marino http://www.omniway.sm http://www.telecomitalia.sm ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drussell at mpifix.com Thu Dec 1 19:59:57 2005 From: drussell at mpifix.com (Danny Russell) Date: Thu, 1 Dec 2005 11:59:57 -0700 Subject: Nagios 2.04b Not checking services in a timely manner Message-ID: <028B223BB24E5443A9784BBE1CC5EBE06F5C49@Lisa.mpifix.com> My Nagios server is not checking services as fast as I have set them to in the config. For example, I have a service that when it is in a non-okay status I want the check to occur every minute until it recovers (the retry check interval). So I set the service to look like this: define service { name Midnight Installer Status use --template ARGIS Customer Service check_command check_nrpe_mi_status retry_check_interval 1 notification_period 24x7 register 0 notification_options u,c,r stalking_options o,w,u,c contact_groups TechOpsSupport } define service { name --template ARGIS Customer Service max_check_attempts 5 normal_check_interval 10 retry_check_interval 3 active_checks_enabled 1 check_period 24x7 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 notification_interval 30 notification_period ShopHours notifications_enabled 1 register 0 notification_options u,c,r servicegroups ARGIS Customer Services contact_groups TechOpsSupport } Why would the checks not be happening on the schedule? Here is the number of checks I am doing: # Active Host / Service Checks: 73 / 831 Thanks! -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Thu Dec 1 18:18:42 2005 From: marc at ena.com (Marc Powell) Date: Thu, 1 Dec 2005 11:18:42 -0600 Subject: trouble getting started: getting error 127 out of bounds on check_http Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Mehmet Fatih AKBULUT > Sent: Wednesday, November 30, 2005 5:31 PM > To: Aleksey Tsalolikhin > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] trouble getting started: getting error 127 out > of bounds on check_http > > hi Aleykey. > i myself have the same problem for check_ldaps. > it says : return code of 127 is out of bounds. plugin may be missing :p > and looking for help for 2 days in mailing lists. > and received just RTFM (read the fuckin manual) so far. > i think people in here dont like to help others much :) As a general statement, the amount of help received is commensurate with the amount of work put into solving the problem before coming here _and_ the amount of detail about the actual problem that is provided. Remember that of the 2000+ users on this list, all but a tiny handful are Nagios users such as yourself. While many of us have a true desire to help, we generally have little patience for questions that lack information and therefore require a lot of back-and-forth e-mails, questions where it's pretty clear that much hasn't been done to try to solve the problem outside the list or questions that have been asked time and again. It implies that your time is more valuable than ours and I guarantee that it's not. I'm not singling you out by any means. If you haven't read it before, the document at http://www.catb.org/~esr/faqs/smart-questions.html, though old, will give you a better understanding of what the general expectations are. You may notice that I almost always answer questions in one of two ways -- 1) a brief explanation and link to the relevant documentation or 2) by asking more questions that if answered, should lead to the source of the problem and it's resolution. I do this with the hope that the OP will learn from the experience and can take away more than just an answer but rather a process to figure out answers to other questions on their own. That's a good thing because it's more likely they'll bring fewer questions here and become more self-sufficient. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jwhardeman at gmail.com Fri Dec 2 01:18:55 2005 From: jwhardeman at gmail.com (Joseph Hardeman) Date: Thu, 1 Dec 2005 19:18:55 -0500 Subject: Limiting Views of Hosts Message-ID: Hi Everyone, I am wondering if there is a way to allow people who login to only view a specific set of hosts. This way I can have one Nagios system checking multiple clients but the clients can only see thier hosts and thier host results. I am running the newest version 2.0b6. Thanks in advance. Joseph -------------- next part -------------- An HTML attachment was scrubbed... URL: From steve.wray at cwa.co.nz Fri Dec 2 01:17:41 2005 From: steve.wray at cwa.co.nz (Steve Wray) Date: Fri, 02 Dec 2005 13:17:41 +1300 Subject: Limiting Views of Hosts In-Reply-To: References: Message-ID: <438F92A5.20406@cwa.co.nz> Joseph Hardeman wrote: > Hi Everyone, > > I am wondering if there is a way to allow people who login to only view > a specific set of hosts. This way I can have one Nagios system checking > multiple clients but the clients can only see thier hosts and thier host > results. I am running the newest version 2.0b6. IIRC... If you have a login for them in the htpasswd file and create contact entries for them, but don't list them in the cgi.cfg, then they should be able to view only the servers for which you've listed them as contacts. -- There is nothing more important to good government than good education. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From aleksey.tsalolikhin at gmail.com Fri Dec 2 01:22:43 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Thu, 1 Dec 2005 16:22:43 -0800 Subject: trouble with check_mysql and USER macros In-Reply-To: References: <72a19efd0511301823r135b0166ra98503096df8e153@mail.gmail.com> Message-ID: <72a19efd0512011622r16fd2811t1a15a314c0b4d50b@mail.gmail.com> On 11/30/05, Hugo van der Kooij wrote: > On Wed, 30 Nov 2005, Aleksey Tsalolikhin wrote: > > > [In check_mysql definition in checkcommands.cfg:] > > command_line $USER1$/check_mysql -H $ARG1$ -d $ARG2$ -u > > $ARG3$ -p $ARG4$ > > Why not use $USER4$ here? My consideration was that then I'd only be able to check one database, the one that password fits. But I only HAVE one database right now, so this works; thank you. > I think not all files will support variables the way you think. But this > time you have passed a password named $USER4$ instead of the variable. Yup. I am not going to ask why $USER4$ works in checkcommands.cfg but not in services.cfg. :) Looking at http://nagios.sourceforge.net/docs/1_0/macros.html it seems like the $USERn$ macros should work across the board, but I guess it refers to Nagios internal processes rather than configuration files. Thanks! Now on to add more modules... Happily, -at ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From todd_barbera at wgbh.org Fri Dec 2 01:25:41 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Thu, 01 Dec 2005 19:25:41 -0500 Subject: Limiting Views of Hosts References: Message-ID: <001701c5f6d6$edeb9240$0d7810ac@wgbh.org> Yes. Set them up as a contact and only include them on the hosts you want them to see. Here's an example of one of my host configuration files: # 'acsdb' host definition define host{ use sunservers host_name acsdb alias acsdb address 192.168.40.40 contact_groups sys-admins,localonline } 'sys-admins' is setup for everything. 'localonline' only gets a subset of hosts that those folks are concerned with. Todd ----- Original Message ----- From: Joseph Hardeman To: nagios-users at lists.sourceforge.net Sent: Thursday, December 01, 2005 7:18 PM Subject: [Nagios-users] Limiting Views of Hosts Hi Everyone, I am wondering if there is a way to allow people who login to only view a specific set of hosts. This way I can have one Nagios system checking multiple clients but the clients can only see thier hosts and thier host results. I am running the newest version 2.0b6. Thanks in advance. Joseph -------------- next part -------------- An HTML attachment was scrubbed... URL: From Jesse.Caldwell at Colorado.EDU Thu Dec 1 23:01:42 2005 From: Jesse.Caldwell at Colorado.EDU (Jesse Caldwell) Date: Thu, 01 Dec 2005 15:01:42 -0700 Subject: changes from 2.0b4 to b5/b6 Message-ID: <200512012201.jB1M1gkk089015@matrix.Colorado.EDU> Hello, I've noticed a couple of changes from 2.0b4 and was wondering if anyone else has seen the same. The first is that in b5/b6, I lost the "View Trends For This Service" and "View Alert Histogram For This Service" links in extinfo.cgi. I don't know what changed between b4 and b5 to bring this about, since I can't find where USE_TRENDS and USE_HISTOGRAM are defined any differently than in b4. At any rate, defining USE_TRENDS and USE_HISTOGRAM in include/cgiutils.h brings back the extinfo.cgi that I'm used to. The second issue is that b5 and b6 take a long time to start service checks after startup, usually about 1.5 min, where b4 would start service checks almost immediately. This isn't causing any problems, just wondering what the heck it's doing. The only issue I had with 2.0b4 was that the parent nagios process would consume cpu resources once a week or so. Nagios would continue to function normally, albeit at > 90% cpu utilization. Killing nagios with KILL and restarting was the only way to get it to behave. I (obviously) haven't run b6 long enough to know if this is still an issue. We're running FreeBSD 5.4-RELEASE-p1, and nagios is configured with: ./configure --localstatedir=/foghorn/data/nagios --prefix=/usr/local/nagios --with-cgiurl=/nagios/cgi-bin --with-default-comments --with-default-downtime --with-default-perfdata --with-default-retention --with-default-status --with-htmurl=/nagios --with-init-dir=/etc/rc.d --with-mail=/usr/bin/mail --with-nagios-grp=nagios --with-nagios-user=nagios --with-template-extinfo --with-template-objects --with-gd-lib=/usr/local/gd/lib --with-gd-inc=/usr/local/gd/include The configuration for nagios has remained static while trying all three betas. Regards, Jesse ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at leangen.net Fri Dec 2 01:54:15 2005 From: nagios at leangen.net (David Leangen) Date: Fri, 02 Dec 2005 09:54:15 +0900 Subject: Dealing with unwanted questions (was: trouble getting started...) In-Reply-To: References: Message-ID: <1133484855.25001.19.camel@sonoda.bioscene.co.jp> Thought I'd add my 2 cents... This seems to be a recurring problem. > As a general statement, the amount of help received is commensurate > with the amount of work put into solving the problem before coming > here _and_ the amount of detail about the actual problem that is > provided. Excellent reply, Marc. I've been on quite a few lists in various OS communities, and this one, although very helpful, can be quite hostile at times. I certainly understand the frustration of those wanting to help, but who are overloaded by simple questions that have already been answered many times before. On the other hand, though, I also understand others who have trouble with basic things. Is there not some way to chide people a little more gently? For instance, perhaps a new "newbie" list could be created, so people who are particularly annoyed with receiving newbie mails could simply avoid that list? Or maybe we could have some kind of standard statement that we copy- paste that points to the guidelines for posting to the list. No matter how much we want the problem not to exist, there will always be people who post questions that others may be frustrated with. Quite frankly, some of the replies, although understandable, have shifted a bit to the nasty side. I just think that there must be a better way of dealing with the problem other than replying with a nasty tone. Anyway, just a few ideas... ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From todd_barbera at wgbh.org Fri Dec 2 02:21:26 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Thu, 01 Dec 2005 20:21:26 -0500 Subject: Dealing with unwanted questions (was: trouble getting started...) References: <1133484855.25001.19.camel@sonoda.bioscene.co.jp> Message-ID: <004c01c5f6de$b7bcbf20$0d7810ac@wgbh.org> I think this section of the documentation says it all: Advice for Beginners ---------------------------------------------------------------------------- ---- Congrats on choosing to try Nagios! Nagios is quite powerful and flexible, but unfortunately its not very friendly to newbies. Why? Because it takes a lot of work to get it installed and configured properly. That being said, if you stick with it and manage to get it up and running, you'll never want to be without it. :-) Here are some very important things to keep in mind for those of you who are first-time users of Nagios: Relax - its going to take some time. Don't expect to be able to compile Nagios and start it up right off the bat. Its not that easy. In fact, its pretty difficult. If you don't want to spend time learning how things work and getting things running smoothly, don't bother using this software. Instead, pay someone to monitor your network for you or hire someone to install Nagios for you. :-) Read the documentation. Nagios is difficult enough to configure when you've got a good grasp of what's going on, and nearly impossible if you don't. Do yourself a favor and read before blindly attempting to install and run Nagios. If you're the type who doesn't want to take the time to read the documentation, you'll probably find that others won't find the time to help you out when you have problems. RTFM. Use the sample config files. Sample configuration files are provided with Nagios. Look at them, modify them for your particular setup and test them! The sample files are just that - samples. There's a very good chance that they won't work for you without modifications. Sample config files can be found in the sample-config/ subdirectory of the Nagios distribution. Seek the help of others. If you've read the documentation, reviewed the sample config files, and are still having problems, try sending a descriptive email message describing your problems to the nagios-users mailing list. Due to the amount of work that I have to do for this project, I am unable to answer most of the questions that get sent directly to me, so your best source of help is going to be the mailing list. If you've done some background reading and you provide a good problem description, odds are that someone will give you some pointers on getting things working properly. ---------------- If you're not willing to at least do some up front reading, i.e., setup my system for me, then you're truly out of your league in dealing with this software or any other software. Get a COTS solution and have a consultant set it up for you and make sure you have a support contract so that someone can tell you what to do when things go wrong. A certain amount of responsibility falls upon the individual who wishes to embark on setting up a system. They need to accept that. JMTC, Todd ----- Original Message ----- From: "David Leangen" To: Sent: Thursday, December 01, 2005 7:54 PM Subject: [Nagios-users] Dealing with unwanted questions (was: trouble getting started...) > > Thought I'd add my 2 cents... This seems to be a recurring problem. > > > As a general statement, the amount of help received is commensurate > > with the amount of work put into solving the problem before coming > > here _and_ the amount of detail about the actual problem that is > > provided. > > Excellent reply, Marc. > > I've been on quite a few lists in various OS communities, and this one, > although very helpful, can be quite hostile at times. > > I certainly understand the frustration of those wanting to help, but who > are overloaded by simple questions that have already been answered many > times before. > > On the other hand, though, I also understand others who have trouble > with basic things. > > > Is there not some way to chide people a little more gently? > > > For instance, perhaps a new "newbie" list could be created, so people > who are particularly annoyed with receiving newbie mails could simply > avoid that list? > > Or maybe we could have some kind of standard statement that we copy- > paste that points to the guidelines for posting to the list. > > No matter how much we want the problem not to exist, there will always > be people who post questions that others may be frustrated with. Quite > frankly, some of the replies, although understandable, have shifted a > bit to the nasty side. I just think that there must be a better way of > dealing with the problem other than replying with a nasty tone. > > > Anyway, just a few ideas... > > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Thu Dec 1 23:25:42 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Thu, 01 Dec 2005 23:25:42 +0100 Subject: trouble getting started: getting error 127 out of bounds on check_http In-Reply-To: References: <72a19efd0511301511p5b5851a3t2f3bd3e40f612096@mail.gmail.com> Message-ID: <438F7866.5050601@its-lehmann.de> Hello, well, this is the time for me to write a few sentences, too. On 01.12.2005 13:41, Mehmet Fatih AKBULUT wrote: > dont know why people are so angry. We are not angry. Basically, many people here, including myself, simply don't don't answer to questions that could either be avoided by actually reading the manual or trying to understand the problem. I'm simply tired of reading "Nagios doesn't work and I get an output out of bounds message". Whoever wants to operate a system like Nagios really needs the ability to understand these messages and to find the reasons for it. And, while many people on the lists are very helpful, no-one will give an introduction to unix(like) OSes and Nagios via email again and again. > and dont know how many times i received that message : "hire someone to > do it for you" so far. Because that seems to be the most useful advice people can give you, assuming you want a working Nagios installation. > Hugo and others that post in the same way, i am telling this to you all : > firstly u act like u'll help me, but then in the second or third message > strangely most of you post such an upsetting mail. I see things differently: First people point out that you have to do lots of work yourself, and also supply more useful information, but if after the second or third mail you don't do, most people don't want to spend their time with someone who doesn't show the will to learn and work himself (This is not especially directed at you, there have been many cases before and others will come after us...). > u dont have to answer my thread. yes i am in need of help cos i heard > about nagios 2-3 weeks ago and i am not an expert on such programs. Right, but becoming not even an expert, only a succesful user, you will have to lear quite a lot. Nobody can make that part easier for you. > i am > willing to learn but as 50cent says in his song : "progress is a slow > process". therefore i will really need help sometimes when i come across > with an interrupt. Then, try to make as much progess as possible yourself. > you may see my posts in mailing list and u want to help me. > but please dont do that favor if you at the end will send such a reply > you did to me. just delete the mail from inbox. > cos such thing u wrote is not much pleasent. > anyway. In other words, you only want answers like "do this, do that, press here, turn that screw, and everything's fine"?. Sorry, but that's not how things work. Considering check_ldaps: You obviously didn't read the release notes that come with the plugins. You obviously didn't try the plugin from the shell. You didn't tell us which check_ldaps you use, and you didn't give any information showing that you actually checked your installation yourself - paths, file rights, command output, and makefile output, for example. hese hints should help you a long way towards understanding why check_ldaps doesn't work for you, by the way. Have a nice day, Arno > Regards. > Bye -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From VINAY_SHARMA at advanex.co.jp Fri Dec 2 02:58:40 2005 From: VINAY_SHARMA at advanex.co.jp (VINAY_SHARMA at advanex.co.jp) Date: Fri, 2 Dec 2005 10:58:40 +0900 Subject: help in config Message-ID: Just remove duplicate entry for check host alive and see other defined command.its just a matter of setting nagios correctaly.and you can define the other commands as you line for nagios. Thanks & regards ************************************** Vinay Sharma Information Systems and Solutions-Associates Advanex Inc (www.advanex.co.jp) Fon : 813-3822-5863 Fax : 813-5815-7881 Email : vinay_sharma at advanex.co.jp "Prashant Nadgir" ??: "nagios-users" ???: cc: nagios-users-admin at lists.sour ??: [Nagios-users] help in config ceforge.net 2005/12/02 10:00 hello, first time i am configuring nagios 2.-0b5 on redhat linux 9. When checking configuration, ./nagios -v nagios.cfg getting error "check-host-alive has already been defined" "could not register command ( config file name ../minimal.cfg) please help me thanks prashant ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Fri Dec 2 07:05:27 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Fri, 2 Dec 2005 07:05:27 +0100 (CET) Subject: changes from 2.0b4 to b5/b6 In-Reply-To: <200512012201.jB1M1gkk089015@matrix.Colorado.EDU> References: <200512012201.jB1M1gkk089015@matrix.Colorado.EDU> Message-ID: On Thu, 1 Dec 2005, Jesse Caldwell wrote: > The second issue is that b5 and b6 take a long time to start service > checks after startup, usually about 1.5 min, where b4 would start > service checks almost immediately. This isn't causing any problems, > just wondering what the heck it's doing. I think you just got lucky. As I run 2.0b4 for a while but most times it may take several minutes before the first service check is performed. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jun.guo at tangerinetech.com Fri Dec 2 10:54:37 2005 From: jun.guo at tangerinetech.com (Jun Guo) Date: Fri, 2 Dec 2005 17:54:37 +0800 Subject: search plugin for bayonne Message-ID: <2AFB08AC9E50C9419CD4F9704890B59506BB8B@domain01.tangerinetech.com> Is there any plugin for bayonne? It's usefull for me. Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: From richard.gliebe at fhv.at Fri Dec 2 08:04:20 2005 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Fri, 02 Dec 2005 08:04:20 +0100 Subject: Foundry Switches In-Reply-To: <4e0e33ee0512010702h4216d81eu7f9890669be545e0@mail.gmail.com> References: <1133441080.871.25.camel@glr-nb.dh.uclv.net> <4e0e33ee0512010702h4216d81eu7f9890669be545e0@mail.gmail.com> Message-ID: <1133507061.894.13.camel@glr-nb.dh.uclv.net> On Thu, 2005-12-01 at 16:02 +0100, S?bastien Barbereau wrote: Hi there: > From the foundry mib file: > snChasActualTemperature OBJECT-TYPE > SYNTAX INTEGER (-110..250) > ACCESS read-only > STATUS mandatory > DESCRIPTION > "Temperature of the chassis. Each unit is 0.5 degrees Celcius. Only > management module built with temperature sensor hardware is > applicable. For those non-applicable management module, it returns > no-such-name." > ::= { snChasGen 18 } > > So should be 72 is 36?C I've read this already on http://www.mibdepot.com, but I'm looking for a mib, which is showing me the *exact* "?C". Richard ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mail at razor.dk Fri Dec 2 15:32:58 2005 From: mail at razor.dk (Christian Joergensen) Date: Fri, 02 Dec 2005 15:32:58 +0100 Subject: Nagios Users drinking game? In-Reply-To: References: Message-ID: <1133533978.1254.9.camel@localhost> On fre, 2005-12-02 at 08:12 -0600, Marion Keith wrote: > I think there should be a Nagios Users drinking game similiar to the > Snort-Users drinking game. Everyone should be required to read the rules > before unpacking Nagios and grieving. I suspect I caused most of the rules > until I RTFM and used my thinker. The Advice for Beginners scared me > enough. Could you elaborate? -- Christian Joergensen | void factorize_primes_in_linear_time () { http://www.razor.dk/ | // todo: implement function | } -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: This is a digitally signed message part URL: From MKeith at ragansmith.com Fri Dec 2 15:12:30 2005 From: MKeith at ragansmith.com (Marion Keith) Date: Fri, 2 Dec 2005 08:12:30 -0600 Subject: Nagios Users drinking game? In-Reply-To: <1133484855.25001.19.camel@sonoda.bioscene.co.jp> References: <1133484855.25001.19.camel@sonoda.bioscene.co.jp> Message-ID: I think there should be a Nagios Users drinking game similiar to the Snort-Users drinking game. Everyone should be required to read the rules before unpacking Nagios and grieving. I suspect I caused most of the rules until I RTFM and used my thinker. The Advice for Beginners scared me enough. Marion ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From MKeith at ragansmith.com Fri Dec 2 16:52:52 2005 From: MKeith at ragansmith.com (Marion Keith) Date: Fri, 2 Dec 2005 09:52:52 -0600 Subject: Nagios Users drinking game? In-Reply-To: <1133533978.1254.9.camel@localhost> References: <1133533978.1254.9.camel@localhost> Message-ID: -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Christian Joergensen Sent: Friday, December 02, 2005 8:33 AM To: MKeith at ragansmith.com Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Nagios Users drinking game? On fre, 2005-12-02 at 08:12 -0600, Marion Keith wrote: > I think there should be a Nagios Users drinking game similiar to the > Snort-Users drinking game. Everyone should be required to read the rules > before unpacking Nagios and grieving. I suspect I caused most of the rules > until I RTFM and used my thinker. The Advice for Beginners scared me > enough. Could you elaborate? Here -> http://www.snort.org/docs/faq/1Q05/node136.html ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pharrison at mainstream-tech.com Fri Dec 2 18:08:18 2005 From: pharrison at mainstream-tech.com (Paul Harrison) Date: Fri, 02 Dec 2005 11:08:18 -0600 Subject: Limiting number of notifications Message-ID: <1133543298.6283.2.camel@pharrison.mainstream-tech.com> Hello all, I am needing some info on limiting the number of notifications that are sent out on a service failure. In the notification.cfg and service.cfg I have tried using last_notification, but this causes an error when verifying the config files. Anyone out there doing something like this, and if so, would you share??? Thanks Paul Harrison ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Deborah.Martin at Kognitio.com Fri Dec 2 18:12:59 2005 From: Deborah.Martin at Kognitio.com (Deborah Martin) Date: Fri, 2 Dec 2005 17:12:59 -0000 Subject: Limiting number of notifications Message-ID: Paul, I think the option 'last_notification' is only valid when defining escalations (you haven't mentioned whether you are or not). The docs will tell you more on how this works. http://nagios.sourceforge.net/docs/2_0/xodtemplate.html look under service escalations or host escalations. regards, deborah -----Original Message----- From: Paul Harrison [mailto:pharrison at mainstream-tech.com] Sent: 02 December 2005 17:08 To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Limiting number of notifications Hello all, I am needing some info on limiting the number of notifications that are sent out on a service failure. In the notification.cfg and service.cfg I have tried using last_notification, but this causes an error when verifying the config files. Anyone out there doing something like this, and if so, would you share??? Thanks Paul Harrison ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ************************************************************************ This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. *********************************************************************** ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enediel at hotmail.com Fri Dec 2 21:04:31 2005 From: enediel at hotmail.com (enediel gonzalez) Date: Fri, 02 Dec 2005 20:04:31 +0000 Subject: recurrent downtime Message-ID: Hello I've tried to configure nagios with recurrent downtime, in the configuration I need to declare that everyday from 6pm to 8pm, one server will be unavailable. I got one srcript sched_downtime.tar.gz without get a real result yet. Thanks in advance for any answer Regards Enediel Linux user 398956 Debian GNU/Linux ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Fri Dec 2 21:27:07 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Fri, 02 Dec 2005 21:27:07 +0100 Subject: recurrent downtime In-Reply-To: References: Message-ID: <4390AE1B.7050008@its-lehmann.de> Hello, On 02.12.2005 21:04, enediel gonzalez wrote: > Hello > I've tried to configure nagios with recurrent downtime, in the > configuration I need to declare that everyday from 6pm to 8pm, one > server will be unavailable. And what did you try? Timeperiods for notificatiosn? Or would you prefer a complete notification setup? > I got one srcript sched_downtime.tar.gz without get a real result yet. And what is the unreal result? Currently, I'm not very good in remote mind reading, and I'm afraid nobody else is. Have you done anything to achieve what you want, or do you want someone to set up your monitoring for you? Arno > Thanks in advance for any answer > Regards > Enediel > Linux user 398956 > Debian GNU/Linux > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tim.washburn at replacements.com Fri Dec 2 21:54:52 2005 From: tim.washburn at replacements.com (Timothy Washburn) Date: Fri, 2 Dec 2005 15:54:52 -0500 Subject: Cluster Drive Questions Message-ID: <56FB69819290E143870CF1073BA7AA7910639D5A@RL2KEXCH1.replacements.com> We have several Windows 2000 File and Print clusters. We would like to monitor the disk space on each of the cluster drives and have it go into warning and critical states as the space gets low. The problem we have is that the drives are only owned by one of the cluster nodes at a time and therefore can only be checked against one of the nodes. Has anyone implmented something similiar to this or does anyone have any thoughts on how to make this possible? We are using Nagios 1.2. -------------- next part -------------- An HTML attachment was scrubbed... URL: From rouilj at cs.umb.edu Fri Dec 2 22:24:40 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 02 Dec 2005 16:24:40 -0500 Subject: Should scheduling host downtime schedule downtime for services (nagios 2.0b4) Message-ID: <200512022124.jB2LOgYr013883@mx1.cs.umb.edu> Hi all: I'm running nagios 2.0b4. When you schedule downtime for a host shouldn't downtime be scheduled for all its services also? I realize that a host down event will stop the polling and notification on its services, but what effect does this have (if any) on the availability reports? -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Fri Dec 2 22:38:00 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 02 Dec 2005 16:38:00 -0500 Subject: Cluster Drive Questions In-Reply-To: Your message of "Fri, 02 Dec 2005 15:54:52 EST." <56FB69819290E143870CF1073BA7AA7910639D5A@RL2KEXCH1.replacements.com> References: <56FB69819290E143870CF1073BA7AA7910639D5A@RL2KEXCH1.replacements.com> Message-ID: <200512022138.jB2Lc06u015901@mx1.cs.umb.edu> In message <56FB69819290E143870CF1073BA7AA7910639D5A at RL2KEXCH1.replacements.com >, "Timothy Washburn" writes: >We have several Windows 2000 File and Print clusters. We would like to >monitor the disk space on each of the cluster drives and have it go into >warning and critical states as the space gets low. The problem we have >is that the drives are only owned by one of the cluster nodes at a time >and therefore can only be checked against one of the nodes. Has anyone >implmented something similiar to this or does anyone have any thoughts >on how to make this possible? We are using Nagios 1.2. Hmm this would be easier with nagios 2 as you could access the current data more easily. What I would do is create 4 services: cluster disk check on host1, host2, host3 and host4. set them up to not notify. The plugin should return ok (maybe with a message, "cluster disk not connected") or unknown depending on the way you check the 4 services. Then on the nagios box create a service: correlate cluster disk check and write a plugin that acts like check_cluster but requires three unknown and 1 non unknown status. Return the non-unknown status: E.G. warning returns warning. If you get more than 1 unknown status throw up some other error. If you get no "known" status, throw an error. If your plugin exits with an OK status on a non-connected cluster drives then I think you can use check_cluster directly. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Fri Dec 2 22:44:33 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Fri, 02 Dec 2005 22:44:33 +0100 Subject: Should scheduling host downtime schedule downtime for services (nagios 2.0b4) In-Reply-To: <200512022124.jB2LOgYr013883@mx1.cs.umb.edu> References: <200512022124.jB2LOgYr013883@mx1.cs.umb.edu> Message-ID: <4390C041.5050004@its-lehmann.de> Hello, On 02.12.2005 22:24, John P. Rouillard wrote: > Hi all: > > I'm running nagios 2.0b4. When you schedule downtime for a host > shouldn't downtime be scheduled for all its services also? > > I realize that a host down event will stop the polling and > notification on its services, but what effect does this have (if any) > on the availability reports? If I understand you correctly, the result is the expected one: Scheduled host downtime results in a scheduled downtime in the availability report. For the services of this host, the same happens: They are calculated as "scheduled warning" , "sched. unk." or "critical". Looks bad in the report to a customer, but it represents what actually happens, I think. One side-note: Using 2.0b4, I get (kind of) funny output from the service availability report (slightly edited, TAB-formatted): State Type / Reason Time % Total Time % Known Time OK Unscheduled 0d 7h 55m 21s 17.002% 17.002% Scheduled 0d 11h 56m 32s 25.629% 25.629% Total 0d 19h 51m 53s 42.631% 42.631% WARNING Unscheduled 0d 0h 0m 0s -35.59% 0.000% Scheduled 0d 0h 0m 0s 35.599% 0.000% Total 0d 0h 0m 0s 0.000% 0.000% UNKNOWN Unscheduled 0d 10h 8m 39s 21.770% 21.770% Scheduled 0d 16h 35m 18s 35.599% 35.599% Total 1d 2h 43m 57s 57.369% 57.369% CRITIC. Unscheduled 0d 0h 0m 0s 0.000% 0.000% Scheduled 0d 0h 0m 0s 0.000% 0.000% Total 0d 0h 0m 0s 0.000% 0.000% Undet. N. Not Running 0d 0h 0m 0s 0.000% Insuff. Data 0d 0h 0m 0s 0.000% Total 0d 0h 0m 0s 0.000% All Total 1d 22h 35m 50s 100.00% 100.000% Do you notice the -35% in column "Percent total time", row "Warning / unsched."? Can someone else verify this (keep in mind this is a host with lots of scheduled downtimes!), or is this even a known bug which I overlooked? Arno > -- rouilj > John Rouillard > =========================================================================== > My employers don't acknowledge my existence much less my opinions. > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From msugano at uolinc.com Fri Dec 2 22:49:53 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Fri, 02 Dec 2005 19:49:53 -0200 Subject: Should scheduling host downtime schedule downtime for services (nagios 2.0b4) In-Reply-To: <200512022124.jB2LOgYr013883@mx1.cs.umb.edu> References: <200512022124.jB2LOgYr013883@mx1.cs.umb.edu> Message-ID: <1133560193.31165.34.camel@localhost> Hi John, I've asked this a week ago, and agree with you about the dependency of services within a host. Here is the way that another John (Jollet) kindly answered: --quote-- make a check template, like the "generic service" template...set passive checks enabled to 0 for it, then in each service for that box, replace "generic service" with that non-passive one. --quote-- I think you already knew this. But it is good because this could turn into a wish for new features, does anyone agree with this? I've seen posts sent by you at the nagios-devel list. Can you place this as a feature request? Good weekend to all, Marcel. On Fri, 2005-12-02 at 16:24 -0500, John P. Rouillard wrote: > Hi all: > > I'm running nagios 2.0b4. When you schedule downtime for a host > shouldn't downtime be scheduled for all its services also? > > I realize that a host down event will stop the polling and > notification on its services, but what effect does this have (if any) > on the availability reports? > > -- rouilj > John Rouillard > =========================================================================== > My employers don't acknowledge my existence much less my opinions. > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Sat Dec 3 06:07:59 2005 From: marc at ena.com (Marc Powell) Date: Fri, 2 Dec 2005 23:07:59 -0600 Subject: recurrent downtime Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of enediel gonzalez > Sent: Friday, December 02, 2005 2:05 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] recurrent downtime > > Hello > I've tried to configure nagios with recurrent downtime, in the > configuration > I need to declare that everyday from 6pm to 8pm, one server will be > unavailable. How about specifying a check_period that doesn't include those hours. If it's still down after 8 pm you'll get a notification. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mfatihakbulut at gmail.com Sat Dec 3 12:12:17 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Sat, 3 Dec 2005 13:12:17 +0200 Subject: Getting sms and ContactGroup Problems. Message-ID: hi all. my first question is this : i have a GSM Modem and i want to receive sms' through it when a critical problem occurs in any one of the services i check by nagios. after pluging this device into my computer, what next should i do so as to start receiving sms' when anything goes wrong ? do i need to write any code for this ? (Service definition i mean?) (ps: if theres a howto or doc page please send link to me or if not please give some details on achieving this goal.) and the second one : i want to have 2 different contact groups: one is for workhours and the other is outside the workhours. to briefly explain this : i dont want to receive any mail from nagios in workhours (will send to someone else), but if something goes wrong outside the workhours this time the nagios should send the mails just to me (or me and the other guy that also receive in workhours). how do i accomplish this :p any help will be greatly appreciated ;) Regards, Bye (ps: since being a newbie on nagios, open for any idea or suggestion :p) -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris at chrisburgess.com.au Sat Dec 3 12:32:21 2005 From: chris at chrisburgess.com.au (Chris Burgess) Date: Sat, 03 Dec 2005 22:32:21 +1100 Subject: Getting sms and ContactGroup Problems. In-Reply-To: References: Message-ID: On 3/12/05 10:12 PM, "Mehmet Fatih AKBULUT" wrote: > hi all. > my first question is this : > i have a GSM Modem and i want to receive sms' through it when a critical > problem occurs in any one of the services i check by nagios. > after pluging this device into my computer, what next should i do so as to > start receiving sms' when anything goes wrong ? > do i need to write any code for this ? (Service definition i mean?) > (ps: if theres a howto or doc page please send link to me or if not please > give some details on achieving this goal.) Hi, You have a few options, try here: http://www.nagios.org/faqs/viewfaq.php?faq_id=65 http://www.nagios.org/faqs/viewfaq.php?faq_id=279 The list archive is also very useful: http://www.nagiosexchange.org/nagios-users.34.0.html Cheers, Chris ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Pearson at uftwf.org Sat Dec 3 15:45:30 2005 From: Mark.Pearson at uftwf.org (Mark Pearson) Date: Sat, 3 Dec 2005 09:45:30 -0500 Subject: Using NRPE to call remote event_handlers Message-ID: <932A04DB269BFF4F886C2ACD54C62CAC02C6AFFB@uftwfexchsvr.uftmaster.org> I am using a sudo in the script as I can run the script that is in the nrpe.cfg on the machine in question however, when I try to run it via the check_nrpe command the remote script does not seem to run. 1) How does everyone else restart services on remotely monitored computers? i.e. the Nagios monitor is running on a computer in NY and the remote computer is in CA. If the httpd service stops how would you use nagios to restart the service? 2) Is there a way I can debug the remote client to see where the problem is? Thanks for the reply. _____ From: moshe sharon [mailto:moshesharon at gmail.com] Sent: Tuesday, November 22, 2005 1:44 AM To: Nagios Users Mailing List Subject: Re: [Nagios-users] Using NRPE to call remote event_handlers Hello First nrpe only returns one line from output so if your restarting procedure returns multiple lines you wont see all the lines from the check_nrpe. second i see you are passing arguments with -a did you enable nrpe with command arguments ? and final you can call sudo before your script "/usr/bin/sudo /usr/local/nagios/libexec/check_disk ..." other then that seems you doing it perfectly fine Moshe Sharon ******************************************************************************* The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. ******************************************************************************* -------------- next part -------------- An HTML attachment was scrubbed... URL: From hvdkooij at vanderkooij.org Sat Dec 3 16:02:46 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Sat, 3 Dec 2005 16:02:46 +0100 (CET) Subject: Using NRPE to call remote event_handlers In-Reply-To: <932A04DB269BFF4F886C2ACD54C62CAC02C6AFFB@uftwfexchsvr.uftmaster.org> References: <932A04DB269BFF4F886C2ACD54C62CAC02C6AFFB@uftwfexchsvr.uftmaster.org> Message-ID: On Sat, 3 Dec 2005, Mark Pearson wrote: > 1) How does everyone else restart services on remotely monitored > computers? i.e. the Nagios monitor is running on a computer in NY and the > remote computer is in CA. If the httpd service stops how would you use > nagios to restart the service? Are serious about not having access to the system? This could be done with SSH. Frankly a Nagios system should be accessible on the command prompt by anyone maintaining the system. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Pearson at uftwf.org Sat Dec 3 16:13:21 2005 From: Mark.Pearson at uftwf.org (Mark Pearson) Date: Sat, 3 Dec 2005 10:13:21 -0500 Subject: Using NRPE to call remote event_handlers Message-ID: <932A04DB269BFF4F886C2ACD54C62CAC02C6AFFE@uftwfexchsvr.uftmaster.org> Hugo, Not sure if you don't understand what I mean or that perhaps presented this incorrectly. I have access to all my systems, however I would like nagios to automatically attempt to restart a service in the event it stops. What do you do if a service stops? -----Original Message----- From: Hugo van der Kooij [mailto:hvdkooij at vanderkooij.org] Sent: Saturday, December 03, 2005 10:03 AM To: 'Nagios Users Mailing List' Subject: RE: [Nagios-users] Using NRPE to call remote event_handlers On Sat, 3 Dec 2005, Mark Pearson wrote: > 1) How does everyone else restart services on remotely monitored > computers? i.e. the Nagios monitor is running on a computer in NY and the > remote computer is in CA. If the httpd service stops how would you use > nagios to restart the service? Are serious about not having access to the system? This could be done with SSH. Frankly a Nagios system should be accessible on the command prompt by anyone maintaining the system. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ******************************************************************************* The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. ******************************************************************************* ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Pearson at uftwf.org Sat Dec 3 16:03:58 2005 From: Mark.Pearson at uftwf.org (Mark Pearson) Date: Sat, 3 Dec 2005 10:03:58 -0500 Subject: Using NRPE to call remote event_handlers Message-ID: <932A04DB269BFF4F886C2ACD54C62CAC02C6AFFC@uftwfexchsvr.uftmaster.org> Oh and yes I did configure nrpe with the enable commands. In fact, I use args with nrpe on plugins but this problem is with the event handlers. Here is a copy of my event handler: But like I mentioned before, this script works running locally but via check_nrpe from the central monitoring machine it doesn't work. #!/bin/sh # # Event handler script for restarting the web server on the local machine # # Note: This script will only restart the web server if the service is # retried 3 times (in a "soft" state) or if the web service somehow # manages to fall into a "hard" error state. # set -x # What state is the HTTP service in? case "$1" in OK) ;; WARNING) ;; UNKNOWN) ;; CRITICAL) # Is this a "soft" or a "hard" state? case "$2" in SOFT) case "$3" in 2) echo -n "Restarting Krang service (2rd soft critical state)..." sudo /etc/rc.d/init.d/krang restart ;; esac ;; HARD) echo -n "This server would be failing over to the other node right now" | /usr/bin/logger -t $TAG # echo -n "Failing over to backup node ..." # sudo /usr/lib/heartbeat/hb_standby ;; esac ;; esac exit 0 _____ From: Mark Pearson [mailto:Mark.Pearson at uftwf.org] Sent: Saturday, December 03, 2005 9:46 AM To: 'Nagios Users Mailing List' Subject: RE: [Nagios-users] Using NRPE to call remote event_handlers I am using a sudo in the script as I can run the script that is in the nrpe.cfg on the machine in question however, when I try to run it via the check_nrpe command the remote script does not seem to run. 1) How does everyone else restart services on remotely monitored computers? i.e. the Nagios monitor is running on a computer in NY and the remote computer is in CA. If the httpd service stops how would you use nagios to restart the service? 2) Is there a way I can debug the remote client to see where the problem is? Thanks for the reply. _____ From: moshe sharon [mailto:moshesharon at gmail.com] Sent: Tuesday, November 22, 2005 1:44 AM To: Nagios Users Mailing List Subject: Re: [Nagios-users] Using NRPE to call remote event_handlers Hello First nrpe only returns one line from output so if your restarting procedure returns multiple lines you wont see all the lines from the check_nrpe. second i see you are passing arguments with -a did you enable nrpe with command arguments ? and final you can call sudo before your script "/usr/bin/sudo /usr/local/nagios/libexec/check_disk ..." other then that seems you doing it perfectly fine Moshe Sharon **************************************************************************** *** The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. **************************************************************************** *** ******************************************************************************* The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. ******************************************************************************* -------------- next part -------------- An HTML attachment was scrubbed... URL: From vladimir at gopb.ru Sat Dec 3 17:13:04 2005 From: vladimir at gopb.ru (Vladimir Potapov) Date: Sat, 3 Dec 2005 19:13:04 +0300 Subject: Send SMS alert via mobile Message-ID: <1549786643.20051203191304@gopb.ru> Hello everyone! I'm using nagios for monitoring some servers and want to receive nagios notifications via SMS. I know that some software can send SMS via internet, but sometimes SMS comes with delay or never comes because internet channel in down. It's not a good solution for me. A more good solution - send SMS via mobile phone connected to monitoring server. This can do by SCMxx, but it support very old phone models, which are removed from manufacture. Can anyone recommend command line tool for Linux/*BSD, which can send SMS from mobile phone and support more newer mobile phones? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrea.gabellini at telecomitalia.sm Sat Dec 3 18:03:29 2005 From: andrea.gabellini at telecomitalia.sm (Andrea Gabellini) Date: Sat, 03 Dec 2005 18:03:29 +0100 Subject: Send SMS alert via mobile In-Reply-To: <1549786643.20051203191304@gopb.ru> References: <1549786643.20051203191304@gopb.ru> Message-ID: <4391CFE1.1090907@telecomitalia.sm> http://smstools.meinemullemaus.de/ Vladimir Potapov wrote: > Hello everyone! > > I'm using nagios for monitoring some servers and want to receive > nagios notifications via SMS. I know that some software can send SMS > via internet, but sometimes SMS comes with delay or never comes > because internet channel in down. It's not a good solution for me. > A more good solution - send SMS via mobile phone connected to > monitoring server. This can do by SCMxx, but it support very old phone > models, which are removed from manufacture. > > Can anyone recommend command line tool for Linux/*BSD, which can send > SMS from mobile phone and support more newer mobile phones? > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -- --------------------------------------- Hard work has a future payoff. Laziness pays off now. --------------------------------------- Ing. Andrea Gabellini Email: andrea.gabellini at telecomitalia.sm Tel: 0549 886111 (Italy) Tel. +378 0549 886111 (International) Telecom Italia San Marino S.p.A. Strada degli Angariari, 3 47891 Rovereta Repubblic of San Marino http://www.omniway.sm http://www.telecomitalia.sm ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From moshesharon at gmail.com Sat Dec 3 19:47:57 2005 From: moshesharon at gmail.com (moshe sharon) Date: Sat, 3 Dec 2005 20:47:57 +0200 Subject: Send SMS alert via mobile In-Reply-To: <4391CFE1.1090907@telecomitalia.sm> References: <1549786643.20051203191304@gopb.ru> <4391CFE1.1090907@telecomitalia.sm> Message-ID: <88ad29240512031047i3a308644n3cd42f2f534f3be2@mail.gmail.com> Hello I'm using gnokii with nokia phone connected to COM1 and it works perfect. but i think this utility only supports nokia phone Moshe Sharon -------------- next part -------------- An HTML attachment was scrubbed... URL: From hvdkooij at vanderkooij.org Sat Dec 3 21:20:44 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Sat, 3 Dec 2005 21:20:44 +0100 (CET) Subject: Using NRPE to call remote event_handlers In-Reply-To: <932A04DB269BFF4F886C2ACD54C62CAC02C6AFFE@uftwfexchsvr.uftmaster.org> References: <932A04DB269BFF4F886C2ACD54C62CAC02C6AFFE@uftwfexchsvr.uftmaster.org> Message-ID: On Sat, 3 Dec 2005, Mark Pearson wrote: > Not sure if you don't understand what I mean or that perhaps presented this > incorrectly. I have access to all my systems, however I would like nagios > to automatically attempt to restart a service in the event it stops. > > What do you do if a service stops? Write a monitor script and put it in a crontab entry. Run it say every 15 minutes Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From az at whoever.org Sun Dec 4 00:31:36 2005 From: az at whoever.org (Az) Date: Sun, 04 Dec 2005 09:31:36 +1000 Subject: Cluster Drive Questions In-Reply-To: <56FB69819290E143870CF1073BA7AA7910639D5A@RL2KEXCH1.replacements.com> References: <56FB69819290E143870CF1073BA7AA7910639D5A@RL2KEXCH1.replacements.com> Message-ID: <43922AD8.5090005@whoever.org> Timothy Washburn wrote: > We have several Windows 2000 File and Print clusters. We would like > to monitor the disk space on each of the cluster drives and have it go > into warning and critical states as the space gets low. The problem > we have is that the drives are only owned by one of the cluster nodes > at a time and therefore can only be checked against one of the nodes. > Has anyone implmented something similiar to this or does anyone have > any thoughts on how to make this possible? We are using Nagios 1.2. There's probably 100 ways to skin this cat. The way we've done these sorts of setups in v1.2/1.3 are... * A host definition for each physical cluster node (eg ahost01a, ahost01b...) * Add the usual suspects for service checks to each physical (eg local disk, cpu, memory, swap...). * A host definition for the virtual node(s) in the cluster (eg ahost01-print, ahost01-file...). * Add the clustered service checks to the virtuals, including a check_cluster one as well (eg only the disks/Windows service checks that relate to that clustered service). This would only work if you have a separate service IP for each of the virtuals. Cheers. -------------- next part -------------- An HTML attachment was scrubbed... URL: From damian at workgroupsolutions.com Sun Dec 4 02:57:35 2005 From: damian at workgroupsolutions.com (Damian Mendoza) Date: Sat, 3 Dec 2005 17:57:35 -0800 Subject: NRPE with Mac OS/X Server Message-ID: <0C941442AC84A8449448BA2207DD4F4D045182@core01.workgroupsolutions.com> Hi, Has anyone successfully compiled NRPE for the MAC OS/X servers? I need to monitor a number of OS/X servers with Nagios server. Can I compile it on one MAC OS/X server and copy the compiled NRPE program to the MAC OS/X servers? Or do I need to install the Mac OS/X development software on each OS/X server and compile each individually? Thanks, Damian -------------- next part -------------- An HTML attachment was scrubbed... URL: From hvdkooij at vanderkooij.org Sun Dec 4 09:47:52 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Sun, 4 Dec 2005 09:47:52 +0100 (CET) Subject: NRPE with Mac OS/X Server In-Reply-To: <0C941442AC84A8449448BA2207DD4F4D045182@core01.workgroupsolutions.com> References: <0C941442AC84A8449448BA2207DD4F4D045182@core01.workgroupsolutions.com> Message-ID: On Sat, 3 Dec 2005, Damian Mendoza wrote: > Can I compile it on one MAC OS/X server and copy the compiled NRPE > program to the MAC OS/X servers? Or do I need to install the Mac OS/X > development software on each OS/X server and compile each individually? I gess you can copy binaries. without the need of a development kit on each system. Just make sure they have the dependencies for the binaries covered. Having said that I have 0 hours of hands-on experience with OS/X. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From asmoore at edge.net Sun Dec 4 18:39:51 2005 From: asmoore at edge.net (Alex Moore) Date: Sun, 4 Dec 2005 11:39:51 -0600 Subject: trouble with check_mysql and USER macros In-Reply-To: <72a19efd0512011622r16fd2811t1a15a314c0b4d50b@mail.gmail.com> References: <72a19efd0511301823r135b0166ra98503096df8e153@mail.gmail.com> <72a19efd0512011622r16fd2811t1a15a314c0b4d50b@mail.gmail.com> Message-ID: <20051204113951.0000480b@sws602> On Thu, 1 Dec 2005 16:22:43 -0800 Aleksey Tsalolikhin wrote: > My consideration was that then I'd only be able to check one database, > the one that password fits. > But I only HAVE one database right now, so this works; thank you. By database, I assume that you meant SQL Server. The script just counts the login spids > 50. It only connects to the default database, master and just needs the ability to login. I created a SQL Server login for the nagios daemon user on all SQL Servers that I want to monitor. With the default permissions of nothing, the nagios user cannot connect to anything but master with public permissions. Did I miss something? Alex ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fmml at cedval.org Sun Dec 4 20:05:11 2005 From: fmml at cedval.org (Francois Meehan) Date: Sun, 4 Dec 2005 14:05:11 -0500 (EST) Subject: monitoring an Exchange server with NC_net, passive mode... Message-ID: <3068.192.168.41.122.1133723111.squirrel@whoami7.cedval.org> Hi all, I was asked the "daunting task" of monitoring a remote Exchange server with Nagios. My first idea is to use NC_net in passive mode, using WMI counters. As anyone tried a similar set up? If so, can you share your experience? Regards, Francois Meehan ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Sun Dec 4 21:54:26 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Mon, 5 Dec 2005 09:54:26 +1300 Subject: monitoring an Exchange server with NC_net, passive mode... In-Reply-To: <3068.192.168.41.122.1133723111.squirrel@whoami7.cedval.org> References: <3068.192.168.41.122.1133723111.squirrel@whoami7.cedval.org> Message-ID: <005101c5f914$ea323bf0$01fea8c0@itss.auckland.ac.nz> >I was asked the "daunting task" of monitoring a remote >Exchange server with Nagios. > >My first idea is to use NC_net in passive mode, using WMI counters. > >As anyone tried a similar set up? If so, can you share your >experience? Yes, we do this. We also use the same agent (actually pNSclient 2.0.1, but NC_net does the same thing) to obtain numbers to graph with MRTG. Works fine, you just need to pick the right counters. We check for inbound/outbound queue length, SMTP, services up, and message throughput. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Sun Dec 4 21:59:22 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Mon, 5 Dec 2005 09:59:22 +1300 Subject: Cluster Drive Questions In-Reply-To: <56FB69819290E143870CF1073BA7AA7910639D5A@RL2KEXCH1.replacements.com> References: <56FB69819290E143870CF1073BA7AA7910639D5A@RL2KEXCH1.replacements.com> Message-ID: <005201c5f915$9a943200$01fea8c0@itss.auckland.ac.nz> >We have several Windows 2000 File and Print clusters. >We would like to monitor the disk space on each of the >cluster drives and have it go into warning and critical >states as the space gets low. The problem we have is >that the drives are only owned by one of the cluster >nodes at a time and therefore can only be checked against >one of the nodes. Has anyone implmented something similiar >to this or does anyone have any thoughts on how to make >this possible? We are using Nagios 1.2. We do this by checking the hosts individually, and the cluster drive via the cluster virtual IP. Thus, we define 3 hosts, the two real ones, and the third 'virtual'. The cluster drive is checked on the virtual, the other things like CPU are checked on the real. If you don't have a virtual IP, then maybe define two non-alerting services to check the drive on each machine. One of them will always give a 'critical drive not found' error. Then, use check_summary (from nagiosexchange.org) to define a summary service of the MINIMUM of the two statuses -- '@min( //host1/drive //host2/drive )'. This will then return a warn or critical when the real one goes out of OK state. We use something similar to this for our other cluster monitoring. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Sun Dec 4 22:01:38 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Mon, 5 Dec 2005 10:01:38 +1300 Subject: recurrent downtime In-Reply-To: References: Message-ID: <005301c5f915$eae7dae0$01fea8c0@itss.auckland.ac.nz> >I've tried to configure nagios with recurrent downtime, in the >configuration >I need to declare that everyday from 6pm to 8pm, one server will be >unavailable. > >I got one srcript sched_downtime.tar.gz without get a real result yet. It's a bit unclear as to what you've already tried, and what problems you are having making it work. There are several downtime schedulers available on nagiosexchange (including one I wrote :) which usually consist of a single script to run via cron and another CGI for configuring. It shouldn't be hard to get them working, but noone here will be able to help unless you state exactly what problems you're having. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Sun Dec 4 22:21:57 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Mon, 5 Dec 2005 10:21:57 +1300 Subject: trouble with check_mysql and USER macros In-Reply-To: <72a19efd0511301823r135b0166ra98503096df8e153@mail.gmail.com> References: <72a19efd0511301823r135b0166ra98503096df8e153@mail.gmail.com> Message-ID: <005801c5f918$c1dbdc70$01fea8c0@itss.auckland.ac.nz> > command_line $USER1$/check_mysql -H $ARG1$ -d $ARG2$ -u $ARG3$ -p $ARG4$ ... > check_command check_mysql!192.168.160.2!mydbname!mydbusername!$USER4$ > >and setting $USER4$ in resource.cfg, I get error > >Access denied for user 'mydbusername'@'192.168.160.1' (using >password: YES) I believe that you may not be able to use $USERx$ variables in host/service definitions, although I may be wrong. It is a bit irritating that (in Nagios 1.2) certain variables are available in some places and not others - for example, not being able to pass parameters to eventhandlers. This is buried in the manual somewhere, I think. In this case, you may be better off defining a separate check command, check_myssql_def (for example), that only takes 3 parameters and is defined like command_line $USER1$/check_mysql -H $ARG1$ -d "$ARG2$" -u "$ARG3$" -p "$USER4$" Note that it is good practice to have the quotes around strings, just in case of embedded metacharacters (especially in the case of passwords). HTH Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From aleksey.tsalolikhin at gmail.com Sun Dec 4 23:24:51 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Sun, 4 Dec 2005 14:24:51 -0800 Subject: trouble with check_mysql and USER macros In-Reply-To: <005801c5f918$c1dbdc70$01fea8c0@itss.auckland.ac.nz> References: <72a19efd0511301823r135b0166ra98503096df8e153@mail.gmail.com> <005801c5f918$c1dbdc70$01fea8c0@itss.auckland.ac.nz> Message-ID: <72a19efd0512041424k1039a5etdf922bd5fb4c438e@mail.gmail.com> On 12/4/05, Steve Shipway wrote: > > you may be better off defining a separate check command, > check_myssql_def (for example), that only takes 3 parameters ... > Note that it is good practice to have the quotes around strings, just in > case of embedded metacharacters (especially in the case of passwords). Acknowledged, thanks! This does help. Best, Aleksey ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Sun Dec 4 23:24:51 2005 From: marc at ena.com (Marc Powell) Date: Sun, 4 Dec 2005 16:24:51 -0600 Subject: trouble with check_mysql and USER macros Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Steve Shipway > Sent: Sunday, December 04, 2005 3:22 PM > To: 'Aleksey Tsalolikhin'; nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] trouble with check_mysql and USER macros > > > command_line $USER1$/check_mysql -H $ARG1$ -d $ARG2$ -u $ARG3$ > -p $ARG4$ > ... > > check_command > check_mysql!192.168.160.2!mydbname!mydbusername!$USER4$ > > > >and setting $USER4$ in resource.cfg, I get error > > > >Access denied for user 'mydbusername'@'192.168.160.1' (using > >password: YES) > > I believe that you may not be able to use $USERx$ variables in > host/service > definitions, although I may be wrong. It is a bit irritating that (in > Nagios 1.2) certain variables are available in some places and not others > - > for example, not being able to pass parameters to eventhandlers. This is > buried in the manual somewhere, I think. > That is correct, macros can only be used in command definitions and can not be used in other object types or passed from other object types (excepting the $ARGn$ macros). I don't know what parameters you were wanting to pass to event handlers but with 2.x it now appears possible to pass almost all macros, including $ARGn$ and $USERn$ macros to event handlers. http://nagios.sourceforge.net/docs/2_0/macros.html "One of the features available in Nagios is the ability to use macros in command defintions. Immediately prior to the execution of a command, Nagios will replace all macros in the command with their corresponding values. This allows you to define a few generic commands to handle all your needs." That page goes on to detail specifically when each macro is available for use. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tdondich at gmail.com Mon Dec 5 00:53:55 2005 From: tdondich at gmail.com (Taylor Dondich) Date: Sun, 4 Dec 2005 15:53:55 -0800 Subject: LISA Conference, Nagios Event Broker Documentation, my talks Message-ID: <3d2eb72a0512041553t71331e93i960a39b2a7ab5e5e@mail.gmail.com> Hi all, just letting anyone know, if they're going to LISA, I'll be there with my company and we'll be doing two talks. One regarding Nagios 2.0, and one regarding our company. During the Nagios 2.0 talks, I'll be discussing the Nagios 2.0 Event Broker and will be providing documentation regarding this feature. This is the first of documentation releases I'll be providing on using the event broker interface to extend nagios to provide further functionality. If anyone's going to LISA, come find me. I'll be talking with Groundwork Open Source Solutions, or you can find our booth. Thanks! Taylor ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jhmartin at toger.us Mon Dec 5 03:30:42 2005 From: jhmartin at toger.us (Jason Martin) Date: Sun, 4 Dec 2005 21:30:42 -0500 Subject: Cluster Drive Questions In-Reply-To: <43922AD8.5090005@whoever.org> References: <56FB69819290E143870CF1073BA7AA7910639D5A@RL2KEXCH1.replacements.com> <43922AD8.5090005@whoever.org> Message-ID: <20051205023042.GA27507@mal.members.linode.com> On Sun, Dec 04, 2005 at 09:31:36AM +1000, Az wrote: > >we have is that the drives are only owned by one of the cluster nodes > >at a time and therefore can only be checked against one of the nodes. You could configure a 'service' that checks for the existence of the drive, then a service dependency between the two such that the drive is only checked when it is present. -Jason Martin -- Never draw fire, it irritates everyone around you This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From jhmartin at toger.us Mon Dec 5 03:33:53 2005 From: jhmartin at toger.us (Jason Martin) Date: Sun, 4 Dec 2005 21:33:53 -0500 Subject: Using NRPE to call remote event_handlers In-Reply-To: References: <932A04DB269BFF4F886C2ACD54C62CAC02C6AFFE@uftwfexchsvr.uftmaster.org> Message-ID: <20051205023353.GB27507@mal.members.linode.com> On Sat, Dec 03, 2005 at 09:20:44PM +0100, Hugo van der Kooij wrote: > > incorrectly. I have access to all my systems, however I would like nagios > > to automatically attempt to restart a service in the event it stops. > > What do you do if a service stops? Write a script for NRPE that performs the restart and invoke it as a event handler. You may have to use passwordless SUDO to give it the correct permissions. I suggest thinking about using tcp wrappers or iptables, as anyone with a NRPE client could invoke the restart script. -Jason MArtin > Write a monitor script and put it in a crontab entry. Run it say every 15 > minutes > > Hugo. > -- Never draw fire, it irritates everyone around you This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From morten.bekkelund at ergo.no Mon Dec 5 10:33:32 2005 From: morten.bekkelund at ergo.no (Morten Bekkelund) Date: Mon, 5 Dec 2005 10:33:32 +0100 Subject: processing passive host check results Message-ID: <8C6D7826E62F404D94BB0197258F548BBC8E0D@100NOOSLMSG004.common.alpharoot.net> Hi everyone! First of all I apology if this issue has been raised earlier, but I've done some googling and haven't found the answer to our problem yet. I'm having a little issue now and then with our distributed nagios solution. Seems sometimes the leefnode is having trouble (or so it seems) sending "OK" check results to the topnode after a host has been down and gotten back up. It might be a problem with the topnode not processing the "OK" check. The leefnode sends an OK status to the topnode and I can find a "ping OK" in the topnode logfile while the web-pages still only showing the host as down (yet it's services as UP). This only happens like 5-10 times pr week on 3200 hosts so it's a fairly limited problem. Anyone seen anything similar? I'm looking for pointers to documentation that helps me solve the problem or maybe some sort of a workaround. Thanks in advance. Morten Bekkelund Nagios-admin, ErgoGroup Sourcing ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.borsani at it.net Mon Dec 5 10:44:08 2005 From: m.borsani at it.net (Marco Borsani) Date: Mon, 5 Dec 2005 10:44:08 +0100 Subject: R: Send SMS alert via mobile In-Reply-To: <4391CFE1.1090907@telecomitalia.sm> References: <4391CFE1.1090907@telecomitalia.sm> Message-ID: <008801c5f980$70cc8970$0900d40a@intranet.it.net> Smstools is very nice, but I cat use it on my HP-UX server (I can't compile on it). Does anyone use something else on HP-UX? Regards Marco -}-----Messaggio originale----- -}Da: nagios-users-admin at lists.sourceforge.net -}[mailto:nagios-users-admin at lists.sourceforge.net] Per conto -}di Andrea Gabellini -}Inviato: sabato 3 dicembre 2005 18.03 -}A: Vladimir Potapov -}Cc: nagios-users -}Oggetto: Re: [Nagios-users] Send SMS alert via mobile -} -}http://smstools.meinemullemaus.de/ -} -}Vladimir Potapov wrote: -}> Hello everyone! -}> -}> I'm using nagios for monitoring some servers and want to receive -}> nagios notifications via SMS. I know that some software can -}send SMS -}> via internet, but sometimes SMS comes with delay or never comes -}> because internet channel in down. It's not a good solution for me. -}> A more good solution - send SMS via mobile phone connected to -}> monitoring server. This can do by SCMxx, but it support -}very old phone -}> models, which are removed from manufacture. -}> -}> Can anyone recommend command line tool for Linux/*BSD, -}which can send -}> SMS from mobile phone and support more newer mobile phones? -}> -}> -}> -}> ------------------------------------------------------- -}> This SF.net email is sponsored by: Splunk Inc. Do you grep -}through log -}> files for problems? Stop! Download the new AJAX search -}engine that -}> makes searching your log files as easy as surfing the web. -} DOWNLOAD SPLUNK! -}> http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click -}> _______________________________________________ -}> Nagios-users mailing list -}> Nagios-users at lists.sourceforge.net -}> https://lists.sourceforge.net/lists/listinfo/nagios-users -}> ::: Please include Nagios version, plugin version (-v) and -}OS when reporting any issue. -}> ::: Messages without supporting info will risk being sent -}to /dev/null -}> -}> -} -}-- -} -}--------------------------------------- -}Hard work has a future payoff. Laziness pays off now. -}--------------------------------------- -}Ing. Andrea Gabellini -}Email: andrea.gabellini at telecomitalia.sm -}Tel: 0549 886111 (Italy) -}Tel. +378 0549 886111 (International) -} -}Telecom Italia San Marino S.p.A. -}Strada degli Angariari, 3 -}47891 Rovereta -}Repubblic of San Marino -} -}http://www.omniway.sm http://www.telecomitalia.sm -} -} -}------------------------------------------------------- -}This SF.net email is sponsored by: Splunk Inc. Do you grep -}through log files for problems? Stop! Download the new AJAX -}search engine that makes searching your log files as easy as -}surfing the web. DOWNLOAD SPLUNK! -}http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click -}_______________________________________________ -}Nagios-users mailing list -}Nagios-users at lists.sourceforge.net -}https://lists.sourceforge.net/lists/listinfo/nagios-users -}::: Please include Nagios version, plugin version (-v) and OS -}when reporting any issue. -}::: Messages without supporting info will risk being sent to /dev/null -} ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From qeldroma at freenet.de Mon Dec 5 11:23:07 2005 From: qeldroma at freenet.de (qeldroma) Date: Mon, 5 Dec 2005 11:23:07 +0100 Subject: Probs with nagios-service/status Message-ID: <200512051123.07721.qeldroma@freenet.de> "It appears as though Nagios is not running, so commands are temporarily unavailable..." ..is the message. I know that this has been discussed severall times...Unfortunately i read many of them and didn't solve my problem. Ok, the facts: I installed nagios 1.2 with MySQL-db. As i can see on the perfdata, logging of them works. Nagios-plugins are 1.4.2. If i try "check_nagios_db.pl" as root or nagios, it works! What do you need else to help me? Can someone help me out step by step? I soon need to have the management-options in nagios that are provided by this feature... Kind regards, Qeldroma ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fernando.shayani at bsb.politec.com.br Mon Dec 5 13:01:56 2005 From: fernando.shayani at bsb.politec.com.br (Fernando Shayani) Date: Mon, 5 Dec 2005 09:01:56 -0300 Subject: Nagios HANGS scheduling info Message-ID: <1964B929D8FF4D4DB06854A75D0FD3730CABC44A@bsbserv028.bsb.politec.com.br> Well, I upgraded to b6, enabled the Orphaned Service and still hangs... Is there any other clue? Fernando Shayani fernando.shayani at bsb.politec.com.br (61) 3038-6951 POLITEC - Bras?lia - DF _____ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Fernando Shayani Sent: quinta-feira, 1 de dezembro de 2005 09:25 To: Eli Stair; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] Nagios HANGS scheduling info Nagios STATD isn't running. I will enable Orphaned Service and check it out again. I will also upgrade to Beta 6 Thanks Fernando Shayani fernando.shayani at bsb.politec.com.br (61) 3038-6951 POLITEC - Bras?lia - DF _____ From: Eli Stair [mailto:estair at ilm.com] Sent: ter?a-feira, 29 de novembro de 2005 14:57 To: Fernando Shayani; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] Nagios HANGS scheduling info Try enabling the orphaned service checks if you haven't already. /eli -----Original Message----- From: nagios-users-admin at lists.sourceforge.net on behalf of Fernando Shayani Sent: Tue 11/29/2005 6:17 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Nagios HANGS scheduling info A lot of times (1 or 2 times a week!) Nagios stops checking the services. Then I go to SCHEDULING QUEUE and noticed that are tests scheduled to check but it didn't run. Tests that usually runs fine, but sometimes, different tests doesn't run. Then I have to RESTART nagios to fix it. Does anyone could help me? Thanks Fernando Shayani fernando.shayani at bsb.politec.com.br (61) 3038-6951 POLITEC -------------- next part -------------- An HTML attachment was scrubbed... URL: From radhiilu at yahoo.com Mon Dec 5 12:46:58 2005 From: radhiilu at yahoo.com (Radhika) Date: Mon, 5 Dec 2005 03:46:58 -0800 (PST) Subject: adding new plugin to existing nagios Message-ID: <20051205114658.24621.qmail@web60723.mail.yahoo.com> Hi, I want to add new plug in for monitoring cisco memory from this link http://www.nagiosexchange.org/SNMP.51.0.html?&tx_netnagext_pi1[p_view]=52&tx_netnagext_pi1[page]=20%3A10 i have downloaded the plugin and i have copied in /usr/local/nagios/libexec/ folder and now i need to define the command and services defined from the following link http://www.manubulon.com/nagios/index_commands.html#mem is it correct or i need to do anyother thing. Please give me the full steps to add a new plugin for my nagios Thanks for your time --------------------------------- Yahoo! Personals Single? There's someone we'd like you to meet. Lots of someones, actually. Try Yahoo! Personals -------------- next part -------------- An HTML attachment was scrubbed... URL: From mfatihakbulut at gmail.com Mon Dec 5 14:44:25 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Mon, 5 Dec 2005 15:44:25 +0200 Subject: 2 error to be solved. Message-ID: hi all. i just need 2 explanations :p these are my questions : 1. Error : Could not bind to DHCP socket (68!) check your priviliges! 2. Error : check_ldaps please specify the ldap base all permissions set days ago. where am i doing wrong ? is my services or command wrong ? why am i getting the errors above ? any ideas ? Regards. Bye -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Mon Dec 5 15:14:44 2005 From: marc at ena.com (Marc Powell) Date: Mon, 5 Dec 2005 08:14:44 -0600 Subject: Probs with nagios-service/status Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of qeldroma > Sent: Monday, December 05, 2005 4:23 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Probs with nagios-service/status > > "It appears as though Nagios is not running, so commands are temporarily > unavailable..." > ..is the message. > > I know that this has been discussed severall times...Unfortunately i read > many > of them and didn't solve my problem. > > Ok, the facts: I installed nagios 1.2 with MySQL-db. As i can see on the > perfdata, logging of them works. Nagios-plugins are 1.4.2. > > If i try "check_nagios_db.pl" as root or nagios, it works! > What do you need else to help me? What is your nagios_check_command in cgi.cfg? When you ran your test, did you use the exact command line as specified there? Also, when you see the error in the CGI, there is a link to follow for more information. What does that page say is the reason it thinks nagios isn't running? -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Mon Dec 5 15:26:52 2005 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Mon, 05 Dec 2005 15:26:52 +0100 Subject: 2 error to be solved. In-Reply-To: References: Message-ID: <1133792812.888.6.camel@glr-nb.dh.uclv.net> On Mon, 2005-12-05 at 15:44 +0200, Mehmet Fatih AKBULUT wrote: > hi all. > i just need 2 explanations :p > these are my questions : > > 1. Error : Could not bind to DHCP socket (68!) check your priviliges! check your priviliges ;-) -rws--x--- 1 root nagios 23972 Aug 12 09:27 ./check_dhcp > 2. Error : check_ldaps please specify the ldap base ./check_ldaps -H $HOSTADDRESS$ -b ldap_base -p Port_number -w warning\ -c critical > > all permissions set days ago. > where am i doing wrong ? hope this helps Regards Richard ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From soumya.nambiar at satconet.com Mon Dec 5 15:48:17 2005 From: soumya.nambiar at satconet.com (Soumya Nambiar) Date: Mon, 5 Dec 2005 17:48:17 +0300 Subject: SMS alerts with clickatell Message-ID: <003301c5f9aa$ee6d6370$0702a8c0@stcl.com> Hi, Has anyone used clickatell as such. When i try to send sms from Nagios it is showing me an authentication failed error. Any inputs??? Soumya -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 1384 bytes Desc: not available URL: From qeldroma at freenet.de Mon Dec 5 15:44:59 2005 From: qeldroma at freenet.de (qeldroma) Date: Mon, 5 Dec 2005 15:44:59 +0100 Subject: Probs with nagios-service/status In-Reply-To: References: Message-ID: <200512051544.59675.qeldroma@freenet.de> > What is your nagios_check_command in cgi.cfg? When you ran your test, > did you use the exact command line as specified there? nagios_check_command=/usr/bin/perl /usr/nagios/libexec/check_nagios_db.pl and YES, it works from console as root and "su nagios".. > Also, when you see the error in the CGI, there is a link to follow for more > information. What does that page say is the reason it thinks nagios > isn't running? ---------------------------------------------------------------------------------------------- Process Status: UNKNOWN Check Command Output: Nagios check command did not return any output ---------------------------------------------------------------------------------------------- But that's the point: It worked as nagios (the user of the process)..?! Kind regards, Qeldroma ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tim.wolak at gmail.com Mon Dec 5 16:02:01 2005 From: tim.wolak at gmail.com (Tim Wolak) Date: Mon, 05 Dec 2005 09:02:01 -0600 Subject: NRPE Error sending query to host Message-ID: <43945669.6070004@gmail.com> Hi all, This is the first time I have run into this as most of mt hosts are linux/Unix. We have some 2K3 windows severs and I have NRPE up and running and Nagios is doing ping checks just fine, however when I run check_nrpe from the Nagios server it gets CHECK_NRPE: Error sending query to host.... has anyone seen this before and know a fix? Thanks, Tim ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cguimont at rocq.qc.ca Mon Dec 5 16:33:48 2005 From: cguimont at rocq.qc.ca (Charles Guimont) Date: Mon, 5 Dec 2005 10:33:48 -0500 Subject: Internal server error Message-ID: <212A156D-86FF-4303-9B55-866CB3959BD2@rocq.qc.ca> Hello, I have installed nagios. Since I installeed it, when I try to access the cgi using the web, I get an error of: Internal Server Error The server encountered an internal error or misconfiguration and was unable to complete your request. Please contact the server administrator, webmaster at rocq.qc.ca and inform them of the time the error occurred, and anything you might have done that may have caused the error. More information about this error may be available in the server error log. Additionally, a 404 Not Found error was encountered while trying to use an ErrorDocument to handle the request. Apache/1.3.34 Server at www.rocq.qc.ca Port 80 I tryed to search the web to find some solutions without success. Is there something I could to to make it work? I tryed to recompile many times Thanks, Charles -------------- next part -------------- An HTML attachment was scrubbed... URL: From cguimont at rocq.qc.ca Mon Dec 5 16:47:30 2005 From: cguimont at rocq.qc.ca (Charles Guimont) Date: Mon, 5 Dec 2005 10:47:30 -0500 Subject: Internal server error In-Reply-To: <43945EB1.6030706@norwich.edu> References: <212A156D-86FF-4303-9B55-866CB3959BD2@rocq.qc.ca> <43945EB1.6030706@norwich.edu> Message-ID: <19A0B84D-1AED-44DF-B94B-5133D6CC752D@rocq.qc.ca> Hello, I forgot to say what was the error log in apache: [Mon Dec 5 10:46:05 2005] [error] [client ***] Premature end of script headers: /usr/local/nagios/sbin/status.cgi Charles On 05-12-05, at 10:37, Jamie Pratt wrote: > Hi. Did you modify your httpd.conf file like it says to below, and > then restart apache? I would check your apache access and error > logs as well for more clues... > > http://nagios.sourceforge.net/docs/2_0/installweb.html > > regards, > jamie > > -- > > > Jamie Pratt > Unix Systems Administrator > Norwich University - Information Technology > http://www.norwich.edu > 1-802-485-2532 > > > > Charles Guimont wrote: >> Hello, >> I have installed nagios. Since I installeed it, when I try to >> access the cgi using the web, I get an error of: >> *Internal Server Error* >> The server encountered an internal error or misconfiguration and >> was unable to complete your request. >> Please contact the server administrator, webmaster at rocq.qc.ca >> and inform them of the time the >> error occurred, and anything you might have done that may have >> caused the error. >> More information about this error may be available in the server >> error log. >> Additionally, a 404 Not Found error was encountered while trying >> to use an ErrorDocument to handle the request. >> /Apache/1.3.34 Server at www.rocq.qc.ca >> Port 80/ >> I tryed to search the web to find some solutions without success. >> Is there something I could to to make it work? >> I tryed to recompile many times >> Thanks, >> Charles > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Dec 5 16:47:22 2005 From: marc at ena.com (Marc Powell) Date: Mon, 5 Dec 2005 09:47:22 -0600 Subject: Internal server error Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Charles Guimont > Sent: Monday, December 05, 2005 9:34 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Internal server error > > Hello, > I have installed nagios. Since I installeed it, when I try to access the > cgi using the web, I get an error of: > > > Internal Server Error > > The server encountered an internal error or misconfiguration and was > unable to complete your request. > Please contact the server administrator, webmaster at rocq.qc.ca and inform > them of the time the error occurred, and anything you might have done that > may have caused the error. > More information about this error may be available in the server error > log. > > Additionally, a 404 Not Found error was encountered while trying to use an > ErrorDocument to handle the request. > > > > > Apache/1.3.34 Server at www.rocq.qc.ca Port 80 > > > I tryed to search the web to find some solutions without success. > Is there something I could to to make it work? Check your apache error_log. It should detail the exact reason for the Internal Server Error. There are many reasons why you might see that kind of error, some related to nagios some not so we'd just be guessing. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Dec 5 16:53:39 2005 From: marc at ena.com (Marc Powell) Date: Mon, 5 Dec 2005 09:53:39 -0600 Subject: Probs with nagios-service/status Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of qeldroma > Sent: Monday, December 05, 2005 8:45 AM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Probs with nagios-service/status > > > What is your nagios_check_command in cgi.cfg? When you ran your test, > > did you use the exact command line as specified there? > nagios_check_command=/usr/bin/perl /usr/nagios/libexec/check_nagios_db.pl > and YES, it works from console as root and "su nagios".. > > > Also, when you see the error in the CGI, there is a link to follow for > more > > information. What does that page say is the reason it thinks nagios > > isn't running? > ------------------------------------------------------------------------ -- > -------------------- > Process Status: > UNKNOWN > Check Command Output: Nagios check command did not return any output > ------------------------------------------------------------------------ -- > -------------------- > > But that's the point: It worked as nagios (the user of the process)..?! My mistake for not remembering this initially... nagios isn't the user running the check_nagios_db.pl process but rather the user that your web server is running as. I would verify that your web server process has permissions to run that script in that directory (normal apache(?) directives and/or SELinux if it's enabled) and is actually able to run it. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Dec 5 17:02:09 2005 From: marc at ena.com (Marc Powell) Date: Mon, 5 Dec 2005 10:02:09 -0600 Subject: Internal server error Message-ID: Try this (preferably as the apache user) -- $ bash $ REQEUST_METHOD=GET $ export REQUEST_METHOD $ /usr/local/nagios/sbin/status.cgi It looks like there may be some library missing or not where it's supposed to be. Was nagios compiled on this machine? You can use 'ldd /usr/local/nagios/sbin/status.cgi' and verify that the libraries exist where they're expected. Additionally, if you have SELinux enabled, make sure that's not causing any problems. -- Marc > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Charles Guimont > Sent: Monday, December 05, 2005 9:48 AM > To: jpratt at norwich.edu; nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Internal server error > > Hello, I forgot to say what was the error log in apache: > > > [Mon Dec 5 10:46:05 2005] [error] [client ***] Premature end of > script headers: /usr/local/nagios/sbin/status.cgi > > > Charles > > On 05-12-05, at 10:37, Jamie Pratt wrote: > > > Hi. Did you modify your httpd.conf file like it says to below, and > > then restart apache? I would check your apache access and error > > logs as well for more clues... > > > > http://nagios.sourceforge.net/docs/2_0/installweb.html > > > > regards, > > jamie > > > > -- > > > > > > Jamie Pratt > > Unix Systems Administrator > > Norwich University - Information Technology > > http://www.norwich.edu > > 1-802-485-2532 > > > > > > > > Charles Guimont wrote: > >> Hello, > >> I have installed nagios. Since I installeed it, when I try to > >> access the cgi using the web, I get an error of: > >> *Internal Server Error* > >> The server encountered an internal error or misconfiguration and > >> was unable to complete your request. > >> Please contact the server administrator, webmaster at rocq.qc.ca > >> and inform them of the time the > >> error occurred, and anything you might have done that may have > >> caused the error. > >> More information about this error may be available in the server > >> error log. > >> Additionally, a 404 Not Found error was encountered while trying > >> to use an ErrorDocument to handle the request. > >> /Apache/1.3.34 Server at www.rocq.qc.ca > >> Port 80/ > >> I tryed to search the web to find some solutions without success. > >> Is there something I could to to make it work? > >> I tryed to recompile many times > >> Thanks, > >> Charles > > > > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cguimont at rocq.qc.ca Mon Dec 5 17:18:30 2005 From: cguimont at rocq.qc.ca (Charles Guimont) Date: Mon, 5 Dec 2005 11:18:30 -0500 Subject: Internal server error In-Reply-To: References: Message-ID: This is the error message I get:

Error: Could not read object configuration data!

Here are some things you should check in order to resolve this error:

  1. Verify configuration options using the -v command-line option to check for errors.
  2. Check the Nagios log file for messages relating to startup or status data errors.
  3. Make sure you've compiled the main program and the CGIs to use the same object data storage options (i.e. default text file or template-based file).
First, running the command nagios -v nagios.cfg return no errors Startup makes no errors: [1133799437] Nagios 2.0b6 starting... (PID=907) [1133799437] LOG VERSION: 2.0 [1133799437] Finished daemonizing... (New PID=908) How can I make sure of this?? Make sure you've compiled the main program and the CGIs to use the same object data storage options (i.e. default text file or template- based file). Thanks Charles On 05-12-05, at 11:02, Marc Powell wrote: > Try this (preferably as the apache user) -- > > $ bash > $ REQEUST_METHOD=GET > $ export REQUEST_METHOD > $ /usr/local/nagios/sbin/status.cgi > > It looks like there may be some library missing or not where it's > supposed to be. Was nagios compiled on this machine? You can use 'ldd > /usr/local/nagios/sbin/status.cgi' and verify that the libraries exist > where they're expected. Additionally, if you have SELinux enabled, > make > sure that's not causing any problems. > > -- > Marc > > >> -----Original Message----- >> From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- >> admin at lists.sourceforge.net] On Behalf Of Charles Guimont >> Sent: Monday, December 05, 2005 9:48 AM >> To: jpratt at norwich.edu; nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] Internal server error >> >> Hello, I forgot to say what was the error log in apache: >> >> >> [Mon Dec 5 10:46:05 2005] [error] [client ***] Premature end of >> script headers: /usr/local/nagios/sbin/status.cgi >> >> >> Charles >> >> On 05-12-05, at 10:37, Jamie Pratt wrote: >> >>> Hi. Did you modify your httpd.conf file like it says to below, and >>> then restart apache? I would check your apache access and error >>> logs as well for more clues... >>> >>> http://nagios.sourceforge.net/docs/2_0/installweb.html >>> >>> regards, >>> jamie >>> >>> -- >>> >>> >>> Jamie Pratt >>> Unix Systems Administrator >>> Norwich University - Information Technology >>> http://www.norwich.edu >>> 1-802-485-2532 >>> >>> >>> >>> Charles Guimont wrote: >>>> Hello, >>>> I have installed nagios. Since I installeed it, when I try to >>>> access the cgi using the web, I get an error of: >>>> *Internal Server Error* >>>> The server encountered an internal error or misconfiguration and >>>> was unable to complete your request. >>>> Please contact the server administrator, webmaster at rocq.qc.ca >>>> and inform them of the time the >>>> error occurred, and anything you might have done that may have >>>> caused the error. >>>> More information about this error may be available in the server >>>> error log. >>>> Additionally, a 404 Not Found error was encountered while trying >>>> to use an ErrorDocument to handle the request. >>>> /Apache/1.3.34 Server at www.rocq.qc.ca >>>> Port 80/ >>>> I tryed to search the web to find some solutions without success. >>>> Is there something I could to to make it work? >>>> I tryed to recompile many times >>>> Thanks, >>>> Charles >>> >>> >> >> >> >> ------------------------------------------------------- >> This SF.net email is sponsored by: Splunk Inc. Do you grep through >> log >> files >> for problems? Stop! Download the new AJAX search engine that makes >> searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! >> http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/ >> null > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Pearson at uftwf.org Mon Dec 5 17:55:26 2005 From: Mark.Pearson at uftwf.org (Mark Pearson) Date: Mon, 5 Dec 2005 11:55:26 -0500 Subject: Using NRPE to call remote event_handlers Message-ID: <932A04DB269BFF4F886C2ACD54C62CAC02C6B033@uftwfexchsvr.uftmaster.org> Thanks Jason. This is precisely what I am doing but for some reason when I execute the check_nrpe script on the main Nagios system it does not execute the corresponding script on the remote box. Understanding that the check_nrpe script accepts one line of output from the script that is executed on the remote system, that one line I receive is part of the normal output of the script running on the remote. Thus telling me that the script is running yet the service is not restarting. Just curious though, when it comes to using nagios' event_handlers to say, restart httpd, I noticed in the sample cfgs that the event_handlers seem all to be running locally. Should I have the full blown nagios running on remote machine? Currently, I only have the nrpe, plugins and my event_handlers I created. Sorry for all the noobie questions, but this is just not making logical sense. Thanks everyone for your help. -Mark -----Original Message----- From: Jason Martin [mailto:jhmartin at toger.us] Sent: Sunday, December 04, 2005 9:34 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Using NRPE to call remote event_handlers On Sat, Dec 03, 2005 at 09:20:44PM +0100, Hugo van der Kooij wrote: > > incorrectly. I have access to all my systems, however I would like nagios > > to automatically attempt to restart a service in the event it stops. > > What do you do if a service stops? Write a script for NRPE that performs the restart and invoke it as a event handler. You may have to use passwordless SUDO to give it the correct permissions. I suggest thinking about using tcp wrappers or iptables, as anyone with a NRPE client could invoke the restart script. -Jason MArtin > Write a monitor script and put it in a crontab entry. Run it say every 15 > minutes > > Hugo. > -- Never draw fire, it irritates everyone around you This message is PGP/MIME signed. ******************************************************************************* The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. ******************************************************************************* ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ron.gage at altiris.com Mon Dec 5 18:08:14 2005 From: ron.gage at altiris.com (Ron Gage) Date: Mon, 5 Dec 2005 12:08:14 -0500 Subject: Service notifications not firing - nothing in logs. Message-ID: Greetings: I am having a bit of a problem with service notifications. It seems that they are simply refusing to trigger at all. Host notifications are working just fine, but nothing for the service notifications - not even in the notification logs. This is with 2.0b6 on apache 1.3.33 (Slackware 10.1). The system is detecting service outages, but does not fire off (or even try to fire off) any notifications. HELP! Should I post relevent excerpts from my config files? RON GAGE Network Administrator Wise Solutions, Inc. T > +1 734 456 2202 M > +1 248 343 2431 www.altiris.com Security. Compliance. Patch management. IT service management. Altiris solves your most pressing IT issues. www.altiris.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From claudio at sefaz.mt.gov.br Mon Dec 5 20:43:00 2005 From: claudio at sefaz.mt.gov.br (Claudio Elauterio) Date: Mon, 05 Dec 2005 15:43:00 -0400 Subject: new look for nagios Message-ID: <43949844.60103@sefaz.mt.gov.br> New look for nagios... -- _________________________ Cl?udio Elaut?rio Analista de T.I ICQ: 77872215 MSN: celauter at yahoo.com.br Fone: 550656172339 Celular: 06599570277 O homem ? feito para a luta, n?o para o repouso. (Ralph Emerson) -------------- next part -------------- A non-text attachment was scrubbed... Name: nagiosnewlookprod.png Type: image/png Size: 119407 bytes Desc: not available URL: From jperrin at gmail.com Mon Dec 5 18:53:02 2005 From: jperrin at gmail.com (Jim Perrin) Date: Mon, 5 Dec 2005 12:53:02 -0500 Subject: new look for nagios In-Reply-To: <43949844.60103@sefaz.mt.gov.br> References: <43949844.60103@sefaz.mt.gov.br> Message-ID: <302ce8b50512050953v417992acubd54841062591aea@mail.gmail.com> On 12/5/05, Claudio Elauterio wrote: > New look for nagios... Very nice! Is this on nagios-exchange, and there an english version? -- Jim Perrin System Architect - UIT Ft Gordon & US Army Signal Center ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From msugano at uolinc.com Mon Dec 5 19:33:27 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Mon, 05 Dec 2005 16:33:27 -0200 Subject: Nagios HANGS scheduling info In-Reply-To: <1964B929D8FF4D4DB06854A75D0FD3730CABC44A@bsbserv028.bsb.politec.com.br> References: <1964B929D8FF4D4DB06854A75D0FD3730CABC44A@bsbserv028.bsb.politec.com.br> Message-ID: <1133807608.17732.12.camel@localhost> On Mon, 2005-12-05 at 09:01 -0300, Fernando Shayani wrote: > Well, I upgraded to b6, enabled the Orphaned Service and still > hangs... > > Is there any other clue? > > Fernando Shayani > fernando.shayani at bsb.politec.com.br > (61) 3038-6951 > POLITEC - Bras?lia - DF Have you followed the instructions available at: http://nagios.sourceforge.net/docs/2_0/configmain.html ? Read that and check the following configs: Inter-check sleep time Service inter-check delay method Maximum service check spread Service interleave factor Maximum concurrent service checks Service reaper frequency Host inter-check delay method Maximum host check spread Timing interval length Auto-rescheduling option Auto-rescheduling interval Auto-rescheduling window Agressive host checking option Which are related to the scheduling options, and might be the source of your problem. Have you messed up those variables? Whatever the case, post the values of this configuration variables. HTH, -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enediel at hotmail.com Mon Dec 5 19:58:38 2005 From: enediel at hotmail.com (enediel gonzalez) Date: Mon, 05 Dec 2005 18:58:38 +0000 Subject: recurrent downtime In-Reply-To: <005301c5f915$eae7dae0$01fea8c0@itss.auckland.ac.nz> References: <005301c5f915$eae7dae0$01fea8c0@itss.auckland.ac.nz> Message-ID: Thanks Steve for your answer When I sent my first message I was trying to use the project sched_downtime.tar.gz avalilable on nagiosexchange's site today I was trying to use your project, everything is find until I run downtime_job.pl my($CMDFILE) = "/usr/local/nagios/var/rw/nagios.cmd"; # Nagios CMD file my($CFGFILE) = "/usr/local/nagios/etc/schedule.cfg"; # configuration file my($FREQUENCY) = 1440; # how many minutes between runs. 1440=24hrs my($DEBUG) = 0; #executing the command date Mon Dec 5 13:50:39 EST 2005 using your web page I configured properly schedule.cfg, define schedule { schedule_type host host_name enediel user admin comment test time 15:00 duration 30 days_of_week Mon } running downtime_job.pl I don't receive any error message and visiting after the nagios page to check the Scheduled Host Downtime declarations I don't see anything there. I checked this flag on the nagios configuration's file check_external_commands=1 today in the morning I saw that launching your script I obtained the necessary nagios configuration, using the nagios interface I deleted the scheduled downtime, but for some reason, later on my tests it didn't happen anymore. ?Could it be for the nagios version? I'm using Nagios 2.0b3 Thanks in advance for any answer Best regards Enediel Linux user 398956 Debian GNU/Linux >From: "Steve Shipway" >To: "'enediel gonzalez'" >, >Subject: RE: [Nagios-users] recurrent downtime >Date: Mon, 5 Dec 2005 10:01:38 +1300 > > >I've tried to configure nagios with recurrent downtime, in the > >configuration > >I need to declare that everyday from 6pm to 8pm, one server will be > >unavailable. > > > >I got one srcript sched_downtime.tar.gz without get a real result yet. > >It's a bit unclear as to what you've already tried, and what problems you >are having making it work. > >There are several downtime schedulers available on nagiosexchange >(including >one I wrote :) which usually consist of a single script to run via cron and >another CGI for configuring. It shouldn't be hard to get them working, but >noone here will be able to help unless you state exactly what problems >you're having. > >Steve > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Kyle_Narum at eLoyalty.com Mon Dec 5 20:09:05 2005 From: Kyle_Narum at eLoyalty.com (Narum, Kyle) Date: Mon, 5 Dec 2005 13:09:05 -0600 Subject: Unexpected Service Escalation Behavior Message-ID: <037594FB5AFC354FA38653EB3FF3242106E73A24@elhqex01.eloyaltyco.com> All, I'm running into an interesting behavior when using Service Escalation rules to control alerts for errors of type UNKNOWN. Basically, for certain Service Checks, we don't want our Help Desk to get notified if the check returns UNKNOWN, however we do want them to be notified of any CRITICAL or WARNING. So I've setup an escalation rule for that service which specifies only the "w,c,r" options. This works fine for sending out the initial UNKNOWN alert. (ie it only goes to a secondary email address which acts as a general log for all alerts.) However, when the Service Check recovers, both of our contact groups get notified. (Both the contact group that received the UNKNOWN error, as well as the contact group that did not receive the error...thereby receiving an "OK/RECOVERY" message, when they never received a "UNKNOWN" alert message.) Here's detail on the configuration I have: Service A defined as: -------------------- define service { service_description Service A check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 active_checks_enabled 1 passive_checks_enabled 1 parallelize_check 1 obsess_over_service 1 check_freshness 1 notifications_enabled 1 notification_options w,c,r,u,f notification_interval 5 notification_period 24x7 event_handler_enabled 1 flap_detection_enabled 1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 contact_groups Contact Group A, Contact Group B } Service Escalation for Service A defined as: ------------------------------------------- define serviceescalation{ service_description Service A host Host A contact_groups Contact Group B first_notification 1 last_notification 0 notification_interval 1440 escalation_period 24x7 escalation_options w,c,r } Expected Behavior when Service A goes into an UNKNOWN state and then goes back to an OK state * Contact Group A gets notified notified of the UNNOWN alert * Contact Group A gets notified of the OK/RECOVERY Actual Behavior when Service A goes into an UNKNOWN state and then goes back to an OK state * Contact Group A gets notified of the UNKNOWN alert * Contact Group A AND Contact Group B both get notified of the OK/RECOVERY I've read thru all the documentation I can find regarding the use of escalation rules, and I'm fairly sure this isn't the intended behavior. Has anyone encountered a similar issue, or am I missing something in my configuration? System: Nagios 2.04b / OS Fedora Core 4 Thanks, Kyle ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From b00mer at gmx.net Mon Dec 5 20:31:56 2005 From: b00mer at gmx.net (Hendrik Baecker) Date: Mon, 05 Dec 2005 20:31:56 +0100 Subject: adding new plugin to existing nagios In-Reply-To: <20051205114658.24621.qmail@web60723.mail.yahoo.com> References: <20051205114658.24621.qmail@web60723.mail.yahoo.com> Message-ID: <439495AC.7020509@gmx.net> Radhika schrieb: > Hi, > > I want to add new plug in for monitoring cisco memory from this link > > http://www.nagiosexchange.org/SNMP.51.0.html?&tx_netnagext_pi1[p_view]=52&tx_netnagext_pi1[page]=20%3A10 > > > i have downloaded the plugin and i have copied in > /usr/local/nagios/libexec/ folder and now i need to define the command > and services defined from the following link > > http://www.manubulon.com/nagios/index_commands.html#mem > > is it correct or i need to do anyother thing. > > Please give me the full steps to add a new plugin for my nagios > > Thanks for your time > > ------------------------------------------------------------------------ > *Yahoo! Personals* > Single? There's someone we'd like you to meet. > Lots of someones, actually. Try Yahoo! Personals > Hello Radhika, here is your wished full step for your personal Enviroment: open your commands.cfg File with an editor of your choice, you can take vi, vim, joe, pico or something similar after you done this, go to the end of the file and make sure, that you're in the edit mode to add a few lines to your file. Now please do not copy and paste from your second link but ensure that you type every single letter as it from the second link like this: define command { command_name check_snmp_mem command_line $USER1$/check_snmp_mem.pl -H $HOSTADDRESS$ -C $ARG1$ -w $ARG2$ -c $ARG3$ $ARG4$ # comment Community, warn, crit, options (-I cisco) } After you have done this go to the fourth line of your new text and change exactly the string "" (without the "") with the snmp readonly community string which you are using on your cisco device. (Look in your config for lines like "snmp-server community SOMETHING" <<<< this should be your Community String. Now, save the file back to disk, run your nagios with the sanity check like: "/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg". In the output of this command be sure that it tolds you something like "0 errors", warnings are less critical, but there should NOT be any error!!! If you got an error, read the Nagios Documentation and try to catch your fault. If there are NO ERROR just restart your nagios and see what's happend... Oh! Nothing happens? Realy? Just go with an editor to your service configuration File and add a service, like you do as usual, read the nagios documentation if you are asking yourself what a service is.... Some hints: define service { name Cisco_memory service_description Cisco memory check_command check_snmp_mem!!95!99!-I register 0 } So, end with kidding. But I am asking myself why you are in the thought, that other peoples are knowing your enviroment such your snmp community and so on. This link (http://www.manubulon.com/nagios/snmp_mem.html) there is an example output from the plugin with -h (<<< stands for HELP) with the instructions what data the plugin needs to run. For everything else like "How to define services and check commands..", let me tell you. If your eyes are not lazy enough to read the mailing lists posts and if you have enough time to wait until someone else find enough time to reply to your post... just read the nagios documentation! Like: http://nagios.sourceforge.net/docs/2_0/xodtemplate.html http://nagios.sourceforge.net/docs/2_0/ Or for other nagios versions like 2.x: http://www.nagios.org/docs/ -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Mon Dec 5 20:57:17 2005 From: marc at ena.com (Marc Powell) Date: Mon, 5 Dec 2005 13:57:17 -0600 Subject: new look for nagios Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Jim Perrin > Sent: Monday, December 05, 2005 11:53 AM > To: Claudio Elauterio > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] new look for nagios > > On 12/5/05, Claudio Elauterio wrote: > > New look for nagios... > > Very nice! > > Is this on nagios-exchange, and there an english version? I don't see that he did anything really special. I only see three minor changes that were made: changed the Nagios image at the top of side.html, maybe edited the stylesheet for the same page to change the background color and changed index.html to point to a custom page in the 'side' frame. Those changes are easily accomplished with basic knowledge of HTML. Is there something more that I'm missing? I'm not deriding his work but encouraging you that the changes are minor and easily accomplished. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jperrin at gmail.com Mon Dec 5 21:01:56 2005 From: jperrin at gmail.com (Jim Perrin) Date: Mon, 5 Dec 2005 15:01:56 -0500 Subject: new look for nagios In-Reply-To: References: Message-ID: <302ce8b50512051201p68c06f29p7c6adc9af31ede03@mail.gmail.com> On 12/5/05, Marc Powell wrote: > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > > admin at lists.sourceforge.net] On Behalf Of Jim Perrin > > Sent: Monday, December 05, 2005 11:53 AM > > To: Claudio Elauterio > > Cc: nagios-users at lists.sourceforge.net > > Subject: Re: [Nagios-users] new look for nagios > > > > On 12/5/05, Claudio Elauterio wrote: > > > New look for nagios... > > > > Very nice! > > > > Is this on nagios-exchange, and there an english version? > > I don't see that he did anything really special. I only see three minor > changes that were made: changed the Nagios image at the top of > side.html, maybe edited the stylesheet for the same page to change the > background color and changed index.html to point to a custom page in the > 'side' frame. Those changes are easily accomplished with basic knowledge > of HTML. Is there something more that I'm missing? I'm not deriding his > work but encouraging you that the changes are minor and easily > accomplished. > > -- > Marc Eh, I've been in meetings all morning. At this point something shiny and new is enough to get me excited, at least initially. -- Jim Perrin System Architect - UIT Ft Gordon & US Army Signal Center ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Dec 5 21:03:26 2005 From: marc at ena.com (Marc Powell) Date: Mon, 5 Dec 2005 14:03:26 -0600 Subject: Internal server error Message-ID: > -----Original Message----- > From: Charles Guimont [mailto:cguimont at rocq.qc.ca] > Sent: Monday, December 05, 2005 10:19 AM > To: Marc Powell; nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Internal server error > > This is the error message I get: > >

Error: Could not read object > configuration data!

>

> Here are some things you should check in order to resolve this error: >

>

>

    >
  1. Verify configuration options using the -v command-line > option to check for errors. >
  2. Check the Nagios log file for messages relating to startup or > status data errors. >
  3. Make sure you've compiled the main program and the CGIs to use > the same object data storage options (i.e. default text file or > template-based file). >
What user did you perform this test as? If apache, it would appear that permission on the directories or files are such that it can't read the config files. > > First, running the command nagios -v nagios.cfg return no errors > > Startup makes no errors: > > [1133799437] Nagios 2.0b6 starting... (PID=907) > [1133799437] LOG VERSION: 2.0 > [1133799437] Finished daemonizing... (New PID=908) > > > How can I make sure of this?? > Make sure you've compiled the main program and the CGIs to use the > same object data storage options (i.e. default text file or template- > based file). This error doesn't make much sense any more with 2.x since all object storage options are now template-based. There may be some circumstance where that might be pertinent but I'll bet you would know what you were doing to make it happen. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Mon Dec 5 21:03:53 2005 From: teng at dataway.com (Tedman Eng) Date: Mon, 5 Dec 2005 12:03:53 -0800 Subject: Using NRPE to call remote event_handlers Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C5D@dw-mail.dataway.com> What user are you testing the event handler as? Most scripts in /etc/init.d require root persmission to restart (or some component of the script may). On the remote box, su to the user that the nrpe daemon executes as, then test the event handler and let us know what happens. -----Original Message----- From: Mark Pearson [mailto:Mark.Pearson at uftwf.org] Sent: Monday, December 05, 2005 8:55 AM To: 'Jason Martin'; 'nagios-users at lists.sourceforge.net' Subject: RE: [Nagios-users] Using NRPE to call remote event_handlers Thanks Jason. This is precisely what I am doing but for some reason when I execute the check_nrpe script on the main Nagios system it does not execute the corresponding script on the remote box. Understanding that the check_nrpe script accepts one line of output from the script that is executed on the remote system, that one line I receive is part of the normal output of the script running on the remote. Thus telling me that the script is running yet the service is not restarting. Just curious though, when it comes to using nagios' event_handlers to say, restart httpd, I noticed in the sample cfgs that the event_handlers seem all to be running locally. Should I have the full blown nagios running on remote machine? Currently, I only have the nrpe, plugins and my event_handlers I created. Sorry for all the noobie questions, but this is just not making logical sense. Thanks everyone for your help. -Mark -----Original Message----- From: Jason Martin [mailto:jhmartin at toger.us] Sent: Sunday, December 04, 2005 9:34 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Using NRPE to call remote event_handlers On Sat, Dec 03, 2005 at 09:20:44PM +0100, Hugo van der Kooij wrote: > > incorrectly. I have access to all my systems, however I would like nagios > > to automatically attempt to restart a service in the event it stops. > > What do you do if a service stops? Write a script for NRPE that performs the restart and invoke it as a event handler. You may have to use passwordless SUDO to give it the correct permissions. I suggest thinking about using tcp wrappers or iptables, as anyone with a NRPE client could invoke the restart script. -Jason MArtin > Write a monitor script and put it in a crontab entry. Run it say every 15 > minutes > > Hugo. > -- Never draw fire, it irritates everyone around you This message is PGP/MIME signed. **************************************************************************** *** The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. **************************************************************************** *** ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cmoody at qualcomm.com Mon Dec 5 14:23:19 2005 From: cmoody at qualcomm.com (Chris Moody) Date: Mon, 05 Dec 2005 05:23:19 -0800 Subject: Limiting Views of Hosts - over 3k hosts In-Reply-To: <001701c5f6d6$edeb9240$0d7810ac@wgbh.org> References: <001701c5f6d6$edeb9240$0d7810ac@wgbh.org> Message-ID: <43943F47.8070907@qualcomm.com> On a similar note(but starting a new thread), does anyone have experience (with a modified cgi or whatnot) of removing the non-visible hosts from view and re-drawing the statumap to only display the visible hosts...but at the appropriate scale. (perhaps I should explain in more detail) I'm stuck in a situation where I need to display over 3000 hosts, but broken up into 4 groups. The trouble is that with that many systems, even if I "pad" the statusmap with parents to expand the diameter of the map, there are too many systems and they overlap. Even when I split them into groups and only have group views, the main map doesn't exclude the non-visable hosts when it draws...so the systems still overlap, but with blank space in between the clusters. I have modified the CGI to increase the default diameter of the map, but it's still a bit crowded. I'm open to whatever creative means (groups or not) to get the map to scale appropriately. I need the hosts displayed for visual dependency tracking. I can provide screenshots if necessary. Anyone have any ideas. Cheers, -Chris Todd Barbera wrote: > > Yes. Set them up as a contact and only include them on the hosts you > want them to see. Here's an example of one of my host configuration files: > > # 'acsdb' host definition > define host{ > use sunservers > host_name acsdb > alias acsdb > address 192.168.40.40 > contact_groups sys-admins,localonline > } > > 'sys-admins' is setup for everything. 'localonline' only gets a subset > of hosts that those folks are concerned with. > > Todd > > > ----- Original Message ----- > *From:* Joseph Hardeman > *To:* nagios-users at lists.sourceforge.net > > *Sent:* Thursday, December 01, 2005 7:18 PM > *Subject:* [Nagios-users] Limiting Views of Hosts > > Hi Everyone, > > I am wondering if there is a way to allow people who login to only > view a specific set of hosts. This way I can have one Nagios system > checking multiple clients but the clients can only see thier hosts > and thier host results. I am running the newest version 2.0b6. > > Thanks in advance. > > Joseph ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Mon Dec 5 21:31:34 2005 From: teng at dataway.com (Tedman Eng) Date: Mon, 5 Dec 2005 12:31:34 -0800 Subject: SMS alerts with clickatell Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C5E@dw-mail.dataway.com> Soumya Nambiar wrote: > Hi, > Has anyone used clickatell as such. I'm sure someone has used clickatell. Maybe not as such. >When i try to send sms from Nagios it is showing me an authentication failed error. Any inputs??? > > Soumya What sms software are you using? 'Authentication failure' implies internet-based sms messaging. sms-clinet for unix does not list 'clickatell' as a supported provider. http://www.smsclient.org/providers.php Check your sms software's docs, check the clickatell Developer API, call clickatell support. Troubleshhot sms issues between a) sms software b) sms provider. When you have solved sending sms from commandline successfully, and then if you still have trouble getting it to work from Nagios try emailing the list. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Pearson at uftwf.org Mon Dec 5 21:32:37 2005 From: Mark.Pearson at uftwf.org (Mark Pearson) Date: Mon, 5 Dec 2005 15:32:37 -0500 Subject: Using NRPE to call remote event_handlers Message-ID: <932A04DB269BFF4F886C2ACD54C62CAC02C6B051@uftwfexchsvr.uftmaster.org> The problem is fixed now, even though I don't understand why completely. To answer the question below, I was using sudo for the command in the /etc/init.d and the script would work, when executed on the remote machine when I was logged in as the nagios user. The problem was executing the event handler using the check_nrpe command. To fix the problem I changed service restart part of the event handler script from: sudo /etc/rc.d/init.d/krang restart to: sudo /etc/rc.d/init.d/krang restart > /dev/null This is purely my opinion now, but I think that because the "krang restart" returns multiple lines of output that something goes wrong and the event handler script is killed by the check_nrpe script. Thus by sending the output to "Null" the "krang restart" script can finish while the check_nrpe script completes. At least that is my opinion. Now I can change my other scripts. Next, doing the same with HeartBeat and failing clusters over. Oh Joy! :-0 Thanks everyone!!! -Mark -----Original Message----- From: Tedman Eng [mailto:teng at dataway.com] Sent: Monday, December 05, 2005 3:04 PM To: 'Mark Pearson' Cc: 'nagios-users at lists.sourceforge.net' Subject: RE: [Nagios-users] Using NRPE to call remote event_handlers What user are you testing the event handler as? Most scripts in /etc/init.d require root persmission to restart (or some component of the script may). On the remote box, su to the user that the nrpe daemon executes as, then test the event handler and let us know what happens. -----Original Message----- From: Mark Pearson [mailto:Mark.Pearson at uftwf.org] Sent: Monday, December 05, 2005 8:55 AM To: 'Jason Martin'; 'nagios-users at lists.sourceforge.net' Subject: RE: [Nagios-users] Using NRPE to call remote event_handlers Thanks Jason. This is precisely what I am doing but for some reason when I execute the check_nrpe script on the main Nagios system it does not execute the corresponding script on the remote box. Understanding that the check_nrpe script accepts one line of output from the script that is executed on the remote system, that one line I receive is part of the normal output of the script running on the remote. Thus telling me that the script is running yet the service is not restarting. Just curious though, when it comes to using nagios' event_handlers to say, restart httpd, I noticed in the sample cfgs that the event_handlers seem all to be running locally. Should I have the full blown nagios running on remote machine? Currently, I only have the nrpe, plugins and my event_handlers I created. Sorry for all the noobie questions, but this is just not making logical sense. Thanks everyone for your help. -Mark -----Original Message----- From: Jason Martin [mailto:jhmartin at toger.us] Sent: Sunday, December 04, 2005 9:34 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Using NRPE to call remote event_handlers On Sat, Dec 03, 2005 at 09:20:44PM +0100, Hugo van der Kooij wrote: > > incorrectly. I have access to all my systems, however I would like nagios > > to automatically attempt to restart a service in the event it stops. > > What do you do if a service stops? Write a script for NRPE that performs the restart and invoke it as a event handler. You may have to use passwordless SUDO to give it the correct permissions. I suggest thinking about using tcp wrappers or iptables, as anyone with a NRPE client could invoke the restart script. -Jason MArtin > Write a monitor script and put it in a crontab entry. Run it say every 15 > minutes > > Hugo. > -- Never draw fire, it irritates everyone around you This message is PGP/MIME signed. **************************************************************************** *** The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. **************************************************************************** *** ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ******************************************************************************* The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. ******************************************************************************* ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Pearson at uftwf.org Mon Dec 5 21:36:57 2005 From: Mark.Pearson at uftwf.org (Mark Pearson) Date: Mon, 5 Dec 2005 15:36:57 -0500 Subject: new look for nagios Message-ID: <932A04DB269BFF4F886C2ACD54C62CAC02C6B052@uftwfexchsvr.uftmaster.org> This is one of my favorites. Found it on nagios-exchange.org -----Original Message----- From: Marc Powell [mailto:marc at ena.com] Sent: Monday, December 05, 2005 2:57 PM To: nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] new look for nagios > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Jim Perrin > Sent: Monday, December 05, 2005 11:53 AM > To: Claudio Elauterio > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] new look for nagios > > On 12/5/05, Claudio Elauterio wrote: > > New look for nagios... > > Very nice! > > Is this on nagios-exchange, and there an english version? I don't see that he did anything really special. I only see three minor changes that were made: changed the Nagios image at the top of side.html, maybe edited the stylesheet for the same page to change the background color and changed index.html to point to a custom page in the 'side' frame. Those changes are easily accomplished with basic knowledge of HTML. Is there something more that I'm missing? I'm not deriding his work but encouraging you that the changes are minor and easily accomplished. -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ******************************************************************************* The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. ******************************************************************************* -------------- next part -------------- A non-text attachment was scrubbed... Name: nuvolo-Nagios-skin.JPG Type: image/jpeg Size: 141559 bytes Desc: not available URL: From s.shipway at auckland.ac.nz Mon Dec 5 21:32:01 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Tue, 6 Dec 2005 09:32:01 +1300 Subject: Using NRPE to call remote event_handlers In-Reply-To: <932A04DB269BFF4F886C2ACD54C62CAC02C6B033@uftwfexchsvr.uftmaster.org> References: <932A04DB269BFF4F886C2ACD54C62CAC02C6B033@uftwfexchsvr.uftmaster.org> Message-ID: <001c01c5f9da$f2bff3e0$01fea8c0@itss.auckland.ac.nz> >Understanding that the check_nrpe script accepts one line of >output from the script that is executed on the remote system, >that one line I receive is part of the normal output of the >script running on the remote. Thus telling me that the script >is running yet the service is not restarting. This sort of thing definitely works, as we have done it here successfully (under windows). However I am now moving to using Win2k's own service autorestart as it is faster and more reliable. Maybe you have increased security levels, and the script is not running with the authority to restart a service? >Just curious though, when it comes to using nagios' >event_handlers to say, restart httpd, I noticed in the sample >cfgs that the event_handlers seem all to be running locally. >Should I have the full blown nagios running on remote machine? No, NRPE will run a plugin on its own. You may need to use something like sudo to wrap the plugin, though, in order to give the root rights required to start a service on a unix box. HTH Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From CWaters at jeld-wen.com Mon Dec 5 21:36:38 2005 From: CWaters at jeld-wen.com (CWaters at jeld-wen.com) Date: Mon, 5 Dec 2005 12:36:38 -0800 Subject: [BULK SPAM] Re: new look for nagios Message-ID: <83C21E03494FF147B56468B50551782140B53A@jwmail01.domain.jw.local> If you want a good one look at nuvola style at nagiosexchange: http://www.nagiosexchange.org/Image_Packs.75.0.html?&tx_netnagext_pi1[p_view ]=252 Adds dhtml menus etc. Been using it for a while now and it seems to be the best one if you are just looking for a facelift to nagios. It's not perfect but works pretty well. Chris Waters JELD-WEN, Inc. Network Services Group =================== -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Jim Perrin Sent: Monday, December 05, 2005 12:02 PM To: Marc Powell Cc: nagios-users at lists.sourceforge.net Subject: [BULK SPAM] Re: [Nagios-users] new look for nagios Importance: Low On 12/5/05, Marc Powell wrote: > > > > -----Original Message----- > > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > > admin at lists.sourceforge.net] On Behalf Of Jim Perrin > > Sent: Monday, December 05, 2005 11:53 AM > > To: Claudio Elauterio > > Cc: nagios-users at lists.sourceforge.net > > Subject: Re: [Nagios-users] new look for nagios > > > > On 12/5/05, Claudio Elauterio wrote: > > > New look for nagios... > > > > Very nice! > > > > Is this on nagios-exchange, and there an english version? > > I don't see that he did anything really special. I only see three > minor changes that were made: changed the Nagios image at the top of > side.html, maybe edited the stylesheet for the same page to change the > background color and changed index.html to point to a custom page in > the 'side' frame. Those changes are easily accomplished with basic > knowledge of HTML. Is there something more that I'm missing? I'm not > deriding his work but encouraging you that the changes are minor and > easily accomplished. > > -- > Marc Eh, I've been in meetings all morning. At this point something shiny and new is enough to get me excited, at least initially. -- Jim Perrin System Architect - UIT Ft Gordon & US Army Signal Center ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cameron.matheson at gmail.com Mon Dec 5 21:54:59 2005 From: cameron.matheson at gmail.com (Cam) Date: Mon, 5 Dec 2005 13:54:59 -0700 Subject: contactgroup{ }, and contactgroups # bug? Message-ID: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> Hi, So I'm writing some scripts to help me maintain my nagios configuration (actually to help other people be able to add new hosts, change some configuration parameters, etc). Anyway, I'm having a problem right now. In the case of hosts/hostgroups, this works fine: define hostgroup{ houstgroup_name kiosks alias kiosks } define host{ name host_template hostgroups kiosks ... } define host{ host_name foobar use host_template ... } #foobar will be put in the hostgroup kiosks however, doing an equivalent configuration w/ contacts/contactgroups doesn't work (i'm using nagios 2.0b4): define contactgroup{ contactgroup_name kiosk-folks alias Kiosk People } define contact{ name kiosk_contact contactgroups kiosk-folks ... register 0 } define contact{ use kiosk_contact contact_name jdoe alias John Doe ... } # DOESN'T ADD TO KIOSK-FOLKS When try to verify my nagios configuration this is the error i get: Error: Contactgroup has no members (config file '/usr/local/opt/nagios/etc/contactgroups.cfg', starting on line 25) I don't understand why it works for the one, but not for the other. Is this a bug? Thanks, Cameron Matheson ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Mon Dec 5 22:31:07 2005 From: jpratt at norwich.edu (Jamie Pratt) Date: Mon, 05 Dec 2005 16:31:07 -0500 Subject: NRPE logging to syslog Message-ID: <4394B19B.8020207@norwich.edu> Hi. I have debugging set to 0 in nrpe.cfg, yet my /var/log/messages and /var/log/secure files are filling up fast with this kind of stuff: Dec 1 03:20:16 gentoo1 xinetd[5777]: START: nrpe pid=27547 from=192.168.10.111 Dec 1 03:20:16 gentoo1 xinetd[5777]: START: nrpe pid=27549 from=192.168.10.111 Dec 1 03:20:29 gentoo1 xinetd[5777]: START: nrpe pid=27558 from=192.168.10.111 Dec 1 03:20:29 gentoo1 xinetd[5777]: START: nrpe pid=27559 from=192.168.10.111 Dec 1 03:20:30 gentoo1 xinetd[5777]: START: nrpe pid=27564 from=192.168.10.111 Dec 1 03:20:30 gentoo1 xinetd[5777]: START: nrpe pid=27565 from=192.168.10.111 Dec 1 03:23:16 gentoo1 xinetd[5777]: START: nrpe pid=27585 from=192.168.10.111 Dec 1 03:23:16 gentoo1 xinetd[5777]: START: nrpe pid=27586 from=192.168.10.111 Dec 1 03:23:16 gentoo1 xinetd[5777]: START: nrpe pid=27587 from=192.168.10.111 Dec 1 03:23:29 gentoo1 xinetd[5777]: START: nrpe pid=27598 from=192.168.10.111 Dec 1 03:23:29 gentoo1 xinetd[5777]: START: nrpe pid=27599 from=192.168.10.111 Dec 1 03:23:30 gentoo1 xinetd[5777]: START: nrpe pid=27604 from=192.168.10.111 Dec 1 03:23:30 gentoo1 xinetd[5777]: START: nrpe pid=27605 from=192.168.10.111 Dec 1 03:26:14 gentoo1 xinetd[5777]: START: nrpe pid=27626 from=192.168.10.111 Dec 1 03:26:14 gentoo1 xinetd[5777]: START: nrpe pid=27628 from=192.168.10.111 Dec 1 03:26:14 gentoo1 xinetd[5777]: START: nrpe pid=27629 from=192.168.10.111 Dec 1 03:26:27 gentoo1 xinetd[5777]: START: nrpe pid=27640 from=192.168.10.111 Dec 1 03:26:27 gentoo1 xinetd[5777]: START: nrpe pid=27641 from=192.168.10.111 Dec 1 03:26:28 gentoo1 xinetd[5777]: START: nrpe pid=27646 from=192.168.10.111 ----------- Any way to shut this off completely so logs stop filling up? Thanks... regards, Jamie ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tim at infostream.com.au Mon Dec 5 22:57:21 2005 From: tim at infostream.com.au (Tim O'Hare) Date: Tue, 06 Dec 2005 08:57:21 +1100 Subject: NRPE logging to syslog In-Reply-To: <4394B19B.8020207@norwich.edu> References: <4394B19B.8020207@norwich.edu> Message-ID: <4394B7C1.1020000@infostream.com.au> Hi Jamie, Check your xinetd configuration. Either /etc/xinetd.conf or /etc/xinetd.d/*. Looks like you have logging switched on in there. cheers, Tim Jamie Pratt wrote: > Hi. I have debugging set to 0 in nrpe.cfg, yet my /var/log/messages and > /var/log/secure files are filling up fast with this kind of stuff: > > Dec 1 03:20:16 gentoo1 xinetd[5777]: START: nrpe pid=27547 > from=192.168.10.111 > Dec 1 03:20:16 gentoo1 xinetd[5777]: START: nrpe pid=27549 > from=192.168.10.111 > Dec 1 03:20:29 gentoo1 xinetd[5777]: START: nrpe pid=27558 > from=192.168.10.111 > Dec 1 03:20:29 gentoo1 xinetd[5777]: START: nrpe pid=27559 > from=192.168.10.111 > Dec 1 03:20:30 gentoo1 xinetd[5777]: START: nrpe pid=27564 > from=192.168.10.111 > Dec 1 03:20:30 gentoo1 xinetd[5777]: START: nrpe pid=27565 > from=192.168.10.111 > Dec 1 03:23:16 gentoo1 xinetd[5777]: START: nrpe pid=27585 > from=192.168.10.111 > Dec 1 03:23:16 gentoo1 xinetd[5777]: START: nrpe pid=27586 > from=192.168.10.111 > Dec 1 03:23:16 gentoo1 xinetd[5777]: START: nrpe pid=27587 > from=192.168.10.111 > Dec 1 03:23:29 gentoo1 xinetd[5777]: START: nrpe pid=27598 > from=192.168.10.111 > Dec 1 03:23:29 gentoo1 xinetd[5777]: START: nrpe pid=27599 > from=192.168.10.111 > Dec 1 03:23:30 gentoo1 xinetd[5777]: START: nrpe pid=27604 > from=192.168.10.111 > Dec 1 03:23:30 gentoo1 xinetd[5777]: START: nrpe pid=27605 > from=192.168.10.111 > Dec 1 03:26:14 gentoo1 xinetd[5777]: START: nrpe pid=27626 > from=192.168.10.111 > Dec 1 03:26:14 gentoo1 xinetd[5777]: START: nrpe pid=27628 > from=192.168.10.111 > Dec 1 03:26:14 gentoo1 xinetd[5777]: START: nrpe pid=27629 > from=192.168.10.111 > Dec 1 03:26:27 gentoo1 xinetd[5777]: START: nrpe pid=27640 > from=192.168.10.111 > Dec 1 03:26:27 gentoo1 xinetd[5777]: START: nrpe pid=27641 > from=192.168.10.111 > Dec 1 03:26:28 gentoo1 xinetd[5777]: START: nrpe pid=27646 > from=192.168.10.111 > > > ----------- > > > Any way to shut this off completely so logs stop filling up? > > Thanks... > > regards, > Jamie > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Dec 5 23:02:19 2005 From: marc at ena.com (Marc Powell) Date: Mon, 5 Dec 2005 16:02:19 -0600 Subject: NRPE logging to syslog Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Jamie Pratt > Sent: Monday, December 05, 2005 3:31 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] NRPE logging to syslog > > Hi. I have debugging set to 0 in nrpe.cfg, yet my /var/log/messages and > /var/log/secure files are filling up fast with this kind of stuff: > > Dec 1 03:20:16 gentoo1 xinetd[5777]: START: nrpe pid=27547 > from=192.168.10.111 > Dec 1 03:20:16 gentoo1 xinetd[5777]: START: nrpe pid=27549 > from=192.168.10.111 [chop] > Any way to shut this off completely so logs stop filling up? This is xinetd logging, not nrpe. Do you have anything like log_on_success = PID HOST in your nrpe xinetd definition or the defaults? -- Marc ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From blakekrone at gmail.com Mon Dec 5 23:07:42 2005 From: blakekrone at gmail.com (Blake Krone) Date: Mon, 5 Dec 2005 15:07:42 -0700 Subject: Huge Service Latencies Message-ID: Service Check Execution Time: 0.10 / 10.04 / 2.175 sec Service Check Latency: 21.25 / 483.37 / 224.970 sec Host Check Execution Time: 0.06 / 10.70 / 4.307 sec Host Check Latency: 0.00 / 529.15 / 5.137 sec # Active Host / Service Checks: 103 / 236 # Passive Host / Service Checks: 0 / 1 Machine is a P3 733mhz with 128mb ram running 2.6.11-gentoo-r3 kernel and Nagios 2.03b Any ideas as to why it has such huge latencies? Some snippets from nagios.cfg service_inter_check_delay_method=s host_inter_check_delay_method=s service_interleave_factor=s max_concurrent_checks=0 service_reaper_frequency=10 Any help is appreciated! -------------- next part -------------- An HTML attachment was scrubbed... URL: From hvdkooij at vanderkooij.org Mon Dec 5 23:14:13 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Mon, 5 Dec 2005 23:14:13 +0100 (CET) Subject: contactgroup{ }, and contactgroups # bug? In-Reply-To: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> References: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> Message-ID: On Mon, 5 Dec 2005, Cam wrote: > however, doing an equivalent configuration w/ contacts/contactgroups > doesn't work (i'm using nagios 2.0b4): Read the documentation: define contactgroup{ contactgroup_name contactgroup_name alias alias members members } All lines are in red and therefor required. Read your online version on: nagios/docs/xodtemplate.html#contactgroup > define contactgroup{ > contactgroup_name kiosk-folks > alias Kiosk People > } No members here! Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Mon Dec 5 23:19:29 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Mon, 5 Dec 2005 23:19:29 +0100 (CET) Subject: Huge Service Latencies In-Reply-To: References: Message-ID: On Mon, 5 Dec 2005, Blake Krone wrote: > Service Check Execution Time: 0.10 / 10.04 / 2.175 sec > Service Check Latency: 21.25 / 483.37 / 224.970 sec > Host Check Execution Time: 0.06 / 10.70 / 4.307 sec > Host Check Latency: 0.00 / 529.15 / 5.137 sec > # Active Host / Service Checks: 103 / 236 > # Passive Host / Service Checks: 0 / 1 > > Machine is a P3 733mhz with 128mb ram running 2.6.11-gentoo-r3 kernel and > Nagios 2.03b A bare website with 128MB is not too generous. Add nagios to it and it becomes cramped. And you must run CGI scripts, check scripts and the lot. I would say you're running out of RAM. You can use tools like top, vmstat, free, .... to check your machine. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Mon Dec 5 23:31:46 2005 From: estair at ilm.com (Eli Stair) Date: Mon, 05 Dec 2005 14:31:46 -0800 Subject: Huge Service Latencies In-Reply-To: References: Message-ID: <4394BFD2.1050609@ilm.com> Just for comparison, I'm having a very similar issue. Currently it's better, but latency last week was around the same area. I'm running with the perlcache, most checks are quick SNMP gets. System is a dual Opteron 246, 4GB RAM and currently doing nothing but this. Check Execution Time: 0.02 sec 30.02 sec 6.020 sec Check Latency: 0.09 sec 79.02 sec 69.122 sec Percent State Change: 0.00% 15.86% 0.03% /eli PS - I'm moving the SNMP checks to Cacti, and pulling them with check_rrd_data.pl due to the huge delay (and load) in the perl wrapper to snmpwalk. Hugo van der Kooij wrote: > On Mon, 5 Dec 2005, Blake Krone wrote: > > >>Service Check Execution Time: 0.10 / 10.04 / 2.175 sec >>Service Check Latency: 21.25 / 483.37 / 224.970 sec >>Host Check Execution Time: 0.06 / 10.70 / 4.307 sec >>Host Check Latency: 0.00 / 529.15 / 5.137 sec >># Active Host / Service Checks: 103 / 236 >># Passive Host / Service Checks: 0 / 1 >> >>Machine is a P3 733mhz with 128mb ram running 2.6.11-gentoo-r3 kernel and >>Nagios 2.03b > > > A bare website with 128MB is not too generous. Add nagios to it and it > becomes cramped. And you must run CGI scripts, check scripts and the lot. > I would say you're running out of RAM. > > You can use tools like top, vmstat, free, .... to check your machine. > > Hugo. > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cguimont at rocq.qc.ca Mon Dec 5 23:48:35 2005 From: cguimont at rocq.qc.ca (Charles Guimont) Date: Mon, 05 Dec 2005 17:48:35 -0500 Subject: Internal server error In-Reply-To: References: Message-ID: <0A990BA2-9104-4DBA-B4AB-EE9E22B778AC@rocq.qc.ca> I permormed the test as root. Any other ideas? Charles On 05-12-05, at 15:03, Marc Powell wrote: > > >> -----Original Message----- >> From: Charles Guimont [mailto:cguimont at rocq.qc.ca] >> Sent: Monday, December 05, 2005 10:19 AM >> To: Marc Powell; nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] Internal server error >> >> This is the error message I get: >> >>

Error: Could not read object >> configuration data!

>>

>> Here are some things you should check in order to resolve this error: >>

>>

>>

    >>
  1. Verify configuration options using the -v command-line >> option to check for errors. >>
  2. Check the Nagios log file for messages relating to startup or >> status data errors. >>
  3. Make sure you've compiled the main program and the CGIs to use >> the same object data storage options (i.e. default text file or >> template-based file). >>
> > What user did you perform this test as? If apache, it would appear > that > permission on the directories or files are such that it can't read the > config files. > >> >> First, running the command nagios -v nagios.cfg return no errors >> >> Startup makes no errors: >> >> [1133799437] Nagios 2.0b6 starting... (PID=907) >> [1133799437] LOG VERSION: 2.0 >> [1133799437] Finished daemonizing... (New PID=908) >> >> >> How can I make sure of this?? >> Make sure you've compiled the main program and the CGIs to use the >> same object data storage options (i.e. default text file or template- >> based file). > > This error doesn't make much sense any more with 2.x since all object > storage options are now template-based. There may be some circumstance > where that might be pertinent but I'll bet you would know what you > were > doing to make it happen. > > -- > Marc > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ludwig.Pummer at Copart.Com Tue Dec 6 00:03:00 2005 From: Ludwig.Pummer at Copart.Com (Ludwig Pummer) Date: Mon, 5 Dec 2005 15:03:00 -0800 Subject: Huge Service Latencies Message-ID: I had large latencies caused by (1) hosts being down and (2) a host check that took way too long. Nagios will suspend all service checks while it performs a host check. In my case, the Nagios config I inherited made 10 attempts at 30 seconds each attempt before declaring a host to be down, so if one host went down, the host check took 300 seconds. I typically have 15 hosts down out of 475. I changed my host checks to 3 attempts at 6 seconds each, and latencies went way down. -- Ludwig Pummer System Administrator Copart Auto Auctions ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Blake Krone Sent: Monday, December 05, 2005 2:08 PM To: Nagios Users Subject: [Nagios-users] Huge Service Latencies Service Check Execution Time: 0.10 / 10.04 / 2.175 sec Service Check Latency: 21.25 / 483.37 / 224.970 sec Host Check Execution Time: 0.06 / 10.70 / 4.307 sec Host Check Latency: 0.00 / 529.15 / 5.137 sec # Active Host / Service Checks: 103 / 236 # Passive Host / Service Checks: 0 / 1 Machine is a P3 733mhz with 128mb ram running 2.6.11-gentoo-r3 kernel and Nagios 2.03b Any ideas as to why it has such huge latencies? -------------- next part -------------- An HTML attachment was scrubbed... URL: From estair at ilm.com Tue Dec 6 00:07:38 2005 From: estair at ilm.com (Eli Stair) Date: Mon, 05 Dec 2005 15:07:38 -0800 Subject: Huge Service Latencies In-Reply-To: References: Message-ID: <4394C83A.8000109@ilm.com> I had a suspicion that the cause might be large numbers of hosts that go down. Until I get around to them the host checks obviously continue... perhaps that's what's causing the general slowdown in all of nagios. Makes me wish again host checks were parallelized. /eli Ludwig Pummer wrote: > I had large latencies caused by (1) hosts being down and (2) a host > check that took way too long. Nagios will suspend all service checks > while it performs a host check. In my case, the Nagios config I > inherited made 10 attempts at 30 seconds each attempt before declaring a > host to be down, so if one host went down, the host check took 300 > seconds. I typically have 15 hosts down out of 475. I changed my host > checks to 3 attempts at 6 seconds each, and latencies went way down. > > > -- > Ludwig Pummer > System Administrator > Copart Auto Auctions > > > > ------------------------------------------------------------------------ > *From:* nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] *On Behalf Of > *Blake Krone > *Sent:* Monday, December 05, 2005 2:08 PM > *To:* Nagios Users > *Subject:* [Nagios-users] Huge Service Latencies > > Service Check Execution Time: 0.10 / 10.04 / 2.175 sec > Service Check Latency: 21.25 / 483.37 / 224.970 sec > Host Check Execution Time: 0.06 / 10.70 / 4.307 sec > Host Check Latency: 0.00 / 529.15 / 5.137 sec > # Active Host / Service Checks: 103 / 236 > # Passive Host / Service Checks: 0 / 1 > > Machine is a P3 733mhz with 128mb ram running 2.6.11-gentoo-r3 > kernel and Nagios 2.03b > > > Any ideas as to why it has such huge latencies? > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From blakekrone at gmail.com Tue Dec 6 00:14:11 2005 From: blakekrone at gmail.com (Blake Krone) Date: Mon, 5 Dec 2005 16:14:11 -0700 Subject: Huge Service Latencies In-Reply-To: References: Message-ID: I have active host checks set to 0 so that they won't actually check unless a service fails on the host. How do you specify a recheck in seconds? I thought minutes were all that was allowed. -Blake On 12/5/05, Ludwig Pummer wrote: > > I had large latencies caused by (1) hosts being down and (2) a host check > that took way too long. Nagios will suspend all service checks while it > performs a host check. In my case, the Nagios config I inherited made 10 > attempts at 30 seconds each attempt before declaring a host to be down, so > if one host went down, the host check took 300 seconds. I typically have 15 > hosts down out of 475. I changed my host checks to 3 attempts at 6 seconds > each, and latencies went way down. > > > -- > Ludwig Pummer > System Administrator > Copart Auto Auctions > > > > ------------------------------ > *From:* nagios-users-admin at lists.sourceforge.net [mailto: > nagios-users-admin at lists.sourceforge.net] *On Behalf Of *Blake Krone > *Sent:* Monday, December 05, 2005 2:08 PM > *To:* Nagios Users > *Subject:* [Nagios-users] Huge Service Latencies > > Service Check Execution Time: 0.10 / 10.04 / 2.175 sec > Service Check Latency: 21.25 / 483.37 / 224.970 sec > Host Check Execution Time: 0.06 / 10.70 / 4.307 sec > Host Check Latency: 0.00 / 529.15 / 5.137 sec > # Active Host / Service Checks: 103 / 236 > # Passive Host / Service Checks: 0 / 1 > > Machine is a P3 733mhz with 128mb ram running 2.6.11-gentoo-r3 kernel and > Nagios 2.03b > > > Any ideas as to why it has such huge latencies? > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Ludwig.Pummer at Copart.Com Tue Dec 6 00:25:32 2005 From: Ludwig.Pummer at Copart.Com (Ludwig Pummer) Date: Mon, 5 Dec 2005 15:25:32 -0800 Subject: Huge Service Latencies Message-ID: I should point out I'm running Nagios 1.3, which will only perform host checks if a service check fails. I use check_ping and the 30 seconds and 6 seconds numbers were what I passed to check_ping in my checkcommands.cfg (although it's probable check_ping wasn't waiting the full 30 seconds, it certainly did wait longer than 6 seconds). It was in the host template definition that I set max_check_attempts. If you just want to see whether host checks are the source of your latency, try temporarily replacing your host check command with a shell script which does "exit 0". You will of course have to suffer through more service notifications due to downed hosts. -- Ludwig Pummer System Administrator Copart Auto Auctions ________________________________ From: Blake Krone [mailto:blakekrone at gmail.com] Sent: Monday, December 05, 2005 3:14 PM To: Ludwig Pummer Cc: Nagios Users Subject: Re: [Nagios-users] Huge Service Latencies I have active host checks set to 0 so that they won't actually check unless a service fails on the host. How do you specify a recheck in seconds? I thought minutes were all that was allowed. -Blake On 12/5/05, Ludwig Pummer wrote: I had large latencies caused by (1) hosts being down and (2) a host check that took way too long. Nagios will suspend all service checks while it performs a host check. In my case, the Nagios config I inherited made 10 attempts at 30 seconds each attempt before declaring a host to be down, so if one host went down, the host check took 300 seconds. I typically have 15 hosts down out of 475. I changed my host checks to 3 attempts at 6 seconds each, and latencies went way down. -- Ludwig Pummer System Administrator Copart Auto Auctions ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Blake Krone Sent: Monday, December 05, 2005 2:08 PM To: Nagios Users Subject: [Nagios-users] Huge Service Latencies Service Check Execution Time: 0.10 / 10.04 / 2.175 sec Service Check Latency: 21.25 / 483.37 / 224.970 sec Host Check Execution Time: 0.06 / 10.70 / 4.307 sec Host Check Latency: 0.00 / 529.15 / 5.137 sec # Active Host / Service Checks: 103 / 236 # Passive Host / Service Checks: 0 / 1 Machine is a P3 733mhz with 128mb ram running 2.6.11-gentoo-r3 kernel and Nagios 2.03b Any ideas as to why it has such huge latencies? -------------- next part -------------- An HTML attachment was scrubbed... URL: From b00mer at gmx.net Tue Dec 6 00:30:07 2005 From: b00mer at gmx.net (Hendrik Baecker) Date: Tue, 06 Dec 2005 00:30:07 +0100 Subject: Huge Service Latencies In-Reply-To: <4394C83A.8000109@ilm.com> References: <4394C83A.8000109@ilm.com> Message-ID: <4394CD7F.9000700@gmx.net> Eli Stair schrieb: > > I had a suspicion that the cause might be large numbers of hosts that > go down. Until I get around to them the host checks obviously > continue... perhaps that's what's causing the general slowdown in all > of nagios. > Think about why a host check is executed... not only when a host is really down... Every time when a service check returns a non-OK state nagios will do a host check. On the other side might it be, that you are using dist-monitoring with nsca? We have had very high latency (600seconds up to somthing around three hours) when using nsca on each single service_check result. Just my 2 cents > Makes me wish again host checks were parallelized. > > /eli > > Ludwig Pummer wrote: > >> I had large latencies caused by (1) hosts being down and (2) a host >> check that took way too long. Nagios will suspend all service checks >> while it performs a host check. In my case, the Nagios config I >> inherited made 10 attempts at 30 seconds each attempt before >> declaring a host to be down, so if one host went down, the host check >> took 300 seconds. I typically have 15 hosts down out of 475. I >> changed my host checks to 3 attempts at 6 seconds each, and latencies >> went way down. >> >> >> -- >> Ludwig Pummer >> System Administrator >> Copart Auto Auctions >> >> >> >> >> ------------------------------------------------------------------------ >> *From:* nagios-users-admin at lists.sourceforge.net >> [mailto:nagios-users-admin at lists.sourceforge.net] *On Behalf Of >> *Blake Krone >> *Sent:* Monday, December 05, 2005 2:08 PM >> *To:* Nagios Users >> *Subject:* [Nagios-users] Huge Service Latencies >> >> Service Check Execution Time: 0.10 / 10.04 / 2.175 sec >> Service Check Latency: 21.25 / 483.37 / 224.970 sec >> Host Check Execution Time: 0.06 / 10.70 / 4.307 sec >> Host Check Latency: 0.00 / 529.15 / 5.137 sec >> # Active Host / Service Checks: 103 / 236 >> # Passive Host / Service Checks: 0 / 1 >> >> Machine is a P3 733mhz with 128mb ram running 2.6.11-gentoo-r3 >> kernel and Nagios 2.03b >> >> >> Any ideas as to why it has such huge latencies? >> > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Tue Dec 6 00:56:09 2005 From: estair at ilm.com (Eli Stair) Date: Mon, 05 Dec 2005 15:56:09 -0800 Subject: Huge Service Latencies In-Reply-To: References: Message-ID: <4394D399.50408@ilm.com> Good point. I don't know how to implement host-check-only-on-servicecheck-fail on 2.0. I don't see in the config that it is supported, from what I read execute_host_checks is a global do or donot hostcheck declarative. Is this correct? /eli Ludwig Pummer wrote: > I should point out I'm running Nagios 1.3, which will only perform host > checks if a service check fails. I use check_ping and the 30 seconds and > 6 seconds numbers were what I passed to check_ping in my > checkcommands.cfg (although it's probable check_ping wasn't waiting the > full 30 seconds, it certainly did wait longer than 6 seconds). It was in > the host template definition that I set max_check_attempts. > > If you just want to see whether host checks are the source of your > latency, try temporarily replacing your host check command with a shell > script which does "exit 0". You will of course have to suffer through > more service notifications due to downed hosts. > > -- > Ludwig Pummer > System Administrator > Copart Auto Auctions > > > > ------------------------------------------------------------------------ > *From:* Blake Krone [mailto:blakekrone at gmail.com] > *Sent:* Monday, December 05, 2005 3:14 PM > *To:* Ludwig Pummer > *Cc:* Nagios Users > *Subject:* Re: [Nagios-users] Huge Service Latencies > > I have active host checks set to 0 so that they won't actually check > unless a service fails on the host. How do you specify a recheck in > seconds? I thought minutes were all that was allowed. > > -Blake > > On 12/5/05, *Ludwig Pummer* > wrote: > > I had large latencies caused by (1) hosts being down and (2) a > host check that took way too long. Nagios will suspend all > service checks while it performs a host check. In my case, the > Nagios config I inherited made 10 attempts at 30 seconds each > attempt before declaring a host to be down, so if one host went > down, the host check took 300 seconds. I typically have 15 hosts > down out of 475. I changed my host checks to 3 attempts at 6 > seconds each, and latencies went way down. > > > -- > Ludwig Pummer > System Administrator > Copart Auto Auctions > > > > ------------------------------------------------------------------------ > *From:* nagios-users-admin at lists.sourceforge.net > > [mailto:nagios-users-admin at lists.sourceforge.net > ] *On > Behalf Of *Blake Krone > *Sent:* Monday, December 05, 2005 2:08 PM > *To:* Nagios Users > *Subject:* [Nagios-users] Huge Service Latencies > > Service Check Execution Time: 0.10 / 10.04 / 2.175 sec > Service Check Latency: 21.25 / 483.37 / 224.970 sec > Host Check Execution Time: 0.06 / 10.70 / 4.307 sec > Host Check Latency: 0.00 / 529.15 / 5.137 sec > # Active Host / Service Checks: 103 / 236 > # Passive Host / Service Checks: 0 / 1 > > Machine is a P3 733mhz with 128mb ram running > 2.6.11-gentoo-r3 kernel and Nagios 2.03b > > > Any ideas as to why it has such huge latencies? > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cameron.matheson at gmail.com Tue Dec 6 01:02:11 2005 From: cameron.matheson at gmail.com (Cam) Date: Mon, 5 Dec 2005 17:02:11 -0700 Subject: contactgroup{ }, and contactgroups # bug? In-Reply-To: References: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> Message-ID: <7e2f9af90512051602y21145f4fq99b5ed451facbb61@mail.gmail.com> On 12/5/05, Hugo van der Kooij wrote: > Read the documentation: > > define contactgroup{ > contactgroup_name contactgroup_name > alias alias > members members > } > > All lines are in red and therefor required. Read your online version on: > nagios/docs/xodtemplate.html#contactgroup > > > define contactgroup{ > > contactgroup_name kiosk-folks > > alias Kiosk People > > } > > No members here! That's exactly my point. There shouldn't have to be. It's listed as required in the 'hostgroups' definition too, but it doesn't actually require it (because if you add members on a per-host basis you shouldn't have to define them in the group definition also). I'm wondering why the inconsistency, and is that a bug? Cameron Matheson ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jhmartin at toger.us Tue Dec 6 02:06:18 2005 From: jhmartin at toger.us (Jason Martin) Date: Mon, 5 Dec 2005 20:06:18 -0500 Subject: Using NRPE to call remote event_handlers In-Reply-To: <932A04DB269BFF4F886C2ACD54C62CAC02C6B033@uftwfexchsvr.uftmaster.org> References: <932A04DB269BFF4F886C2ACD54C62CAC02C6B033@uftwfexchsvr.uftmaster.org> Message-ID: <20051206010618.GA22204@mal.members.linode.com> On Mon, Dec 05, 2005 at 11:55:26AM -0500, Mark Pearson wrote: > restart httpd, I noticed in the sample cfgs that the event_handlers seem all > to be running locally. Should I have the full blown nagios running on > remote machine? Currently, I only have the nrpe, plugins and my This is not necessary, just replace the part where it invokes the restart command with the appropriate NRPE call. -Jason Martin -- Press Ctrl-Alt-Del to continue... This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From cguimont at rocq.qc.ca Tue Dec 6 05:35:06 2005 From: cguimont at rocq.qc.ca (Charles Guimont) Date: Mon, 05 Dec 2005 23:35:06 -0500 Subject: Internal server error In-Reply-To: References: Message-ID: <3F1C094F-5AEC-40AF-94FB-A7E50A1F23C4@rocq.qc.ca> Hellol, I tryed > $ bash > $ REQEUST_METHOD=GET > $ export REQUEST_METHOD > $ /usr/local/nagios/sbin/status.cgi Again and it worked as root. What could be causing the problem now? Thanks, Charles On 05-12-05, at 11:02, Marc Powell wrote: > Try this (preferably as the apache user) -- > > $ bash > $ REQEUST_METHOD=GET > $ export REQUEST_METHOD > $ /usr/local/nagios/sbin/status.cgi > > It looks like there may be some library missing or not where it's > supposed to be. Was nagios compiled on this machine? You can use 'ldd > /usr/local/nagios/sbin/status.cgi' and verify that the libraries exist > where they're expected. Additionally, if you have SELinux enabled, > make > sure that's not causing any problems. > > -- > Marc > > >> -----Original Message----- >> From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- >> admin at lists.sourceforge.net] On Behalf Of Charles Guimont >> Sent: Monday, December 05, 2005 9:48 AM >> To: jpratt at norwich.edu; nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] Internal server error >> >> Hello, I forgot to say what was the error log in apache: >> >> >> [Mon Dec 5 10:46:05 2005] [error] [client ***] Premature end of >> script headers: /usr/local/nagios/sbin/status.cgi >> >> >> Charles >> >> On 05-12-05, at 10:37, Jamie Pratt wrote: >> >>> Hi. Did you modify your httpd.conf file like it says to below, and >>> then restart apache? I would check your apache access and error >>> logs as well for more clues... >>> >>> http://nagios.sourceforge.net/docs/2_0/installweb.html >>> >>> regards, >>> jamie >>> >>> -- >>> >>> >>> Jamie Pratt >>> Unix Systems Administrator >>> Norwich University - Information Technology >>> http://www.norwich.edu >>> 1-802-485-2532 >>> >>> >>> >>> Charles Guimont wrote: >>>> Hello, >>>> I have installed nagios. Since I installeed it, when I try to >>>> access the cgi using the web, I get an error of: >>>> *Internal Server Error* >>>> The server encountered an internal error or misconfiguration and >>>> was unable to complete your request. >>>> Please contact the server administrator, webmaster at rocq.qc.ca >>>> and inform them of the time the >>>> error occurred, and anything you might have done that may have >>>> caused the error. >>>> More information about this error may be available in the server >>>> error log. >>>> Additionally, a 404 Not Found error was encountered while trying >>>> to use an ErrorDocument to handle the request. >>>> /Apache/1.3.34 Server at www.rocq.qc.ca >>>> Port 80/ >>>> I tryed to search the web to find some solutions without success. >>>> Is there something I could to to make it work? >>>> I tryed to recompile many times >>>> Thanks, >>>> Charles >>> >>> >> >> >> >> ------------------------------------------------------- >> This SF.net email is sponsored by: Splunk Inc. Do you grep through >> log >> files >> for problems? Stop! Download the new AJAX search engine that makes >> searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! >> http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/ >> null > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Stanley.Hopcroft at Dest.gov.au Tue Dec 6 06:09:27 2005 From: Stanley.Hopcroft at Dest.gov.au (Stanley.Hopcroft at Dest.gov.au) Date: Tue, 6 Dec 2005 16:09:27 +1100 Subject: Reporting ideas sought. Message-ID: Dear Folks, I am writing to welcome clues about providing an itemised list of outages and their causes from, 'in some way', Nagios. The Nagios availability report does ineed provide a useful list of outages that can be wrapped and processed to ones hearts content (eg HOST_NAME DOWN UP OUTAGE Albany_DEST_router 05-12-2005 04:10:59 05-12-2005 08:42:29 4h 31m 30s Albany_Optus_router_PE_in 05-12-2005 04:10:59 05-12-2005 08:42:29 4h 31m 30s Lismore_DEST_router 05-12-2005 16:11:30 05-12-2005 20:01:40 3h 50m 10s Lismore_Optus_router_PE_i 05-12-2005 16:11:30 05-12-2005 20:01:40 3h 50m 10s Kempsey_DEST_router 05-12-2005 13:16:39 05-12-2005 13:22:49 6m 10s Kempsey_Optus_router_PE_i 05-12-2005 13:16:39 05-12-2005 13:22:49 6m 10s Broken_Hill_Optus_router_ 05-12-2005 01:54:17 05-12-2005 01:57:27 3m 10s Broken_Hill_DEST_router 05-12-2005 01:56:07 05-12-2005 01:57:27 1m 20s ) but Nagios has AFAIK, no means of capuring event related data and associating it with an outage event to produce something like HOST_NAME DOWN UP OUTAGE CAUSE COMMENT Albany_DEST_router 05-12-2005 04:10:59 05-12-2005 08:42:29 4h 31m 30s 1 BDR -> down, provider Albany_Optus_router_PE_in 05-12-2005 04:10:59 05-12-2005 08:42:29 4h 31m 30s 1 BDR -> down, provider Lismore_DEST_router 05-12-2005 16:11:30 05-12-2005 20:01:40 3h 50m 10s 2 router restart by power-on Lismore_Optus_router_PE_i 05-12-2005 16:11:30 05-12-2005 20:01:40 3h 50m 10s 2 power failure Kempsey_DEST_router 05-12-2005 13:16:39 05-12-2005 13:22:49 6m 10s 1 BDR -> down, provider Kempsey_Optus_router_PE_i 05-12-2005 13:16:39 05-12-2005 13:22:49 6m 10s 1 BDR -> down, provider Broken_Hill_Optus_router_ 05-12-2005 01:54:17 05-12-2005 01:57:27 3m 10s 5 dismiss Broken_Hill_DEST_router 05-12-2005 01:56:07 05-12-2005 01:57:27 1m 20s 5 dismiss In this case, cause is a coded value that classifies the fault and the comment is free form text. The best I can think of to create something like this is to 1 Append the outages to a file - possibly by having an event handler run the code that extracts the outage from the availability CGI - or better still all the data for an outage is prob provided by macros - for the host or service and appending that to a file. 2 Have an admin edit the file and add the values when they become known. The guts of the problem is Nagios does the right thing by automatically changing the state of monitored entity; there is no opportuntity to 'officially' close the 'fault' by collecting user-input and associating it with an outage. Looked at another way, outages don't really exist as first class objects (with their own methods and data). All comments are very welcome, Yours sincerely. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Tue Dec 6 07:14:26 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Tue, 6 Dec 2005 07:14:26 +0100 (CET) Subject: contactgroup{ }, and contactgroups # bug? In-Reply-To: <7e2f9af90512051602y21145f4fq99b5ed451facbb61@mail.gmail.com> References: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> <7e2f9af90512051602y21145f4fq99b5ed451facbb61@mail.gmail.com> Message-ID: On Mon, 5 Dec 2005, Cam wrote: > That's exactly my point. There shouldn't have to be. It's listed as > required in the 'hostgroups' definition too, but it doesn't actually > require it (because if you add members on a per-host basis you > shouldn't have to define them in the group definition also). I'm > wondering why the inconsistency, and is that a bug? If it is not required for the hostgroup then THAT is the bug. Unfortunatly I can not find a bug tracking tool on the Nagios site. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Tue Dec 6 07:16:09 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Tue, 6 Dec 2005 07:16:09 +0100 (CET) Subject: Internal server error In-Reply-To: <3F1C094F-5AEC-40AF-94FB-A7E50A1F23C4@rocq.qc.ca> References: <3F1C094F-5AEC-40AF-94FB-A7E50A1F23C4@rocq.qc.ca> Message-ID: On Mon, 5 Dec 2005, Charles Guimont wrote: > Hellol, > I tryed > > $ bash > > $ REQEUST_METHOD=GET > > $ export REQUEST_METHOD > > $ /usr/local/nagios/sbin/status.cgi > > Again and it worked as root. What could be causing the problem now? Does it work as apache? (Or whoeven runs your webserver.) Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bipsen-sender-25cac0 at andebakken.dk Tue Dec 6 12:50:39 2005 From: bipsen-sender-25cac0 at andebakken.dk (Brian Ipsen) Date: Tue, 6 Dec 2005 12:50:39 +0100 Subject: Problem with nagios-plugin built RPM Message-ID: <1133869840.13016.TMDA@worf.andebakken.dk> Hi, I'm trying to build an RPM with the nagios plugins... Basically, what I do, is: # perl -MCPAN -e 'install Net::SNMP' # /usr/sbin/groupadd nagios # /usr/sbin/useradd -r -d /var/log/nagios -s /bin/sh -c "nagios" -g nagios nagios # cd /usr/src/whitebox/SOURCES # wget http://surfnet.dl.sourceforge.net/sourceforge/nagiosplug/nagios-plugins-1.4. 2.tar.gz # tar xzpf nagios-plugins-1.4.2.tar.gz *.spec # mv -f nagios-plugins-1.4.2/nagios-plugins.spec ../SPECS/ # rm -Rf nagios-plugins-1.4.2 # cd ../SPECS # chown 0.0 nagios-plugins.spec # rpmbuild -ba nagios-plugins.spec # rpm -Uvh ../RPMS/i386/nagios-plugins-1.4.2-1.i386.rpm But this gives me: error: Failed dependencies: perl(Net::SNMP) is needed by nagios-plugins-1.4-2.1.el3.rf.i386 I then check: # perl -MCPAN -e 'install Net::SNMP' CPAN: Storable loaded ok Going to read /root/.cpan/Metadata Database was generated on Tue, 06 Dec 2005 01:01:00 GMT Net::SNMP is up to date. Why is it complaining about Net::SNMP ?? I also tried: # rpm -qa | grep net-snmp net-snmp-libs-5.1.2-11 net-snmp-devel-5.1.2-11 net-snmp-perl-5.1.2-11 net-snmp-utils-5.1.2-11 net-snmp-5.1.2-11 Where is the problem ?? Regards /Brian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From frank.bourdeau at dga.defense.gouv.fr Tue Dec 6 13:24:20 2005 From: frank.bourdeau at dga.defense.gouv.fr (frank.bourdeau at dga.defense.gouv.fr) Date: Tue, 6 Dec 2005 13:24:20 +0100 Subject: =?ISO-8859-1?Q?R=E9f=2E_=3A_[Nagios-users]_Problem_with_nagios-plugin?= =?ISO-8859-1?Q?_built_RPM?= In-Reply-To: References: Message-ID: <200512061205.jB6C5sMg028501@maila.dga.defense.gouv.fr> Hi, You must intsall the package perl-SNMP from your distro or you must rebuild your nagios-plugins package, because you have installed net::SNMP perl script. Best regards ---- Frank BOURDEAU tel: 0241936560 fax: 0241936681 frank.bourdeau at dga.defense.gouv.fr Brian Ipsen Envoy? par : nagios-users-admin at lists.sourceforge.net 06/12/2005 12:50 Veuillez r?pondre ? Brian Ipsen Pour : cc : Objet : [Nagios-users] Problem with nagios-plugin built RPM Hi, I'm trying to build an RPM with the nagios plugins... Basically, what I do, is: # perl -MCPAN -e 'install Net::SNMP' # /usr/sbin/groupadd nagios # /usr/sbin/useradd -r -d /var/log/nagios -s /bin/sh -c "nagios" -g nagios nagios # cd /usr/src/whitebox/SOURCES # wget http://surfnet.dl.sourceforge.net/sourceforge/nagiosplug/nagios-plugins-1.4 . 2.tar.gz # tar xzpf nagios-plugins-1.4.2.tar.gz *.spec # mv -f nagios-plugins-1.4.2/nagios-plugins.spec ../SPECS/ # rm -Rf nagios-plugins-1.4.2 # cd ../SPECS # chown 0.0 nagios-plugins.spec # rpmbuild -ba nagios-plugins.spec # rpm -Uvh ../RPMS/i386/nagios-plugins-1.4.2-1.i386.rpm But this gives me: error: Failed dependencies: perl(Net::SNMP) is needed by nagios-plugins-1.4-2.1.el3.rf.i386 I then check: # perl -MCPAN -e 'install Net::SNMP' CPAN: Storable loaded ok Going to read /root/.cpan/Metadata Database was generated on Tue, 06 Dec 2005 01:01:00 GMT Net::SNMP is up to date. Why is it complaining about Net::SNMP ?? I also tried: # rpm -qa | grep net-snmp net-snmp-libs-5.1.2-11 net-snmp-devel-5.1.2-11 net-snmp-perl-5.1.2-11 net-snmp-utils-5.1.2-11 net-snmp-5.1.2-11 Where is the problem ?? Regards /Brian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: From cguimont at rocq.qc.ca Tue Dec 6 14:10:28 2005 From: cguimont at rocq.qc.ca (Charles Guimont) Date: Tue, 06 Dec 2005 08:10:28 -0500 Subject: Internal server error In-Reply-To: References: <3F1C094F-5AEC-40AF-94FB-A7E50A1F23C4@rocq.qc.ca> Message-ID: <953BA938-FE61-4FC0-8064-16E089073EEC@rocq.qc.ca> I can't seam to use user nobody, I get an error of, user is not available Charles On 05-12-06, at 01:16, Hugo van der Kooij wrote: > On Mon, 5 Dec 2005, Charles Guimont wrote: > >> Hellol, >> I tryed >>> $ bash >>> $ REQEUST_METHOD=GET >>> $ export REQUEST_METHOD >>> $ /usr/local/nagios/sbin/status.cgi >> >> Again and it worked as root. What could be causing the problem now? > > Does it work as apache? (Or whoeven runs your webserver.) > > Hugo. > > -- > I hate duplicates. Just reply to the relevant mailinglist. > hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ > Don't meddle in the affairs of magicians, > for they are subtle and quick to anger. > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bipsen-sender-25cac0 at andebakken.dk Tue Dec 6 15:37:02 2005 From: bipsen-sender-25cac0 at andebakken.dk (Brian Ipsen) Date: Tue, 6 Dec 2005 15:37:02 +0100 Subject: Problem with nagios-plugin built RPM In-Reply-To: <200512061205.jB6C5sMg028501@maila.dga.defense.gouv.fr> References: <200512061205.jB6C5sMg028501@maila.dga.defense.gouv.fr> Message-ID: <1133879823.14570.TMDA@worf.andebakken.dk> Hi, I've installed the following SNMP packages: net-snmp-libs-5.1.2-11 net-snmp-5.1.2-11 net-snmp-utils-5.1.2-11 net-snmp-devel-5.1.2-11 net-snmp-perl-5.1.2-11 And these packages containing perl in the name: perl-5.8.5-12.1 perl-XML-Parser-2.34-5 perl-XML-SAX-0.12-7 perl-Crypt-SSLeay-0.51-5 perl-Filter-1.30-6 perl-HTML-Tagset-3.03-30 perl-libwww-perl-5.79-5 perl-libxml-perl-0.07-30 perl-XML-Encoding-1.01-26 perl-Convert-ASN1-0.18-3.WB1 perl-XML-Grove-0.46alpha-27 perl-LDAP-0.31-5.WB1 perl-XML-LibXML-Common-0.13-7 perl-URI-1.30-4 perl-HTML-Parser-3.35-6 perl-XML-Dumper-0.71-2 perl-XML-NamespaceSupport-1.08-6 perl-XML-Twig-3.13-6 perl-XML-LibXML-1.58-1 net-snmp-perl-5.1.2-11 During ./configure, I get: checking for snmpget... /usr/bin/snmpget checking for snmpgetnext... /usr/bin/snmpgetnext configure: WARNING: Tried /usr/bin/perl - install Net::SNMP perl module if you want to use the perl snmp plugins So it still can't find Net::SNMP I wonder why (no other perl+snmp packages are available for WhiteBox Enterprise Linux 4.0)..... /Brian ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of frank.bourdeau at dga.defense.gouv.fr Sent: 6. december 2005 13:24 To: Brian Ipsen Cc: nagios-users at lists.sourceforge.net Subject: [Nagios-users] R?f. : [Nagios-users] Problem with nagios-plugin built RPM Hi, You must intsall the package perl-SNMP from your distro or you must rebuild your nagios-plugins package, because you have installed net::SNMP perl script. Best regards ---- Frank BOURDEAU tel: 0241936560 fax: 0241936681 frank.bourdeau at dga.defense.gouv.fr Brian Ipsen Envoy? par : nagios-users-admin at lists.sourceforge.net 06/12/2005 12:50 Veuillez r?pondre ? Brian Ipsen Pour : cc : Objet : [Nagios-users] Problem with nagios-plugin built RPM Hi, I'm trying to build an RPM with the nagios plugins... Basically, what I do, is: # perl -MCPAN -e 'install Net::SNMP' # /usr/sbin/groupadd nagios # /usr/sbin/useradd -r -d /var/log/nagios -s /bin/sh -c "nagios" -g nagios nagios # cd /usr/src/whitebox/SOURCES # wget http://surfnet.dl.sourceforge.net/sourceforge/nagiosplug/nagios-plugins-1.4. 2.tar.gz # tar xzpf nagios-plugins-1.4.2.tar.gz *.spec # mv -f nagios-plugins-1.4.2/nagios-plugins.spec ../SPECS/ # rm -Rf nagios-plugins-1.4.2 # cd ../SPECS # chown 0.0 nagios-plugins.spec # rpmbuild -ba nagios-plugins.spec # rpm -Uvh ../RPMS/i386/nagios-plugins-1.4.2-1.i386.rpm But this gives me: error: Failed dependencies: perl(Net::SNMP) is needed by nagios-plugins-1.4-2.1.el3.rf.i386 I then check: # perl -MCPAN -e 'install Net::SNMP' CPAN: Storable loaded ok Going to read /root/.cpan/Metadata Database was generated on Tue, 06 Dec 2005 01:01:00 GMT Net::SNMP is up to date. Why is it complaining about Net::SNMP ?? I also tried: # rpm -qa | grep net-snmp net-snmp-libs-5.1.2-11 net-snmp-devel-5.1.2-11 net-snmp-perl-5.1.2-11 net-snmp-utils-5.1.2-11 net-snmp-5.1.2-11 Where is the problem ?? Regards /Brian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ron.gage at altiris.com Tue Dec 6 15:51:13 2005 From: ron.gage at altiris.com (Ron Gage) Date: Tue, 6 Dec 2005 09:51:13 -0500 Subject: Possible BUG: No Service Notifications - HELP!!! Message-ID: Hi: I have a fairly small Nagios installation (25 hosts, 50 services). I can not get any service notifications to fire at all. None. I get host notifications without any problems, but absolutely no service notifications. Nothing in the notification logs either. The service outage is being properly recorded in the logs, but no notification is firing. I have verified that service notifications are turned on in the config files and that the mail command is properly defined. I have also tried (via the CGI interface) turning notifications off and on for various services and hosts - no difference. The part that really bugs me is that there is nothing in the notification logs when a service problem occurs. Can anyone offer me some guidance on where to look for this? RON GAGE Network Administrator Wise Solutions, Inc. T > +1 734 456 2202 M > +1 248 343 2431 www.altiris.com Security. Compliance. Patch management. IT service management. Altiris solves your most pressing IT issues. www.altiris.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From sghosh at sghosh.org Tue Dec 6 16:29:49 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Tue, 6 Dec 2005 10:29:49 -0500 (EST) Subject: Problem with nagios-plugin built RPM In-Reply-To: <1133879823.14570.TMDA@worf.andebakken.dk> References: <1133879823.14570.TMDA@worf.andebakken.dk> Message-ID: Not all linux distributions include the perl Net::SNMP package. You can download it from CPAN http://search.cpan.org/dist/Net-SNMP/ or if you use DAG's repository http://dag.wieers.com/packages/perl-Net-SNMP/ On Tue, 6 Dec 2005, Brian Ipsen wrote: > Hi, > > I've installed the following SNMP packages: > > net-snmp-libs-5.1.2-11 > net-snmp-5.1.2-11 > net-snmp-utils-5.1.2-11 > net-snmp-devel-5.1.2-11 > net-snmp-perl-5.1.2-11 > > And these packages containing perl in the name: > > perl-5.8.5-12.1 > perl-XML-Parser-2.34-5 > perl-XML-SAX-0.12-7 > perl-Crypt-SSLeay-0.51-5 > perl-Filter-1.30-6 > perl-HTML-Tagset-3.03-30 > perl-libwww-perl-5.79-5 > perl-libxml-perl-0.07-30 > perl-XML-Encoding-1.01-26 > perl-Convert-ASN1-0.18-3.WB1 > perl-XML-Grove-0.46alpha-27 > perl-LDAP-0.31-5.WB1 > perl-XML-LibXML-Common-0.13-7 > perl-URI-1.30-4 > perl-HTML-Parser-3.35-6 > perl-XML-Dumper-0.71-2 > perl-XML-NamespaceSupport-1.08-6 > perl-XML-Twig-3.13-6 > perl-XML-LibXML-1.58-1 > net-snmp-perl-5.1.2-11 > > During ./configure, I get: > > checking for snmpget... /usr/bin/snmpget > checking for snmpgetnext... /usr/bin/snmpgetnext > configure: WARNING: Tried /usr/bin/perl - install Net::SNMP perl module if > you want to use the perl snmp plugins > > So it still can't find Net::SNMP > > I wonder why (no other perl+snmp packages are available for WhiteBox > Enterprise Linux 4.0)..... > > /Brian > > ________________________________ > > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of > frank.bourdeau at dga.defense.gouv.fr > Sent: 6. december 2005 13:24 > To: Brian Ipsen > Cc: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] R?f. : [Nagios-users] Problem with > nagios-plugin built RPM > > > > Hi, > > You must intsall the package perl-SNMP from your distro or you must > rebuild your nagios-plugins package, because you have installed net::SNMP > perl script. > > Best regards > > ---- > Frank BOURDEAU > tel: 0241936560 fax: 0241936681 > frank.bourdeau at dga.defense.gouv.fr > > > > Brian Ipsen > Envoy? par : nagios-users-admin at lists.sourceforge.net > > 06/12/2005 12:50 > Veuillez r?pondre ? Brian Ipsen > > > Pour : > cc : > Objet : [Nagios-users] Problem with nagios-plugin > built RPM > > > > Hi, > > I'm trying to build an RPM with the nagios plugins... > > Basically, what I do, is: > > # perl -MCPAN -e 'install Net::SNMP' > # /usr/sbin/groupadd nagios > # /usr/sbin/useradd -r -d /var/log/nagios -s /bin/sh -c "nagios" -g > nagios > nagios > # cd /usr/src/whitebox/SOURCES > # wget > > http://surfnet.dl.sourceforge.net/sourceforge/nagiosplug/nagios-plugins-1.4. > 2.tar.gz > # tar xzpf nagios-plugins-1.4.2.tar.gz *.spec > # mv -f nagios-plugins-1.4.2/nagios-plugins.spec ../SPECS/ > # rm -Rf nagios-plugins-1.4.2 > # cd ../SPECS > # chown 0.0 nagios-plugins.spec > # rpmbuild -ba nagios-plugins.spec > # rpm -Uvh ../RPMS/i386/nagios-plugins-1.4.2-1.i386.rpm > > But this gives me: > > error: Failed dependencies: > perl(Net::SNMP) is needed by > nagios-plugins-1.4-2.1.el3.rf.i386 > > I then check: > > # perl -MCPAN -e 'install Net::SNMP' > CPAN: Storable loaded ok > Going to read /root/.cpan/Metadata > Database was generated on Tue, 06 Dec 2005 01:01:00 GMT > Net::SNMP is up to date. > > Why is it complaining about Net::SNMP ?? > > I also tried: > > # rpm -qa | grep net-snmp > > net-snmp-libs-5.1.2-11 > net-snmp-devel-5.1.2-11 > net-snmp-perl-5.1.2-11 > net-snmp-utils-5.1.2-11 > net-snmp-5.1.2-11 > > Where is the problem ?? > > Regards > > /Brian > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to > /dev/null > > > > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op?k > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- From jperrin at gmail.com Tue Dec 6 17:24:12 2005 From: jperrin at gmail.com (Jim Perrin) Date: Tue, 6 Dec 2005 11:24:12 -0500 Subject: Problem with nagios-plugin built RPM In-Reply-To: <1133869840.13016.TMDA@worf.andebakken.dk> References: <1133869840.13016.TMDA@worf.andebakken.dk> Message-ID: <302ce8b50512060824y68fcd1fay948e1db8bf345067@mail.gmail.com> > # perl -MCPAN -e 'install Net::SNMP' > # /usr/sbin/groupadd nagios > # /usr/sbin/useradd -r -d /var/log/nagios -s /bin/sh -c "nagios" -g nagios > nagios > # cd /usr/src/whitebox/SOURCES > # wget > http://surfnet.dl.sourceforge.net/sourceforge/nagiosplug/nagios-plugins-1.4. > 2.tar.gz > # tar xzpf nagios-plugins-1.4.2.tar.gz *.spec > # mv -f nagios-plugins-1.4.2/nagios-plugins.spec ../SPECS/ > # rm -Rf nagios-plugins-1.4.2 > # cd ../SPECS > # chown 0.0 nagios-plugins.spec > # rpmbuild -ba nagios-plugins.spec > # rpm -Uvh ../RPMS/i386/nagios-plugins-1.4.2-1.i386.rpm > > But this gives me: > > error: Failed dependencies: > perl(Net::SNMP) is needed by nagios-plugins-1.4-2.1.el3.rf.i386 > You've stepped outside of RPM in how you installed the perl dependencies. RPM has no way of knowing that they're on your system. That's why you're getting the error. Either install via source, or install via rpm. Don't mix and match, as it'll only screw up your system. Since you're using an rpm based distro, it's best to stick with using rpm to install things, and using the repositories someone else was nice enough to post already. > I then check: > > # perl -MCPAN -e 'install Net::SNMP' > CPAN: Storable loaded ok > Going to read /root/.cpan/Metadata > Database was generated on Tue, 06 Dec 2005 01:01:00 GMT > Net::SNMP is up to date. > > Why is it complaining about Net::SNMP ?? > > I also tried: > > # rpm -qa | grep net-snmp > > net-snmp-libs-5.1.2-11 > net-snmp-devel-5.1.2-11 > net-snmp-perl-5.1.2-11 > net-snmp-utils-5.1.2-11 > net-snmp-5.1.2-11 > > Where is the problem ?? > > Regards > > /Brian > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Jim Perrin System Architect - UIT Ft Gordon & US Army Signal Center ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Tue Dec 6 17:37:09 2005 From: jpratt at norwich.edu (Jamie Pratt) Date: Tue, 06 Dec 2005 11:37:09 -0500 Subject: NRPE logging to syslog In-Reply-To: References: Message-ID: <4395BE35.7000603@norwich.edu> Thanks guys... However, i'm still trying to get rid of these, and can't seem to get it right?... Dec 6 11:28:07 mail4 nrpe[23417]: INFO: SSL/TLS initialized. All network traffic will be encrypted. my /etc/xinetd.d/nrpe file is: service nrpe { disable = no flags = REUSE socket_type = stream wait = no user = nagios server = /usr/sbin/nrpe server_args = -c /usr/local/nagios/etc/nrpe.cfg --inetd log_on_failure += USERID log_type = SYSLOG authpriv err only_from = 192.168.10.111 } and my xinetd.conf is: defaults { instances = 60 log_type = SYSLOG authpriv err # log_on_success = HOST PID log_on_failure = HOST cps = 25 30 } (I tried upping the syslog level to authpriv.err to try and get theese INFO messages to go away, but still no luck..? crit,warning,, same deal.. ?) thanks again, jamie Marc Powell wrote: > >>-----Original Message----- >>From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- >>admin at lists.sourceforge.net] On Behalf Of Jamie Pratt >>Sent: Monday, December 05, 2005 3:31 PM >>To: nagios-users at lists.sourceforge.net >>Subject: [Nagios-users] NRPE logging to syslog >> >>Hi. I have debugging set to 0 in nrpe.cfg, yet my /var/log/messages > > and > >>/var/log/secure files are filling up fast with this kind of stuff: >> >>Dec 1 03:20:16 gentoo1 xinetd[5777]: START: nrpe pid=27547 >>from=192.168.10.111 >>Dec 1 03:20:16 gentoo1 xinetd[5777]: START: nrpe pid=27549 >>from=192.168.10.111 > > > [chop] > > >>Any way to shut this off completely so logs stop filling up? > > > This is xinetd logging, not nrpe. Do you have anything like > > log_on_success = PID HOST > > in your nrpe xinetd definition or the defaults? > > -- > Marc > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pharrison at mainstream-tech.com Tue Dec 6 17:45:28 2005 From: pharrison at mainstream-tech.com (Paul Harrison) Date: Tue, 06 Dec 2005 10:45:28 -0600 Subject: Trying to limit number of service notifications Message-ID: <1133887528.16704.12.camel@pharrison.mainstream-tech.com> Hello all I am needing to limit the number of service notifications that I receive. My understanding is that I must user escalations. I have included the entry in my escalations.cfg. Can anyone point out what I am doing wrong? My host escalations work perfectly. After 3 notifications I get 2 pages and get no more notifications until the recovery. Thank you in advance. Paul Harrison # Serviceescalation definition define serviceescalation{ host_name * service_description * first_notification 3 last_notification 5 contact_groups admins notification_interval 1 } -------------- next part -------------- An HTML attachment was scrubbed... URL: From fernando.shayani at bsb.politec.com.br Tue Dec 6 18:53:20 2005 From: fernando.shayani at bsb.politec.com.br (Fernando Shayani) Date: Tue, 6 Dec 2005 14:53:20 -0300 Subject: Nagios HANGS scheduling info Message-ID: <1964B929D8FF4D4DB06854A75D0FD3730CABCEC7@bsbserv028.bsb.politec.com.br> Well, my configuration is: Inter-check sleep time (sleep_time=0.25) Service inter-check delay method (service_inter_check_delay_method=s) Maximum service check spread (max_service_check_spread=2) Service interleave factor (service_interleave_factor=s) Maximum concurrent service checks (max_concurrent_checks=0) Service reaper frequency (service_reaper_frequency=5) Host inter-check delay method (host_inter_check_delay_method=s) Maximum host check spread (max_host_check_spread=2) Timing interval length (interval_length=60) Agressive host checking option (use_aggressive_host_checking=0) The following options are not set. Auto-rescheduling option Auto-rescheduling interval Auto-rescheduling window I will read the configuration and recheck it all. I also got the SYSLOG line right after que problem. I hope this could help you help me. Dec 6 06:33:24 bsbserv007 nagios: Warning: The check of service 'CPU LOAD' on host 'BSBSERV017' could not be performed due to a fork() error. The check will be rescheduled. Thanks for the help. Fernando Shayani fernando.shayani at bsb.politec.com.br (61) 3038-6951 POLITEC - Bras?lia - DF -----Original Message----- From: Marcel Mitsuto Fucatu Sugano [mailto:msugano at uolinc.com] Sent: segunda-feira, 5 de dezembro de 2005 15:33 To: Fernando Shayani Cc: Eli Stair; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] Nagios HANGS scheduling info On Mon, 2005-12-05 at 09:01 -0300, Fernando Shayani wrote: > Well, I upgraded to b6, enabled the Orphaned Service and still > hangs... > > Is there any other clue? > > Fernando Shayani > fernando.shayani at bsb.politec.com.br > (61) 3038-6951 > POLITEC - Bras?lia - DF Have you followed the instructions available at: http://nagios.sourceforge.net/docs/2_0/configmain.html ? Read that and check the following configs: Inter-check sleep time Service inter-check delay method Maximum service check spread Service interleave factor Maximum concurrent service checks Service reaper frequency Host inter-check delay method Maximum host check spread Timing interval length Auto-rescheduling option Auto-rescheduling interval Auto-rescheduling window Agressive host checking option Which are related to the scheduling options, and might be the source of your problem. Have you messed up those variables? Whatever the case, post the values of this configuration variables. HTH, -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Richard.Luys at vanderlet.nl Tue Dec 6 18:02:43 2005 From: Richard.Luys at vanderlet.nl (Richard.Luys at vanderlet.nl) Date: Tue, 6 Dec 2005 18:02:43 +0100 Subject: Configuration modified, not shown in GUI, except for 'View Config'??? Message-ID: <9BC12CEA9D0D27409DB7599F44FDD3574EAA4B@vdlsv00029.vanderlet.nl> Hi All, It's been a while since I posted anything on this list, since I am awfully busy migrating a very messy config on our Nagios server to a new one. But while doing so I ran into a very strange problem. Our situation is as follows: Nagios 1.3 running on a SLES9 64-bit server Configuration is in /usr/local/nagios/etc, and in several configdirectories underneath the etc-directory (specified with the cfg_dir directive in nagios.cfg). Here is what I did: I edited a file called services_web.cfg in /usr/local/nagios/etc/tno. This directory is specified with the user_dir directive in nagios.cfg The modification was very simple: I added 'notifications_enabled 0' to the tempate in this file. I then reloaded the nagios config (/etc/init.d/nagios reload) In the GUI this modification is NOT shown; ie there are no 'notifications disabled' symbols next to the service descriptions, and the 'Service State Information' screen still says 'Service Notifications enabled'. But, when I go to 'View Config' - Services, and then lookup this service, it tells me Notifications enabled is Off!!! When I add the 'notifications_enabled 0' directive to a service_description instead of the template, same situation as above occurs. When I modify the service_description of a service in the same file, and reload Nagios, then both the service_description AND the notifications_enabled setting from the template are correctly shown in the GUI. Another workaround I found is this: comment out the respective cfg_dir line in the nagios.cfg. Reload Nagios, refresh the GUI, then uncomment the line again, reload Nagios and refresh the GUI. Then all notifications_disabled symbols are there, and everything is in order... Not sure if this happened to anyone before, but I thought this might be interesting to know. Please advise whether this should be reported as a bug! Rgrds, Richard Luys The Netherlands This e-mail message and its attachments are subject to the disclaimer published at the following website of VANDERLET: http://www.vanderlet.nl/disclaimer ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jeffwilliams05 at gmail.com Tue Dec 6 19:15:06 2005 From: jeffwilliams05 at gmail.com (Jeff Williams) Date: Tue, 6 Dec 2005 12:15:06 -0600 Subject: Trying to limit number of service notifications In-Reply-To: <1133887528.16704.12.camel@pharrison.mainstream-tech.com> References: <1133887528.16704.12.camel@pharrison.mainstream-tech.com> Message-ID: <17e351010512061015y540b8054jdec370380cce33c@mail.gmail.com> I had a similar question a while back and was told that this can be accomplished by specifying a notification_interval of 0 for your last desired notification. Prior to being told that, I thought that specifying last_notification would be enough, but it was not. In your case something like this would probably work: define serviceescalation{ host_name * service_description * first_notification 3 last_notification 4 contact_groups admins notification_interval 1 } define serviceescalation{ host_name * service_description * first_notification 5 last_notification 5 contact_groups admins notification_interval 0 } Hope this works for you. Jeff On 12/6/05, Paul Harrison wrote: > Hello all > > I am needing to limit the number of service notifications that I receive. > My understanding is that I must user escalations. I have included the > entry in my escalations.cfg. > Can anyone point out what I am doing wrong? My host escalations work > perfectly. After 3 notifications I get 2 pages and get no more > notifications until the recovery. > > Thank you in advance. > > Paul Harrison > > # Serviceescalation definition > define serviceescalation{ > host_name * > service_description * > first_notification 3 > last_notification 5 > contact_groups admins > notification_interval 1 > } > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mramos at co.sapo.pt Tue Dec 6 19:27:57 2005 From: mramos at co.sapo.pt (Marco Ramos) Date: Tue, 06 Dec 2005 18:27:57 +0000 Subject: Possible BUG: No Service Notifications - HELP!!! In-Reply-To: References: Message-ID: <1133893677.3799.36.camel@localhost.localdomain> Do you have service escalations defined in your escalations.cfg? Do you have notifications_enabled set to 1 in the services you want? How does your services.cfg look like for the services you want to be notified? regards, Marco Ramos On Tue, 2005-12-06 at 09:51 -0500, Ron Gage wrote: > Hi: > > > > I have a fairly small Nagios installation (25 hosts, 50 services). > > > > I can not get any service notifications to fire at all. None. > > > > I get host notifications without any problems, but absolutely no > service notifications. Nothing in the notification logs either. The > service outage is being properly recorded in the logs, but no > notification is firing. > > > > I have verified that service notifications are turned on in the config > files and that the mail command is properly defined. > > > > I have also tried (via the CGI interface) turning notifications off > and on for various services and hosts ? no difference. > > > > The part that really bugs me is that there is nothing in the > notification logs when a service problem occurs. > > > > Can anyone offer me some guidance on where to look for this? > > > > RON GAGE > Network Administrator > Wise Solutions, Inc. > T > +1 734 456 2202 > M > +1 248 343 2431 > www.altiris.com > > Security. Compliance. Patch management. IT service management. > Altiris solves your most pressing IT issues. > www.altiris.com > > > > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Thomas.Zimmer at oppenheim.de Tue Dec 6 19:32:55 2005 From: Thomas.Zimmer at oppenheim.de (Thomas.Zimmer at oppenheim.de) Date: Tue, 6 Dec 2005 19:32:55 +0100 Subject: AW: Trying to limit number of service notification s Message-ID: <9A834A8DCEC68648B20701E1CDE10925F8493B@OKWPMXS01.de.sal-opp.net> To differ betwenn message 1-3 as email and 4/5 as pages, you?d have to define different contact (one with email-address and another with pager-address), build to different contactgroups, and assign the escalations Paul described to these different groups. i think the escalations should be: define serviceescalation{ host_name * service_description * first_notification 1 last_notification 3 contact_groups email-group notification_interval 1 } define serviceescalation{ host_name * service_description * first_notification 4 last_notification 5 contact_groups pager-group notification_interval 1 } Greetz,Thomas Zimmer Produktservice & Betrieb Betrieb & Support Sal. Oppenheim jr. & Cie., Frankfurt a. Main Internet: http://www.oppenheim.de E-Mail: thomas.zimmer at oppenheim.de -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] Im Auftrag von Paul Harrison Gesendet: Dienstag, 6. Dezember 2005 17:45 An: nagios-users at lists.sourceforge.net Betreff: [Nagios-users] Trying to limit number of service notifications Hello all I am needing to limit the number of service notifications that I receive. My understanding is that I must user escalations. I have included the entry in my escalations.cfg. Can anyone point out what I am doing wrong? My host escalations work perfectly. After 3 notifications I get 2 pages and get no more notifications until the recovery. Thank you in advance. Paul Harrison # Serviceescalation definition define serviceescalation{ host_name * service_description * first_notification 3 last_notification 5 contact_groups admins notification_interval 1 } -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Zimmer, Thomas (Produktservices und Betrieb).vcf Type: application/octet-stream Size: 254 bytes Desc: not available URL: From mramos at co.sapo.pt Tue Dec 6 19:42:48 2005 From: mramos at co.sapo.pt (Marco Ramos) Date: Tue, 06 Dec 2005 18:42:48 +0000 Subject: Nagios HANGS scheduling info In-Reply-To: <1964B929D8FF4D4DB06854A75D0FD3730CABCEC7@bsbserv028.bsb.politec.com.br> References: <1964B929D8FF4D4DB06854A75D0FD3730CABCEC7@bsbserv028.bsb.politec.com.br> Message-ID: <1133894570.3799.39.camel@localhost.localdomain> Try to tune your service_reaper_frequency and max_concurrent_checks values. Take a look at http://nagios.org/faqs/viewfaq.php?faq_id=115. Had the same problem a while ago and managed to solve it tunning this two options. regards, Marco Ramos On Tue, 2005-12-06 at 14:53 -0300, Fernando Shayani wrote: > Well, my configuration is: > > Inter-check sleep time (sleep_time=0.25) > Service inter-check delay method (service_inter_check_delay_method=s) > Maximum service check spread (max_service_check_spread=2) > Service interleave factor (service_interleave_factor=s) > Maximum concurrent service checks (max_concurrent_checks=0) > Service reaper frequency (service_reaper_frequency=5) > Host inter-check delay method (host_inter_check_delay_method=s) > Maximum host check spread (max_host_check_spread=2) > Timing interval length (interval_length=60) > Agressive host checking option (use_aggressive_host_checking=0) > > The following options are not set. > Auto-rescheduling option > Auto-rescheduling interval > Auto-rescheduling window > > > I will read the configuration and recheck it all. > > I also got the SYSLOG line right after que problem. I hope this could help you help me. > > Dec 6 06:33:24 bsbserv007 nagios: Warning: The check of service 'CPU LOAD' on host 'BSBSERV017' could not be performed due to a fork() error. The check will be rescheduled. > > > Thanks for the help. > > > Fernando Shayani > fernando.shayani at bsb.politec.com.br > (61) 3038-6951 > POLITEC - Bras?lia - DF > > -----Original Message----- > From: Marcel Mitsuto Fucatu Sugano [mailto:msugano at uolinc.com] > Sent: segunda-feira, 5 de dezembro de 2005 15:33 > To: Fernando Shayani > Cc: Eli Stair; nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] Nagios HANGS scheduling info > > On Mon, 2005-12-05 at 09:01 -0300, Fernando Shayani wrote: > > Well, I upgraded to b6, enabled the Orphaned Service and still > > hangs... > > > > Is there any other clue? > > > > Fernando Shayani > > fernando.shayani at bsb.politec.com.br > > (61) 3038-6951 > > POLITEC - Bras?lia - DF > > Have you followed the instructions available at: > http://nagios.sourceforge.net/docs/2_0/configmain.html ? > > Read that and check the following configs: > Inter-check sleep time > Service inter-check delay method > Maximum service check spread > Service interleave factor > Maximum concurrent service checks > Service reaper frequency > Host inter-check delay method > Maximum host check spread > Timing interval length > Auto-rescheduling option > Auto-rescheduling interval > Auto-rescheduling window > > Agressive host checking option > > Which are related to the scheduling options, and might be the source of your problem. Have you messed up those variables? Whatever the case, post the values of this configuration variables. > > HTH, > -- > Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Tue Dec 6 19:53:46 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Tue, 6 Dec 2005 19:53:46 +0100 (CET) Subject: Internal server error In-Reply-To: <953BA938-FE61-4FC0-8064-16E089073EEC@rocq.qc.ca> References: <3F1C094F-5AEC-40AF-94FB-A7E50A1F23C4@rocq.qc.ca> <953BA938-FE61-4FC0-8064-16E089073EEC@rocq.qc.ca> Message-ID: On Tue, 6 Dec 2005, Charles Guimont wrote: > I can't seam to use user nobody, I get an error of, user is not > available If you run apache as nobody thn you are in for a heap of trouble with Nagios. You do not want to give too much access to nobody. After all it IS nobody! Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cguimont at rocq.qc.ca Tue Dec 6 20:14:52 2005 From: cguimont at rocq.qc.ca (Charles Guimont) Date: Tue, 6 Dec 2005 14:14:52 -0500 Subject: Internal server error In-Reply-To: References: <3F1C094F-5AEC-40AF-94FB-A7E50A1F23C4@rocq.qc.ca> <953BA938-FE61-4FC0-8064-16E089073EEC@rocq.qc.ca> Message-ID: Yes, but it doesn't solve my problem. Many trying to change the apache username! Charles On 05-12-06, at 13:53, Hugo van der Kooij wrote: > On Tue, 6 Dec 2005, Charles Guimont wrote: > >> I can't seam to use user nobody, I get an error of, user is not >> available > > If you run apache as nobody thn you are in for a heap of trouble with > Nagios. You do not want to give too much access to nobody. After > all it IS > nobody! > > Hugo. > > -- > I hate duplicates. Just reply to the relevant mailinglist. > hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ > Don't meddle in the affairs of magicians, > for they are subtle and quick to anger. > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ron.gage at altiris.com Tue Dec 6 20:29:24 2005 From: ron.gage at altiris.com (Ron Gage) Date: Tue, 6 Dec 2005 14:29:24 -0500 Subject: BUG Confirmed: No Service Notifications Message-ID: Well, I got the problem fixed. Thanks to everyone who provided helpful hints. The problem here was that there was no notification_options in the service definitions in the config file. The bug here is that notification_options is required in service definitions - yet the config passed the syntax check without incident. Either the syntax check is wrong for not noticing that a required statement was missing or the documentation is wrong for showing that the statement is required. RON GAGE Network Administrator Wise Solutions, Inc. T > +1 734 456 2202 M > +1 248 343 2431 www.altiris.com Security. Compliance. Patch management. IT service management. Altiris solves your most pressing IT issues. www.altiris.com ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Ron Gage Sent: Tuesday, December 06, 2005 9:51 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Possible BUG: No Service Notifications - HELP!!! Hi: I have a fairly small Nagios installation (25 hosts, 50 services). I can not get any service notifications to fire at all. None. I get host notifications without any problems, but absolutely no service notifications. Nothing in the notification logs either. The service outage is being properly recorded in the logs, but no notification is firing. I have verified that service notifications are turned on in the config files and that the mail command is properly defined. I have also tried (via the CGI interface) turning notifications off and on for various services and hosts - no difference. The part that really bugs me is that there is nothing in the notification logs when a service problem occurs. Can anyone offer me some guidance on where to look for this? RON GAGE Network Administrator Wise Solutions, Inc. T > +1 734 456 2202 M > +1 248 343 2431 www.altiris.com Security. Compliance. Patch management. IT service management. Altiris solves your most pressing IT issues. www.altiris.com -------------- next part -------------- An HTML attachment was scrubbed... URL: From p.mayers at imperial.ac.uk Tue Dec 6 21:02:36 2005 From: p.mayers at imperial.ac.uk (Phil Mayers) Date: Tue, 06 Dec 2005 20:02:36 +0000 Subject: empty hostgroups forbidden in 2.0b6 Message-ID: <4395EE5C.6010701@imperial.ac.uk> moving from 2.0b3 "[root at monhost nagios]# /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg -h Nagios 2.0b6 Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) Last Modified: 11-30-2005 License: GPL Reading configuration data... Error: Hostgroup has no members (config file '/usr/local/nagios/etc/dynamic/ic.cfg', starting on line 161) """ To which I'd reply - so what? This is particularly annoying for us. We (re)build the nagios configuration every 5 minutes (and HUP if diff says so) from an extensive SQL database containing routers, switches, hosts, networks, routing topology and so on. The hostgroups in question are per-building and per-wiring-centre. They're just extracted as a list and added to the config file. If I have to only add ones which actually contain stuff, I'll have to keep a list of what was touched whilst building the other devices and then append them - this means each time I add a new device type to the export script, I have to remember to include the code. Annoying. Is there a practical reason for this change? Or would a patch/revert be accepted? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Tue Dec 6 21:06:56 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Tue, 6 Dec 2005 21:06:56 +0100 (CET) Subject: empty hostgroups forbidden in 2.0b6 In-Reply-To: <4395EE5C.6010701@imperial.ac.uk> References: <4395EE5C.6010701@imperial.ac.uk> Message-ID: On Tue, 6 Dec 2005, Phil Mayers wrote: > moving from 2.0b3 > > "[root at monhost nagios]# /usr/local/nagios/bin/nagios -v > /usr/local/nagios/etc/nagios.cfg -h > > Nagios 2.0b6 > Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) > Last Modified: 11-30-2005 > License: GPL > > Reading configuration data... > > Error: Hostgroup has no members (config file > '/usr/local/nagios/etc/dynamic/ic.cfg', starting on line 161) > > """ > > To which I'd reply - so what? It was documented as a REQUIRED field from the start of v2.0 so you just got away because there was a bug untill now. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sloane at ku.edu Tue Dec 6 21:43:21 2005 From: sloane at ku.edu (Sloane, Robert Raymond) Date: Tue, 6 Dec 2005 14:43:21 -0600 Subject: empty hostgroups forbidden in 2.0b6 Message-ID: <9632C3DAB675864EB0A9B7724D85DB7901660DF1@MAILBOXSEVEN.home.ku.edu> > It was documented as a REQUIRED field from the start of v2.0 > so you just got away because there was a bug untill now. I would argue that there is absolutly no reason to make it required now that hostgroups can be specified in the host definition, and that there are good reasons to NOT make it required. Otherwise hosts will either end up being listed twice, once in the hostgroups members and once in the host definition, or you won't be able to list them in the host definition, a significant advantage of version 2. -- Bob Sloane, University of Kansas Computer Center, Lawrence, KS, 66045 Email:sloane at ku.edu http://www.ku.edu/home/sloane Phone:(785)864-0444 ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From p.mayers at imperial.ac.uk Tue Dec 6 21:50:47 2005 From: p.mayers at imperial.ac.uk (Phil Mayers) Date: Tue, 06 Dec 2005 20:50:47 +0000 Subject: empty hostgroups forbidden in 2.0b6 In-Reply-To: References: <4395EE5C.6010701@imperial.ac.uk> Message-ID: <4395F9A7.4090504@imperial.ac.uk> Hugo van der Kooij wrote: > On Tue, 6 Dec 2005, Phil Mayers wrote: > >> moving from 2.0b3 >> >> "[root at monhost nagios]# /usr/local/nagios/bin/nagios -v >> /usr/local/nagios/etc/nagios.cfg -h >> >> Nagios 2.0b6 >> Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) >> Last Modified: 11-30-2005 >> License: GPL >> >> Reading configuration data... >> >> Error: Hostgroup has no members (config file >> '/usr/local/nagios/etc/dynamic/ic.cfg', starting on line 161) >> >> """ >> >> To which I'd reply - so what? > > It was documented as a REQUIRED field from the start of v2.0 so you just > got away because there was a bug untill now. The docs say: """members: This is a list of the short names of hosts that should be included in this group. Multiple host names should be separated by commas. This directive may be used as an alternative to (or in addition to) the hostgroups directive in host definitions.""" ...so presumably you could skip "members" and set "hostgroups" on a host and it would all be ok - and of course the "hostgroups" attribute of a "host" is not required at all. A more accurate bit of text would be """all hostgroups must have 1 or more members, either via the "member" attribute of the group, or the "hostgroups" attribute of one or more hosts""" However, I still think it's unhelpful to mandate members, and makes it considerably more burdensome to build configs automatically *especially* if you want to use include directories to merge an automatically generated config and a static "extra specials" config (which may want to refer to e.g. a building/wiring centre group in the dynamic config) So, is there a specific architectural reason why a hostgroup must have members? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cdrice at pobox.com Tue Dec 6 22:40:48 2005 From: cdrice at pobox.com (Charles Dee Rice) Date: Tue, 6 Dec 2005 13:40:48 -0800 (PST) Subject: Nagios log file format v1 to v2? In-Reply-To: <20051206041109.C0B551371D@sc8-sf-spam2.sourceforge.net> References: <20051206041109.C0B551371D@sc8-sf-spam2.sourceforge.net> Message-ID: <20051206214048.86667.qmail@web53902.mail.yahoo.com> I upgraded from nagios-1 to nagios-2 several months ago, and would like to migrate my old log files to be usable for availability reporting (avail.cgi, etc.). The simple step of moving the old files into the archive directory didn't quite work out for me, though... :/ Whenever I try attempt to generate any reports using date ranges which fall in the older v-1 logs, avail.cgi segfaults. Any ideas? How have others handled migrating historical host and service data from v1 to v2? Thanks - Chuck ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drussell at mpifix.com Tue Dec 6 23:28:27 2005 From: drussell at mpifix.com (Danny Russell) Date: Tue, 6 Dec 2005 15:28:27 -0700 Subject: Customer 3rd Party Plug-in Dev - Not passing back ; instead getting back : Message-ID: <028B223BB24E5443A9784BBE1CC5EBE076979C@Lisa.mpifix.com> Has anyone noticed this peculiarity? I have a custom plug-in that outputs - Uptime=3330;LastInstall=9-12-2005;LastUpdate=10-12-2005 When I look at the detail page it says the same output, but all ; (semi colons) have been converted into : (colons) Is this standard behavior? Are semi-colons an allow char to have in an output string? I am not trying to use this to gather performance data, just that is the output of the plug-in. -------------- next part -------------- An HTML attachment was scrubbed... URL: From RNathan at baxglobal.com Wed Dec 7 00:08:41 2005 From: RNathan at baxglobal.com (Ranga Nathan) Date: Tue, 6 Dec 2005 15:08:41 -0800 Subject: SuSE9 RPM for nagios 2x? Message-ID: I would like to try the latest nagios beta build but I do not have an RPM for SuSE 9. If someone has an RPM for SuSE9 for intel, please send me a link / copy. Yes, I could build from tar.gz distro but I would like to maintain some consistency and the option to uninstall if needed. Thanks __________________________________________ Ranga Nathan / CSG Systems Programmer - Specialist; Technical Services; BAX Global Inc. Irvine-California Tel: 714-442-7591 Fax: 714-442-2840 -------------- next part -------------- An HTML attachment was scrubbed... URL: From teng at dataway.com Wed Dec 7 00:16:59 2005 From: teng at dataway.com (Tedman Eng) Date: Tue, 6 Dec 2005 15:16:59 -0800 Subject: Configuration modified, not shown in GUI, exce pt for 'View Config'??? Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C68@dw-mail.dataway.com> I believe the problem you are expeiencing is due to retention. A quick way is to delete or rename the entire status file (status.sav) A less quick way is delete all lines in the status file (status.sav)that correspond to the host. There will be HOST: and SERVICE: entries, make sure to yank all of them. NOTE: If the host changes state while you're doing either of these methods, you'll get messed up reports for that host. > -----Original Message----- > From: Richard.Luys at vanderlet.nl [mailto:Richard.Luys at vanderlet.nl] > Sent: Tuesday, December 06, 2005 9:03 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Configuration modified, not shown in > GUI, except > for 'View Config'??? > > > Hi All, > > It's been a while since I posted anything on this list, since I am > awfully busy migrating a very messy config on our Nagios > server to a new > one. But while doing so I ran into a very strange problem. > > Our situation is as follows: > Nagios 1.3 running on a SLES9 64-bit server > Configuration is in /usr/local/nagios/etc, and in several > configdirectories underneath the etc-directory (specified with the > cfg_dir directive in nagios.cfg). > > Here is what I did: > I edited a file called services_web.cfg in /usr/local/nagios/etc/tno. > This directory is specified with the user_dir directive in nagios.cfg > The modification was very simple: I added 'notifications_enabled > 0' to the tempate in this file. > I then reloaded the nagios config (/etc/init.d/nagios reload) > In the GUI this modification is NOT shown; ie there are no > 'notifications disabled' symbols next to the service descriptions, and > the 'Service State Information' screen still says 'Service > Notifications > enabled'. > > But, when I go to 'View Config' - Services, and then lookup this > service, it tells me Notifications enabled is Off!!! > > When I add the 'notifications_enabled 0' directive to a > service_description instead of the template, same situation as above > occurs. When I modify the service_description of a service in the same > file, and reload Nagios, then both the service_description AND the > notifications_enabled setting from the template are correctly shown in > the GUI. > > Another workaround I found is this: comment out the respective cfg_dir > line in the nagios.cfg. Reload Nagios, refresh the GUI, then uncomment > the line again, reload Nagios and refresh the GUI. Then all > notifications_disabled symbols are there, and everything is > in order... > > Not sure if this happened to anyone before, but I thought > this might be > interesting to know. > > Please advise whether this should be reported as a bug! > > Rgrds, > Richard Luys > The Netherlands > > This e-mail message and its attachments are subject to the > disclaimer published at the following website of VANDERLET: http://www.vanderlet.nl/disclaimer ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Wed Dec 7 00:32:16 2005 From: teng at dataway.com (Tedman Eng) Date: Tue, 6 Dec 2005 15:32:16 -0800 Subject: empty hostgroups forbidden in 2.0b6 Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C69@dw-mail.dataway.com> I'd agree with the original poster. I see many administrative uses of allowing empty groups (and groupless objects). Perhaps rather than an Error, it could be changed to Warning? This would provide feedback for those troubleshooting or learning to configure nagios, but would allow a wider degree of freedom in configuration. > -----Original Message----- > From: Phil Mayers [mailto:p.mayers at imperial.ac.uk] > Sent: Tuesday, December 06, 2005 12:51 PM > To: Nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] empty hostgroups forbidden in 2.0b6 > > > Hugo van der Kooij wrote: > > On Tue, 6 Dec 2005, Phil Mayers wrote: > > > >> moving from 2.0b3 > >> > >> "[root at monhost nagios]# /usr/local/nagios/bin/nagios -v > >> /usr/local/nagios/etc/nagios.cfg -h > >> > >> Nagios 2.0b6 > >> Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) > >> Last Modified: 11-30-2005 > >> License: GPL > >> > >> Reading configuration data... > >> > >> Error: Hostgroup has no members (config file > >> '/usr/local/nagios/etc/dynamic/ic.cfg', starting on line 161) > >> > >> """ > >> > >> To which I'd reply - so what? > > > > It was documented as a REQUIRED field from the start of > v2.0 so you just > > got away because there was a bug untill now. > > The docs say: > > """members: This is a list of the short names of hosts that > should be > included in this group. Multiple host names should be separated by > commas. This directive may be used as an alternative to (or > in addition > to) the hostgroups directive in host definitions.""" > > ...so presumably you could skip "members" and set > "hostgroups" on a host > and it would all be ok - and of course the "hostgroups" > attribute of a > "host" is not required at all. > > A more accurate bit of text would be """all hostgroups must have 1 or > more members, either via the "member" attribute of the group, or the > "hostgroups" attribute of one or more hosts""" > > However, I still think it's unhelpful to mandate members, and > makes it > considerably more burdensome to build configs automatically > *especially* > if you want to use include directories to merge an automatically > generated config and a static "extra specials" config (which > may want to > refer to e.g. a building/wiring centre group in the dynamic config) > > So, is there a specific architectural reason why a hostgroup > must have > members? > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep > through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. > DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Wed Dec 7 07:37:19 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Wed, 7 Dec 2005 07:37:19 +0100 (CET) Subject: SuSE9 RPM for nagios 2x? In-Reply-To: References: Message-ID: On Tue, 6 Dec 2005, Ranga Nathan wrote: > I would like to try the latest nagios beta build but I do not have an RPM > for SuSE 9. If someone has an RPM for SuSE9 for intel, please send me a > link / copy. > > Yes, I could build from tar.gz distro but I would like to maintain some > consistency and the option to uninstall if needed. The tar file comes with it's own spec file. So it should be peanuts to build your own with: rpmbuild -ta .......tar.gz Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From grover1711 at gmail.com Wed Dec 7 08:56:18 2005 From: grover1711 at gmail.com (ankush grover) Date: Wed, 7 Dec 2005 13:26:18 +0530 Subject: not able to get the status of some services from the remote machine even with net-snmp running Message-ID: <5f638b360512062356v2c1d6566tc258e2f17f8cd4bd@mail.gmail.com> hey friends, I am not an expert in Nagios.I have configured Nagios to monitor few hosts in my Network. There is one FC3 server on which few services like (mysql,dns,apache,postfix) etc are running on that server.But through nagios I am not able to get the information about the disk & swap status from the remote machine even though snmp & snmptrapd services are running on this server. services.cfg define service{ host_name mail service_description swapspacechecking check_command check_swap max_check_attempts 5 normal_check_interval 5 retry_check_interval 3 check_period nonworkhours notification_interval 30 notification_period nonworkhours notification_options w,c,r contact_groups linux-admins } define service{ host_name mail service_description disk_checking_mail check_command check_local_disk_mail max_check_attempts 5 normal_check_interval 5 retry_check_interval 3 check_period nonworkhours notification_interval 30 notification_period nonworkhours notification_options w,c,r contact_groups linux-admins } checkcommands.cfg file define command{ command_name check_local_disk_mail command_line $USER1$/check_disk -w 10% -c 5% -p /var -p /usr -p /home -p / -p /opt } define command{ command_name check_swap command_line $USER1$/check_swap -w 35% -c 15% } In both the above cases the nagios is showing the status for the local machine on which the nagios is running not for the remote machine. But plugins for apache ,local load ,mysql on mail machine are working fine. Does local_disk and check_swap plugins requires some special configuration. Moreover I would like to monitor postfix(mailq) running on the machine.WhatI have to do to the get status of mailq ? The name of the machine which I am monitoring is mail and the machine name on which nagios is running is cluster1. Please guide me. Thanks & Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: From Thomas.Zimmer at oppenheim.de Wed Dec 7 09:19:30 2005 From: Thomas.Zimmer at oppenheim.de (Thomas.Zimmer at oppenheim.de) Date: Wed, 7 Dec 2005 09:19:30 +0100 Subject: AW: not able to get the status of some services fr om the remote machine even with net-snmp running Message-ID: <9A834A8DCEC68648B20701E1CDE10925F8493C@OKWPMXS01.de.sal-opp.net> as the check-name "check_LOCAL_disk_mail" says, these two are local check. Many checks are not able to check a host resp. a service remote via a tcp-connection o.sim. To execute such services on a remote machine, you?ll have to use nrpe, nsca or ssh (maybe there are more possib.). The service checks you want to exec. hav to reside on the remote machine. Assuming you have running ssh-services (configuring nrpe is a bit more complicated): # command for executing checks via ssh define command { command_name check_by_ssh command_line $USER1$/check_by_ssh -H $HOSTADDRESS$ -C ".//$ARG1$ $ARG2$" } # the remotecheck define service { host_name mail service_description check_swap_on_mail check_command check_by_ssh!check_local_disk_mail! max_check_attempts 5 normal_check_interval 5 retry_check_interval 3 check_period nonworkhours notification_interval 30 notification_period nonworkhours notification_options w,c,r contact_groups linux-admins } This should work. NRPE function in a similar way; but on the remote host a daemon is running, and the checks that may be executed on the remote host are configured on the remote host. NSCA inverts the direction of getting check results; you?ll have to compile a "small nagios", configure all checks for the machine locally, and then you can pass all results as "passive chak results" via the "send_nsca" plugin to your central nagios-server. I think the nrpe alternative has greater performance as executing checks via ssh, but you lose the ability of fully central configuration. i never used nsca myself. Greetz, Thomas Zimmer Produktservice & Betrieb Betrieb & Support Sal. Oppenheim jr. & Cie., Frankfurt a. Main Internet: http://www.oppenheim.de E-Mail: thomas.zimmer at oppenheim.de -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] Im Auftrag von ankush grover Gesendet: Mittwoch, 7. Dezember 2005 08:56 An: nagios-users at lists.sourceforge.net Betreff: [Nagios-users] not able to get the status of some services from the remote machine even with net-snmp running hey friends, I am not an expert in Nagios.I have configured Nagios to monitor few hosts in my Network. There is one FC3 server on which few services like (mysql,dns,apache,postfix) etc are running on that server.But through nagios I am not able to get the information about the disk & swap status from the remote machine even though snmp & snmptrapd services are running on this server. services.cfg define service{ host_name mail service_description swapspacechecking check_command check_swap max_check_attempts 5 normal_check_interval 5 retry_check_interval 3 check_period nonworkhours notification_interval 30 notification_period nonworkhours notification_options w,c,r contact_groups linux-admins } define service{ host_name mail service_description disk_checking_mail check_command check_local_disk_mail max_check_attempts 5 normal_check_interval 5 retry_check_interval 3 check_period nonworkhours notification_interval 30 notification_period nonworkhours notification_options w,c,r contact_groups linux-admins } checkcommands.cfg file define command{ command_name check_local_disk_mail command_line $USER1$/check_disk -w 10% -c 5% -p /var -p /usr -p /home -p / -p /opt } define command{ command_name check_swap command_line $USER1$/check_swap -w 35% -c 15% } In both the above cases the nagios is showing the status for the local machine on which the nagios is running not for the remote machine. But plugins for apache ,local load ,mysql on mail machine are working fine. Does local_disk and check_swap plugins requires some special configuration. Moreover I would like to monitor postfix(mailq) running on the machine.What I have to do to the get status of mailq ? The name of the machine which I am monitoring is mail and the machine name on which nagios is running is cluster1. Please guide me. Thanks & Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Zimmer, Thomas (Produktservices und Betrieb).vcf Type: application/octet-stream Size: 254 bytes Desc: not available URL: From grover1711 at gmail.com Wed Dec 7 09:52:51 2005 From: grover1711 at gmail.com (ankush grover) Date: Wed, 7 Dec 2005 14:22:51 +0530 Subject: not able to get the status of some services fr om the remote machine even with net-snmp running In-Reply-To: <9A834A8DCEC68648B20701E1CDE10925F8493C@OKWPMXS01.de.sal-opp.net> References: <9A834A8DCEC68648B20701E1CDE10925F8493C@OKWPMXS01.de.sal-opp.net> Message-ID: <5f638b360512070052i4106f36cq1ee2d4dbae2d50b@mail.gmail.com> On 12/7/05, Thomas.Zimmer at oppenheim.de wrote: > > as the check-name "check_LOCAL_disk_mail" says, these two are local check. > Many checks are not able to check a host resp. a service remote via a > tcp-connection o.sim. > > To execute such services on a remote machine, you?ll have to use nrpe, > nsca or ssh (maybe there are more possib.). The service checks you want to > exec. hav to reside on the remote machine. > > Assuming you have running ssh-services (configuring nrpe is a bit more > complicated): > > # command for executing checks via ssh > define command { > command_name check_by_ssh > command_line $USER1$/check_by_ssh -H > $HOSTADDRESS$ -C ".//$ARG1$ $ARG2$" > } > # the remotecheck > define service { > host_name mail > service_description check_swap_on_mail > check_command check_by_ssh! > check_local_disk_mail! > max_check_attempts 5 > normal_check_interval 5 > retry_check_interval 3 > check_period nonworkhours > notification_interval 30 > notification_period nonworkhours > notification_options w,c,r > contact_groups linux-admins > } > > This should work. NRPE function in a similar way; but on the remote host a > daemon is running, and the checks that may be executed on the remote host > are configured on the remote host. > > NSCA inverts the direction of getting check results; you?ll have to > compile a "small nagios", configure all checks for the machine locally, and > then you can pass all results as "passive chak results" via the "send_nsca" > plugin to your central nagios-server. > > I think the nrpe alternative has greater performance as executing checks > via ssh, but you lose the ability of fully central configuration. i never > used nsca myself. > > Greetz, > > *Thomas Zimmer* > Produktservice & Betrieb > Betrieb & Support > Sal. Oppenheim jr. & Cie., Frankfurt a. Main > hey Mr. Thomas, Thanks for your guidance. Thanks & Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: From kvaes at vangenechten.com Wed Dec 7 10:58:01 2005 From: kvaes at vangenechten.com (Karim Vaes) Date: Wed, 07 Dec 2005 10:58:01 +0100 Subject: Nagios: cgi interface regurally gives errors (nfs related?) Message-ID: Hello, We'eve setup nagios so that the nagios directory (/usr/local/nagios) is symlinked to a shared directory on a NAS. This share is made by NFS (/etc/fstab => 10.1.1.11:/vol/vol0/shared /nasdata nfs proto=udp,hard,nolock,intr,suid,rsize=32768,wsize=32768). The reason we did this is, so that when the first nagios server fails, the second would be able to take over while having the same configs & status files. Yet along with this migration (from disk to shared storage) we upgraded to the latest release (1.2 -> 2.05b). Now from time to time (every ~10min) we get errors in the CGI, whilest the nagios daemon appaers to be working as it should. Errors variate from "internal server error" to "could not read status file". Anyone an idea how to fix this? Regards Karim -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: InterScan_Disclaimer.txt URL: From sudheer at tgs-solutions.com Wed Dec 7 15:49:10 2005 From: sudheer at tgs-solutions.com (Sudheer Muddappa) Date: Wed, 07 Dec 2005 09:49:10 -0500 Subject: CHecking log files for errors warnings on a remote server In-Reply-To: References: Message-ID: <4396F666.8040904@tgs-solutions.com> Hi guys, Is there any plug-in in Nagios to check the tail of logfile for errors on a remote machine? Thanks, Sudheer > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james at jameswhite.org Wed Dec 7 15:49:33 2005 From: james at jameswhite.org (James S. White) Date: Wed, 7 Dec 2005 08:49:33 -0600 (CST) Subject: nagios process count increasing Message-ID: I am running nagios in an distributed environment and when I start nagios on the central server I see an increasing number of nagios processes with the parent process of init. $ ps -ef | grep na[g]ios | awk '{if($3==1){print $0}}' | wc -l 129 $ ps -ef | grep na[g]ios | awk '{if($3==1){print $0}}' | wc -l 141 $ ps -ef | grep na[g]ios | awk '{if($3==1){print $0}}' | wc -l 155 This keeps growing until the server goes down. I just upgraded to 2.0b6 but was seeing this in 2.0b4 as well. The distributed server does not see this. Any ideas? ----------------------------------------------------------------------- James S. White primary/voip: (615) 469-0268 928 Mountain Valley Dr. .O. mobile: (256) 476-2619 Nashville, TN 37209 ..O work: (615) 341-8218 http://www.jameswhite.org OOO work-mobile: (615) 519-5160 james at jameswhite.org fax: (866) 260-5465 ----------------------------------------------------------------------- "Computers are useless. They only give you answers." -- Pablo Picasso ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From morten.bekkelund at ergo.no Wed Dec 7 15:58:20 2005 From: morten.bekkelund at ergo.no (Morten Bekkelund) Date: Wed, 7 Dec 2005 15:58:20 +0100 Subject: nagios process count increasing Message-ID: <8C6D7826E62F404D94BB0197258F548BBC8E36@100NOOSLMSG004.common.alpharoot.net> We've had the same problems. Earlier when we used active checks from the topnode in our distributed system, we had to restart nagios when the number of nagios processes went too high (or the server would crash). We had a script that restarted nagios whenever the number got too high. Lately we reconfigured our solution to using passive host checks and it solved the problem. Seems the topnode gets too busy with active checks. A possibility would be to let the leafnodes run the active checks and then send the results to the topnode via nsca by using performance data. morten -----Original Message----- From: James S. White [mailto:james at jameswhite.org] Sent: 7. desember 2005 15:50 To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] nagios process count increasing I am running nagios in an distributed environment and when I start nagios on the central server I see an increasing number of nagios processes with the parent process of init. $ ps -ef | grep na[g]ios | awk '{if($3==1){print $0}}' | wc -l 129 $ ps -ef | grep na[g]ios | awk '{if($3==1){print $0}}' | wc -l 141 $ ps -ef | grep na[g]ios | awk '{if($3==1){print $0}}' | wc -l 155 This keeps growing until the server goes down. I just upgraded to 2.0b6 but was seeing this in 2.0b4 as well. The distributed server does not see this. Any ideas? ----------------------------------------------------------------------- James S. White primary/voip: (615) 469-0268 928 Mountain Valley Dr. .O. mobile: (256) 476-2619 Nashville, TN 37209 ..O work: (615) 341-8218 http://www.jameswhite.org OOO work-mobile: (615) 519-5160 james at jameswhite.org fax: (866) 260-5465 ----------------------------------------------------------------------- "Computers are useless. They only give you answers." -- Pablo Picasso ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Alethad at unch.unc.edu Wed Dec 7 16:16:11 2005 From: Alethad at unch.unc.edu (Chrietzberg, Aletha) Date: Wed, 7 Dec 2005 10:16:11 -0500 Subject: help on AIX 5.2 Message-ID: Hey guys, Here's what I'm getting when I try to run 'make install' Any ideas? I also don't know why I'm getting what looks to be an extra entry for nagios on the second install-sh line. Or is this OK? ----------------------------------------------------------------- # make install cd ./base && make install .././install-sh -c -m 775 -o nagios -g nagios -d /usr/local/nagios/bin #.././install-sh -c -m 774 -o nagios -g nagios nagios /usr/local/nagios/bin /usr/bin/strip /usr/local/nagios/bin/nagios strip: /usr/local/nagios/bin/nagios -- 0654-405 Specify an XCOFF object file or an archive of XCOFF object files. make: 1254-004 The error code from the last command is 255. Stop. make: 1254-004 The error code from the last command is 2. Stop. ---------------------------------------------------------------------------- ------------ Anyone jump in here. I'm not much of a perl programmer either. Any help with AIX would be greatly appreciated. -----Original Message----- From: Petrucci, Joseph [mailto:Joseph.Petrucci at ddiworld.com] Sent: Monday, November 28, 2005 11:38 AM To: Chrietzberg, Aletha; Kevin Alford; Daniel Jimenez; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] RE: nrpe ver. 2 on AIX 5.2 Last time I was in an AIX shop, there was a Linux tools for AIX CD that came with 5.2. load GCC from there then get a source copy of Nagios and try to compile it. you might have to load on a few more libraries but I do not think so. >-----Original Message----- >From: nagios-users-admin at lists.sourceforge.net >[mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of >Chrietzberg, Aletha >Sent: Monday, November 28, 2005 11:31 AM >To: 'Kevin Alford'; Daniel Jimenez; Chrietzberg, Aletha; >nagios-users at lists.sourceforge.net >Subject: RE: [Nagios-users] RE: nrpe ver. 2 on AIX 5.2 > > >I went to the link you gave but I don't see a discussion on installing >Nagios base on AIX 5.2. I do see the nrpe install but I >thought you had to >have Nagios running somewhere to use this. I am an AIX only shop. > >Or did I misunderstand/misread something in your answer? > > > > > >-----Original Message----- >From: Kevin Alford [mailto:kevin.alford at tmctechnologies.com] >Sent: Monday, November 28, 2005 10:46 AM >To: Daniel Jimenez; Chrietzberg, Aletha; >nagios-users at lists.sourceforge.net >Subject: RE: [Nagios-users] RE: nrpe ver. 2 on AIX 5.2 (Daniel Jimenez) > > >I used the following link to install mine, and it worked fine. I am >running AIX 5.2 ML7. > >http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_mailli >sttofaq_p >i1%5Bmode%5D=1&tx_maillisttofaq_pi1%5BshowUid%5D=12441#childUid12708 > > > > >Kevin D. Alford >Manager, >Network and Facility >TMC Technologies, Inc. > >a Global Science & Technology Company > > > > >-----Original Message----- >From: nagios-users-admin at lists.sourceforge.net >[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Daniel >Jimenez >Sent: Monday, November 28, 2005 10:45 AM >To: Chrietzberg, Aletha; nagios-users at lists.sourceforge.net >Subject: RE: [Nagios-users] RE: nrpe ver. 2 on AIX 5.2 (Daniel Jimenez) > >I'm actually not installing Nagios, but just NRPE and the plugins. I'm >having problems with the most important plugin for my specific >environment, which is check_disk. I'm sticking to Linux for my Nagios >Core install. > >-----Original Message----- >From: Chrietzberg, Aletha [mailto:Alethad at unch.unc.edu] >Sent: Monday, November 28, 2005 8:27 AM >To: 'nagios-users at lists.sourceforge.net' >Subject: [Nagios-users] RE: nrpe ver. 2 on AIX 5.2 (Daniel Jimenez) > >Daniel, >I have been trying to get nagios installed on AIX 5.2 for some time but >to no avail. Is there anything you can pass along that may help me? >Did you have to install any extra AIX PTFs? I have Perl, gdlib, zlib, >gcc >and libpng installed. But where I'm getting into trouble is in the >running >of make on the main Nagios makefile. One of the errors is in regards >to >gcc warnings on a bunch of undeclared identifiers. > >I have been battling this for a while but can find no one that can help >me. >I pick it up and put it down. I am still running the old Netsaint and >have >been hanging on to that so I can have something. > >I promise not to take up too much of your time. I just want >to get the >base running. I can handle the plugins. >Any help would be greatly appreciated. > > > >Never let the fear of striking out get in your way. > Babe Ruth > > >Aletha D Chrietzberg >ISD - AIX Tech Services >UNC Healthcare >(919) 966-4888 > > > > > >------------------------------------------------------- >This SF.net email is sponsored by: Splunk Inc. Do you grep through log >files for problems? Stop! Download the new AJAX search engine that >makes searching your log files as easy as surfing the web. DOWNLOAD >SPLUNK! >http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when >reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > > > >------------------------------------------------------- >This SF.net email is sponsored by: Splunk Inc. Do you grep through log >files for problems? Stop! Download the new AJAX search engine that >makes searching your log files as easy as surfing the web. DOWNLOAD >SPLUNK! >http://ads.osdn.com/?ad_idv37&alloc_id865&op=ick >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when >reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > > > > >------------------------------------------------------- >This SF.net email is sponsored by: Splunk Inc. Do you grep >through log files >for problems? Stop! Download the new AJAX search engine that makes >searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! >http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS >when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From happy at usg.edu Wed Dec 7 17:41:30 2005 From: happy at usg.edu (Mark Plaksin) Date: Wed, 07 Dec 2005 11:41:30 -0500 Subject: Nagios HANGS scheduling info References: <1964B929D8FF4D4DB06854A75D0FD3730CABCEC7@bsbserv028.bsb.politec.com.br> <1133894570.3799.39.camel@localhost.localdomain> Message-ID: <87mzjcdflx.fsf@stone.tss.usg.edu> Marco Ramos writes: > Try to tune your service_reaper_frequency and max_concurrent_checks > values. Take a look at http://nagios.org/faqs/viewfaq.php?faq_id=115. Thanks for the pointer! > Had the same problem a while ago and managed to solve it tunning this > two options. What happens when you have the potential for 1000 checks in a minute? The math in that FAQ says you should set service_reaper_frequency to 0.50 but Nagios (2.04b) only allows integers >= 1. Looking at our "Performance Info" page, most of the time 2-300 were executed in the past minute but sometimes it's nearly 1000. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mscandian at freebsdbrasil.com.br Wed Dec 7 17:01:09 2005 From: mscandian at freebsdbrasil.com.br (=?ISO-8859-1?Q?M=E1rio_S=E9rgio_Candian?=) Date: Wed, 07 Dec 2005 14:01:09 -0200 Subject: Nagios with MySQL Message-ID: <43970745.3020107@freebsdbrasil.com.br> Hi guys, I have one question about Nagios: Can I install Nagios with database sql (MySQL)? I'm usinng a FreeBSD server, and the nagios ports dont say anything about installing Nagios with MySQL. I saw the documentation and dont find nothing. Someone know if it's possible? (nagios 1.x or 2.x). Thanks. -- Mario Sergio Candian - "Dreams as if you'll live forever. Live as if you'll die today" -- James Dean ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mye at bali.to Wed Dec 7 18:15:52 2005 From: mye at bali.to (mye) Date: Thu, 08 Dec 2005 02:15:52 +0900 Subject: Nagios with MySQL In-Reply-To: <43970745.3020107@freebsdbrasil.com.br> References: <43970745.3020107@freebsdbrasil.com.br> Message-ID: <20051208021332.0EB9.MYE@bali.to> Hi, yes its possible you can see nagios installing document or search in http://www.google.com using nagios mysql key words Muhammad On Wed, 07 Dec 2005 14:01:09 -0200 M?io S?gio Candian wrote: > Hi guys, > > I have one question about Nagios: > > Can I install Nagios with database sql (MySQL)? I'm usinng a FreeBSD > server, and the nagios ports dont say anything about installing Nagios > with MySQL. I saw the documentation and dont find nothing. > > Someone know if it's possible? (nagios 1.x or 2.x). > > Thanks. > > -- > Mario Sergio Candian > - > "Dreams as if you'll live forever. Live as if you'll die today" -- James Dean > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jeffwilliams05 at gmail.com Wed Dec 7 18:31:03 2005 From: jeffwilliams05 at gmail.com (Jeff Williams) Date: Wed, 7 Dec 2005 11:31:03 -0600 Subject: Nagios with MySQL In-Reply-To: <20051208021332.0EB9.MYE@bali.to> References: <43970745.3020107@freebsdbrasil.com.br> <20051208021332.0EB9.MYE@bali.to> Message-ID: <17e351010512070931p7365688dpc7bd69fa6111e0fa@mail.gmail.com> Correct me if I'm wrong, but I don't think it's possible to have native db support in 2.0. This is taken from the What's New in Nagios 2.0 document: "Native DB Support Dropped - Native support for storing various types of data (status, retention, comment, downtime, etc.) in MySQL and PostgreSQL has been dropped. Stop whining. I expect someone will develop an alternative using the new event broker sometime in the near future. Besides, DB support was not well implemented and dropping native DB support will make things easier for newbies to understand (one less thing to figure out)." This is not to say that you cannot find an addon that will do this for you though. Jeff On 12/7/05, mye wrote: > > Hi, > yes its possible > you can see nagios installing document > or search in http://www.google.com using nagios mysql key words > > Muhammad > > On Wed, 07 Dec 2005 14:01:09 -0200 > M?io S?gio Candian wrote: > > > Hi guys, > > > > I have one question about Nagios: > > > > Can I install Nagios with database sql (MySQL)? I'm usinng a FreeBSD > > server, and the nagios ports dont say anything about installing Nagios > > with MySQL. I saw the documentation and dont find nothing. > > > > Someone know if it's possible? (nagios 1.x or 2.x). > > > > Thanks. > > > > -- > > Mario Sergio Candian > > - > > "Dreams as if you'll live forever. Live as if you'll die today" -- James Dean > > > > > > > > ------------------------------------------------------- > > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > > for problems? Stop! Download the new AJAX search engine that makes > > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Wed Dec 7 18:34:14 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Wed, 7 Dec 2005 18:34:14 +0100 (CET) Subject: help on AIX 5.2 In-Reply-To: References: Message-ID: On Wed, 7 Dec 2005, Chrietzberg, Aletha wrote: > ----------------------------------------------------------------- > # make install > cd ./base && make install > .././install-sh -c -m 775 -o nagios -g nagios -d > /usr/local/nagios/bin > #.././install-sh -c -m 774 -o nagios -g nagios nagios > /usr/local/nagios/bin I guess Nagios never got installed here as there is a significant difference between these lines. (Like no source file to begin with.) > /usr/bin/strip /usr/local/nagios/bin/nagios > strip: /usr/local/nagios/bin/nagios -- 0654-405 Specify an XCOFF object file > or an archive of > XCOFF object files. > make: 1254-004 The error code from the last command is 255. You can not strip nothing. This is just the detection of a problem which started earlier. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ssmith at siu.edu Wed Dec 7 19:16:03 2005 From: ssmith at siu.edu (Scott Smith) Date: Wed, 07 Dec 2005 12:16:03 -0600 Subject: Critical Audible Alarms with Unreachable Hosts Message-ID: <439726E3.5060203@siu.edu> I have been using Nagios for a while (several years, and even when it was NetSaint). Just recently we had our NOC switchover from HP OpenView to Nagios. In doing so we updated all the Hosts from our network into the Nagios config, some 990 plus Hosts, and now over 1000 services. Here is my issue. When we have a building go down, say 5 Switches (Hosts), and we have the Parents Relationship setup properly we see 1 Host Down, and 4 Hosts Unreachable. We only get one Email Notification about the single Host being down, which is what we want. However, the NOC has recently asked use to enable the Sound for Nagios, which we did with the cgi.cfg file. The problem is that when the NOC Acknowledges the root problem, and services from the 1 Host that is down, we keep getting the Critical.wav alert on our webpages. They have to goto ALL the Unreachable devices services, and Acknowledge them as well. In this example isn't to bad, however some of our buildings have over 100 Hosts, and if the main router goes down for that building our NOC would have a nightmare trying to Ack each Host. Ultimately I would think that they should just Ack the Root problem, and then the Unreachable Children (so to speak) would not need Acking. The more interesting part is that Nagios seems to understand this by only sending one Email, but the webpages don't. Any ideas on this one? -- Scott Smith Network Engineering Services Southern Illinois University Carbondale ssmith at siu.edu ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From brueckel at wave-computer.de Wed Dec 7 20:11:29 2005 From: brueckel at wave-computer.de (=?ISO-8859-15?Q?Alexander_Br=FCckel?=) Date: Wed, 07 Dec 2005 20:11:29 +0100 Subject: Second (passive) Nagios-Installation forks processes like hell Message-ID: <439733E1.8070604@wave-computer.de> Hi Guys! We use two servers which are checking different locations and submit their results to a third nagios installation. (all 2.0b4 on SuSE 9.3) This installation is located on the same host as one of the two active ones and was built with the --prefix option. This nagios does nothing but revieving results, no active checks or whatever. We use nsca to submit the results between the server/installations which seems to work pretty good. The problem is, that the passive nagios forks processes all the time without killing all the old ones. So we end up having over 250 processes after one hour. The only way to stop this is to restart the the passive nagios. If we do this it seems like the first process, which he tries to kill pb pid, doesn't exist any more. Any help would be greatly appreciated! Thanks in advance, Alex ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.beswick at citigroup.com Wed Dec 7 21:32:41 2005 From: stuart.beswick at citigroup.com (Beswick, Stuart) Date: Wed, 7 Dec 2005 15:32:41 -0500 Subject: Event Handler Message-ID: I'm looking to setup an event handler to disable active checks when it first becomes critical. The reason I am doing this is because I a running a check_log, and it will go critical for only one check then go back to good. Anyone know how to do this? I would need the Check Command I would assume. Thanks, Stuart ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cameron.matheson at gmail.com Wed Dec 7 22:06:10 2005 From: cameron.matheson at gmail.com (Cam) Date: Wed, 7 Dec 2005 14:06:10 -0700 Subject: contactgroup{ }, and contactgroups # bug? In-Reply-To: References: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> <7e2f9af90512051602y21145f4fq99b5ed451facbb61@mail.gmail.com> Message-ID: <7e2f9af90512071306g3444cb90l320143f541d95019@mail.gmail.com> On 12/5/05, Hugo van der Kooij wrote: > If it is not required for the hostgroup then THAT is the bug. Unfortunatly > I can not find a bug tracking tool on the Nagios site. That is Absolutely not a bug. It's a documentation bug, if anything. The 'members' line should *not* be required (when members are defined elsewhere). Cameron Matheson ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Alethad at unch.unc.edu Wed Dec 7 22:03:55 2005 From: Alethad at unch.unc.edu (Chrietzberg, Aletha) Date: Wed, 7 Dec 2005 16:03:55 -0500 Subject: Nagios/Plugins on AIX 5.2 - reply Message-ID: Hey Chuck, Yes, I'm still having trouble. That would be awesome if you have any notes. I'll take anything I can get my hands on at this point. I'm trying to install the Nagios and the plugins. Which version of Nagios did you install? 1.2 or 2.0b4? Just send me whatever you can. Thanks. Never let the fear of striking out get in your way. Babe Ruth Aletha D Chrietzberg UNC Healthcare ISD - AIX Tech Services (919) 966-4888 -----Original Message----- From: Charles Dee Rice [mailto:cdrice at pobox.com] Sent: Wednesday, December 07, 2005 3:10 PM To: Alethad at unch.unc.edu Subject: Nagios/Plugins on AIX 5.2 Hi, this is Chuck in Apache Junction, AZ. I saw a recent post you sent to the nagios-users mailing list concerning having some trouble building nagios on AIX 5.2. I have nagios, nagios-plugins, nrpe and ncsa all built on AIX 5.2 ML04 with detailed notes and build logs -- I'd be happy to send them to you if you're still having trouble. Drop me a note or give me a call at 480-235-8069 if you want. - Chuck ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.bondarenko at ugtel.ru Wed Dec 7 22:13:30 2005 From: s.bondarenko at ugtel.ru (=?koi8-r?B?4s/OxMHSxc7LzyDzLuEu?=) Date: Thu, 8 Dec 2005 00:13:30 +0300 Subject: Problem connecting to NSClient Message-ID: <10696621404.20051208001330@ugtel.ru> Hello, I have installed the latest Nagios after using the previous version for two years. I have copied the configuration files to ne server where the new version is set up. Nagios starts with no warning and no error messages, but when it tries to check the Win2K mashine (connects to NSClient) it receives the following: NSClient - ERROR:Wrong password I want to mention that the password in Windows Registry matches the password specified in services: check_nt_diskspase!C!90%!98%!123 password: 123 (it is also set up in windows registry) All other services work perfectly for ex: check_tcp!5000 (checking the Sybase SQL server) Please can you explain me what may be the problem or how can I deeply monitor the problem -- Best Wishes, Bondarenko Sergey ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From emanuel.beunder at d-radio.nl Wed Dec 7 23:18:02 2005 From: emanuel.beunder at d-radio.nl (Emanuel Beunder) Date: Wed, 07 Dec 2005 23:18:02 +0100 Subject: delayed critical alerts Message-ID: <43975F9A.8010705@d-radio.nl> Hello, I've been looking a bit trough the archives of the mailinglist, but supprisingly enough, I didn't find the anwer to my question. The company I work for uses Nagios 1.2 for monitoring everything and all, something me and my manager are very proud off (I really love this framework). But I just have an big problem (and some small ones): on some services we have enabled sending a warning by SMS when this service fails. But unfortunatly my cellphone is pretty loud and on one night when things really went wrong, I was woken up 10 times during the night (to sleepy to realize to turn it off), while I really needed the sleep to face the problems the next day. So I disabled notifying during 11pm and 7am on my hostgroup. I also enlarged the time window of the notification intervals, because after our local DNS broke down on a bank holiday (when I was on holiday too) and received hundreds of SMSes.. But now I also don't get any warning anymore when something broke down during the night. And since I am not an early bird, I'd like to know when I really need to go to my work early when the S hits the fan (or to log in by VPN and fix the issue). So basically my question is: how do I delay a notification to a hostgroup until the notification time window for this group begins? (because I do like having 24x7 statistics for my services, I don't like to change the monitor window to the notification window). Thanks in advance for your answers! Cheers, Emanuel -- In a world without walls and fences, who needs Windows or Gates? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misc at viceconsulting.co.nz Wed Dec 7 23:26:15 2005 From: misc at viceconsulting.co.nz (misc at viceconsulting.co.nz) Date: Thu, 8 Dec 2005 11:26:15 +1300 (NZDT) Subject: Plugins timing out Message-ID: <40811.127.0.0.1.1133994375.squirrel@www.goldenfields.co.nz> Hi All, On a server with 400 service checks I have about 10 service checks that are timing out. My timeout value is set to 25 seconds. While I am trying to resolve the timeout issues, I am keen to find out how Nagios reacts when waiting for service check results. What happens when Nagios is waiting for the result of a service check? Will Nagios continue to happily schedule service checks even when waiting for a result, or will it await the result of a service check before scheduling another? If it does schedule multiple service checks simultaneously, is there a limit to the maximum number of service checks that Nagios will check at a given time? My average check Latency for this Nagios server (as per "Performance Info") is 132.551 seconds. Would I be correct in assuming that this high latency value could be caused by these few rogue service checks that are timing out? Also, this is a distributed Nagios server which results to the Central Nagios server. I am getting quite a few "Stale Results" from this distributed result. Is it possible for a few rogue service checks which are timining out to throw out the remaining service checks from this distributed Nagios server causing stale results? The stale results being reported by the Central Nagios server are not exclusively the rogue timeout services but also regular checks that don't time out. Any insight into any of this would be much appreciated. Alex. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mfatihakbulut at gmail.com Wed Dec 7 23:57:44 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Wed, 7 Dec 2005 17:57:44 -0500 Subject: help please. browser problem. Message-ID: hi all. i use latest versions of mozilla and firefox. but getting the error below: Nagios <!-- This page requires a web browser which supports frames. --> <p> <center> <h2>Nagios</h2> <a href="http://www.nagios.org" alt="Nagios Home">www.nagios.org</a><br> Copyright (c) 1999-2001 Ethan A. Galstad<br> </center> </p> <p> <i>Note: These pages require a browser which supports frames</i> </p> on a different it works fine on firefox and mozilla on even konqueror. but why not works on the one at home ? help please ... -------------- next part -------------- An HTML attachment was scrubbed... URL: From benjamin.rutter at packethosting.net Thu Dec 8 00:58:38 2005 From: benjamin.rutter at packethosting.net (Benjamin Rutter) Date: Wed, 7 Dec 2005 18:58:38 -0500 Subject: Strange contact group behavior in 2.0b3 Message-ID: <006901c5fb8a$2503ecc0$08fefe0a@corp.trm.com> I'm running Nagios v2.0b3 and have a strange problem with contact groups Here is a breakdown of the issues: - I create 2 contacts, one called 'brutter' and another 'tward' - I then create a contact group called 'TRMStaff' and add both users - Any service I set the contact group of 'TRMStaff' to works flawlessly - Any host I set the contact group of 'TRMStaff' to works flawlessly So at this point, all is well...then the strange part - I create a new contact called 'lgroup' - I then create a contact group called 'LG' and add the 'lgroup' user - Any service I set the contact group of 'LG' to works fine AS LONG AS the HOST of that service is set to 'TRMStaff' as the contact group. If I have a host with contact group 'TRMStaff' and a service of that host with contact group 'LG', everything works as it should. As soon as I change the contact group on the host to 'LG', that host disappears from the list of hosts in the GUI and alerts are not generated for the hosts OR the services related to that host. I have tried creating multiple groups with all sorts of different names and assigning different users to no avail.... I need to be able to have the 'LG' contact group assigned to both HOSTS and it's corresponding services. I have checked the syntax up and down a hundred times and everything seems to be on point. Is there anything illegal in the group or contact names that I've chosen? Any suggestions out there??? This is killing me...? ? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From todd_barbera at wgbh.org Thu Dec 8 01:20:57 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Wed, 07 Dec 2005 19:20:57 -0500 Subject: Strange contact group behavior in 2.0b3 References: <006901c5fb8a$2503ecc0$08fefe0a@corp.trm.com> Message-ID: <001001c5fb8d$4346c880$0d7810ac@wgbh.org> Maybe you should try upgrading to 2.0b6. The change log shows one fix for hostgroups, but it didn't sound like an exact fit. At least you'll have the latest fixes even if the problem persists and you'll be able to eliminate the possibility that your issue may have been addressed in a newer release. Todd ----- Original Message ----- From: "Benjamin Rutter" To: Sent: Wednesday, December 07, 2005 6:58 PM Subject: [Nagios-users] Strange contact group behavior in 2.0b3 I'm running Nagios v2.0b3 and have a strange problem with contact groups Here is a breakdown of the issues: - I create 2 contacts, one called 'brutter' and another 'tward' - I then create a contact group called 'TRMStaff' and add both users - Any service I set the contact group of 'TRMStaff' to works flawlessly - Any host I set the contact group of 'TRMStaff' to works flawlessly So at this point, all is well...then the strange part - I create a new contact called 'lgroup' - I then create a contact group called 'LG' and add the 'lgroup' user - Any service I set the contact group of 'LG' to works fine AS LONG AS the HOST of that service is set to 'TRMStaff' as the contact group. If I have a host with contact group 'TRMStaff' and a service of that host with contact group 'LG', everything works as it should. As soon as I change the contact group on the host to 'LG', that host disappears from the list of hosts in the GUI and alerts are not generated for the hosts OR the services related to that host. I have tried creating multiple groups with all sorts of different names and assigning different users to no avail.... I need to be able to have the 'LG' contact group assigned to both HOSTS and it's corresponding services. I have checked the syntax up and down a hundred times and everything seems to be on point. Is there anything illegal in the group or contact names that I've chosen? Any suggestions out there??? This is killing me... ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=ick _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ewh at groupoliver.com Thu Dec 8 02:04:52 2005 From: ewh at groupoliver.com (Eric X. Holzapfel) Date: Wed, 7 Dec 2005 17:04:52 -0800 Subject: No Luck with Apache and Nagios Message-ID: _____________________________________________ Hello List, I am trying to run Nagios 2.0b5 on RH Fedora 4 with apache 2.0.54 I can start nagios ok, and both the -v, -s, and status report ok. The nagios.log file reports startup and monitoring info ok. The problem is with apache. I am completely unable to use apache. If I try to put these lines in my httpd.conf file, apache errors with a warning, that the alias won't work because there is an earlier alias and script alias, etc. ScriptAlias /nagios/cgi-bin/ /usr/local/nagios/sbin/ AllowOverride AuthConfig Options ExecCGI Order allow,deny Allow from all And Alias /nagios/ /usr/local/nagios/share/ Options None AllowOverride AuthConfig Order allow,deny Allow from all Any ideas????? When I try to go to http://localhost/nagios I get page not found, etc Thanks, eric -------------- next part -------------- An HTML attachment was scrubbed... URL: From todd_barbera at wgbh.org Thu Dec 8 03:11:02 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Wed, 07 Dec 2005 21:11:02 -0500 Subject: No Luck with Apache and Nagios References: Message-ID: <001c01c5fb9c$a4548680$0d7810ac@wgbh.org> No Luck with Apache and NagiosDid you try compiling Apache from source? If not, give that a shot. You might be tripping up on an older config. Also, you could try the default config files if they are still there. httpd-std.conf would be the file name. Start Apache with just the default config file just to make sure it starts and then add your changes. Todd ----- Original Message ----- From: Eric X. Holzapfel To: Nagios-users at lists.sourceforge.net Sent: Wednesday, December 07, 2005 8:04 PM Subject: [Nagios-users] No Luck with Apache and Nagios _____________________________________________ Hello List, I am trying to run Nagios 2.0b5 on RH Fedora 4 with apache 2.0.54 I can start nagios ok, and both the -v, -s, and status report ok. The nagios.log file reports startup and monitoring info ok. The problem is with apache. I am completely unable to use apache. If I try to put these lines in my httpd.conf file, apache errors with a warning, that the alias won't work because there is an earlier alias and script alias, etc. ScriptAlias /nagios/cgi-bin/ /usr/local/nagios/sbin/ AllowOverride AuthConfig Options ExecCGI Order allow,deny Allow from all And Alias /nagios/ /usr/local/nagios/share/ Options None AllowOverride AuthConfig Order allow,deny Allow from all Any ideas????? When I try to go to http://localhost/nagios I get page not found, etc Thanks, eric -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris at chrisburgess.com.au Thu Dec 8 03:48:19 2005 From: chris at chrisburgess.com.au (Chris Burgess) Date: Thu, 08 Dec 2005 13:48:19 +1100 Subject: No Luck with Apache and Nagios In-Reply-To: References: Message-ID: On 8/12/05 12:04 PM, "Eric X. Holzapfel" wrote: > Hello List, > > I am trying to run Nagios 2.0b5 on RH Fedora 4 with apache 2.0.54 > > I can start nagios ok, and both the ?v, -s, and status report ok. The > nagios.log file reports startup and monitoring info ok. > > The problem is with apache. I am completely unable to use apache. If I try > to put these lines in my httpd.conf file, apache errors with a warning, that > the alias won?t work because there is an earlier alias and script alias, etc. Hi Eric, How did you install Nagios, source or package? You may already have the directives somewhere else installed by your package. If there aren't any duplicates in httpd.conf, have a look in the Apache conf directory (/etc/httpd/conf.d or similar). I don't have access to a Fedora box at the moment to check. If not, what is the exact error message, it should also give you a line number, check what's on that line (and before/after). Cheers, Chris ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscribe at aehome.net Thu Dec 8 05:38:59 2005 From: subscribe at aehome.net (Adam Emery) Date: Wed, 7 Dec 2005 22:38:59 -0600 Subject: No Luck with Apache and Nagios In-Reply-To: References: Message-ID: <000401c5fbb1$60a3ae10$0208090a@a1> You might want to look for an "include" statement in your httpd.conf. you probably have a nagios.conf that is already being loaded. -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Eric X. Holzapfel Sent: Wednesday, December 07, 2005 7:05 PM To: Nagios-users at lists.sourceforge.net Subject: [Nagios-users] No Luck with Apache and Nagios _____________________________________________ Hello List, I am trying to run Nagios 2.0b5 on RH Fedora 4 with apache 2.0.54 I can start nagios ok, and both the -v, -s, and status report ok. The nagios.log file reports startup and monitoring info ok. The problem is with apache. I am completely unable to use apache. If I try to put these lines in my httpd.conf file, apache errors with a warning, that the alias won't work because there is an earlier alias and script alias, etc. ScriptAlias /nagios/cgi-bin/ /usr/local/nagios/sbin/ AllowOverride AuthConfig Options ExecCGI Order allow,deny Allow from all And Alias /nagios/ /usr/local/nagios/share/ Options None AllowOverride AuthConfig Order allow,deny Allow from all Any ideas????? When I try to go to http://localhost/nagios I get page not found, etc Thanks, eric -------------- next part -------------- An HTML attachment was scrubbed... URL: From Lui.external at infineon.com Thu Dec 8 06:54:40 2005 From: Lui.external at infineon.com (Lui.external at infineon.com) Date: Thu, 8 Dec 2005 13:54:40 +0800 Subject: Problem with server downtime Message-ID: Hi all, I faced some servers' downtime issue in Nagios. My server was down for more than 1 minutes. But the downtime recorded by Nagios is 3 second only. May I know how could this happened? I would like to trigger the host down alert only if the downtime is more than 1 minute, is there any solution for this? In what situation 0 downtime will be detected by Nagios? Thanks in advance! Warmest Regards, Louise Lui Sook Chen IFMY IT External Infineon Technologies Malacca Phone : 06-2325266 (ext : 8979/8920) mailto : lui.external at infineon.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 8 07:28:49 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 8 Dec 2005 07:28:49 +0100 (CET) Subject: help please. browser problem. In-Reply-To: References: Message-ID: On Wed, 7 Dec 2005, Mehmet Fatih AKBULUT wrote: > on a different it works fine on firefox and mozilla on even konqueror. > but why not works on the one at home ? If it works on nearly every spot but one then you should look for browser problems. Nothing Nagios can do about is bad browsers. Hugo. PS: That was NOT an error. Just the source of a webpage. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 8 07:33:27 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 8 Dec 2005 07:33:27 +0100 (CET) Subject: No Luck with Apache and Nagios In-Reply-To: References: Message-ID: On Wed, 7 Dec 2005, Eric X. Holzapfel wrote: > The problem is with apache. I am completely unable to use apache. If I > try to put these lines in my httpd.conf file, apache errors with a > warning, that the alias won't work because there is an earlier alias and > script alias, etc. If you use the RPM files you allready have a nagios.conf file in you conf.d directory and you should NOT add them again to your config. Remove all the new lines and restart the server. Then try again. Check you apache config tree for extra files. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ccyuen at hkcert.org Thu Dec 8 09:46:09 2005 From: ccyuen at hkcert.org (CC Yuen) Date: Thu, 08 Dec 2005 16:46:09 +0800 Subject: How to config alert send out when service down? Message-ID: <4397F2D1.8040909@hkcert.org> Hi list, As title, I tried to enable the commands as following: # 'notify-by-email' command definition define command{ command_name notify-by-email command_line /usr/bin/printf "%b" "***** Nagios 1.0 *****\n\nNotifica tion Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddr ess: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $DATETIME$\n\nAdditional Info:\n\n$OUTPUT$" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ alert - $HOSTALIA S$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } but it has error when restart. What is the correct step that I do? I can receive alert email when host down. Thanks Chung ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ccyuen at hkcert.org Thu Dec 8 10:09:41 2005 From: ccyuen at hkcert.org (CC Yuen) Date: Thu, 08 Dec 2005 17:09:41 +0800 Subject: How to config alert send out when service down? In-Reply-To: <4397F6DB.6030708@mailtilmeg.com> References: <4397F2D1.8040909@hkcert.org> <4397F6DB.6030708@mailtilmeg.com> Message-ID: <4397F855.1090105@hkcert.org> Hi there, Thank you so much for your fast reply. My contacts.cfg also enable the following lines, what else can I do? I was play around it near 1 month but still can't receive service alert. Thanks again, Chung Nagios ??: > Tou have to define it under the contact you wish to use this for. > > define contact{ > contact_name deskjet-wrk > alias Alias name > service_notification_period 24x7 > host_notification_period 24x7 > service_notification_options w,u,c,r > host_notification_options d,r > service_notification_commands notify-by-email > host_notification_commands host-notify-by-email > email username at domain.com > } > > > CC Yuen wrote: > > >>Hi list, >> >>As title, I tried to enable the commands as following: >> >># 'notify-by-email' command definition >>define command{ >> command_name notify-by-email >> command_line /usr/bin/printf "%b" "***** Nagios 1.0 *****\n\nNotifica >>tion Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddr >>ess: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $DATETIME$\n\nAdditional >>Info:\n\n$OUTPUT$" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ alert - $HOSTALIA >>S$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ >> } >> >>but it has error when restart. What is the correct step that I do? >>I can receive alert email when host down. >> >> >>Thanks >>Chung >> >> >>------------------------------------------------------- >>This SF.net email is sponsored by: Splunk Inc. Do you grep through log files >>for problems? Stop! Download the new AJAX search engine that makes >>searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! >>http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >>_______________________________________________ >>Nagios-users mailing list >>Nagios-users at lists.sourceforge.net >>https://lists.sourceforge.net/lists/listinfo/nagios-users >>::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >>::: Messages without supporting info will risk being sent to /dev/null >> >> >> > > > > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at mailtilmeg.com Thu Dec 8 10:16:38 2005 From: nagios at mailtilmeg.com (Nagios) Date: Thu, 08 Dec 2005 10:16:38 +0100 Subject: Problem defining contact groups - Allerts sent to all contacts instead of group Message-ID: <4397F9F6.6060700@mailtilmeg.com> Hi all my first post :) We have Nagios 2.0b4 up and running and no problems there. But i have now added som service checks which should only be reported to certian people. (i.e contact group) I have defined three contact_groups and 5 contacts in my setup The problem is that with this perticular service_check i get reports to all the 5 contacts and not the contact group that i have spesified both under host_settings , service_settings and under contact group settings. Am i missing something that over rides my settings ??? I have attached settings from my files. I have anonymized IP values and names,email's Best regards Gabrielli -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: nagios_config.txt URL: From fernando.shayani at bsb.politec.com.br Thu Dec 8 11:45:25 2005 From: fernando.shayani at bsb.politec.com.br (Fernando Shayani) Date: Thu, 8 Dec 2005 07:45:25 -0300 Subject: RES: Nagios HANGS scheduling info Message-ID: <1964B929D8FF4D4DB06854A75D0FD3730511F4AE@bsbserv028.bsb.politec.com.br> Ok. It continues hanging... Now I changed the REAPER from 5 to 2... Lets see.. Thanks Fernando -----Mensagem original----- De: Marco Ramos [mailto:mramos at co.sapo.pt] Enviada: ter 06-dez-05 16:42 Para: Fernando Shayani Cc: nagios-users at lists.sourceforge.net Assunto: RE: [Nagios-users] Nagios HANGS scheduling info Try to tune your service_reaper_frequency and max_concurrent_checks values. Take a look at http://nagios.org/faqs/viewfaq.php?faq_id=115. Had the same problem a while ago and managed to solve it tunning this two options. regards, Marco Ramos On Tue, 2005-12-06 at 14:53 -0300, Fernando Shayani wrote: > Well, my configuration is: > > Inter-check sleep time (sleep_time=0.25) > Service inter-check delay method (service_inter_check_delay_method=s) > Maximum service check spread (max_service_check_spread=2) > Service interleave factor (service_interleave_factor=s) > Maximum concurrent service checks (max_concurrent_checks=0) > Service reaper frequency (service_reaper_frequency=5) > Host inter-check delay method (host_inter_check_delay_method=s) > Maximum host check spread (max_host_check_spread=2) > Timing interval length (interval_length=60) > Agressive host checking option (use_aggressive_host_checking=0) > > The following options are not set. > Auto-rescheduling option > Auto-rescheduling interval > Auto-rescheduling window > > > I will read the configuration and recheck it all. > > I also got the SYSLOG line right after que problem. I hope this could help you help me. > > Dec 6 06:33:24 bsbserv007 nagios: Warning: The check of service 'CPU LOAD' on host 'BSBSERV017' could not be performed due to a fork() error. The check will be rescheduled. > > > Thanks for the help. > > > Fernando Shayani > fernando.shayani at bsb.politec.com.br > (61) 3038-6951 > POLITEC - Bras?lia - DF > > -----Original Message----- > From: Marcel Mitsuto Fucatu Sugano [mailto:msugano at uolinc.com] > Sent: segunda-feira, 5 de dezembro de 2005 15:33 > To: Fernando Shayani > Cc: Eli Stair; nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] Nagios HANGS scheduling info > > On Mon, 2005-12-05 at 09:01 -0300, Fernando Shayani wrote: > > Well, I upgraded to b6, enabled the Orphaned Service and still > > hangs... > > > > Is there any other clue? > > > > Fernando Shayani > > fernando.shayani at bsb.politec.com.br > > (61) 3038-6951 > > POLITEC - Bras?lia - DF > > Have you followed the instructions available at: > http://nagios.sourceforge.net/docs/2_0/configmain.html ? > > Read that and check the following configs: > Inter-check sleep time > Service inter-check delay method > Maximum service check spread > Service interleave factor > Maximum concurrent service checks > Service reaper frequency > Host inter-check delay method > Maximum host check spread > Timing interval length > Auto-rescheduling option > Auto-rescheduling interval > Auto-rescheduling window > > Agressive host checking option > > Which are related to the scheduling options, and might be the source of your problem. Have you messed up those variables? Whatever the case, post the values of this configuration variables. > > HTH, > -- > Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mfatihakbulut at gmail.com Thu Dec 8 10:51:37 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Thu, 8 Dec 2005 11:51:37 +0200 Subject: help please. browser problem. In-Reply-To: References: Message-ID: Hi Hugo. yes thats not error but i dont why i am getting source of nagios main page ? cos i've 2 computers on one freebsd installed and on the other debian. nagios pages works fine on Freebsd but something wrong with debian. couldnt get any page yet. always getting source codes that says use a browser that supports Frames. Regards. (confused cos even the version of browsers are the same on two different comps.) -------------- next part -------------- An HTML attachment was scrubbed... URL: From billford at billford.com Thu Dec 8 15:24:18 2005 From: billford at billford.com (Bill Mathews) Date: Thu, 08 Dec 2005 09:24:18 -0500 Subject: Nagios and a Microsoft "cluster" Message-ID: <43984212.3070706@billford.com> We are trying to monitor (with Nagios, of course) a Microsft web server set up in a Microsoft "clustered" environment. We have a couple of issues, one is that when try to ping the cluster address we see duplicate ping responses: 64 bytes from www.example.com (1.2.3.4): icmp_seq=3 ttl=116 time=32.5 ms 64 bytes from www.example.com (1.2.3.4): icmp_seq=3 ttl=116 time=32.8 ms (DUP!) The other issue is that routinely (a few times a day) this site shows as down, even though it isn't (confirmed through observation). I am somewhat unfamiliar with MS-Clustering but from the description on their website it's a lot like any other kind of clustering (we monitor a lot of clusters). Anyone every see anything similar? I have some theories but none really make a lot of sense in any cluster I am familiar with. Thanks in advance for any assistance. Bill ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ssmith at siu.edu Thu Dec 8 16:30:49 2005 From: ssmith at siu.edu (Scott Smith) Date: Thu, 08 Dec 2005 09:30:49 -0600 Subject: Critical Audible Alarms with Unreachable Hosts In-Reply-To: <4397A559.1060301@kc.rr.com> References: <439726E3.5060203@siu.edu> <4397A559.1060301@kc.rr.com> Message-ID: <439851A9.5090608@siu.edu> That is a possibility (only enabling the Host Down sounds). We like having the sounds for Critical and Warnings as well. Otherwise the NOC typically will not look at the Nagios webpage. Hence why they stated the Sound was just 'so' important to them. If all the devices are Acked then there are no sounds from the webpage. I was just thinking that it made sense to me if Nagios knows where the Network Outage is, and is smart enough to only send one Notification, then why doesn't it do the same thing on the webpage sounds? It appears to me that the same intelligence isn't transversed to the CGI's. At least that's my opinion at this point, I can't seem to find any solid answer on this question yet. Steven Danz wrote: > Scott Smith wrote: > >> I have been using Nagios for a while (several years, and even when it >> was NetSaint). Just recently we had our NOC switchover from HP >> OpenView to Nagios. In doing so we updated all the Hosts from our >> network into the Nagios config, some 990 plus Hosts, and now over >> 1000 services. >> >> Here is my issue. When we have a building go down, say 5 Switches >> (Hosts), and we have the Parents Relationship setup properly we see 1 >> Host Down, and 4 Hosts Unreachable. We only get one Email >> Notification about the single Host being down, which is what we >> want. However, the NOC has recently asked use to enable the Sound >> for Nagios, which we did with the cgi.cfg file. The problem is that >> when the NOC Acknowledges the root problem, and services from the 1 >> Host that is down, we keep getting the Critical.wav alert on our >> webpages. They have to goto ALL the Unreachable devices services, >> and Acknowledge them as well. >> In this example isn't to bad, however some of our buildings have over >> 100 Hosts, and if the main router goes down for that building our NOC >> would have a nightmare trying to Ack each Host. Ultimately I would >> think that they should just Ack the Root problem, and then the >> Unreachable Children (so to speak) would not need Acking. The more >> interesting part is that Nagios seems to understand this by only >> sending one Email, but the webpages don't. >> >> Any ideas on this one? > > > From what I see, aren't there different sounds for host down, host > unreachable, service critical, service warning, service unknown, and > normal? It would seem that you could just enable the host down sound, > and only get 1 audible alert. Is that what isn't working, or am I > missing the point? > > Is it only making sounds on the page reloads? I wasn't sure if the > sounds were generated if the condition was ack-ed or not. > > Just some thoughts. > > -Steven > -- Scott Smith Network Engineering Services Southern Illinois University Carbondale ssmith at siu.edu ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Alethad at unch.unc.edu Thu Dec 8 16:28:00 2005 From: Alethad at unch.unc.edu (Chrietzberg, Aletha) Date: Thu, 8 Dec 2005 10:28:00 -0500 Subject: Nagios/Plugins on AIX 5.2 - reply Message-ID: Got it. Here's what I'm getting now. ---------------------------------------------------------------------------- ------------------ $ make all cd ./base && make gcc -O2 -I/usr/local/lib -D_ALL_SOURCE -D_ANSI_C_SOURCE -D_POSIX_SOURCE -qmaxmem=16384 -qnoansialias -DUSE_NATIVE_DLOPEN -DNEED_PTHREAD_INIT -q32 -D_LARGE_FILES -qlonglong -I/usr/opt/perl5/lib/5.8.0/aix-thread-multi/CORE -DHAVE_CONFIG_H -DNSCORE -c broker.c gcc: unrecognized option `-qmaxmem=16384' gcc: unrecognized option `-qnoansialias' gcc: unrecognized option `-q32' gcc: unrecognized option `-qlonglong' In file included from ../include/config.h:228, from broker.c:26: /usr/include/pthread.h:556: error: parse error before '*' token /usr/include/pthread.h:559: error: parse error before '*' token /usr/include/pthread.h:563: error: parse error before '*' token /usr/include/pthread.h:566: error: parse error before '*' token /usr/include/pthread.h:569: error: parse error before '*' token /usr/include/pthread.h:572: error: parse error before '*' token /usr/include/pthread.h:575: error: parse error before '*' token /usr/include/pthread.h:578: error: parse error before '*' token /usr/include/pthread.h:581: error: parse error before '*' token /usr/include/pthread.h:585: error: parse error before '*' token /usr/include/pthread.h:588: error: parse error before '*' token In file included from broker.c:27: ../include/common.h:31:1: unterminated #ifdef In file included from ../include/nagios.h:26, from broker.c:29: ../include/common.h:31:1: unterminated #ifdef In file included from ../include/objects.h:31, from ../include/nagios.h:28, from broker.c:29: ../include/common.h:31:1: unterminated #ifdef make: 1254-004 The error code from the last command is 1. Stop. ---------------------------------------------------------------------------- ----------------------- I edited common.h as your notes instruct. I haven't changed anything else in it. Are the gcc errors something I can correct or ignore? Any suggestions on the the pthread header? -----Original Message----- From: Charles Dee Rice [mailto:cdrice at pobox.com] Sent: Wednesday, December 07, 2005 5:41 PM To: Chrietzberg, Aletha Subject: RE: Nagios/Plugins on AIX 5.2 - reply I've attached two files -- install-notes-aix-server.txt install-logs-aix-server.txt I converted them to DOS format (CR+LF), since I didn't know if your mail client is on a PC. The "notes" file is a crib sheet listing the commands and configuration options I used when building the components on AIX and Linux. I've hidden hostnames and internal network info with X's. Note that I have moved around a few things from the standard installation model, so be sure to "un-modify" things before you cut-and-paste anything from here. :) I also change the nagios user and group to "infr", an internal account we use here for non-privlidged sysadmin tasks. All the RPMs I refer to are the IBM-supplied RPMS for AIX52 as part of the Linux Affinity project. Let me know if you don't have the URL and I'll send you the link. We are in the process updating our SSL libs here, so you'll see some notes about things to change for different versions of SSL. Take whatever actions are appropriate for your libraries. Some of the mkitab lines only apply if you are not using tcpwrappers. If you are using tcpwrappers, configure nrpe and ncsu through tcpwrappers. In the log file, you'll see the command lines are truncated because they are a literal cut-and-paste from the xterm windows where I ran the commands, rather than scripts generated by the script command. Let me know if you have questions or are still having trouble! - Chuck --- "Chrietzberg, Aletha" wrote: > Hey Chuck, > Yes, I'm still having trouble. > That would be awesome if you have any notes. I'll take anything I can get > my hands on at this point. I'm trying to install the Nagios and the > plugins. > > Which version of Nagios did you install? 1.2 or 2.0b4? > Just send me whatever you can. > > Thanks. > > Never let the fear of striking out get in your way. > Babe Ruth > > > Aletha D Chrietzberg > UNC Healthcare > ISD - AIX Tech Services > (919) 966-4888 > > > > -----Original Message----- > From: Charles Dee Rice [mailto:cdrice at pobox.com] > Sent: Wednesday, December 07, 2005 3:10 PM > To: Alethad at unch.unc.edu > Subject: Nagios/Plugins on AIX 5.2 > > > > Hi, this is Chuck in Apache Junction, AZ. I saw a recent post you sent to > the nagios-users mailing list concerning having some trouble building > nagios > on AIX 5.2. > > I have nagios, nagios-plugins, nrpe and ncsa all built on AIX 5.2 ML04 with > detailed notes and build logs -- I'd be happy to send them to you if you're > still having trouble. > > Drop me a note or give me a call at 480-235-8069 if you want. > > - Chuck ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rutski89 at gmail.com Thu Dec 8 16:50:39 2005 From: rutski89 at gmail.com (Patrick Rutkowski) Date: Thu, 8 Dec 2005 10:50:39 -0500 Subject: check_disk_smb failing Message-ID: I have both check_http and check_disk_smb in /usr/lib/nagios/plugins. If I try to load a services.cfg with "check_command" set to "check_http", things work fine; however, trying "check_disk_smb" fails with: Error: Service check command 'check_disk_smb' specified in service 'WikiDB/Jason' for host 'jason' not defined anywhere! What should I do? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From todd_barbera at wgbh.org Thu Dec 8 16:56:00 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Thu, 08 Dec 2005 10:56:00 -0500 Subject: check_disk_smb failing In-Reply-To: References: Message-ID: <000601c5fc0f$e2e00b90$620a010a@GRAPEAPE> It needs to be defined in your checkcommands.cfg or misccommands.cfg before you define it in your services.cfg. Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Patrick Rutkowski Sent: Thursday, December 08, 2005 10:51 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] check_disk_smb failing I have both check_http and check_disk_smb in /usr/lib/nagios/plugins. If I try to load a services.cfg with "check_command" set to "check_http", things work fine; however, trying "check_disk_smb" fails with: Error: Service check command 'check_disk_smb' specified in service 'WikiDB/Jason' for host 'jason' not defined anywhere! What should I do? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=ick _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.Laden at tudor.com Thu Dec 8 17:06:14 2005 From: Andrew.Laden at tudor.com (Andrew Laden) Date: Thu, 8 Dec 2005 11:06:14 -0500 Subject: Passive service freshness check syntax Message-ID: <56EAA5BC64E6C34F8C9EE6725D4A2DFA024DE98F@tudor.com> Did the methodology for passive service freshness checks change at some point. I have my config set up as Check_period none passive_checks_enabled 1 Active_checks_enabled 1 Which if I recall was to allow active checks to be forced, as the freshness check would do, not not run on a regualar basis. Yet from the recent docs, it looks like the proper method should be. Check_period 24x7 Passive_checks_enabled 1 Active_checks_enabled 0 Which is it? I am trying to debug why freshness checking is no longer working. (2.0b4 and 2.0b6) -Andrew -------------- next part -------------- An HTML attachment was scrubbed... URL: From Andrew.Laden at tudor.com Thu Dec 8 17:34:58 2005 From: Andrew.Laden at tudor.com (Andrew Laden) Date: Thu, 8 Dec 2005 11:34:58 -0500 Subject: Passive service freshness check syntax Message-ID: <56EAA5BC64E6C34F8C9EE6725D4A2DFA024DE999@tudor.com> As a follow, up, setting it as below caused the freshness checks to start working. However, in the GUI, they are listed as disabled problems, not unhandled problems. Which is an issue. When a freshness check trigger a critical, I want it to be an unhandled problem, not a disabled one. (Which is probably why I was using the original syntax.) So I guess the follow up is did anything change between 1.x and 2.x that would have causes the 1st syntax to stop working? Will check_period now block freshess triggered active checks? _____ From: Andrew Laden [mailto:Andrew.Laden at tudor.com] Sent: Thursday, December 08, 2005 11:06 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Passive service freshness check syntax Did the methodology for passive service freshness checks change at some point. I have my config set up as Check_period none passive_checks_enabled 1 Active_checks_enabled 1 Which if I recall was to allow active checks to be forced, as the freshness check would do, not not run on a regualar basis. Yet from the recent docs, it looks like the proper method should be. Check_period 24x7 Passive_checks_enabled 1 Active_checks_enabled 0 Which is it? I am trying to debug why freshness checking is no longer working. (2.0b4 and 2.0b6) -Andrew -------------- next part -------------- An HTML attachment was scrubbed... URL: From rutski89 at gmail.com Thu Dec 8 20:47:03 2005 From: rutski89 at gmail.com (Patrick Rutkowski) Date: Thu, 8 Dec 2005 14:47:03 -0500 Subject: resources.cfg Message-ID: I'm running nagios 1.3 and using the whole $stuff$ idea with resources.cfg breaks my configs, is it supposed to work on 1.3 anyway, or is this a 2.x specific feature? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From aleksey.tsalolikhin at gmail.com Thu Dec 8 20:51:51 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Thu, 8 Dec 2005 11:51:51 -0800 Subject: resources.cfg In-Reply-To: References: Message-ID: <72a19efd0512081151v7e0a7663kd5194a29e2fc7d76@mail.gmail.com> Hi, Patrick. I had some trouble with $USER$ macros too at start, if that's what you are talking about. But I got it working with some help from the list. How does it break your configs? -at On 12/8/05, Patrick Rutkowski wrote: > > I'm running nagios 1.3 and using the whole $stuff$ idea with > resources.cfg breaks my configs, is it supposed to work on 1.3 anyway, > or is this a 2.x specific feature? > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&opclick > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From ssmith at siu.edu Thu Dec 8 20:59:30 2005 From: ssmith at siu.edu (Scott Smith) Date: Thu, 08 Dec 2005 13:59:30 -0600 Subject: Acknowledge Scripts Message-ID: <439890A2.8010204@siu.edu> I was wondering if anyone would have a script they wrote, or copied from somewhere, to acknowledge multiple Devices/Hosts that are down at one time instead of having to goto each individual Host and Ack. I.e., one building is down without power, and we have 100 Switches/Routers in this building. Instead of our NOC Acking 100 times, it would be nice if they could Ack the Root (Parent) problem and it would Automatically Ack all the Children. Any ideas on this one? If I need to post this in a different List, please advise which one and I'll post it there. Thanks in advance. -- Scott Smith Network Engineering Services Southern Illinois University Carbondale ssmith at siu.edu ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Pearson at uftwf.org Thu Dec 8 21:04:20 2005 From: Mark.Pearson at uftwf.org (Mark Pearson) Date: Thu, 8 Dec 2005 15:04:20 -0500 Subject: check_ssmtp (check_tcp) SSL Errors Message-ID: <932A04DB269BFF4F886C2ACD54C62CAC02C6B192@uftwfexchsvr.uftmaster.org> Hi everyone, I am up to adding our mail servers to our nagios monitoring list and hit a roadblock with monitoring sendmail. I have tried using the following command at the cmdline: ./check_ssmtp -w 5 -c 10 -H mail -p 25 But I get the following error message: CRITICAL - Cannot make SSL connection 29889:error:140770FC:SSL Routines:SSL23_GET_SERVER_HELLO:unknown protocol:s23_clnt.c:475: I can connect to mail from the Nagios monitor via port 25 and I have also added the host to the sendmail access file. Apparently I am missing something, ideas? BTW, we have our mail servers clustered so we need to monitor this port vs. the actually service since the sendmail service is running on the active node, thus we cannot use check_nrpe. Thoughts anyone? ******************************************************************************* The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. ******************************************************************************* -------------- next part -------------- An HTML attachment was scrubbed... URL: From aleksey.tsalolikhin at gmail.com Thu Dec 8 21:04:33 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Thu, 8 Dec 2005 12:04:33 -0800 Subject: resources.cfg In-Reply-To: References: <72a19efd0512081151v7e0a7663kd5194a29e2fc7d76@mail.gmail.com> Message-ID: <72a19efd0512081204y69773c53lc3d999c5e9b9d8c1@mail.gmail.com> On 12/8/05, Patrick Rutkowski wrote: > > If I put $USER1$ where full path is supposed to be in > checkcommands.cfg then the command starts failing and returning wonky > undefined results. If I acutally type out the full path instead of > using the macro, things work again. Sames goes if I make a macro for > passwords. That's the trouble I was having. Do you have resource.cfg defined in nagios.cfg? Make sure it's not commented out. It's marked as optional by the comment describing it, but it's not if you want to use the default config, which uses the USER macros. -------------- next part -------------- An HTML attachment was scrubbed... URL: From todd_barbera at wgbh.org Thu Dec 8 21:09:55 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Thu, 08 Dec 2005 15:09:55 -0500 Subject: check_ssmtp (check_tcp) SSL Errors In-Reply-To: <932A04DB269BFF4F886C2ACD54C62CAC02C6B192@uftwfexchsvr.uftmaster.org> References: <932A04DB269BFF4F886C2ACD54C62CAC02C6B192@uftwfexchsvr.uftmaster.org> Message-ID: <003b01c5fc33$5b2cee10$620a010a@GRAPEAPE> Why not use check_smtp instead? I don't think port 25 is an SSL port within sendmail, although I'm no sendmail expert. I use check_smtp in my configuration and it works fine. Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Mark Pearson Sent: Thursday, December 08, 2005 3:04 PM To: 'nagios-users at lists.sourceforge.net' Subject: [Nagios-users] check_ssmtp (check_tcp) SSL Errors Hi everyone, I am up to adding our mail servers to our nagios monitoring list and hit a roadblock with monitoring sendmail. I have tried using the following command at the cmdline: ./check_ssmtp -w 5 -c 10 -H mail -p 25 But I get the following error message: CRITICAL - Cannot make SSL connection 29889:error:140770FC:SSL Routines:SSL23_GET_SERVER_HELLO:unknown protocol:s23_clnt.c:475: I can connect to mail from the Nagios monitor via port 25 and I have also added the host to the sendmail access file. Apparently I am missing something, ideas? BTW, we have our mail servers clustered so we need to monitor this port vs. the actually service since the sendmail service is running on the active node, thus we cannot use check_nrpe. Thoughts anyone? **************************************************************************** *** The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. **************************************************************************** *** -------------- next part -------------- An HTML attachment was scrubbed... URL: From scotjenkins at gmail.com Thu Dec 8 21:14:59 2005 From: scotjenkins at gmail.com (Scot Jenkins) Date: Thu, 8 Dec 2005 14:14:59 -0600 Subject: Acknowledge Scripts In-Reply-To: <439890A2.8010204@siu.edu> References: <439890A2.8010204@siu.edu> Message-ID: On 12/8/05, Scott Smith wrote: > I was wondering if anyone would have a script they wrote, or copied from > somewhere, to acknowledge multiple Devices/Hosts that are down at one > time instead of having to goto each individual Host and Ack. > > I.e., one building is down without power, and we have 100 > Switches/Routers in this building. Instead of our NOC Acking 100 times, > it would be nice if they could Ack the Root (Parent) problem and it > would Automatically Ack all the Children. Is that really necessary? If you use the "parents" option, devices behind the main one that is down will show as unreachable. I think you could then just acknowledge the main device that shows "down". This of course assumes everything in that building is behind one device. If not, you might have to acknowledge each of the main devices, but if you do it right, you shouldn't have to acknowledge _all_ switches/routers, just the few that everything else sits behind. I mapped out my network tree on paper first. Then I check the statusmap.cgi to see that my config matches how I think it should look. scot ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Pearson at uftwf.org Thu Dec 8 21:15:21 2005 From: Mark.Pearson at uftwf.org (Mark Pearson) Date: Thu, 8 Dec 2005 15:15:21 -0500 Subject: check_ssmtp (check_tcp) SSL Errors Message-ID: <932A04DB269BFF4F886C2ACD54C62CAC02C6B194@uftwfexchsvr.uftmaster.org> Todd, I don't have that plugin in my listing, which I thought was kind of odd but I guess that check_ssmtp = Secure SMTP. Further research has found that the check_ssmtp plugin is the check_tcp plugin yet forces the -S (SSL) check. Which explains a lot. I guess I could just use check_tcp unless someone can tell me where to get check_smtp from, since I thought I downloaded the latest version of the nagios plugins. -Mark _____ From: Todd Barbera [mailto:todd_barbera at wgbh.org] Sent: Thursday, December 08, 2005 3:10 PM To: 'Mark Pearson'; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] check_ssmtp (check_tcp) SSL Errors Why not use check_smtp instead? I don't think port 25 is an SSL port within sendmail, although I'm no sendmail expert. I use check_smtp in my configuration and it works fine. Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Mark Pearson Sent: Thursday, December 08, 2005 3:04 PM To: 'nagios-users at lists.sourceforge.net' Subject: [Nagios-users] check_ssmtp (check_tcp) SSL Errors Hi everyone, I am up to adding our mail servers to our nagios monitoring list and hit a roadblock with monitoring sendmail. I have tried using the following command at the cmdline: ./check_ssmtp -w 5 -c 10 -H mail -p 25 But I get the following error message: CRITICAL - Cannot make SSL connection 29889:error:140770FC:SSL Routines:SSL23_GET_SERVER_HELLO:unknown protocol:s23_clnt.c:475: I can connect to mail from the Nagios monitor via port 25 and I have also added the host to the sendmail access file. Apparently I am missing something, ideas? BTW, we have our mail servers clustered so we need to monitor this port vs. the actually service since the sendmail service is running on the active node, thus we cannot use check_nrpe. Thoughts anyone? **************************************************************************** *** The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. **************************************************************************** *** ******************************************************************************* The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. ******************************************************************************* -------------- next part -------------- An HTML attachment was scrubbed... URL: From project10 at project10.net Thu Dec 8 21:15:45 2005 From: project10 at project10.net (Shawn Gervais) Date: Thu, 08 Dec 2005 14:15:45 -0600 Subject: Nagios 1.3: Services fail to HARD immediately Message-ID: <43989471.9030109@project10.net> Greetings list, On a new installation of Nagios 1.3, I am seeing services go from HARD OK immediately to HARD CRITICAL when they fail. I have max_check_attempts set to 3, and the Nagios web interface does indeed display 'Current attempt: 1/3'. Oddly enough, if I HUP nagios the check attempts values are displayed properly in the CGI. They will eventually revert to 1/3 when the next active check is done, so it seems. Does anyone have any idea why Nagios is only requiring one check to fail before putting my services in HARD states? -Shawn ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ssmith at siu.edu Thu Dec 8 21:23:51 2005 From: ssmith at siu.edu (Scott Smith) Date: Thu, 08 Dec 2005 14:23:51 -0600 Subject: Acknowledge Scripts In-Reply-To: References: <439890A2.8010204@siu.edu> Message-ID: <43989657.201@siu.edu> An HTML attachment was scrubbed... URL: From project10 at project10.net Thu Dec 8 21:20:31 2005 From: project10 at project10.net (Shawn Gervais) Date: Thu, 08 Dec 2005 14:20:31 -0600 Subject: Nagios 1.3: Services fail to HARD immediately In-Reply-To: <43989471.9030109@project10.net> References: <43989471.9030109@project10.net> Message-ID: <4398958F.6080207@project10.net> > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null Linux 2.6.9-22.0.1.EL (CentOS 4.2) Plugins: nagios-plugins 1.4.2 -Shawn ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From todd_barbera at wgbh.org Thu Dec 8 21:24:26 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Thu, 08 Dec 2005 15:24:26 -0500 Subject: check_ssmtp (check_tcp) SSL Errors In-Reply-To: <932A04DB269BFF4F886C2ACD54C62CAC02C6B194@uftwfexchsvr.uftmaster.org> References: <932A04DB269BFF4F886C2ACD54C62CAC02C6B194@uftwfexchsvr.uftmaster.org> Message-ID: <004601c5fc35$629a1ae0$620a010a@GRAPEAPE> I have 1.4.1 of the plugins and it was included in the distribution. I checked the REQUIREMENTS file and didn't see any prereqs for getting it to compile and install. I do use check_tcp to verify that our Meeting Maker software is listening on the proper port, so that's probably a valid option. Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Mark Pearson Sent: Thursday, December 08, 2005 3:15 PM To: 'Todd Barbera'; 'nagios-users at lists.sourceforge.net' Subject: RE: [Nagios-users] check_ssmtp (check_tcp) SSL Errors Todd, I don't have that plugin in my listing, which I thought was kind of odd but I guess that check_ssmtp = Secure SMTP. Further research has found that the check_ssmtp plugin is the check_tcp plugin yet forces the -S (SSL) check. Which explains a lot. I guess I could just use check_tcp unless someone can tell me where to get check_smtp from, since I thought I downloaded the latest version of the nagios plugins. -Mark _____ From: Todd Barbera [mailto:todd_barbera at wgbh.org] Sent: Thursday, December 08, 2005 3:10 PM To: 'Mark Pearson'; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] check_ssmtp (check_tcp) SSL Errors Why not use check_smtp instead? I don't think port 25 is an SSL port within sendmail, although I'm no sendmail expert. I use check_smtp in my configuration and it works fine. Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Mark Pearson Sent: Thursday, December 08, 2005 3:04 PM To: 'nagios-users at lists.sourceforge.net' Subject: [Nagios-users] check_ssmtp (check_tcp) SSL Errors Hi everyone, I am up to adding our mail servers to our nagios monitoring list and hit a roadblock with monitoring sendmail. I have tried using the following command at the cmdline: ./check_ssmtp -w 5 -c 10 -H mail -p 25 But I get the following error message: CRITICAL - Cannot make SSL connection 29889:error:140770FC:SSL Routines:SSL23_GET_SERVER_HELLO:unknown protocol:s23_clnt.c:475: I can connect to mail from the Nagios monitor via port 25 and I have also added the host to the sendmail access file. Apparently I am missing something, ideas? BTW, we have our mail servers clustered so we need to monitor this port vs. the actually service since the sendmail service is running on the active node, thus we cannot use check_nrpe. Thoughts anyone? **************************************************************************** *** The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. **************************************************************************** *** **************************************************************************** *** The views, opinions, and judgments expressed in this message are solely those of the author. The message contents have not been reviewed or approved by the UFT Welfare Fund. **************************************************************************** *** -------------- next part -------------- An HTML attachment was scrubbed... URL: From jrattray at pnhs.purdue.edu Thu Dec 8 21:25:40 2005 From: jrattray at pnhs.purdue.edu (Jeff Rattray) Date: Thu, 08 Dec 2005 15:25:40 -0500 Subject: using LDAP-based user authentication Message-ID: <43985074.30456.48EDDA6@localhost> Greetings, everyone. We (Purdue College of Pharmacy, Nursing and Health Sciences) have been using nagios for half of a year now, and it has been really handy. Currently, we are trying to change the way nagios authenticates users. Purdue has implemented a new security policy that says that all passwords, whatever they are for, must be changed every 30 days. Leaving aside the difficulties about changing, say, BIOS passwords in the student labs, we are trying to reduce the number of usernames and passwords that we have to make it easier to keep up with the password-changing requirement. I have successfully configured mod_authz_ldap to handle authentication on our nagios server, and I can log into the nagios pages. Unfortunately, instead of returning a simple username like 'jrattray', the CGIs see my username as 'uid=jrattray,ou=people,o=pnhs.purdue.edu' . I cannot authorize myself to see any information, because when I put that string into cgi.cfg, nagios breaks it up at the commas into 3 different users. (The equals signs may be confusing it also.) Here are my questions: (1) Does anyone know how to escape this username so that it works in cgi.cfg? I have tried entering it as a quoted string, putting slashes in front of the commas, etc. (2) Failing that, has anyone had any experience using nagios with mod_auth_ldap? I used mod_authz_ldap (a third-party product) only because it came with Red Hat EL, but I don't want to bother compiling the official apache module if it does exactly the same thing. (3) Failing THAT, how does one submit this to the wish list for nagios? Thanks, Jeff Rattray -- Jeff Rattray, Ph.D. Manager, Web-based Instructional Technology Purdue University College of Pharmacy, Nursing and Health Sciences ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Thu Dec 8 21:27:24 2005 From: teng at dataway.com (Tedman Eng) Date: Thu, 8 Dec 2005 12:27:24 -0800 Subject: Acknowledge Scripts Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C6C@dw-mail.dataway.com> You could add a parent to those hosts in order to group them. Then the 100 powered-off hosts would be marked unreachable and you'd only have to acknowledge that one parent host. The 'parent' host does not have to be the real network-parent. Just some reliable host that will help indicate the power status of the building. (an access point, one of the switches, etc.) We use the same concept to isolate large groups of devices that we need to 'toggle off' for one reason or another. > -----Original Message----- > From: Scott Smith [mailto:ssmith at siu.edu] > Sent: Thursday, December 08, 2005 12:00 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Acknowledge Scripts > > > I was wondering if anyone would have a script they wrote, or > copied from > somewhere, to acknowledge multiple Devices/Hosts that are down at one > time instead of having to goto each individual Host and Ack. > > I.e., one building is down without power, and we have 100 > Switches/Routers in this building. Instead of our NOC Acking > 100 times, > it would be nice if they could Ack the Root (Parent) problem and it > would Automatically Ack all the Children. > > Any ideas on this one? If I need to post this in a different List, > please advise which one and I'll post it there. > Thanks in advance. > -- > Scott Smith > Network Engineering Services > Southern Illinois University Carbondale > ssmith at siu.edu > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep > through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. > DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jeremy.ashton at bell.ca Thu Dec 8 21:39:29 2005 From: jeremy.ashton at bell.ca (jeremy.ashton at bell.ca) Date: Thu, 8 Dec 2005 15:39:29 -0500 Subject: Host Undetermined Status Message-ID: <4D0A3E3121A0504EAEF0FBA7B9576C26053DE240@toroondc914.bell.corp.bce.ca> We have been testing out Nagios 1.3 in a distributed monitoring fashion. Everything seems to be working fine with the exception of the reports listing the devices in a undetermined status. This is the case whether the host is actively checking or receiving passive check results. Below listed is a chunk of our Availability report for some of our actively monitored hosts. Hostgroup 'CustomerB' Host State Breakdowns: Host % Time Up % Time Down % Time Unreachable % Time Undetermined hostA 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostB 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostC 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostD 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostE 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostF 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostG 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostH 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostI 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostJ 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% hostK 0.000% (0.000%) 0.000% (0.000%) 0.000% (0.000%) 100.000% Even though the hosts are all showing in an up status on the main page... Any ideas on what I can tweak to make this work? Thanks in advance, Jeremy ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Thu Dec 8 21:41:49 2005 From: estair at ilm.com (Eli Stair) Date: Thu, 08 Dec 2005 12:41:49 -0800 Subject: 2.0b6, bug in "notifications_enabled" handling (sends notifications when set to 0 Message-ID: <43989A8D.7070109@ilm.com> Can someone confirm my belief on this is proper? I used this same config several weeks ago ( just changed membership to a different hostgroup ) and I was not receiving any alerts on this hostgroup then. Since moving to 2.0b6 (from b4) I'm changing hostgroup membership back to this "non-alerting" group. I am still not receiving host down alerts, but I've started getting service alerts for hosts in it. The config is below. After changing 'notification_options' to "n" the alerts stopped altogether. Previously, it was set to "d,u" but 'notification_options' seemingly overrode it. /eli define host{ name linux-node-lowkey contact_groups cluster-admins check_command check-host-alive checks_enabled 1 check_interval 0 max_check_attempts 5 notification_interval 480 notification_period none notification_options n notifications_enabled 0 ; Host notifications are enabled event_handler_enabled 1 ; Host event handler is enabled flap_detection_enabled 0 ; Flap detection is enabled process_perf_data 0 ; Process performance data retain_status_information 0 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL HOST, JUST A TEMPLATE! } System: RHEL42 x86_64 ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ssmith at siu.edu Thu Dec 8 21:50:55 2005 From: ssmith at siu.edu (Scott Smith) Date: Thu, 08 Dec 2005 14:50:55 -0600 Subject: Acknowledge Scripts In-Reply-To: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C6C@dw-mail.dataway.com> References: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C6C@dw-mail.dataway.com> Message-ID: <43989CAF.3050007@siu.edu> An HTML attachment was scrubbed... URL: From aleksey.tsalolikhin at gmail.com Thu Dec 8 21:52:54 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Thu, 8 Dec 2005 12:52:54 -0800 Subject: resources.cfg In-Reply-To: References: <72a19efd0512081151v7e0a7663kd5194a29e2fc7d76@mail.gmail.com> <72a19efd0512081204y69773c53lc3d999c5e9b9d8c1@mail.gmail.com> Message-ID: <72a19efd0512081252h2d74c47eo7e97e6d1dad771ce@mail.gmail.com> > > On 12/8/05, Patrick Rutkowski wrote: > > > If I put $USER1$ where full path is supposed to be in > > > checkcommands.cfg then the command starts failing and returning wonky > > > undefined results. If I acutally type out the full path instead of > > > using the macro, things work again. Sames goes if I make a macro for > > > passwords. > On 12/8/05, Aleksey Tsalolikhin wrote: > That's the trouble I was having. Do you have resource.cfg defined in > nagios.cfg? Make sure it's not commented out. It's marked as optional by > the comment describing it, but it's not if you want to use the default > config, which uses the USER macros. On 12/8/05, Patrick Rutkowski wrote: > The resources file is and was uncommented in nagios.cfg Ok. Please put your reply below mine as it makes it easier to follow the thread. It may help if you give us more information about your config, per How to Ask Questions the Smart Way http://www.catb.org/~esr/faqs/smart-questions.html#beprecise Specifically: Describe the symptoms of your problem or bug carefully and clearly. (how does the command fail? is there an error message?) Describe the environment in which it occurs (machine, OS, application, whatever). Provide your vendor's distribution and release level (e.g.: "Fedora Core 2", "Slackware 9.1", etc.). Describe the research you did to try and understand the problem before you asked the question. Describe the diagnostic steps you took to try and pin down the problem yourself before you asked the question. I am a Nagios newbie myself and doing the above when I posted my question helped make solving it fast and easy. HTH, -at ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From CWaters at jeld-wen.com Thu Dec 8 21:54:59 2005 From: CWaters at jeld-wen.com (CWaters at jeld-wen.com) Date: Thu, 8 Dec 2005 12:54:59 -0800 Subject: [BULK SPAM] using LDAP-based user authenticat ion Message-ID: <83C21E03494FF147B56468B50551782145F0F1@jwmail01.domain.jw.local> I am using authz_ldap and don't seem to have that problem. Can you show the config for your module? What is your LDAP store??? AD,OpenLDAP...I am authenticating my users against an M$ Active Directory. Works pretty well. I would include your .htaccess code as well. That way we can compare them to known working configs. Chris Waters WAN/LAN Technician JELD-WEN, Inc. Network Services Group =================== -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Jeff Rattray Sent: Thursday, December 08, 2005 12:26 PM To: nagios-users at lists.sourceforge.net Subject: [BULK SPAM] [Nagios-users] using LDAP-based user authentication Importance: Low Greetings, everyone. We (Purdue College of Pharmacy, Nursing and Health Sciences) have been using nagios for half of a year now, and it has been really handy. Currently, we are trying to change the way nagios authenticates users. Purdue has implemented a new security policy that says that all passwords, whatever they are for, must be changed every 30 days. Leaving aside the difficulties about changing, say, BIOS passwords in the student labs, we are trying to reduce the number of usernames and passwords that we have to make it easier to keep up with the password-changing requirement. I have successfully configured mod_authz_ldap to handle authentication on our nagios server, and I can log into the nagios pages. Unfortunately, instead of returning a simple username like 'jrattray', the CGIs see my username as 'uid=jrattray,ou=people,o=pnhs.purdue.edu' . I cannot authorize myself to see any information, because when I put that string into cgi.cfg, nagios breaks it up at the commas into 3 different users. (The equals signs may be confusing it also.) Here are my questions: (1) Does anyone know how to escape this username so that it works in cgi.cfg? I have tried entering it as a quoted string, putting slashes in front of the commas, etc. (2) Failing that, has anyone had any experience using nagios with mod_auth_ldap? I used mod_authz_ldap (a third-party product) only because it came with Red Hat EL, but I don't want to bother compiling the official apache module if it does exactly the same thing. (3) Failing THAT, how does one submit this to the wish list for nagios? Thanks, Jeff Rattray -- Jeff Rattray, Ph.D. Manager, Web-based Instructional Technology Purdue University College of Pharmacy, Nursing and Health Sciences ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Thu Dec 8 22:01:30 2005 From: estair at ilm.com (Eli Stair) Date: Thu, 08 Dec 2005 13:01:30 -0800 Subject: [BULK SPAM] using LDAP-based user authenticat ion In-Reply-To: <83C21E03494FF147B56468B50551782145F0F1@jwmail01.domain.jw.local> References: <83C21E03494FF147B56468B50551782145F0F1@jwmail01.domain.jw.local> Message-ID: <43989F2A.6040108@ilm.com> I'm switching over to using auth_kerb from AD (just because I've done that, not the authz_ldap). I'm curious if you both have the username being populated properly in the "Author" portion of the CGI's when you're adding comments/ ack'ing events. I still haven't gotten any suggestions from the list on where the problem lies. Cheers, /eli CWaters at jeld-wen.com wrote: > I am using authz_ldap and don't seem to have that problem. Can you show the > config for your module? What is your LDAP store??? AD,OpenLDAP...I am > authenticating my users against an M$ Active Directory. Works pretty well. > I would include your .htaccess code as well. That way we can compare them > to known working configs. > > > Chris Waters > WAN/LAN Technician > JELD-WEN, Inc. > Network Services Group > =================== > > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Jeff Rattray > Sent: Thursday, December 08, 2005 12:26 PM > To: nagios-users at lists.sourceforge.net > Subject: [BULK SPAM] [Nagios-users] using LDAP-based user authentication > Importance: Low > > Greetings, everyone. We (Purdue College of Pharmacy, Nursing and Health > Sciences) have been using nagios for half of a year now, and it has been > really handy. Currently, we are trying to change the way nagios > authenticates users. > Purdue has implemented a new security policy that says that all passwords, > whatever they are for, must be changed every 30 days. Leaving aside the > difficulties about changing, say, BIOS passwords in the student labs, we are > trying to reduce the number of usernames and passwords that we have to make > it easier to keep up with the password-changing requirement. > > I have successfully configured mod_authz_ldap to handle authentication on > our nagios server, and I can log into the nagios pages. Unfortunately, > instead of returning a simple username like 'jrattray', the CGIs see my > username as 'uid=jrattray,ou=people,o=pnhs.purdue.edu' . I cannot authorize > myself to see any information, because when I put that string into cgi.cfg, > nagios breaks it up at the commas into 3 different users. (The equals signs > may be confusing it > also.) > > Here are my questions: > > (1) Does anyone know how to escape this username so that it works in > cgi.cfg? > I have tried entering it as a quoted string, putting slashes in front of the > commas, etc. > > (2) Failing that, has anyone had any experience using nagios with > mod_auth_ldap? I used mod_authz_ldap (a third-party product) only because > it came with Red Hat EL, but I don't want to bother compiling the official > apache module if it does exactly the same thing. > > (3) Failing THAT, how does one submit this to the wish list for nagios? > > Thanks, > Jeff Rattray > -- > Jeff Rattray, Ph.D. > Manager, Web-based Instructional Technology Purdue University College of > Pharmacy, Nursing and Health Sciences > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From aleksey.tsalolikhin at gmail.com Thu Dec 8 22:20:17 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Thu, 8 Dec 2005 13:20:17 -0800 Subject: resources.cfg In-Reply-To: References: <72a19efd0512081151v7e0a7663kd5194a29e2fc7d76@mail.gmail.com> <72a19efd0512081204y69773c53lc3d999c5e9b9d8c1@mail.gmail.com> <72a19efd0512081252h2d74c47eo7e97e6d1dad771ce@mail.gmail.com> Message-ID: <72a19efd0512081320h315c69dp1ce403e855bbaea0@mail.gmail.com> On 12/8/05, Patrick Rutkowski wrote: > It's fixed, I didn't know that the only alloweable user macros were > $USER1$ - $USER32$; thanks for you help ;) Well done! And you are welcome. Aleksey ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Thu Dec 8 22:27:06 2005 From: estair at ilm.com (Eli Stair) Date: Thu, 08 Dec 2005 13:27:06 -0800 Subject: 2.0b5 initial host/service checks delayed after start (not present in 2.0b3) In-Reply-To: <20051129215903.92906.qmail@web31905.mail.mud.yahoo.com> References: <20051129215903.92906.qmail@web31905.mail.mud.yahoo.com> Message-ID: <4398A52A.3050707@ilm.com> FYI, in the time it's taking to wait for nagios to start polling anything after starting it up I decided to look at what it's doing... This would explain why it starts up and sits around not consuming any cycles but not polling. Sleep left in the code? These entries in the log each come afer a few minutes (119 and 175 seconds apart) each.. This is running on 2.0b6, x86_64 arch, compiled from source with perlcache. /eli ###FILE: nagios.log: [1134076786] Finished daemonizing... (New PID=11914) [1134076905] service_result_worker_thread(): poll(): EINTR (impossible) [1134077080] service_result_worker_thread(): poll(): EINTR (impossible) ### GDB info: Attaching to program: /usr/local/nagios/bin/nagios, process 11914 Reading symbols from /usr/lib64/perl5/5.8.5/x86_64-linux-thread-multi/CORE/libperl.so...(no debugging symbols found)...done. Loaded symbols for /usr/lib64/perl5/5.8.5/x86_64-linux-thread-multi/CORE/libperl.so Reading symbols from /lib64/libnsl.so.1...(no debugging symbols found)...done. Loaded symbols for /lib64/libnsl.so.1 Reading symbols from /lib64/libdl.so.2...(no debugging symbols found)...done. Loaded symbols for /lib64/libdl.so.2 Reading symbols from /lib64/tls/libm.so.6...(no debugging symbols found)...done. Loaded symbols for /lib64/tls/libm.so.6 Reading symbols from /lib64/libcrypt.so.1...(no debugging symbols found)...done. Loaded symbols for /lib64/libcrypt.so.1 Reading symbols from /lib64/libutil.so.1...(no debugging symbols found)...done. Loaded symbols for /lib64/libutil.so.1 Reading symbols from /lib64/tls/libpthread.so.0... (no debugging symbols found)...done. [Thread debugging using libthread_db enabled] [New Thread 182894164416 (LWP 11914)] [New Thread 1094719840 (LWP 11917)] [New Thread 1084229984 (LWP 11915)] Loaded symbols for /lib64/tls/libpthread.so.0 Reading symbols from /lib64/tls/libc.so.6...(no debugging symbols found)...done. Loaded symbols for /lib64/tls/libc.so.6 Reading symbols from /usr/lib64/libltdl.so.3...(no debugging symbols found)...done. Loaded symbols for /usr/lib64/libltdl.so.3 Reading symbols from /lib64/ld-linux-x86-64.so.2...(no debugging symbols found)...done. Loaded symbols for /lib64/ld-linux-x86-64.so.2 0x000000364700b9c5 in __nanosleep_nocancel () from /lib64/tls/libpthread.so.0 (gdb) where #0 0x000000364700b9c5 in __nanosleep_nocancel () from /lib64/tls/libpthread.so.0 #1 0x00000000004209aa in event_execution_loop () #2 0x000000000040efa0 in main () (gdb) info registers rax 0xfffffffffffffdfc -516 rbx 0x861bb0 8788912 rcx 0xffffffffffffffff -1 rdx 0x2 2 rsi 0x0 0 rdi 0x7fbffff450 548682069072 rbp 0x0 0x0 rsp 0x7fbffff410 0x7fbffff410 r8 0x0 0 r9 0x2e8a 11914 r10 0x7fbffff301 548682068737 r11 0x202 514 r12 0x7fbffff450 548682069072 r13 0xffffffff 4294967295 r14 0xffffffff 4294967295 r15 0x7fbffffa08 548682070536 rip 0x364700b9c5 0x364700b9c5 <__nanosleep_nocancel+60> eflags 0x202 514 cs 0x33 51 ss 0x2b 43 ds 0x0 0 es 0x0 0 fs 0x0 0 gs 0x0 0 Fred wrote: > I do the same thing with check_icmp except that I use sudo and create > a simple sudo entry like (see the CHECK_ICMP): > > Cmnd_Alias CHECKALLSSHKEYS = /opt/hptc/nagios/libexec/check_keys # > HP-HPTC-KeySync > Cmnd_Alias CHECKSYSLOGALERTS = > /opt/hptc/nagios/libexec/check_syslogalerts # HP-HPTC-SysLog > Cmnd_Alias CHECKSFS = /opt/hptc/nagios/libexec/check_sfs # HP-HPTC-SysLog > Cmnd_Alias CHECKLSF = /opt/hptc/nagios/libexec/check_lsf # HP-HPTC-CheckLSF > Cmnd_Alias CHECKICMP = /opt/hptc/nagios/libexec/check_icmp # > HP-HPTC-CheckICMP > nagios ALL = NOPASSWD: > CHECKALLSSHKEYS,CHECKSYSLOGALERTS,CHECKSFS,CHECKLSF,CHECKICMP # > HP-HPTC-Nagios > > I just built the 2.0b5 and hope to give it a try in the next few days on a > 700+ node system ... I am hoping that this *solves* the delay problem > that existed in the previous releases. > > -FredC > > > */Eli Stair /* wrote: > > > I'm running a fresh build of 2.0b5 on x86_64. After an initial start of > nagios, it can take up to 10 minutes for the first host or service > checks to begin. There is no CPU load by the nagios process during this > time. I have over 1000 hosts to check, and have reduced the max > host/service check spread in order to ensure that it is not "evening" > out the time. > > This problem is NOT occuring on a 2.0b3 build, with the same exact > configuration. > > After the checks DO start, it can take hours to finish. I've changed > the user to root so that I can have the host check be check_icmp -t > 1 -p > 1. > > Unfortunately, even with this situation, having anywhere between 4 and > 64 hosts go down can make the "monitoring" aspect effectively useless. > > Any suggestions on the problem of startup lag? > Any ways to further speed up the host check runs, aside from using > check_icmp? > > Thanks, > > /eli > > ### inline nagios.cfg: > > > [root at monitor02 etc]# cat nagios.cfg | egrep -v "^#|^$" > log_file=/var/log/nagios/nagios.log > cfg_file=/usr/local/nagios/etc/checkcommands.cfg > cfg_file=/usr/local/nagios/etc/misccommands.cfg > cfg_dir=/usr/local/nagios/etc/config > cfg_file=/usr/local/nagios/etc/timeperiods.cfg > cfg_file=/usr/local/nagios/etc/contacts.cfg > cfg_file=/usr/local/nagios/etc/contactgroups.cfg > cfg_file=/usr/local/nagios/etc/hosts.cfg > cfg_file=/usr/local/nagios/etc/hostgroups.cfg > cfg_file=/usr/local/nagios/etc/customcommands.cfg > cfg_file=/usr/local/nagios/etc/services.cfg > object_cache_file=/usr/local/nagios/var/objects.cache > resource_file=/usr/local/nagios/etc/resource.cfg > status_file=/usr/local/nagios/var/status.dat > nagios_user=root > nagios_group=root > check_external_commands=1 > command_check_interval=-1 > command_file=/usr/local/nagios/var/rw/nagios.cmd > comment_file=/usr/local/nagios/var/comments.dat > downtime_file=/usr/local/nagios/var/downtime.dat > lock_file=/usr/local/nagios/var/nagios.lock > temp _file=/usr/local/nagios/var/nagios.tmp > event_broker_options=-1 > log_rotation_method=d > log_archive_path=/var/log/nagios/archives > use_syslog=1 > log_notifications=1 > log_service_retries=1 > log_host_retries=1 > log_event_handlers=1 > log_initial_states=0 > log_external_commands=1 > log_passive_checks=1 > service_inter_check_delay_method=s > max_service_check_spread=15 > service_interleave_factor=s > host_inter_check_delay_method=s > max_host_check_spread=10 > max_concurrent_checks=0 > service_reaper_frequency=15 > auto_reschedule_checks=0 > auto_rescheduling_interval=30 > auto_rescheduling_window=180 > sleep_time=0.25 > service_check_timeout=60 > host_check_timeout=30 > event_handler_timeout=30 > notification_timeout=30 > ocsp_timeout=5 > perfdata_timeout=5 > retain_state_information=1 > state_retention_file=/usr/local/nagios/var/retention.dat > retention_update_interval=0 > use_retained_program_state=1 > use_retained_scheduling_info=0 > interv al_length=60 > use_aggressive_host_checking=0 > execute_service_checks=1 > accept_passive_service_checks=0 > execute_host_checks=1 > accept_passive_host_checks=1 > enable_notifications=1 > enable_event_handlers=1 > process_performance_data=0 > obsess_over_services=0 > check_for_orphaned_services=0 > check_service_freshness=1 > service_freshness_check_interval=60 > check_host_freshness=1 > host_freshness_check_interval=60 > aggregate_status_updates=1 > status_update_interval=15 > enable_flap_detection=0 > low_service_flap_threshold=5.0 > high_service_flap_threshold=20.0 > low_host_flap_threshold=5.0 > high_host_flap_threshold=20.0 > date_format=iso8601 > illegal_object_name_chars=`~!$%^&*|'"<>?,()= > illegal_macro_output_chars=`~$&|'"<> > use_regexp_matching=0 > use_true_regexp_matching=0 > admin_email=nagios > admin_pager=pagenagios > daemon_dumps_core=0 > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From CWaters at jeld-wen.com Thu Dec 8 23:06:46 2005 From: CWaters at jeld-wen.com (CWaters at jeld-wen.com) Date: Thu, 8 Dec 2005 14:06:46 -0800 Subject: [BULK SPAM] using LDAP-based user authenticat ion Message-ID: <83C21E03494FF147B56468B50551782145F1B6@jwmail01.domain.jw.local> Yes, mine are populating correctly with the short username. >From the info at the top of a page: Last Updated: Thu Dec 8 14:02:46 PST 2005 Updated every 60 seconds Nagios(r) - www.nagios.org Logged in as cwaters And when an authenticated user acks an alert, it records the same data. What was your orignal issue? Does it allow you to authenticate but not retain the value for the user? Chris Waters WAN/LAN Technician JELD-WEN, Inc. Network Services Group =================== -----Original Message----- From: Eli Stair [mailto:estair at ilm.com] Sent: Thursday, December 08, 2005 1:02 PM To: Chris Waters Cc: nagios-users at lists.sourceforge.net Subject: Re: [BULK SPAM] [Nagios-users] using LDAP-based user authenticat ion Importance: Low I'm switching over to using auth_kerb from AD (just because I've done that, not the authz_ldap). I'm curious if you both have the username being populated properly in the "Author" portion of the CGI's when you're adding comments/ ack'ing events. I still haven't gotten any suggestions from the list on where the problem lies. Cheers, /eli CWaters at jeld-wen.com wrote: > I am using authz_ldap and don't seem to have that problem. Can you > show the config for your module? What is your LDAP store??? > AD,OpenLDAP...I am authenticating my users against an M$ Active Directory. Works pretty well. > I would include your .htaccess code as well. That way we can compare > them to known working configs. > > > Chris Waters > WAN/LAN Technician > JELD-WEN, Inc. > Network Services Group > =================== > > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Jeff > Rattray > Sent: Thursday, December 08, 2005 12:26 PM > To: nagios-users at lists.sourceforge.net > Subject: [BULK SPAM] [Nagios-users] using LDAP-based user > authentication > Importance: Low > > Greetings, everyone. We (Purdue College of Pharmacy, Nursing and > Health > Sciences) have been using nagios for half of a year now, and it has > been really handy. Currently, we are trying to change the way nagios > authenticates users. > Purdue has implemented a new security policy that says that all > passwords, whatever they are for, must be changed every 30 days. > Leaving aside the difficulties about changing, say, BIOS passwords in > the student labs, we are trying to reduce the number of usernames and > passwords that we have to make it easier to keep up with the password-changing requirement. > > I have successfully configured mod_authz_ldap to handle authentication > on our nagios server, and I can log into the nagios pages. > Unfortunately, instead of returning a simple username like 'jrattray', > the CGIs see my username as 'uid=jrattray,ou=people,o=pnhs.purdue.edu' > . I cannot authorize myself to see any information, because when I > put that string into cgi.cfg, nagios breaks it up at the commas into 3 > different users. (The equals signs may be confusing it > also.) > > Here are my questions: > > (1) Does anyone know how to escape this username so that it works in > cgi.cfg? > I have tried entering it as a quoted string, putting slashes in front > of the commas, etc. > > (2) Failing that, has anyone had any experience using nagios with > mod_auth_ldap? I used mod_authz_ldap (a third-party product) only > because it came with Red Hat EL, but I don't want to bother compiling > the official apache module if it does exactly the same thing. > > (3) Failing THAT, how does one submit this to the wish list for nagios? > > Thanks, > Jeff Rattray > -- > Jeff Rattray, Ph.D. > Manager, Web-based Instructional Technology Purdue University College > of Pharmacy, Nursing and Health Sciences > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files for problems? Stop! Download the new AJAX search engine that > makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files for problems? Stop! Download the new AJAX search engine that > makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Thu Dec 8 23:17:39 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Fri, 9 Dec 2005 11:17:39 +1300 Subject: using LDAP-based user authentication In-Reply-To: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B540@vxchange.vcr.auckland.ac.nz> References: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B540@vxchange.vcr.auckland.ac.nz> Message-ID: <003f01c5fc45$3421cae0$8ea8d882@438LG01sshi052> > Purdue has implemented a new security policy that says that > all passwords, whatever they are for, must be changed every > 30 days. Ouch. Sounds like a knee-jerk response. Be prepared for less security as people find they have to write down these constantly change passwords... but anyway, back to the real issue. > (2) Failing that, has anyone had any experience using nagios > with mod_auth_ldap? I used mod_authz_ldap (a third-party > product) only because it came with Red Hat EL, but I don't > want to bother compiling the official apache module if it > does exactly the same thing. We are running Nagios with LDAP here fine. We use mod_auth_ldap (in Apache2), which is much better (I have no experience with authz_ldap, but I tried the Muquit one before and it works OK). This returns the username as whatever we define the uid parameter to be. We only need to put this: AuthLDAPUrl "ldap://ldap.auckland.ac.nz:389/ou=People,o=The University of Auckland,c=NZ?uid" into our Apache httpd.conf and it will latch onto the correct server, use ssl, and return the uid which we can then use within Nagios. To make the Muquit mod_auth_ldap work, you need more configuration, like: LDAP_Server ldap.auckland.ac.nz LDAP_Port 389 Base_DN "ou=People,o=The University of Auckland,c=NZ" since it will default to using the 'uid' parameter. For authz_ldap, I had to use this: AuthzLDAPServer ldap.auckland.ac.nz:389 AuthzLDAPUserBase "ou=People,o=The University of Auckland,c=NZ" AuthzLDAPUserKey uid AuthzLDAPUserScope base AuthzLDAPGroupBase "ou=Groups,o=The University of Auckland,c=NZ" AuthzLDAPGroupKey cn AuthzLDAPGroupScope base but it didn't work too well - and besides, we have standardised on the mod_auth_ldap now. In all these cases, Apache reported just the uid attribute as the username, so we only had to define users with that in Nagios. No need for all the LDAP ou= stuff. Hope this helps, Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tykeal at bardicgrove.org Thu Dec 8 23:49:39 2005 From: tykeal at bardicgrove.org (Andrew Grimberg) Date: Thu, 08 Dec 2005 14:49:39 -0800 Subject: using LDAP-based user authentication In-Reply-To: <003f01c5fc45$3421cae0$8ea8d882@438LG01sshi052> References: <003f01c5fc45$3421cae0$8ea8d882@438LG01sshi052> Message-ID: <1134082179.14540.5.camel@localhost.localdomain> On Fri, 2005-12-09 at 11:17 +1300, Steve Shipway wrote: > We are running Nagios with LDAP here fine. We use mod_auth_ldap (in > Apache2), which is much better (I have no experience with authz_ldap, but I > tried the Muquit one before and it works OK). This returns the username as > whatever we define the uid parameter to be. We only need to put this: > > AuthLDAPUrl "ldap://ldap.auckland.ac.nz:389/ou=People,o=The University of > Auckland,c=NZ?uid" > > into our Apache httpd.conf and it will latch onto the correct server, use > ssl, and return the uid which we can then use within Nagios. > > To make the Muquit mod_auth_ldap work, you need more configuration, like: > > LDAP_Server ldap.auckland.ac.nz > LDAP_Port 389 > Base_DN "ou=People,o=The University of Auckland,c=NZ" > > since it will default to using the 'uid' parameter. > > For authz_ldap, I had to use this: > > AuthzLDAPServer ldap.auckland.ac.nz:389 > AuthzLDAPUserBase "ou=People,o=The University of Auckland,c=NZ" > AuthzLDAPUserKey uid > AuthzLDAPUserScope base > AuthzLDAPGroupBase "ou=Groups,o=The University of Auckland,c=NZ" > AuthzLDAPGroupKey cn > AuthzLDAPGroupScope base > > but it didn't work too well - and besides, we have standardised on the > mod_auth_ldap now. > > In all these cases, Apache reported just the uid attribute as the username, > so we only had to define users with that in Nagios. No need for all the > LDAP ou= stuff. Sounds similar to what we're doing. We went a step further in our requirements. Folks that are accessing Nagios must be part of a particular group to authenticate as well as be defined in Nagios to see anything. Note, this is using mod_auth_ldap ScriptAlias /nagios/cgi-bin /usr/lib/nagios/cgi/ Options ExecCGI order deny,allow deny from all AuthType Basic AuthName "nagios" AuthLDAPUrl "ldap://ldap/ou=Users,ou=DoIT,o=Seattle?cn" AuthLDAPGroupAttributeIsDN on AuthLDAPGroupAttribute member require group cn=Nagios,ou=Users,ou=DoIT,o=Seattle satisfy any -Andy- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Fri Dec 9 06:14:36 2005 From: estair at ilm.com (Eli Stair) Date: Thu, 08 Dec 2005 21:14:36 -0800 Subject: eventhandlers running when a dependent service dependency is not satisfied Message-ID: <439912BC.5020000@ilm.com> I'm not entirely sure I am configuring this properly to achieve my goal, so I'll state this shortly and then give the details below. The question comes down to this: Should a failed service check for a dependent trigger a check of its parent before continuing? If this is not the case, or default, is there _ANY_ way to implement this? I want to avoid at all costs having an every-minute check of the parent processes on many thousand hosts just to keep from having the child process checks and event handlers going hay-wire. I want a dependency chain like this: SSH -- SNMP --\ - Ganglia - NTP I believe I have this set up so that a service check for SNMP is dependent on the SSH service running. In turn, the service checks for other processes that use SNMP are dependent on SNMP running. My intent is that service checks for NTP,etc will not be attempted if its parent SNMP process is not in an OK state (as I have an event handler that will restart SNMP if it is dead). If the parent SNMP _IS_ running, then the child process checks (Ganglia, NTP, etc) will be checked and if dead their own event handler will activate. The problem is that in this case, if I kill off SNMP the child process checks STILL execute and return a CRITICAL. As a result, nagios fires off the event handler for all these checks which results in an SSH out to the nodes in question and restarting a bunch of services that are probably still running. It SHOULD NOT schedule the child checks and thus not run their event handlers until AFTER a new parent check has returned executed and returned successfully, correct? I've included a dependency example below, and a snip from the nagios log showing it sequentially hammering out checks of all the child processes at the same time it already knows the parent is dead. My apologies for the lengthy post, but I believe I've covered this from every angle and posted enough info up front to make it easily parseable. Thanks for any help in this, even if it's just a statement that I'm wrong, and I have to do this a different way. Cheers, /eli ################################################### ### snip of this host/group definition include: define host{ use linux-node-production host_name HOSTNAME1 address IP } define servicedependency{ host_name HOSTNAME1 service_description SSH dependent_host_name HOSTNAME1 dependent_service_description SNMP execution_failure_criteria w,p,u,c notification_failure_criteria w,p,u,c inherits_parent 1 } define servicedependency{ host_name HOSTNAME1 service_description SNMP dependent_host_name HOSTNAME1 dependent_service_description SNMP--* execution_failure_criteria w,p,u,c notification_failure_criteria w,p,u,c inherits_parent 1 } define service{ use generic-service hostgroup_name HOSTGROUP1 service_description SNMP check_command SNMPCHECKCOMMAND event_handler restart-by-ssh!/etc/init.d/snmpd!restart normal_check_interval 30 } define service{ use generic-service hostgroup_name HOSTGROUP1 service_description SNMP-- NTP running check_command SNMPCHECKCOMMAND event_handler restart-by-ssh!/etc/init.d/xntpd!restart normal_check_interval 240 } ################################################### [1134102595] SERVICE ALERT: HOSTNAME1001;SNMP-- cron running;CRITICAL;SOFT;1;No process matching cron found : CRITICAL [1134102595] SERVICE EVENT HANDLER: HOSTNAME1001;SNMP-- cron running;CRITICAL;SOFT;1;restart-by-ssh!/etc/init.d/cron!restart [1134102655] SERVICE ALERT: HOSTNAME1001;SNMP-- cron running;CRITICAL;SOFT;2;No process matching cron found : CRITICAL [1134102655] SERVICE EVENT HANDLER: HOSTNAME1001;SNMP-- cron running;CRITICAL;SOFT;2;restart-by-ssh!/etc/init.d/cron!restart [1134102715] SERVICE ALERT: HOSTNAME1001;SNMP-- cron running;CRITICAL;SOFT;3;No process matching cron found : CRITICAL [1134102715] SERVICE EVENT HANDLER: HOSTNAME1001;SNMP-- cron running;CRITICAL;SOFT;3;restart-by-ssh!/etc/init.d/cron!restart [1134102775] SERVICE ALERT: HOSTNAME1001;SNMP-- cron running;OK;SOFT;4;(No output returned from plugin) [1134102775] SERVICE EVENT HANDLER: HOSTNAME1001;SNMP-- cron running;OK;SOFT;4;restart-by-ssh!/etc/init.d/cron!restart [1134104099] EXTERNAL COMMAND: SCHEDULE_FORCED_SVC_CHECK;HOSTNAME1001;SNMP-- Ganglia running;1134104073 [1134104476] SERVICE ALERT: HOSTNAME1001;SNMP-- Ganglia running;UNKNOWN;SOFT;1;ERROR: Process name table : No response from remote host '10.65.29.1'. [1134104476] SERVICE EVENT HANDLER: HOSTNAME1001;SNMP-- Ganglia running;UNKNOWN;SOFT;1;restart-by-ssh!/etc/init.d/gmond!restart ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From prashant at mac-india.com Fri Dec 9 20:48:25 2005 From: prashant at mac-india.com (Prashant Nadgir) Date: Sat, 10 Dec 2005 01:18:25 +0530 Subject: 2-0b5 authentication Message-ID: <00b401c5fcf9$84d3daa0$441010c1@prashant> i tried to access http:./. /nagios browser is prompting for user name and passwd although i enter correct user name and passwd,i am getting "authentication failed or bad passwd " I have created .htaccess in /usr/local/nagios/sbin directory. I have also put the same file in /usr/local/nagios/share folder htaccess file. AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users require valid-user htpasswd /usr/local/nagios/etc/htpasswd.users nagiosadmin thanks in advance Prashant -------------- next part -------------- An HTML attachment was scrubbed... URL: From aleksey.tsalolikhin at gmail.com Fri Dec 9 08:54:04 2005 From: aleksey.tsalolikhin at gmail.com (Aleksey Tsalolikhin) Date: Thu, 8 Dec 2005 23:54:04 -0800 Subject: 2-0b5 authentication In-Reply-To: <00b401c5fcf9$84d3daa0$441010c1@prashant> References: <00b401c5fcf9$84d3daa0$441010c1@prashant> Message-ID: <72a19efd0512082354v5adcb64aha30bf415a1f47caf@mail.gmail.com> On 12/9/05, Prashant Nadgir wrote: > i tried to access http:./. /nagios browser is prompting for user name and > passwd > although i enter correct user name and passwd,i am getting "authentication > failed or bad passwd " > > > I have created .htaccess in /usr/local/nagios/sbin directory. > I have also put the same file in /usr/local/nagios/share folder > > htaccess file.AuthName "Nagios Access" AuthType Basic AuthUserFile > /usr/local/nagios/etc/htpasswd.users require valid-user > > htpasswd /usr/local/nagios/etc/htpasswd.users nagiosadmin > > thanks in advance > Prashant > Do you have AllowOverride enabled in your Web server config? Check out http://httpd.apache.org/docs/2.0/howto/auth.html Aleksey ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Fri Dec 9 09:03:52 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Fri, 9 Dec 2005 09:03:52 +0100 (CET) Subject: eventhandlers running when a dependent service dependency is not satisfied In-Reply-To: <439912BC.5020000@ilm.com> References: <439912BC.5020000@ilm.com> Message-ID: On Thu, 8 Dec 2005, Eli Stair wrote: > SSH -- SNMP --\ > - Ganglia > - NTP > define servicedependency{ > host_name HOSTNAME1 > service_description SSH > dependent_host_name HOSTNAME1 > dependent_service_description SNMP > execution_failure_criteria w,p,u,c > notification_failure_criteria w,p,u,c > inherits_parent 1 > } Here you asume that if SSH fails because it depends on another service it will also fail SNMP I have not seen SSH being dependent on anything however in your description. How a non exsiting parent will impact services is unknown to me. > define servicedependency{ > host_name HOSTNAME1 > service_description SNMP > dependent_host_name HOSTNAME1 > dependent_service_description SNMP--* > execution_failure_criteria w,p,u,c > notification_failure_criteria w,p,u,c > inherits_parent 1 > } Is this a real wildcard? Are they supported? I would expect: define servicedependency{ host_name HOSTNAME1 service_description SNMP dependent_host_name HOSTNAME1 dependent_service_description NTP execution_failure_criteria w,p,u,c notification_failure_criteria w,p,u,c inherits_parent 1 } define servicedependency{ host_name HOSTNAME1 service_description SNMP dependent_host_name HOSTNAME1 dependent_service_description Ganglia execution_failure_criteria w,p,u,c notification_failure_criteria w,p,u,c inherits_parent 1 } Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Fri Dec 9 09:52:01 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 09 Dec 2005 03:52:01 -0500 Subject: eventhandlers running when a dependent service dependency is not satisfied In-Reply-To: Your message of "Thu, 08 Dec 2005 21:14:36 PST." <439912BC.5020000@ilm.com> References: <439912BC.5020000@ilm.com> Message-ID: <200512090852.jB98q1jd008592@mx1.cs.umb.edu> Hi Eli: You didn't say what version of nagios you are running so I'll assume 2.0. In message <439912BC.5020000 at ilm.com>, Eli Stair writes: >The question comes down to this: > > Should a failed service check for a dependent trigger a check of its >parent before continuing? IIRC from the code it does not force a check of the parent service. I can see arguments for and against forcing a poll of the parent. Also the documentation: http://nagios.sourceforge.net/docs/2_0/dependencies.html in the "How Service Dependencies Are Tested" section, says: Nagios gets the current status of the service that is being depended upon. not nagios repolls the service being depended upon. A footnote says: by default, Nagios will use the most current hard state of the service(s) that is/are being depended upon an option in the config file will allow it to use the current soft state instead. I use the soft state of the service being depended upon myself. >If this is not the case, or default, is there _ANY_ way to implement this? Sort of. The event handler for the child can send a SCHEDULE_FORCED_SVC_CHECK external command for the parent specifying the current time in seconds. See http://www.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=129 for details. The command will be acted upon immediately since nagios reads the external command file after an event handler runs. Use this to force an update of the current service status for the parent. Parse through the objects.cache (probably in /var/log/nagios/objects.cache) file for the expanded servicedependency objects to find the service dependencies that match your host/service. I set my nagios options so that: max_check_attempts(dependent)*retry_check_interval(dependent) > normal_check_interval(parent) This way the parent service will be checked at least once during the soft error interval of the dependent service. >I want to avoid at all costs having an every-minute check of the parent >processes on many thousand hosts just to keep from having the child >process checks and event handlers going hay-wire. You need to use the max_check_attempts to provide a buffer in which the parent service will be checked. You can have your event handler submit an external command on the first soft error and try to fix the problem on a subsequent soft, or hard error. You don't have any of those directives in your sample config. >I want a dependency chain like this: > > SSH -- SNMP --\ > - Ganglia > - NTP Just a note, I wouldn't have ssh in the dependency chain unless you are accessing snmp over ssh (e.g. running check_snmp via check_by_ssh). I can't tell if that is the case or not. Just because your event handler runs over ssh doesn't add it to the dependency chain IMO. If ssh is down, it means none of the other services will be checked and you won't recognize them as down. >I believe I have this set up so that a service check for SNMP is >dependent on the SSH service running. Did you verify in the web interface or object.cache? >In turn, the service checks for >other processes that use SNMP are dependent on SNMP running. My intent >is that service checks for NTP,etc will not be attempted if its parent >SNMP process is not in an OK state (as I have an event handler that will >restart SNMP if it is dead). If the parent SNMP _IS_ running, then the >child process checks (Ganglia, NTP, etc) will be checked and if dead >their own event handler will activate. It looks like the config is ok on that score with one possible exception noted below. >The problem is that in this case, if I kill off SNMP the child process >checks STILL execute and return a CRITICAL. As a result, nagios fires >off the event handler for all these checks which results in an SSH out >to the nodes in question and restarting a bunch of services that are >probably still running. It SHOULD NOT schedule the child checks and >thus not run their event handlers until AFTER a new parent check has >returned executed and returned successfully, correct? Nope, nagios doesn't re-run the parent or parents. If you are in a soft failure mode, you can write your event handler to wait until you are in a hard failure mode. >I've included a dependency example below, and a snip from the nagios log >showing it sequentially hammering out checks of all the child processes >at the same time it already knows the parent is dead. >[...] >################################################### >### snip of this host/group definition include: >define host{ > use linux-node-production > host_name HOSTNAME1 > address IP >} > >define servicedependency{ > host_name HOSTNAME1 > service_description SSH > dependent_host_name HOSTNAME1 > dependent_service_description SNMP > execution_failure_criteria w,p,u,c > notification_failure_criteria w,p,u,c > inherits_parent 1 >} > >define servicedependency{ > host_name HOSTNAME1 > service_description SNMP > dependent_host_name HOSTNAME1 > dependent_service_description SNMP--* Not sure if SNMP--* does what you think (and I hope) it does. Have you looked at the view config web page and verified that nagios is seeing the appropriate service dependencies? -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mls at skayser.de Fri Dec 9 10:38:19 2005 From: mls at skayser.de (Sebastian Kayser) Date: Fri, 9 Dec 2005 10:38:19 +0100 Subject: 2-0b5 authentication In-Reply-To: <00b401c5fcf9$84d3daa0$441010c1@prashant> References: <00b401c5fcf9$84d3daa0$441010c1@prashant> Message-ID: <20051209093819.GA2497@skayser.homeip.net> * Prashant Nadgir wrote: > i tried to access http:./. /nagios browser is prompting for user name > and passwd > > although i enter correct user name and passwd,i am getting > "authentication failed or bad passwd " > > I have created .htaccess in /usr/local/nagios/sbin directory. > I have also put the same file in /usr/local/nagios/share folder > > htaccess file. > > AuthName "Nagios Access" > AuthType Basic > AuthUserFile /usr/local/nagios/etc/htpasswd.users > require valid-user > > htpasswd /usr/local/nagios/etc/htpasswd.users nagiosadmin Have a look at your apache error log. It should tell you, why apache doesn't authenticate you. - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From guillaume.renard at gmail.com Fri Dec 9 10:52:02 2005 From: guillaume.renard at gmail.com (Guillaume RENARD) Date: Fri, 9 Dec 2005 10:52:02 +0100 Subject: CPu consumption too high Message-ID: <762a4f900512090152mac25671w5fc78d0717d834b1@mail.gmail.com> Hello All, I use Nagios 2.0b5 and I have only 210 services. The problem is that my server is running out of ressource 10:42:05 up 423 days, 19:36, 3 users, load average: 8.18, 7.29, 6.54 But nagios don't use so much ressource : 10:42:35 up 423 days, 19:37, 3 users, load average: 8.63, 7.47, 6.62 94 processes: 85 sleeping, 9 running, 0 zombie, 0 stopped CPU0 states: 0.0% user 99.4% system 0.0% nice 0.0% iowait 0.1%idle CPU1 states: 2.1% user 97.4% system 0.0% nice 0.0% iowait 0.0%idle Mem: 1547392k av, 1536372k used, 11020k free, 0k shrd, 496k buff 440520k actv, 424k in_d, 13256k in_c Swap: 2040244k av, 1496k used, 2038748k free 570160k cached PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME CPU COMMAND 22562 nagios 25 0 1312 1312 784 R 23.9 0.0 0:04 1 nagios 22570 nagios 25 0 464 464 400 D 23.5 0.0 0:03 0 check_snmp 22569 nagios 25 0 444 444 388 R 22.4 0.0 0:03 1 urlize 22568 nagios 25 0 468 468 400 R 18.9 0.0 0:05 0 check_snmp 22577 nagios 25 0 528 528 468 R 15.9 0.0 0:00 0 check_ping 22565 nagios 25 0 484 484 416 S 4.2 0.0 0:04 1 urlize 21191 nagios 15 0 1268 1268 740 S 0.0 0.0 0:15 0 nagios 22390 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios 22391 nagios 21 0 1288 1288 976 S 0.0 0.0 0:06 1 apan.sh 22558 nagios 25 0 2344 2344 1400 S 0.0 0.1 0:02 0 check_snmp2 22560 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios 22564 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios 22567 nagios 25 0 1312 1312 784 S 0.0 0.0 0:00 1 nagios I am using apan with nagios but it shouldn't take so many ressource. Does someone have an idea please ? -- Guillaume RENARD http://glmrenard.free.fr -------------- next part -------------- An HTML attachment was scrubbed... URL: From nagios at mailtilmeg.com Fri Dec 9 11:49:52 2005 From: nagios at mailtilmeg.com (Nagios) Date: Fri, 09 Dec 2005 11:49:52 +0100 Subject: Problem defining contact groups - Allerts sent to all contacts instead of group Message-ID: <43996150.9090401@mailtilmeg.com> -------------- next part -------------- An embedded message was scrubbed... From: Nagios Subject: [Nagios-users] Problem defining contact groups - Allerts sent to all contacts instead of group Date: Thu, 08 Dec 2005 10:16:38 +0100 Size: 9505 URL: From todd_barbera at wgbh.org Fri Dec 9 13:40:05 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Fri, 09 Dec 2005 07:40:05 -0500 Subject: CPu consumption too high References: <762a4f900512090152mac25671w5fc78d0717d834b1@mail.gmail.com> Message-ID: <004701c5fcbd$af40b880$0d7810ac@wgbh.org> Given that your CPU's are showing 97 and 99 percent usage by system resources, it sounds like an OS process is chewing up your CPU. Have you run "top" or another similar utility? sar, iostat, and vmstat are useful tools for helping track down performance issues. I know these are available on Solaris, but I can't speak for other *nix servers. Also, since your server has been running for over a year, there's a good chance you're well behind on your OS patch levels. It could be you've hit some sort of bug within your OS. Todd ----- Original Message ----- From: Guillaume RENARD To: nagios-users at lists.sourceforge.net Sent: Friday, December 09, 2005 4:52 AM Subject: [Nagios-users] CPu consumption too high Hello All, I use Nagios 2.0b5 and I have only 210 services. The problem is that my server is running out of ressource 10:42:05 up 423 days, 19:36, 3 users, load average: 8.18, 7.29, 6.54 But nagios don't use so much ressource : 10:42:35 up 423 days, 19:37, 3 users, load average: 8.63, 7.47, 6.62 94 processes: 85 sleeping, 9 running, 0 zombie, 0 stopped CPU0 states: 0.0% user 99.4% system 0.0% nice 0.0% iowait 0.1% idle CPU1 states: 2.1% user 97.4% system 0.0% nice 0.0% iowait 0.0% idle Mem: 1547392k av, 1536372k used, 11020k free, 0k shrd, 496k buff 440520k actv, 424k in_d, 13256k in_c Swap: 2040244k av, 1496k used, 2038748k free 570160k cached PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME CPU COMMAND 22562 nagios 25 0 1312 1312 784 R 23.9 0.0 0:04 1 nagios 22570 nagios 25 0 464 464 400 D 23.5 0.0 0:03 0 check_snmp 22569 nagios 25 0 444 444 388 R 22.4 0.0 0:03 1 urlize 22568 nagios 25 0 468 468 400 R 18.9 0.0 0:05 0 check_snmp 22577 nagios 25 0 528 528 468 R 15.9 0.0 0:00 0 check_ping 22565 nagios 25 0 484 484 416 S 4.2 0.0 0:04 1 urlize 21191 nagios 15 0 1268 1268 740 S 0.0 0.0 0:15 0 nagios 22390 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios 22391 nagios 21 0 1288 1288 976 S 0.0 0.0 0:06 1 apan.sh 22558 nagios 25 0 2344 2344 1400 S 0.0 0.1 0:02 0 check_snmp2 22560 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios 22564 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios 22567 nagios 25 0 1312 1312 784 S 0.0 0.0 0:00 1 nagios I am using apan with nagios but it shouldn't take so many ressource. Does someone have an idea please ? -- Guillaume RENARD http://glmrenard.free.fr -------------- next part -------------- An HTML attachment was scrubbed... URL: From tim.wolak at gmail.com Fri Dec 9 14:23:08 2005 From: tim.wolak at gmail.com (Tim Wolak) Date: Fri, 09 Dec 2005 07:23:08 -0600 Subject: NRPE timeouts Message-ID: <4399853C.7080101@gmail.com> Has anyone had check_nrpe for NT time out when checking it from the nagios server? I have a 2K3 server that when I start the service and run a check from the nagios server I get a successfull response, if I run the check right after that it times out..... Any ideas? Tim ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From guillaume.renard at gmail.com Fri Dec 9 14:45:06 2005 From: guillaume.renard at gmail.com (Guillaume RENARD) Date: Fri, 9 Dec 2005 14:45:06 +0100 Subject: CPu consumption too high In-Reply-To: <762a4f900512090540g6d4d0adcn48f9da2b6d92b3c8@mail.gmail.com> References: <762a4f900512090152mac25671w5fc78d0717d834b1@mail.gmail.com> <004701c5fcbd$af40b880$0d7810ac@wgbh.org> <762a4f900512090540g6d4d0adcn48f9da2b6d92b3c8@mail.gmail.com> Message-ID: <762a4f900512090545h71bd4424nc5772211b9852326@mail.gmail.com> Hello, Yes, a samba process have taken all the CPU ! Thanks for your answer ! On 12/9/05, Guillaume RENARD wrote: > > Hello, > > Thanks a lot for your > > On 12/9/05, Todd Barbera wrote: > > > > Given that your CPU's are showing 97 and 99 percent usage by system > > resources, it sounds like an OS process is chewing up your CPU. Have you run > > "top" or another similar utility? sar, iostat, and vmstat are useful tools > > for helping track down performance issues. I know these are available on > > Solaris, but I can't speak for other *nix servers. Also, since your server > > has been running for over a year, there's a good chance you're well behind > > on your OS patch levels. It could be you've hit some sort of bug within your > > OS. > > > > Todd > > > > ----- Original Message ----- > > *From:* Guillaume RENARD > > *To:* nagios-users at lists.sourceforge.net > > *Sent:* Friday, December 09, 2005 4:52 AM > > *Subject:* [Nagios-users] CPu consumption too high > > > > Hello All, > > > > I use Nagios 2.0b5 and I have only 210 services. > > The problem is that my server is running out of ressource > > 10:42:05 up 423 days, 19:36, 3 users, load average: 8.18, 7.29, 6.54 > > > > But nagios don't use so much ressource : > > 10:42:35 up 423 days, 19:37, 3 users, load average: 8.63, 7.47, 6.62 > > 94 processes: 85 sleeping, 9 running, 0 zombie, 0 stopped > > CPU0 states: 0.0% user 99.4% system 0.0% nice 0.0% iowait 0.1%idle > > CPU1 states: 2.1% user 97.4% system 0.0% nice 0.0% iowait 0.0%idle > > Mem: 1547392k av, 1536372k used, 11020k free, 0k shrd, 496k > > buff > > 440520k actv, 424k in_d, 13256k in_c > > Swap: 2040244k av, 1496k used, 2038748k free 570160k > > cached > > > > PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME CPU > > COMMAND > > 22562 nagios 25 0 1312 1312 784 R 23.9 0.0 0:04 1 > > nagios > > 22570 nagios 25 0 464 464 400 D 23.5 0.0 0:03 0 > > check_snmp > > 22569 nagios 25 0 444 444 388 R 22.4 0.0 0:03 1 urlize > > 22568 nagios 25 0 468 468 400 R 18.9 0.0 0:05 0 > > check_snmp > > 22577 nagios 25 0 528 528 468 R 15.9 0.0 0:00 0 > > check_ping > > 22565 nagios 25 0 484 484 416 S 4.2 0.0 0:04 1 urlize > > 21191 nagios 15 0 1268 1268 740 S 0.0 0.0 0:15 0 nagios > > 22390 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios > > 22391 nagios 21 0 1288 1288 976 S 0.0 0.0 0:06 1 > > apan.sh > > 22558 nagios 25 0 2344 2344 1400 S 0.0 0.1 0:02 0 > > check_snmp2 > > 22560 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios > > 22564 nagios 24 0 1312 1312 784 S 0.0 0.0 0:00 0 nagios > > 22567 nagios 25 0 1312 1312 784 S 0.0 0.0 0:00 1 nagios > > > > I am using apan with nagios but it shouldn't take so many ressource. > > > > > > Does someone have an idea please ? > > > > > > > > > > > > -- > > Guillaume RENARD > > http://glmrenard.free.fr > > > > > > > -- > Guillaume RENARD > http://glmrenard.free.fr > -- Guillaume RENARD http://glmrenard.free.fr -------------- next part -------------- An HTML attachment was scrubbed... URL: From jrattray at pnhs.purdue.edu Fri Dec 9 15:39:19 2005 From: jrattray at pnhs.purdue.edu (Jeff Rattray) Date: Fri, 09 Dec 2005 09:39:19 -0500 Subject: using LDAP-based user authentication Message-ID: <439950C7.17742.87826C0@localhost> On Thu, 8 Dec 2005 at 12:54:59 -0800 CWaters at jeld-wen.com said: > > I am using authz_ldap and don't seem to have that problem. Can you show the > config for your module? What is your LDAP store??? AD,OpenLDAP...I am > authenticating my users against an M$ Active Directory. Works pretty well. > I would include your .htaccess code as well. That way we can compare them > to known working configs. > Sure! Right now I have it all in .htaccess for testing; I realize that this is a bad idea long-term and I can't do caching that way, but unless we get this username issue sorted out I'm going to have to drop it all anyways. We are using a Netscape/iPlanet Directory server version 4.16, which we run within our College. There is also an Openldap server at the University level, but it only accepts SSL v3 ldap connections, and I haven't figure out how to make mod_authz_ldap do that. Contents of my .htaccess file: # .htaccess for LDAP authentication against directory.purdue.edu # 08DE2005 JJR # NOTE - mod_authz_ldap IS NOT THE SAME AS mod_auth_ldap; different syntax etc! AuthzLDAPEngine on AuthzLDAPAuthoritative on AuthzLDAPServer directory.pnhs.purdue.edu AuthzLDAPUserBase "ou=people,o=pnhs.purdue.edu" AuthzLDAPUserKey uid AuthzLDAPUserScope base AuthType basic AuthName "PNHS LDAP Account Access" require valid-user Right now I have put our nagios pages back to the original files-based authentication, and am testing ldap authentication in a different directory with a php script that prints out all of the environment variables. $_SERVER["REMOTE_USER"] is the relevant one, and it is still set to uid=jrattray,ou=people,o=pnhs.purdue.edu . I am open for suggestions. Thanks, Jeff -- Jeff Rattray, Ph.D. Manager, Web-based Instructional Technology Purdue University College of Pharmacy, Nursing and Health Sciences ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chet.luther at gmail.com Fri Dec 9 16:22:23 2005 From: chet.luther at gmail.com (Chet Luther) Date: Fri, 9 Dec 2005 10:22:23 -0500 Subject: contactgroup{ }, and contactgroups # bug? In-Reply-To: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> References: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> Message-ID: <366ea3750512090722q19b38744tf81bdb7d8e1faef9@mail.gmail.com> On 12/5/05, Cam wrote: > Hi, > > So I'm writing some scripts to help me maintain my nagios > configuration (actually to help other people be able to add new hosts, > change some configuration parameters, etc). Anyway, I'm having a > problem right now. In the case of hosts/hostgroups, this works fine: > > define hostgroup{ > houstgroup_name kiosks > alias kiosks > } > > define host{ > name host_template > hostgroups kiosks > ... > } > > define host{ > host_name foobar > use host_template > ... > } #foobar will be put in the hostgroup kiosks > > however, doing an equivalent configuration w/ contacts/contactgroups > doesn't work (i'm using nagios 2.0b4): > > define contactgroup{ > contactgroup_name kiosk-folks > alias Kiosk People > } > > define contact{ > name kiosk_contact > contactgroups kiosk-folks > ... > register 0 > } > > define contact{ > use kiosk_contact > contact_name jdoe > alias John Doe > ... > } # DOESN'T ADD TO KIOSK-FOLKS > > When try to verify my nagios configuration this is the error i get: > > Error: Contactgroup has no members (config file > '/usr/local/opt/nagios/etc/contactgroups.cfg', starting on line 25) > > I don't understand why it works for the one, but not for the other. > Is this a bug? This is somewhere between a missing feature and a bug, but it is quite inconsistent. Here's a quick patch to fix it. It is applied against the current CVS tree. diff -urN nagios-orig/xdata/xodtemplate.c nagios/xdata/xodtemplate.c --- nagios-orig/xdata/xodtemplate.c 2005-11-16 03:49:35.000000000 -0500 +++ nagios/xdata/xodtemplate.c 2005-12-09 10:04:17.000000000 -0500 @@ -5874,6 +5874,8 @@ this_contact->contact_name=strdup(template_contact->contact_name); if(this_contact->alias==NULL && template_contact->alias!=NULL) this_contact->alias=strdup(template_contact->alias); + if(this_contact->contactgroups==NULL && template_contact->contactgroups!=NULL) + this_contact->contactgroups=strdup(template_contact->contactgroups); if(this_contact->email==NULL && template_contact->email!=NULL) this_contact->email=strdup(template_contact->email); if(this_contact->pager==NULL && template_contact->pager!=NULL) -- Chet Luther chet.luther at gmail.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chet.luther at gmail.com Fri Dec 9 16:27:40 2005 From: chet.luther at gmail.com (Chet Luther) Date: Fri, 9 Dec 2005 10:27:40 -0500 Subject: contactgroup{ }, and contactgroups # bug? In-Reply-To: <366ea3750512090722q19b38744tf81bdb7d8e1faef9@mail.gmail.com> References: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> <366ea3750512090722q19b38744tf81bdb7d8e1faef9@mail.gmail.com> Message-ID: <366ea3750512090727v3b09ea8dsd5181bec111e3383@mail.gmail.com> On 12/9/05, Chet Luther wrote: > This is somewhere between a missing feature and a bug, but it is quite > inconsistent. Here's a quick patch to fix it. It is applied against > the current CVS tree. > > Sorry about that, gmail butchered the formatting on that patch. I'm attaching it instead. -- Chet Luther chet.luther at gmail.com -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: nagios_contact_template_fix.patch.txt URL: From mscandian at freebsdbrasil.com.br Fri Dec 9 17:13:07 2005 From: mscandian at freebsdbrasil.com.br (=?ISO-8859-1?Q?M=E1rio_S=E9rgio_Candian?=) Date: Fri, 09 Dec 2005 14:13:07 -0200 Subject: check_udp problem Message-ID: <4399AD13.6030302@freebsdbrasil.com.br> Hi guys, I'm trying to use the check_udp plugin for the radius service, and when I run the check_udp command I received this: (root at firewall)~# ./check_udp -H 201.x.y.z -p 1645 No data was recieved from host! No response from host on port 1645 Exit 2 (root at firewall)~# ./check_udp -H 201.x.y.z -p 1646 No data was recieved from host! No response from host on port 1646 Exit 2 And if I try the port 53 I received the same message. Someone know if the check_udp plugin is broken? What I can do to fix it? Thanks for all. -- Mario Sergio Candian - "Dreams as if you'll live forever. Live as if you'll die today" -- James Dean ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ed at ebonmists.com Fri Dec 9 19:11:10 2005 From: ed at ebonmists.com (Ed Benckert) Date: Fri, 09 Dec 2005 13:11:10 -0500 Subject: Notifications not sending out Message-ID: <4399C8BE.2010706@ebonmists.com> I'm using a pretty stock Nagios install. I can post parts of my configs if necessary. When I use the default 'host' format and the machine goes down, I get a notification. I cannot, under any circumstance, get a notification from a service to send out. I see nothing in the logs saying it even knew it should send a notification. I have this: define service{ use generic-service ; host_name gateway service_description DNS is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_dns } notifications_enabled is 1 in the generic-service The period is 24x7, it should give a warning email, it's sending to admins which is the same group when a machine cannot be pinged... but it doesnt even try... anyone have any idea? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Fri Dec 9 19:41:24 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Fri, 9 Dec 2005 13:41:24 -0500 (EST) Subject: check_udp problem In-Reply-To: <4399AD13.6030302@freebsdbrasil.com.br> References: <4399AD13.6030302@freebsdbrasil.com.br> Message-ID: Most udp protocols expect data in packets and discard packets without payloads. For Radius, try check_radius for port 53: try check_dig or check dns On Fri, 9 Dec 2005, [ISO-8859-1] M?rio S?rgio Candian wrote: > Hi guys, > > I'm trying to use the check_udp plugin for the radius service, and when I run > the check_udp command I received this: > > (root at firewall)~# ./check_udp -H 201.x.y.z -p 1645 > No data was recieved from host! > No response from host on port 1645 > Exit 2 > > (root at firewall)~# ./check_udp -H 201.x.y.z -p 1646 > No data was recieved from host! > No response from host on port 1646 > Exit 2 > > And if I try the port 53 I received the same message. > > Someone know if the check_udp plugin is broken? What I can do to fix it? > > Thanks for all. > > -- From estair at ilm.com Fri Dec 9 20:09:11 2005 From: estair at ilm.com (Eli Stair) Date: Fri, 09 Dec 2005 11:09:11 -0800 Subject: eventhandlers running when a dependent service dependency is not satisfied In-Reply-To: References: <439912BC.5020000@ilm.com> Message-ID: <4399D657.6060107@ilm.com> Thanks for your response Hugo. In this case I have service checks named: SSH # uses check_ssh plugin SNMP # gets a low-latency MIB to ensure SNMPd is answering SNMP-- $DAEMON ## where $DAEMON is one of many processes that need to be running on the host (each with its own service definition). In this case SSH does not have any dependencies, it is to be the 'root' check that all the other services should in the end rely on. The wildcard matches all the process checks over SNMP, and are (intended) to be dependent on the SNMP daemon running properly. The event handlers for these particular services just SSH to the node and restart the service. However to keep this clean I just wanted to make sure that those aren't occuring _unless_ it was confirmed that SNMP and SSH are already running (otherwise the node could be down hard, or it could be responding to a simple death of one of its dependent processes)... The root issue was an assumption on my part that the service_dependencies did something they did not (i.e., re-checking master states when a child failed a check). Now that I realize I need to add a SCHEDULE_FORCED_SVC_CHECK to the first run of the event handlers to force a re-check of the state of its master service, I believe I can pull this off properly. Cheers, /eli Hugo van der Kooij wrote: > On Thu, 8 Dec 2005, Eli Stair wrote: > > >> SSH -- SNMP --\ >> - Ganglia >> - NTP > > >>define servicedependency{ >> host_name HOSTNAME1 >> service_description SSH >> dependent_host_name HOSTNAME1 >> dependent_service_description SNMP >> execution_failure_criteria w,p,u,c >> notification_failure_criteria w,p,u,c >> inherits_parent 1 >>} > > > Here you asume that if SSH fails because it depends on another service it > will also fail SNMP > > I have not seen SSH being dependent on anything however in your > description. How a non exsiting parent will impact services is unknown to > me. > > >>define servicedependency{ >> host_name HOSTNAME1 >> service_description SNMP >> dependent_host_name HOSTNAME1 >> dependent_service_description SNMP--* >> execution_failure_criteria w,p,u,c >> notification_failure_criteria w,p,u,c >> inherits_parent 1 >>} > > > Is this a real wildcard? Are they supported? > > I would expect: > > define servicedependency{ > host_name HOSTNAME1 > service_description SNMP > dependent_host_name HOSTNAME1 > dependent_service_description NTP > execution_failure_criteria w,p,u,c > notification_failure_criteria w,p,u,c > inherits_parent 1 > } > > define servicedependency{ > host_name HOSTNAME1 > service_description SNMP > dependent_host_name HOSTNAME1 > dependent_service_description Ganglia > execution_failure_criteria w,p,u,c > notification_failure_criteria w,p,u,c > inherits_parent 1 > } > > > Hugo. > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dmourati at cm.math.uiuc.edu Fri Dec 9 20:21:05 2005 From: dmourati at cm.math.uiuc.edu (Demetri Mouratis) Date: Fri, 9 Dec 2005 13:21:05 -0600 (CST) Subject: Stalking option + notification Message-ID: I'm picking up on a thread from a while back regarding stalking options and notification. I have a bit of an odd case from a custom plugin where I've "rolled up" several different bad cases into one global state. The request I'm getting now is to send an email notification when the state first goes bad, and then to send an additional email if the content of the status message changes, i.e. bad for a different reason. In Steve's example below, I want emails at (2), (4), and (5). Should I fix this at the plugin level, tweak some configuration is some crafty way, go with volitile services? Any suggestions welcome and I can provide more detail if necessary. Thanks. From: Steve Shipway RE: Stalking option + notification 2005-11-10 14:52 >Unless the documentation is very wrong, which I am inclined to >doubt is the case, the state stalking isn't contributing to >your notifications, only to the logging of those states. Do >you have is_volatile set for those services? That's another >way to get multiple repeat notifications for non-OK services. Surely, is_volatile will cause a notification *every*time* a critical check is received? I only want a notification if it is in critical *and* the text is different from the last check, which is what state stalking does. Eg, the plugin returns the LAST problem from the final 30 mins of the syslog file. Successive checks return: 1) OK - no problems in last 30min (message 1 appears at the end of syslog) 2) CRITICAL - message 1 (5 mins pass) 3) CRITICAL - message 1 (message 2 appears at the end of syslog) 4) CRITICAL - message 2 (30 mins pass) 5) OK - no problems in last 30min then I want to receive an alert for (2) and (4). Normal checks would alert for (2) only. Volatile checks would alert for (2), (3) and (4). Stalk stalking for critical would alert for (2) and (4). Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Fri Dec 9 20:23:28 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Fri, 9 Dec 2005 20:23:28 +0100 (CET) Subject: eventhandlers running when a dependent service dependency is not satisfied In-Reply-To: <4399D657.6060107@ilm.com> References: <439912BC.5020000@ilm.com> <4399D657.6060107@ilm.com> Message-ID: On Fri, 9 Dec 2005, Eli Stair wrote: > SSH # uses check_ssh plugin > SNMP # gets a low-latency MIB to ensure SNMPd is answering > SNMP-- $DAEMON ## where $DAEMON is one of many processes that need to > be running on the host (each with its own service definition). > > In this case SSH does not have any dependencies, it is to be the 'root' > check that all the other services should in the end rely on. So the SNMP dependency on SSH should not consult parent dependencies. I suggest you change that. It should not hurt but I rather be safe then sorry. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at halligan.org Fri Dec 9 20:43:39 2005 From: michael at halligan.org (Michael T. Halligan) Date: Fri, 9 Dec 2005 11:43:39 -0800 Subject: Debugging nagios checks? Message-ID: <2CA695A2-BDD1-4935-9B61-2283088B2458@halligan.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 I'm trying to use check_snmp_storage.pl from http://www.manubulon.com/ nagios/snmp_storage.html and am having a bit of a hard time. Nagios simply fails when trying to check, with the extremely helpful error of "(No output!)" .. Nothing useful gets generated in any of the logs (and I have every log option turned on).. When I strace nagios, there's still not much useful. How can I make nagios show me what it's running when it runs a check, and show me the output, so I can debug this issue? I think I'm running into a problem with snmp on my client boxes, but I have no idea. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (Darwin) iD8DBQFDmd5ywjCqooJyNAMRAsoSAJ9LJhdN5qFYyiUo0yLgYBTP5k+CagCeJjbd l/4YsXHAUJdkgUWEAv3kLf4= =KY95 -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Fri Dec 9 20:48:50 2005 From: estair at ilm.com (Eli Stair) Date: Fri, 09 Dec 2005 11:48:50 -0800 Subject: Debugging nagios checks? In-Reply-To: <2CA695A2-BDD1-4935-9B61-2283088B2458@halligan.org> References: <2CA695A2-BDD1-4935-9B61-2283088B2458@halligan.org> Message-ID: <4399DFA2.5030703@ilm.com> Have you already tested the plugin from the commandline, passing it the args that are sent from Nagios to make sure your version/community string and mountpoint info are properly read and returned? Have you checked to make sure that SNMP is monitoring disk usage? /eli Michael T. Halligan wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > I'm trying to use check_snmp_storage.pl from http://www.manubulon.com/ > nagios/snmp_storage.html and am having a bit of a hard time. > Nagios simply fails when trying to check, with the extremely helpful > error of "(No output!)" .. Nothing useful gets generated in any of > the logs (and I have every log option turned on).. When I strace > nagios, there's still not much useful. > > How can I make nagios show me what it's running when it runs a check, > and show me the output, so I can debug this issue? I think > I'm running into a problem with snmp on my client boxes, but I have no > idea. > > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.1 (Darwin) > > iD8DBQFDmd5ywjCqooJyNAMRAsoSAJ9LJhdN5qFYyiUo0yLgYBTP5k+CagCeJjbd > l/4YsXHAUJdkgUWEAv3kLf4= > =KY95 > -----END PGP SIGNATURE----- > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Fri Dec 9 20:50:46 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Fri, 9 Dec 2005 20:50:46 +0100 (CET) Subject: Notifications not sending out In-Reply-To: <4399C8BE.2010706@ebonmists.com> References: <4399C8BE.2010706@ebonmists.com> Message-ID: On Fri, 9 Dec 2005, Ed Benckert wrote: > I cannot, under any circumstance, get a notification from a service to send > out. I see nothing in the logs saying it even knew it should send a > notification. Can you see what is logged exactly for a such an event? Like I have: [1134152645] HOST ALERT: wichita.qi.nl;DOWN;SOFT;1;CRITICAL - Socket timeout after 10 seconds [1134152645] HOST ALERT: wichita.qi.nl;UP;SOFT;2;SMTP OK - 0.308 sec. response time [1134153430] Auto-save of retention data completed successfully. [1134155870] SERVICE ALERT: aragorn;SSH;OK;HARD;4;SSH OK - OpenSSH_4.2 (protocol 2.0) [1134155870] SERVICE NOTIFICATION: hvdkooij;aragorn;SSH;OK;notify-by-email;SSH OK - OpenSSH_4.2 (protocol 2.0) Hugo -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at halligan.org Fri Dec 9 20:55:49 2005 From: michael at halligan.org (Michael T. Halligan) Date: Fri, 9 Dec 2005 11:55:49 -0800 Subject: Debugging nagios checks? In-Reply-To: <4399DFA2.5030703@ilm.com> References: <2CA695A2-BDD1-4935-9B61-2283088B2458@halligan.org> <4399DFA2.5030703@ilm.com> Message-ID: <3B7FC993-301F-44BD-BF7A-2044FBEBC8A2@halligan.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Eli, Yes, yes, and yes. From here, I'm somewhat out of good paths for debugging this. What I can't understand, is I don't even see script processes actually running if I keep a constant ps going.. does Nagios-2 no longer run scripts in a visible manner? Michaeel On Dec 9, 2005, at 11:48 AM, Eli Stair wrote: > > Have you already tested the plugin from the commandline, passing it > the args that are sent from Nagios to make sure your version/ > community string and mountpoint info are properly read and returned? > > Have you checked to make sure that SNMP is monitoring disk usage? > > /eli > > Michael T. Halligan wrote: >> -----BEGIN PGP SIGNED MESSAGE----- >> Hash: SHA1 >> I'm trying to use check_snmp_storage.pl from http:// >> www.manubulon.com/ nagios/snmp_storage.html and am having a bit of >> a hard time. >> Nagios simply fails when trying to check, with the extremely >> helpful error of "(No output!)" .. Nothing useful gets generated >> in any of >> the logs (and I have every log option turned on).. When I strace >> nagios, there's still not much useful. >> How can I make nagios show me what it's running when it runs a >> check, and show me the output, so I can debug this issue? I think >> I'm running into a problem with snmp on my client boxes, but I >> have no idea. >> -----BEGIN PGP SIGNATURE----- >> Version: GnuPG v1.4.1 (Darwin) >> iD8DBQFDmd5ywjCqooJyNAMRAsoSAJ9LJhdN5qFYyiUo0yLgYBTP5k+CagCeJjbd >> l/4YsXHAUJdkgUWEAv3kLf4= >> =KY95 >> -----END PGP SIGNATURE----- >> ------------------------------------------------------- >> This SF.net email is sponsored by: Splunk Inc. Do you grep through >> log files >> for problems? Stop! Download the new AJAX search engine that makes >> searching your log files as easy as surfing the web. DOWNLOAD >> SPLUNK! >> http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. ::: Messages without supporting info will >> risk being sent to /dev/null > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (Darwin) iD8DBQFDmeFJwjCqooJyNAMRAuBrAKC4mwCyldVwnhGiGhjPMvs/RMhsbwCcCgpj uYeR7aShALcHnLoCBtHPUTc= =SvA5 -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Fri Dec 9 21:01:08 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Fri, 9 Dec 2005 21:01:08 +0100 (CET) Subject: Debugging nagios checks? In-Reply-To: <4399DFA2.5030703@ilm.com> References: <2CA695A2-BDD1-4935-9B61-2283088B2458@halligan.org> <4399DFA2.5030703@ilm.com> Message-ID: On Fri, 9 Dec 2005, Eli Stair wrote: > Have you already tested the plugin from the commandline, passing it the > args that are sent from Nagios to make sure your version/community > string and mountpoint info are properly read and returned? > > Have you checked to make sure that SNMP is monitoring disk usage? My plan of attack on an existing situation would be: 1. Use snmpwalk to obtain all SNMP data from the host and verify it is presenting disk info. 2. Trace from Service definition to Check command to the actual plugin how the test is done. 3. Retry the plugin manually with the information obtained from the previous two steps. When building one new I would: 1. Use snmpwalk to get all the details. 2. Use the plugin manually to get the proper results. 3. Write a check command 4. Write a service definition Actually this morning I added the check_snmp_int.pl script this way to define a host named internet based on the actual state of the DSL link of my router. Just be carefull of typos. I spend almost an hour before I noticed $ARG2$ is not $ARGS2$ and noticed it just about when I did write a full description for the mailinglist of my problem. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Fri Dec 9 21:04:46 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Fri, 9 Dec 2005 21:04:46 +0100 (CET) Subject: Debugging nagios checks? In-Reply-To: <3B7FC993-301F-44BD-BF7A-2044FBEBC8A2@halligan.org> References: <2CA695A2-BDD1-4935-9B61-2283088B2458@halligan.org> <4399DFA2.5030703@ilm.com> <3B7FC993-301F-44BD-BF7A-2044FBEBC8A2@halligan.org> Message-ID: On Fri, 9 Dec 2005, Michael T. Halligan wrote: > > Yes, yes, and yes. From here, I'm somewhat out of good paths for > debugging this. What I can't understand, is I don't even see script > processes actually running if I keep a constant ps going.. does > Nagios-2 no longer run scripts in a visible manner? It may just be too fast for you to see. I started top. Choose to check only processes of user nagios and did increase the update frequency to 0.5 seconds Then you can see the checks being sparked of. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jrojas at shopzilla.com Fri Dec 9 22:02:03 2005 From: jrojas at shopzilla.com (Jason Rojas) Date: Fri, 09 Dec 2005 13:02:03 -0800 Subject: Debugging nagios checks? In-Reply-To: References: <2CA695A2-BDD1-4935-9B61-2283088B2458@halligan.org> <4399DFA2.5030703@ilm.com> <3B7FC993-301F-44BD-BF7A-2044FBEBC8A2@halligan.org> Message-ID: <4399F0CB.6020709@shopzilla.com> An HTML attachment was scrubbed... URL: From estair at ilm.com Fri Dec 9 22:57:04 2005 From: estair at ilm.com (Eli Stair) Date: Fri, 09 Dec 2005 13:57:04 -0800 Subject: eventhandlers running when a dependent service dependency is not satisfied In-Reply-To: <200512090852.jB98q1jd008592@mx1.cs.umb.edu> References: <200512090852.jB98q1jd008592@mx1.cs.umb.edu> Message-ID: <4399FDB0.4080000@ilm.com> Thanks a million for pointing out the 'SCHEDULE_FORCED_SVC_CHECK', I'm now rewriting and testing the event handlers to take care of this. If only there were a macro/variable of the master service... looking for a lightweight way to determine the to pass to the macro that is the direct parent of the check that just failed. WRT the SSH/SNMP dependency issue, I have a feeling that I'm missing something here altogether, or didn't include enough info in my initial report, as both you and Hugo mentioned a possible issue with this. To be clear, I'm doing this only so that if a dependent service IS down (Ganglia) and SNMP has been shown to be up (after 'SCHEDULE_FORCED_SVC_CHECK',) I need to (or want to) make sure that SSH is running before attempting to connect. There are enough failure modes that occur causing SSH to die at the same time as other services that I want to avoid a bunch of high-latency/timeout/CPU event handlers running if they are bound to fail. Thanks for the accurate pointer to that macro, Cheers, /eli Here's the output of view config showing that it is configured the way I think... just not sure if that is something I don't want to do :) Host Service Host Service Dependency Type Dependency Failure Options deathstar1001 SNMP-- Ganglia running deathstar1001 SNMP Notification Warning, Unknown, Critical, Pending deathstar1001 SNMP-- Ganglia running deathstar1001 SNMP Check Execution Warning, Unknown, Critical, Pending deathstar1001 SNMP-- NTP running deathstar1001 SNMP Notification Warning, Unknown, Critical, Pending deathstar1001 SNMP-- NTP running deathstar1001 SNMP Check Execution Warning, Unknown, Critical, Pending deathstar1001 SNMP-- cron running deathstar1001 SNMP Notification Warning, Unknown, Critical, Pending deathstar1001 SNMP-- cron running deathstar1001 SNMP Check Execution Warning, Unknown, Critical, Pending deathstar1001 SNMP-- automounter running 4 instances deathstar1001 SNMP Notification Warning, Unknown, Critical, Pending deathstar1001 SNMP-- automounter running 4 instances deathstar1001 SNMP Check Execution Warning, Unknown, Critical, Pending deathstar1001 SNMP-- load -lt 4 deathstar1001 SNMP Notification Warning, Unknown, Critical, Pending deathstar1001 SNMP-- load -lt 4 deathstar1001 SNMP Check Execution Warning, Unknown, Critical, Pending deathstar1001 SNMP deathstar1001 SSH Notification Warning, Unknown, Critical, Pending deathstar1001 SNMP deathstar1001 SSH Check Execution Warning, Unknown, Critical, Pending John P. Rouillard wrote: > Hi Eli: > > You didn't say what version of nagios you are running so I'll assume > 2.0. > > In message <439912BC.5020000 at ilm.com>, > Eli Stair writes: > >>The question comes down to this: >> >> Should a failed service check for a dependent trigger a check of its >>parent before continuing? > > > IIRC from the code it does not force a check of the parent service. I > can see arguments for and against forcing a poll of the parent. Also > the documentation: > > http://nagios.sourceforge.net/docs/2_0/dependencies.html > > in the "How Service Dependencies Are Tested" section, says: > > Nagios gets the current status of the service that is being depended upon. > > not nagios repolls the service being depended upon. A footnote > says: > > by default, Nagios will use the most current hard state of the > service(s) that is/are being depended upon > > an option in the config file will allow it to use the current soft > state instead. I use the soft state of the service being depended upon > myself. > > >>If this is not the case, or default, is there _ANY_ way to implement this? > > > Sort of. The event handler for the child can send a > SCHEDULE_FORCED_SVC_CHECK external command for the parent specifying > the current time in seconds. See > > http://www.nagios.org/developerinfo/externalcommands/commandinfo.php?command_id=129 > > for details. The command will be acted upon immediately since nagios > reads the external command file after an event handler runs. Use this > to force an update of the current service status for the parent. Parse > through the objects.cache (probably in /var/log/nagios/objects.cache) > file for the expanded servicedependency objects to find the service > dependencies that match your host/service. > > I set my nagios options so that: > > max_check_attempts(dependent)*retry_check_interval(dependent) > > normal_check_interval(parent) > > This way the parent service will be checked at least once during the > soft error interval of the dependent service. > > >>I want to avoid at all costs having an every-minute check of the parent >>processes on many thousand hosts just to keep from having the child >>process checks and event handlers going hay-wire. > > > You need to use the max_check_attempts to provide a buffer in which > the parent service will be checked. You can have your event handler > submit an external command on the first soft error and try to fix the > problem on a subsequent soft, or hard error. You don't have any of > those directives in your sample config. > > >>I want a dependency chain like this: >> >> SSH -- SNMP --\ >> - Ganglia >> - NTP > > > Just a note, I wouldn't have ssh in the dependency chain unless you > are accessing snmp over ssh (e.g. running check_snmp via > check_by_ssh). I can't tell if that is the case or not. Just because > your event handler runs over ssh doesn't add it to the dependency > chain IMO. If ssh is down, it means none of the other services will be > checked and you won't recognize them as down. > > >>I believe I have this set up so that a service check for SNMP is >>dependent on the SSH service running. > > > Did you verify in the web interface or object.cache? > > >>In turn, the service checks for >>other processes that use SNMP are dependent on SNMP running. My intent >>is that service checks for NTP,etc will not be attempted if its parent >>SNMP process is not in an OK state (as I have an event handler that will >>restart SNMP if it is dead). If the parent SNMP _IS_ running, then the >>child process checks (Ganglia, NTP, etc) will be checked and if dead >>their own event handler will activate. > > > It looks like the config is ok on that score with one possible > exception noted below. > > >>The problem is that in this case, if I kill off SNMP the child process >>checks STILL execute and return a CRITICAL. As a result, nagios fires >>off the event handler for all these checks which results in an SSH out >>to the nodes in question and restarting a bunch of services that are >>probably still running. It SHOULD NOT schedule the child checks and >>thus not run their event handlers until AFTER a new parent check has >>returned executed and returned successfully, correct? > > > Nope, nagios doesn't re-run the parent or parents. If you are in a > soft failure mode, you can write your event handler to wait until you > are in a hard failure mode. > > >>I've included a dependency example below, and a snip from the nagios log >>showing it sequentially hammering out checks of all the child processes >>at the same time it already knows the parent is dead. >>[...] >>################################################### >>### snip of this host/group definition include: >>define host{ >> use linux-node-production >> host_name HOSTNAME1 >> address IP >>} >> >>define servicedependency{ >> host_name HOSTNAME1 >> service_description SSH >> dependent_host_name HOSTNAME1 >> dependent_service_description SNMP >> execution_failure_criteria w,p,u,c >> notification_failure_criteria w,p,u,c >> inherits_parent 1 >>} >> >>define servicedependency{ >> host_name HOSTNAME1 >> service_description SNMP >> dependent_host_name HOSTNAME1 >> dependent_service_description SNMP--* > > > Not sure if SNMP--* does what you think (and I hope) it does. Have you > looked at the view config web page and verified that nagios is seeing > the appropriate service dependencies? > > -- rouilj > John Rouillard > =========================================================================== > My employers don't acknowledge my existence much less my opinions. > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Fri Dec 9 23:14:41 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 09 Dec 2005 17:14:41 -0500 Subject: eventhandlers running when a dependent service dependency is not satisfied In-Reply-To: Your message of "Fri, 09 Dec 2005 13:57:04 PST." <4399FDB0.4080000@ilm.com> References: <4399FDB0.4080000@ilm.com> Message-ID: <200512092214.jB9MEgJA017304@mx1.cs.umb.edu> In message <4399FDB0.4080000 at ilm.com>, Eli Stair writes: >Thanks a million for pointing out the 'SCHEDULE_FORCED_SVC_CHECK', I'm >now rewriting and testing the event handlers to take care of this. If >only there were a macro/variable of the master service... looking for a >lightweight way to determine the to pass to the >macro that is the direct parent of the check that just failed. One problem is that there can be multiple parents. It's a many to many relationship. Parsing it from the objects cache is pain, but it works. You could run a script that inverts the object.cache file for faster lookup. Then your plugin runs the script which updates the cache only if it is older than the object.cache then you query the inverted cache file. >WRT the SSH/SNMP dependency issue, I have a feeling that I'm missing >something here altogether, or didn't include enough info in my initial >report, as both you and Hugo mentioned a possible issue with this. > >To be clear, I'm doing this only so that if a dependent service IS down >(Ganglia) and SNMP has been shown to be up (after >'SCHEDULE_FORCED_SVC_CHECK',) I need to (or want to) make sure that SSH >is running before attempting to connect. There are enough failure modes >that occur causing SSH to die at the same time as other services that I >want to avoid a bunch of high-latency/timeout/CPU event handlers running >if they are bound to fail. SSH isn't required to do the monitoring. Its required for the response. I would just handle the error in the event handler and submit an apropriate passive response. Make the service have no valid polling time and be volatile reporting only on wanrin and critical. This will make errors in the event handler be reported. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at halligan.org Sat Dec 10 01:31:34 2005 From: michael at halligan.org (Michael T. Halligan) Date: Fri, 9 Dec 2005 16:31:34 -0800 Subject: Debugging nagios checks? In-Reply-To: <4399F0CB.6020709@shopzilla.com> References: <2CA695A2-BDD1-4935-9B61-2283088B2458@halligan.org> <4399DFA2.5030703@ilm.com> <3B7FC993-301F-44BD-BF7A-2044FBEBC8A2@halligan.org> <4399F0CB.6020709@shopzilla.com> Message-ID: -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Jason, Pretty much what I ended up doing, except I just tee'd a copy of the output to a file. Michael On Dec 9, 2005, at 1:02 PM, Jason Rojas wrote: > what about using /bin/echo to echo the command to check the syntax, > then run the plugin by hand from command line to see if there are > any issues? > > -Jason > Hugo van der Kooij wrote: >> On Fri, 9 Dec 2005, Michael T. Halligan wrote: >> >> >>> Yes, yes, and yes. From here, I'm somewhat out of good paths for >>> debugging this. What I can't understand, is I don't even see script >>> processes actually running if I keep a constant ps going.. does >>> Nagios-2 no longer run scripts in a visible manner? >>> >> It may just be too fast for you to see. >> >> I started top. Choose to check only processes of user nagios and did >> increase the update frequency to 0.5 seconds >> >> Then you can see the checks being sparked of. >> >> Hugo. >> >> > ------------------------------------------------------- This SF.net > email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ Nagios-users > mailing list Nagios-users at lists.sourceforge.net https:// > lists.sourceforge.net/lists/listinfo/nagios-users ::: Please > include Nagios version, plugin version (-v) and OS when reporting > any issue. ::: Messages without supporting info will risk being > sent to /dev/null -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (Darwin) iD8DBQFDmiHtwjCqooJyNAMRAk4tAJ0YjkIoc1B3xl/VGFBKfPFD7pa3YQCgsvck K9q7+oolq6d/8o08APV01nw= =Mevl -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at halligan.org Sat Dec 10 01:34:21 2005 From: michael at halligan.org (Michael T. Halligan) Date: Fri, 9 Dec 2005 16:34:21 -0800 Subject: false "Host UP" notifications Message-ID: <87A74EC1-2CF6-439A-9CEA-009854B09559@halligan.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 This problem seemed to go away when I switched from 2.0b4 -> 2.0b6, but it's rearing it's head again. I'm wondering if this is some type of a flapping issue. I've tried tunning on & off aggressive host & service checking. As far as I can recall, it's only happening on host checks, not service checks. The notifications look like this: - ---------------------------------------------------------------------- ***** Nagios ***** Notification Type: PROBLEM Host: hostname State: UP Address: IP.ADDR Info: PING OK - Packet loss = 0%, RTA = 0.19 ms Date/Time: Fri Dec 9 16:29:24 PST 2005 - ---------------------------------------------------------------------- -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (Darwin) iD8DBQFDmiKNwjCqooJyNAMRAnT8AKCo4PrftCoZMC9xjQ/OyFo/p1UGyACfZxeI VNRodL/JAQNRypQvhsWU68Y= =T6zy -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chet.luther at gmail.com Sat Dec 10 03:29:46 2005 From: chet.luther at gmail.com (Chet Luther) Date: Fri, 9 Dec 2005 21:29:46 -0500 Subject: Acknowledge Scripts In-Reply-To: <439890A2.8010204@siu.edu> References: <439890A2.8010204@siu.edu> Message-ID: <366ea3750512091829v4edd07cdlfe5b55c6d2c8c0b3@mail.gmail.com> On 12/8/05, Scott Smith wrote: > I was wondering if anyone would have a script they wrote, or copied from > somewhere, to acknowledge multiple Devices/Hosts that are down at one > time instead of having to goto each individual Host and Ack. > > I.e., one building is down without power, and we have 100 > Switches/Routers in this building. Instead of our NOC Acking 100 times, > it would be nice if they could Ack the Root (Parent) problem and it > would Automatically Ack all the Children. > > Any ideas on this one? If I need to post this in a different List, > please advise which one and I'll post it there. > Thanks in advance. I'd make sure that you have left the "host_unreachable_sound=" in your cgi.cfg commented out. I'm looking at the 2.0b4 code that I run, and it seems to support this properly. -- Chet Luther chet.luther at gmail.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Sat Dec 10 06:54:34 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Sat, 10 Dec 2005 06:54:34 +0100 (CET) Subject: false "Host UP" notifications In-Reply-To: <87A74EC1-2CF6-439A-9CEA-009854B09559@halligan.org> References: <87A74EC1-2CF6-439A-9CEA-009854B09559@halligan.org> Message-ID: On Fri, 9 Dec 2005, Michael T. Halligan wrote: > --[PinePGP]--------------------------------------------------[begin]-- > This problem seemed to go away when I switched from 2.0b4 -> 2.0b6, > but it's rearing it's head again. I'm wondering > if this is some type of a flapping issue. I've tried tunning on & off > aggressive host & service checking. As far as > I can recall, it's only happening on host checks, not service checks. How is your host defined exactly? What do you see reported in the log file? Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james at jameswhite.org Sat Dec 10 16:14:59 2005 From: james at jameswhite.org (James S. White) Date: Sat, 10 Dec 2005 09:14:59 -0600 (CST) Subject: using LDAP-based user authentication In-Reply-To: <43985074.30456.48EDDA6@localhost> References: <43985074.30456.48EDDA6@localhost> Message-ID: We implemented a similar solution using Apache::AuthCookie and it shows the user as just the userid, not the whole DN, It might be worth it to look at Apache::AuthCookie. Just my $.02. ----------------------------------------------------------------------- James S. White primary/voip: (615) 469-0268 928 Mountain Valley Dr. .O. mobile: (256) 476-2619 Nashville, TN 37209 ..O work: (615) 341-8218 http://www.jameswhite.org OOO work-mobile: (615) 519-5160 james at jameswhite.org fax: (866) 260-5465 ----------------------------------------------------------------------- "Computers are useless. They only give you answers." -- Pablo Picasso On Thu, 8 Dec 2005, Jeff Rattray wrote: > Greetings, everyone. We (Purdue College of Pharmacy, Nursing and Health > Sciences) have been using nagios for half of a year now, and it has been really > handy. Currently, we are trying to change the way nagios authenticates users. > Purdue has implemented a new security policy that says that all passwords, > whatever they are for, must be changed every 30 days. Leaving aside the > difficulties about changing, say, BIOS passwords in the student labs, we are > trying to reduce the number of usernames and passwords that we have to make it > easier to keep up with the password-changing requirement. > > I have successfully configured mod_authz_ldap to handle authentication on our > nagios server, and I can log into the nagios pages. Unfortunately, instead of > returning a simple username like 'jrattray', the CGIs see my username as > 'uid=jrattray,ou=people,o=pnhs.purdue.edu' . I cannot authorize myself to see > any information, because when I put that string into cgi.cfg, nagios breaks it > up at the commas into 3 different users. (The equals signs may be confusing it > also.) > > Here are my questions: > > (1) Does anyone know how to escape this username so that it works in cgi.cfg? > I have tried entering it as a quoted string, putting slashes in front of the > commas, etc. > > (2) Failing that, has anyone had any experience using nagios with > mod_auth_ldap? I used mod_authz_ldap (a third-party product) only because it > came with Red Hat EL, but I don't want to bother compiling the official apache > module if it does exactly the same thing. > > (3) Failing THAT, how does one submit this to the wish list for nagios? > > Thanks, > Jeff Rattray > -- > Jeff Rattray, Ph.D. > Manager, Web-based Instructional Technology > Purdue University College of Pharmacy, Nursing and Health Sciences > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jrattray at pnhs.purdue.edu Sat Dec 10 20:40:55 2005 From: jrattray at pnhs.purdue.edu (Jeff Rattray) Date: Sat, 10 Dec 2005 14:40:55 -0500 Subject: using LDAP-based user authentication In-Reply-To: References: <43985074.30456.48EDDA6@localhost> Message-ID: <439AE8F7.27179.4C8C9E6@localhost> On 10 Dec 2005 at 9:14, James S. White wrote: > > We implemented a similar solution using Apache::AuthCookie and it shows > the user as just the userid, not the whole DN, It might be worth it to > look at Apache::AuthCookie. > > > > > I have successfully configured mod_authz_ldap to handle authentication on our > > nagios server, and I can log into the nagios pages. Unfortunately, instead of > > returning a simple username like 'jrattray', the CGIs see my username as > > 'uid=jrattray,ou=people,o=pnhs.purdue.edu' . I cannot authorize myself to see > > any information, because when I put that string into cgi.cfg, nagios breaks it > > up at the commas into 3 different users. (The equals signs may be confusing it > > also.) > > > James - That's worth looking into. The docs for Apache::AuthCookie say you have to implement two functions (authen_cred() and authen_ses_key()), and a login page. Presumably you have written perl scripts to do this for your LDAP store; would you be willing to send me a copy of them to crib off of? thanks, Jeff -- Jeff Rattray, Ph.D. Manager, Web-based Instructional Technology Purdue University College of Pharmacy, Nursing and Health Sciences ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at halligan.org Sat Dec 10 21:31:32 2005 From: michael at halligan.org (Michael T. Halligan) Date: Sat, 10 Dec 2005 12:31:32 -0800 Subject: false "Host UP" notifications In-Reply-To: References: <87A74EC1-2CF6-439A-9CEA-009854B09559@halligan.org> Message-ID: -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hugo, # Host Definition define host { host_name HOST alias HOST address IPADDR use HOSTTEMPLATE contact_groups noc } #Host Template # host_templates HOSTTEMPLATE define host { name HOSTTEMPLATE process_perf_data 1 retain_status_information 1 flap_detection_enabled 0 retain_nonstatus_information 0 active_checks_enabled 1 passive_checks_enabled 0 check_period 24x7 obsess_over_host 1 check_freshness 1 check_command check-host-alive max_check_attempts 3 event_handler_enabled 0 notifications_enabled 1 notification_interval 120 notification_period 24x7 notification_options d,u,r contact_groups noc register 0 } On Dec 9, 2005, at 9:54 PM, Hugo van der Kooij wrote: > On Fri, 9 Dec 2005, Michael T. Halligan wrote: > >> --[PinePGP]-------------------------------------------------- >> [begin]-- >> This problem seemed to go away when I switched from 2.0b4 -> 2.0b6, >> but it's rearing it's head again. I'm wondering >> if this is some type of a flapping issue. I've tried tunning on & off >> aggressive host & service checking. As far as >> I can recall, it's only happening on host checks, not service checks. > > How is your host defined exactly? What do you see reported in the log > file? > > Hugo. > > -- > I hate duplicates. Just reply to the relevant mailinglist. > hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ > Don't meddle in the affairs of magicians, > for they are subtle and quick to anger. > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (Darwin) iD8DBQFDmzsowjCqooJyNAMRAth0AKC57ZuPMLrVc7zfibJiMgfLra+uTACgwMfR qHB7ZGjSScr2sMvotwG6SCM= =dBtt -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Sat Dec 10 23:37:01 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Sat, 10 Dec 2005 23:37:01 +0100 (CET) Subject: false "Host UP" notifications In-Reply-To: References: <87A74EC1-2CF6-439A-9CEA-009854B09559@halligan.org> Message-ID: On Sat, 10 Dec 2005, Michael T. Halligan wrote: > # Host Definition > # host_templates HOSTTEMPLATE .... Looks ok. But what do the logs tell you? And what if you move options from the template to the host? Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at halligan.org Sun Dec 11 01:38:46 2005 From: michael at halligan.org (Michael T. Halligan) Date: Sat, 10 Dec 2005 16:38:46 -0800 Subject: false "Host UP" notifications In-Reply-To: References: <87A74EC1-2CF6-439A-9CEA-009854B09559@halligan.org> Message-ID: <5277CCCF-2114-41FA-8051-FB45E2244BB7@halligan.org> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hugo, Nothing out of the ordinary in the logs really.. Just a bunch of host up hard messages, but without any corresponding host down messages. Does this have something to do with freshness testing, maybe? Hmm, I just realized that my subject line is a little bit misleading. By saying that the notifications I'm getting are false host-up notifications, I'm not saying that the host is down. The host has been up for a couple of weeks, yet nagios just keeps reminding me that the host is up. On Dec 10, 2005, at 2:37 PM, Hugo van der Kooij wrote: > On Sat, 10 Dec 2005, Michael T. Halligan wrote: > >> # Host Definition >> # host_templates HOSTTEMPLATE > > .... > > Looks ok. But what do the logs tell you? And what if you move > options from > the template to the host? > > Hugo. > > -- > I hate duplicates. Just reply to the relevant mailinglist. > hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ > Don't meddle in the affairs of magicians, > for they are subtle and quick to anger. > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (Darwin) iD8DBQFDm3UawjCqooJyNAMRAq1JAKCMOzoaVX6KXFWPie7k9gB7lBGpQACfduQl wm3+hCY3Nr7bLogXvTvIPxA= =/8GI -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Sun Dec 11 16:17:01 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Sun, 11 Dec 2005 16:17:01 +0100 (CET) Subject: false "Host UP" notifications In-Reply-To: <5277CCCF-2114-41FA-8051-FB45E2244BB7@halligan.org> References: <87A74EC1-2CF6-439A-9CEA-009854B09559@halligan.org> <5277CCCF-2114-41FA-8051-FB45E2244BB7@halligan.org> Message-ID: On Sat, 10 Dec 2005, Michael T. Halligan wrote: > Nothing out of the ordinary in the logs really.. > Just a bunch of host up hard messages, but without any corresponding > host down messages. > > Does this have something to do with freshness testing, maybe? Disable it if you run active checks on your hosts. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Lui.external at infineon.com Mon Dec 12 02:52:37 2005 From: Lui.external at infineon.com (Lui.external at infineon.com) Date: Mon, 12 Dec 2005 09:52:37 +0800 Subject: How to set host_check_alive command's time out to 2 minutes? Message-ID: Hi, Is it possible for us to set the threshold for host_check_alive command time out to 2 minutes? We will considered the server is really down after 2 minutes. Thanks. Warmest Regards, Louise ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hkaelber at math.uni-bielefeld.de Mon Dec 12 12:57:51 2005 From: hkaelber at math.uni-bielefeld.de (Holger Kaelberer) Date: Mon, 12 Dec 2005 12:57:51 +0100 Subject: delayed notifications Message-ID: <20051212115747.GB21841@math.uni-bielefeld.de> Hi, I am having problems with (host- and service-) notifications not being send out when they should. Concrete example: Host crashed on friday: /--- Dec 9 21:45:38 cnode4 nagios: HOST ALERT: nectar;DOWN;SOFT;1;FPING CRITICAL - nectar (loss=100.000000% ) Dec 9 21:45:38 cnode4 nagios: HOST ALERT: nectar;DOWN;SOFT;2;FPING CRITICAL - nectar (loss=100.000000% ) Dec 9 21:45:39 cnode4 nagios: HOST ALERT: nectar;DOWN;SOFT;3;FPING CRITICAL - nectar (loss=100.000000% ) Dec 9 21:45:39 cnode4 nagios: HOST ALERT: nectar;DOWN;SOFT;4;FPING CRITICAL - nectar (loss=100.000000% ) Dec 9 21:45:40 cnode4 nagios: HOST ALERT: nectar;DOWN;HARD;5;FPING CRITICAL - nectar (loss=100.000000% ) ... Dec 10 00:00:18 cnode4 nagios: CURRENT HOST STATE: nectar;DOWN;HARD;5;FPING CRITICAL - nectar (loss=100.000000% ) ... Dec 11 00:00:19 cnode4 nagios: CURRENT HOST STATE: nectar;DOWN;HARD;5;FPING CRITICAL - nectar (loss=100.000000% ) ---/ Notifications on monday morning: /--- Dec 12 07:00:29 cnode4 nagios: HOST NOTIFICATION: nagios;nectar;DOWN;host-notify-by-email;FPING CRITICAL - nectar (loss=100.000000% ) ---/ I just upgraded to 2.0b5 (from 1.2) out of global config: /--- log_host_retries=1 retain_state_information=1 enable_notifications=1 use_retained_program_state=1 use_retained_scheduling_info=0 ---/ host-config for the example: /--- define host{ host_name nectar alias nectar event_handler_enabled 0 flap_detection_enabled 1 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 0 check_command check-host-alive max_check_attempts 5 notifications_enabled 1 notification_interval 0 notification_period 24x7 notification_options d,u contact_groups admpers } ---/ Any ideas how to further trace the problem? I already had the same problem when enbling the global enable_notifications and restarting nagios; that maybe has another reason. Thanks, Holger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fernando.shayani at bsb.politec.com.br Mon Dec 12 14:54:05 2005 From: fernando.shayani at bsb.politec.com.br (Fernando Shayani) Date: Mon, 12 Dec 2005 10:54:05 -0300 Subject: Nagios HANGS scheduling info Message-ID: <1964B929D8FF4D4DB06854A75D0FD3730CC8F2C7@bsbserv028.bsb.politec.com.br> Well, it's STILL hangs... Here is my configuration: log_file=/usr/local/nagios/var/nagios.log temp_file=/usr/local/nagios/var/nagios.tmp status_file=/usr/local/nagios/var/status.dat aggregate_status_updates=1 status_update_interval=3 nagios_user=nagios nagios_group=nagios enable_notifications=1 execute_service_checks=1 accept_passive_service_checks=1 enable_event_handlers=1 log_rotation_method=d log_archive_path=/usr/local/nagios/var/archives check_external_commands=1 command_check_interval=-1 command_file=/usr/local/nagios/var/rw/nagios.cmd downtime_file=/usr/local/nagios/var/downtime.dat comment_file=/usr/local/nagios/var/comments.dat lock_file=/usr/local/nagios/var/nagios.lock retain_state_information=1 state_retention_file=/usr/local/nagios/var/retention.dat retention_update_interval=360 use_retained_program_state=1 use_syslog=1 log_notifications=1 log_service_retries=0 log_host_retries=0 log_event_handlers=1 log_initial_states=0 log_external_commands=0 log_passive_checks=0 sleep_time=1 service_interleave_factor=s max_concurrent_checks=0 service_reaper_frequency=2 interval_length=60 use_aggressive_host_checking=0 enable_flap_detection=1 low_service_flap_threshold=20 high_service_flap_threshold=80 low_host_flap_threshold=20 high_host_flap_threshold=80 soft_state_dependencies=0 service_check_timeout=25 host_check_timeout=10 event_handler_timeout=30 notification_timeout=15 ocsp_timeout=60 perfdata_timeout=60 obsess_over_services=0 process_performance_data=0 check_for_orphaned_services=1 check_service_freshness=0 freshness_check_interval=60 date_format=euro illegal_object_name_chars=`~!$%&*|\\\'\\\"<>?,()= illegal_macro_output_chars=`~$&|\\\'\\\"<> admin_email=fernando.shayani at bsb.politec.com.br service_inter_check_delay_method=s max_service_check_spread=15 host_inter_check_delay_method=s max_host_check_spread=15 auto_reschedule_checks=1 auto_rescheduling_interval=30 auto_rescheduling_window=180 And here is my STATS: CURRENT STATUS DATA ---------------------------------------------------- Status File: /usr/local/nagios/var/status.dat Status File Age: 0d 0h 0m 3s Status File Version: 2.0b6 Program Running Time: 0d 2h 47m 39s Total Services: 314 Services Checked: 314 Services Scheduled: 313 Active Service Checks: 314 Passive Service Checks: 0 Total Service State Change: 0.000 / 12.110 / 0.299 % Active Service Latency: 0.003 / 358.274 / 16.984 % Active Service Execution Time: 0.036 / 25.014 / 2.207 sec Active Service State Change: 0.000 / 12.110 / 0.299 % Active Services Last 1/5/15/60 min: 48 / 182 / 267 / 277 Passive Service State Change: 0.000 / 0.000 / 0.000 % Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 Services Ok/Warn/Unk/Crit: 304 / 3 / 4 / 3 Services Flapping: 0 Services In Downtime: 0 Total Hosts: 129 Hosts Checked: 129 Hosts Scheduled: 1 Active Host Checks: 129 Passive Host Checks: 0 Total Host State Change: 0.000 / 10.260 / 0.291 % Active Host Latency: 0.000 / 0.176 / 0.001 % Active Host Execution Time: 0.000 / 5.973 / 2.012 sec Active Host State Change: 0.000 / 10.260 / 0.291 % Active Hosts Last 1/5/15/60 min: 1 / 2 / 3 / 6 Passive Host State Change: 0.000 / 0.000 / 0.000 % Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 Hosts Up/Down/Unreach: 128 / 1 / 0 Hosts Flapping: 0 Hosts In Downtime: 0 Please... Help... Fernando Shayani fernando.shayani at bsb.politec.com.br (61) 3038-6951 POLITEC - Bras?lia - DF -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Fernando Shayani Sent: quinta-feira, 8 de dezembro de 2005 07:45 To: Marco Ramos Cc: nagios-users at lists.sourceforge.net Subject: RES: [Nagios-users] Nagios HANGS scheduling info Ok. It continues hanging... Now I changed the REAPER from 5 to 2... Lets see.. Thanks Fernando -----Mensagem original----- De: Marco Ramos [mailto:mramos at co.sapo.pt] Enviada: ter 06-dez-05 16:42 Para: Fernando Shayani Cc: nagios-users at lists.sourceforge.net Assunto: RE: [Nagios-users] Nagios HANGS scheduling info Try to tune your service_reaper_frequency and max_concurrent_checks values. Take a look at http://nagios.org/faqs/viewfaq.php?faq_id=115. Had the same problem a while ago and managed to solve it tunning this two options. regards, Marco Ramos On Tue, 2005-12-06 at 14:53 -0300, Fernando Shayani wrote: > Well, my configuration is: > > Inter-check sleep time (sleep_time=0.25) > Service inter-check delay method (service_inter_check_delay_method=s) > Maximum service check spread (max_service_check_spread=2) > Service interleave factor (service_interleave_factor=s) > Maximum concurrent service checks (max_concurrent_checks=0) > Service reaper frequency (service_reaper_frequency=5) > Host inter-check delay method (host_inter_check_delay_method=s) > Maximum host check spread (max_host_check_spread=2) > Timing interval length (interval_length=60) > Agressive host checking option (use_aggressive_host_checking=0) > > The following options are not set. > Auto-rescheduling option > Auto-rescheduling interval > Auto-rescheduling window > > > I will read the configuration and recheck it all. > > I also got the SYSLOG line right after que problem. I hope this could help you help me. > > Dec 6 06:33:24 bsbserv007 nagios: Warning: The check of service 'CPU LOAD' on host 'BSBSERV017' could not be performed due to a fork() error. The check will be rescheduled. > > > Thanks for the help. > > > Fernando Shayani > fernando.shayani at bsb.politec.com.br > (61) 3038-6951 > POLITEC - Bras?lia - DF > > -----Original Message----- > From: Marcel Mitsuto Fucatu Sugano [mailto:msugano at uolinc.com] > Sent: segunda-feira, 5 de dezembro de 2005 15:33 > To: Fernando Shayani > Cc: Eli Stair; nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] Nagios HANGS scheduling info > > On Mon, 2005-12-05 at 09:01 -0300, Fernando Shayani wrote: > > Well, I upgraded to b6, enabled the Orphaned Service and still > > hangs... > > > > Is there any other clue? > > > > Fernando Shayani > > fernando.shayani at bsb.politec.com.br > > (61) 3038-6951 > > POLITEC - Bras?lia - DF > > Have you followed the instructions available at: > http://nagios.sourceforge.net/docs/2_0/configmain.html ? > > Read that and check the following configs: > Inter-check sleep time > Service inter-check delay method > Maximum service check spread > Service interleave factor > Maximum concurrent service checks > Service reaper frequency > Host inter-check delay method > Maximum host check spread > Timing interval length > Auto-rescheduling option > Auto-rescheduling interval > Auto-rescheduling window > > Agressive host checking option > > Which are related to the scheduling options, and might be the source of your problem. Have you messed up those variables? Whatever the case, post the values of this configuration variables. > > HTH, > -- > Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=ick _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hkaelber at math.uni-bielefeld.de Mon Dec 12 15:36:54 2005 From: hkaelber at math.uni-bielefeld.de (Holger Kaelberer) Date: Mon, 12 Dec 2005 15:36:54 +0100 Subject: delayed notifications In-Reply-To: <20051212115747.GB21841@math.uni-bielefeld.de> References: <20051212115747.GB21841@math.uni-bielefeld.de> Message-ID: <20051212143654.GH21841@math.uni-bielefeld.de> I don't like answering my own questions ... but the problem ist solved: contact-timeperiod was workhours, now is "24x7". Thanks to Joerg Linge! Holger Kaelberer wrote on Mon Dec 12, 2005 um 12:57:50PM: > > Hi, > > I am having problems with (host- and service-) notifications not being send > out when they should. Concrete example: > > Host crashed on friday: > /--- > Dec 9 21:45:38 cnode4 nagios: HOST ALERT: nectar;DOWN;SOFT;1;FPING CRITICAL > - nectar (loss=100.000000% ) > Dec 9 21:45:38 cnode4 nagios: HOST ALERT: nectar;DOWN;SOFT;2;FPING CRITICAL > - nectar (loss=100.000000% ) > Dec 9 21:45:39 cnode4 nagios: HOST ALERT: nectar;DOWN;SOFT;3;FPING CRITICAL > - nectar (loss=100.000000% ) > Dec 9 21:45:39 cnode4 nagios: HOST ALERT: nectar;DOWN;SOFT;4;FPING CRITICAL > - nectar (loss=100.000000% ) > Dec 9 21:45:40 cnode4 nagios: HOST ALERT: nectar;DOWN;HARD;5;FPING CRITICAL > - nectar (loss=100.000000% ) > > ... > > Dec 10 00:00:18 cnode4 nagios: CURRENT HOST STATE: nectar;DOWN;HARD;5;FPING > CRITICAL - nectar (loss=100.000000% ) > > ... > > Dec 11 00:00:19 cnode4 nagios: CURRENT HOST STATE: nectar;DOWN;HARD;5;FPING > CRITICAL - nectar (loss=100.000000% ) > ---/ > > Notifications on monday morning: > /--- > Dec 12 07:00:29 cnode4 nagios: HOST NOTIFICATION: > nagios;nectar;DOWN;host-notify-by-email;FPING CRITICAL - nectar > (loss=100.000000% ) > ---/ > > I just upgraded to 2.0b5 (from 1.2) > > out of global config: > /--- > log_host_retries=1 > retain_state_information=1 > enable_notifications=1 > use_retained_program_state=1 > use_retained_scheduling_info=0 > ---/ > > host-config for the example: > /--- > define host{ > host_name nectar > alias nectar > event_handler_enabled 0 > flap_detection_enabled 1 > process_perf_data 1 > retain_status_information 1 > retain_nonstatus_information 0 > check_command check-host-alive > max_check_attempts 5 > notifications_enabled 1 > notification_interval 0 > notification_period 24x7 > notification_options d,u > contact_groups admpers > } > ---/ > > Any ideas how to further trace the problem? > > I already had the same problem when enbling the global enable_notifications and restarting nagios; that maybe has another reason. > > Thanks, > Holger. > -- Holger Kaelberer Computerlabor Mathematik, U5-138 Universitaet Bielefeld Tel.: (0521) 106-3861 ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From emartine at bsd.uchicago.edu Mon Dec 12 16:05:41 2005 From: emartine at bsd.uchicago.edu (Eduardo Martinez) Date: Mon, 12 Dec 2005 09:05:41 -0600 Subject: Nagios Notification Format In-Reply-To: <20051212041506.DA7FF88F88@sc8-sf-spam1.sourceforge.net> References: <20051212041506.DA7FF88F88@sc8-sf-spam1.sourceforge.net> Message-ID: <200512121505.jBCF5h33016221@union-station.bsd.uchicago.edu> This is not really a problem but a curiosity. Maybe I missed something in the documentation but I could not find where to edit this. I want to add a few lines to the notification style that is sent out by Nagios. Example notification that I would like to modify. ######################################## ***** Nagios ***** Notification Type: RECOVERY Service: My Service Host: My Host Address: IP Address State: OK Date/Time: Mon Dec 12 08:33:03 CST 2005 Additional Info: All processes are running ######################################## I know the notification type, service, host, address, state, and date are filled in and generated by the configuration you give it and the Nagios plugins. But how is the entire notification generated? Any help would be greatly appreciated. Thank you very much, -Ed This email is intended only for the use of the individual or entity to which it is addressed and may contain information that is privileged and confidential. If the reader of this email message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is prohibited. If you have received this email in error, please notify the sender and destroy/delete all copies of the transmittal. Thank you. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bjd at compudentsystems.com Mon Dec 12 16:05:56 2005 From: bjd at compudentsystems.com (Brian J. Dent) Date: Mon, 12 Dec 2005 07:05:56 -0800 Subject: Command line argument for Windows Message-ID: <003901c5ff2d$8e02da50$1904a8c0@AUGUSTINE> I'm testing a script where I pass an IP address to a Windows command/batch file. The IP is separated by the ! as shown below. In the definition of the command, I define -p $ARG1$. And finally on the host I'm trying to catch the argument. Either the command definition or the actual argument in Windows is not right. Any comments? >From the definition file: define service{ . check_command check_nrpe_nt_remote_arg!192.168.4.8 } >From the checkcommands.cfg: define command{ command_name check_nrpe_nt_remote_arg command_line /usr/local/nagios/libexec/check_nrpe -H $HOSTADDRESS$ -c nrpe_nt_check_remote_arg -p $ARG1$ } >From the host's NRPE.cfg file: command[nrpe_nt_check_remote_arg]=c:\nrpe_nt\ping_remote.cmd $arg1$ Regards, bjd ~Brian J. Dent CompuDent Systems v (831) 649-2575, c (831) 224-2875, f (831) 649-0948 bjd at compudentsystems.com http://www.compudentsystems.com NOTE: This e-mail is confidential and is intended only for the recipient(s) listed. Unauthorized use or disclosure of this e-mail or any of the information in it is strictly prohibited. If you are not a listed recipient or someone authorized to receive e-mail on behalf of a listed recipient, please reply to the sender that the e-mail was misdirected and delete the e-mail. Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: From jeffwilliams05 at gmail.com Mon Dec 12 16:22:34 2005 From: jeffwilliams05 at gmail.com (Jeff Williams) Date: Mon, 12 Dec 2005 09:22:34 -0600 Subject: possible bug: escalations don't work if state changes Message-ID: <17e351010512120722t629d9c6fj69dffae607b8b84d@mail.gmail.com> I am using Nagios 2.0b6, but also experienced this issue in 2.0b4. Nagios escalations do not seem to work when the state changes after a maximum notifications level has been reached. For example, if a service hits the warning level, then escalates, and then reaches a stop level for notifications (5 for example). This is assuming that the administrator does not acknowledge any of the notifications (not likely, but happened to us recently). Then after a period of time, the service moves to the critical level. Nagios, in my opinion, should then start paging the contact groups specified in the service definition. After this, it should escalate when necessary. However, this is not the case. Nagios pages the contacts in the service definition (oncall,ulsysadmins in the example below) indefinitely until acknowledged or an OK state is reached. Despite the fact that the administrator SHOULD have acknowledged the pages, I feel that this is a bug. However, I may have something set incorrectly in the configs. I have included the corresponding config files below. define service{ use generic-service host_name test-server service_description /MYSQL is_volatile 0 check_period 24x7 retry_check_interval 1 contact_groups oncall,ulsysadmins notification_interval 5 notification_period 24x7 notification_options w,u,c,r check_command check_nrpe!check_disk7 } define host{ use generic-host host_name test-server alias test-server address 192.168.1.1 parents switch-1 check_command check-host-alive check_interval 4 notification_interval 0 notification_period 24x7 notification_options d,r contact_groups oncall,ulsysadmins } define hostescalation{ host_name test-server last_notification 2 notification_interval 5 contact_groups oncall } define hostescalation{ host_name test-server first_notification 3 last_notification 4 notification_interval 5 contact_groups oncall,backup } define hostescalation{ host_name test-server first_notification 5 last_notification 5 notification_interval 0 contact_groups oncall,backup } define serviceescalation{ host_name test-server service_description /MYSQL first_notification 2 last_notification 2 notification_interval 5 contact_groups oncall } define serviceescalation{ host_name test-server service_description /MYSQL first_notification 3 last_notification 4 notification_interval 5 contact_groups oncall,backup } define serviceescalation{ host_name test-server service_description /MYSQL first_notification 5 last_notification 5 notification_interval 0 contact_groups oncall,backup } Please let me know if I need to include more information. Thanks in advance. Jeff ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mramos at co.sapo.pt Mon Dec 12 16:34:33 2005 From: mramos at co.sapo.pt (Marco Ramos) Date: Mon, 12 Dec 2005 15:34:33 +0000 Subject: Nagios HANGS scheduling info In-Reply-To: <1964B929D8FF4D4DB06854A75D0FD3730CC8F2C7@bsbserv028.bsb.politec.com.br> References: <1964B929D8FF4D4DB06854A75D0FD3730CC8F2C7@bsbserv028.bsb.politec.com.br> Message-ID: <1134401674.5866.22.camel@localhost.localdomain> The problem should be your max_concurrent_checks set to 0. Run "nagios - s nagios.cfg" and set the max_concurrent_check to the value suggested. This should fix it. Best regards, Marco Ramos On Mon, 2005-12-12 at 10:54 -0300, Fernando Shayani wrote: > Well, it's STILL hangs... Here is my configuration: > > log_file=/usr/local/nagios/var/nagios.log > temp_file=/usr/local/nagios/var/nagios.tmp > status_file=/usr/local/nagios/var/status.dat > aggregate_status_updates=1 > status_update_interval=3 > nagios_user=nagios > nagios_group=nagios > enable_notifications=1 > execute_service_checks=1 > accept_passive_service_checks=1 > enable_event_handlers=1 > log_rotation_method=d > log_archive_path=/usr/local/nagios/var/archives > check_external_commands=1 > command_check_interval=-1 > command_file=/usr/local/nagios/var/rw/nagios.cmd > downtime_file=/usr/local/nagios/var/downtime.dat > comment_file=/usr/local/nagios/var/comments.dat > lock_file=/usr/local/nagios/var/nagios.lock > retain_state_information=1 > state_retention_file=/usr/local/nagios/var/retention.dat > retention_update_interval=360 > use_retained_program_state=1 > use_syslog=1 > log_notifications=1 > log_service_retries=0 > log_host_retries=0 > log_event_handlers=1 > log_initial_states=0 > log_external_commands=0 > log_passive_checks=0 > sleep_time=1 > service_interleave_factor=s > max_concurrent_checks=0 > service_reaper_frequency=2 > interval_length=60 > use_aggressive_host_checking=0 > enable_flap_detection=1 > low_service_flap_threshold=20 > high_service_flap_threshold=80 > low_host_flap_threshold=20 > high_host_flap_threshold=80 > soft_state_dependencies=0 > service_check_timeout=25 > host_check_timeout=10 > event_handler_timeout=30 > notification_timeout=15 > ocsp_timeout=60 > perfdata_timeout=60 > obsess_over_services=0 > process_performance_data=0 > check_for_orphaned_services=1 > check_service_freshness=0 > freshness_check_interval=60 > date_format=euro > illegal_object_name_chars=`~!$%&*|\\\'\\\"<>?,()= > illegal_macro_output_chars=`~$&|\\\'\\\"<> > admin_email=fernando.shayani at bsb.politec.com.br > service_inter_check_delay_method=s > max_service_check_spread=15 > host_inter_check_delay_method=s > max_host_check_spread=15 > auto_reschedule_checks=1 > auto_rescheduling_interval=30 > auto_rescheduling_window=180 > > > And here is my STATS: > > CURRENT STATUS DATA > ---------------------------------------------------- > Status File: /usr/local/nagios/var/status.dat > Status File Age: 0d 0h 0m 3s > Status File Version: 2.0b6 > > Program Running Time: 0d 2h 47m 39s > > Total Services: 314 > Services Checked: 314 > Services Scheduled: 313 > Active Service Checks: 314 > Passive Service Checks: 0 > Total Service State Change: 0.000 / 12.110 / 0.299 % > Active Service Latency: 0.003 / 358.274 / 16.984 % > Active Service Execution Time: 0.036 / 25.014 / 2.207 sec > Active Service State Change: 0.000 / 12.110 / 0.299 % > Active Services Last 1/5/15/60 min: 48 / 182 / 267 / 277 > Passive Service State Change: 0.000 / 0.000 / 0.000 % > Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 > Services Ok/Warn/Unk/Crit: 304 / 3 / 4 / 3 > Services Flapping: 0 > Services In Downtime: 0 > > Total Hosts: 129 > Hosts Checked: 129 > Hosts Scheduled: 1 > Active Host Checks: 129 > Passive Host Checks: 0 > Total Host State Change: 0.000 / 10.260 / 0.291 % > Active Host Latency: 0.000 / 0.176 / 0.001 % > Active Host Execution Time: 0.000 / 5.973 / 2.012 sec > Active Host State Change: 0.000 / 10.260 / 0.291 % > Active Hosts Last 1/5/15/60 min: 1 / 2 / 3 / 6 > Passive Host State Change: 0.000 / 0.000 / 0.000 % > Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 > Hosts Up/Down/Unreach: 128 / 1 / 0 > Hosts Flapping: 0 > Hosts In Downtime: 0 > > > Please... Help... > > > Fernando Shayani > fernando.shayani at bsb.politec.com.br > (61) 3038-6951 > POLITEC - Bras?lia - DF > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Fernando Shayani > Sent: quinta-feira, 8 de dezembro de 2005 07:45 > To: Marco Ramos > Cc: nagios-users at lists.sourceforge.net > Subject: RES: [Nagios-users] Nagios HANGS scheduling info > > Ok. It continues hanging... > > Now I changed the REAPER from 5 to 2... Lets see.. > Thanks > Fernando > > -----Mensagem original----- > De: Marco Ramos [mailto:mramos at co.sapo.pt] > Enviada: ter 06-dez-05 16:42 > Para: Fernando Shayani > Cc: nagios-users at lists.sourceforge.net > Assunto: RE: [Nagios-users] Nagios HANGS scheduling info > > > > > Try to tune your service_reaper_frequency and max_concurrent_checks > values. Take a look at http://nagios.org/faqs/viewfaq.php?faq_id=115. > > Had the same problem a while ago and managed to solve it tunning this > two options. > > regards, > Marco Ramos > > On Tue, 2005-12-06 at 14:53 -0300, Fernando Shayani wrote: > > Well, my configuration is: > > > > Inter-check sleep time (sleep_time=0.25) > > Service inter-check delay method (service_inter_check_delay_method=s) > > Maximum service check spread (max_service_check_spread=2) > > Service interleave factor (service_interleave_factor=s) > > Maximum concurrent service checks (max_concurrent_checks=0) > > Service reaper frequency (service_reaper_frequency=5) > > Host inter-check delay method (host_inter_check_delay_method=s) > > Maximum host check spread (max_host_check_spread=2) > > Timing interval length (interval_length=60) > > Agressive host checking option (use_aggressive_host_checking=0) > > > > The following options are not set. > > Auto-rescheduling option > > Auto-rescheduling interval > > Auto-rescheduling window > > > > > > I will read the configuration and recheck it all. > > > > I also got the SYSLOG line right after que problem. I hope this could help you help me. > > > > Dec 6 06:33:24 bsbserv007 nagios: Warning: The check of service 'CPU LOAD' on host 'BSBSERV017' could not be performed due to a fork() error. The check will be rescheduled. > > > > > > Thanks for the help. > > > > > > Fernando Shayani > > fernando.shayani at bsb.politec.com.br > > (61) 3038-6951 > > POLITEC - Bras?lia - DF > > > > -----Original Message----- > > From: Marcel Mitsuto Fucatu Sugano [mailto:msugano at uolinc.com] > > Sent: segunda-feira, 5 de dezembro de 2005 15:33 > > To: Fernando Shayani > > Cc: Eli Stair; nagios-users at lists.sourceforge.net > > Subject: RE: [Nagios-users] Nagios HANGS scheduling info > > > > On Mon, 2005-12-05 at 09:01 -0300, Fernando Shayani wrote: > > > Well, I upgraded to b6, enabled the Orphaned Service and still > > > hangs... > > > > > > Is there any other clue? > > > > > > Fernando Shayani > > > fernando.shayani at bsb.politec.com.br > > > (61) 3038-6951 > > > POLITEC - Bras?lia - DF > > > > Have you followed the instructions available at: > > http://nagios.sourceforge.net/docs/2_0/configmain.html ? > > > > Read that and check the following configs: > > Inter-check sleep time > > Service inter-check delay method > > Maximum service check spread > > Service interleave factor > > Maximum concurrent service checks > > Service reaper frequency > > Host inter-check delay method > > Maximum host check spread > > Timing interval length > > Auto-rescheduling option > > Auto-rescheduling interval > > Auto-rescheduling window > > > > Agressive host checking option > > > > Which are related to the scheduling options, and might be the source of your problem. Have you messed up those variables? Whatever the case, post the values of this configuration variables. > > > > HTH, > > -- > > Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br > > > > > > > > ------------------------------------------------------- > > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > > for problems? Stop! Download the new AJAX search engine that makes > > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=ick > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fernando.shayani at bsb.politec.com.br Mon Dec 12 17:54:20 2005 From: fernando.shayani at bsb.politec.com.br (Fernando Shayani) Date: Mon, 12 Dec 2005 13:54:20 -0300 Subject: Nagios HANGS scheduling info Message-ID: <1964B929D8FF4D4DB06854A75D0FD3730CC8F479@bsbserv028.bsb.politec.com.br> Well, PERFORMANCE SUGGESTIONS ----------------------- I have no suggestions - things look okay. That's what it says... But I will put 5 to my max_concurrent_check... Thanks for the tip. Fernando Shayani fernando.shayani at bsb.politec.com.br (61) 3038-6951 POLITEC - Bras?lia - DF -----Original Message----- From: Marco Ramos [mailto:mramos at co.sapo.pt] Sent: segunda-feira, 12 de dezembro de 2005 12:35 To: Fernando Shayani Cc: nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] Nagios HANGS scheduling info The problem should be your max_concurrent_checks set to 0. Run "nagios - s nagios.cfg" and set the max_concurrent_check to the value suggested. This should fix it. Best regards, Marco Ramos On Mon, 2005-12-12 at 10:54 -0300, Fernando Shayani wrote: > Well, it's STILL hangs... Here is my configuration: > > log_file=/usr/local/nagios/var/nagios.log > temp_file=/usr/local/nagios/var/nagios.tmp > status_file=/usr/local/nagios/var/status.dat > aggregate_status_updates=1 > status_update_interval=3 > nagios_user=nagios > nagios_group=nagios > enable_notifications=1 > execute_service_checks=1 > accept_passive_service_checks=1 > enable_event_handlers=1 > log_rotation_method=d > log_archive_path=/usr/local/nagios/var/archives > check_external_commands=1 > command_check_interval=-1 > command_file=/usr/local/nagios/var/rw/nagios.cmd > downtime_file=/usr/local/nagios/var/downtime.dat > comment_file=/usr/local/nagios/var/comments.dat > lock_file=/usr/local/nagios/var/nagios.lock > retain_state_information=1 > state_retention_file=/usr/local/nagios/var/retention.dat > retention_update_interval=360 > use_retained_program_state=1 > use_syslog=1 > log_notifications=1 > log_service_retries=0 > log_host_retries=0 > log_event_handlers=1 > log_initial_states=0 > log_external_commands=0 > log_passive_checks=0 > sleep_time=1 > service_interleave_factor=s > max_concurrent_checks=0 > service_reaper_frequency=2 > interval_length=60 > use_aggressive_host_checking=0 > enable_flap_detection=1 > low_service_flap_threshold=20 > high_service_flap_threshold=80 > low_host_flap_threshold=20 > high_host_flap_threshold=80 > soft_state_dependencies=0 > service_check_timeout=25 > host_check_timeout=10 > event_handler_timeout=30 > notification_timeout=15 > ocsp_timeout=60 > perfdata_timeout=60 > obsess_over_services=0 > process_performance_data=0 > check_for_orphaned_services=1 > check_service_freshness=0 > freshness_check_interval=60 > date_format=euro > illegal_object_name_chars=`~!$%&*|\\\'\\\"<>?,()= > illegal_macro_output_chars=`~$&|\\\'\\\"<> > admin_email=fernando.shayani at bsb.politec.com.br > service_inter_check_delay_method=s > max_service_check_spread=15 > host_inter_check_delay_method=s > max_host_check_spread=15 > auto_reschedule_checks=1 > auto_rescheduling_interval=30 > auto_rescheduling_window=180 > > > And here is my STATS: > > CURRENT STATUS DATA > ---------------------------------------------------- > Status File: /usr/local/nagios/var/status.dat > Status File Age: 0d 0h 0m 3s > Status File Version: 2.0b6 > > Program Running Time: 0d 2h 47m 39s > > Total Services: 314 > Services Checked: 314 > Services Scheduled: 313 > Active Service Checks: 314 > Passive Service Checks: 0 > Total Service State Change: 0.000 / 12.110 / 0.299 % > Active Service Latency: 0.003 / 358.274 / 16.984 % > Active Service Execution Time: 0.036 / 25.014 / 2.207 sec > Active Service State Change: 0.000 / 12.110 / 0.299 % > Active Services Last 1/5/15/60 min: 48 / 182 / 267 / 277 > Passive Service State Change: 0.000 / 0.000 / 0.000 % > Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0 > Services Ok/Warn/Unk/Crit: 304 / 3 / 4 / 3 > Services Flapping: 0 > Services In Downtime: 0 > > Total Hosts: 129 > Hosts Checked: 129 > Hosts Scheduled: 1 > Active Host Checks: 129 > Passive Host Checks: 0 > Total Host State Change: 0.000 / 10.260 / 0.291 % > Active Host Latency: 0.000 / 0.176 / 0.001 % > Active Host Execution Time: 0.000 / 5.973 / 2.012 sec > Active Host State Change: 0.000 / 10.260 / 0.291 % > Active Hosts Last 1/5/15/60 min: 1 / 2 / 3 / 6 > Passive Host State Change: 0.000 / 0.000 / 0.000 % > Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0 > Hosts Up/Down/Unreach: 128 / 1 / 0 > Hosts Flapping: 0 > Hosts In Downtime: 0 > > > Please... Help... > > > Fernando Shayani > fernando.shayani at bsb.politec.com.br > (61) 3038-6951 > POLITEC - Bras?lia - DF > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of > Fernando Shayani > Sent: quinta-feira, 8 de dezembro de 2005 07:45 > To: Marco Ramos > Cc: nagios-users at lists.sourceforge.net > Subject: RES: [Nagios-users] Nagios HANGS scheduling info > > Ok. It continues hanging... > > Now I changed the REAPER from 5 to 2... Lets see.. > Thanks > Fernando > > -----Mensagem original----- > De: Marco Ramos [mailto:mramos at co.sapo.pt] > Enviada: ter 06-dez-05 16:42 > Para: Fernando Shayani > Cc: nagios-users at lists.sourceforge.net > Assunto: RE: [Nagios-users] Nagios HANGS scheduling info > > > > > Try to tune your service_reaper_frequency and max_concurrent_checks > values. Take a look at http://nagios.org/faqs/viewfaq.php?faq_id=115. > > Had the same problem a while ago and managed to solve it tunning this > two options. > > regards, > Marco Ramos > > On Tue, 2005-12-06 at 14:53 -0300, Fernando Shayani wrote: > > Well, my configuration is: > > > > Inter-check sleep time (sleep_time=0.25) > > Service inter-check delay method (service_inter_check_delay_method=s) > > Maximum service check spread (max_service_check_spread=2) > > Service interleave factor (service_interleave_factor=s) > > Maximum concurrent service checks (max_concurrent_checks=0) > > Service reaper frequency (service_reaper_frequency=5) > > Host inter-check delay method (host_inter_check_delay_method=s) > > Maximum host check spread (max_host_check_spread=2) > > Timing interval length (interval_length=60) > > Agressive host checking option (use_aggressive_host_checking=0) > > > > The following options are not set. > > Auto-rescheduling option > > Auto-rescheduling interval > > Auto-rescheduling window > > > > > > I will read the configuration and recheck it all. > > > > I also got the SYSLOG line right after que problem. I hope this could help you help me. > > > > Dec 6 06:33:24 bsbserv007 nagios: Warning: The check of service 'CPU LOAD' on host 'BSBSERV017' could not be performed due to a fork() error. The check will be rescheduled. > > > > > > Thanks for the help. > > > > > > Fernando Shayani > > fernando.shayani at bsb.politec.com.br > > (61) 3038-6951 > > POLITEC - Bras?lia - DF > > > > -----Original Message----- > > From: Marcel Mitsuto Fucatu Sugano [mailto:msugano at uolinc.com] > > Sent: segunda-feira, 5 de dezembro de 2005 15:33 > > To: Fernando Shayani > > Cc: Eli Stair; nagios-users at lists.sourceforge.net > > Subject: RE: [Nagios-users] Nagios HANGS scheduling info > > > > On Mon, 2005-12-05 at 09:01 -0300, Fernando Shayani wrote: > > > Well, I upgraded to b6, enabled the Orphaned Service and still > > > hangs... > > > > > > Is there any other clue? > > > > > > Fernando Shayani > > > fernando.shayani at bsb.politec.com.br > > > (61) 3038-6951 > > > POLITEC - Bras?lia - DF > > > > Have you followed the instructions available at: > > http://nagios.sourceforge.net/docs/2_0/configmain.html ? > > > > Read that and check the following configs: > > Inter-check sleep time > > Service inter-check delay method > > Maximum service check spread > > Service interleave factor > > Maximum concurrent service checks > > Service reaper frequency > > Host inter-check delay method > > Maximum host check spread > > Timing interval length > > Auto-rescheduling option > > Auto-rescheduling interval > > Auto-rescheduling window > > > > Agressive host checking option > > > > Which are related to the scheduling options, and might be the source of your problem. Have you messed up those variables? Whatever the case, post the values of this configuration variables. > > > > HTH, > > -- > > Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br > > > > > > > > ------------------------------------------------------- > > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > > for problems? Stop! Download the new AJAX search engine that makes > > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=ick > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files for problems? Stop! Download the new AJAX search engine that > makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From srunschke at abit.de Mon Dec 12 17:01:05 2005 From: srunschke at abit.de (srunschke at abit.de) Date: Mon, 12 Dec 2005 17:01:05 +0100 Subject: Antwort: possible bug: escalations don't work if state changes In-Reply-To: <17e351010512120722t629d9c6fj69dffae607b8b84d@mail.gmail.com> References: <17e351010512120722t629d9c6fj69dffae607b8b84d@mail.gmail.com> Message-ID: nagios-users-admin at lists.sourceforge.net schrieb am 12.12.2005 16:22:34: > I am using Nagios 2.0b6, but also experienced this issue in 2.0b4. > Nagios escalations do not seem to work when the state changes after a > maximum notifications level has been reached. For example, if a You are mislead. Your escalations are only defined up to notification 5, at notification 6 they end and Nagios reverts to the base definition of the service. > define hostescalation{ > host_name test-server > first_notification 5 > last_notification 5 > notification_interval 0 > contact_groups oncall,backup > } > > define serviceescalation{ > host_name test-server > service_description /MYSQL > first_notification 5 > last_notification 5 > notification_interval 0 > contact_groups oncall,backup > } > > Please let me know if I need to include more information. Thanks in advance. Changing last_notification to 0 in those cases should produce your desired effect if I did understand you correctly. regards sash -------------------------------------------------- Sascha Runschke Netzwerk Administration IT-Services ABIT AG Robert-Bosch-Str. 1 40668 Meerbusch Tel.:+49 (0) 2150.9153.226 Mobil:+49 (0) 173.5419665 mailto:SRunschke at abit.de http://www.abit.net http://www.abit-epos.net --------------------------------- Sicherheitshinweis zur E-Mail Kommunikation / Security note regarding email communication: http://www.abit.net/sicherheitshinweis.html ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jamesd at jml.net Mon Dec 12 18:18:32 2005 From: jamesd at jml.net (James Davis) Date: Mon, 12 Dec 2005 17:18:32 +0000 Subject: Nagios querying services no longer in configuration Message-ID: <439DB0E8.2000403@jml.net> I'm running the Debian nagios package, nagios-common 1.3-cvs.20050402-8. I've removed some hosts and services that were recently removed from our network from my Nagios configuration and restarted nagios. However, I'm still getting e-mail alerts that the service is down. Have I forgotten to do something fairly basic here? James -- http://www.freecharity.org.uk/ - Free hosting for charities http://jamesd.ukgeeks.co.uk/ ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jeffwilliams05 at gmail.com Mon Dec 12 18:21:20 2005 From: jeffwilliams05 at gmail.com (Jeff Williams) Date: Mon, 12 Dec 2005 11:21:20 -0600 Subject: possible bug: escalations don't work if state changes In-Reply-To: References: <17e351010512120722t629d9c6fj69dffae607b8b84d@mail.gmail.com> Message-ID: <17e351010512120921v49c2cd6cle8a87d5d7b012cc1@mail.gmail.com> On 12/12/05, srunschke at abit.de wrote: > nagios-users-admin at lists.sourceforge.net schrieb am 12.12.2005 16:22:34: > > > I am using Nagios 2.0b6, but also experienced this issue in 2.0b4. > > Nagios escalations do not seem to work when the state changes after a > > maximum notifications level has been reached. For example, if a > > You are mislead. Your escalations are only defined up to > notification 5, at notification 6 they end and Nagios reverts > to the base definition of the service. > > > define hostescalation{ > > host_name test-server > > first_notification 5 > > last_notification 5 > > notification_interval 0 > > contact_groups oncall,backup > > } > > > > define serviceescalation{ > > host_name test-server > > service_description /MYSQL > > first_notification 5 > > last_notification 5 > > notification_interval 0 > > contact_groups oncall,backup > > } > > > > Please let me know if I need to include more information. Thanks in > advance. > > Changing last_notification to 0 in those cases should > produce your desired effect if I did understand you > correctly. > So I would need to have last_notification be 0 and notification_interval be 0? Or just last_notification? Is there a better way for me to do the service escalation so I do not have 3 separate entries? I do know that you can have similar services from multiple hosts in the same service escalation, but I am just wondering if I can compress the configs a little more. Thanks for the quick response. Jeff ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tor-arne.rysstad at skatteetaten.no Mon Dec 12 18:18:11 2005 From: tor-arne.rysstad at skatteetaten.no (Rysstad, Tor Arne) Date: Mon, 12 Dec 2005 18:18:11 +0100 Subject: Hurray for Nagios!!! Message-ID: <466366D5570F7D468217856CDA142AFA0B287C@X0PV051.skead.no> Hi ! I'd just like to say Hi to everybody.. I have been a read-only member of this list for a while, but I'd really like to show my gratitude to all the developers of Nagios, the Nagios plugins and to everybody that answers questions on this list!! I am running Nagios on a computer that I put together from parts picked out of the waste bin... Pentium III, 497.89 MHz (System Bogomips 983.04), running Fedora 2. I monitor 61 hosts, 536 services on it... On the same machine I run Cacti, IPPlan and a very low volume Mambo CMS site... This works beautifully!! Here is my uptime: 18:00:19 up 140 days, 3 min, 2 users, load average: 1.92, 1.87, 2.12 Nagios has been a great assistance for me every day, first of all making troubleshooting simpler and quicker, but it also makes it simple for my users to simply check Nagios to see that everything is up and working. Sweet! :-) Thanks a lot, guys!!!! Regards from Oslo, Norway Tor Arne Rysstad ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Mon Dec 12 20:03:29 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Mon, 12 Dec 2005 14:03:29 -0500 (EST) Subject: Nagios Notification Format In-Reply-To: <200512121505.jBCF5h33016221@union-station.bsd.uchicago.edu> References: <200512121505.jBCF5h33016221@union-station.bsd.uchicago.edu> Message-ID: look for the notify-by-* commands in misccommands.cfg printf * | mail On Mon, 12 Dec 2005, Eduardo Martinez wrote: > This is not really a problem but a curiosity. > Maybe I missed something in the documentation but I could not find where to > edit this. I want to add a few lines to the notification style that is sent > out by Nagios. Example notification that I would like to modify. > > ######################################## > ***** Nagios ***** > > Notification Type: RECOVERY > > Service: My Service > Host: My Host > Address: IP Address > State: OK > > Date/Time: Mon Dec 12 08:33:03 CST 2005 > > Additional Info: > > All processes are running > ######################################## > > I know the notification type, service, host, address, state, and date are > filled in and generated by the configuration you give it and the Nagios > plugins. > > But how is the entire notification generated? > Any help would be greatly appreciated. > Thank you very much, > > -Ed > > > > This email is intended only for the use of the individual or entity to which it is addressed and may contain information that is privileged and confidential. If the reader of this email message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is prohibited. If you have received this email in error, please notify the sender and destroy/delete all copies of the transmittal. Thank you. > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Mon Dec 12 20:06:35 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Mon, 12 Dec 2005 14:06:35 -0500 (EST) Subject: How to set host_check_alive command's time out to 2 minutes? In-Reply-To: References: Message-ID: On Mon, 12 Dec 2005 Lui.external at infineon.com wrote: > Hi, > > Is it possible for us to set the threshold for host_check_alive command > time out to 2 minutes? > We will considered the server is really down after 2 minutes. > > Thanks. > > > Warmest Regards, > Louise > 2 minutes is a long time for a ping response. If you really want the 2 minutes, consider your retry-interval and max retry setting rather than the command timeout. -- -sg ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cameron.matheson at gmail.com Mon Dec 12 20:05:15 2005 From: cameron.matheson at gmail.com (Cam) Date: Mon, 12 Dec 2005 12:05:15 -0700 Subject: contactgroup{ }, and contactgroups # bug? In-Reply-To: <366ea3750512090727v3b09ea8dsd5181bec111e3383@mail.gmail.com> References: <7e2f9af90512051254j2eeeb94cxf0c7369a1b596334@mail.gmail.com> <366ea3750512090722q19b38744tf81bdb7d8e1faef9@mail.gmail.com> <366ea3750512090727v3b09ea8dsd5181bec111e3383@mail.gmail.com> Message-ID: <7e2f9af90512121105s4cfcebd1h74230e788be00523@mail.gmail.com> Chet, Thanks a million! I applied the patch to 2.06b and it works just as expected now. Hope that one makes it into upstream. Thanks, Cameron Matheson On 12/9/05, Chet Luther wrote: > On 12/9/05, Chet Luther wrote: > > This is somewhere between a missing feature and a bug, but it is quite > > inconsistent. Here's a quick patch to fix it. It is applied against > > the current CVS tree. > > > > > > Sorry about that, gmail butchered the formatting on that patch. I'm > attaching it instead. > > -- > Chet Luther > chet.luther at gmail.com > > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Mon Dec 12 20:32:36 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Mon, 12 Dec 2005 20:32:36 +0100 (CET) Subject: trap scripting Message-ID: Right, I think I will give a go at writing my own scripts to send out UP and DOWN notifications based on the SNMP traps I get from my router. If I understood the manual (nagios/docs/extcommands.html) correctly all I have to do is that I just have to write a single line to the fifo like: PROCESS_HOST_CHECK_RESULT ;; So if I have a host definition like: define host{ use generic-host host_name Internet alias Internet address 192.168.1.254 parents anfalas check_command check_snmp_int!public!Virtual-Access2 max_check_attempts 3 check_interval 3s ; Check every second on retries notification_interval 120 notification_period 24x7 notification_options d,r contact_groups admins } Which uses SNMP to do active polling like: define command{ command_name check_snmp_int command_line $USER1$/check_snmp_int.pl -H $HOSTADDRESS$ -C $ARG1$ -n $ARG2$ } I could send a message like: PROCESS_HOST_CHECK_RESULT Internet;DOWN;TRAP: PPP LCP Closed And the host would be considered down as it will read this as a passive check result. Or did I misread anything? Basically I want to stop polling as soon as I get the link down trap from the router to all hosts not local on my network. At present the SNMP trap is just send per email to me like: Date: Mon, 12 Dec 2005 18:33:54 +0100 From: root at gandalf.hugo.vanderkooij.org To: hugo at vanderkooij.org Subject: trap received from 192.168.1.254: Host: 192.168.1.254 (anfalas.hugo.vanderkooij.org) IF-MIB::ifIndex.25 25 IF-MIB::ifAdminStatus.25 up IF-MIB::ifOperStatus.25 down IF-MIB::ifDescr.25 Virtual-Access2 IF-MIB::ifType.25 ppp SNMPv2-SMI::enterprises.9.2.2.1.1.20.25 "PPP LCP Closed" But I'm sure I can fetch this in a nice script instead and use it to update Nagios once I know what to send to Nagios. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Mon Dec 12 20:48:35 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Mon, 12 Dec 2005 20:48:35 +0100 Subject: Nagios Notification Format In-Reply-To: <200512121505.jBCF5h33016221@union-station.bsd.uchicago.edu> References: <200512121505.jBCF5h33016221@union-station.bsd.uchicago.edu> Message-ID: <439DD413.6090808@its-lehmann.de> Hello, Eduardo Martinez schrieb: > This is not really a problem but a curiosity. > Maybe I missed something in the documentation but I could not find where to > edit this. I want to add a few lines to the notification style that is sent > out by Nagios. ... > But how is the entire notification generated? Usually, the commands that create the notifications rely on the printf command. Just look up how your notification commands are defined - they are just commands as all the check commands. You will - probably - find something like 'printf "something" | mail -s Nagios $some_address$' in there. Then, understand Nagios' macros (described in the manual) and how the command printf works (man printf on the shell should help). Arno > Any help would be greatly appreciated. > Thank you very much, > > -Ed > > > > This email is intended only for the use of the individual or entity to which it is addressed and may contain information that is privileged and confidential. If the reader of this email message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication is prohibited. If you have received this email in error, please notify the sender and destroy/delete all copies of the transmittal. Thank you. > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Mon Dec 12 20:50:47 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Mon, 12 Dec 2005 20:50:47 +0100 Subject: Nagios querying services no longer in configuration In-Reply-To: <439DB0E8.2000403@jml.net> References: <439DB0E8.2000403@jml.net> Message-ID: <439DD497.7010506@its-lehmann.de> Hello, James Davis schrieb: > I'm running the Debian nagios package, nagios-common > 1.3-cvs.20050402-8. > > I've removed some hosts and services that were recently removed from our > network from my Nagios configuration and restarted nagios. However, I'm > still getting e-mail alerts that the service is down. > > Have I forgotten to do something fairly basic here? Retention? I could imagine that, even after a restart, Nagios still has all the information concerning hosts in its retention file. Have the hosts vanished from the web frontend? If it's a question of retention file contents, you can edit the file in question (best done while Nagios doesnt run...) and restart Nagios afterwards. Arno > James > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mathias at nilings.se Mon Dec 12 20:50:52 2005 From: mathias at nilings.se (Mathias Sundman) Date: Mon, 12 Dec 2005 20:50:52 +0100 (CET) Subject: How to set host_check_alive command's time out to 2 minutes? In-Reply-To: References: Message-ID: On Mon, 12 Dec 2005, Subhendu Ghosh wrote: > On Mon, 12 Dec 2005 Lui.external at infineon.com wrote: > > > > Is it possible for us to set the threshold for host_check_alive > > command time out to 2 minutes? We will considered the server is really > > down after 2 minutes. > > 2 minutes is a long time for a ping response. > > If you really want the 2 minutes, consider your retry-interval and max > retry setting rather than the command timeout. For a service check you can use retry_check_interval and max_check_attempts, but as soon as a service check fails a host check will be done and the number of host_checks defined to run is executed in one series without any delay, so there is no real good way of making nagios "hold" for 2 minutes before considering it down. You could of cource make the host_check_alive script hold for two minutes before returning if the host is not responding, but that is probably not such a good idea as it will stall all other checks during that time. You may however use escalations to suppress one or more notifications, to add a delay before a notification is accually sent even if the host is considered down immediatly when it doesn't respond to a number of pings. I've also written a patch that adds a new host option, first_notification_delay that can be used to specify the time we want Nagios to wait before sending any notifications for a given host. You can find it in the last month archive of the nagios-devel list if you're interested. //Mathias ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Mon Dec 12 21:26:14 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Mon, 12 Dec 2005 21:26:14 +0100 Subject: Nagios querying services no longer in configuration In-Reply-To: <439DD554.10300@jml.net> References: <439DB0E8.2000403@jml.net> <439DD497.7010506@its-lehmann.de> <439DD554.10300@jml.net> Message-ID: <439DDCE6.4090100@its-lehmann.de> Hello, please reply to the list... James Davis schrieb: > Arno Lehmann wrote: > > >>I could imagine that, even after a restart, Nagios still has all the >>information concerning hosts in its retention file. Have the hosts >>vanished from the web frontend? > > > Yes they have. > > >>If it's a question of retention file contents, you can edit the file in >>question (best done while Nagios doesnt run...) and restart Nagios >>afterwards. > > > Where do I find the retention file? Basically, wherever you put it yourself :-) It's a setting in the main configuration file. Arno > James > > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jamesd at jml.net Mon Dec 12 22:18:53 2005 From: jamesd at jml.net (James Davis) Date: Mon, 12 Dec 2005 21:18:53 +0000 Subject: Nagios querying services no longer in configuration In-Reply-To: <439DDCE6.4090100@its-lehmann.de> References: <439DB0E8.2000403@jml.net> <439DD497.7010506@its-lehmann.de> <439DD554.10300@jml.net> <439DDCE6.4090100@its-lehmann.de> Message-ID: <439DE93D.1030705@jml.net> Arno Lehmann wrote: > please reply to the list... Woops sorry, didn't look before sending. > Basically, wherever you put it yourself :-) > > It's a setting in the main configuration file. It wasn't in there but following on from that I checked the other files referenced in the configuration file and found it in /var/log/nagios/status.log which I then deleted the entry from. I started nagios up and I've yet to receive any new alerts. Does this happen if you remove a host from the configuration whilst there's an alert outstanding? Thanks, James -- http://www.freecharity.org.uk/ - Free hosting for charities http://jamesd.ukgeeks.co.uk/ ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From awfief at gmail.com Mon Dec 12 23:43:20 2005 From: awfief at gmail.com (sheeri kritzer) Date: Mon, 12 Dec 2005 17:43:20 -0500 Subject: Nagios Performance Data shows checks aren't being completed Message-ID: <24cb9b4b0512121443n3d7b0d0fi70f06368a94e84ef@mail.gmail.com> Hi all, My nagios 2.0 installation shows the following under performance information. There are 99 service checks, and I can't imagine it takes more than an hour to complete all 99. We've had problems where nagios hasn't found and notified us of problems. The load on the box is tiny. nagios -s has no suggestions. What did I do wrong? uptime 17:38:38 up 81 days, 9:05, 4 users, load average: 0.00, 0.00, 0.00 Nagios is running, and has been for a while: ps -ef | grep nagios nagios 11160 1 0 Nov14 ? 00:12:32 /usr/bin/nagios -d /etc/nagios/nagios.cfg nagios 22947 1 0 Nov20 ? 00:00:00 nrpe -c /etc/nagios/nrpe.cfg -d Performance Info: Program-Wide Performance Information Active Service Checks: Time Frame Checks Completed <= 1 minute: 1 (1.0%) <= 5 minutes: 58 (58.6%) <= 15 minutes: 60 (60.6%) <= 1 hour: 60 (60.6%) Since program start: 99 (100.0%) Metric Min. Max. Average Check Execution Time: 0.01 sec 8.71 sec 1.286 sec Check Latency: 0.01 sec 1.03 sec 0.488 sec Percent State Change: 0.00% 0.00% 0.00% Passive Service Checks: Time Frame Checks Completed <= 1 minute: 0 (0.0%) <= 5 minutes: 0 (0.0%) <= 15 minutes: 0 (0.0%) <= 1 hour: 0 (0.0%) Since program start: 0 (0.0%) Metric Min. Max. Average Percent State Change: 0.00% 0.00% 0.00% Active Host Checks: Time Frame Checks Completed <= 1 minute: 0 (0.0%) <= 5 minutes: 0 (0.0%) <= 15 minutes: 0 (0.0%) <= 1 hour: 0 (0.0%) Since program start: 19 (76.0%) Metric Min. Max. Average Check Execution Time: 3.01 sec 4.01 sec 3.972 sec Check Latency: 0.00 sec 0.00 sec 0.000 sec Percent State Change: 0.00% 0.00% 0.00% Passive Host Checks: Time Frame Checks Completed <= 1 minute: 0 (0.0%) <= 5 minutes: 0 (0.0%) <= 15 minutes: 0 (0.0%) <= 1 hour: 0 (0.0%) Since program start: 0 (0.0%) Metric Min. Max. Average Percent State Change: 0.00% 0.00% 0.00% ---------------------------------------------------------------------------------------------------------------------------- Nagios 2.0b4 Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) Last Modified: 08-02-2005 License: GPL Projected scheduling information for host and service checks is listed below. This information assumes that you are going to start running Nagios with your current config files. HOST SCHEDULING INFORMATION --------------------------- Total hosts: 25 Total scheduled hosts: 0 Host inter-check delay method: SMART Average host check interval: 0.00 sec Host inter-check delay: 0.00 sec Max host check spread: 30 min First scheduled check: N/A Last scheduled check: N/A SERVICE SCHEDULING INFORMATION ------------------------------- Total services: 99 Total scheduled services: 99 Service inter-check delay method: SMART Average service check interval: 300.00 sec Inter-check delay: 3.03 sec Interleave factor method: SMART Average services per host: 3.96 Service interleave factor: 4 Max service check spread: 30 min First scheduled check: Mon Dec 12 17:39:51 2005 Last scheduled check: Mon Dec 12 17:44:47 2005 CHECK PROCESSING INFORMATION ---------------------------- Service check reaper interval: 10 sec Max concurrent service checks: Unlimited PERFORMANCE SUGGESTIONS ----------------------- I have no suggestions - things look okay. --------------------------------------------------------------------------------------------------------------------------------- grep -v ^# /etc/nagios/nagios.cfg | grep -v ^$ Nagios.cfg params: log_file=/var/log/nagios/nagios.log cfg_file=/etc/nagios/checkcommands.cfg cfg_file=/etc/nagios/misccommands.cfg cfg_file=/etc/nagios/contactgroups.cfg cfg_file=/etc/nagios/contacts.cfg cfg_file=/etc/nagios/dependencies.cfg cfg_file=/etc/nagios/escalations.cfg cfg_file=/etc/nagios/hostgroups.cfg cfg_file=/etc/nagios/hosts.cfg cfg_file=/etc/nagios/services.cfg cfg_file=/etc/nagios/timeperiods.cfg object_cache_file=/var/log/nagios/objects.cache resource_file=/etc/nagios/resource.cfg status_file=/var/log/nagios/status.dat nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/var/log/nagios/rw/nagios.cmd comment_file=/var/log/nagios/comments.dat downtime_file=/var/log/nagios/downtime.dat lock_file=/var/run/nagios.pid temp_file=/var/log/nagios/nagios.tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/var/log/nagios/archives use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 service_reaper_frequency=10 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/var/log/nagios/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=0 interval_length=60 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=1 enable_event_handlers=1 process_performance_data=0 obsess_over_services=0 check_for_orphaned_services=0 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 aggregate_status_updates=1 status_update_interval=15 enable_flap_detection=0 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=us p1_file=/usr/bin/p1.pl illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$&|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=nagios admin_pager=pagenagios daemon_dumps_core=0 Any help is much appreciated. Thank you, Sheeri Kritzer ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Mon Dec 12 23:47:39 2005 From: estair at ilm.com (Eli Stair) Date: Mon, 12 Dec 2005 14:47:39 -0800 Subject: Nagios Performance Data shows checks aren't being completed In-Reply-To: <24cb9b4b0512121443n3d7b0d0fi70f06368a94e84ef@mail.gmail.com> References: <24cb9b4b0512121443n3d7b0d0fi70f06368a94e84ef@mail.gmail.com> Message-ID: <439DFE0B.8020600@ilm.com> Corroboration here, I actually have a mail I'm compiling also on the same issue. 2.0b6 I've got orphaned service checks enabled, unlimited parallel service checks, etc. If I force a host/svc check through the CGI's or the command file direct they get executed right away... the scheduler just is losing them. /eli sheeri kritzer wrote: > Hi all, > > My nagios 2.0 installation shows the following under performance > information. There are 99 service checks, and I can't imagine it > takes more than an hour to complete all 99. We've had problems where > nagios hasn't found and notified us of problems. The load on the box > is tiny. nagios -s has no suggestions. What did I do wrong? > > uptime > 17:38:38 up 81 days, 9:05, 4 users, load average: 0.00, 0.00, 0.00 > > Nagios is running, and has been for a while: > > ps -ef | grep nagios > nagios 11160 1 0 Nov14 ? 00:12:32 /usr/bin/nagios -d > /etc/nagios/nagios.cfg > nagios 22947 1 0 Nov20 ? 00:00:00 nrpe -c /etc/nagios/nrpe.cfg -d > > Performance Info: > > Program-Wide Performance Information > Active Service Checks: > > Time Frame Checks Completed > <= 1 minute: 1 (1.0%) > <= 5 minutes: 58 (58.6%) > <= 15 minutes: 60 (60.6%) > <= 1 hour: 60 (60.6%) > Since program start: 99 (100.0%) > > Metric Min. Max. Average > Check Execution Time: 0.01 sec 8.71 sec 1.286 sec > Check Latency: 0.01 sec 1.03 sec 0.488 sec > Percent State Change: 0.00% 0.00% 0.00% > Passive Service Checks: > > Time Frame Checks Completed > <= 1 minute: 0 (0.0%) > <= 5 minutes: 0 (0.0%) > <= 15 minutes: 0 (0.0%) > <= 1 hour: 0 (0.0%) > Since program start: 0 (0.0%) > > Metric Min. Max. Average > Percent State Change: 0.00% 0.00% 0.00% > Active Host Checks: > > Time Frame Checks Completed > <= 1 minute: 0 (0.0%) > <= 5 minutes: 0 (0.0%) > <= 15 minutes: 0 (0.0%) > <= 1 hour: 0 (0.0%) > Since program start: 19 (76.0%) > > Metric Min. Max. Average > Check Execution Time: 3.01 sec 4.01 sec 3.972 sec > Check Latency: 0.00 sec 0.00 sec 0.000 sec > Percent State Change: 0.00% 0.00% 0.00% > Passive Host Checks: > > Time Frame Checks Completed > <= 1 minute: 0 (0.0%) > <= 5 minutes: 0 (0.0%) > <= 15 minutes: 0 (0.0%) > <= 1 hour: 0 (0.0%) > Since program start: 0 (0.0%) > > Metric Min. Max. Average > Percent State Change: 0.00% 0.00% 0.00% > > ---------------------------------------------------------------------------------------------------------------------------- > > Nagios 2.0b4 > Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) > Last Modified: 08-02-2005 > License: GPL > > Projected scheduling information for host and service > checks is listed below. This information assumes that > you are going to start running Nagios with your current > config files. > > HOST SCHEDULING INFORMATION > --------------------------- > Total hosts: 25 > Total scheduled hosts: 0 > Host inter-check delay method: SMART > Average host check interval: 0.00 sec > Host inter-check delay: 0.00 sec > Max host check spread: 30 min > First scheduled check: N/A > Last scheduled check: N/A > > > SERVICE SCHEDULING INFORMATION > ------------------------------- > Total services: 99 > Total scheduled services: 99 > Service inter-check delay method: SMART > Average service check interval: 300.00 sec > Inter-check delay: 3.03 sec > Interleave factor method: SMART > Average services per host: 3.96 > Service interleave factor: 4 > Max service check spread: 30 min > First scheduled check: Mon Dec 12 17:39:51 2005 > Last scheduled check: Mon Dec 12 17:44:47 2005 > > > CHECK PROCESSING INFORMATION > ---------------------------- > Service check reaper interval: 10 sec > Max concurrent service checks: Unlimited > > > PERFORMANCE SUGGESTIONS > ----------------------- > I have no suggestions - things look okay. > > > --------------------------------------------------------------------------------------------------------------------------------- > > grep -v ^# /etc/nagios/nagios.cfg | grep -v ^$ > Nagios.cfg params: > > log_file=/var/log/nagios/nagios.log > cfg_file=/etc/nagios/checkcommands.cfg > cfg_file=/etc/nagios/misccommands.cfg > cfg_file=/etc/nagios/contactgroups.cfg > cfg_file=/etc/nagios/contacts.cfg > cfg_file=/etc/nagios/dependencies.cfg > cfg_file=/etc/nagios/escalations.cfg > cfg_file=/etc/nagios/hostgroups.cfg > cfg_file=/etc/nagios/hosts.cfg > cfg_file=/etc/nagios/services.cfg > cfg_file=/etc/nagios/timeperiods.cfg > object_cache_file=/var/log/nagios/objects.cache > resource_file=/etc/nagios/resource.cfg > status_file=/var/log/nagios/status.dat > nagios_user=nagios > nagios_group=nagios > check_external_commands=1 > command_check_interval=-1 > command_file=/var/log/nagios/rw/nagios.cmd > comment_file=/var/log/nagios/comments.dat > downtime_file=/var/log/nagios/downtime.dat > lock_file=/var/run/nagios.pid > temp_file=/var/log/nagios/nagios.tmp > event_broker_options=-1 > log_rotation_method=d > log_archive_path=/var/log/nagios/archives > use_syslog=1 > log_notifications=1 > log_service_retries=1 > log_host_retries=1 > log_event_handlers=1 > log_initial_states=0 > log_external_commands=1 > log_passive_checks=1 > service_inter_check_delay_method=s > max_service_check_spread=30 > service_interleave_factor=s > host_inter_check_delay_method=s > max_host_check_spread=30 > max_concurrent_checks=0 > service_reaper_frequency=10 > auto_reschedule_checks=0 > auto_rescheduling_interval=30 > auto_rescheduling_window=180 > sleep_time=0.25 > service_check_timeout=60 > host_check_timeout=30 > event_handler_timeout=30 > notification_timeout=30 > ocsp_timeout=5 > perfdata_timeout=5 > retain_state_information=1 > state_retention_file=/var/log/nagios/retention.dat > retention_update_interval=60 > use_retained_program_state=1 > use_retained_scheduling_info=0 > interval_length=60 > use_aggressive_host_checking=0 > execute_service_checks=1 > accept_passive_service_checks=1 > execute_host_checks=1 > accept_passive_host_checks=1 > enable_notifications=1 > enable_event_handlers=1 > process_performance_data=0 > obsess_over_services=0 > check_for_orphaned_services=0 > check_service_freshness=1 > service_freshness_check_interval=60 > check_host_freshness=0 > host_freshness_check_interval=60 > aggregate_status_updates=1 > status_update_interval=15 > enable_flap_detection=0 > low_service_flap_threshold=5.0 > high_service_flap_threshold=20.0 > low_host_flap_threshold=5.0 > high_host_flap_threshold=20.0 > date_format=us > p1_file=/usr/bin/p1.pl > illegal_object_name_chars=`~!$%^&*|'"<>?,()= > illegal_macro_output_chars=`~$&|'"<> > use_regexp_matching=0 > use_true_regexp_matching=0 > admin_email=nagios > admin_pager=pagenagios > daemon_dumps_core=0 > > Any help is much appreciated. > > Thank you, > > Sheeri Kritzer > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mls at skayser.de Tue Dec 13 09:21:07 2005 From: mls at skayser.de (Sebastian Kayser) Date: Tue, 13 Dec 2005 09:21:07 +0100 Subject: trap scripting In-Reply-To: References: Message-ID: <20051213082107.GA2162@skayser.homeip.net> * Hugo van der Kooij wrote: > I think I will give a go at writing my own scripts to send out UP and DOWN > notifications based on the SNMP traps I get from my router. > > If I understood the manual (nagios/docs/extcommands.html) correctly all I > have to do is that I just have to write a single line to the fifo like: > > PROCESS_HOST_CHECK_RESULT ;; like to add about the trap> > > [...] > > I could send a message like: > > PROCESS_HOST_CHECK_RESULT Internet;DOWN;TRAP: PPP LCP Closed > > And the host would be considered down as it will read this as a passive > check result. > > Or did I misread anything? Have a look at "External Command Detail: PROCESS_HOST_CHECK_RESULT" http://shorl.com/dagolemistuty You are missing the timestamp, a semicolon and i don't know wether DOWN as a status is accepted (i use the numeric status codes 012). So rather it should be [] PROCESS_HOST_CHECK_RESULT;Internet;1;TRAP: PPP LCP Closed - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mfatihakbulut at gmail.com Tue Dec 13 13:08:32 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Tue, 13 Dec 2005 14:08:32 +0200 Subject: sms notification problem ? Message-ID: hi all. i've 17 service definitions defined for various hosts and hostgroups since now. and i want to send sms notifications when a service goes down. but the problem is TWO people want to get notifications when a service goes DOWN and comes BACK. not many sms' in 5-10 or 60 minutes. just ONE when service out, and the SECOND when service IN again. how do i have to define nofity-by-sms service definition to achieve this goal ?[just one when out and one when in again] help me please ? Regards, Bye -------------- next part -------------- An HTML attachment was scrubbed... URL: From pete at devilincarnate.eclipse.co.uk Tue Dec 13 17:22:32 2005 From: pete at devilincarnate.eclipse.co.uk (Pete Clarke) Date: Tue, 13 Dec 2005 16:22:32 -0000 (GMT) Subject: check-nfs issue Message-ID: <1341.217.34.62.180.1134490952.squirrel@www.theclarkefamily.org.uk> Hi there, I have a slight problem with a strange error coming from check-nfs. I get mailed the following warning: ***** Nagios ***** Notification Type: PROBLEM Service: NFS Host: Jumpstart/TFTP (boot) server Address: 192.168.0.34 State: CRITICAL Date/Time: Tue Dec 13 16:13:11 GMT 2005 Additional Info: CRITICAL: RPC program nfs version 2 version 3 udp is not running However, if I simulate the check (i.e. with check_rpc) I get the following: shansi:/usr/lib/nagios/plugins# ./check_rpc -H stepney nfs -v /usr/bin/rpcinfo -u stepney 100003 2>&1 | program 100003 version 2 ready and waiting 1: version 2 program 100003 version 3 ready and waiting 1: version 2 version 3 OK: RPC program nfs version 2 version 3 udp running Stepney is a Solaris 9 box. Any ideas why this would happen? I use the same script to check for nfs services on 2 other Linux servers and they return correctly.. Cheers, Pete. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rhall at healthvision.com Tue Dec 13 17:59:25 2005 From: rhall at healthvision.com (Rusty Hall) Date: Tue, 13 Dec 2005 17:59:25 +0100 (CET) Subject: nagios and snmptt Message-ID: <20051213165925.7B72A4F415D@desire.netways.de> I have nagios 1.2 up and running (and has been for awhile) on a fedora core2 server. I recently added the snmptt for snmp traps. The snmptt appears to be running correctly accordng to the logs being generated. The trap also apperas in the nagios.log file. The question and or questions i have is I never get an alert an the passive check aalways says pending. enclosed are the service check and an entry from the logs passive check define service{ use generic-service ; Name of service template to use host_name device_name service_description TRAP is_volatile 1 check_command check-host-alive check_period 24x7 max_check_attempts 1 normal_check_interval 1 retry_check_interval 1 passive_checks_enabled 1 check_period none notification_interval 31536000 # equals a year notification_period 24x7 notification_options w,u,c,r notifications_enabled 1 contact_groups linux-admins } snmptt log entry Wed Dec 7 21:10:41 2005 .iso.org.dod.internet.snmpV2.snmpModules.snmpMIB. snmpMIBObjects.snmpTraps.coldStart Normal "Status Events" system_name - A coldStart trap signifies that the SNMP entity, Wed Dec 7 15:38:00 2005 .iso.org.dod.internet.private.enterprises.3224.0. 500 Normal "Status Events" system_ip - This trap indicates that VPN tunnel status has occured. 41 2005-12-07 15:38:00 system-critical-00 041: vpn "St John Tunnel" is down. entry from nagios.log [1133792753] EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;system_name;T RAP;0;Device reinitialized (coldStart) [1133792757] Warning: Message queue contained results for service 'TRAP' on host 'system_name'. The service could not be found! The pieces to make this work seem to be in place but im just missing something. If im amissing any thing please let me know ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From f1216 at yahoo.com Tue Dec 13 18:02:35 2005 From: f1216 at yahoo.com (Fred) Date: Tue, 13 Dec 2005 09:02:35 -0800 (PST) Subject: New Duplicate definition warnings in 2.0b6 Message-ID: <20051213170235.88769.qmail@web31909.mail.mud.yahoo.com> As of 2.0b6 I started seeing lots of duplicate definition warnings such as: Warning: Duplicate definition found for service 'loadave' (config file '/opt/hptc/nagios/etc/nj12n16.cfg', starting on line 632) Warning: Duplicate definition found for service 'loadave' (config file '/opt/hptc/nagios/etc/nj12n16.cfg', starting on line 1031) Warning: Duplicate definition found for service 'loadave' (config file '/opt/hptc/nagios/etc/nj12n16.cfg', starting on line 1430) Warning: Duplicate definition found for service 'loadave' (config file '/opt/hptc/nagios/etc/nj12n16.cfg', starting on line 1829) Warning: Duplicate definition found for service 'loadave' (config file '/opt/hptc/nagios/etc/nj12n16.cfg', starting on line 2574) These files are autogenerated by a script. There are indeed service definitions at these locations that are nearly identical except for the node name. (I know I should use hostgroups but I haven't gotten there yet). # Service definition define service{ use nrpe ; Name of service template to use host_name nj12n11 name loadave service_description Load Average is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 60 contact_groups admins active_checks_enabled 0 passive_checks_enabled 1 # check_load_average is not called here since we are passive # Called from check_metrics check_command check_dummy!1!"load average data is stale" register 1 } # Service definition define service{ use nrpe ; Name of service template to use host_name nj12n12 name loadave service_description Load Average is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 60 contact_groups admins active_checks_enabled 0 passive_checks_enabled 1 # check_load_average is not called here since we are passive # Called from check_metrics check_command check_dummy!1!"load average data is stale" register 1 } Is this really a problem or is it harmless? Thanx -FredC -------------- next part -------------- An HTML attachment was scrubbed... URL: From pete at devilincarnate.eclipse.co.uk Tue Dec 13 18:08:44 2005 From: pete at devilincarnate.eclipse.co.uk (Pete Clarke) Date: Tue, 13 Dec 2005 17:08:44 -0000 (GMT) Subject: check-nfs issue In-Reply-To: References: <1341.217.34.62.180.1134490952.squirrel@www.theclarkefamily.org.uk> Message-ID: <1378.217.34.62.180.1134493724.squirrel@www.theclarkefamily.org.uk> >> Any ideas why this would happen? I use the same script to check for nfs >> services on 2 other Linux servers and they return correctly.. >> > > Is "stepney" == 192.168.0.34 ? > Yep. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hunterp at sfnewmexican.com Tue Dec 13 19:27:15 2005 From: hunterp at sfnewmexican.com (Hunter Peress) Date: Tue, 13 Dec 2005 11:27:15 -0700 Subject: All my hosts are not updating Message-ID: <3C0EBA58-B533-4D6C-8406-ABABED9965F3@sfnewmexican.com> I think I have conflicting information in Nagios...that the host info screen for a host is reporting that the host has not been checked in 3days, but the scheduling queue says something different. Can you folks help me shed some light on this: The host is named io. In the host detail , it tells me that the host has not been updated all weekend: Host Status: UP Status Information: (Host assumed to be up) Last Status Check: 12-08-2005 17:31:49 Status Data Age: 3d 18h 28m 25s Last State Change: 12-08-2005 17:21:11 Current State Duration: 3d 18h 39m 3s Last Host Notification: N/A But, in the Scheduling Queue screen, it tells me what i think is conflicting data about this host: Host Service Last Check Next Check Active Checks io PING 12-12-2005 12:02:43 12-12-2005 12:03:43 ENABLED To finish, here are some relevant conf files: The conf of the PING service: define service{ use generic-service ; Name of service template to use host_name io service_description PING is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 1 retry_check_interval 1 contact_groups sfnmweb notification_interval 4 notification_period 24x7 notification_options c,r check_command check_ping!100.0,20%!500.0,60% } And here is the conf of the io host: define host{ use generic-host ; Name of host template to use host_name io alias IO address 10.2.1.10 check_command check-host-alive max_check_attempts 5 notification_interval 15 notification_period 24x7 notification_options d,u,r register 1; } ------------------------------------------------------------------------ Hunter Peress hperess at sfnewmexican.com Web Programmer The Santa Fe New Mexican, Inc. -------------- next part -------------- An HTML attachment was scrubbed... URL: From mls at skayser.de Tue Dec 13 20:22:22 2005 From: mls at skayser.de (Sebastian Kayser) Date: Tue, 13 Dec 2005 20:22:22 +0100 Subject: nagios and snmptt In-Reply-To: <20051213165925.7B72A4F415D@desire.netways.de> References: <20051213165925.7B72A4F415D@desire.netways.de> Message-ID: <20051213192222.GA5902@skayser.homeip.net> * Rusty Hall wrote: > I have nagios 1.2 up and running (and has been for awhile) on a > fedora core2 server. I recently added the snmptt for snmp traps. > The snmptt appears to be running correctly accordng to the logs > being generated. The trap also apperas in the nagios.log file. > The question and or questions i have is I never get an alert an > the passive check aalways says pending. enclosed are the service > check and an entry from the logs The pending state is ok, as you have set the check_period to none, which i suppose contains no timeslots. The service will never be actively checked. > passive check > > define service{ > use generic-service ; Name of service template to use > host_name device_name > service_description TRAP > is_volatile 1 > check_command check-host-alive > check_period 24x7 > max_check_attempts 1 > normal_check_interval 1 > retry_check_interval 1 > passive_checks_enabled 1 > check_period none > notification_interval 31536000 # equals a year > notification_period 24x7 > notification_options w,u,c,r > notifications_enabled 1 > contact_groups linux-admins > } > snmptt log entry Notice the mentioned check_period above? You configured two of them, get rid of one of them (it's confusing). > Wed Dec 7 21:10:41 2005 .iso.org.dod.internet.snmpV2.snmpModules.snmpMIB. > snmpMIBObjects.snmpTraps.coldStart Normal "Status Events" system_name > - A coldStart trap signifies that the SNMP entity, > > Wed Dec 7 15:38:00 2005 .iso.org.dod.internet.private.enterprises.3224.0. > 500 Normal "Status Events" system_ip - This trap indicates that > VPN tunnel status has occured. 41 2005-12-07 15:38:00 system-critical-00 > 041: vpn "St John Tunnel" is down. > > entry from nagios.log > > [1133792753] EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;system_name;T > RAP;0;Device reinitialized (coldStart) > > [1133792757] Warning: Message queue contained results for service > 'TRAP' on host 'system_name'. The service could not be found! Nagios receives an external check result and tries to match it to one of it's configured services. Services are uniquely identified by the associated host and the service name. Your configuration is for host "device_name" and service "TRAP", the external check result is for host "system_name" and service "TRAP". No match here (and that's what Nagios complains about in it's warning message). - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From btaylor at modulusvideo.com Tue Dec 13 20:25:16 2005 From: btaylor at modulusvideo.com (Bryant Taylor) Date: Tue, 13 Dec 2005 11:25:16 -0800 Subject: contacts.cfg file errors when running pre-flight Message-ID: <3AAB61FCAC5E9D418FA7C897ABF2C815711692@exchange.MODULUSVIDEO.local> Hope someone can help. I have configured all of my .cfg files correctly from the manual, but for some reason, when I run nagios -v nagios.cfg I get the following on this file. Not sure why, have I haven't found anything online or on the nagios website that refers to this. Any ideas and or resolutions would be great Thank you, Bryant Nagios 1.2 Copyright (c) 1999-2004 Ethan Galstad (nagios at nagios.org) Last Modified: 02-02-2004 License: GPL Reading configuration data... Error: Could not add object property in file '/etc/nagios/contacts.cfg' on line 16. ***> One or more problems was encountered while processing the config files... Check your configuration file(s) to ensure that they contain valid directives and data defintions. If you are upgrading from a previous version of Nagios, you should be aware that some variables/definitions may have been removed or modified in this version. Make sure to read the HTML documentation on the main and host config files, as well as the 'Whats New' section to find out what has changed. Here is my current contacts.cfg file. ######################################################################## ####### define contact { contact_name Bryant alias Bryant Taylor contactgroups admins host_notification_period 24x7 service_notification_period 24x7 host_notification_options d,u,r service_notification_options w,u,c,r host_notification_commands host-notify-by-email service_notification_commands notify-by-email email btaylor at modulusvideo.local } ######################################################################## ####### # # Contact configuration file # # END OF FILE # -------------- next part -------------- An HTML attachment was scrubbed... URL: From mls at skayser.de Tue Dec 13 20:40:53 2005 From: mls at skayser.de (Sebastian Kayser) Date: Tue, 13 Dec 2005 20:40:53 +0100 Subject: contacts.cfg file errors when running pre-flight In-Reply-To: <3AAB61FCAC5E9D418FA7C897ABF2C815711692@exchange.MODULUSVIDEO.local> References: <3AAB61FCAC5E9D418FA7C897ABF2C815711692@exchange.MODULUSVIDEO.local> Message-ID: <20051213194053.GA6186@skayser.homeip.net> * Bryant Taylor wrote: > I have configured all of my .cfg files correctly from the manual, but > for some reason, when I run nagios -v nagios.cfg I get the following > on this file. Not sure why, have I haven't found anything online or on > the nagios website that refers to this. > > Here is my current contacts.cfg file. > > define contact { > contact_name Bryant > alias Bryant Taylor > contactgroups admins > host_notification_period 24x7 > service_notification_period 24x7 > host_notification_options d,u,r > service_notification_options w,u,c,r > host_notification_commands host-notify-by-email > service_notification_commands notify-by-email > email btaylor at modulusvideo.local > } The contactgroups membership has to be defined via the directive "members" of a contactgroups object, not via "contactgroups" in a contacts object. - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rhall at healthvision.com Tue Dec 13 20:57:53 2005 From: rhall at healthvision.com (Rusty Hall) Date: Tue, 13 Dec 2005 20:57:53 +0100 (CET) Subject: nagios and snmptt In-Reply-To: <20051213192222.GA5902@skayser.homeip.net> References: <20051213192222.GA5902@skayser.homeip.net> Message-ID: <20051213195753.92C944F415D@desire.netways.de> Hi Sebastian in the servce service check and the trap do match I just cleaned the names up but they do match in the config and trap file - Rusty Hall (rdhall1) ----------------------- This thread is located in the archive at this URL: http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_maillisttofaq_pi 1[showUid]=13626 ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From btaylor at modulusvideo.com Tue Dec 13 21:02:10 2005 From: btaylor at modulusvideo.com (Bryant Taylor) Date: Tue, 13 Dec 2005 12:02:10 -0800 Subject: contacts.cfg file errors when running pre-flight Message-ID: <3AAB61FCAC5E9D418FA7C897ABF2C8157116A2@exchange.MODULUSVIDEO.local> Ok so the line "contactgroups admins" should read "members admins"? Correct? -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Sebastian Kayser Sent: Tuesday, December 13, 2005 11:41 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Re: contacts.cfg file errors when running pre-flight * Bryant Taylor wrote: > I have configured all of my .cfg files correctly from the manual, but > for some reason, when I run nagios -v nagios.cfg I get the following > on this file. Not sure why, have I haven't found anything online or on > the nagios website that refers to this. > > Here is my current contacts.cfg file. > > define contact { > contact_name Bryant > alias Bryant Taylor > contactgroups admins > host_notification_period 24x7 > service_notification_period 24x7 > host_notification_options d,u,r > service_notification_options w,u,c,r > host_notification_commands host-notify-by-email > service_notification_commands notify-by-email > email btaylor at modulusvideo.local > } The contactgroups membership has to be defined via the directive "members" of a contactgroups object, not via "contactgroups" in a contacts object. - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mls at skayser.de Tue Dec 13 21:11:27 2005 From: mls at skayser.de (Sebastian Kayser) Date: Tue, 13 Dec 2005 21:11:27 +0100 Subject: contacts.cfg file errors when running pre-flight In-Reply-To: <3AAB61FCAC5E9D418FA7C897ABF2C8157116A2@exchange.MODULUSVIDEO.local> References: <3AAB61FCAC5E9D418FA7C897ABF2C8157116A2@exchange.MODULUSVIDEO.local> Message-ID: <20051213201127.GA6522@skayser.homeip.net> 1) No need for PM, i read the list 2) Please read http://learn.to/quote * Bryant Taylor wrote: > Ok so the line "contactgroups admins" should read "members > admins"? No, delete the line and add Bryant to the list of members in the definition of your contactgroup admins. - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a.foxcroft at qut.edu.au Tue Dec 13 23:11:45 2005 From: a.foxcroft at qut.edu.au (Anthony Foxcroft) Date: Wed, 14 Dec 2005 08:11:45 +1000 Subject: =?WINDOWS-1252?Q?Nagios_Query_=96=A0Request_Timeout=3F?= Message-ID: <1F33F397-B092-474D-998A-BA7FB98CD021@qut.edu.au> Hi I'm a relatively inexperienced Nagios user. I've got Nagios checking some Hosts and Services on our Mac and PC Servers. However I have a small problem/need which I hope the list can shed some light on. Nagios has the facility to monitor a URL and send an alert if the URL returns a certain response code (e.g. 500). Obviously this requires Nagios to send a request to the server, wait for a response, and then report a failure based on the response code. This works fine for sites that are either running (success response code) or not running (fail response code). However, what we would like to do is to be able to send alerts for sites that are running but are running very slowly. In this case it would take a long time for Nagios to receive a response from the server. Is it possible to configure Nagios to send an alert if it does not get any response from a URL within a given timeframe (e.g. 45 seconds, 1 minute etc)? I imagine this would require some kind of "request timeout" setting. How would I go about setting this up? Kind regards Anthony ???????????????????????????????????????????????????????????????????????? ????????????? Anthony J. Foxcroft Computer Support Officer Queensland University of Technology | Publications Unit K Block | 130 Victoria Park Road | Kelvin Grove | Brisbane | Queensland 4059 | Australia Telephone: 07 3864 3152 | Facsimile: 07 3864 3573 | Pager: 07 3309 1462 | eMail: a.foxcroft at qut.edu.au ________________________________________________________________________ _____________ Experience is a wonderful thing. It enables you to recognise a mistake when you make it again. NOTICE: This email contains information intended for the addressee only. The inclusion of this email address on any mailing list without specific permission is a breach of Anti-Spam legislation. -------------- next part -------------- An HTML attachment was scrubbed... URL: From al at its-lehmann.de Tue Dec 13 23:16:23 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Tue, 13 Dec 2005 23:16:23 +0100 Subject: Nagios querying services no longer in configuration In-Reply-To: <439DE93D.1030705@jml.net> References: <439DB0E8.2000403@jml.net> <439DD497.7010506@its-lehmann.de> <439DD554.10300@jml.net> <439DDCE6.4090100@its-lehmann.de> <439DE93D.1030705@jml.net> Message-ID: <439F4837.4090404@its-lehmann.de> Hello, James Davis schrieb: > Arno Lehmann wrote: ... >>Basically, wherever you put it yourself :-) >> >>It's a setting in the main configuration file. > > > It wasn't in there but following on from that I checked the other files > referenced in the configuration file and found it in > /var/log/nagios/status.log which I then deleted the entry from. I > started nagios up and I've yet to receive any new alerts. > > Does this happen if you remove a host from the configuration whilst > there's an alert outstanding? I don't know, because I never had such a situation, but I guess that, depending on your setup, all the important state information is saved, and that obviously includes state, notification number and interval and last one sent. And, most probably, lots of other interesting stuff, too. So, in short, I'd assume that it does. A simple work around would be to manually inject a passive result that indicates a good state, and then reload Nagios with the modified configuration. Arno > Thanks, > > James > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From msugano at uolinc.com Tue Dec 13 23:51:14 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Tue, 13 Dec 2005 20:51:14 -0200 Subject: Nagios Query =?windows-1251?Q?=96=A0Request?= Timeout? In-Reply-To: <1F33F397-B092-474D-998A-BA7FB98CD021@qut.edu.au> References: <1F33F397-B092-474D-998A-BA7FB98CD021@qut.edu.au> Message-ID: <1134514274.9025.6.camel@localhost> Hello Auzzy fellow... On Wed, 2005-12-14 at 08:11 +1000, Anthony Foxcroft wrote: > Is it possible to configure Nagios to send an alert if it does not get > any response from a URL within a given timeframe (e.g. 45 seconds, 1 > minute etc)? I imagine this would require some kind of "request > timeout" setting. Look for help on the plugins. Try `check_http --help`, from there, you should be able to learn the flags of this plugin. Precisely the -c flag, which is the time threshold for a critical response of a loaded webserver. HTH, > -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Wed Dec 14 01:26:54 2005 From: teng at dataway.com (Tedman Eng) Date: Tue, 13 Dec 2005 16:26:54 -0800 Subject: All my hosts are not updating Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C75@dw-mail.dataway.com> You are looking at two different status info's. Hosts are not checked unless one of it's services returns a non-Ok state. If a host is up for 108 days without and service incidents, the host check age will be 108 days old. -----Original Message----- From: Hunter Peress [mailto:hunterp at sfnewmexican.com] Sent: Tuesday, December 13, 2005 10:27 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] All my hosts are not updating I think I have conflicting information in Nagios...that the host info screen for a host is reporting that the host has not been checked in 3days, but the scheduling queue says something different. Can you folks help me shed some light on this: The host is named io. In the host detail , it tells me that the host has not been updated all weekend: Host Status: UP Status Information: (Host assumed to be up) Last Status Check: 12-08-2005 17:31:49 Status Data Age: 3d 18h 28m 25s Last State Change: 12-08-2005 17:21:11 Current State Duration: 3d 18h 39m 3s Last Host Notification: N/A But, in the Scheduling Queue screen, it tells me what i think is conflicting data about this host: Host Service Last Check Next Check Active Checks io PING 12-12-2005 12:02:43 12-12-2005 12:03:43 ENABLED To finish, here are some relevant conf files: The conf of the PING service: define service{ use generic-service ; Name of service template to use host_name io service_description PING is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 1 retry_check_interval 1 contact_groups sfnmweb notification_interval 4 notification_period 24x7 notification_options c,r check_command check_ping!100.0,20%!500.0,60% } And here is the conf of the io host: define host{ use generic-host ; Name of host template to use host_name io alias IO address 10.2.1.10 check_command check-host-alive max_check_attempts 5 notification_interval 15 notification_period 24x7 notification_options d,u,r register 1; } ------------------------------------------------------------------------ Hunter Peress hperess at sfnewmexican.com Web Programmer The Santa Fe New Mexican, Inc. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rtparies at gmail.com Wed Dec 14 01:54:28 2005 From: rtparies at gmail.com (Randy Paries) Date: Tue, 13 Dec 2005 18:54:28 -0600 Subject: getting nagios to snmp to my box, Where do i specify the community Message-ID: Hello, I am trying to do a smnp check on one of my boxes. (it is a mediatrix viop gateway) I can query it using the cactii using the community of public so i have set it up as follows:: The ping works find but the snmp fails with a CRITICAL "Connection refused by host" Si i am thinking maybe it is the community cause like mentioned above cactii can query it just fine. Thanks for any suggestions. ========================================================= define host{ use generic-host ; Name of host template to use host_name MediaTrix.RP alias MediaTrix RP Box address 66.xx.xx.205 max_check_attempts 20 notification_interval 60 notification_period 24x7 notification_options d,u,r } define service{ use generic-service host_name MediaTrix.RP service_description SMTP is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 240 notification_period 24x7 notification_options w,u,c,r check_command check_smtp } define service{ use generic-service host_name MediaTrix.RP service_description PING is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 240 notification_period 24x7 notification_options w,u,c,r check_command check_ping!100.0,20%!500.0,60% } ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Wed Dec 14 01:57:18 2005 From: teng at dataway.com (Tedman Eng) Date: Tue, 13 Dec 2005 16:57:18 -0800 Subject: contacts.cfg file errors when running pre-flig ht Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C76@dw-mail.dataway.com> Bryant, A few years ago, I bought a can of refigerated of chocolate chip cookie dough from the store. I spooned the dough onto a cookie sheet and set it into the oven. Twenty minutes later I had warm moist melt-in-your-mouth chocolate chip cookies! There's nothing better than fresh baked cookies! So one day, some months later, I had friends over and decided to swing by the store and pick up another batch of cookie dough. I threw them in the oven and set the timer, 20 minutes. When I opened the oven though, I found that I had burnt little crisps that barely resembled cookies. It turns out, they'd changed the recipe (and the cooking time too). Moral of the story: Make sure you're using the right directions. You are running Nagios 1.2 The docs showing "contactgroups" as a valid directive are from 2.x documentation. -----Original Message----- From: Bryant Taylor [mailto:btaylor at modulusvideo.com] Sent: Tuesday, December 13, 2005 11:25 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] contacts.cfg file errors when running pre-flight Hope someone can help. I have configured all of my .cfg files correctly from the manual, but for some reason, when I run nagios -v nagios.cfg I get the following on this file. Not sure why, have I haven't found anything online or on the nagios website that refers to this. Any ideas and or resolutions would be great Thank you, Bryant Nagios 1.2 Copyright (c) 1999-2004 Ethan Galstad (nagios at nagios.org) Last Modified: 02-02-2004 License: GPL Reading configuration data... Error: Could not add object property in file '/etc/nagios/contacts.cfg' on line 16. ***> One or more problems was encountered while processing the config files... Check your configuration file(s) to ensure that they contain valid directives and data defintions. If you are upgrading from a previous version of Nagios, you should be aware that some variables/definitions may have been removed or modified in this version. Make sure to read the HTML documentation on the main and host config files, as well as the 'Whats New' section to find out what has changed. Here is my current contacts.cfg file. ############################################################################ ### define contact { contact_name Bryant alias Bryant Taylor contactgroups admins host_notification_period 24x7 service_notification_period 24x7 host_notification_options d,u,r service_notification_options w,u,c,r host_notification_commands host-notify-by-email service_notification_commands notify-by-email email btaylor at modulusvideo.local } ############################################################################ ### # # Contact configuration file # # END OF FILE # ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ratty at they.org Wed Dec 14 02:24:21 2005 From: ratty at they.org (frank) Date: Tue, 13 Dec 2005 17:24:21 -0800 (PST) Subject: getting nagios to snmp to my box, Where do i specify the community In-Reply-To: References: Message-ID: First you need to decide what kind of check you're attempting. Your mail says "snmp" (and "smnp") but your config says "SMTP". Try actually running an SNMP check and you might get better results. At the very least you'll get rid of those "Connection refused" messages since SNMP is stateless. -f On Tue, 13 Dec 2005, Randy Paries wrote: > Date: Tue, 13 Dec 2005 18:54:28 -0600 > From: Randy Paries > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] getting nagios to snmp to my box, > Where do i specify the community > > Hello, > I am trying to do a smnp check on one of my boxes. > (it is a mediatrix viop gateway) > > I can query it using the cactii using the community of public > > so i have set it up as follows:: > > The ping works find but the snmp fails with a CRITICAL > "Connection refused by host" > > Si i am thinking maybe it is the community cause like mentioned above > cactii can query it just fine. > > Thanks for any suggestions. > ========================================================= > > define host{ > use generic-host ; Name of host > template to use > > host_name MediaTrix.RP > alias MediaTrix RP Box > address 66.xx.xx.205 > max_check_attempts 20 > notification_interval 60 > notification_period 24x7 > notification_options d,u,r > } > > define service{ > use generic-service > host_name MediaTrix.RP > service_description SMTP > is_volatile 0 > check_period 24x7 > max_check_attempts 3 > normal_check_interval 5 > retry_check_interval 1 > contact_groups admins > notification_interval 240 > notification_period 24x7 > notification_options w,u,c,r > check_command check_smtp > } > > define service{ > use generic-service > host_name MediaTrix.RP > service_description PING > is_volatile 0 > check_period 24x7 > max_check_attempts 3 > normal_check_interval 5 > retry_check_interval 1 > contact_groups admins > notification_interval 240 > notification_period 24x7 > notification_options w,u,c,r > check_command check_ping!100.0,20%!500.0,60% > } > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From m.borsani at it.net Wed Dec 14 09:06:18 2005 From: m.borsani at it.net (Marco Borsani) Date: Wed, 14 Dec 2005 09:06:18 +0100 Subject: How to send SMS using more than one device? Message-ID: <003201c60085$43975780$0900d40a@intranet.it.net> Hi all ! I am using a Nagios running on HP-UX 11.00. To send SMS I prepare a script that send a file to one serial port. I know that there are some tools (like smstools) that permit to use more than one devices to send SMS. Unfortunately on my HP-UX I am not able to compile smstools software (but on a linux system it works very well!). What do you use to send SMS using more than one devices? Many thanks! Marco Borsani Unix & Monitoring System Administrator Technical Operation Tel. +39 010 4310115 Fax +39 010 4327454 E-mail: m.borsani at IT.net ITnet S.r.l. - Direzione e Coordinamento di WIND Telecomunicazioni S.p.A. Internet Service Provider Sede legale: Via C.G.Viola, 48 - 00148 Roma Dir. Centrale e Amministrativa: Via Pacinotti, 39 16151 Genova (Italy) http://www.it.net mailto:info at IT.net _______________________________________________________________ Altre sedi ITnet: MILANO tel.: +39 02 30114900 info-milano at IT.net ROMA tel.: +39 06 83116707 info-roma at IT.net _______________________________________________________________ ITnet is associated to CIX (Commercial IP eXchange) and RIPE ITnet is associated to AIIP (Associazione Italiana Internet Providers) ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Wed Dec 14 09:05:41 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Wed, 14 Dec 2005 09:05:41 +0100 Subject: getting nagios to snmp to my box, Where do i specify the community In-Reply-To: References: Message-ID: <439FD255.9040503@its-lehmann.de> Hi, Randy Paries schrieb: > Hello, > I am trying to do a smnp check on one of my boxes. > (it is a mediatrix viop gateway) > > I can query it using the cactii using the community of public Here, it's rather early, but I have the impression you set up your checks quite early, too ;-) Above, you want to write snmp, I assume - community strings belong there, usually. > so i have set it up as follows:: > > The ping works find but the snmp fails with a CRITICAL > "Connection refused by host" See below. > Si i am thinking maybe it is the community cause like mentioned above > cactii can query it just fine. > > Thanks for any suggestions. > ========================================================= > > define host{ > use generic-host ; Name of host > template to use > > host_name MediaTrix.RP > alias MediaTrix RP Box > address 66.xx.xx.205 > max_check_attempts 20 > notification_interval 60 > notification_period 24x7 > notification_options d,u,r > } > > define service{ > use generic-service > host_name MediaTrix.RP > service_description SMTP Really really sure? > is_volatile 0 > check_period 24x7 > max_check_attempts 3 > normal_check_interval 5 > retry_check_interval 1 > contact_groups admins > notification_interval 240 > notification_period 24x7 > notification_options w,u,c,r > check_command check_smtp I guess you check for the simple mail transport protocol, not the simple network management one. > } > Arno -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From srunschke at abit.de Wed Dec 14 10:13:00 2005 From: srunschke at abit.de (srunschke at abit.de) Date: Wed, 14 Dec 2005 10:13:00 +0100 Subject: Antwort: sms notification problem ? In-Reply-To: References: Message-ID: nagios-users-admin at lists.sourceforge.net schrieb am 13.12.2005 13:08:32: > i've 17 service definitions defined for various hosts and hostgroups > since now. and i want to send sms notifications when a service goes down. > but the problem is TWO people want to get notifications when a > service goes DOWN and comes BACK. > not many sms' in 5-10 or 60 minutes. just ONE when service out, and > the SECOND when service IN again. > how do i have to define nofity-by-sms service definition to achieve > this goal ?[just one when out and one when in again] Let's say you have 2 Contactgroups: I-WANT-DOWN-AND-UP and I-ONLY-WANT-DOWN define serviceescalation { hostgroup_name HTTP-HOSTGROUP service_description HTTP contact_groups I-WANT-DOWN-AND-UP,I-ONLY-WANT-DOWN first_notification 1 last_notification 1 notification_interval 10 escalation_period daylight_hours escalation_options c } define serviceescalation { hostgroup_name HTTP-HOSTGROUP service_description HTTP contact_groups I-WANT-DOWN-AND-UP first_notification 1 last_notification 0 notification_interval 10 escalation_period daylight_hours escalation_options r } This sends exactly 1 SMS to both contactgroups if the service goes critical. If the host ever recovers, it sends the recovery only to the I-WANT-DOWN-AND-UP group. Note that every critical notification after the first isn't affected by this escalations anymore and therefore the contact defined in the base service gets paged (usually with mail). regards sash -------------------------------------------------- Sascha Runschke Netzwerk Administration IT-Services ABIT AG Robert-Bosch-Str. 1 40668 Meerbusch Tel.:+49 (0) 2150.9153.226 Mobil:+49 (0) 173.5419665 mailto:SRunschke at abit.de http://www.abit.net http://www.abit-epos.net --------------------------------- Sicherheitshinweis zur E-Mail Kommunikation / Security note regarding email communication: http://www.abit.net/sicherheitshinweis.html ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From werner at skolelinux.no Wed Dec 14 11:09:05 2005 From: werner at skolelinux.no (Morten Werner Olsen) Date: Wed, 14 Dec 2005 11:09:05 +0100 Subject: Extended list of status-/errorcodes from webapps Message-ID: <20051214100905.GO31749@debra.uio.no> Hi! This isn't directly related to Nagios, but I assume some of you use Nagios for monitoring webapplications so still kind of related. :) We've recently started to monitor our webservers and some of our webapplications with Nagios. Most webapplications depends on other services as databases, LDAP, NIS, filesystems and so on. Together with the webappdevelopers we are planning to make a status-page in the most important webapps which tests these dependencies, and print the status either in the output or in one of the HTML-headers. My first thought was to extend the Status-header (which normally says '200 OK'). I googled around to see if I could find an already written extention to this standard [1], but didn't find any. There are also other problems with this solution; PHP denies to print "invalid headers" not defined by the standard, and the check_http-plugin for Nagios reports "Critical - invalid header". So after speaking to a few of my colleagues, we agreed that using an own header ("App-status:" or something) might be the best solution. But I would like to know if any others have done anything like this? And have anyone made such a list of status-codes available or even proposed it as a standard? I'm also interested if some of you have other smart solutions to this "problem"? - Werner [1] http://www.ietf.org/rfc/rfc2616.txt ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From werner at skolelinux.no Wed Dec 14 11:17:42 2005 From: werner at skolelinux.no (Morten Werner Olsen) Date: Wed, 14 Dec 2005 11:17:42 +0100 Subject: Extended list of status-/errorcodes from webapps In-Reply-To: <20051214100905.GO31749@debra.uio.no> References: <20051214100905.GO31749@debra.uio.no> Message-ID: <20051214101742.GQ31749@debra.uio.no> On Wed, Dec 14, 2005 at 11:09:05AM +0100, Morten Werner Olsen wrote: > We've recently started to monitor our webservers and some of our > webapplications with Nagios. Most webapplications depends on other > services as databases, LDAP, NIS, filesystems and so on. Together with > the webappdevelopers we are planning to make a status-page in the most > important webapps which tests these dependencies, and print the status > either in the output or in one of the HTML-headers. HTTP-headers, sorry. - Werner ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marcio at ambientebrasil.com.br Wed Dec 14 13:32:42 2005 From: marcio at ambientebrasil.com.br (Marcio Merlone) Date: Wed, 14 Dec 2005 10:32:42 -0200 Subject: Sound notification on nagios server Message-ID: <1134563562.30790.14.camel@localhost.localdomain> Hello all. I searched the web for this but could not find a proper solution. I am trying to make the nagios server (not the web gui) play a sound when there is something wrong. In this effort, I set: misccommands.cfg: define command{ command_name play_siren command_line /sbin/sirene.sh } /sbin/sirene.sh: #!/bin/bash data=`date` echo "$data Making noise" >> /var/log/buzina.log /usr/bin/aplay -q /usr/share/nagios/htdocs/media/critical.wav contacts.cfg: define contact{ contact_name sirene alias Sound card on sever service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,u,r service_notification_commands play_siren host_notification_commands play_siren email info at domain.tld } contactgroups.cfg: define contactgroup{ contactgroup_name buzina alias Placa de som na cedro members sirene } hosts.cfg: define host{ use generic-host host_name teste alias Just for fun address 10.0.0.124 check_command check-host-alive max_check_attempts 20 notification_interval 60 notification_period 24x7 notification_options d,u,r } services.cfg: define service{ use generic-service hostgroup_name teste service_description PING is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups router-admins,buzina notification_interval 240 notification_period 24x7 notification_options c,r check_command check_ping!100.0,20%!500.0,60% } I have a sound card attached to the server, and issuing this command by hand I can hear the sound. But when the test server is down the command is not executed. The machine is an ubuntu 5.10 running kernel 2.6.12-10-k7, and nagios-mysql 1.3-cvs.20050402-4ubuntu3. Please let me know if you need further information. Thanks in advance for any help or hint. Best regards, -- Marcio Merlone Ambiente Brasil ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Deborah.Martin at Kognitio.com Wed Dec 14 14:15:23 2005 From: Deborah.Martin at Kognitio.com (Deborah Martin) Date: Wed, 14 Dec 2005 13:15:23 -0000 Subject: Sound notification on nagios server Message-ID: Have you enabled sound in cgi.cfg ? -----Original Message----- From: Marcio Merlone [mailto:marcio at ambientebrasil.com.br] Sent: 14 December 2005 12:33 To: nagios-users Subject: [Nagios-users] Sound notification on nagios server Hello all. I searched the web for this but could not find a proper solution. I am trying to make the nagios server (not the web gui) play a sound when there is something wrong. In this effort, I set: misccommands.cfg: define command{ command_name play_siren command_line /sbin/sirene.sh } /sbin/sirene.sh: #!/bin/bash data=`date` echo "$data Making noise" >> /var/log/buzina.log /usr/bin/aplay -q /usr/share/nagios/htdocs/media/critical.wav contacts.cfg: define contact{ contact_name sirene alias Sound card on sever service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,u,r service_notification_commands play_siren host_notification_commands play_siren email info at domain.tld } contactgroups.cfg: define contactgroup{ contactgroup_name buzina alias Placa de som na cedro members sirene } hosts.cfg: define host{ use generic-host host_name teste alias Just for fun address 10.0.0.124 check_command check-host-alive max_check_attempts 20 notification_interval 60 notification_period 24x7 notification_options d,u,r } services.cfg: define service{ use generic-service hostgroup_name teste service_description PING is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups router-admins,buzina notification_interval 240 notification_period 24x7 notification_options c,r check_command check_ping!100.0,20%!500.0,60% } I have a sound card attached to the server, and issuing this command by hand I can hear the sound. But when the test server is down the command is not executed. The machine is an ubuntu 5.10 running kernel 2.6.12-10-k7, and nagios-mysql 1.3-cvs.20050402-4ubuntu3. Please let me know if you need further information. Thanks in advance for any help or hint. Best regards, -- Marcio Merlone Ambiente Brasil ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ************************************************************************ This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. *********************************************************************** ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kevin.alford at tmctechnologies.com Wed Dec 14 14:37:45 2005 From: kevin.alford at tmctechnologies.com (Kevin Alford) Date: Wed, 14 Dec 2005 08:37:45 -0500 Subject: Monitoring a group of processes Message-ID: <49AE1B79542B094CAF5C60333CC8347D010A146E@WVHQMAIL.tmctechnologies.com> I am running nagios 2.0b5 on Fedora core 3. I wanted to know if it is capable of monitoring a group of processes? An example is smbd and nmbd. You need both of them to be running for Samba to be operational. I can use the command /usr/local/nagios/libexec/check_procs -C smbd to see that the smbd processes are running, and I can do the same thing For nmbd, but how do I look at them together? This is just an example. I am using this to monitor the project specific Application we run, but it involves multiple processes. I would like to monitor each process, and report on the application Being up as a whole. Your assistance in this matter is greatly appreciated. Kevin D. Alford ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marcio at ambientebrasil.com.br Wed Dec 14 14:46:31 2005 From: marcio at ambientebrasil.com.br (Marcio Merlone) Date: Wed, 14 Dec 2005 11:46:31 -0200 Subject: Sound notification on nagios server In-Reply-To: References: Message-ID: <1134567992.30790.17.camel@localhost.localdomain> On Qua, 2005-12-14 at 13:15 +0000, Deborah Martin wrote: > Have you enabled sound in cgi.cfg ? Yes, but this regards only for web clients browsing nagios, and it is working fine. BUT, I want the sound to be played on the nagios server, not my workstation. -- Marcio Merlone Ambiente Brasil ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tjl at topdanmark.dk Wed Dec 14 14:51:32 2005 From: tjl at topdanmark.dk (Thomas Jens Lauritsen) Date: Wed, 14 Dec 2005 14:51:32 +0100 Subject: Vedr.: Monitoring a group of processes Message-ID: Hi Kevin, I have made a shell-plugin that ask for a group of processes. It uses SNMP to get the data which I pipe to tmp-file. Then I reread the file via grep etc. until I have a status which I then tell Nagois about. In the services.cfg it look like this. # Service definition define service{ use generic-service ; Name of service template to use host_name SERVER-XXXX service_description AVIRUSTASKS is_volatile 0 check_period 24x7 max_check_attempts 2 normal_check_interval 15 retry_check_interval 1 contact_groups xxxxxx-admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check_tasks.sh!public!nantigen.exe!nasched.exe!nnwall.exe!nnwalljp.exe! } As you can see I check for these four processes on a windows box: nantigen.exe ; nasched.exe ; nnwall.exe ; nnwalljp.exe Best reegards Thomas |---------+----------------------------------------> | | "Kevin Alford" | | | | | | Sendt af: | | | nagios-users-admin at lists.sour| | | ceforge.net | | | | | | | | | 14-12-2005 14:37 | | | | |---------+----------------------------------------> >--------------------------------------------------------------------------------------------------------------| | | | Til: "nagios-users" | | cc: | | Vedr.: [Nagios-users] Monitoring a group of processes | >--------------------------------------------------------------------------------------------------------------| I am running nagios 2.0b5 on Fedora core 3. I wanted to know if it is capable of monitoring a group of processes? An example is smbd and nmbd. You need both of them to be running for Samba to be operational. I can use the command /usr/local/nagios/libexec/check_procs -C smbd to see that the smbd processes are running, and I can do the same thing For nmbd, but how do I look at them together? This is just an example. I am using this to monitor the project specific Application we run, but it involves multiple processes. I would like to monitor each process, and report on the application Being up as a whole. Your assistance in this matter is greatly appreciated. Kevin D. Alford ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From luis.cerezo at pgs.com Wed Dec 14 15:02:48 2005 From: luis.cerezo at pgs.com (Luis Cerezo) Date: Wed, 14 Dec 2005 08:02:48 -0600 Subject: solaris check_disk util does not see mounts/disks... Message-ID: <43A02608.2080908@pgs.com> which version of df is it expecting? ... sunfreeware? Solaris 9... bash-2.05$ df -h | grep users /dev/dsk/c3t1d1s3 340G 230G 107G 69% /users bash-2.05$ mount | grep users /users on /dev/dsk/c3t1d1s3 read/write/setuid/intr/largefiles/logging/quota/xattr/onerror=panic/dev=1d8000b on Tue Nov 29 21:00:27 2005 bash-2.05$ ./check_disk -w 10% -c 2% -p /users Disk "/users" not mounted or nonexistant bash-2.05$ ./check_disk -w 10% -c 2% -p /users/ Disk "/users/" not mounted or nonexistant bash-2.05$ ./check_disk -w 10% -c 2% -p "/users" Disk "/users" not mounted or nonexistant bash-2.05$ ./check_disk -w 10% -c 2% -p"/users" Disk "/users" not mounted or nonexistant bash-2.05$ ./check_disk -w 10% -c 2% -p/users Disk "/users" not mounted or nonexistant bash-2.05$ ./check_disk Could not parse arguments Usage: check_disk -w limit -c limit [-p path | -x device] [-t timeout] [-m] [-e] [--verbose] check_disk (-h|--help) check_disk (-V|--version) bash-2.05$ ./check_disk -w 10% -c 2% -p /users --verbose */usr/local/bin/df -Pk /users ==> Disk "/users" not mounted or nonexistant* bash-2.05$ /usr/local/bin/df *bash: /usr/local/bin/df: No such file or directory* bash-2.05$ which df WARNING *** No CM login file - CM is not setup *** /usr/bin/df bash-2.05$ df -Pk *df: unknown option: P* Usage: df [-F FSType] [-abeghklntVv] [-o FSType-specific_options] [directory | block_device | resource] bash-2.05$ -- Luis E. Cerezo 281.509.8397 ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mscandian at freebsdbrasil.com.br Wed Dec 14 14:18:16 2005 From: mscandian at freebsdbrasil.com.br (=?ISO-8859-1?Q?M=E1rio_S=E9rgio_Candian?=) Date: Wed, 14 Dec 2005 11:18:16 -0200 Subject: check_radius config_file Message-ID: <43A01B98.9060306@freebsdbrasil.com.br> Hi list, I'd like to use the check_radius plugin, but I dont understand to use that plugin. Look: (root at firewall)~# ./check_radius -H 201.38.x.y -u radius -p password -P 1645 -t 15 Config file errorExit 3 (root at firewall)~# ./check_radius Could not parse arguments Usage: check_radius -H host -F config_file -u username -p password' [-P port] [-t timeout] [-r retries] [-e expect] check_radius (-h | --help) for detailed help check_radius (-V | --version) for version information Exit 3 What is the config_file? What I can put in the config_file? Regards, -- Mario Sergio Candian - "Dreams as if you'll live forever. Live as if you'll die today" -- James Dean ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mfatihakbulut at gmail.com Wed Dec 14 15:20:35 2005 From: mfatihakbulut at gmail.com (Mehmet Fatih AKBULUT) Date: Wed, 14 Dec 2005 16:20:35 +0200 Subject: what wrong with these ? Message-ID: hi all, my command definition is like this : define command{ command_name host-notify-by-sms command_line /usr/local/bin/gsmsendsms -d /dev/ttyd0 -b 19200 05322623103 "$NOTIFICATIONTYPE$: $HOSTNAME$ is $HOSTSTATE$ ($OUTPUT$)" } [located in misscommands.cfg] and service definition : # 'nagios' contact definition define contact{ contact_name sysadmin alias Sys Admin service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,u,r service_notification_commands notify-by-email,host-notify-by-sms host_notification_commands host-notify-by-email,host-notify-by-sms email makbulut at boun.edu.tr } i get mail but have problem in getting sms :( not received any sms for hours even many services down ? any idea why above lines not work ? Regads, Bye -------------- next part -------------- An HTML attachment was scrubbed... URL: From assaf at m-wise.com Wed Dec 14 15:37:37 2005 From: assaf at m-wise.com (Assaf Flatto) Date: Wed, 14 Dec 2005 16:37:37 +0200 Subject: what wrong with these ? In-Reply-To: References: Message-ID: <43A02E31.7010806@m-wise.com> from the first glance is look like a problem in the host/service notification configuration , service notification should be defined as : " service_notification_commands notify-by-sms" you are using host_notify-by-sms for services. Assaf > > my command definition is like this : > > define command{ > command_name host-notify-by-sms > command_line /usr/local/bin/gsmsendsms -d /dev/ttyd0 -b 19200 > 05322623103 "$NOTIFICATIONTYPE$: $HOSTNAME$ is $HOSTSTATE$ ($OUTPUT$)" > } > > [located in misscommands.cfg] > > and service definition : > > # 'nagios' contact definition > define contact{ > contact_name sysadmin > alias Sys Admin > service_notification_period 24x7 > host_notification_period 24x7 > service_notification_options w,u,c,r > host_notification_options d,u,r > service_notification_commands notify-by-email,host-notify-by-sms > host_notification_commands host-notify-by-email,host-notify-by-sms > email makbulut at boun.edu.tr > } ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From awfief at gmail.com Wed Dec 14 15:44:33 2005 From: awfief at gmail.com (sheeri kritzer) Date: Wed, 14 Dec 2005 09:44:33 -0500 Subject: Nagios Performance Data shows checks aren't being completed In-Reply-To: <24cb9b4b0512121443n3d7b0d0fi70f06368a94e84ef@mail.gmail.com> References: <24cb9b4b0512121443n3d7b0d0fi70f06368a94e84ef@mail.gmail.com> Message-ID: <24cb9b4b0512140644r4b62ea64k766f4d3cc92c67ac@mail.gmail.com> So, while exploring, since I haven't heard any responses yet, I found that some of my service checks had a "Next Scheduled Active Check" for 11/29/05. Why would there be one in the past? I restarted nagios, and things seem running fine now, but how did my system get into that state? How can I prevent that from happening in the future? My original message had all my server stats. . . . -Sheeri On 12/12/05, sheeri kritzer wrote: > Hi all, > > My nagios 2.0 installation shows the following under performance > information. There are 99 service checks, and I can't imagine it > takes more than an hour to complete all 99. We've had problems where > nagios hasn't found and notified us of problems. The load on the box > is tiny. nagios -s has no suggestions. What did I do wrong? > [cut for length] ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Deborah.Martin at Kognitio.com Wed Dec 14 16:40:23 2005 From: Deborah.Martin at Kognitio.com (Deborah Martin) Date: Wed, 14 Dec 2005 15:40:23 -0000 Subject: Sound notification on nagios server Message-ID: probably a dumb question - but why have the sound on your server and not through a client front-end ? Surely a nagios server would be hosted in a computer room somewhere and the clients would be those monitoring from their desktops ? Or why not have a web browser open on the server then ? -----Original Message----- ************************************************************************ This email and any files transmitted with it are confidential and intended solely for the use of the individual or entity to whom they are addressed. Any unauthorised distribution or copying is strictly prohibited. Whilst Kognitio Limited takes steps to prevent the transmission of viruses via e-mail, we can not guarantee that any email or attachment is free from computer viruses and you are strongly advised to undertake your own anti-virus precautions. Kognitio grants no warranties regarding performance, use or quality of any e-mail or attachment and undertakes no liability for loss or damage, howsoever caused. *********************************************************************** From: Marcio Merlone [mailto:marcio at ambientebrasil.com.br] Sent: 14 December 2005 13:47 To: Deborah Martin Cc: nagios-users Subject: RE: [Nagios-users] Sound notification on nagios server On Qua, 2005-12-14 at 13:15 +0000, Deborah Martin wrote: > Have you enabled sound in cgi.cfg ? Yes, but this regards only for web clients browsing nagios, and it is working fine. BUT, I want the sound to be played on the nagios server, not my workstation. -- Marcio Merlone Ambiente Brasil ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark_stray at mlc.com.au Wed Dec 14 18:02:13 2005 From: mark_stray at mlc.com.au (mark_stray at mlc.com.au) Date: Thu, 15 Dec 2005 01:02:13 +0800 Subject: Mark Stray/AU3/WealthMgmt is out of the office. Message-ID: <482570D7.005D0B45.00@flx726.flexiplan.com> I will be out of the office starting 14/12/2005 and will not return until 19/12/2005. I will respond yo your message when I return to Work. Any matters of an urgent nature should be directed to Martin Kleinman or the helpdesk. regards, Mark *********************************************************************** This email is sent by or on behalf of the named sender identified above. If: (a) you do not wish to receive any email marketing material from me in the future, please forward the contents of this email with the message 'unsubscribe' in the subject box to unsubscribe at mlc.com.au; or (b)you wish to unsubscribe from all central email marketing lists used by our business,please forward the contents of this email,with the message 'unsubscribe from all central email marketing lists' in the subject box to unsubscribeall at mlc.com.au. The information contained in this e-mail communication may be confidential. You should only read, disclose, re-transmit, copy, distribute, act in reliance on or commercialise the information if you are authorised to do so. If you are not the intended recipient of this e-mail communication, please immediately notify us by e-mail to masterkeycustom_postmaster at mlc.com.au, or reply by e-mail direct to the sender and then destroy any electronic and paper copy of this message. Any views expressed in this e-mail communication are those of the individual sender, except where the sender specifically states them to be the views of a member of the National Australia Bank Group of companies. Any advice contained in this e-mail has been prepared without taking into account your objectives, financial situation or needs. Before acting on any advice in this e-mail, National Australia Bank Limited recommends that you consider whether it is appropriate for your circumstances. If this e-mail contains reference to any financial products, the National recommends you consider the Product Disclosure statement (PDS) or other disclosure document before making any decisions regarding any products. The National Australia Bank Group of companies does not represent, warrant or guarantee that the integrity of this communication has been maintained nor that the communication is free of errors, virus or interference. *********************************************************************** ______________________________________________________________________ This email has been scanned by the MessageLabs Email Security System. For more information please visit http://www.messagelabs.com/email ______________________________________________________________________ ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Wed Dec 14 18:46:53 2005 From: estair at ilm.com (Eli Stair) Date: Wed, 14 Dec 2005 09:46:53 -0800 Subject: Extended list of status-/errorcodes from webapps In-Reply-To: <20051214100905.GO31749@debra.uio.no> References: <20051214100905.GO31749@debra.uio.no> Message-ID: <43A05A8D.2060000@ilm.com> Making changes to the HTTP header to indicate an errorcode is something I haven't thought about before. I've always used tags in the HTML body to indicate the issue... it requires less work, and enables you to be more verbose (and IMO is cleaner than extending the standard for the header...). What need is making you lean more towards this modification rather than just having your status applet/URL output the relavent data in the body for parsing? /eli Morten Werner Olsen wrote: > Hi! > > This isn't directly related to Nagios, but I assume some of you use > Nagios for monitoring webapplications so still kind of related. :) > > We've recently started to monitor our webservers and some of our > webapplications with Nagios. Most webapplications depends on other > services as databases, LDAP, NIS, filesystems and so on. Together with > the webappdevelopers we are planning to make a status-page in the most > important webapps which tests these dependencies, and print the status > either in the output or in one of the HTML-headers. > > My first thought was to extend the Status-header (which normally says > '200 OK'). I googled around to see if I could find an already written > extention to this standard [1], but didn't find any. There are also > other problems with this solution; PHP denies to print "invalid > headers" not defined by the standard, and the check_http-plugin for > Nagios reports "Critical - invalid header". > > So after speaking to a few of my colleagues, we agreed that using an > own header ("App-status:" or something) might be the best > solution. But I would like to know if any others have done anything > like this? And have anyone made such a list of status-codes available > or even proposed it as a standard? > > I'm also interested if some of you have other smart solutions to this > "problem"? > > > - Werner > > [1] http://www.ietf.org/rfc/rfc2616.txt > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mscandian at freebsdbrasil.com.br Wed Dec 14 17:44:23 2005 From: mscandian at freebsdbrasil.com.br (=?ISO-8859-1?Q?M=E1rio_S=E9rgio_Candian?=) Date: Wed, 14 Dec 2005 14:44:23 -0200 Subject: check_radius Message-ID: <43A04BE7.6050706@freebsdbrasil.com.br> Hi list, I'd like to use the check_radius plugin, but I dont understand to use that plugin. Look: (root at firewall)~# ./check_radius -H 201.38.x.y -u radius -p password -P 1645 -t 15 Config file error Exit 3 (root at firewall)~# ./check_radius Could not parse arguments Usage: check_radius -H host -F config_file -u username -p password' [-P port] [-t timeout] [-r retries] [-e expect] check_radius (-h | --help) for detailed help check_radius (-V | --version) for version information Exit 3 What is the config_file? What I can put in the config_file? Regards, -- Atenciosamente, Mario Sergio Candian FreeBSD Brasil LTDA. - "Dreams as if you'll live forever. Live as if you'll die today" -- James Dean ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Wed Dec 14 19:24:59 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Wed, 14 Dec 2005 19:24:59 +0100 Subject: check_radius In-Reply-To: <43A04BE7.6050706@freebsdbrasil.com.br> References: <43A04BE7.6050706@freebsdbrasil.com.br> Message-ID: <20051214182458.GC28716393@CIS.FU-Berlin.DE> * M?rio S?rgio Candian [2005-12-14 14:44]: > (root at firewall)~# ./check_radius -H 201.38.x.y -u radius -p password -P > 1645 -t 15 > Config file error > Exit 3 > > (root at firewall)~# ./check_radius > Could not parse arguments > Usage: > check_radius -H host -F config_file -u username -p > password' [-P port] [-t timeout] [-r retries] [-e expect] > check_radius (-h | --help) for detailed help > check_radius (-V | --version) for version information > Exit 3 > > What is the config_file? What I can put in the config_file? It's the configuration file for the 'radiusclient' library: | $ check_radius -h | grep configuration | [...] A configuration file may also be present. The format of the | configuration file is described in the radiusclient library sources. Please use the 'nagiosplug-help' list if you have further questions regarding the plugin. Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marcio at ambientebrasil.com.br Wed Dec 14 20:00:51 2005 From: marcio at ambientebrasil.com.br (Marcio Merlone) Date: Wed, 14 Dec 2005 17:00:51 -0200 Subject: Sound notification on nagios server In-Reply-To: References: Message-ID: <1134586852.30790.25.camel@localhost.localdomain> On Qua, 2005-12-14 at 15:40 +0000, Deborah Martin wrote: > probably a dumb question - but why have the sound on your server and not > through a client front-end ? > Surely a nagios server would be hosted in a computer room somewhere and the > clients would be those monitoring > from their desktops ? Or why not have a web browser open on the server then > ? A question is never dumb ;) The room where my nagios server is hosted is right beside me, at the reach of my ears. But I don't want to keep my workstation always on, and, as a matter of fact, I don't have the required plugins on my browser to play sound - and don't really care for those, don't want them. Since the server will be always on, it seems a good idea (for me) to have the sound notification on it. My support team is indeed very small, just me and another guy, to deal with 4 linux servers and 2 internet connections. Thanks for your reply. -- Marcio Merlone Ambiente Brasil ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Wed Dec 14 20:58:39 2005 From: estair at ilm.com (Eli Stair) Date: Wed, 14 Dec 2005 11:58:39 -0800 Subject: Sound notification on nagios server In-Reply-To: <1134586852.30790.25.camel@localhost.localdomain> References: <1134586852.30790.25.camel@localhost.localdomain> Message-ID: <43A0796F.90708@ilm.com> I've had success with 'echo -en "\007"' (aka: beep). Have an eventhandler run that whenever you want to get annoyed. It's fun to use with a remote ssh type handler when you're watching interns try to find a node in a cluster and you're watching on Netbotz ;) /eli Marcio Merlone wrote: > On Qua, 2005-12-14 at 15:40 +0000, Deborah Martin wrote: > >>probably a dumb question - but why have the sound on your server and not >>through a client front-end ? >>Surely a nagios server would be hosted in a computer room somewhere and the >>clients would be those monitoring >>from their desktops ? Or why not have a web browser open on the server then >>? > > > > A question is never dumb ;) > > The room where my nagios server is hosted is right beside me, at the > reach of my ears. But I don't want to keep my workstation always on, > and, as a matter of fact, I don't have the required plugins on my > browser to play sound - and don't really care for those, don't want > them. > > Since the server will be always on, it seems a good idea (for me) to > have the sound notification on it. My support team is indeed very small, > just me and another guy, to deal with 4 linux servers and 2 internet > connections. > > Thanks for your reply. > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bidhata at gmail.com Wed Dec 14 21:01:52 2005 From: bidhata at gmail.com (Krish Paul) Date: Wed, 14 Dec 2005 21:01:52 +0100 (CET) Subject: Multi User NagiOS Message-ID: <20051214200152.83A334F415D@desire.netways.de> Hi guys, Do you know any plugins by which I can create multiple user in NagiOS , means - let admin added 5 servers for user 'A' and 2 servers for user 'B'.When user 'A' logged in using his login and password, he/she can only browse and see his own server, not user 'B''s servers. Do you know any, please let me know. - Krish Paul (bidhata) ----------------------- The mailing list archive is found here: http://www.nagiosexchange.org/nagios-users.34.0.html ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marcio at ambientebrasil.com.br Wed Dec 14 21:12:53 2005 From: marcio at ambientebrasil.com.br (Marcio Merlone) Date: Wed, 14 Dec 2005 18:12:53 -0200 Subject: Sound notification on nagios server In-Reply-To: <43A0796F.90708@ilm.com> References: <1134586852.30790.25.camel@localhost.localdomain> <43A0796F.90708@ilm.com> Message-ID: <1134591174.30790.30.camel@localhost.localdomain> On Qua, 2005-12-14 at 11:58 -0800, Eli Stair wrote: > > I've had success with 'echo -en "\007"' (aka: beep). Have an > eventhandler run that whenever you want to get annoyed. > > It's fun to use with a remote ssh type handler when you're watching > interns try to find a node in a cluster and you're watching on > Netbotz ;) > > I'll give eventhandlers a try, but this seems wrong to me. Eventhandlers should be used as a response to fix what is wrong - in my understanding. It looks to me that host_notification_commands and service_notification_commands are the proper way to make a notification noise... -- Marcio Merlone ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Wed Dec 14 21:17:08 2005 From: estair at ilm.com (Eli Stair) Date: Wed, 14 Dec 2005 12:17:08 -0800 Subject: Sound notification on nagios server In-Reply-To: <1134591174.30790.30.camel@localhost.localdomain> References: <1134586852.30790.25.camel@localhost.localdomain> <43A0796F.90708@ilm.com> <1134591174.30790.30.camel@localhost.localdomain> Message-ID: <43A07DC4.2090002@ilm.com> Agreed, that's what they're for and definitely what I (and surely most people) use them for. But they also work well to "handle" "events". If there are any reasons not to use them for that, or other arbitrary purposes, I'm not aware of them. /eli Marcio Merlone wrote: > On Qua, 2005-12-14 at 11:58 -0800, Eli Stair wrote: > > >>I've had success with 'echo -en "\007"' (aka: beep). Have an >>eventhandler run that whenever you want to get annoyed. >> >>It's fun to use with a remote ssh type handler when you're watching >>interns try to find a node in a cluster and you're watching on >>Netbotz ;) >> >> > > I'll give eventhandlers a try, but this seems wrong to me. Eventhandlers > should be used as a response to fix what is wrong - in my understanding. > It looks to me that host_notification_commands and > service_notification_commands are the proper way to make a notification > noise... > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mls at skayser.de Wed Dec 14 21:18:36 2005 From: mls at skayser.de (Sebastian Kayser) Date: Wed, 14 Dec 2005 21:18:36 +0100 Subject: Multi User NagiOS In-Reply-To: <20051214200152.83A334F415D@desire.netways.de> References: <20051214200152.83A334F415D@desire.netways.de> Message-ID: <20051214201836.GA6841@skayser.homeip.net> * Krish Paul wrote: > Do you know any plugins by which I can create multiple user in > NagiOS , means - let admin added 5 servers for user 'A' and 2 servers > for user 'B'.When user 'A' logged in using his login and password, > he/she can only browse and see his own server, not user 'B''s servers. This is Nagios' default behaviour if you use CGI authentication. An authenticated user is only allowed to see hosts/services for whom he is configured as a contact (unless you are working with authorized_for_all_* directives in your CGI configuration file). There is a whole section dedicated to this topic in the Nagios documentation Nagios 1.x: http://nagios.sourceforge.net/docs/1_0/cgiauth.html Nagios 2.x: http://nagios.sourceforge.net/docs/2_0/cgiauth.html - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bidhata at gmail.com Wed Dec 14 21:29:35 2005 From: bidhata at gmail.com (Krish Paul) Date: Wed, 14 Dec 2005 21:29:35 +0100 (CET) Subject: Multi User NagiOS In-Reply-To: <20051214201836.GA6841@skayser.homeip.net> References: <20051214201836.GA6841@skayser.homeip.net> Message-ID: <20051214202935.529544F415D@desire.netways.de> Hi Sebastian Thanks a lot for the answer. Going to configure it in guided way. - Krish Paul (bidhata) ----------------------- This thread is located in the archive at this URL: http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_maillisttofaq_pi 1[showUid]=13672 ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From msugano at uolinc.com Wed Dec 14 21:24:31 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Wed, 14 Dec 2005 18:24:31 -0200 Subject: Sound notification on nagios server In-Reply-To: <1134567992.30790.17.camel@localhost.localdomain> References: <1134567992.30790.17.camel@localhost.localdomain> Message-ID: <1134591871.16079.26.camel@localhost> On Wed, 2005-12-14 at 11:46 -0200, Marcio Merlone wrote: > Yes, but this regards only for web clients browsing nagios, and it is > working fine. BUT, I want the sound to be played on the nagios server, > not my workstation. Nagios user has permissions on whatever /dev/dsp devices? -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. -- http://www.uol.com.br ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cmoody at qualcomm.com Wed Dec 14 21:41:57 2005 From: cmoody at qualcomm.com (Chris Moody) Date: Wed, 14 Dec 2005 12:41:57 -0800 Subject: Multi User NagiOS In-Reply-To: <20051214202935.529544F415D@desire.netways.de> References: <20051214201836.GA6841@skayser.homeip.net> <20051214202935.529544F415D@desire.netways.de> Message-ID: <43A08395.9040603@qualcomm.com> Since it sounds like you are also looking for a way to prevent a user from adding/deleting hosts/services/etc from another user's listings...you can check out my nagios Vhost script on nagiosexchange. I have a situation where I run seperate nagios processes for each group. This allows the users in the group to modify their instance of nagios to their heart's content...without even touching the other processes and/or configs. It is quite trivial to chroot users to the nagios directory of their group when they login to the server to modify their setup. If you -need- this kind of functionality as well, I'd recommend the Vhost setup. If you only need to separate the _views_...then the nagios auth configs are the perfect solution and don't require any extra processes. Cheers, -Chris Krish Paul wrote: > Hi Sebastian > > > > Thanks a lot for the answer. Going to configure it in guided way. > > > > > > - Krish Paul (bidhata) > > > > ----------------------- > > This thread is located in the archive at this URL: > > http://www.nagiosexchange.org/nagios-users.34.0.html?&tx_maillisttofaq_pi > 1[showUid]=13672 > > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op?k > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marcio at ambientebrasil.com.br Wed Dec 14 21:55:24 2005 From: marcio at ambientebrasil.com.br (Marcio Merlone) Date: Wed, 14 Dec 2005 18:55:24 -0200 Subject: Sound notification on nagios server In-Reply-To: <1134591871.16079.26.camel@localhost> References: <1134567992.30790.17.camel@localhost.localdomain> <1134591871.16079.26.camel@localhost> Message-ID: <1134593725.9513.3.camel@localhost.localdomain> On Qua, 2005-12-14 at 18:24 -0200, Marcel Mitsuto Fucatu Sugano wrote: > On Wed, 2005-12-14 at 11:46 -0200, Marcio Merlone wrote: > > Yes, but this regards only for web clients browsing nagios, and it is > > working fine. BUT, I want the sound to be played on the nagios server, > > not my workstation. > > Nagios user has permissions on whatever /dev/dsp devices? It does now :) But still no sound. But while checking the permissions, I've got an improvement. Now the log file /var/log/buzina.log of the script gets populated.... (Recording the script:) root at cedro:/etc/nagios# cat /usr/lib/nagios/plugins/sirene #!/bin/bash data=`date` echo "$data Tocando buzina" >> /var/log/buzina.log /usr/bin/aplay -q /usr/share/nagios/htdocs/media/critical.wav root at cedro:/etc/nagios# Good tip, but there is still something wrong. -- Marcio Merlone Ambiente Brasil ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Wed Dec 14 22:18:01 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Thu, 15 Dec 2005 10:18:01 +1300 Subject: Monitoring a group of processes In-Reply-To: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B57F@vxchange.vcr.auckland.ac.nz> References: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B57F@vxchange.vcr.auckland.ac.nz> Message-ID: <001901c600f3$ddc5ab90$245cd882@438LG01sshi052> > An example is smbd and nmbd. You need both of them to be > running for Samba to be operational. I can use the command > /usr/local/nagios/libexec/check_procs -C smbd to see that the > smbd processes are running, and I can do the same thing For > nmbd, but how do I look at them together? You could define two Nagios services using check_procs (one to check for smbd, one to check for nmbd) and then define a third summary service using check_summary (get it from www.nagiosexchange.org ) to give the worst result of the two. Use summary calculation '@max( /host/smbd /host/nmbd )' or similar depending on your host and service names. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Wed Dec 14 22:39:07 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Wed, 14 Dec 2005 22:39:07 +0100 Subject: Sound notification on nagios server In-Reply-To: <1134593725.9513.3.camel@localhost.localdomain> References: <1134567992.30790.17.camel@localhost.localdomain> <1134591871.16079.26.camel@localhost> <1134593725.9513.3.camel@localhost.localdomain> Message-ID: <43A090FB.3010504@its-lehmann.de> Hello, Marcio Merlone schrieb: > On Qua, 2005-12-14 at 18:24 -0200, Marcel Mitsuto Fucatu Sugano wrote: > >>On Wed, 2005-12-14 at 11:46 -0200, Marcio Merlone wrote: >> >>>Yes, but this regards only for web clients browsing nagios, and it is >>>working fine. BUT, I want the sound to be played on the nagios server, >>>not my workstation. >> >>Nagios user has permissions on whatever /dev/dsp devices? > > > It does now :) More general approach: Try running your notification script as the user nagios runs as. Or put it in a wrapper script that uses su or sudo to execute the siren-script as, for example, root. That might give you some clues... I guess it's a permissions problem you encounter. > But still no sound. But while checking the permissions, I've got an > improvement. Now the log file /var/log/buzina.log of the script gets > populated.... > > (Recording the script:) > > root at cedro:/etc/nagios# cat /usr/lib/nagios/plugins/sirene > #!/bin/bash > data=`date` > echo "$data Tocando buzina" >> /var/log/buzina.log > /usr/bin/aplay -q /usr/share/nagios/htdocs/media/critical.wav > root at cedro:/etc/nagios# > > Good tip, but there is still something wrong. Permissions on aplay, the wav file, or anything else? An aplay ... >>/var/log/buzina.log might also show some error messages. Arno -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jason at shakabuku.org Wed Dec 14 22:59:48 2005 From: jason at shakabuku.org (Jason Bodnar) Date: Wed, 14 Dec 2005 15:59:48 -0600 Subject: Using newlines and control characters in config file Message-ID: <20051214215749.M96016@shakabuku.org> I remember once reading there was a special character sequence for using a newline in config files (for sending as a string with check_tcp for example). I cannot find this in the documentation, FAQ or mailing list archives. Could somebody point me to this info. Also, what is the appropriate way to put control characters (such as EOF/EOT) in a config file? Thanks, Jason -- Jason Bodnar jason at shakabuku.org http://www.shakabuku.org UnWired Buyer. The best way to win on eBay. Period. http://www.unwiredbuyer.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Thu Dec 15 08:47:58 2005 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Thu, 15 Dec 2005 08:47:58 +0100 Subject: alternative text behind links Message-ID: <1134632878.9864.15.camel@glr-nb.dh.uclv.net> Hi, maybe a cracy question. I want to insert/define an alternative text (MouseOver) behind a link in the WebGUI like this: [...] Service Status Details For Host 'x.y.z' . OutputSource OK 12-15-2005 08:35:21 0d 0h 21m 33s 1/3 Output Source OK - 3 . [...] the Column "OutputSource" is a link to the "Service State Information" Now I want to define an explanation (MouseOver Popup) behind this link, what the snmpget output "3" meens for this service, like: INTEGERS: other (1), none (2), normal (3), bypass (4), battery (5), booster (6), reducer (7) Thanks in advance Richard -- Richard Gliebe Fachhochschule Vorarlberg GmbH / University for Applied Science Information Services ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zyjohn at people.com.cn Thu Dec 15 10:57:40 2005 From: zyjohn at people.com.cn (Zhou Yan) Date: Thu, 15 Dec 2005 17:57:40 +0800 Subject: How to send SMS using more than one device? In-Reply-To: <003201c60085$43975780$0900d40a@intranet.it.net> References: <003201c60085$43975780$0900d40a@intranet.it.net> Message-ID: <43A13E14.5090403@people.com.cn> Hello: I think you can use kannel (http://www.kannel.org), by defining gsm modems as different SMSC, sms can be sent through more than one devices. ZhouYan System Engineer E-mail: zyjohn at people.com.cn Beijing, China Marco Borsani wrote: >Hi all ! > >I am using a Nagios running on HP-UX 11.00. To send SMS I prepare a script >that send a file to one serial port. > >I know that there are some tools (like smstools) that permit to use more >than one devices to send SMS. >Unfortunately on my HP-UX I am not able to compile smstools software (but on >a linux system it works very well!). > >What do you use to send SMS using more than one devices? > >Many thanks! > >Marco Borsani >Unix & Monitoring System Administrator >Technical Operation >Tel. +39 010 4310115 >Fax +39 010 4327454 >E-mail: m.borsani at IT.net > >ITnet S.r.l. - Direzione e Coordinamento di WIND Telecomunicazioni S.p.A. >Internet Service Provider >Sede legale: Via C.G.Viola, 48 - 00148 Roma >Dir. Centrale e Amministrativa: Via Pacinotti, 39 > 16151 Genova (Italy) > >http://www.it.net >mailto:info at IT.net >_______________________________________________________________ >Altre sedi ITnet: >MILANO tel.: +39 02 30114900 info-milano at IT.net >ROMA tel.: +39 06 83116707 info-roma at IT.net >_______________________________________________________________ >ITnet is associated to CIX (Commercial IP eXchange) and RIPE >ITnet is associated to AIIP (Associazione Italiana Internet Providers) > > > > >------------------------------------------------------- >This SF.net email is sponsored by: Splunk Inc. Do you grep through log files >for problems? Stop! Download the new AJAX search engine that makes >searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! >http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > > > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rmayhew at mweb.com Thu Dec 15 11:41:45 2005 From: rmayhew at mweb.com (Richard Mayhew) Date: Thu, 15 Dec 2005 12:41:45 +0200 Subject: How to send SMS using more than one device? Message-ID: Try SMS tools... It does it all, and its the simplest. -- Regards Richard -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Zhou Yan Sent: 15 December 2005 11:58 AM To: Marco Borsani Cc: 'nagios-users' Subject: Re: [Nagios-users] How to send SMS using more than one device? Hello: I think you can use kannel (http://www.kannel.org), by defining gsm modems as different SMSC, sms can be sent through more than one devices. ZhouYan System Engineer E-mail: zyjohn at people.com.cn Beijing, China Marco Borsani wrote: >Hi all ! > >I am using a Nagios running on HP-UX 11.00. To send SMS I prepare a >script that send a file to one serial port. > >I know that there are some tools (like smstools) that permit to use >more than one devices to send SMS. >Unfortunately on my HP-UX I am not able to compile smstools software >(but on a linux system it works very well!). > >What do you use to send SMS using more than one devices? > >Many thanks! > >Marco Borsani >Unix & Monitoring System Administrator >Technical Operation >Tel. +39 010 4310115 >Fax +39 010 4327454 >E-mail: m.borsani at IT.net > >ITnet S.r.l. - Direzione e Coordinamento di WIND Telecomunicazioni S.p.A. >Internet Service Provider >Sede legale: Via C.G.Viola, 48 - 00148 Roma >Dir. Centrale e Amministrativa: Via Pacinotti, 39 > 16151 Genova (Italy) > >http://www.it.net >mailto:info at IT.net >_______________________________________________________________ >Altre sedi ITnet: >MILANO tel.: +39 02 30114900 info-milano at IT.net >ROMA tel.: +39 06 83116707 info-roma at IT.net >_______________________________________________________________ >ITnet is associated to CIX (Commercial IP eXchange) and RIPE ITnet is >associated to AIIP (Associazione Italiana Internet Providers) > > > > >------------------------------------------------------- >This SF.net email is sponsored by: Splunk Inc. Do you grep through log >files for problems? Stop! Download the new AJAX search engine that >makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! >http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > > > > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From werner.flamme at ufz.de Thu Dec 15 12:52:13 2005 From: werner.flamme at ufz.de (Werner Flamme) Date: Thu, 15 Dec 2005 12:52:13 +0100 Subject: Monitoring a group of processes In-Reply-To: <49AE1B79542B094CAF5C60333CC8347D010A146E@WVHQMAIL.tmctechnologies.com> References: <49AE1B79542B094CAF5C60333CC8347D010A146E@WVHQMAIL.tmctechnologies.com> Message-ID: <43A158ED.5040402@ufz.de> Kevin Alford schrieb am 14.12.05 14:37: > I am running nagios 2.0b5 on Fedora core 3. I wanted to know if it is > capable of monitoring a group of processes? > An example is smbd and nmbd. You need both of them to be running for > Samba to be operational. I can use the command > /usr/local/nagios/libexec/check_procs -C smbd to see that the smbd > processes are running, and I can do the same thing > For nmbd, but how do I look at them together? This is just an example. > I am using this to monitor the project specific > Application we run, but it involves multiple processes. I would like to > monitor each process, and report on the application > Being up as a whole. Your assistance in this matter is greatly > appreciated. > > > > Kevin D. Alford Kevin, I used check_smb.sh (from the contrib dir in the nagios-plugins package) and modified a bit. Maybe you can do so too. The modified script runs on SuSE Linux Enterprise Server. I had some definitions at the beginning: BINGREP=`which grep` BINPS=`which ps` BINWC=`which wc` # look if your ps supports the syntax - Solaris does not... PSCOMMAND="$BINPS ax" # we need an output with the name of the process, so you # may adapt "ps ax" to what your system understands :-) and then I modified the checks: # # No arguments. Let's kick this pig. # # is smbd running? Count everything with "smbd" inside: SMBDRUN=$($PSCOMMAND | $BINGREP smbd | wc -l) # now we subtract "ps ax": SMBDRUN=$[$SMBDRUN-1] # # Ok, now let's grab a count of nmbd processes. # NMBDRUN=$($PSCOMMAND | $BINGREP nmbd | wc -l) # now we subtract "ps ax" again: NMBDRUN=$[$NMBDRUN-1] # # now for the dismount. # #echo "Total Users:$total_users Total Files:$total_files" echo "SMB-Procs: $SMBDRUN - NMB-Procs: $NMBDRUN" # if any of NMBDRUN or SMBDRUN equals zero, ALLPROC will become zero too: ALLPROC=$[$NMBDRUN*$SMBDRUN] # # let Nagios know that everything is ok - or not... # # either nmbd or smbd is (or both are) not running if [ $ALLPROC -eq 0 ]; then exit $STATE_CRITICAL # maybe you want to see a warning when there is only # 1 smbd and 1 nmbd running? Then de-comment next 2 lines: #elif [ $ALLPROC -lt 2 ]; then # exit $STATE_WARNING # and in any other case I assume everything is OK... else exit $STATE_OK fi So I fairly agree with Cal Evans: "Shamelessly stolen from other Nagios plugins." HTH, Werner -- Werner Flamme, Abt. WKDV UFZ Umweltforschungszentrum Leipzig-Halle GmbH Permoserstr. 15, 04318 Leipzig eMail: werner.flamme at ufz.de, Tel.: (0341) 235-2500 ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marcio at ambientebrasil.com.br Thu Dec 15 14:04:02 2005 From: marcio at ambientebrasil.com.br (Marcio Merlone) Date: Thu, 15 Dec 2005 11:04:02 -0200 Subject: Sound notification on nagios server In-Reply-To: <43A090FB.3010504@its-lehmann.de> References: <1134567992.30790.17.camel@localhost.localdomain> <1134591871.16079.26.camel@localhost> <1134593725.9513.3.camel@localhost.localdomain> <43A090FB.3010504@its-lehmann.de> Message-ID: <1134651843.9513.31.camel@localhost.localdomain> On Qua, 2005-12-14 at 22:39 +0100, Arno Lehmann wrote: > > On Qua, 2005-12-14 at 18:24 -0200, Marcel Mitsuto Fucatu Sugano wrote: > >>On Wed, 2005-12-14 at 11:46 -0200, Marcio Merlone wrote: > >>>Yes, but this regards only for web clients browsing nagios, and it is > >>>working fine. BUT, I want the sound to be played on the nagios server, > >>>not my workstation. > >> > >>Nagios user has permissions on whatever /dev/dsp devices? > More general approach: Try running your notification script as the user > nagios runs as. Or put it in a wrapper script that uses su or sudo to > execute the siren-script as, for example, root. That might give you some > clues... I guess it's a permissions problem you encounter. When Mr. Mitsuto said something about permissions, I checked all possibilities, and made this: root at cedro:/etc/nagios# su - nagios -s /bin/bash nagios at cedro:~ $ /usr/bin/aplay /usr/share/nagios/htdocs/media/critical.wav Playing WAVE '/usr/share/nagios/htdocs/media/critical.wav' : Unsigned 8 bit, Rate 11025 Hz, Mono nagios at cedro:~$ It made a sound! But: root at cedro:/etc/nagios# su - nagios -c "/usr/bin/aplay /usr/share/nagios/htdocs/media/critical.wav" root at cedro:/etc/nagios# This makes nothing. Anyway, it is clear that the nagios user is able to access /dev/dsp, since it played the sound on the first attempt. > Permissions on aplay, the wav file, or anything else? An aplay ... > >>/var/log/buzina.log might also show some error messages. I made that also, but no output to the log from aplay, just the echo "Playing" >> log... Should I give up? :) I refuse to! -- Marcio Merlone Ambiente Brasil ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ktynagios at richard-group.com Thu Dec 15 16:42:13 2005 From: ktynagios at richard-group.com (Kurt Yoder) Date: Thu, 15 Dec 2005 10:42:13 -0500 Subject: max_check_attempts ignores warnings? Message-ID: <5A00A48F-CC61-4CD2-913D-6BFAD37D303B@richard-group.com> Hello I would like my check_attempts and max_check_attempts to behave differently than it is now. Currently, assuming max_check_attempts is 3, swap warning is 10%, and swap critical is 90%, I can have the following situation: swap 5: ok swap 15: warning (check_attempts 1) swap 30: warning (check_attempts 2) swap 45: warning (check_attempts 3: max_check_attempts is reached) swap 95: critical (check_attempts 3: notification sent since max_check_attempts is reached) swap 15: warning (check_attempts 3: max_check_attempts is still fulfilled) swap 95: critical (check_attempts 3: notification sent since max_check_attempts is reached) swap 95: critical (check_attempts 3: notification already sent) swap 95: critical (check_attempts 3: notification already sent) swap 95: critical (check_attempts 3: notification already sent) swap 15: warning (check_attempts 3: max_check_attempts is reached) swap 5: ok I want it to do this instead: swap 5: ok swap 15: warning swap 30: warning swap 45: warning swap 95: critical (check_attempts 1) swap 15: warning swap 95: critical (check_attempts 1) swap 95: critical (check_attempts 2) swap 95: critical (check_attempts 3: notification sent since max_check_attempts is reached) swap 95: critical (check_attempts 3: notification already sent) swap 15: warning swap 5: ok So I want check_attempts to only increment when I'm in critical state, *not* when I'm in warning state. Is it possible for me to configure nagios this way? -- Kurt Yoder ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rutski89 at gmail.com Thu Dec 15 19:49:25 2005 From: rutski89 at gmail.com (Patrick Rutkowski) Date: Thu, 15 Dec 2005 13:49:25 -0500 Subject: Retention File Message-ID: "checks_enabled = 0" is saved in my retention file. I disabled my retention file and set "checks_enabled = 1" on all of my hosts in hosts.cfg. I reload/restart nagios yet it insists on settings "checks_enabled = 0" for all my hosts, and not scheduling pings for them. What should I do? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rutski89 at gmail.com Thu Dec 15 20:32:51 2005 From: rutski89 at gmail.com (Patrick Rutkowski) Date: Thu, 15 Dec 2005 14:32:51 -0500 Subject: Retention File Message-ID: I'm posting this again because I checked the archives and I think it didn't go though. "checks_enabled = 0" is saved in my retention file. I disabled my retention file and set "checks_enabled = 1" on all of my hosts in hosts.cfg. I reload/restart nagios yet it insists on settings "checks_enabled = 0" for all my hosts, and not scheduling pings for them. What should I do? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Thu Dec 15 20:35:31 2005 From: estair at ilm.com (Eli Stair) Date: Thu, 15 Dec 2005 11:35:31 -0800 Subject: Bug report/workaround -- (was Re: Nagios Performance Data shows checks aren't being completed) In-Reply-To: <439DFE0B.8020600@ilm.com> References: <24cb9b4b0512121443n3d7b0d0fi70f06368a94e84ef@mail.gmail.com> <439DFE0B.8020600@ilm.com> Message-ID: <43A1C583.6020601@ilm.com> I've been trying to resolve this situation for over a week now without taking drastic changes. 2.0b6, all retention data created new (not continued from older versions), x86_64, perl cache enabled. I've had a worsening problem recently, where my monitoring host (which is controlling 1003 hosts/8543 services/5257 service dependencies) an increasing number of service checks and event handlers were falling through the scheduler. Even after stopping and starting nagios, and doing a forced_host_svc_checks the relavent check/responses during the several-minute execution pause, these were being skipped or not acted upon. Showing the status in 'view config' confirmed that it was set up properly, but events were missed and either not re-scheduled or rescheduled but not executed. The last step I took was to stop nagios a final time last night and zero the state file retention.dat (as well as the objects.cache for good measure, though it wasn't the problem). After starting nagios fresh with no notion of previous states, within one hour (my threshhold for service/host checks) the entire schedule was executed properly, all services that had been in an unhandled 'bad' state for days were checked, and the respective event handlers were run and the situation rectified. I have no idea of the cause of this, whether it will happen again or not, etc. I'll be more than happy to provide more details. I have backups of the config and retention files from several periods during this period. I'd really like to help resolve this, as losing the trending data is not something I want to do again. My only concern with this setup is the "Warning: Size of service_message struct (528 bytes) is > POSIX-guaranteed atomic write size (512 bytes). Service checks results may get lost or mangled!" I get when building 2.0betas on any system I have available, I haven't seen this addressed/resolved in any searches of archives I've done. Cheers, /eli Eli Stair wrote: > > Corroboration here, I actually have a mail I'm compiling also on the > same issue. 2.0b6 > > I've got orphaned service checks enabled, unlimited parallel service > checks, etc. If I force a host/svc check through the CGI's or the > command file direct they get executed right away... the scheduler just > is losing them. > > /eli > > sheeri kritzer wrote: > >> Hi all, >> >> My nagios 2.0 installation shows the following under performance >> information. There are 99 service checks, and I can't imagine it >> takes more than an hour to complete all 99. We've had problems where >> nagios hasn't found and notified us of problems. The load on the box >> is tiny. nagios -s has no suggestions. What did I do wrong? >> >> uptime >> 17:38:38 up 81 days, 9:05, 4 users, load average: 0.00, 0.00, 0.00 >> >> Nagios is running, and has been for a while: >> >> ps -ef | grep nagios >> nagios 11160 1 0 Nov14 ? 00:12:32 /usr/bin/nagios -d >> /etc/nagios/nagios.cfg >> nagios 22947 1 0 Nov20 ? 00:00:00 nrpe -c >> /etc/nagios/nrpe.cfg -d >> >> Performance Info: >> >> Program-Wide Performance Information >> Active Service Checks: >> >> Time Frame Checks Completed >> <= 1 minute: 1 (1.0%) >> <= 5 minutes: 58 (58.6%) >> <= 15 minutes: 60 (60.6%) >> <= 1 hour: 60 (60.6%) >> Since program start: 99 (100.0%) >> >> Metric Min. Max. Average >> Check Execution Time: 0.01 sec 8.71 sec 1.286 sec >> Check Latency: 0.01 sec 1.03 sec 0.488 sec >> Percent State Change: 0.00% 0.00% 0.00% >> Passive Service Checks: >> >> Time Frame Checks Completed >> <= 1 minute: 0 (0.0%) >> <= 5 minutes: 0 (0.0%) >> <= 15 minutes: 0 (0.0%) >> <= 1 hour: 0 (0.0%) >> Since program start: 0 (0.0%) >> >> Metric Min. Max. Average >> Percent State Change: 0.00% 0.00% 0.00% >> Active Host Checks: >> >> Time Frame Checks Completed >> <= 1 minute: 0 (0.0%) >> <= 5 minutes: 0 (0.0%) >> <= 15 minutes: 0 (0.0%) >> <= 1 hour: 0 (0.0%) >> Since program start: 19 (76.0%) >> >> Metric Min. Max. Average >> Check Execution Time: 3.01 sec 4.01 sec 3.972 sec >> Check Latency: 0.00 sec 0.00 sec 0.000 sec >> Percent State Change: 0.00% 0.00% 0.00% >> Passive Host Checks: >> >> Time Frame Checks Completed >> <= 1 minute: 0 (0.0%) >> <= 5 minutes: 0 (0.0%) >> <= 15 minutes: 0 (0.0%) >> <= 1 hour: 0 (0.0%) >> Since program start: 0 (0.0%) >> >> Metric Min. Max. Average >> Percent State Change: 0.00% 0.00% 0.00% >> >> ---------------------------------------------------------------------------------------------------------------------------- >> >> >> Nagios 2.0b4 >> Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) >> Last Modified: 08-02-2005 >> License: GPL >> >> Projected scheduling information for host and service >> checks is listed below. This information assumes that >> you are going to start running Nagios with your current >> config files. >> >> HOST SCHEDULING INFORMATION >> --------------------------- >> Total hosts: 25 >> Total scheduled hosts: 0 >> Host inter-check delay method: SMART >> Average host check interval: 0.00 sec >> Host inter-check delay: 0.00 sec >> Max host check spread: 30 min >> First scheduled check: N/A >> Last scheduled check: N/A >> >> >> SERVICE SCHEDULING INFORMATION >> ------------------------------- >> Total services: 99 >> Total scheduled services: 99 >> Service inter-check delay method: SMART >> Average service check interval: 300.00 sec >> Inter-check delay: 3.03 sec >> Interleave factor method: SMART >> Average services per host: 3.96 >> Service interleave factor: 4 >> Max service check spread: 30 min >> First scheduled check: Mon Dec 12 17:39:51 2005 >> Last scheduled check: Mon Dec 12 17:44:47 2005 >> >> >> CHECK PROCESSING INFORMATION >> ---------------------------- >> Service check reaper interval: 10 sec >> Max concurrent service checks: Unlimited >> >> >> PERFORMANCE SUGGESTIONS >> ----------------------- >> I have no suggestions - things look okay. >> >> >> --------------------------------------------------------------------------------------------------------------------------------- >> >> >> grep -v ^# /etc/nagios/nagios.cfg | grep -v ^$ >> Nagios.cfg params: >> >> log_file=/var/log/nagios/nagios.log >> cfg_file=/etc/nagios/checkcommands.cfg >> cfg_file=/etc/nagios/misccommands.cfg >> cfg_file=/etc/nagios/contactgroups.cfg >> cfg_file=/etc/nagios/contacts.cfg >> cfg_file=/etc/nagios/dependencies.cfg >> cfg_file=/etc/nagios/escalations.cfg >> cfg_file=/etc/nagios/hostgroups.cfg >> cfg_file=/etc/nagios/hosts.cfg >> cfg_file=/etc/nagios/services.cfg >> cfg_file=/etc/nagios/timeperiods.cfg >> object_cache_file=/var/log/nagios/objects.cache >> resource_file=/etc/nagios/resource.cfg >> status_file=/var/log/nagios/status.dat >> nagios_user=nagios >> nagios_group=nagios >> check_external_commands=1 >> command_check_interval=-1 >> command_file=/var/log/nagios/rw/nagios.cmd >> comment_file=/var/log/nagios/comments.dat >> downtime_file=/var/log/nagios/downtime.dat >> lock_file=/var/run/nagios.pid >> temp_file=/var/log/nagios/nagios.tmp >> event_broker_options=-1 >> log_rotation_method=d >> log_archive_path=/var/log/nagios/archives >> use_syslog=1 >> log_notifications=1 >> log_service_retries=1 >> log_host_retries=1 >> log_event_handlers=1 >> log_initial_states=0 >> log_external_commands=1 >> log_passive_checks=1 >> service_inter_check_delay_method=s >> max_service_check_spread=30 >> service_interleave_factor=s >> host_inter_check_delay_method=s >> max_host_check_spread=30 >> max_concurrent_checks=0 >> service_reaper_frequency=10 >> auto_reschedule_checks=0 >> auto_rescheduling_interval=30 >> auto_rescheduling_window=180 >> sleep_time=0.25 >> service_check_timeout=60 >> host_check_timeout=30 >> event_handler_timeout=30 >> notification_timeout=30 >> ocsp_timeout=5 >> perfdata_timeout=5 >> retain_state_information=1 >> state_retention_file=/var/log/nagios/retention.dat >> retention_update_interval=60 >> use_retained_program_state=1 >> use_retained_scheduling_info=0 >> interval_length=60 >> use_aggressive_host_checking=0 >> execute_service_checks=1 >> accept_passive_service_checks=1 >> execute_host_checks=1 >> accept_passive_host_checks=1 >> enable_notifications=1 >> enable_event_handlers=1 >> process_performance_data=0 >> obsess_over_services=0 >> check_for_orphaned_services=0 >> check_service_freshness=1 >> service_freshness_check_interval=60 >> check_host_freshness=0 >> host_freshness_check_interval=60 >> aggregate_status_updates=1 >> status_update_interval=15 >> enable_flap_detection=0 >> low_service_flap_threshold=5.0 >> high_service_flap_threshold=20.0 >> low_host_flap_threshold=5.0 >> high_host_flap_threshold=20.0 >> date_format=us >> p1_file=/usr/bin/p1.pl >> illegal_object_name_chars=`~!$%^&*|'"<>?,()= >> illegal_macro_output_chars=`~$&|'"<> >> use_regexp_matching=0 >> use_true_regexp_matching=0 >> admin_email=nagios >> admin_pager=pagenagios >> daemon_dumps_core=0 >> >> Any help is much appreciated. >> >> Thank you, >> >> Sheeri Kritzer >> >> >> ------------------------------------------------------- >> This SF.net email is sponsored by: Splunk Inc. Do you grep through log >> files >> for problems? Stop! Download the new AJAX search engine that makes >> searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! >> http://ads.osdn.com/?ad_idv37&alloc_id865&op=click >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. ::: Messages without supporting info will risk >> being sent to /dev/null >> > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 15 20:48:45 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 15 Dec 2005 20:48:45 +0100 (CET) Subject: Sound notification on nagios server In-Reply-To: <1134651843.9513.31.camel@localhost.localdomain> References: <1134567992.30790.17.camel@localhost.localdomain> <1134591871.16079.26.camel@localhost> <1134593725.9513.3.camel@localhost.localdomain> <43A090FB.3010504@its-lehmann.de> <1134651843.9513.31.camel@localhost.localdomain> Message-ID: On Thu, 15 Dec 2005, Marcio Merlone wrote: > root at cedro:/etc/nagios# su - nagios -s /bin/bash > nagios at cedro:~ > $ /usr/bin/aplay /usr/share/nagios/htdocs/media/critical.wav > Playing WAVE '/usr/share/nagios/htdocs/media/critical.wav' : Unsigned 8 > bit, Rate 11025 Hz, Mono > nagios at cedro:~$ > > It made a sound! But: > > root at cedro:/etc/nagios# su - nagios -c > "/usr/bin/aplay /usr/share/nagios/htdocs/media/critical.wav" > root at cedro:/etc/nagios# > > This makes nothing. Anyway, it is clear that the nagios user is able to > access /dev/dsp, since it played the sound on the first attempt. Frankly I would not be so sure. But why does it not report anything on the type of file? Seems like you need to check that first. Perhaps brush up a bit on unix filesystems and access rights. try to check that first (Hint: ls -l /usr/share/nagios/htdocs/media/critical.wav) Then check de dsp device and see if it is available and not claimed by some other process. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Thu Dec 15 20:51:34 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Thu, 15 Dec 2005 20:51:34 +0100 (CET) Subject: max_check_attempts ignores warnings? In-Reply-To: <5A00A48F-CC61-4CD2-913D-6BFAD37D303B@richard-group.com> References: <5A00A48F-CC61-4CD2-913D-6BFAD37D303B@richard-group.com> Message-ID: On Thu, 15 Dec 2005, Kurt Yoder wrote: > So I want check_attempts to only increment when I'm in critical > state, *not* when I'm in warning state. Is it possible for me to > configure nagios this way? It seems you only want the Critical events. Then why not remove the Warnings from your notification? Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martin.schwarz at mscsoftware.com Thu Dec 15 20:57:12 2005 From: martin.schwarz at mscsoftware.com (Martin Schwarz) Date: Thu, 15 Dec 2005 20:57:12 +0100 Subject: out of office Message-ID: I will be out of the office starting 12/14/2005 and will not return until 01/17/2006. I will have no access to my mail. In urgent cases call me on my mobile ( +491728491717 ) or contact Armin in munich ( -172 ) ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Thu Dec 15 21:21:13 2005 From: estair at ilm.com (Eli Stair) Date: Thu, 15 Dec 2005 12:21:13 -0800 Subject: Retention File In-Reply-To: References: <43A1CB45.8050006@ilm.com> Message-ID: <43A1D039.2020300@ilm.com> Did you zero the file out though? I surmise the root problem came from some level of corruption in reading/writing that file... if not present it will rebuild it from scratch as it schedules checks and gets results. I was having a problem towards the end where one particular hostgroup of ~800 systems was losing the service state after a restart on _most_ of its services but not all (those that weren't being saved always lost state on restart, those that WERE always worked). I was using retain service state (just not scheduling), so it was a partial error for this issue. /eli Patrick Rutkowski wrote: > You say that you set nagios to not use retention files at all, and it > worked. I did the same, but it still doesn't work :-( > > On 12/15/05, Eli Stair wrote: > >>You're one of a bunch of people, myself included, who I think are >>running into the same or a similar bug. See my recent email, I'm >>curious if the same workaround resolves your issue. >> >>/eli >> >>Patrick Rutkowski wrote: >> >>>I'm posting this again because I checked the archives and I think it >>>didn't go though. >>> >>>"checks_enabled = 0" is saved in my retention file. I disabled my >>>retention file and set "checks_enabled = 1" on all of my hosts in >>>hosts.cfg. I reload/restart nagios yet it insists on settings >>>"checks_enabled = 0" for all my hosts, and not scheduling pings for >>>them. What should I do? >>> >>> >>>------------------------------------------------------- >>>This SF.net email is sponsored by: Splunk Inc. Do you grep through log files >>>for problems? Stop! Download the new AJAX search engine that makes >>>searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! >>>http://ads.osdn.com/?ad_idv37&alloc_id865&op=click >>>_______________________________________________ >>>Nagios-users mailing list >>>Nagios-users at lists.sourceforge.net >>>https://lists.sourceforge.net/lists/listinfo/nagios-users >>>::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >>>::: Messages without supporting info will risk being sent to /dev/null >>> >> >> > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Thu Dec 15 21:28:54 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Fri, 16 Dec 2005 09:28:54 +1300 Subject: max_check_attempts ignores warnings? In-Reply-To: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B59D@vxchange.vcr.auckland.ac.nz> References: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B59D@vxchange.vcr.auckland.ac.nz> Message-ID: <004301c601b6$2b003f70$8ea8d882@438LG01sshi052> > I would like my check_attempts and max_check_attempts to behave > differently than it is now. Me too... (quick, better add more text else my net.credibility will fall for making a me-too post :) Basically, the check_attempts only resets to 0 when state is OK. I would also like it to reset to 0 when the state changes, so that the situation you mention can't happen (an immediate alert on Critical when it has been hovering in Warning for a while). However, this would cause a problem of multiple 'Warning' alerts since you would go from hard-warn to soft-critical, and if you did not them move on to hard-critical you'd drop back to soft-warn which would eventually give another warning. It would be nicer (for me) if a state change only occurred after enough check_attempts had registered for the pending state... but I think that would mean a huge code change and philosophy change for Nagios, so it's unlikely to happen :) On the other hand, a service which reaches Critical after spending some time in Warning is unlikely to leave Critical soon, so maybe an immediate alert is what we want, rather than a further wait. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Fri Dec 16 00:18:28 2005 From: teng at dataway.com (Tedman Eng) Date: Thu, 15 Dec 2005 15:18:28 -0800 Subject: alternative text behind links Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C80@dw-mail.dataway.com> Here's how to achieve what you want: HOWTO: Add javascript to a mouseover event associated with a service. For this exercise, I'm using 1.2 (other versions may behave differently, I haven't tried) First, make sure you have extended service information configured. Extended service information allows you to set a link, an image and alt text for any service. Nagios does not check for validity of the text you set for those fields, and we'll take advantage of this behavior. Here's a normal example extinfo.cfg from the docs: -------------------------- define serviceextinfo{ host_name linux2 service_description Log Anomalies icon_image security.png icon_image_alt Security-Related Alerts } -------------------------- Modify it to look like this (notice the mismatched '' single quotes): -------------------------- define serviceextinfo{ host_name linux2 service_description Log Anomalies icon_image security.png icon_image_alt Security-Related Alerts' onmouseover='alert("Vote for Pedro") } -------------------------- What this does is closes the ALT attribute that Nagios adds, inserts our extra attributes, then Nagios adds a single quote which closes off our injected attribute. This example is merely illustrative, so you should probably call a javascript function there instead of a simple alert. The tooltip functions can be defined using the custom CGI headers feature. Hope this helps! Tedman > -----Original Message----- > From: Richard Gliebe [mailto:richard.gliebe at fhv.at] > Sent: Wednesday, December 14, 2005 11:48 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] alternative text behind links > > > Hi, > > maybe a cracy question. > > I want to insert/define an alternative text (MouseOver) > behind a link in > the WebGUI like this: > > [...] > Service Status Details For Host 'x.y.z' > . > OutputSource OK 12-15-2005 08:35:21 0d 0h 21m 33s 1/3 Output > Source OK - > 3 > . > [...] > > the Column "OutputSource" is a link to the "Service State Information" > > Now I want to define an explanation (MouseOver Popup) behind > this link, > what the snmpget output "3" meens for this service, like: > > INTEGERS: > other (1), > none (2), > normal (3), > bypass (4), > battery (5), > booster (6), > reducer (7) > > Thanks in advance > Richard > > -- > Richard Gliebe > Fachhochschule Vorarlberg GmbH / University for Applied Science > Information Services > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep > through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. > DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From g.knights at qut.edu.au Fri Dec 16 00:18:44 2005 From: g.knights at qut.edu.au (Guy Knights) Date: Fri, 16 Dec 2005 09:18:44 +1000 Subject: escalation groups? Message-ID: <43A1F9D4.2060307@qut.edu.au> I'm currently configuring an install of Nagios 1.2 to provide server monitoring for my workplace. I have the hosts, hostgroups, contacts, services, etc set up and working fine. I've tested escalations and they're working fine, but I'm trying to work out how I can set up our proposed notification schedule. Below is a brief outline of what we have already: * Multiple hosts in 3 different host groups * Multiple contacts, in 3 different contact groups * About 30 different services, based on two different templates, a medium priority template and a high priority template * Two different time periods, one for standard working hours, and one for weekend on-call hours Is it possible to set up something like a "group" of escalations, so certain services follow one escalation pattern, another set of services follows another escalation pattern, etc? The problem is that because we have so many services already, and the notifications for these services are based on templates, it looks to me like we'd have to set up a huge amount of escalations to accomplish the escalation schedule we're aiming for. Basically, we would just like to be able to choose the set of escalations used, based upon which template the service uses. Is this possible? I apologise if this is confusing to follow, I'm happy to elaborate if people aren't sure what I'm on about! Regards, Guy --------------------- Guy Knights Computer Systems Officer Queensland University of Technology Victoria Park Road Kelvin Grove QLD Ph: (07) 3864 3350 Email: g.knights at qut.edu.au ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Fri Dec 16 00:29:51 2005 From: teng at dataway.com (Tedman Eng) Date: Thu, 15 Dec 2005 15:29:51 -0800 Subject: Retention File Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C81@dw-mail.dataway.com> Hosts are not checked unless there's a problem with one of its services. This is by design. If you want to check a host's ping service, you need to add a ping service to that host. A host definition with no services will never get checked. (think of hosts as containers) > -----Original Message----- > From: Patrick Rutkowski [mailto:rutski89 at gmail.com] > Sent: Thursday, December 15, 2005 11:33 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Retention File > > > I'm posting this again because I checked the archives and I think it > didn't go though. > > "checks_enabled = 0" is saved in my retention file. I disabled my > retention file and set "checks_enabled = 1" on all of my hosts in > hosts.cfg. I reload/restart nagios yet it insists on settings > "checks_enabled = 0" for all my hosts, and not scheduling pings for > them. What should I do? > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep > through log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. > DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From engelenh at gmail.com Fri Dec 16 03:07:58 2005 From: engelenh at gmail.com (Hans Engelen) Date: Fri, 16 Dec 2005 03:07:58 +0100 Subject: Nagios and a Microsoft "cluster" In-Reply-To: <43984212.3070706@billford.com> References: <43984212.3070706@billford.com> Message-ID: <5383c62b0512151807r288aed3ak73bc223e037d8fd@mail.gmail.com> Could you provide a little more information on the issue at hand ? A short description of your setup would help. We are defenitly talking about clustering here right, not load balancing ? The double ping response issue sounds more like a load balancing issue. Cheers, Hans On 12/8/05, Bill Mathews wrote: > > We are trying to monitor (with Nagios, of course) a Microsft web server > set up in a Microsoft "clustered" environment. We have a couple of > issues, one is that when try to ping the cluster address we see > duplicate ping responses: > > 64 bytes from www.example.com (1.2.3.4): icmp_seq=3 ttl=116 time=32.5 ms > 64 bytes from www.example.com (1.2.3.4): icmp_seq=3 ttl=116 time=32.8 ms > (DUP!) > > The other issue is that routinely (a few times a day) this site shows as > down, even though it isn't (confirmed through observation). I am > somewhat unfamiliar with MS-Clustering but from the description on their > website it's a lot like any other kind of clustering (we monitor a lot > of clusters). Anyone every see anything similar? I have some theories > but none really make a lot of sense in any cluster I am familiar with. > Thanks in advance for any assistance. > > Bill > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through log > files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From billford at billford.com Fri Dec 16 03:21:05 2005 From: billford at billford.com (Bill Mathews) Date: Thu, 15 Dec 2005 21:21:05 -0500 Subject: Nagios and a Microsoft "cluster" Message-ID: <43A22491.2050904@billford.com> Hans Engelen wrote: >Could you provide a little more information on the issue at hand ? >A short description of your setup would help. We are defenitly talking about clustering here right, >not load balancing ? The double ping response issue sounds more like a load balancing issue. Well, it's allegedly a failover cluster. I don't control the Microsoft end of it and I've been told it's a failover. The only details I have is that we're monitoring it across the Internet, along with a few hundred other machines and that site is the only one giving us this trouble. It's somewhat odd. Setup is nagios latest running on Debian stable, not sure what other setup info would help. Thanks, Bill ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at halligan.org Fri Dec 16 03:46:12 2005 From: michael at halligan.org (Michael T. Halligan) Date: Thu, 15 Dec 2005 18:46:12 -0800 Subject: false "Host UP" notifications In-Reply-To: References: <87A74EC1-2CF6-439A-9CEA-009854B09559@halligan.org> <5277CCCF-2114-41FA-8051-FB45E2244BB7@halligan.org> Message-ID: -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hugo, Thanks, this did the trick! Michael Michael T. Halligan - ------------------------------------- BitPusher, LLC http://www.bitpusher.com/ On Dec 11, 2005, at 7:17 AM, Hugo van der Kooij wrote: > On Sat, 10 Dec 2005, Michael T. Halligan wrote: > >> Nothing out of the ordinary in the logs really.. >> Just a bunch of host up hard messages, but without any corresponding >> host down messages. >> >> Does this have something to do with freshness testing, maybe? > > Disable it if you run active checks on your hosts. > > Hugo. > > -- > I hate duplicates. Just reply to the relevant mailinglist. > hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ > Don't meddle in the affairs of magicians, > for they are subtle and quick to anger. > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do you grep through > log files > for problems? Stop! Download the new AJAX search engine that makes > searching your log files as easy as surfing the web. DOWNLOAD > SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (Darwin) iD8DBQFDoip5wjCqooJyNAMRAo5RAJ9FaU5sgfB/iN8oKAsjhexv+GHpmQCcDMRB WWDLCTJhC1EkjwbZ2zMrVpA= =xDMx -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tykeal at bardicgrove.org Fri Dec 16 05:34:35 2005 From: tykeal at bardicgrove.org (Andrew Grimberg) Date: Thu, 15 Dec 2005 20:34:35 -0800 Subject: Nagios and a Microsoft "cluster" In-Reply-To: <43A22491.2050904@billford.com> References: <43A22491.2050904@billford.com> Message-ID: <1134707675.13312.9.camel@tykes-place> On Thu, 2005-12-15 at 21:21 -0500, Bill Mathews wrote: > Hans Engelen wrote: > > >Could you provide a little more information on the issue at hand ? > > >A short description of your setup would help. We are defenitly talking > about clustering here right, >not load balancing ? The double ping > response issue sounds more like a load balancing issue. > > > Well, it's allegedly a failover cluster. I don't control the Microsoft > end of it and I've been told it's a failover. The only details I have is > that we're monitoring it across the Internet, along with a few hundred > other machines and that site is the only one giving us this trouble. > It's somewhat odd. > > Setup is nagios latest running on Debian stable, not sure what other > setup info would help. Strange. I would agree with Hans that it sounds a lot like a MS NLBS (Network Load Balance Services) cluster and not a MSCS (MS Cluster Services). However, since you say your monitoring from across the Internet and not the local network the only way that I know of to get that kind of response would be to an MS NLBS on a VMware ESX or GSX server. Though, it's potentially possible to elicit that kind of response from a even from a non-local machine. They may be running a NLBS "failover" cluster and not a true MSCS. NLBS can run in a full load balance or in an active / passive (aka failover) configuration. As NLBS does some stupid packet magic to get the clustering to work it can have undesired side effects of all nodes in the cluster replying to pings. If you can get your customer to give you more information on the cluster, I bet they will tell you that it is an NLBS cluster in unicast mode. You might suggest to them that they work with their network techs to switch over to multicast mode, that should hopefully help a little with the problem. A better suggestion is for them to get a hardware load balancer instead of using Microsoft's NLBS solution. As we've discovered on our network, it causes a lot of problems on the media layer that kills some network security. -Andy- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Fri Dec 16 08:21:55 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Fri, 16 Dec 2005 08:21:55 +0100 (CET) Subject: Retention File In-Reply-To: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C81@dw-mail.dataway.com> References: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C81@dw-mail.dataway.com> Message-ID: On Thu, 15 Dec 2005, Tedman Eng wrote: > Hosts are not checked unless there's a problem with one of its services. > This is by design. > If you want to check a host's ping service, you need to add a ping service > to that host. A host definition with no services will never get checked. > (think of hosts as containers) I not sure this still holds true for v2.0 I have some hosts without checks: Checking hosts... Warning: Host 'Internet' has no services associated with it! Warning: Host 'InternetV6' has no services associated with it! Warning: Host 'wichita.qi.nl' has no services associated with it! Warning: Host 'www.qi.nl' has no services associated with it! Checked 18 hosts. Still I receive UP and DOWN reports on them. The first two I might claim are dependencies for all non-local hosts and might be triggered by other events. But the last two are leafnodes and nothing depends on them. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nkaranuk at gmail.com Fri Dec 16 08:26:03 2005 From: nkaranuk at gmail.com (Karan uk) Date: Fri, 16 Dec 2005 12:56:03 +0530 Subject: JMX notifications->snmptrapd->snmptt->Nagios Configuation Message-ID: <33fd7fe20512152326w2ff9ef3ei2d19c598a57fb0da@mail.gmail.com> Hello, I was looking to configure Nagios so that it can accept SNMP traps(JMX notification thru snmptrapd->snmptt) and send e-mail. As per the doc's i've got snmptrapd writing the nagios.cmd file.Butconfigured Nagios seems to not react to the PROCESS_SERVICE_CHECK_RESULT entries in the command file. The configuration files i'm using are as bellow.Do let me know if i'm missing something in the conf. or the way to make nagios execute nagios.cmdentries. Nagios Version 1.3 nagios.cmd-- 1 PROCESS_SERVICE_CHECK_RESULT :winnt_host:TRAP:1:Trap received Respective conf in nagios.cfg--- check_external_commands=1 command_check_interval=1 command_file=/usr/local/nagios/var/rw/nagios.cmd --- services.cfg-- define service{ use generic-service e template to use host_name winnt_host service_description TRAP is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 active_checks_enabled 0 passive_checks_enabled 1 contact_groups nt-admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check-host-alive } ---------- snmptt.ini--- dns_enable = 0 strip_domain = 0 net_snmp_perl_enable = 0 translate_value_oids = 0 translate_enterprise_oid_format = 1 ---------- snmptrapd.conf------- traphandle default /usr/sbin/snmptt ------------- thanks in advance, nkaranuk -------------- next part -------------- An HTML attachment was scrubbed... URL: From Thomas.Zimmer at oppenheim.de Fri Dec 16 08:35:09 2005 From: Thomas.Zimmer at oppenheim.de (Thomas.Zimmer at oppenheim.de) Date: Fri, 16 Dec 2005 08:35:09 +0100 Subject: AW: JMX notifications->snmptrapd->snmptt->Nagios C onfiguation Message-ID: <9A834A8DCEC68648B20701E1CDE10925F84965@OKWPMXS01.de.sal-opp.net> Hi, i think you are not using the right syntax for the nagios.cmd file ("1 PROCESS_SERVICE_CHECK_RESULT :winnt_host:TRAP:1:Trap received") It should be: now=`date +%s` commandfile='/usr/local/nagios/var/rw/nagios.cmd' /bin/printf "[%lu] PROCESS_SERVICE_CHECK_RESULT;host1;service1;0;OK- Everything Looks Great\n" $now > $commandfile See http://www.nagios.org/developerinfo/externalcommands/commandlist.php for all commands. Greetz, Thomas Zimmer Produktservice & Betrieb Betrieb & Support Sal. Oppenheim jr. & Cie., Frankfurt a. Main Telefon: +49 69 7134 0 5192 Internet: http://www.oppenheim.de E-Mail: thomas.zimmer at oppenheim.de -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] Im Auftrag von Karan uk Gesendet: Freitag, 16. Dezember 2005 08:26 An: nagios-users at lists.sourceforge.net Betreff: [Nagios-users] JMX notifications->snmptrapd->snmptt->Nagios Configuation Hello, I was looking to configure Nagios so that it can accept SNMP traps(JMX notification thru snmptrapd->snmptt) and send e-mail. As per the doc's i've got snmptrapd writing the nagios.cmd file.But configured Nagios seems to not react to the PROCESS_SERVICE_CHECK_RESULT entries in the command file. The configuration files i'm using are as bellow.Do let me know if i'm missing something in the conf. or the way to make nagios execute nagios.cmd entries. Nagios Version 1.3 nagios.cmd-- 1 PROCESS_SERVICE_CHECK_RESULT :winnt_host:TRAP:1:Trap received Respective conf in nagios.cfg--- check_external_commands=1 command_check_interval=1 command_file=/usr/local/nagios/var/rw/nagios.cmd --- services.cfg-- define service{ use generic-service e template to use host_name winnt_host service_description TRAP is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 active_checks_enabled 0 passive_checks_enabled 1 contact_groups nt-admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check-host-alive } ---------- snmptt.ini--- dns_enable = 0 strip_domain = 0 net_snmp_perl_enable = 0 translate_value_oids = 0 translate_enterprise_oid_format = 1 ---------- snmptrapd.conf------- traphandle default /usr/sbin/snmptt ------------- thanks in advance, nkaranuk -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Zimmer, Thomas (Produktservices und Betrieb).vcf Type: application/octet-stream Size: 254 bytes Desc: not available URL: From soumya.nambiar at satconet.com Fri Dec 16 08:48:23 2005 From: soumya.nambiar at satconet.com (Soumya Nambiar) Date: Fri, 16 Dec 2005 10:48:23 +0300 Subject: JMX notifications->snmptrapd->snmptt->Nagios Configuation In-Reply-To: <33fd7fe20512152326w2ff9ef3ei2d19c598a57fb0da@mail.gmail.com> References: <33fd7fe20512152326w2ff9ef3ei2d19c598a57fb0da@mail.gmail.com> Message-ID: <00bf01c60215$1b997b50$0702a8c0@stcl.com> Hello, The is_volatile should be set to 1. By the way did u convert the mib file u require using snmpttconvertmib.If so that line should be added to ur snmptt.ini file. Soumya Nambiar -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Karan uk Sent: Friday, December 16, 2005 10:26 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] JMX notifications->snmptrapd->snmptt->Nagios Configuation Hello, I was looking to configure Nagios so that it can accept SNMP traps(JMX notification thru snmptrapd->snmptt) and send e-mail. As per the doc's i've got snmptrapd writing the nagios.cmd file.But configured Nagios seems to not react to the PROCESS_SERVICE_CHECK_RESULT entries in the command file. The configuration files i'm using are as bellow.Do let me know if i'm missing something in the conf. or the way to make nagios execute nagios.cmd entries. Nagios Version 1.3 nagios.cmd-- 1 PROCESS_SERVICE_CHECK_RESULT :winnt_host:TRAP:1:Trap received Respective conf in nagios.cfg--- check_external_commands=1 command_check_interval=1 command_file=/usr/local/nagios/var/rw/nagios.cmd --- services.cfg-- define service{ use generic-service e template to use host_name winnt_host service_description TRAP is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 active_checks_enabled 0 passive_checks_enabled 1 contact_groups nt-admins notification_interval 120 notification_period 24x7 notification_options w,u,c,r check_command check-host-alive } ---------- snmptt.ini--- dns_enable = 0 strip_domain = 0 net_snmp_perl_enable = 0 translate_value_oids = 0 translate_enterprise_oid_format = 1 ---------- snmptrapd.conf------- traphandle default /usr/sbin/snmptt ------------- thanks in advance, nkaranuk -------------- next part -------------- A non-text attachment was scrubbed... Name: winmail.dat Type: application/ms-tnef Size: 4524 bytes Desc: not available URL: From prashant at mac-india.com Fri Dec 16 09:28:08 2005 From: prashant at mac-india.com (Prashant Nadgir) Date: Fri, 16 Dec 2005 08:28:08 +0000 (UTC) Subject: plugins for windows services Message-ID: <014601c602e4$1b102600$441010c1@prashant> I want to monitor different services / processes in windows 2000 server. please help me thanks in advance Prashant -------------- next part -------------- An HTML attachment was scrubbed... URL: From mls at skayser.de Fri Dec 16 09:53:15 2005 From: mls at skayser.de (Sebastian Kayser) Date: Fri, 16 Dec 2005 09:53:15 +0100 Subject: plugins for windows services In-Reply-To: <014601c602e4$1b102600$441010c1@prashant> References: <014601c602e4$1b102600$441010c1@prashant> Message-ID: <20051216085315.GA11141@skayser.homeip.net> * Prashant Nadgir wrote: > I want to monitor different services / processes in windows 2000 > server. Have you done some research on your own? How about the Nagios FAQ -> Search -> "Windows" http://nagios.org/faqs/ As always there are multiple ways of how to accomplish certain things. Some keywords which should help you getting started: NSClient, NRPE, nagios-wsc - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From petri.niemela at epte.fi Fri Dec 16 10:27:58 2005 From: petri.niemela at epte.fi (Petri Niemela) Date: Fri, 16 Dec 2005 11:27:58 +0200 Subject: Host checks Message-ID: <43A2889E.8020108@epte.fi> Good morning, I tried to find answer from mailing list archives but was not able to find an answer. When ever any of the service checks fails nagios will check wether host is alive. This is fine, but my problem is that for now my nagios installation pings host 5 times in less that 30 seconds and then sends "host down" alert. What I would like to get is 10s delay between these pings. I would priciate if anyone could give me a pointer how to fix this. I'm using nagios 2.0B3 on OpenBSD 3.8 box. If you need any other information, just let me know.... Yours, pete ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From billford at billford.com Fri Dec 16 14:16:12 2005 From: billford at billford.com (Bill Mathews) Date: Fri, 16 Dec 2005 08:16:12 -0500 Subject: Nagios and Microsoft "cluster" Message-ID: <43A2BE1C.8070808@billford.com> Andrew Grimberg wrote: >If you can get your customer to give you more information on the >cluster, I bet they will tell you that it is an NLBS cluster in unicast >mode. You might suggest to them that they work with their network techs >to switch over to multicast mode, that should hopefully help a little >with the problem. A better suggestion is for them to get a hardware >load balancer instead of using Microsoft's NLBS solution. As we've >discovered on our network, it causes a lot of problems on the media >layer that kills some network security. Oooooh, I bet you're 100% right. We had a similar issue with firewall clustering on their network. Had to move to unicast mode to get around Cisco's issues with multicast. Thanks for your pointer. Bill ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ktynagios at richard-group.com Fri Dec 16 15:37:41 2005 From: ktynagios at richard-group.com (Kurt Yoder) Date: Fri, 16 Dec 2005 09:37:41 -0500 Subject: max_check_attempts ignores warnings? In-Reply-To: References: <5A00A48F-CC61-4CD2-913D-6BFAD37D303B@richard-group.com> Message-ID: <6E57CF07-C3FF-46C6-B255-CB8EFF99E22F@richard-group.com> On Dec 15, 2005, at 2:51 PM, Hugo van der Kooij wrote: > On Thu, 15 Dec 2005, Kurt Yoder wrote: > > >> So I want check_attempts to only increment when I'm in critical >> state, *not* when I'm in warning state. Is it possible for me to >> configure nagios this way? >> > > It seems you only want the Critical events. Then why not remove the > Warnings from your notification? > I've got notifications at "notification_options c,r", so I'm not sending notifications on warnings. I only wish this were that simple. I have some services that stay in a warning state for an extended time. Then when they go to critical, they immediately send out an alert. This is what I'd like to avoid. Is there no way to do this? -- Kurt Yoder ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ktynagios at richard-group.com Fri Dec 16 15:42:14 2005 From: ktynagios at richard-group.com (Kurt Yoder) Date: Fri, 16 Dec 2005 09:42:14 -0500 Subject: max_check_attempts ignores warnings? In-Reply-To: <004301c601b6$2b003f70$8ea8d882@438LG01sshi052> References: <004301c601b6$2b003f70$8ea8d882@438LG01sshi052> Message-ID: On Dec 15, 2005, at 3:28 PM, Steve Shipway wrote: > > On the other hand, a service which reaches Critical after spending > some time > in Warning is unlikely to leave Critical soon, so maybe an > immediate alert > is what we want, rather than a further wait. > In my case, a further wait is what I want. For example, I might have a particular service in a warning state for a while. I want to leave it as a warning so I can see that it might need attention. However, sometimes the service check might time out; then I get an immediate notification, which is not what I want. I suppose I could patch the source. Are there any resources out there on hacking nagios warnings and notifications? -- Kurt Yoder ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Samba at guidemail.com Fri Dec 16 16:17:23 2005 From: Samba at guidemail.com (Samba) Date: Fri, 16 Dec 2005 09:17:23 -0600 Subject: New Release of NagiosWeb Config on SourceForge Message-ID: For those still looking for a configuration program for Nagios that uses a MySQL backend I have recently had some time to work on a program I put out a year ago. I've made a lot of improvements to it and there are some unique features in NagiosWeb that I think everyone can use. Check out the Changelog for details. https://sourceforge.net/projects/nagiosweb/ Enjoy! Josh Konkol, CCSE CNE MCSE Technical Research Specialist .~. GuideOne Insurance /V\ /( )\ ^^-^^ samba at guidemail.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jungleey at yahoo.com Fri Dec 16 16:16:59 2005 From: jungleey at yahoo.com (prashanth guduru) Date: Fri, 16 Dec 2005 07:16:59 -0800 (PST) Subject: plugins for windows services In-Reply-To: <014601c602e4$1b102600$441010c1@prashant> References: <014601c602e4$1b102600$441010c1@prashant> Message-ID: <20051216151659.56760.qmail@web60923.mail.yahoo.com> Try to be very specific abt services you want to monitor. There already may be a plugin available or you could write one for yourself. Did you search at www.nagiosexchange.com? --- Prashant Nadgir wrote: > I want to monitor different services / processes in > windows 2000 server. > please help me > > thanks in advance > > Prashant > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jungleey at yahoo.com Fri Dec 16 16:22:16 2005 From: jungleey at yahoo.com (prashanth guduru) Date: Fri, 16 Dec 2005 07:22:16 -0800 (PST) Subject: Host checks In-Reply-To: <43A2889E.8020108@epte.fi> References: <43A2889E.8020108@epte.fi> Message-ID: <20051216152216.28948.qmail@web60915.mail.yahoo.com> The answers are right in the documentation. Goto http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#host and search for the normal_check_interval and retry_check_interval. --- Petri Niemela wrote: > Good morning, > > I tried to find answer from mailing list archives > but was not able to > find an answer. > > When ever any of the service checks fails nagios > will check wether host > is alive. This is fine, but my problem is that for > now my nagios > installation pings host 5 times in less that 30 > seconds and then sends > "host down" alert. What I would like to get is 10s > delay between these > pings. > > I would priciate if anyone could give me a pointer > how to fix this. > > I'm using nagios 2.0B3 on OpenBSD 3.8 box. > > If you need any other information, just let me > know.... > > Yours, > pete > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do > you grep through log files > for problems? Stop! Download the new AJAX search > engine that makes > searching your log files as easy as surfing the > web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Samba at guidemail.com Fri Dec 16 16:35:30 2005 From: Samba at guidemail.com (Samba) Date: Fri, 16 Dec 2005 09:35:30 -0600 Subject: FW: Host checks Message-ID: Actually the answer isn't in the documentation. Normal_check_interval and retry_check_interval only refer to services NOT hosts. I have the same issue when a host check is run. I have the retries set to 3 and it takes 18 seconds for the ping to time-out so that only gives me (3 x 18) = 54 seconds. I would also like to know how to set the retry interval in a HOST to something higher. Thanks, Josh -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of prashanth guduru Posted At: Friday, December 16, 2005 9:22 AM Posted To: Samba Conversation: [Nagios-users] Host checks Subject: Re: [Nagios-users] Host checks The answers are right in the documentation. Goto http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#host and search for the normal_check_interval and retry_check_interval. --- Petri Niemela wrote: > Good morning, > > I tried to find answer from mailing list archives > but was not able to > find an answer. > > When ever any of the service checks fails nagios > will check wether host > is alive. This is fine, but my problem is that for > now my nagios > installation pings host 5 times in less that 30 > seconds and then sends > "host down" alert. What I would like to get is 10s > delay between these > pings. > > I would priciate if anyone could give me a pointer > how to fix this. > > I'm using nagios 2.0B3 on OpenBSD 3.8 box. > > If you need any other information, just let me > know.... > > Yours, > pete > > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do > you grep through log files > for problems? Stop! Download the new AJAX search > engine that makes > searching your log files as easy as surfing the > web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dravail at cvf.fr Fri Dec 16 16:41:24 2005 From: dravail at cvf.fr (Ravail David) Date: Fri, 16 Dec 2005 16:41:24 +0100 Subject: host_event_handlers block service_event_handlers execution ? Message-ID: <43A2E024.7040308@cvf.fr> Hi, I use event_handlers to store state change into an external DB. When a host's state change from UP to DOWN : the host_event_handlers are executed but not the service_event_handlers (for services associated on this host). And When the host recovery appear : host and service event_handlers are launch. It's normal? -- David RAVAIL dravail at cvf.fr ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rcaldwell at bvh.com Fri Dec 16 16:44:59 2005 From: rcaldwell at bvh.com (rcaldwell at bvh.com) Date: Fri, 16 Dec 2005 09:44:59 -0600 Subject: This item has been released from quarantine. Message-ID: <021401c60257$abe575f0$060111ac@BVH.local> This file, which was attached to the message titled "Re: [Nagios-users] max_check_attempts ignores warnings?" by "ktynagios at richard-group.com" and was quarantined on 12/16/2005 8:38 AM, has been released. NOTE: If Auto-Protect is enabled, then this restored attachment will be rescanned during the restore. If the attachment is still infected, the current virus detection policy will apply to this attachment. -------------- next part -------------- A non-text attachment was scrubbed... Name: Message BodySYQ174cd6b40.txt Type: name="message bodysyq174cd6b40.txt"/octet-stream Size: 1397 bytes Desc: not available URL: From jlin at myvest.com Fri Dec 16 18:19:19 2005 From: jlin at myvest.com (Julie S. Lin) Date: Fri, 16 Dec 2005 09:19:19 -0800 Subject: syslog monitoring In-Reply-To: <20051215041619.404D312B1B@sc8-sf-spam2.sourceforge.net> References: <20051215041619.404D312B1B@sc8-sf-spam2.sourceforge.net> Message-ID: <43A2F717.7040308@myvest.com> Hi All, Does anyone here know if there's a plug in to nagios that will allow monitoring of syslogs and or integrates monitoring done via scripts? I'd like to integrate my monitoring. --jsl ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hvdkooij at vanderkooij.org Fri Dec 16 18:28:39 2005 From: hvdkooij at vanderkooij.org (Hugo van der Kooij) Date: Fri, 16 Dec 2005 18:28:39 +0100 (CET) Subject: max_check_attempts ignores warnings? In-Reply-To: <6E57CF07-C3FF-46C6-B255-CB8EFF99E22F@richard-group.com> References: <5A00A48F-CC61-4CD2-913D-6BFAD37D303B@richard-group.com> <6E57CF07-C3FF-46C6-B255-CB8EFF99E22F@richard-group.com> Message-ID: On Fri, 16 Dec 2005, Kurt Yoder wrote: > > On Dec 15, 2005, at 2:51 PM, Hugo van der Kooij wrote: > > > On Thu, 15 Dec 2005, Kurt Yoder wrote: > > > >> So I want check_attempts to only increment when I'm in critical > >> state, *not* when I'm in warning state. Is it possible for me to > >> configure nagios this way? > > > > It seems you only want the Critical events. Then why not remove the > > Warnings from your notification? > > I've got notifications at "notification_options c,r", so I'm not > sending notifications on warnings. I only wish this were that simple. > > I have some services that stay in a warning state for an extended > time. Then when they go to critical, they immediately send out an > alert. This is what I'd like to avoid. > > Is there no way to do this? Use escalations for this. Just do not send out the message straigh off but instead let the escaltion take care of that. This suggestion has been handled recently on this mailinglist for a similar request. Hugo. -- I hate duplicates. Just reply to the relevant mailinglist. hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/ Don't meddle in the affairs of magicians, for they are subtle and quick to anger. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drussell at mpifix.com Fri Dec 16 18:29:15 2005 From: drussell at mpifix.com (Danny Russell) Date: Fri, 16 Dec 2005 10:29:15 -0700 Subject: Is this a Valid Time Period? Why wouldn't this service notify the contact groups contacts? Message-ID: <028B223BB24E5443A9784BBE1CC5EBE076A3A5@Lisa.mpifix.com> I am having some services not notifying when I think they should be, do you see anything wrong with this timeperiod or config??? define timeperiod{ timeperiod_name ShopHours alias 4:00AM to 11:30PM MST sunday 04:00-23:30 monday 04:00-23:30 tuesday 04:00-23:30 wednesday 04:00-23:30 thursday 04:00-23:30 friday 04:00-23:30 saturday 04:00-23:30 } This is the service that is going critical, but is never notifying! define service { use SMS Status service_description SMS Status check_command check_nrpe_sms_status host_name Sunrise Chevrolet contact_groups FieldOpsSupport,TechOpsSupport } define service { name SMS Status use --template ARGIS Customer Service check_command check_nrpe_sms_status register 0 contact_groups FieldOpsSupport,TechOpsSupport } define service { name --template ARGIS Customer Service max_check_attempts 5 normal_check_interval 10 retry_check_interval 3 active_checks_enabled 1 check_period 24x7 process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 notification_interval 30 notification_period ShopHours notifications_enabled 1 register 0 notification_options u,c,r servicegroups ARGIS Customer Services contact_groups FieldOpsSupport,TechOpsSupport } define contact{ contact_name drussell alias Danny Russell host_notification_period 24x7 service_notification_period 24x7 host_notification_options d,u,r service_notification_options u,c,r host_notification_commands host-notify-by-email,host-notify-by-epager service_notification_commands notify-by-email,notify-by-epager email drussell at mpifix.com pager drussell at tmomail.net } define contactgroup{ contactgroup_name TechOpsSupport alias Technical Operations Support Team Contact Group members nagios-admin,drussell,malmond,nherre,mdaley,dlott } -------------- next part -------------- An HTML attachment was scrubbed... URL: From hmann at itgroundwork.com Fri Dec 16 18:57:38 2005 From: hmann at itgroundwork.com (Harper Mann) Date: Fri, 16 Dec 2005 09:57:38 -0800 Subject: syslog monitoring In-Reply-To: <43A2F717.7040308@myvest.com> References: <20051215041619.404D312B1B@sc8-sf-spam2.sourceforge.net> Message-ID: <43A30012.3070705@itgroundwork.com> There is a file in the nagiosplug contrib called check_log2.pl. It makes a temporary file with the byte offset from the last check so it starts the new check at the old offset. For the new lines, it does regular expression matching. [hmann at sirius contrib]$ perl ./check_log2.pl -h check_log2.pl (nagios-plugins 1.4) 1.1 The nagios plugins come with ABSOLUTELY NO WARRANTY. You may redistribute copies of the plugins under the terms of the GNU General Public License. For more information about these matters, see the file named COPYING. Scan arbitrary log files for regular expression matches. Usage: check_log2.pl -l -s -p [-n ] Usage: check_log2.pl [ -v | --version ] Usage: check_log2.pl [ -h | --help ] -l, --logfile= The log file to be scanned -s, --seekfile= The temporary file to store the seek position of the last scan -p, --pattern= The regular expression to scan for in the log file -n, --negpattern= The regular expression to skip in the log file Cheers, - Harper Harper Mann Groundwork Open Source Solutions 510-599-2075 (cell) Julie S. Lin wrote: Hi All, Does anyone here know if there's a plug in to nagios that will allow monitoring of syslogs and or integrates monitoring done via scripts? I'd like to integrate my monitoring. --jsl ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! [1]http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list [2]Nagios-users at lists.sourceforge.net [3]https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null References 1. http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click 2. mailto:Nagios-users at lists.sourceforge.net 3. https://lists.sourceforge.net/lists/listinfo/nagios-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From mls at skayser.de Fri Dec 16 19:06:13 2005 From: mls at skayser.de (Sebastian Kayser) Date: Fri, 16 Dec 2005 19:06:13 +0100 Subject: syslog monitoring In-Reply-To: <43A2F717.7040308@myvest.com> References: <20051215041619.404D312B1B@sc8-sf-spam2.sourceforge.net> <43A2F717.7040308@myvest.com> Message-ID: <20051216180613.GA14292@skayser.homeip.net> * Julie S. Lin wrote: > Does anyone here know if there's a plug in to nagios that will allow > monitoring of syslogs .. You can use syslog-ng in combination with SEC or swatch or something alike to monitor your logfiles and commit service check results (probably warnings or criticals) via the external command file depending on your log messages. You have to make use of passive checks for that. Passive checks and the Nagios external command file: http://nagios.sourceforge.net/docs/1_0/passivechecks.html http://nagios.sourceforge.net/docs/2_0/passivechecks.html Some info on syslog-ng, SEC and swatch: http://www.campin.net/newlogcheck.html > .. and or integrates monitoring done via scripts? I'd like to > integrate my monitoring. What do you mean by scripts? Please elaborate. Checks done by Nagios are usually scripts themselfes. If you mean scripts which are asynchronous in their nature (i.e. not triggered by Nagios, for example backups etc.) you can make use of the external command file, too. If they are run on the same host you can write directly to the external command file, if not you can use NSCA. http://nagios.sourceforge.net/docs/1_0/addons.html http://nagios.sourceforge.net/docs/2_0/addons.html#nsca - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kidd1270 at gmail.com Fri Dec 16 19:08:59 2005 From: kidd1270 at gmail.com (Kidd Chaos) Date: Fri, 16 Dec 2005 13:08:59 -0500 Subject: aggregate escalation notifications Message-ID: <91a354090512161008p47733959t20f7e1ef0b3f8b09@mail.gmail.com> -------------------------- Question: Does anyone know of a plugin to do aggregate escalation notifications? Or a plugin for an email or page with the total count of all problems? Thanks. -Kidd. ---------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: From jhmartin at toger.us Fri Dec 16 19:28:10 2005 From: jhmartin at toger.us (Jason Martin) Date: Fri, 16 Dec 2005 13:28:10 -0500 Subject: syslog monitoring In-Reply-To: <43A30012.3070705@itgroundwork.com> References: <43A2F717.7040308@myvest.com> <43A30012.3070705@itgroundwork.com> Message-ID: <20051216182810.GB14076@mal.members.linode.com> On Fri, Dec 16, 2005 at 09:57:38AM -0800, Harper Mann wrote: > > There is a file in the nagiosplug contrib called check_log2.pl. It > makes a temporary file with the byte offset from the last check so it > starts the new check at the old offset. For the new lines, it does > regular expression matching. Note that the primary problem with Nagios logfile monitoring is that it can only generate 1 alarm in a given polling interval, so even though the logfile might indicate multiple problems (cpu overheat, UPS battery dead, SCSI disk failure) you can only get one of those thorugh Nagios. Also, the pattern capabilities of check_log2 are a bit limited for comprehensive syslog monitoring. -Jason Martin -- Experience is a good teacher but her fees are high... This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From Jeremy.Pavleck at capella.edu Fri Dec 16 21:53:55 2005 From: Jeremy.Pavleck at capella.edu (Pavleck, Jeremy D.) Date: Fri, 16 Dec 2005 14:53:55 -0600 Subject: NRPE 2.0 arguments question Message-ID: <0F1A5F3993566D4883C596CB56E59B9A05E562B1@mspmail02.int.capella.lan> Greetings, If I setup an nrpe.cfg check of something like check_procs -w $ARG1$ -c $ARG2$ -a $ARG3$ $ARG4$ and don't supply an $ARG4$, will it error or simple just carry on with the check? I'm trying to make some overly generic remote checks here, and something like this would be great. Thanks. Jeremy D. Pavleck Network Engineer System Management Direct Line: 612-977-5881 Toll Free: 1-888-CAPELLA ext. 5881 Fax: 612-977-5060 E-mail: jeremy.pavleck at capella.edu Capella University 225 South 6th Street, 9th Floor Minneapolis, MN 55402 www.capella.edu ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Fri Dec 16 22:18:07 2005 From: teng at dataway.com (Tedman Eng) Date: Fri, 16 Dec 2005 13:18:07 -0800 Subject: plugins for windows services Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C84@dw-mail.dataway.com> Here is my complete configuration for windows services. Hope this helps you. define service{ name WINDOWS host_name * use generic-service service_description WINDOWS check_command check_windows } I use the check_windows plugin. This excellent plugin will check the host and return critical status if the host is running Windows, OK status otherwise. Sarcasm aside, what have you done in the way of trying to answer your own question? Did you read the FAQ's? Did you look at www.nagiosexchange.org ? Did you try Google? Did you try? -----Original Message----- From: Prashant Nadgir [mailto:prashant at mac-india.com] Sent: Saturday, December 17, 2005 12:30 AM To: nagios-users Subject: [Nagios-users] plugins for windows services I want to monitor different services / processes in windows 2000 server. please help me thanks in advance Prashant ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From petri.niemela at epte.fi Fri Dec 16 23:08:07 2005 From: petri.niemela at epte.fi (Petri Niemela) Date: Sat, 17 Dec 2005 00:08:07 +0200 Subject: plugins for windows services In-Reply-To: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C84@dw-mail.dataway.com> References: <37ED92F9890FAF4BB947613C66FF8B1A08BB2C84@dw-mail.dataway.com> Message-ID: <43A33AC7.7060100@epte.fi> I have found a SNMP based plugin that is able to monitor processes in windows servers. There is only on small problem, if application is started as service you will only see svchost.exe in process list. Is there any way, without installin software in windows bo, to be able to get also list of services? Yours, Pete Tedman Eng wrote: >Here is my complete configuration for windows services. Hope this helps >you. > >define service{ > name WINDOWS > host_name * > use generic-service > service_description WINDOWS > check_command check_windows > } > >I use the check_windows plugin. This excellent plugin will check the host >and return critical status if the host is running Windows, OK status >otherwise. > > >Sarcasm aside, what have you done in the way of trying to answer your own >question? >Did you read the FAQ's? >Did you look at www.nagiosexchange.org ? >Did you try Google? >Did you try? > > >-----Original Message----- >From: Prashant Nadgir [mailto:prashant at mac-india.com] >Sent: Saturday, December 17, 2005 12:30 AM >To: nagios-users >Subject: [Nagios-users] plugins for windows services > > >I want to monitor different services / processes in windows 2000 server. >please help me > >thanks in advance > >Prashant > > >------------------------------------------------------- >This SF.net email is sponsored by: Splunk Inc. Do you grep through log files >for problems? Stop! Download the new AJAX search engine that makes >searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! >http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > -- ---------------------------------------------------- Petri Niemel? EPTE Oy GSM: +358407000110 FAX: +35895487557 Email: petri.niemela at epte.fi http://epte.fi ---------------------------------------------------- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jungleey at yahoo.com Fri Dec 16 23:31:49 2005 From: jungleey at yahoo.com (prashanth guduru) Date: Fri, 16 Dec 2005 14:31:49 -0800 (PST) Subject: FW: Host checks In-Reply-To: References: Message-ID: <20051216223149.83742.qmail@web60919.mail.yahoo.com> I am sorry....i overlooked it...i scrolled way over to the bottom where it was actually describing the service definition. Its true it doesnt talk about how you can do that in the documentation. But the documentation does say that it is going to try it until the max_attempts, so i guess it keeps running the command that many times or until the host is up. Im sure you must have thought about this but you can actually do this by writing another simple plugin that wraps around check_ping. --- Samba wrote: > Actually the answer isn't in the documentation. > Normal_check_interval > and retry_check_interval only refer to services NOT > hosts. > > I have the same issue when a host check is run. I > have the retries set > to 3 and it takes 18 seconds for the ping to > time-out so that only gives > me (3 x 18) = 54 seconds. I would also like to know > how to set the > retry interval in a HOST to something higher. > > Thanks, > > Josh > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On > Behalf Of prashanth > guduru > Posted At: Friday, December 16, 2005 9:22 AM > Posted To: Samba > Conversation: [Nagios-users] Host checks > Subject: Re: [Nagios-users] Host checks > > The answers are right in the documentation. Goto > http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#host > and search for the normal_check_interval and > retry_check_interval. > > > --- Petri Niemela wrote: > > > Good morning, > > > > I tried to find answer from mailing list archives > > but was not able to > > find an answer. > > > > When ever any of the service checks fails nagios > > will check wether host > > is alive. This is fine, but my problem is that for > > now my nagios > > installation pings host 5 times in less that 30 > > seconds and then sends > > "host down" alert. What I would like to get is > 10s > > delay between these > > pings. > > > > I would priciate if anyone could give me a pointer > > how to fix this. > > > > I'm using nagios 2.0B3 on OpenBSD 3.8 box. > > > > If you need any other information, just let me > > know.... > > > > Yours, > > pete > > > > > > > > > ------------------------------------------------------- > > This SF.net email is sponsored by: Splunk Inc. Do > > you grep through log files > > for problems? Stop! Download the new AJAX search > > engine that makes > > searching your log files as easy as surfing the > > web. DOWNLOAD SPLUNK! > > > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version > > (-v) and OS when reporting any issue. > > ::: Messages without supporting info will risk > being > > sent to /dev/null > > > > > __________________________________________________ > Do You Yahoo!? > Tired of spam? Yahoo! Mail has the best spam > protection around > http://mail.yahoo.com > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do > you grep through log > files > for problems? Stop! Download the new AJAX search > engine that makes > searching your log files as easy as surfing the > web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > > > ------------------------------------------------------- > This SF.net email is sponsored by: Splunk Inc. Do > you grep through log files > for problems? Stop! Download the new AJAX search > engine that makes > searching your log files as easy as surfing the > web. DOWNLOAD SPLUNK! > http://ads.osdn.com/?ad_idv37&alloc_id865&op=click > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jason at shakabuku.org Sat Dec 17 00:43:27 2005 From: jason at shakabuku.org (Jason Bodnar) Date: Fri, 16 Dec 2005 17:43:27 -0600 Subject: How to use newline in config files Message-ID: <20051216234102.M82828@shakabuku.org> I need to have a newline as an arguement for a check_command. \n doesn't work. ^J does work. Hitting Enter and putting the rest on a separate line causes nagios to choke on the config file. How do you put include a newline in an arguement? -- Jason Bodnar jason at shakabuku.org http://www.shakabuku.org ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sgtphou at fire-eyes.org Sat Dec 17 01:44:34 2005 From: sgtphou at fire-eyes.org (fire-eyes) Date: Fri, 16 Dec 2005 19:44:34 -0500 Subject: Dependencies Message-ID: <43A35F72.4090909@fire-eyes.org> I am using 1.3, and reading about dependencies. http://nagios.sourceforge.net/docs/1_0/dependencies.html I am interested in host dependencies. When I used nagios perhaps over a year ago, I thought it was host dependencies which allowed me to get the status map drawn in such a way where you could see hosts connected "through" other hosts. This made it clear the organization and location in the network. I can't seem to make this happen, though nagios -v doesn't complain about anything at all. Is there any better documentation on this? Being dyslexic doesn't help (I can't keep straight the relation between host_name and dependent_host_name)... ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sgtphou at fire-eyes.org Sat Dec 17 02:40:07 2005 From: sgtphou at fire-eyes.org (fire-eyes) Date: Fri, 16 Dec 2005 20:40:07 -0500 Subject: Dependencies In-Reply-To: <43A35F72.4090909@fire-eyes.org> References: <43A35F72.4090909@fire-eyes.org> Message-ID: <43A36C77.1030205@fire-eyes.org> fire-eyes wrote: > I am using 1.3, and reading about dependencies. > > http://nagios.sourceforge.net/docs/1_0/dependencies.html > > I am interested in host dependencies. When I used nagios perhaps over a > year ago, I thought it was host dependencies which allowed me to get the > status map drawn in such a way where you could see hosts connected > "through" other hosts. This made it clear the organization and location > in the network. > > I can't seem to make this happen, though nagios -v doesn't complain > about anything at all. > > Is there any better documentation on this? Being dyslexic doesn't help > (I can't keep straight the relation between host_name and > dependent_host_name)... Whoops, I was thinking "parents" in the host definitions. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios-user at proy.org Sat Dec 17 10:40:09 2005 From: nagios-user at proy.org (Patrick Proy) Date: Sat, 17 Dec 2005 10:40:09 +0100 Subject: plugins for windows services In-Reply-To: <43A33AC7.7060100@epte.fi> References: <43A33AC7.7060100@epte.fi> Message-ID: <20051217094011.4C1511953BE@mail.proy.org> Hi, Try check_snmp_win.pl (http://www.manubulon.com/nagios/snmp_windows.html) : it monitors windows services by snmp using the Microsoft MIB. Patrick -----Message d'origine----- De : nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] De la part de Petri Niemela Envoy? : vendredi 16 d?cembre 2005 23:08 ? : nagios-users Objet : Re: [Nagios-users] plugins for windows services I have found a SNMP based plugin that is able to monitor processes in windows servers. There is only on small problem, if application is started as service you will only see svchost.exe in process list. Is there any way, without installin software in windows bo, to be able to get also list of services? Yours, Pete Tedman Eng wrote: >Here is my complete configuration for windows services. Hope this >helps you. > >define service{ > name WINDOWS > host_name * > use generic-service > service_description WINDOWS > check_command check_windows > } > >I use the check_windows plugin. This excellent plugin will check the >host and return critical status if the host is running Windows, OK >status otherwise. > > >Sarcasm aside, what have you done in the way of trying to answer your >own question? >Did you read the FAQ's? >Did you look at www.nagiosexchange.org ? >Did you try Google? >Did you try? > > >-----Original Message----- >From: Prashant Nadgir [mailto:prashant at mac-india.com] >Sent: Saturday, December 17, 2005 12:30 AM >To: nagios-users >Subject: [Nagios-users] plugins for windows services > > >I want to monitor different services / processes in windows 2000 server. >please help me > >thanks in advance > >Prashant > > >------------------------------------------------------- >This SF.net email is sponsored by: Splunk Inc. Do you grep through log >files for problems? Stop! Download the new AJAX search engine that >makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! >http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > -- ---------------------------------------------------- Petri Niemel? EPTE Oy GSM: +358407000110 FAX: +35895487557 Email: petri.niemela at epte.fi http://epte.fi ---------------------------------------------------- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=ick _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nick at openstream.ch Sun Dec 18 08:22:57 2005 From: nick at openstream.ch (Nick Weisser) Date: Sun, 18 Dec 2005 08:22:57 +0100 Subject: Advice on Plugin Development Message-ID: <43A50E51.1060105@openstream.ch> Hi there, I need a plugin for Nagios that reboots an external server under certain circumstances (no ping response). There is no "direct connection" between the machine that Nagios runs on and the monitored server, so the reboot needs to be initiated via HTTPS with several POST parameters, i.e. a "simple" web form. I've been using Nagios for quite some time now, but I'm not sure if this can be realized with a plugin or what other approaches are possible. Any comments would be greatly appreciated. -- Regards Nick ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From roux.emmanuel at gmail.com Sun Dec 18 09:24:18 2005 From: roux.emmanuel at gmail.com (Emmanuel Roux) Date: Sun, 18 Dec 2005 09:24:18 +0100 Subject: Advice on Plugin Development In-Reply-To: <43A50E51.1060105@openstream.ch> References: <43A50E51.1060105@openstream.ch> Message-ID: <478bd3690512180024s2834df87r@mail.gmail.com> 2005/12/18, Nick Weisser : > > Hi there, > > I need a plugin for Nagios that reboots an external server under certain > circumstances (no ping response). > > There is no "direct connection" between the machine that Nagios runs on > and the monitored server, so the reboot needs to be initiated via HTTPS > with several POST parameters, i.e. a "simple" web form. > > I've been using Nagios for quite some time now, but I'm not sure if this > can be realized with a plugin or what other approaches are possible. > > Any comments would be greatly appreciated. > -- > Regards > Nick Hello, I think you can do it by using event handlers and such a perl script that use libwww-perl module to request a shutdown via a webmin system ... Regards Emmanuel -------------- next part -------------- An HTML attachment was scrubbed... URL: From mls at skayser.de Sun Dec 18 10:08:37 2005 From: mls at skayser.de (Sebastian Kayser) Date: Sun, 18 Dec 2005 10:08:37 +0100 Subject: How to use newline in config files In-Reply-To: <20051216234102.M82828@shakabuku.org> References: <20051216234102.M82828@shakabuku.org> Message-ID: <20051218090837.GA2102@skayser.homeip.net> * Jason Bodnar wrote: > I need to have a newline as an arguement for a check_command. \n doesn't work. > ^J does work. Hitting Enter and putting the rest on a separate line causes > nagios to choke on the config file. How do you put include a newline in an > arguement? You could try to write a wrapper (with bash/perl/...) for your check script which calls the relevant command with a newline. Thereafter you would define a check_command for your wrapper instead of the relevant command itself. But post some details on which command you want to be called on how you have tried it 'til now, maybe there are other ways. - sebastian ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Sun Dec 18 10:38:41 2005 From: estair at ilm.com (Eli Stair) Date: Sun, 18 Dec 2005 01:38:41 -0800 Subject: Advice on Plugin Development Message-ID: <6E56E676C9D6A74EBC980144BC06A17D0613AE@mailbox03.lucas.alllucas.com> If you write your event handler in Perl, then you can just use HTTP::Request POST to handle the transaction that way, it's nice and clean. If you want to do it in shell you can pipe your http transactions through nc. /eli -----Original Message----- From: nagios-users-admin at lists.sourceforge.net on behalf of Nick Weisser Sent: Sat 12/17/2005 11:22 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Advice on Plugin Development Hi there, I need a plugin for Nagios that reboots an external server under certain circumstances (no ping response). There is no "direct connection" between the machine that Nagios runs on and the monitored server, so the reboot needs to be initiated via HTTPS with several POST parameters, i.e. a "simple" web form. I've been using Nagios for quite some time now, but I'm not sure if this can be realized with a plugin or what other approaches are possible. Any comments would be greatly appreciated. -- Regards Nick ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: From benny at bennyvision.com Sun Dec 18 15:50:37 2005 From: benny at bennyvision.com (C. Bensend) Date: Sun, 18 Dec 2005 08:50:37 -0600 (CST) Subject: Advice on Plugin Development In-Reply-To: <43A50E51.1060105@openstream.ch> References: <43A50E51.1060105@openstream.ch> Message-ID: <50054.63.227.74.41.1134917437.squirrel@webmail.stinkweasel.net> > I need a plugin for Nagios that reboots an external server under certain > circumstances (no ping response). > > There is no "direct connection" between the machine that Nagios runs on > and the monitored server, so the reboot needs to be initiated via HTTPS > with several POST parameters, i.e. a "simple" web form. OK, am I missing something here? How are you going to be able to do an HTTP POST operation, if the machine isn't reachable via the network? ;) Or do you have something even more wonky going on, like ICMP starting to fail before the machine goes away? -- "NOT WORK SAFE! Extreme animated violence, language, birds, and what appears to be God with a katana." -- SA list ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sgtphou at fire-eyes.org Sun Dec 18 20:00:48 2005 From: sgtphou at fire-eyes.org (fire-eyes) Date: Sun, 18 Dec 2005 14:00:48 -0500 Subject: 1.3 Status Map Message-ID: <43A5B1E0.6010200@fire-eyes.org> On the status map for 1.3, in the circular marked up display, what does a grey background mean? I thought it meant pending. However if I select update every few seconds, it seems to randomly go between green and grey. This is confusing me. ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From amon at pro-linux.org Sun Dec 18 21:04:32 2005 From: amon at pro-linux.org (Werner Amon) Date: Sun, 18 Dec 2005 21:04:32 +0100 Subject: Queston about Notification Message-ID: <43A5C0D0.3070104@pro-linux.org> Hi, we have a timeperiod, in this period the notifications are disabled. For notification we use email and sms (via gnokii). We wish to have in this timeperiod the emailnotification but not the sms. Is this possible? greez from austria Werner PS: Sorry for my english ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From whiterabbit at mand4la.info Sun Dec 18 21:19:07 2005 From: whiterabbit at mand4la.info (Matteo Mancini) Date: Sun, 18 Dec 2005 21:19:07 +0100 Subject: nsca problem Message-ID: <43A5C43B.1020305@mand4la.info> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi list I'm configuring nsca for windows eventlog monitor, but I've got a few problem with nsca I've startd nsca daemon via inetd nsca stream tcp nowait _nagios /usr/local/sbin/nsca -c /etc/nagios/nsca.cfg --inetd # netstat -an -p tcp Active Internet connections (including servers) Proto Recv-Q Send-Q Local Address Foreign Address (state) tcp 0 0 10.0.200.179.80 87.6.47.179.60681 ESTABLISHED tcp 0 0 10.0.200.179.80 87.6.47.179.52152 FIN_WAIT_2 tcp 0 0 10.0.200.179.80 87.6.47.179.63290 FIN_WAIT_2 tcp 0 0 *.5667 *.* LISTEN tcp 0 48 10.0.200.179.22 10.0.200.145.27078 ESTABLISHED tcp 0 0 *.22 *.* LISTEN tcp 0 0 *.80 *.* LISTEN tcp 0 0 127.0.0.1.587 *.* LISTEN tcp 0 0 127.0.0.1.25 *.* LISTEN BTW if I telnet port 5667, I recive a strange message: # telnet localhost 5667 Trying 127.0.0.1... Connected to localhost. Escape character is '^]'. Incorrect command line arguments supplied NSCA - Nagios Service Check Acceptor Copyright (c) 2000-2003 Ethan Galstad (nagios at nagios.org) Version: 2.4 Last Modified: 07-23-2003 License: GPL Encryption Routines: AVAILABLE Usage: -c -c [mode] Options: = Name of config file to use [mode] = Determines how NSCA should run. Valid modes: --inetd = Run as a service under inetd or xinetd --daemon = Run as a standalone multi-process daemon --single = Run as a standalone single-process daemon (default) Notes: This program is designed to accept passive check results from remote hosts that use the send_nsca utility. Can run as a service under inetd or xinetd (read the docs for info on this), or as a standalone daemon. Connection closed by foreign host. # why this message? something goes wron with my setup? thanks Matteo -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (GNU/Linux) iD8DBQFDpcQt/TjXD9LUVswRAgTwAJwKUBvdZAw/7/HeHWQkVXM8hSCCcACcC8ds Gzpmh4GQEgEP/rxD1Io1YpM= =Zbw6 -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sgtphou at fire-eyes.org Sun Dec 18 21:20:10 2005 From: sgtphou at fire-eyes.org (fire-eyes) Date: Sun, 18 Dec 2005 15:20:10 -0500 Subject: 1.3: Cannot ping a host, so how do I tell if it is "up"? Message-ID: <43A5C47A.7090000@fire-eyes.org> I'm using 1.3. When I define a host, the only command defined which I see I can use is check-host-alive. This uses ping to determine if it is "UP". What could I do to determine if it is up, if I can't ping it, since the host drops those? I figure I could fall back to a tcp port, however I tried check_tcp!someport but nagios doesn't like the "!someport" in hosts.cfg. Any ideas? ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Sun Dec 18 22:17:08 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Mon, 19 Dec 2005 10:17:08 +1300 Subject: 1.3: Cannot ping a host, so how do I tell if it is "up"? In-Reply-To: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B5D6@vxchange.vcr.auckland.ac.nz> References: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B5D6@vxchange.vcr.auckland.ac.nz> Message-ID: <006a01c60418$67d02be0$0a5cd882@438LG01sshi052> > When I define a host, the only command defined which I see I > can use is check-host-alive. This uses ping to determine if > it is "UP". What could I do to determine if it is up, if I > can't ping it, since the host drops those? I have three different cases for this. Most of my hosts can have a host_check_command of check_ping. Some, behind a firewall, are set to use check_ssh instead (they run SSH so I can use this to verify the host is up). Finally, a couple of really awkward ones I just have a fake 'always up' set (they have special service checks and a host check is not needed). Remember, you can set the host_check_command on a per-host basis if you want. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Sun Dec 18 22:25:35 2005 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Mon, 19 Dec 2005 10:25:35 +1300 Subject: nsca problem In-Reply-To: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B5D5@vxchange.vcr.auckland.ac.nz> References: <14A270C0E2F4124AB0B7320BDBEA949F0AB3B5D5@vxchange.vcr.auckland.ac.nz> Message-ID: <006b01c60419$9650d090$0a5cd882@438LG01sshi052> > I've startd nsca daemon via inetd > nsca stream tcp nowait _nagios /usr/local/sbin/nsca -c > /etc/nagios/nsca.cfg --inetd This syntax is wrong for inetd.conf. Assuming your user is 'nagios', it should be: nsca stream tcp nowait nagios /usr/local/sbin/nsca nsca -c /etc/nagios/nsca.cfg --inetd Note that you specify the path to the server, and then all the arguments *including argv[0]* ! This makes it look a bit odd. See your inetd.conf manual page, and look at the existing examples. Steve ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jperrin at gmail.com Sun Dec 18 22:36:47 2005 From: jperrin at gmail.com (Jim Perrin) Date: Sun, 18 Dec 2005 16:36:47 -0500 Subject: 1.3: Cannot ping a host, so how do I tell if it is "up"? In-Reply-To: <43A5C47A.7090000@fire-eyes.org> References: <43A5C47A.7090000@fire-eyes.org> Message-ID: <302ce8b50512181336k45b9c589q7af4d7e248627af0@mail.gmail.com> On 12/18/05, fire-eyes wrote: > I'm using 1.3. > > When I define a host, the only command defined which I see I can use is > check-host-alive. This uses ping to determine if it is "UP". What could > I do to determine if it is up, if I can't ping it, since the host drops > those? > > I figure I could fall back to a tcp port, however I tried > check_tcp!someport but nagios doesn't like the "!someport" in hosts.cfg. > > Any ideas? > > tcping works quite well and check_tcp works, you're probably just not phrasing the check correctly for it. -- Jim Perrin System Architect - UIT Ft Gordon & US Army Signal Center ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_idv37&alloc_id865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From whiterabbit at mand4la.info Sun Dec 18 22:37:25 2005 From: whiterabbit at mand4la.info (Matteo Mancini) Date: Sun, 18 Dec 2005 22:37:25 +0100 Subject: nsca problem In-Reply-To: <006b01c60419$9650d090$0a5cd882@438LG01sshi052> References: <006b01c60419$9650d090$0a5cd882@438LG01sshi052> Message-ID: <43A5D695.90800@mand4la.info> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hi Steve.. > This syntax is wrong for inetd.conf. Assuming your user is 'nagios', it > should be: > > nsca stream tcp nowait nagios /usr/local/sbin/nsca nsca -c > /etc/nagios/nsca.cfg --inetd > > Note that you specify the path to the server, and then all the arguments > *including argv[0]* ! This makes it look a bit odd. See your inetd.conf > manual page, and look at the existing examples. This is the mistake ...now I can see my test eventlog Dec 18 23:32:52 nagios nsca[12451]: SERVICE CHECK -> Host Name: '3KREMOTE', Service Description: 'Test', Return Code: '0', Output: 'This is a test message generated by NagEvLog agent' Dec 18 23:32:52 nagios nsca[12451]: End of connection... thank u much Matteo -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.1 (GNU/Linux) iD8DBQFDpdaV/TjXD9LUVswRAr+kAKCJZ98Fw7jMgONnhTsGPdsAJPWXhQCfY6zF 66MdNmLR0ouTAnoefIihXys= =qM3k -----END PGP SIGNATURE----- ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From estair at ilm.com Sun Dec 18 23:14:23 2005 From: estair at ilm.com (Eli Stair) Date: Sun, 18 Dec 2005 14:14:23 -0800 Subject: Advice on Plugin Development Message-ID: <6E56E676C9D6A74EBC980144BC06A17D0613B1@mailbox03.lucas.alllucas.com> I have absolutely no idea about the actual setup this user has, but my assumption is that they're interfacing with some OOB tool (not the IP of the box itself) i.e. power strip, management processor, STONITH interrupt etc... And as far as the (valid) concern goes of having devices on your network that can power-control hardware with little to no authentication.... well it happens. With any luck this is done over SSL and a user:pass pair is checked before any transactions. I've seen plenty of _bad_ methods, including unauthenticated plaintext HTTP/PUT, raw TCP socket access, and devices that when scanned lock up and power off everything they control :) All pretty OT for the list, sorry. /eli -----Original Message----- From: nagios-users-admin at lists.sourceforge.net on behalf of C. Bensend Sent: Sun 12/18/2005 6:50 AM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Advice on Plugin Development > I need a plugin for Nagios that reboots an external server under certain > circumstances (no ping response). > > There is no "direct connection" between the machine that Nagios runs on > and the monitored server, so the reboot needs to be initiated via HTTPS > with several POST parameters, i.e. a "simple" web form. OK, am I missing something here? How are you going to be able to do an HTTP POST operation, if the machine isn't reachable via the network? ;) Or do you have something even more wonky going on, like ICMP starting to fail before the machine goes away? -- "NOT WORK SAFE! Extreme animated violence, language, birds, and what appears to be God with a katana." -- SA list ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: From g.knights at qut.edu.au Mon Dec 19 05:44:19 2005 From: