From midr at stibo.com Mon Jan 4 09:54:09 2010 From: midr at stibo.com (Mirco Benjamin Drick) Date: Mon, 4 Jan 2010 09:54:09 +0100 Subject: bug in check_logfiles? existing file is reported as not found Message-ID: Hi list, Tried unsuccessfully to find a support forum for check_logfiles, maybe someone here can help me ;-) I have check_logfiles running on a win2k3 machine and it checks for a file using a date pattern (see config snipped below). This morning nagios gave an error that the logfile with todays date (ie 2009-12-30.log) did not exist - but it did, this has been working for some weeks without problems until now. Exchanging the date pattern with the actual file name gave the same result - check_logfiles claims the file did not exist. Trying with any other file (fx 2009-12-29.log or 2009-12-31.log) in the same location gives no problem. Any ideas how to debug further on this? Otherwise it looks like the problem will disappear tomorrow as my created file with tomorrows datepattern works ok. Thanks Mirco Check_logfiles.cfg: @searches = ( { logfile => 'C:\STEP2CIFileMover\logs\$CL_DATE_YYYY$-$CL_DATE_MM$-$CL_DATE_DD$.log', criticalpatterns => 'ERROR', options => 'noprotocol,perfdata,nocase,sticky=28800' }, ); ............................................................ Mirco Drick | Systems Administrator Stibo Systems A/S MASTERING Data Management T +45 89 39 11 11 www.stibosystems.com ............................................................ This e-mail is intended for the addressee only and may contain confidential information. If you are not the intended recipient, you must not copy, distribute or take any action in reliance on it. If this email is sent to you in error, please notify us immediately by telephone or by e-mail. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Mon Jan 4 16:43:01 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Mon, 04 Jan 2010 16:43:01 +0100 Subject: Check_openmanage In-Reply-To: (Jack Lyons's message of "Thu, 31 Dec 2009 10:26:01 -0500") References: Message-ID: <15tvdfhg9kq.fsf@tux.uio.no> Jack Lyons writes: > I have some older 2650 that through some message with State=UKNOWN when I use > check_openmanage > > See below for output of check_openmanage -d > > Is this a hardware issue that we need to address or is this a system > configuration issue - no fan probes, no temp probes, no volt probs that could > be handled via configuration / change of check_openmanage > > I have added this to the perl code and it works, but I am having problems > compiling check_openmanage.pl on windows. (problems installings and using > PAR::Packer) > > in the $ok_errors section > | No\sfan\sprobes\sfound\son\sthis\ssystem # No battery > probes > | No\stemperature\sprobes\sfound\son\sthis\ssystem # No > battery probes > | No\svoltage\sprobes\sfound\son\sthis\ssystem # No > battery probes > > A) Could someone give me a compiled version of the check_openmanage.pl > that has the $ok_errors section in it. Yes, I could do that for you. But see below first... > B) Can we modify the --only option to include warning+ to include warning > messages and above AND ignore Unknown states? Not sure that I understand what you mean. If used, the --only option specifies exactly one component to check. For example, '--only cpu' would make the plugin only check the CPUs. All other components are ignored. No warnings about e.g. fan probes should then appear. > C) is there another way to prevent to configure the plugin for nagios from > alerting on this output. Yes. You can use the '--check' option to specify that you don't want to check these things. Example: check_openmanage --check fans=0,temp=0,voltage=0 Using the '--check' option as above will prevent check_openmanage to ever running the commands that are failing. [...] > UNKNOWN | Problem running 'omreport chassis fans': Error! No fan probes found > on this system. > UNKNOWN | Problem running 'omreport chassis temps': Error! No temperature > probes found on this system. > UNKNOWN | Problem running 'omreport chassis volts': Error! No voltage probes > found on this system. These are errors from running omreport. They indicate that something is wrong, either with the hardware or with Openmanage. I would try reinstalling Openmanage first, which may help. The 2650 is an old model, but if you still have a valid warranty you should contact Dell support about this problem. These commands should not fail like this. If all else fails, use the '--check' option as described above. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Mon Jan 4 16:52:33 2010 From: a31modela at hotmail.com (steve f) Date: Mon, 4 Jan 2010 10:52:33 -0500 Subject: Notification Question Message-ID: Hello & Happy New Year, Is it possible to have Nagios notify me of a service problem once an hour AND tell me how many times it alerted during that hour time frame? For example, if I run a plugin, I don't necessarily want to have a notification every time the threshold was met but after 1 hour, send me a notification that during that hour time period, the threshold was exceeded 10 times? I know that via the notification cfg I can set the time frame for sending a notification but can I keep a running total of the number of alerts for that 1 hour timeframe? Thanks, Steve _________________________________________________________________ Hotmail: Trusted email with powerful SPAM protection. http://clk.atdmt.com/GBL/go/177141665/direct/01/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Eliot.Picken at wenaas.co.uk Mon Jan 4 17:04:20 2010 From: Eliot.Picken at wenaas.co.uk (Eliot.Picken at wenaas.co.uk) Date: Mon, 4 Jan 2010 16:04:20 +0000 Subject: AUTO: Eliot Picken is out of the office (returning 06/01/2010) Message-ID: I am out of the office until 06/01/2010. I am currently out of the office. Your email has not been forwarded For urgent issues, please contact Alex Lawrie on +44 (0) 1224 894 000 Best regards Eliot Note: This is an automated response to your message "Re: [Nagios-users] Check_openmanage" sent on 1/4/2010 3:43:01 PM. This is the only notification you will receive while this person is away. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Mon Jan 4 18:16:16 2010 From: mitsuto at gmail.com (Marcel) Date: Mon, 4 Jan 2010 15:16:16 -0200 Subject: Notification Question In-Reply-To: References: Message-ID: <2dfcbd1b1001040916s740928d3q927272b532bfecf2@mail.gmail.com> yeah, just set your normal_check_interval to 6 (minutes if you don't change the interval_length), and max_check_attempts to 10, then, after 60 minutes you would be notified. Or the oposite, maybe setting the check interval to 10 and the number of checks to notify to 6. That way you'll always know that in the last 6 (or 10) checks in the last 60 minutes, you had a threshold verification alert, but notifications are only sent after reaching the max_check_attempts. HTH. On Mon, Jan 4, 2010 at 1:52 PM, steve f wrote: > Hello & Happy New Year, > > Is it possible to have Nagios notify me of a service problem once an hour > AND tell me how many times it alerted during that hour time frame? > > For example, if I run a plugin, I don't necessarily want to have a > notification every time the threshold was met but after 1 hour, send me a > notification that during that hour time period, the threshold was exceeded > 10 times? > > I know that via the notification cfg I can set the time frame for sending a > notification but can I keep a running total of the number of alerts for that > 1 hour timeframe? > > Thanks, > Steve > > ------------------------------ > Hotmail: Trusted email with powerful SPAM protection. Sign up now. > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and > easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Mon Jan 4 20:38:52 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Mon, 4 Jan 2010 19:38:52 +0000 Subject: Notification Question In-Reply-To: References: Message-ID: <765d77c81001041138v75fbdd22r9249b43efca99ec5@mail.gmail.com> 2010/1/4 steve f : > Hello & Happy New Year, > > Is it possible to have Nagios notify me of a service problem once an hour > AND tell me how many times it alerted during that hour time frame? > > For example, if I run a plugin, I don't necessarily want to have a > notification every time the threshold was met but after 1 hour, send me a > notification that during that hour time period, the threshold was exceeded > 10 times? > > I know that via the notification cfg I can set the time frame for sending a > notification but can I keep a running total of the number of alerts for that > 1 hour timeframe? Out of the box, no I don't think there is a way you can do that. If you use ndoutils, I guess you could write a custom notification command script which gets the information you need by doing a SQL query of the database. > Thanks, I'm not sure you will want to thank me for this advice! The NDO schema can be a right pain. Cheers, Jim ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Tue Jan 5 01:37:20 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Mon, 4 Jan 2010 17:37:20 -0700 Subject: improving the 300 second resolution nagiosgraph Message-ID: <46CF557C-CAF7-45E4-BAA6-8A4266917BC9@stubhub.com> I need some assistance with nagiosgraph, specifically with how it handles RRD data. I am finding that there is a 300 second resolution limitation with nagiosgraph and how it uses rrdtool. I see the 300 second resolution clearly in the graphs themselves (regardless of how much I "zoom") which also correlates to the head of the output of 'rrdtool dump' for any of the RRD files nagiosgrah has created: 0003 300 1262647468 errors GAUGE 60 NaN NaN 0 6.0000000000e+01 0 The problem with this is that I have monitors the run every 60 seconds and the lack of precision is excessively smoothing the graphs to the point of them being useless. My question is two-fold: 1) Where is this "step" period of 300 seconds specified in nagiosgraph? 2) If I were to globally change the "step" period in nagiosgraph from 300 seconds to 60 seconds is there some way that I can keep my existing RRD data or would it become corrupted if I tried to change this? Thanks, Matthew Litwin mlitwin at stubhub.com 415.222.8475 ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lamberte at imec.be Tue Jan 5 08:11:30 2010 From: lamberte at imec.be (Lambert Emmanuel) Date: Tue, 5 Jan 2010 08:11:30 +0100 Subject: nagios always show zero load and no users logged in Message-ID: Hi, I have configured Nagios web interface on a server called ZITA. I have 2 servers that I want to monitor : wsphotonicsA and wsphotonicsB. In the web interface, the status of both servers is shown with all green. If I shutdown one of the servers, this is correctly shown. However, the load of both servers is always shown as zero and Nagios never detects the number of logged in users (it always shows zero, with the exception of 1 user that is sporadically detected). The number of processes is detected correctly. Serverload has been constantly 50% or more during the past 2 weeks, but Nagios doesn't detect it. ------------------------------------------------------------------------ Extract from nagios.log : [1260918000] CURRENT SERVICE STATE: wsphotonicsA;Current Users;OK;HARD;1;USERS O K - 0 users currently logged in [1260918000] CURRENT SERVICE STATE: wsphotonicsA;PING;OK;HARD;1;PING OK - Packet loss = 0%, RTA = 0.10 ms [1260918000] CURRENT SERVICE STATE: wsphotonicsA;Root Partition;OK;HARD;1;DISK O K - free space: / 34103 MB (71% inode=99%): [1260918000] CURRENT SERVICE STATE: wsphotonicsA;SSH;OK;HARD;1;SSH OK - OpenSSH_ 4.3 (protocol 2.0) [1260918000] CURRENT SERVICE STATE: wsphotonicsA;Swap Usage;OK;HARD;1;SWAP OK - 92% free (875 MB out of 956 MB) [1260918000] CURRENT SERVICE STATE: wsphotonicsA;Total Processes;OK;HARD;1;PROCS OK: 21 processes with STATE = RSZDT [1260918000] CURRENT SERVICE STATE: wsphotonicsB;Current Load;OK;HARD;1;OK - loa d average: 0.00, 0.00, 0.00 [1260918000] CURRENT SERVICE STATE: wsphotonicsB;Current Users;OK;HARD;1;USERS O K - 0 users currently logged in [1260918000] CURRENT SERVICE STATE: wsphotonicsB;PING;OK;HARD;1;PING OK - Packet loss = 0%, RTA = 0.11 ms [1260918000] CURRENT SERVICE STATE: wsphotonicsB;Root Partition;OK;HARD;1;DISK O K - free space: / 34103 MB (71% inode=99%): [1260918000] CURRENT SERVICE STATE: wsphotonicsB;SSH;OK;HARD;1;SSH OK - OpenSSH_ 4.3 (protocol 2.0) ------------------------------------------------------------------------------------------------------------ Here is the cfg file that I use to configure the servers : photonics at zita:~$ more /usr/local/nagios/etc/objects/wsphotonics.cfg define hostgroup { hostgroup_name calculation_servers alias CALCULATION SERVERS members wsphotonicsA, wsphotonicsB } define host { use linux-server host_name wsphotonicsA alias wsphotonicsA address 157.193.172.101 hostgroups calculation_servers max_check_attempts 5 check_command check-host-alive contact_groups admins notification_interval 2 notification_period 24x7 notification_options d,u,r } define host { use linux-server host_name wsphotonicsB alias wsphotonicsB address 157.193.172.188 hostgroups calculation_servers check_command check-host-alive max_check_attempts 5 contact_groups admins notification_interval 2 notification_period 24x7 notification_options d,u,r } ############################################################################### ############################################################################### # # SERVICE DEFINITIONS - wsphotonicsA # ############################################################################### ############################################################################### # Define a service to "ping" to wsphotonicsA define service{ use local-service ; Name of service template to use host_name wsphotonicsA service_description PING check_command check_ping!100.0,20%!500.0,60% } # Define a service to check the disk space of the root partition # on the local machine. Warning if < 20% free, critical if # < 10% free space on partition. define service{ use local-service ; Name of service template to use host_name wsphotonicsA service_description Root Partition check_command check_local_disk!20%!10%!/ } # Define a service to check the number of currently logged in # users on the local machine. Warning if > 20 users, critical # if > 50 users. define service{ use local-service ; Name of service template to use host_name wsphotonicsA service_description Current Users check_command check_local_users!20!50 } # Define a service to check the number of currently running procs # on the local machine. Warning if > 250 processes, critical if # > 400 users. define service{ use local-service ; Name of service template to use host_name wsphotonicsA service_description Total Processes check_command check_local_procs!250!400!RSZDT } # Define a service to check the load on the local machine. define service{ use local-service ; Name of service template to use host_name wsphotonicsA service_description Current Load check_command check_local_load!5.0,4.0,3.0!10.0,6.0,4. 0 } # Define a service to check the swap usage the local machine. # Critical if less than 10% of swap is free, warning if less than 20% is free define service{ use local-service ; Name of service template to use host_name wsphotonicsA service_description Swap Usage check_command check_local_swap!20!10 } # Define a service to check SSH on the local machine. define service{ use local-service ; Name of service template to use host_name wsphotonicsA service_description SSH check_command check_ssh notifications_enabled 1 } ############################################################################### ############################################################################### # # SERVICE DEFINITIONS - wsphotonicsB # ############################################################################### ############################################################################### # Define a service to "ping" to wsphotonicsB define service{ use local-service ; Name of service template to use host_name wsphotonicsB service_description PING check_command check_ping!100.0,20%!500.0,60% } # Define a service to check the disk space of the root partition # on the local machine. Warning if < 20% free, critical if # < 10% free space on partition. define service{ use local-service ; Name of service template to use host_name wsphotonicsB service_description Root Partition check_command check_local_disk!20%!10%!/ } # Define a service to check the number of currently logged in # users on the local machine. Warning if > 20 users, critical # if > 50 users. define service{ use local-service ; Name of service template to use host_name wsphotonicsB service_description Current Users check_command check_local_users!20!50 } # Define a service to check the number of currently running procs # on the local machine. Warning if > 250 processes, critical if # > 400 users. define service{ use local-service ; Name of service template to use host_name wsphotonicsB service_description Total Processes check_command check_local_procs!250!400!RSZDT } # Define a service to check the load on the local machine. define service{ use local-service ; Name of service template to use host_name wsphotonicsB service_description Current Load check_command check_local_load!5.0,4.0,3.0!10.0,6.0,4. 0 } # Define a service to check the swap usage the local machine. # Critical if less than 10% of swap is free, warning if less than 20% is free define service{ use local-service ; Name of service template to use host_name wsphotonicsB service_description Swap Usage check_command check_local_swap!20!10 } # Define a service to check SSH on the local machine. define service{ use local-service ; Name of service template to use host_name wsphotonicsB service_description SSH check_command check_ssh notifications_enabled 1 } ------------------------------------------------------------------------------------------------------------------------------------------------------------------ What could cause the fact that Nagios is not detecting serverload and logged in users? :? wbr Emmanuel Lambert ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dermoth at aei.ca Tue Jan 5 09:37:03 2010 From: dermoth at aei.ca (Thomas Guyot-Sionnest) Date: Tue, 05 Jan 2010 03:37:03 -0500 Subject: nagios always show zero load and no users logged in In-Reply-To: References: Message-ID: <4B42FA2F.8000008@aei.ca> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 05/01/10 02:11 AM, Lambert Emmanuel wrote: > Hi, > > I have configured Nagios web interface on a server called ZITA. > I have 2 servers that I want to monitor : wsphotonicsA and wsphotonicsB. > In the web interface, the status of both servers is shown with all > green. If I shutdown one of the servers, this is correctly shown. > However, the load of both servers is always shown as zero and Nagios > never detects the number of logged in users (it always shows zero, with > the exception of 1 user that is sporadically detected). The number of > processes is detected correctly. > Serverload has been constantly 50% or more during the past 2 weeks, but > Nagios doesn't detect it. All the check_local_* commands that you are running are checking the Nagios server, not the remote server. To check remote servers, you have to install/setup some way to execute checks (NRPE, check_by_ssh, SNMP) and run a check against it. The plugins will have to be installed on the servers themselves. All network-based checks (check_ping, icmp, tcp, http, smtp, ntp, mysql, etc.) can check a remote host which is specified with the -H (or - --hostname) parameter. All other checks runs locally. - -- Thomas -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFLQvov6dZ+Kt5BchYRAt2oAJ9x8+r2L0K5wT4448j30USjQ7EMEgCgszru Km5delF9oOuEypAoOhAMZP8= =HBM/ -----END PGP SIGNATURE----- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Tue Jan 5 10:11:11 2010 From: jashokda at cisco.com (Jatin Davey) Date: Tue, 05 Jan 2010 14:41:11 +0530 Subject: Installing Nagios on CentOS Message-ID: <4B43022F.5040204@cisco.com> Hi Can someone provide some instructions or links on how to install Nagios on CentOS distribution. Thanks Jatin ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sebastian.Ries at dtnet.de Tue Jan 5 10:29:35 2010 From: Sebastian.Ries at dtnet.de (Sebastian Ries) Date: Tue, 05 Jan 2010 10:29:35 +0100 Subject: Installing Nagios on CentOS In-Reply-To: <4B43022F.5040204@cisco.com> References: <4B43022F.5040204@cisco.com> Message-ID: <1262683775.21193.0.camel@bofh.dtnet.de> Hi > Can someone provide some instructions or links on how to install Nagios > on CentOS distribution. One easy way ist use the EPEL repository https://fedoraproject.org/wiki/EPEL then yum install nagios does the job Regards Sebastian Ries -- ------------------------------------------------------------ DT Netsolution GmbH - Talaeckerstr. 30 - D-70437 Stuttgart Tel: +49-711-849910-36 Fax: +49-711-849910-936 WEB: http://www.dtnet.de/ email: Sebastian.Ries at dtnet.de ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Tue Jan 5 11:54:31 2010 From: jashokda at cisco.com (Jatin Davey) Date: Tue, 05 Jan 2010 16:24:31 +0530 Subject: Installing Nagios on CentOS In-Reply-To: <1262683775.21193.0.camel@bofh.dtnet.de> References: <4B43022F.5040204@cisco.com> <1262683775.21193.0.camel@bofh.dtnet.de> Message-ID: <4B431A67.30202@cisco.com> Thanks Sebastian. I have one question if i use the command as mentioned. Will it also install all the depadencies that are required for the functioning of a complete Nagios application. I mean will it also make the GUI of Nagios functional ? Thanks Jatin Sebastian Ries wrote: > Hi > > >> Can someone provide some instructions or links on how to install Nagios >> on CentOS distribution. >> > > One easy way ist use the EPEL repository > https://fedoraproject.org/wiki/EPEL > > then yum install nagios does the job > > Regards > Sebastian Ries > > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Jan 5 12:20:57 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 05 Jan 2010 11:20:57 +0000 Subject: Installing Nagios on CentOS In-Reply-To: <4B431A67.30202@cisco.com> References: <4B43022F.5040204@cisco.com> <1262683775.21193.0.camel@bofh.dtnet.de> <4B431A67.30202@cisco.com> Message-ID: <4B432099.6090804@flatto.net> Jatin Davey wrote: > Thanks Sebastian. > > I have one question if i use the command as mentioned. Will it also > install all the depadencies that are required for the functioning of a > complete Nagios application. I mean will it also make the GUI of Nagios > functional ? > > Thanks > Jatin > > Sebastian Ries wrote: > >> Hi >> >> >> >>> Can someone provide some instructions or links on how to install Nagios >>> on CentOS distribution. >>> >>> >> One easy way ist use the EPEL repository >> https://fedoraproject.org/wiki/EPEL >> >> then yum install nagios does the job >> >> Regards >> Sebastian Ries >> >> The command will install the core nagios capabilities, which include the web gui . you will still need to configure the nagios and make sure the apache service is running and responding , but to answer in short , yes! Assaf ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Tue Jan 5 12:32:45 2010 From: jashokda at cisco.com (Jatin Davey) Date: Tue, 05 Jan 2010 17:02:45 +0530 Subject: Installing Nagios on CentOS In-Reply-To: <4B432099.6090804@flatto.net> References: <4B43022F.5040204@cisco.com> <1262683775.21193.0.camel@bofh.dtnet.de> <4B431A67.30202@cisco.com> <4B432099.6090804@flatto.net> Message-ID: <4B43235D.5010208@cisco.com> Thanks Assaf. I am planning to install it on Fedora 8 and CentOS 5.4. I ll get help from the community if i run into any issues. Thanks Jatin Assaf Flatto wrote: > Jatin Davey wrote: > >> Thanks Sebastian. >> >> I have one question if i use the command as mentioned. Will it also >> install all the depadencies that are required for the functioning of a >> complete Nagios application. I mean will it also make the GUI of Nagios >> functional ? >> >> Thanks >> Jatin >> >> Sebastian Ries wrote: >> >> >>> Hi >>> >>> >>> >>> >>>> Can someone provide some instructions or links on how to install Nagios >>>> on CentOS distribution. >>>> >>>> >>>> >>> One easy way ist use the EPEL repository >>> https://fedoraproject.org/wiki/EPEL >>> >>> then yum install nagios does the job >>> >>> Regards >>> Sebastian Ries >>> >>> >>> > The command will install the core nagios capabilities, which include the > web gui . > > you will still need to configure the nagios and make sure the apache > service is running and responding , but to answer in short , yes! > > > Assaf > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.felici at alpikom.it Tue Jan 5 12:15:06 2010 From: s.felici at alpikom.it (Simone Felici) Date: Tue, 05 Jan 2010 12:15:06 +0100 Subject: Installing Nagios on CentOS In-Reply-To: <4B43022F.5040204@cisco.com> References: <4B43022F.5040204@cisco.com> Message-ID: <4B431F3A.4000006@alpikom.it> Instead using rpm, I prefer to compile it. Using Fedora Quickstart line guide, it works perfectly: http://nagios.sourceforge.net/docs/3_0/quickstart-fedora.html Simon Jatin Davey ha scritto in data 05/01/2010 10.11: > Hi > > Can someone provide some instructions or links on how to install Nagios > on CentOS distribution. > > Thanks > Jatin > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eemerson at safecore.com Tue Jan 5 14:40:52 2010 From: eemerson at safecore.com (Eric Emerson) Date: Tue, 5 Jan 2010 08:40:52 -0500 Subject: improving the 300 second resolution nagiosgraph In-Reply-To: <46CF557C-CAF7-45E4-BAA6-8A4266917BC9@stubhub.com> References: <46CF557C-CAF7-45E4-BAA6-8A4266917BC9@stubhub.com> Message-ID: I had the opposite problem a while back (checks which only ran every 30 minutes making the grahps really choppy. You can change the heartbeat value in nagiosgraph.conf which controls the --step for rrdcreate (http://oss.oetiker.ch/rrdtool/doc/rrdcreate.en.html) This will only affect new rrdfiles created so if you want to preserve existing data you are going to need to migrate it and if you do not want this setting applied to all new rrd files you will need to change it back. Take it Easy Eric On Mon, Jan 4, 2010 at 7:37 PM, Litwin, Matthew wrote: > I need some assistance with nagiosgraph, specifically with how it handles > RRD data. > > I am finding that there is a 300 second resolution limitation with > nagiosgraph and how it uses rrdtool. I see the 300 second resolution clearly > in the graphs themselves (regardless of how much I "zoom") which also > correlates to the head of the output of 'rrdtool dump' for any of the RRD > files nagiosgrah has created: > > 0003 > 300 > 1262647468 > > > errors > GAUGE > 60 > NaN > NaN > > > 0 > 6.0000000000e+01 > 0 > > > The problem with this is that I have monitors the run every 60 seconds and > the lack of precision is excessively smoothing the graphs to the point of > them being useless. > > My question is two-fold: > 1) Where is this "step" period of 300 seconds specified in nagiosgraph? > 2) If I were to globally change the "step" period in nagiosgraph from 300 > seconds to 60 seconds is there some way that I can keep my existing RRD data > or would it become corrupted if I tried to change this? > > Thanks, > Matthew Litwin > mlitwin at stubhub.com > 415.222.8475 > > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and > easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 5 14:51:14 2010 From: marc at ena.com (Marc Powell) Date: Tue, 5 Jan 2010 07:51:14 -0600 Subject: improving the 300 second resolution nagiosgraph In-Reply-To: <46CF557C-CAF7-45E4-BAA6-8A4266917BC9@stubhub.com> References: <46CF557C-CAF7-45E4-BAA6-8A4266917BC9@stubhub.com> Message-ID: <972BDA50-0B77-46AB-AED6-9B0B970FF162@ena.com> On Jan 4, 2010, at 6:37 PM, Litwin, Matthew wrote: > I need some assistance with nagiosgraph, specifically with how it handles RRD data. I don't use nagiosgraph but I have familiarity with RRDTool. > I am finding that there is a 300 second resolution limitation with nagiosgraph and how it uses rrdtool. I see the 300 second resolution clearly in the graphs themselves (regardless of how much I "zoom") which also correlates to the head of the output of 'rrdtool dump' for any of the RRD files nagiosgrah has created. > The problem with this is that I have monitors the run every 60 seconds and the lack of precision is excessively smoothing the graphs to the point of them being useless. > > My question is two-fold: > 1) Where is this "step" period of 300 seconds specified in nagiosgraph? Taking a peek at the nagiosgraph code (particularly etc/ngshared.pm), your problem is that it _isn't_ specified. nagiosgraph doesn't appear to have any code for specifying the step when it asks rrdtool to create the rrd database. Because it isn't passed a step value, rrdtool uses it's default of 300 seconds. > 2) If I were to globally change the "step" period in nagiosgraph from 300 seconds to 60 seconds is there some way that I can keep my existing RRD data or would it become corrupted if I tried to change this? If you were able to change it by adding code to support that, it would not affect the existing rrd files. The step is set at create time only and cannot be changed for an existing rrd file. You'd have to delete your existing files and recreate them. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 5 15:10:00 2010 From: marc at ena.com (Marc Powell) Date: Tue, 5 Jan 2010 08:10:00 -0600 Subject: improving the 300 second resolution nagiosgraph In-Reply-To: References: <46CF557C-CAF7-45E4-BAA6-8A4266917BC9@stubhub.com> Message-ID: <8315662B-8977-41FB-8C76-4F7EB8C7A56C@ena.com> On Jan 5, 2010, at 7:40 AM, Eric Emerson wrote: > I had the opposite problem a while back (checks which only ran every 30 minutes making the grahps really choppy. > You can change the heartbeat value in nagiosgraph.conf which controls the --step for rrdcreate This is very different functionality, but may appear to be be the same effect in your case. It absolutely does not affect the step value. Heartbeat essentially specifies how much time can be missed before a value is considered to be unknown. It's also set per datasource whereas step applies universally to the entire rrd database (which may contain multiple datasources). Think of the steps as a bunch of buckets in a line. You walk along the line placing something in each bucket (value at each step). If you skip one, rrdtool is smart enough to guess what you would have put in it based on what's in the bucket before it and the bucket after it and your consolidation functions (AVERAGE, MAX, MIN, etc). The heartbeat determines how many buckets (steps) you can skip in a row before rrdtool will g ive up on guessing what would have been in them and just assume they were empty. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Tue Jan 5 16:09:23 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Tue, 5 Jan 2010 08:09:23 -0700 Subject: improving the 300 second resolution nagiosgraph In-Reply-To: <8315662B-8977-41FB-8C76-4F7EB8C7A56C@ena.com> References: <46CF557C-CAF7-45E4-BAA6-8A4266917BC9@stubhub.com> <8315662B-8977-41FB-8C76-4F7EB8C7A56C@ena.com> Message-ID: <4C0A265C-B152-40FB-9CD5-83566D644EB8@stubhub.com> Thank you very much. This correlates with the results I am seeing. Normally this behavior is something that is beneficial for any sort of floating point arithmetic but can easily produce confusing results for monitors with (1) integer values and (2) have checks more frequent that the step or "bucket" duration. >From your explanation this is what I can surmise about my current case. A monitor is checked every 60 seconds so there are going to be 5 readings per step that would be averaged. This monitor usually flat at "0" and occasionally blips into "1". Thus, depending on how many readings per step are at "1" and how many are at "0", my RRD data at each step will predictably be 0.2 for 1 reading at "1", 0.4 for 2 readings at "1" and so forth. Do I have that right? That said, it sounds like I might want to make a plea to the author of nagiosgraph to make the step length something that can be configureable, but I might just be hitting the wall of what this tool can actually do. On Jan 5, 2010, at 6:10 AM, Marc Powell wrote: > > On Jan 5, 2010, at 7:40 AM, Eric Emerson wrote: > >> I had the opposite problem a while back (checks which only ran every 30 minutes making the grahps really choppy. >> You can change the heartbeat value in nagiosgraph.conf which controls the --step for rrdcreate > > This is very different functionality, but may appear to be be the same effect in your case. It absolutely does not affect the step value. Heartbeat essentially specifies how much time can be missed before a value is considered to be unknown. It's also set per datasource whereas step applies universally to the entire rrd database (which may contain multiple datasources). Think of the steps as a bunch of buckets in a line. You walk along the line placing something in each bucket (value at each step). If you skip one, rrdtool is smart enough to guess what you would have put in it based on what's in the bucket before it and the bucket after it and your consolidation functions (AVERAGE, MAX, MIN, etc). The heartbeat determines how many buckets (steps) you can skip in a row before rrdtool will give up on guessing what would have been in them and just assume they were empty. > > -- > Marc > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null Thanks, Matthew Litwin mlitwin at stubhub.com 415.222.8475 ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 5 16:57:19 2010 From: marc at ena.com (Marc Powell) Date: Tue, 5 Jan 2010 09:57:19 -0600 Subject: improving the 300 second resolution nagiosgraph In-Reply-To: <4C0A265C-B152-40FB-9CD5-83566D644EB8@stubhub.com> References: <46CF557C-CAF7-45E4-BAA6-8A4266917BC9@stubhub.com> <8315662B-8977-41FB-8C76-4F7EB8C7A56C@ena.com> <4C0A265C-B152-40FB-9CD5-83566D644EB8@stubhub.com> Message-ID: On Jan 5, 2010, at 9:09 AM, Litwin, Matthew wrote: > Thank you very much. This correlates with the results I am seeing. Normally this behavior is something that is beneficial for any sort of floating point arithmetic but can easily produce confusing results for monitors with (1) integer values and (2) have checks more frequent that the step or "bucket" duration. Integer values are a different situation as well ;) In RRDTool, *all* values are treated as rates of change, even GAUGES. RRDTool also pre-creates all the buckets that the values will fall into at very specific timestamps (multiples of step from the rrd start time or epoch if start isn't specified). If you insert your values at the exact time that corresponds to a bucket, rrdtool uses that value as is and it looks like your value was treated as a integer. If you insert at a time that is slightly before or slightly after a bucket, rrdtool will adjust your value based on the rate of change from the last insert and the exact time of the current insert. It essentially 'fudges' the value to make it fit in the correct bucket as if it were a rate. > From your explanation this is what I can surmise about my current case. A monitor is checked every 60 seconds so there are going to be 5 readings per step that would be averaged. This monitor usually flat at "0" and occasionally blips into "1". Thus, depending on how many readings per step are at "1" and how many are at "0", my RRD data at each step will predictably be 0.2 for 1 reading at "1", 0.4 for 2 readings at "1" and so forth. Do I have that right? Mostly. There's some additional fudging that goes on when you try to insert multiple values between steps, with some values being ignored I believe. I haven't cared enough about it to dig into the very specifics of it personally. > That said, it sounds like I might want to make a plea to the author of nagiosgraph to make the step length something that can be configureable, but I might just be hitting the wall of what this tool can actually do. Yes, they would need to add that support. It will introduce some end-user complexity in cases where checks happen at different intervals but playing around with heartbeat to cover the longest case would probably work. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yuehung at cht.com.tw Wed Jan 6 04:19:57 2010 From: yuehung at cht.com.tw (=?Big5?B?vEKpqKps?=) Date: Wed, 06 Jan 2010 11:19:57 +0800 Subject: status vs non-status information Message-ID: <4B44015D.9020907@cht.com.tw> Hi there, For information retention, which information will be dealt as "status" and which will be dealt as "non-status"? TIA ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Wed Jan 6 09:55:22 2010 From: jashokda at cisco.com (Jatin Davey) Date: Wed, 06 Jan 2010 14:25:22 +0530 Subject: Installing Nagios on CentOS In-Reply-To: <4B431F3A.4000006@alpikom.it> References: <4B43022F.5040204@cisco.com> <4B431F3A.4000006@alpikom.it> Message-ID: <4B444FFA.4050201@cisco.com> Thanks all , Everything went fine and i was able to install Nagios on my linux box. Thanks Jatin Simone Felici wrote: > Instead using rpm, I prefer to compile it. > Using Fedora Quickstart line guide, it works perfectly: > > http://nagios.sourceforge.net/docs/3_0/quickstart-fedora.html > > Simon > > Jatin Davey ha scritto in data 05/01/2010 10.11: > >> Hi >> >> Can someone provide some instructions or links on how to install Nagios >> on CentOS distribution. >> >> Thanks >> Jatin >> >> > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Wed Jan 6 12:11:41 2010 From: jashokda at cisco.com (Jatin Davey) Date: Wed, 06 Jan 2010 16:41:41 +0530 Subject: Not able to get the newly added host on the Nagios UI Message-ID: <4B446FED.5010405@cisco.com> Hi I am a new user of Nagios. I installed the Nagios core , Plugins and the NRPE addon based on the instructions mentioned in the quick start guide. As of the NRPE addon i installed it using the document that was provided with the tar file and everything went fine. Similarly i installed the NRPE plug in and NRPE daemon on the machine that i wanted to monitor. On my Nagios box when i issue the following command: /usr/local/nagios/libexec/check_nrpe -H 192.168.0.1 my remote host responds with the NRPE version correctly. Also i could not see any errors in the cfg file by issuing the command: /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg Upon re-starting the Nagios service i could see the following: [root at pc-fedora-1 objects]# service nagios restart Running configuration check...done. Stopping nagios: .done. Starting nagios:su: warning: cannot change directory to /var/spool/nagios: No such file or directory This account is currently not available. done. Upon opening the UI i am still not able to see any hosts added. I have also added the command and input service files needed to contact the remote host. Please help. Thanks Jatin ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Wed Jan 6 13:53:26 2010 From: jashokda at cisco.com (Jatin Davey) Date: Wed, 06 Jan 2010 18:23:26 +0530 Subject: Not able to get the newly added host on the Nagios UI In-Reply-To: <4B446FED.5010405@cisco.com> References: <4B446FED.5010405@cisco.com> Message-ID: <4B4487C6.7050901@cisco.com> Got it , Added the configuration file location in the main configuration file titled as: nagios.cfg Thanks Jatin Jatin Davey wrote: > Hi > > I am a new user of Nagios. I installed the Nagios core , Plugins and the > NRPE addon based on the instructions mentioned in the quick start guide. > As of the NRPE addon i installed it using the document that was provided > with the tar file and everything went fine. Similarly i installed the > NRPE plug in and NRPE daemon on the machine that i wanted to monitor. > > On my Nagios box when i issue the following command: > > /usr/local/nagios/libexec/check_nrpe -H 192.168.0.1 > > my remote host responds with the NRPE version correctly. Also i could > not see any errors in the cfg file by issuing the command: > > /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg > > > Upon re-starting the Nagios service i could see the following: > > [root at pc-fedora-1 objects]# service nagios restart > Running configuration check...done. > Stopping nagios: .done. > Starting nagios:su: warning: cannot change directory to > /var/spool/nagios: No such file or directory > This account is currently not available. > done. > > Upon opening the UI i am still not able to see any hosts added. I have > also added the command and input service files needed to contact the > remote host. > > Please help. > > Thanks > Jatin > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Wed Jan 6 14:27:17 2010 From: jashokda at cisco.com (Jatin Davey) Date: Wed, 06 Jan 2010 18:57:17 +0530 Subject: Getting parameters and plotting them in terms of % (Percentage) Message-ID: <4B448FB5.4060105@cisco.com> Hi all I am seeing that graphs getting shown for the corresponding service that i define but they are shown in terms of only 4 points , either critical , ok , warning or indeterminate. I need to see the graphs in terms of utilization values such as: CPU -- % used (out of 100%) Memory -- Total memory (Used and free memory in terms of MB or KB etc) Disk Usage -- Used Disk space and free disk space (In terms of MB or GB etc) Network I/O -- Packets sent or received etc Please provide any pointers on how get this using Nagios. Thanks Jatin ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 6 14:49:38 2010 From: marc at ena.com (Marc Powell) Date: Wed, 6 Jan 2010 07:49:38 -0600 Subject: Getting parameters and plotting them in terms of % (Percentage) In-Reply-To: <4B448FB5.4060105@cisco.com> References: <4B448FB5.4060105@cisco.com> Message-ID: <8381699D-B31B-4EFD-8EE9-EA302237F32D@ena.com> On Jan 6, 2010, at 7:27 AM, Jatin Davey wrote: > I need to see the graphs in terms of utilization values such as: > > CPU -- % used (out of 100%) > Memory -- Total memory (Used and free memory in terms of MB or KB etc) > Disk Usage -- Used Disk space and free disk space (In terms of MB or GB etc) > Network I/O -- Packets sent or received etc > > Please provide any pointers on how get this using Nagios. Nagios doesn't support this directly but there are addons that provide this functionality. PNP4Nagios, NagiosGraph, etc. Look for them at http://exchange.nagios.org. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From penguinhead at gmail.com Wed Jan 6 15:10:44 2010 From: penguinhead at gmail.com (Pankaj) Date: Wed, 6 Jan 2010 19:40:44 +0530 Subject: nsca results don't reach nagios Message-ID: <312b9fd1001060610uf07cca8l7ba36c52283fcb60@mail.gmail.com> Hi, I have configured a nagios and nsca server, When I run the service check I get the following message in syslog: >Jan 6 09:06:58 cl-t036-222cl nsca[31340]: Connection from 10.1.233.10 port 48077 >Jan 6 09:06:58 cl-t036-222cl nsca[31340]: Handling the connection... >Jan 6 09:06:58 cl-t036-222cl nsca[31340]: SERVICE CHECK -> Host Name: 'localhost', Service Description: 'TestMessage', Return Code: '1', Output: 'Service in warning state'command_file=/var/lib/nagios3/rw/nagios.cmd >Jan 6 09:06:58 cl-t036-222cl nsca[31340]: End of connection... The results reach the command_file=/var/lib/nagios3/rw/nagios.cmd > cat /var/lib/nagios3/rw/nagios.cmd >[1262786818] PROCESS_SERVICE_CHECK_RESULT;localhost;TestMessage;1;Service in warning state However the results of my passive checks do not reach nagios. The nagios error log files say nothing about the check. What could I be doing wrong? Here are my conf files: >>>>>>>>nagios.cfg cat nagios.cfg | grep passive | grep -v '#' log_passive_checks=1 accept_passive_service_checks=1 accept_passive_host_checks=1 translate_passive_host_checks=0 passive_host_checks_are_soft=0 command_file=/var/lib/nagios3/rw/nagios.cmd >>>>>>>>nsca.cfg cat /etc/nsca.cfg | grep -v '#' pid_file=/var/run/nsca.pid server_port=5667 server_address=10.1.233.52 nsca_user=nagios nsca_group=nogroup debug=1 command_file=/var/lib/nagios3/rw/nagios.cmd alternate_dump_file=/var/run/nagios/nsca.dump aggregate_writes=0 append_to_file=0 max_packet_age=30 decryption_method=1 >>>>>>> localhost_nagios2.cfg define host{ use generic-host ; Name of host template to use host_name localhost alias localhost address 127.0.0.1 } define command{ command_name check_dummy command_line $USER1$/check_dummy $ARG1$ } define service{ use generic-service name passive_service active_checks_enabled 0 passive_checks_enabled 1 # We want only passive checking flap_detection_enabled 0 register 0 # This is a template, not a real service is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 5 retry_check_interval 1 check_freshness 0 contact_groups admins check_command check_dummy!0 notification_interval 120 notification_period 24x7 notification_options w,u,c,r stalking_options w,c,u } define service{ use passive_service service_description TestMessage host_name localhost } >>>>>>>>>>>>> The service check #!/bin/sh HOST=localhost SERVICE="TestMessage" NSCAHOST=10.1.233.52 echo -e "$HOST\t$SERVICE\t1\tService in warning state" | /usr/sbin/send_nsca -H $NSCAHOST -c /etc/send_nsca.cfg exit 0 -- ( 2b || ! 2b ) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 6 16:01:49 2010 From: marc at ena.com (Marc Powell) Date: Wed, 6 Jan 2010 09:01:49 -0600 Subject: nsca results don't reach nagios In-Reply-To: <312b9fd1001060610uf07cca8l7ba36c52283fcb60@mail.gmail.com> References: <312b9fd1001060610uf07cca8l7ba36c52283fcb60@mail.gmail.com> Message-ID: On Jan 6, 2010, at 8:10 AM, Pankaj wrote: > > cat /var/lib/nagios3/rw/nagios.cmd > >[1262786818] PROCESS_SERVICE_CHECK_RESULT;localhost;TestMessage;1;Service in warning state By doing this, you've removed it from the pipe and there's no possibility for nagios to see it at all. > However the results of my passive checks do not reach nagios. The nagios error log files say nothing about the check. What could I be doing wrong? > > Here are my conf files: > > >>>>>>>>nagios.cfg > > cat nagios.cfg | grep passive | grep -v '#' > log_passive_checks=1 > accept_passive_service_checks=1 > accept_passive_host_checks=1 > translate_passive_host_checks=0 > passive_host_checks_are_soft=0 > command_file=/var/lib/nagios3/rw/nagios.cmd value of check_external_commands? ls -l /var/lib/nagios3/rw/nagios.cmd? -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 6 17:38:55 2010 From: marc at ena.com (Marc Powell) Date: Wed, 6 Jan 2010 10:38:55 -0600 Subject: nsca results don't reach nagios In-Reply-To: <312b9fd1001060750v1f3b9451w74d05220940a2b99@mail.gmail.com> References: <312b9fd1001060610uf07cca8l7ba36c52283fcb60@mail.gmail.com> <312b9fd1001060750v1f3b9451w74d05220940a2b99@mail.gmail.com> Message-ID: <2C31AEAF-9AFC-43E3-A503-EA15634879F4@ena.com> Hi Pankaj. Please always respond on-list so that others learn and benefit from your experience. More below -- On Jan 6, 2010, at 9:50 AM, Pankaj wrote: > On Wed, Jan 6, 2010 at 8:31 PM, Marc Powell wrote: > > On Jan 6, 2010, at 8:10 AM, Pankaj wrote: > > > > cat /var/lib/nagios3/rw/nagios.cmd > > >[1262786818] PROCESS_SERVICE_CHECK_RESULT;localhost;TestMessage;1;Service in warning state > > By doing this, you've removed it from the pipe and there's no possibility for nagios to see it at all. > > This file did not exist ... so I touched it. Does it need to be a named pipe? Yes and must be created by nagios. I suspected this was your problem. Remove the file you've created. > value of check_external_commands? > > log_external_commands=1 That's not what I asked for ;) check_external_commands must be set to 1. I'll bet it's 0. Enabling this causes nagios to create the external command pipe (nagios.cmd) and watch for input there (including passive check results which are external commands). -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 6 17:51:57 2010 From: marc at ena.com (Marc Powell) Date: Wed, 6 Jan 2010 10:51:57 -0600 Subject: status vs non-status information In-Reply-To: <4B44015D.9020907@cht.com.tw> References: <4B44015D.9020907@cht.com.tw> Message-ID: <7D579EA1-E031-4891-AE44-8A08B676D291@ena.com> On Jan 5, 2010, at 9:19 PM, ??? wrote: > > Hi there, > > For information retention, which information will be dealt as "status" > and which will be dealt as "non-status"? It's not really documented outside of the code (there are lots of parameters for each). Using xdata/xrddefault.c as a reference (it's pretty easy to read/understand) -- Hosts: Status data - things like check type, current state, last hard state, performance data, plugin output, next check, current attempt, etc (xrddefault.c ~line 1175) Non-Status data - things like acknowledgement status, active/passive checks enabled, check command, check period, etc (xrddefault.c ~line 1263) Services: Pretty much the same as above. Status data starting about line 1446, non-status starting about line 1538 Contacts: Status data - last host/service notification (line 1721) Non-Status data - things like host/service notification periods, host/service notifications enabled, etc (line 1729) Search within xrddefault.c for 'retain_status_information' and 'retain_nonstatus_information and look at those blocks. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dit.dash at gmail.com Wed Jan 6 18:23:23 2010 From: dit.dash at gmail.com (dave stern - e-mail.pluribus.unum) Date: Wed, 6 Jan 2010 12:23:23 -0500 Subject: nagios output as clickable link Message-ID: <7f62d2421001060923gc0ec9f8o20be0f0d03b3e6ad@mail.gmail.com> This may be stretching nagios beyond its designed intention but... I'm writing a plugin that could produce many many lines of output. Since this obviously wont fit on a status page, what I'd like to do is if the output is not exit-status=0, produce a webpage and in the status information field of the service, have a clickable link to transport you to that page. Has anyone successfully tried this before? TIA ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 6 18:38:35 2010 From: marc at ena.com (Marc Powell) Date: Wed, 6 Jan 2010 11:38:35 -0600 Subject: nagios output as clickable link In-Reply-To: <7f62d2421001060923gc0ec9f8o20be0f0d03b3e6ad@mail.gmail.com> References: <7f62d2421001060923gc0ec9f8o20be0f0d03b3e6ad@mail.gmail.com> Message-ID: On Jan 6, 2010, at 11:23 AM, dave stern - e-mail.pluribus.unum wrote: > This may be stretching nagios beyond its designed intention but... > > I'm writing a plugin that could produce many many lines of output. > Since this obviously wont fit on a status page, what I'd like to do > is if the output is not exit-status=0, produce a webpage and in the > status information field of the service, have a clickable link to transport > you to that page. Has anyone successfully tried this before? Yes, there are people doing it. The following post notes a gotcha... http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg27845.html (appears to have been a home-brew like yours). More generally -- http://nagiosplugins.org/man/urlize -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Wed Jan 6 18:47:49 2010 From: mitsuto at gmail.com (Marcel) Date: Wed, 6 Jan 2010 15:47:49 -0200 Subject: DRBD for status.dat, nagios.cmd, objects.cache? Message-ID: <2dfcbd1b1001060947ka78cf7cu76407dbf699f3884@mail.gmail.com> Anyone has any advice about this kind of HA strategy? I'm thinking about using drbd for nagios-HA environment, where nagios.cmd, status.dat and objects.cache would be in drbd partition and master/stand-by scripts for keeping nagios running at all times when master fails and stand-by took over. TIA, Marcel -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 6 19:11:04 2010 From: marc at ena.com (Marc Powell) Date: Wed, 6 Jan 2010 12:11:04 -0600 Subject: DRBD for status.dat, nagios.cmd, objects.cache? In-Reply-To: <2dfcbd1b1001060947ka78cf7cu76407dbf699f3884@mail.gmail.com> References: <2dfcbd1b1001060947ka78cf7cu76407dbf699f3884@mail.gmail.com> Message-ID: On Jan 6, 2010, at 11:47 AM, Marcel wrote: > Anyone has any advice about this kind of HA strategy? > > I'm thinking about using drbd for nagios-HA environment, where nagios.cmd, status.dat and objects.cache would be in drbd partition and master/stand-by scripts for keeping nagios running at all times when master fails and stand-by took over. Just some random thoughts -- I'd be very surprised if DRBD supported named pipes properly (nagios.cmd). I don't see how it can. status.dat and objects.cache are only read by the CGI's. Nagios itself keeps everything it needs to know in memory during runtime. Neither of the files are read by the nagios daemon so are irrelevant for HA. You should be more interested in the retention files as nagios on the failover box can read those on start to get up to speed, so to speak. People are using Nagios + Heartbeat/DRBD successfully; Google says so at least. This seems interesting for example. It doesn't speak about implementation but some gotchas that he experienced -- http://www.drachen-server.de/archives/25-IO-tuning-for-Nagios.html -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Wed Jan 6 19:31:54 2010 From: benny at bennyvision.com (C. Bensend) Date: Wed, 6 Jan 2010 12:31:54 -0600 Subject: Checking IIS sites when there is a default "outage" page Message-ID: <63365268b9db77d120509239618fa65b.squirrel@webmail.stinkweasel.net> Hey folks, So, I'm running into an interesting scenario here at work, while setting up hosts for the web application guys. I have a number of IIS-hosted web sites that require NTLM authentication, which complicates matters. I found one plugin (check_http_ntlm.pl) that can at least pass the NTLM authentication, but it seems to know nothing of virtual hosts. So, if ANY of the sites are up, it happily answers OK to them all. :( To further complicate matters, they have a default outage page set up, so if a web site is stopped, the request will be passed along to IIS' default page (ie, I never get an error, just a redirect). So... The stock check_http won't work (can't do NTLM), the plugin I found won't work (knows nothing of individual sites), and I have yet to get webinject working with NTLM (pointers/ examples graciously accepted). Any thoughts on how to monitor an IIS site requiring NTLM authentication when a default page is configured for an "out of service" page? Thanks much! Benny -- "It's not all about getting up and putting four slices of kickass in a two slice toaster." -- ark86, on Fazed.net ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 6 19:57:16 2010 From: marc at ena.com (Marc Powell) Date: Wed, 6 Jan 2010 12:57:16 -0600 Subject: Checking IIS sites when there is a default "outage" page In-Reply-To: <63365268b9db77d120509239618fa65b.squirrel@webmail.stinkweasel.net> References: <63365268b9db77d120509239618fa65b.squirrel@webmail.stinkweasel.net> Message-ID: <8FC191CA-2ECA-46EA-B9D7-DC029A76FC7A@ena.com> On Jan 6, 2010, at 12:31 PM, C. Bensend wrote: > Any thoughts on how to monitor an IIS site requiring NTLM > authentication when a default page is configured for an "out of > service" page? I'm not aware of anything out-of-the-box but I do know of one that's close (but haven't used it). I also know that curl supports NTLM so if that's used as the basis of a plugin, you're a good way there. This PHP based plugin appears to have the features of check_http you need, but without the NTLM auth -- http://exchange.nagios.org/directory/Plugins/Websites%2C-Forms-and-Transactions/CURL-check-script/details Since it's PHP and PHP Curl is pretty well documented, I think that it wouldn't be too difficult to add in the authentication bit (two lines in the curl_setopt() section I believe, if hard-coded). The two options I think you'd need are -- http://www.php.net/manual/en/function.curl-setopt.php CURLOPT_USERPWD 'username:password' CURLOPT_HTTPAUTH 'CURLAUTH_NTLM' It may be more complicated than that but it's where I'd start if I were doing it. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From israel at frontierflying.com Thu Jan 7 00:18:03 2010 From: israel at frontierflying.com (Israel Brewster) Date: Wed, 6 Jan 2010 14:18:03 -0900 Subject: Override interval_length for specific service only Message-ID: <75B50ACF-789B-4F69-8E92-6857B5450260@frontierflying.com> I was wondering if there was any way with nagios 3.2 to override the global interval_length for a specific service? I have one service that I would like to check every 30 seconds, but interval_length in 60. I could change the interval_length definition, of course, but then I would have to go back through all my definitions and change the interval values I used there. This may not be too bad, as I make heavy use of templating, but it would be preferable to simply override this for the one service. ----------------------------------------------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Israel Brewster.vcf Type: text/directory Size: 417 bytes Desc: not available URL: -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Thu Jan 7 03:24:12 2010 From: benny at bennyvision.com (C. Bensend) Date: Wed, 6 Jan 2010 20:24:12 -0600 Subject: Checking IIS sites when there is a default "outage" page In-Reply-To: <8FC191CA-2ECA-46EA-B9D7-DC029A76FC7A@ena.com> References: <63365268b9db77d120509239618fa65b.squirrel@webmail.stinkweasel.net> <8FC191CA-2ECA-46EA-B9D7-DC029A76FC7A@ena.com> Message-ID: <6f7ab08bdacff73cb645c59ff7a982b0.squirrel@webmail.stinkweasel.net> >> Any thoughts on how to monitor an IIS site requiring NTLM >> authentication when a default page is configured for an "out of >> service" page? > > I'm not aware of anything out-of-the-box but I do know of one that's close > (but haven't used it). I also know that curl supports NTLM so if that's > used as the basis of a plugin, you're a good way there. This PHP based > plugin appears to have the features of check_http you need, but without > the NTLM auth -- > > http://exchange.nagios.org/directory/Plugins/Websites%2C-Forms-and-Transactions/CURL-check-script/details > > Since it's PHP and PHP Curl is pretty well documented, I think that it > wouldn't be too difficult to add in the authentication bit (two lines in > the curl_setopt() section I believe, if hard-coded). The two options I > think you'd need are -- > > http://www.php.net/manual/en/function.curl-setopt.php > > CURLOPT_USERPWD 'username:password' > CURLOPT_HTTPAUTH 'CURLAUTH_NTLM' > > It may be more complicated than that but it's where I'd start if I were > doing it. Hmmmm... Yeah, that might be a good start. If it turns out to be a pain to modify the existing plugins I'm looking at, I'll have to write one I guess. Thanks for the links, Marc, it is much appreciated. (hints/tips/examples/pointers on webinject tests still very much appreciated, too - I have some web apps to log into and test) Benny -- "It's not all about getting up and putting four slices of kickass in a two slice toaster." -- ark86, on Fazed.net ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From penguinhead at gmail.com Thu Jan 7 04:08:43 2010 From: penguinhead at gmail.com (Pankaj) Date: Thu, 7 Jan 2010 08:38:43 +0530 Subject: nsca results don't reach nagios In-Reply-To: <2C31AEAF-9AFC-43E3-A503-EA15634879F4@ena.com> References: <312b9fd1001060610uf07cca8l7ba36c52283fcb60@mail.gmail.com> <312b9fd1001060750v1f3b9451w74d05220940a2b99@mail.gmail.com> <2C31AEAF-9AFC-43E3-A503-EA15634879F4@ena.com> Message-ID: <312b9fd1001061908q3dabdcc9w84305a864210683e@mail.gmail.com> On Wed, Jan 6, 2010 at 10:08 PM, Marc Powell wrote: > Hi Pankaj. Please always respond on-list so that others learn and benefit > from your experience. More below -- > Thanks. > > value of check_external_commands? > > > > log_external_commands=1 > > That's not what I asked for ;) check_external_commands must be set to 1. > I'll bet it's 0. Enabling this causes nagios to create the external command > pipe (nagios.cmd) and watch for input there (including passive check results > which are external commands). > > Yes That's right. removing the file and setting check_external_commands to 1 solved the problem. Thank you very much. Cheers! P. -- ( 2b || ! 2b ) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From anthony at xmanhosting.com.au Thu Jan 7 05:44:21 2010 From: anthony at xmanhosting.com.au (xmanhosting) Date: Thu, 7 Jan 2010 12:44:21 +0800 Subject: incorporating CoreTemp into Nagios Message-ID: <002701ca8f54$171557e0$9930fea9@rockinghshs.internal> Hello, I have my cpu temp (coretemp) monitored by on my server by the installation of rpm -Uvh http://elrepo.org/elrepo-release-0.1-1.el5.elrepo.noarch.rpm Running 'sensors' from the terminal produces the output: coretemp-isa-0000 Adapter: ISA adapter Core 0: +48?C (high = +100?C) coretemp-isa-0001 Adapter: ISA adapter Core 1: +48?C (high = +100?C) coretemp-isa-0002 Adapter: ISA adapter Core 2: +49?C (high = +100?C) coretemp-isa-0003 Adapter: ISA adapter Core 3: +49?C (high = +100?C) Does anyone have any idea of how to bring the cpu temp from above into Nagios? Or is there a tutorial available that someone knows of, of how to do this? Thanks for any help, Kind regards, Anthony -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Thu Jan 7 10:23:24 2010 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Thu, 07 Jan 2010 10:23:24 +0100 Subject: Cisco Aironet 1200 AccessPoints In-Reply-To: <6db4a4200912310802g452f6249t7b8626fbc5529bea@mail.gmail.com> References: <4B3C55D3.1080808@fhv.at> <6db4a4200912310802g452f6249t7b8626fbc5529bea@mail.gmail.com> Message-ID: <4B45A80C.9010909@fhv.at> On 12/31/09 5:02 PM shadih rahman wrote: > Richard, Hi, > you can look at this page and see which mib contains the information you > are looking for. Thanks > > ftp://ftp-sj.cisco.com/pub/mibs/supportlists/c1200/c1200-non-ios-supportlist.html thanks for the mib link. EXAMPLE: OID .1.3.6.1.2.1.2.2.1.16.1 shows me the ifOutOctets from Interface Dot11Radio0 in summary since the last reboot or other resets .... (hope I'm right and you know what I mean ;-)) ..... IF-MIB::ifOutOctets.1 = Counter32: 1384984421 ..... this output is always growing and not a real time statistic. I want to monitor maybe the 5 minute input and output rates from the Dot11Radio0 interface like the 'show interface Dot11Radio0' output. ..... 5 minute input rate 0 bits/sec, 0 packets/sec 5 minute output rate 1236000 bits/sec, 124 packets/sec ..... Or something similar like mrtg outputs (input an output rates in realtime). I can't find the right OID, so does anyone know these OIDs? Or do I have to calculate it by myself? thanks in advance. Richard ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Matthias.Flacke at gmx.de Thu Jan 7 12:37:56 2010 From: Matthias.Flacke at gmx.de (Matthias.Flacke at gmx.de) Date: Thu, 7 Jan 2010 12:37:56 +0100 (MET) Subject: Override interval_length for specific service only Message-ID: <201001071137.o07BbuSg007065@post.webmailer.de> > I was wondering if there was any way with nagios 3.2 to override the global interval_length > for a specific service? I have one service that I would like to check every 30 seconds, but > interval_length in 60. I could change the interval_length definition, of course, but then I > would have to go back through all my definitions and change the interval values I used there. > This may not be too bad, as I make heavy use of templating, but it would be preferable to > simply override this for the one service. In Nagios3 you can specify fraction numbers for the check_interval. So with the standard interval_length of 60 seconds a check_interval of 0.5 would configure a 30 seconds interval for your particular check. -Matthias ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Thu Jan 7 14:14:20 2010 From: jpratt at norwich.edu (James Pratt) Date: Thu, 7 Jan 2010 08:14:20 -0500 Subject: incorporating CoreTemp into Nagios In-Reply-To: <002701ca8f54$171557e0$9930fea9@rockinghshs.internal> References: <002701ca8f54$171557e0$9930fea9@rockinghshs.internal> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD493E6968@nuexchg.norwich.edu> Sure, that should not be too difficult - google around for "Nagios wrapper" or nagios-plugins to see some example wrapper/plugin scripts. Here's one that monitors hdtemp I found real quick that should show you how it works easy enough - http://linsec.ca/blog/2008/02/09/hddtemp-wrapper-for-nagios/ cheers, james >> -----Original Message----- >> From: xmanhosting [mailto:anthony at xmanhosting.com.au] >> Sent: Wednesday, January 06, 2010 11:44 PM >> To: nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] incorporating CoreTemp into Nagios >> >> Hello, >> >> I have my cpu temp (coretemp) monitored by on my server by the installation of >> >> rpm -Uvh http://elrepo.org/elrepo-release-0.1-1.el5.elrepo.noarch.rpm >> >> Running 'sensors' from the terminal produces the output: >> >> coretemp-isa-0000 >> Adapter: ISA adapter >> Core 0: +48?C (high = +100?C) >> >> coretemp-isa-0001 >> Adapter: ISA adapter >> Core 1: +48?C (high = +100?C) >> >> coretemp-isa-0002 >> Adapter: ISA adapter >> Core 2: +49?C (high = +100?C) >> >> coretemp-isa-0003 >> Adapter: ISA adapter >> Core 3: +49?C (high = +100?C) >> >> Does anyone have any idea of how to bring the cpu temp from above into Nagios? >> >> Or is there a tutorial available that someone knows of, of how to do this? >> >> Thanks for any help, >> >> Kind regards, >> Anthony ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Thu Jan 7 14:25:58 2010 From: jashokda at cisco.com (Jatin Davey) Date: Thu, 07 Jan 2010 18:55:58 +0530 Subject: Getting parameters and plotting them in terms of % (Percentage) In-Reply-To: <8381699D-B31B-4EFD-8EE9-EA302237F32D@ena.com> References: <4B448FB5.4060105@cisco.com> <8381699D-B31B-4EFD-8EE9-EA302237F32D@ena.com> Message-ID: <4B45E0E6.3000305@cisco.com> Hi Marc Can you elaborate on what you mean by Nagios does not support it directly. My understanding is that that the check_nrpe process does not have such provisions to get the data in the manner that i have asked for. Please correct me if i am wrong. Also i checked PNP4Nagios , it has support for monitoring on linux boxes , Do we have anything similar for monitoring windows boxes as well from nagios. I checked NRPE and it uses the check_nt process to get the data from windows boxes. Does it show graphs similar to what PNP4nagios does. Thanks for your help. Thanks Jatin Marc Powell wrote: > On Jan 6, 2010, at 7:27 AM, Jatin Davey wrote: > > >> I need to see the graphs in terms of utilization values such as: >> >> CPU -- % used (out of 100%) >> Memory -- Total memory (Used and free memory in terms of MB or KB etc) >> Disk Usage -- Used Disk space and free disk space (In terms of MB or GB etc) >> Network I/O -- Packets sent or received etc >> >> Please provide any pointers on how get this using Nagios. >> > > > Nagios doesn't support this directly but there are addons that provide this functionality. PNP4Nagios, NagiosGraph, etc. Look for them at http://exchange.nagios.org. > > -- > Marc > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 7 14:45:24 2010 From: marc at ena.com (Marc Powell) Date: Thu, 7 Jan 2010 07:45:24 -0600 Subject: incorporating CoreTemp into Nagios In-Reply-To: <002701ca8f54$171557e0$9930fea9@rockinghshs.internal> References: <002701ca8f54$171557e0$9930fea9@rockinghshs.internal> Message-ID: On Jan 6, 2010, at 10:44 PM, xmanhosting wrote: > Hello, > > I have my cpu temp (coretemp) monitored by on my server by the installation of > > rpm -Uvh http://elrepo.org/elrepo-release-0.1-1.el5.elrepo.noarch.rpm This is just a repository setup RPM. It doesn't tell us what subsequent package you've installed to provide the 'sensors' command. > Running 'sensors' from the terminal produces the output: > > coretemp-isa-0000 > Adapter: ISA adapter > Core 0: +48?C (high = +100?C) > Does anyone have any idea of how to bring the cpu temp from above into Nagios? If this is from the lm_sensors package, there are a number of already made plugins to work with it (I haven't used any of them)... check_sensors included in the official plugins distribution or http://exchange.nagios.org/index.php?option=com_mtree&task=search&Itemid=74&searchword=lm_sensors or http://exchange.nagios.org/index.php?option=com_mtree&task=search&Itemid=74&searchword=check_sensors or Google -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 7 15:04:22 2010 From: marc at ena.com (Marc Powell) Date: Thu, 7 Jan 2010 08:04:22 -0600 Subject: Cisco Aironet 1200 AccessPoints In-Reply-To: <4B45A80C.9010909@fhv.at> References: <4B3C55D3.1080808@fhv.at> <6db4a4200912310802g452f6249t7b8626fbc5529bea@mail.gmail.com> <4B45A80C.9010909@fhv.at> Message-ID: On Jan 7, 2010, at 3:23 AM, Richard Gliebe wrote: > IF-MIB::ifOutOctets.1 = Counter32: 1384984421 > ..... > this output is always growing and not a real time statistic. Sure it's real time. It's total octets sent from reboot until the moment you've read it. ;) > I want to monitor maybe the 5 minute input and output rates from the > Dot11Radio0 interface like the 'show interface Dot11Radio0' output. > > ..... > 5 minute input rate 0 bits/sec, 0 packets/sec > 5 minute output rate 1236000 bits/sec, 124 packets/sec > ..... > > Or something similar like mrtg outputs (input an output rates in realtime). > > I can't find the right OID, so does anyone know these OIDs? There isn't one; it's not exposed by SNMP. ifOutOctets above is all you have to work with. MRTG is performing the calculation itself when the value from ifOutOctets/ifInOctets is retrieved/stored. > Or do I have to calculate it by myself? Yes. You'll want to be cognizant of counter re-sets, counter roll-overs and the fact that the OID index for that specific interface may change on reboot so that you don't get strange results. You'll need to read current value, compare with saved previous value/timestamp, save current value/timestamp, repeat at next run. You could also look for an existing plugin that suits your needs instead. You're not the first to want to do this. look for check_traffic check_snmp_cisco_traffic check_iftraffic.pl Check http://exchange.nagios.org or google for combinations of nagios check snmp, traffic, cisco, interface. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 7 15:22:08 2010 From: marc at ena.com (Marc Powell) Date: Thu, 7 Jan 2010 08:22:08 -0600 Subject: Getting parameters and plotting them in terms of % (Percentage) In-Reply-To: <4B45E0E6.3000305@cisco.com> References: <4B448FB5.4060105@cisco.com> <8381699D-B31B-4EFD-8EE9-EA302237F32D@ena.com> <4B45E0E6.3000305@cisco.com> Message-ID: On Jan 7, 2010, at 7:25 AM, Jatin Davey wrote: > Hi Marc > > Can you elaborate on what you mean by Nagios does not support it directly. Nagios only cares about state: OK, WARNING, CRITICAL, UNKNOWN, UP, DOWN. That's all it looks at. The human-readable plugin output is just for humans, nagios doesn't pay any attention to it. Each plugin may also provide Performance Data that can be passed to an addon for further processing. The performance data varies by plugin but usually contains the information you're wanting to graph. http://nagios.sourceforge.net/docs/3_0/perfdata.html might help clarify... > My understanding is that that the check_nrpe process does not have such provisions to get the data in the manner that i have asked for. Please correct me if i am wrong. check_nrpe supports the passing of performance data... The key is whether the plugin you are executing provides it and whether you have an addon that looks for it. > Also i checked PNP4Nagios , it has support for monitoring on linux boxes , Do we have anything similar for monitoring windows boxes as well from nagios. I checked NRPE and it uses the check_nt process to get the data from windows boxes. Does it show graphs similar to what PNP4nagios does. check_nt does not because it is not a graphing add-on in the same way that check_nrpe isn't. You're confused about what each tool does. check_nrpe and check_nt are just transports. They establish a connection to a remote box, ask the remote box to run a plugin and they then return that output to nagios for further processing. I don't use any of the graphing addons so my knowledge of them is secondhand. I also monitor very few windows boxes so I don't know if the nagios tools on them support providing performance data but I would be surprised if one or more of them didn't. I would also be surprised if one or more of the graphing addons didn't process performance data received from Windows systems. Best I can tell you is to Google, read the docs for the various graphing addons and/or wait for further input that may come from others on this list who use those packages... -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 7 17:17:45 2010 From: marc at ena.com (Marc Powell) Date: Thu, 7 Jan 2010 10:17:45 -0600 Subject: [Nagiosplug-help] check_interval threshold In-Reply-To: <32A8F6BFE24F664C882D73D305663B8102911EFA@MLNYC20MB051.amrs.win.ml.com> References: <32A8F6BFE24F664C882D73D305663B8102911EED@MLNYC20MB051.amrs.win.ml.com> <32A8F6BFE24F664C882D73D305663B8102911EFA@MLNYC20MB051.amrs.win.ml.com> Message-ID: Please always respond on-list so that others now, and in the future, learn from your experience. More below... On Jan 7, 2010, at 10:05 AM, Khan, Tony (A&E) wrote: > Thank you so much for your help Marc. > > My Service definition setting looks like this, should I just add ' > interval_length 1' under check_period? interval_length is set in nagios.cfg and affects *all* intervals defined in *all* host{}, service{}, contact{} definitions, etc. See my prior e-mail about this. For those intervals that you do not want to change, you'll need to increase their values relative to how much you decreased the interval_length. interval_length is the basic unit for all time based operations in nagios and is measured in seconds. For example, if interval_length = 60 and you have 'normal_check_interval 1' in a service definition, that service will be checked every 60 seconds (interval_length * normal_check_interval). If you set interval_length = 1 and you have 'normal_check_interval 1', that service will now be checked every 1 second. To check it every 60 seconds, you'd need to change it to 'normal_check_interval 60' If you set interval_length = 1 and you want to check a service every 5 minutes, you'll need to set 'normal_check_interval 300' (5 minutes == 300 seconds) -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pitchfork at ederdrom.de Thu Jan 7 17:58:30 2010 From: pitchfork at ederdrom.de (Joerg Linge) Date: Thu, 07 Jan 2010 17:58:30 +0100 Subject: Getting parameters and plotting them in terms of % (Percentage) In-Reply-To: <4B45E0E6.3000305@cisco.com> References: <4B448FB5.4060105@cisco.com> <8381699D-B31B-4EFD-8EE9-EA302237F32D@ena.com> <4B45E0E6.3000305@cisco.com> Message-ID: <4B4612B6.4040406@ederdrom.de> Am 07.01.10 14:25, schrieb Jatin Davey: > Hi Marc > > Can you elaborate on what you mean by Nagios does not support it > directly. My understanding is that that the check_nrpe process does not > have such provisions to get the data in the manner that i have asked > for. Please correct me if i am wrong. > > Also i checked PNP4Nagios , it has support for monitoring on linux boxes PNP4Nagios can create graphs based on performance data provided by nagios plugins. PNP is not limited to linux hosts. http://docs.pnp4nagios.org/pnp-0.6/about Joerg ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From israel at frontierflying.com Thu Jan 7 18:09:33 2010 From: israel at frontierflying.com (Israel Brewster) Date: Thu, 7 Jan 2010 08:09:33 -0900 Subject: Override interval_length for specific service only In-Reply-To: <201001071137.o07BbuSg007065@post.webmailer.de> References: <201001071137.o07BbuSg007065@post.webmailer.de> Message-ID: <64E4D5AF-EC2E-4F27-B15F-4FE46B3A270B@frontierflying.com> On Jan 7, 2010, at 2:37 AM, Matthias.Flacke at gmx.de wrote: > >> I was wondering if there was any way with nagios 3.2 to override >> the global interval_length >> for a specific service? I have one service that I would like to >> check every 30 seconds, but >> interval_length in 60. I could change the interval_length >> definition, of course, but then I >> would have to go back through all my definitions and change the >> interval values I used there. >> This may not be too bad, as I make heavy use of templating, but it >> would be preferable to >> simply override this for the one service. > > In Nagios3 you can specify fraction numbers for the check_interval. > > So with the standard interval_length of 60 seconds a check_interval > of 0.5 would configure > a 30 seconds interval for your particular check. Perfect, Thanks :-) > > -Matthias ----------------------------------------------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----------------------------------------------- -------------- next part -------------- A non-text attachment was scrubbed... Name: Israel Brewster.vcf Type: text/directory Size: 417 bytes Desc: not available URL: -------------- next part -------------- -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jack1729 at hotmail.com Thu Jan 7 18:08:56 2010 From: jack1729 at hotmail.com (Jack Lyons) Date: Thu, 7 Jan 2010 12:08:56 -0500 Subject: Check_openmanage In-Reply-To: <15tvdfhg9kq.fsf@tux.uio.no> References: , <15tvdfhg9kq.fsf@tux.uio.no> Message-ID: I tried using the blacklisting syntax and that never seemed to work....your command fits my needs perfectly, We have 82 2650's and are trying to phase them out, but 10 of the 18 I installed had problems -makes me wonder if there is something wrong with openamange 5.5 on that version. Since we have those servers under 3rd party warranty so we will pursue with them that open manage is having problems Just another feature request.....I would like to make the host name an html clickable link to the https://servername:1311 Open Manage web interface. I thought about wrapping the open manage check to do it, but seemed kind of kludgy. Thanks Jack > From: t.h.amundsen at usit.uio.no > To: jack1729 at hotmail.com > CC: nagios-users at lists.sourceforge.net > Subject: Re: Check_openmanage > Date: Mon, 4 Jan 2010 16:43:01 +0100 > > Jack Lyons writes: > > > I have some older 2650 that through some message with State=UKNOWN when I use > > check_openmanage > > > > See below for output of check_openmanage -d > > > > Is this a hardware issue that we need to address or is this a system > > configuration issue - no fan probes, no temp probes, no volt probs that could > > be handled via configuration / change of check_openmanage > > > > I have added this to the perl code and it works, but I am having problems > > compiling check_openmanage.pl on windows. (problems installings and using > > PAR::Packer) > > > > in the $ok_errors section > > | No\sfan\sprobes\sfound\son\sthis\ssystem # No battery > > probes > > | No\stemperature\sprobes\sfound\son\sthis\ssystem # No > > battery probes > > | No\svoltage\sprobes\sfound\son\sthis\ssystem # No > > battery probes > > > > > A) Could someone give me a compiled version of the check_openmanage.pl > > that has the $ok_errors section in it. > > Yes, I could do that for you. But see below first... > > > B) Can we modify the --only option to include warning+ to include warning > > messages and above AND ignore Unknown states? > > Not sure that I understand what you mean. If used, the --only option > specifies exactly one component to check. For example, '--only cpu' > would make the plugin only check the CPUs. All other components are > ignored. No warnings about e.g. fan probes should then appear. > > > C) is there another way to prevent to configure the plugin for nagios from > > alerting on this output. > > Yes. You can use the '--check' option to specify that you don't want to > check these things. Example: > > check_openmanage --check fans=0,temp=0,voltage=0 > > Using the '--check' option as above will prevent check_openmanage to > ever running the commands that are failing. > > [...] > > UNKNOWN | Problem running 'omreport chassis fans': Error! No fan probes found > > on this system. > > UNKNOWN | Problem running 'omreport chassis temps': Error! No temperature > > probes found on this system. > > UNKNOWN | Problem running 'omreport chassis volts': Error! No voltage probes > > found on this system. > > These are errors from running omreport. They indicate that something is > wrong, either with the hardware or with Openmanage. I would try > reinstalling Openmanage first, which may help. The 2650 is an old model, > but if you still have a valid warranty you should contact Dell support > about this problem. These commands should not fail like this. If all > else fails, use the '--check' option as described above. > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 7 18:50:55 2010 From: marc at ena.com (Marc Powell) Date: Thu, 7 Jan 2010 11:50:55 -0600 Subject: your old post: RE: Restricting user to monitor single host In-Reply-To: References: Message-ID: Hi Nobuo, Please always send your questions to the list and not to me directly... More below -- On Jan 7, 2010, at 10:45 AM, Matsushita, Nobuo wrote: > Hi Marc, > > I am trying this restriction on Nagios V3.x. According your post on March 8, 2005: > > > I can't seem to find where this little nugget is actually stated in the > > documentation or the FAQ any longer. The quick synopsis is -- > > > > Enable CGI authorization as above. Create a contact for this person. If > > they don't need notifications, set the service_notification_options and > > host_notification_options to 'n'. Add them to the contact_group for the > > host (and possibly as a contact for each service). Create an .htpasswd > > user account using the same username as defined in the contact > > definition. > > When I check the Nagios documentation on contact_group.cfg, the "contactgroup" definition does not have "host". Is this changed on new released? or am I doing a wrong thing? 2005 would have been an old release and I'm not quite sure of the context of this post. It seems to be talking about simple authorization configuration though. In the current release, contactgroups are specified within the host{} definition itself with the 'contact_groups' parameter. If it's just a single person, you don't need a group and could just use the 'contacts' parameter -- http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#host This documentation may also be useful depending on what you are trying to to -- http://nagios.sourceforge.net/docs/3_0/cgiauth.html -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Thu Jan 7 19:39:29 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Thu, 07 Jan 2010 19:39:29 +0100 Subject: Check_openmanage In-Reply-To: (Jack Lyons's message of "Thu, 7 Jan 2010 12:08:56 -0500") References: <15tvdfhg9kq.fsf@tux.uio.no> Message-ID: <15t7hrtdaji.fsf@tux.uio.no> Jack Lyons writes: > Just another feature request.....I would like to make the host name an > html clickable link to the https://servername:1311 Open Manage web > interface. I thought about wrapping the open manage check to do it, > but seemed kind of kludgy. Yes, that could easily be done. But the reason that I haven't implemented something like that is that this is exactly the kind of thing that the action_url is for, like this: define host { host_name myhost.example.org alias myhost address 10.0.0.1 use generic-host hostgroup_name dell-hardware contact_groups spam at example.org action_url https://$HOSTNAME$:1311/ } If you have the DRACs set up with deterministic host names based on the server host name, e.g. 'servername-drac', you can use the DRAC web address instead. I think that the action_url approach is cleaner, as some people (like me) don't have the openmanage web interface set up. You can alternatively use notes_url if you've already used up action_url. PS. I wish that Nagios had a more generic alternative, i.e. something like this: "generic_url
||". That would be really useful. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jack1729 at hotmail.com Thu Jan 7 21:39:51 2010 From: jack1729 at hotmail.com (Jack Lyons) Date: Thu, 7 Jan 2010 15:39:51 -0500 Subject: Check_openmanage In-Reply-To: <15t7hrtdaji.fsf@tux.uio.no> References: , <15tvdfhg9kq.fsf@tux.uio.no>, , <15t7hrtdaji.fsf@tux.uio.no> Message-ID: I will give that a try - I am not the nagios adminstrator and wasn't aware that was available.....thanks. Jack > From: t.h.amundsen at usit.uio.no > To: jack1729 at hotmail.com > CC: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Check_openmanage > Date: Thu, 7 Jan 2010 19:39:29 +0100 > > Jack Lyons writes: > > > Just another feature request.....I would like to make the host name an > > html clickable link to the https://servername:1311 Open Manage web > > interface. I thought about wrapping the open manage check to do it, > > but seemed kind of kludgy. > > Yes, that could easily be done. But the reason that I haven't > implemented something like that is that this is exactly the kind of > thing that the action_url is for, like this: > > define host { > host_name myhost.example.org > alias myhost > address 10.0.0.1 > use generic-host > hostgroup_name dell-hardware > contact_groups spam at example.org > action_url https://$HOSTNAME$:1311/ > } > > If you have the DRACs set up with deterministic host names based on the > server host name, e.g. 'servername-drac', you can use the DRAC web > address instead. I think that the action_url approach is cleaner, as > some people (like me) don't have the openmanage web interface set > up. You can alternatively use notes_url if you've already used up > action_url. > > PS. I wish that Nagios had a more generic alternative, i.e. something > like this: "generic_url
||". That would be > really useful. > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Fri Jan 8 01:12:04 2010 From: benny at bennyvision.com (C. Bensend) Date: Thu, 7 Jan 2010 18:12:04 -0600 Subject: Checking IIS sites when there is a default "outage" page In-Reply-To: <8FC191CA-2ECA-46EA-B9D7-DC029A76FC7A@ena.com> References: <63365268b9db77d120509239618fa65b.squirrel@webmail.stinkweasel.net> <8FC191CA-2ECA-46EA-B9D7-DC029A76FC7A@ena.com> Message-ID: <687c474c2acfcaa2a26153fe910f5020.squirrel@webmail.stinkweasel.net> >> Any thoughts on how to monitor an IIS site requiring NTLM >> authentication when a default page is configured for an "out of >> service" page? > > I'm not aware of anything out-of-the-box but I do know of one that's close > (but haven't used it). I also know that curl supports NTLM so if that's > used as the basis of a plugin, you're a good way there. This PHP based > plugin appears to have the features of check_http you need, but without > the NTLM auth -- > > http://exchange.nagios.org/directory/Plugins/Websites%2C-Forms-and-Transactions/CURL-check-script/details > > Since it's PHP and PHP Curl is pretty well documented, I think that it > wouldn't be too difficult to add in the authentication bit (two lines in > the curl_setopt() section I believe, if hard-coded). The two options I > think you'd need are -- > > http://www.php.net/manual/en/function.curl-setopt.php > > CURLOPT_USERPWD 'username:password' > CURLOPT_HTTPAUTH 'CURLAUTH_NTLM' > > It may be more complicated than that but it's where I'd start if I were > doing it. A quick followup... I decided to take the requirements a little more literally, and build a test that checks to see if their IIS web sites are up and running or if they're stopped/paused/whatever. So, here it is: http://exchange.nagios.org/directory/Plugins/Web-Servers/IIS/check_iis_site/details It's a VBscript that I run via NSClient's NRPE handler, and it checks the status of any IIS web site I tell it to. Released under BSD license, maybe someone else will find it useful. :) Benny -- "It's not all about getting up and putting four slices of kickass in a two slice toaster." -- ark86, on Fazed.net ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From anthony at xmanhosting.com.au Fri Jan 8 03:07:21 2010 From: anthony at xmanhosting.com.au (xmanhosting) Date: Fri, 8 Jan 2010 10:07:21 +0800 Subject: Making Nagios accessible on public internet Message-ID: <002901ca9007$52ac6100$9930fea9@rockinghshs.internal> Hello, I have successfully set up Nagios on my server computer, and can access it locally in a browser by navigating to: http://onlinebackup.xmanhosting.com.au/nagios/ What do I need to do so that my Nagios can be accessed publicly on the internet, rather than just locally to the server? Thanks for any help. Kind regards, Anthony -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From anthony at xmanhosting.com.au Fri Jan 8 03:56:54 2010 From: anthony at xmanhosting.com.au (xmanhosting) Date: Fri, 8 Jan 2010 10:56:54 +0800 Subject: Setting up email notification Message-ID: <002e01ca900e$3ee33520$9930fea9@rockinghshs.internal> Hello, I am trying to set up email notification. In my contacts.cfg file I have define contact{ contact_name nagiosadmin use generic-contact ; Inherit default values from generic-contact template (defined above) alias Nagios Admin ; Full name of user email anthony at xmanhosting.com.au ; <<***** CHANGE THIS TO YOUR EMAIL ADDRESS ****** } and then I have tried to add the contact to a host (switch.cfg): define host{ use generic-switch ; Inherit default values from a template host_name Billion ; The name we're giving to this switch alias Billion 7401 ; A longer name associated with the switch address 192.168.1.254 ; IP address of the switch hostgroups switches ; Host groups this switch is associated with contact nagiosadmin } however when I run: /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg I get: Processing object config file '/usr/local/nagios/etc/objects/switch.cfg'... Error: Invalid host object directive 'contact'. Error: Could not add object property in file '/usr/local/nagios/etc/objects/switch.cfg' on line 30. Error processing object config files! Is someone able to assist me to get notifications working? Thanks for any help Kind regards, Anthony -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 8 05:44:18 2010 From: marc at ena.com (Marc Powell) Date: Thu, 7 Jan 2010 22:44:18 -0600 Subject: Making Nagios accessible on public internet In-Reply-To: <002901ca9007$52ac6100$9930fea9@rockinghshs.internal> References: <002901ca9007$52ac6100$9930fea9@rockinghshs.internal> Message-ID: On Jan 7, 2010, at 8:07 PM, xmanhosting wrote: > Hello, > > I have successfully set up Nagios on my server computer, and can access it locally in a browser by navigating to: > > http://onlinebackup.xmanhosting.com.au/nagios/ > > What do I need to do so that my Nagios can be accessed publicly on the internet, rather than just locally to the server? Only you'll know but my guess is that at the least you'll need to lessen the security on your firewall or whatever device you have in front of onlinebackup.xmanhosting.com.au to permit Internet based users. There's nothing nagios-specific that you need to do. All of it is network or host-based restrictions that you've implemented yourself. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 8 05:47:06 2010 From: marc at ena.com (Marc Powell) Date: Thu, 7 Jan 2010 22:47:06 -0600 Subject: Setting up email notification In-Reply-To: <002e01ca900e$3ee33520$9930fea9@rockinghshs.internal> References: <002e01ca900e$3ee33520$9930fea9@rockinghshs.internal> Message-ID: On Jan 7, 2010, at 8:56 PM, xmanhosting wrote: > and then I have tried to add the contact to a host (switch.cfg): > > define host{ > use generic-switch ; Inherit default values from a template > host_name Billion ; The name we're giving to this switch > alias Billion 7401 ; A longer name associated with the switch > address 192.168.1.254 ; IP address of the switch > hostgroups switches ; Host groups this switch is associated with > contact nagiosadmin > } > > however when I run: > > /usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg > > I get: > Processing object config file '/usr/local/nagios/etc/objects/switch.cfg'... > Error: Invalid host object directive 'contact' It's 'contacts', plural, not 'contact'. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From anthony at xmanhosting.com.au Fri Jan 8 05:57:23 2010 From: anthony at xmanhosting.com.au (xmanhosting) Date: Fri, 8 Jan 2010 12:57:23 +0800 Subject: check_sensors command Message-ID: <001901ca901f$13ade5b0$9930fea9@rockinghshs.internal> Hello, Thanks for the replies Mark and James. How would I find out what package i've installed to provide the 'sensors' command? I have 'sensors' now appearing in my Nagios admin, and it is reporting as 'sensors ok' Below are my command and service definitions. Do you know of any extra parameters I need to add so that it reports the actual cpu temp? # 'check_sensors' command definition define command{ command_name check_sensors command_line $USER1$/check_sensors } define service { use local-service host_name webserver service_description sensors check_command check_sensors } Kind regards, Anthony -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From penguinhead at gmail.com Fri Jan 8 08:30:10 2010 From: penguinhead at gmail.com (Pankaj) Date: Fri, 8 Jan 2010 13:00:10 +0530 Subject: non sticky passive checks Message-ID: <312b9fd1001072330p2c97f234lb7f13206aa07c240@mail.gmail.com> Hi, I've configured passive service checks and am reporting back the results using nsca. I would like to turn the alerts to be critical after 5 minutes if the monitored servers fail to send any results. Currently the check continues to remain in the last state. How can I achieve this? >>>>>>>>>>> service_common.cfg define service{ use generic_service name passive_service active_checks_enabled 0 passive_checks_enabled 1 flap_detection_enabled 0 register 0 is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 5 retry_check_interval 1 check_freshness 0 contact_groups admins check_command check_dummy!0 notification_interval 120 notification_period 24x7 notification_options w,u,c,r stalking_options w,c,u } define service { use passive_service service_description HW Sensors hostgroup_name hgp_webservers, hgp_db, hgp_search, hgp_image check_command check_dummy active_checks_enabled 0 passive_checks_enabled 1 } define command{ command_name check_dummy command_line $USER1$/check_dummy $ARG1$ } -- ( 2b || ! 2b ) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 8 10:46:18 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 08 Jan 2010 09:46:18 +0000 Subject: non sticky passive checks In-Reply-To: <312b9fd1001072330p2c97f234lb7f13206aa07c240@mail.gmail.com> References: <312b9fd1001072330p2c97f234lb7f13206aa07c240@mail.gmail.com> Message-ID: <4B46FEEA.8060409@flatto.net> Pankaj wrote: > Hi, > > I've configured passive service checks and am reporting back the > results using nsca. I would like to turn the alerts to be critical > after 5 minutes if the monitored servers fail to send any results. > Currently the check continues to remain in the last state. How can I > achieve this? > > >>>>>>>>>>> service_common.cfg > define service{ > use generic_service > name passive_service > active_checks_enabled 0 > passive_checks_enabled 1 > flap_detection_enabled 0 > register 0 > is_volatile 0 > check_period 24x7 > max_check_attempts 1 > normal_check_interval 5 > retry_check_interval 1 > check_freshness 0 > contact_groups admins > check_command check_dummy!0 > notification_interval 120 > notification_period 24x7 > notification_options w,u,c,r > stalking_options w,c,u > } > > define service { > use passive_service > service_description HW Sensors > hostgroup_name hgp_webservers, > hgp_db, hgp_search, hgp_image > check_command check_dummy > active_checks_enabled 0 > passive_checks_enabled 1 > } > > define command{ > command_name check_dummy > command_line $USER1$/check_dummy $ARG1$ > } > > > -- > ( 2b || ! 2b ) > One way to do this is to add an active check from your central nagios to the remote nagios machine ( standard check_host_alive should do the trick) and if that machine is not responding for the time you define as bad , then trigger an event handler that "expires" the results supplied by that server . Assaf ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jashokda at cisco.com Fri Jan 8 11:52:10 2010 From: jashokda at cisco.com (Jatin Davey) Date: Fri, 08 Jan 2010 16:22:10 +0530 Subject: Getting parameters and plotting them in terms of % (Percentage) In-Reply-To: <4B4612B6.4040406@ederdrom.de> References: <4B448FB5.4060105@cisco.com> <8381699D-B31B-4EFD-8EE9-EA302237F32D@ena.com> <4B45E0E6.3000305@cisco.com> <4B4612B6.4040406@ederdrom.de> Message-ID: <4B470E5A.7040700@cisco.com> Hi all While compiling the PNP4 pacakage i am seeing that one the modules is not installed: RRDs Perl Modules: *** NOT FOUND *** Can someone tell me where to get the modules. Thanks Jatin Joerg Linge wrote: > Am 07.01.10 14:25, schrieb Jatin Davey: > >> Hi Marc >> >> Can you elaborate on what you mean by Nagios does not support it >> directly. My understanding is that that the check_nrpe process does not >> have such provisions to get the data in the manner that i have asked >> for. Please correct me if i am wrong. >> >> Also i checked PNP4Nagios , it has support for monitoring on linux boxes >> > > PNP4Nagios can create graphs based on performance data provided by nagios plugins. > PNP is not limited to linux hosts. > > http://docs.pnp4nagios.org/pnp-0.6/about > > Joerg > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From paul.weaver at bbc.co.uk Fri Jan 8 12:12:45 2010 From: paul.weaver at bbc.co.uk (Paul Weaver) Date: Fri, 8 Jan 2010 11:12:45 -0000 Subject: non sticky passive checks In-Reply-To: <312b9fd1001072330p2c97f234lb7f13206aa07c240@mail.gmail.com> References: <312b9fd1001072330p2c97f234lb7f13206aa07c240@mail.gmail.com> Message-ID: <5C44E813F7481A46980A2F6778680B1A01B423D8@bbcxues16.national.core.bbc.co.uk> We use the following check to ensure that a backup has completed. The backup job calls in success (or fail) at the end, and runs every day (86,400 seconds). The freshness_threshold The backupFailed check command contains the folllowing. Note that normally active checks are disabled, however the check_freshness and freshness_threshold will triger an active check after n time intervals (93,600 * 1second in my case). This is nagios 2, so YMMV define service{ use defaultservice host_name myhost service_description Nightly Backup active_checks_enabled 0 passive_checks_enabled 1 check_freshness 1 freshness_threshold 93600 max_check_attempts 1 check_command backupFailed } define command{ command_name backupFailed command_line $USER1$/reportError 2 "The backup on $HOSTADDRESS$ failed to report in time" } With reportError being a bash script: #!/bin/bash echo $2 exit $1 -- What's with all the carrots? What do they need such good eyesight for anyway? Paul Weaver Systems Development Engineer BBC FM&T BETG TDD SDT Work: 020 8225 8109 Mobile: 07736 481006 Room 1244, BBC Television Centre, Wood Lane, London, W12 7RJ ________________________________ From: Pankaj [mailto:penguinhead at gmail.com] Sent: 08 January 2010 07:30 To: Nagios Mailinglist Subject: [Nagios-users] non sticky passive checks Hi, I've configured passive service checks and am reporting back the results using nsca. I would like to turn the alerts to be critical after 5 minutes if the monitored servers fail to send any results. Currently the check continues to remain in the last state. How can I achieve this? >>>>>>>>>>> service_common.cfg define service{ use generic_service name passive_service active_checks_enabled 0 passive_checks_enabled 1 flap_detection_enabled 0 register 0 is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 5 retry_check_interval 1 check_freshness 0 contact_groups admins check_command check_dummy!0 notification_interval 120 notification_period 24x7 notification_options w,u,c,r stalking_options w,c,u } define service { use passive_service service_description HW Sensors hostgroup_name hgp_webservers, hgp_db, hgp_search, hgp_image check_command check_dummy active_checks_enabled 0 passive_checks_enabled 1 } define command{ command_name check_dummy command_line $USER1$/check_dummy $ARG1$ } -- ( 2b || ! 2b ) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From K.Getchell at kingston.ac.uk Fri Jan 8 12:00:09 2010 From: K.Getchell at kingston.ac.uk (Getchell, Kristoffer M) Date: Fri, 8 Jan 2010 11:00:09 +0000 Subject: non sticky passive checks In-Reply-To: <312b9fd1001072330p2c97f234lb7f13206aa07c240@mail.gmail.com> References: <312b9fd1001072330p2c97f234lb7f13206aa07c240@mail.gmail.com> Message-ID: Hi Could you enable freshness checking and set the check command to something that sets the status to critical (for example you could write an external command into the external command queue)? So long as passive results were received within the freshness threshold, the check command won't be executed, so the status you see will be what is reported by the passive check result. If no passive check results are received you will then be notified of a critical situation as the freshness check command would then be executed and return a critical alert. To this end, I would redefine your passive service command as follows: define service { use passive_service service_description HW Sensors hostgroup_name hgp_webservers, hgp_db, hgp_search, hgp_image check_command check_dummy active_checks_enabled 0 passive_checks_enabled 1 check_freshness 1 freshness_threshold 300 check_command issue_critical_alert ; this command is run only if the service results are "stale" } Kind regards, Kris From: Pankaj [mailto:penguinhead at gmail.com] Sent: 08 January 2010 07:30 To: Nagios Mailinglist Subject: [Nagios-users] non sticky passive checks Hi, I've configured passive service checks and am reporting back the results using nsca. I would like to turn the alerts to be critical after 5 minutes if the monitored servers fail to send any results. Currently the check continues to remain in the last state. How can I achieve this? >>>>>>>>>>> service_common.cfg ?define service{ ??????????????? use??????????????????? generic_service ??????????????? name?????????????????? passive_service ??????????????? active_checks_enabled? 0 ??????????????? passive_checks_enabled 1??????????????????????????????? ??????????????? flap_detection_enabled 0 ??????????????? register?????????????? 0??????????????????????????????? ??????????????? is_volatile??????????? 0 ??????????????? check_period?????????? 24x7 ??????????????? max_check_attempts???? 1 ??????????????? normal_check_interval? 5 ??????????????? retry_check_interval?? 1 ??????????????? check_freshness??????? 0 ??????????????? contact_groups???????? admins? ??????????????? check_command????????? check_dummy!0 ??????????????? notification_interval? 120 ??????????????? notification_period??? 24x7 ??????????????? notification_options?? w,u,c,r ??????????????? stalking_options?????? w,c,u } define service { ??????????????? use???????????????????????????? passive_service ??????????????? service_description?????? HW Sensors ??????????????? hostgroup_name????????????????? hgp_webservers, hgp_db, hgp_search, hgp_image ??????????????? check_command?????????????????? check_dummy ??????????????? active_checks_enabled?????????? 0 ??????????????? passive_checks_enabled????????? 1 } define command{ ??????? command_name check_dummy ??????? command_line $USER1$/check_dummy $ARG1$ } -- ( 2b || ! 2b ) This email has been scanned for all viruses by the MessageLabs Email Security System. This email has been scanned for all viruses by the MessageLabs Email Security System. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From penguinhead at gmail.com Fri Jan 8 13:06:52 2010 From: penguinhead at gmail.com (Pankaj) Date: Fri, 8 Jan 2010 17:36:52 +0530 Subject: non sticky passive checks In-Reply-To: References: <312b9fd1001072330p2c97f234lb7f13206aa07c240@mail.gmail.com> Message-ID: <312b9fd1001080406h50ea6d38nbc0e208b5841edad@mail.gmail.com> Hi, Thanks all. The following works best: define service { .... check_freshness 1 freshness_threshold 300 check_command check_dummy } define command{ command_name check_dummy command_line $USER1$/check_dummy 2 "Stale data: The server has not responded." } On Fri, Jan 8, 2010 at 4:30 PM, Getchell, Kristoffer M < K.Getchell at kingston.ac.uk> wrote: > Hi > > Could you enable freshness checking and set the check command to something > that sets the status to critical (for example you could write an external > command into the external command queue)? So long as passive results were > received within the freshness threshold, the check command won't be > executed, so the status you see will be what is reported by the passive > check result. If no passive check results are received you will then be > notified of a critical situation as the freshness check command would then > be executed and return a critical alert. > > To this end, I would redefine your passive service command as follows: > > define service { > use passive_service > service_description HW Sensors > hostgroup_name hgp_webservers, hgp_db, hgp_search, > hgp_image > check_command check_dummy > active_checks_enabled 0 > passive_checks_enabled 1 > check_freshness 1 > freshness_threshold 300 > check_command issue_critical_alert ; this > command is run only if the service results are "stale" > } > > Kind regards, > > > > > Kris > > > > > > > > From: Pankaj [mailto:penguinhead at gmail.com] > Sent: 08 January 2010 07:30 > To: Nagios Mailinglist > Subject: [Nagios-users] non sticky passive checks > > Hi, > > I've configured passive service checks and am reporting back the results > using nsca. I would like to turn the alerts to be critical after 5 minutes > if the monitored servers fail to send any results. Currently the check > continues to remain in the last state. How can I achieve this? > > >>>>>>>>>>> service_common.cfg > define service{ > use generic_service > name passive_service > active_checks_enabled 0 > passive_checks_enabled 1 > flap_detection_enabled 0 > register 0 > is_volatile 0 > check_period 24x7 > max_check_attempts 1 > normal_check_interval 5 > retry_check_interval 1 > check_freshness 0 > contact_groups admins > check_command check_dummy!0 > notification_interval 120 > notification_period 24x7 > notification_options w,u,c,r > stalking_options w,c,u > } > > define service { > use passive_service > service_description HW Sensors > hostgroup_name hgp_webservers, hgp_db, > hgp_search, hgp_image > check_command check_dummy > active_checks_enabled 0 > passive_checks_enabled 1 > } > > define command{ > command_name check_dummy > command_line $USER1$/check_dummy $ARG1$ > } > > > -- > ( 2b || ! 2b ) > > This email has been scanned for all viruses by the MessageLabs Email > Security System. > > This email has been scanned for all viruses by the MessageLabs Email > Security System. > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and > easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- ( 2b || ! 2b ) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Fri Jan 8 14:24:23 2010 From: jpratt at norwich.edu (James Pratt) Date: Fri, 8 Jan 2010 08:24:23 -0500 Subject: check_sensors command In-Reply-To: <001901ca901f$13ade5b0$9930fea9@rockinghshs.internal> References: <001901ca901f$13ade5b0$9930fea9@rockinghshs.internal> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD493E6B18@nuexchg.norwich.edu> >> -----Original Message----- >> From: xmanhosting [mailto:anthony at xmanhosting.com.au] >> Sent: Thursday, January 07, 2010 11:57 PM >> To: nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] check_sensors command >> >> Hello, >> >> Thanks for the replies Mark and James. >> >> How would I find out what package i've installed to provide the 'sensors' command? >> >> I have 'sensors' now appearing in my Nagios admin, and it is reporting as 'sensors >> ok' >> >> Below are my command and service definitions. Do you know of any extra >> parameters I need to add so that it reports the actual cpu temp? >> >> # 'check_sensors' command definition >> define command{ >> command_name check_sensors >> command_line $USER1$/check_sensors >> } >> >> define service { >> use local-service >> host_name webserver >> service_description sensors >> check_command check_sensors >> } >> >> Kind regards, >> Anthony >> Depends on your linux distro - if you have RPM, then it would be (replace mysql with your sensors binary path/name) - # rpm -qf /usr/bin/mysql mysql-3.23.52-3 Sorry, but I don't know apt-get, so perhaps someone else can tell you about that one if you are on debian/ubuntu etc. Cheers, jamie ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jlanawalt at gmail.com Fri Jan 8 17:01:46 2010 From: jlanawalt at gmail.com (Jacob Anawalt) Date: Fri, 8 Jan 2010 09:01:46 -0700 Subject: Could not expand members specified in hostgroup (defined in cfg_dir) Message-ID: <6ccd37151001080801p101c8cbdm3c188c5e8d511b57@mail.gmail.com> Hello, I was trying out Nagios 3.0.6-4~lenny2 on Debian 5.0. The nagios3 package in Lenny uses cfg_dir pointing at /etc/nagios3/conf.d. The directory is pre-populated with the following: contacts_nagios2.cfg extinfo_nagios2.cfg generic-host_nagios2.cfg generic-service_nagios2.cfg host-gateway_nagios3.cfg hostgroups_nagios2.cfg localhost_nagios2.cfg services_nagios2.cfg timeperiods_nagios2.cfg I made a new cfg for my gecko server, gecko.cfg, and added it alongside localhost in hostgroups_nagios2.cfg, but when I tried to restart, nagios says: Error: Could not find any host matching 'gecko' (config file '/etc/nagios3/conf.d/hostgroups_nagios2.cfg', starting on line 18) Error: Could not expand members specified in hostgroup (config file '/etc/nagios3/conf.d/hostgroups_nagios2.cfg', starting on line 18) I worked around the issue by removing gecko from the hostgroup members and instead adding a hostgroups entry to the gecko host definition. I believe the object definitions are correct because the gecko entry with an HTTP service is now active. What would prevent me from adding gecko to members of the hostgroup? services_nagios2.cfg: ... define service { hostgroup_name http-servers service_description HTTP check_command check_http use generic-service notification_interval 0 } ... hostgroups_nagios2.cfg: ... define hostgroup { hostgroup_name http-servers alias HTTP servers # works/original members localhost # broken # members localhost,gecko } ... gecko.cfg: define host{ use generic-host host_name gecko.domain.tld alias gecko address nnn.nnn.nnn.nnn # added when hostgroup.members didn't work hostgroups http-servers } localhost_nagios2.cfg: define host{ use generic-host host_name localhost alias localhost address 127.0.0.1 } # a few non-hostgroup service definitions follow ... ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 8 17:53:28 2010 From: marc at ena.com (Marc Powell) Date: Fri, 8 Jan 2010 10:53:28 -0600 Subject: Could not expand members specified in hostgroup (defined in cfg_dir) In-Reply-To: <6ccd37151001080801p101c8cbdm3c188c5e8d511b57@mail.gmail.com> References: <6ccd37151001080801p101c8cbdm3c188c5e8d511b57@mail.gmail.com> Message-ID: <551E8F18-B133-4456-8CA3-2BFB51B5C665@ena.com> On Jan 8, 2010, at 10:01 AM, Jacob Anawalt wrote: > I made a new cfg for my gecko server, gecko.cfg, and added it > alongside localhost in hostgroups_nagios2.cfg, but when I tried to > restart, nagios says: > > Error: Could not find any host matching 'gecko' (config file > '/etc/nagios3/conf.d/hostgroups_nagios2.cfg', starting on line 18) > hostgroups_nagios2.cfg: > ... > define hostgroup { > hostgroup_name http-servers > alias HTTP servers > # works/original > members localhost > # broken > # members localhost,gecko > } members are host_names. This should be 'localhost,gecko.domain.tld'. alias is generally just a more descriptive name for human consumption only. > gecko.cfg: > define host{ > use generic-host > host_name gecko.domain.tld > alias gecko > address nnn.nnn.nnn.nnn > # added when hostgroup.members didn't work > hostgroups http-servers > } Personally I think this is a better, more scalable way of adding it to the hostgroup anyway. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Sat Jan 9 02:02:24 2010 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Fri, 8 Jan 2010 17:02:24 -0800 Subject: Getting parameters and plotting them in terms of % (Percentage) In-Reply-To: <4B470E5A.7040700@cisco.com> References: <4B448FB5.4060105@cisco.com> <8381699D-B31B-4EFD-8EE9-EA302237F32D@ena.com> <4B45E0E6.3000305@cisco.com> <4B4612B6.4040406@ederdrom.de> <4B470E5A.7040700@cisco.com> Message-ID: <20100109010224.GK17169@bakgwai.americas.hpqcorp.net> Hi Jatin! On Fri, 08 Jan 2010, Jatin Davey wrote: > Hi all > > While compiling the PNP4 pacakage i am seeing that one the modules is > not installed: > > RRDs Perl Modules: *** NOT FOUND *** > > Can someone tell me where to get the modules. It depends what OS and platform you're running it on and how you like to install your Perl modules. http://cpan.perl.org is always a good place to look if you do the compile-from-source thing, but if you didn't know that already I'd suggest using a pre-built package for your OS distribution of choice, whatever that may be. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gmartin at gmartin.org Sat Jan 9 16:56:22 2010 From: gmartin at gmartin.org (gmartin) Date: Sat, 9 Jan 2010 10:56:22 -0500 Subject: Checking IIS sites when there is a default "outage" page In-Reply-To: <687c474c2acfcaa2a26153fe910f5020.squirrel@webmail.stinkweasel.net> References: <63365268b9db77d120509239618fa65b.squirrel@webmail.stinkweasel.net> <8FC191CA-2ECA-46EA-B9D7-DC029A76FC7A@ena.com> <687c474c2acfcaa2a26153fe910f5020.squirrel@webmail.stinkweasel.net> Message-ID: Benny, you might also take a look at Webinject.pl It allows you to script a series of website interactions and watch for patterns in the response. There is a nagios mode and I believe you can pass credentials. \\Greg On Thu, Jan 7, 2010 at 7:12 PM, C. Bensend wrote: > > >> Any thoughts on how to monitor an IIS site requiring NTLM > >> authentication when a default page is configured for an "out of > >> service" page? > > > > I'm not aware of anything out-of-the-box but I do know of one that's > close > > (but haven't used it). I also know that curl supports NTLM so if that's > > used as the basis of a plugin, you're a good way there. This PHP based > > plugin appears to have the features of check_http you need, but without > > the NTLM auth -- > > > > > http://exchange.nagios.org/directory/Plugins/Websites%2C-Forms-and-Transactions/CURL-check-script/details > > > > Since it's PHP and PHP Curl is pretty well documented, I think that it > > wouldn't be too difficult to add in the authentication bit (two lines in > > the curl_setopt() section I believe, if hard-coded). The two options I > > think you'd need are -- > > > > http://www.php.net/manual/en/function.curl-setopt.php > > > > CURLOPT_USERPWD 'username:password' > > CURLOPT_HTTPAUTH 'CURLAUTH_NTLM' > > > > It may be more complicated than that but it's where I'd start if I were > > doing it. > > A quick followup... > > I decided to take the requirements a little more literally, and > build a test that checks to see if their IIS web sites are > up and running or if they're stopped/paused/whatever. So, here > it is: > > > http://exchange.nagios.org/directory/Plugins/Web-Servers/IIS/check_iis_site/details > > It's a VBscript that I run via NSClient's NRPE handler, and it > checks the status of any IIS web site I tell it to. > > Released under BSD license, maybe someone else will find it > useful. :) > > Benny > > > -- > "It's not all about getting up and putting four slices of kickass > in a two slice toaster." -- ark86, on Fazed.net > > > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and > easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Sat Jan 9 16:59:30 2010 From: benny at bennyvision.com (C. Bensend) Date: Sat, 9 Jan 2010 09:59:30 -0600 Subject: Checking IIS sites when there is a default "outage" page In-Reply-To: References: <63365268b9db77d120509239618fa65b.squirrel@webmail.stinkweasel.net> <8FC191CA-2ECA-46EA-B9D7-DC029A76FC7A@ena.com> <687c474c2acfcaa2a26153fe910f5020.squirrel@webmail.stinkweasel.net> Message-ID: <6db7b7133858334881aeb47f579d5948.squirrel@webmail.stinkweasel.net> > Benny, you might also take a look at Webinject.pl It allows you to script > a > series of website interactions and watch for patterns in the response. > There is a nagios mode and I believe you can pass credentials. In the earlier messages in this thread, I mentioned it. I'm not having a helluva lot of luck with it, it doesn't seem to do what I *think* I'm telling it to. I would LOVE it if you had some examples you could provide. :) I will need to get it working at some point, I just haven't had the time yet... Benny -- "It's not all about getting up and putting four slices of kickass in a two slice toaster." -- ark86, on Fazed.net ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From steinex at nognu.de Sun Jan 10 15:59:25 2010 From: steinex at nognu.de (Frank Steinborn) Date: Sun, 10 Jan 2010 15:59:25 +0100 Subject: $TOTALHOSTSUP$ and probably other macros give wrong values Message-ID: <20100110145925.GA93420@haydn.nognu.de> Hello, I'm trying to use some macros from Nagios 3.2.0 for an IRC bot, like $TOTALSERVICESCRITICAL$, $TOTALSERVICESWARNINGUNHANDLED$ etc. Most *seem* to give the correct value, but $TOTALHOSTSUP$ is obviously wrong. I have 7 hosts in an UP state, only one of these hosts is in "do not actively check"-mode. However, $TOTALHOSTSUP$ returns '2'. I'm not sure why this happens, do I miss something? This happens on FreeBSD 7.2-RELEASE. I would be glad if anyone could clarify that. Thanks, Frank ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From radel at inet.net.nz Sun Jan 10 21:55:46 2010 From: radel at inet.net.nz (Dietrich Radel) Date: Mon, 11 Jan 2010 09:55:46 +1300 Subject: [question] Service State Information -> "Last Update" - should it reset to 0 every 10 seconds? Message-ID: <4B4A3ED2.6000805@inet.net.nz> Hello. Just a quick question. When examining the information for a service in the NAGIOS web interface, I have noticed that the "Last Update" time keeps resetting back to zero every 10 seconds i.e. when it reaches "0d 0h 0m 10s ago". Is this normal behaviour. I was certain that it didn't use to do this. Could I have done something to cause this in my config? Thanks in advance. Regards, D.Radel. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rmaglasang at infoweapons.com Mon Jan 11 07:36:38 2010 From: rmaglasang at infoweapons.com (Sho) Date: Mon, 11 Jan 2010 14:36:38 +0800 Subject: Making Nagios accessible on public internet In-Reply-To: <002901ca9007$52ac6100$9930fea9@rockinghshs.internal> References: <002901ca9007$52ac6100$9930fea9@rockinghshs.internal> Message-ID: <4B4AC6F6.8090104@infoweapons.com> xmanhosting wrote: > Hello, > > I have successfully set up Nagios on my server computer, and can > access it locally in a browser by navigating to: > > http://onlinebackup.xmanhosting.com.au/nagios/ > > What do I need to do so that my Nagios can be accessed publicly on > the internet, rather than just locally to the server? Configure your server with a public IP address (routable); if its behind a firewall or NAT, you may need to setup a bi-NAT or port-forwarding > > Thanks for any help. > > Kind regards, > Anthony > > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From EWScott at scotborders.gov.uk Mon Jan 11 11:11:51 2010 From: EWScott at scotborders.gov.uk (Scott, Ewan) Date: Mon, 11 Jan 2010 10:11:51 +0000 Subject: question about running Nagios on Solaris LDom and Zone servers Message-ID: <771645F4ADF2C9449B7E6AE630C720018BC4BD2315@HQ-MAIL-01.scotborders.gov.uk> Solaris query. I am about to put Nagios on both the primary LDom physical servers and all the virtual guest systems which run on them. Similarly I intend to run it on the primary host zone servers and the guest zones running on them. Is anyone aware of any problems - performance issues? - which can result from this blanket approach? Regards Ewan Ewan Scott UNIX Team Systems Support Officer Business & Technology Solutions Scottish Borders Council Council Headquarters Newtown St Boswells Melrose TD6 0SA 01835 824000 ext 5374 ewscott at scotborders.gov.uk http://www.scotborders.gov.uk ********************************************************************** This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. ********************************************************************** -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From simon.kainz at tugraz.at Mon Jan 11 13:03:40 2010 From: simon.kainz at tugraz.at (Simon Kainz) Date: Mon, 11 Jan 2010 13:03:40 +0100 Subject: Nested (on demand)macros Message-ID: <4B4B139C.4050501@tugraz.at> Hi list! I#M using Nagios 2.9 and have the following question: I'm trying to save some typing and to prettyfy my config files, so i came up with the following: check_command check-host-adaptive!$SERVICESTATEID:$HOSTNAME$:conn$!$SERVICEATTEMPT:$HOSTNAME$:conn$ This should run check-host-adaptive with the state id (Ok, warn, crit) of the plugin conn on the current host (as defines by $HOSTNAME$), the same holds for the serviceattempts. But Nagios presents me the verbose text "$SERVICESTATEID:$HOSTNAME$:conn$" instead of the values i want to see. Any ideas ? Regards, -- DI Simon Kainz Graz, University of Technology Department Computing Phone: ++43 (0) 316 / 873 6885 -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 260 bytes Desc: OpenPGP digital signature URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Jan 11 14:51:44 2010 From: marc at ena.com (Marc Powell) Date: Mon, 11 Jan 2010 07:51:44 -0600 Subject: Nested (on demand)macros In-Reply-To: <4B4B139C.4050501@tugraz.at> References: <4B4B139C.4050501@tugraz.at> Message-ID: <43FE77E9-BA5D-42C8-A854-D330A1BF689F@ena.com> On Jan 11, 2010, at 6:03 AM, Simon Kainz wrote: > Hi list! > > I#M using Nagios 2.9 and have the following question: > > I'm trying to save some typing and to prettyfy my config files, so i > came up with the following: > > > check_command > check-host-adaptive!$SERVICESTATEID:$HOSTNAME$:conn$!$SERVICEATTEMPT:$HOSTNAME$:conn$ > I don't know if this double substitution will work*, but if it has a chance to, you need to use the macro in the command{} definition command_line, not the host{} definition. See http://nagios.sourceforge.net/docs/3_0/clusters.html for an example. * I don't believe this is going to work because it's not listed as one of the macros that's expected to contain other macros. From the Macros documenation -- Macro Substitution - How Macros Work Before Nagios executes a command, it will replace any macros it finds in the command definition with their corresponding values. This macro substitution occurs for all types of commands that Nagios executes - host and service checks, notifications, event handlers, etc. Certain macros may themselves contain other macros. These include the $HOSTNOTES$, $HOSTNOTESURL$, $HOSTACTIONURL$, $SERVICENOTES$, $SERVICENOTESURL$, and $SERVICEACTIONURL$ macros. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Mon Jan 11 15:00:37 2010 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Mon, 11 Jan 2010 15:00:37 +0100 Subject: check_openmanage block esmlog, check the rest of items Message-ID: <90f6e8271001110600t362db767u91e70510ee838389@mail.gmail.com> hi, we have a brand new r710 and I have installed the openmanage software on the server itself. On the nagios server I execute it and it works fine (SNMP). /check_openmanage -H HOST -C public -P 1 -a Controller 1 [SAS 6/iR Integrated] needs attention: Degraded ESM log content: 4 critical, 0 non-critical, 1 ok Is it possible to exclude the esm log from the check? And a second question: the raid controller is 'degraded', but it only is an updated firmware version. I cannot blacklist it with the --blacklist ctrl_fw option: ./check_openmanage -H rdpaysist -C public -P 1 -a -b ctrl_fw=1 Controller 1 [SAS 6/iR Integrated] needs attention: Degraded ESM log content: 4 critical, 0 non-critical, 1 ok Is upgrading the firmware the only option to get rid of this message? (that is not a problem, by the way, but if I can blacklist it ...) TIA -- Groeten, J.Asenjo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Mon Jan 11 15:07:17 2010 From: a31modela at hotmail.com (steve f) Date: Mon, 11 Jan 2010 09:07:17 -0500 Subject: Nagios Integration with IBM TEC ( Tivoli ) Message-ID: Hello, We have IBM TEC ( Tivoli Enterprise Console) for alot of our current monitoring. Does anyone have any experience / history integrating Nagios with IBM TEC? We use postd messages to get the info from the clients back to TEC. Can Nagios read the postd messages? I guess I could somehow use a passive check to do this correct ?? Would Nagios be considered a viable replacement for TEC? I have not been able to find any Marketing type of info as far as Nagios as a replacement for other monitoring solutions. Thanks, Steve _________________________________________________________________ Hotmail: Powerful Free email with security by Microsoft. http://clk.atdmt.com/GBL/go/196390710/direct/01/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Mon Jan 11 15:25:41 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Mon, 11 Jan 2010 15:25:41 +0100 Subject: check_openmanage block esmlog, check the rest of items In-Reply-To: <90f6e8271001110600t362db767u91e70510ee838389@mail.gmail.com> (Natxo Asenjo's message of "Mon, 11 Jan 2010 15:00:37 +0100") References: <90f6e8271001110600t362db767u91e70510ee838389@mail.gmail.com> Message-ID: <15t6378afbu.fsf@tux.uio.no> Natxo Asenjo writes: > we have a brand new r710 and I have installed the openmanage software > on the server itself. On the nagios server I execute it and it works > fine (SNMP). > > /check_openmanage -H HOST -C public -P 1 -a > Controller 1 [SAS 6/iR Integrated] needs attention: Degraded > ESM log content: 4 critical, 0 non-critical, 1 ok > > Is it possible to exclude the esm log from the check? Yes. You have implicitely included it with the '-a' option, which means "check everything, including log content". Drop the '-a' option from the command and you should be fine. > And a second question: the raid controller is 'degraded', but it only > is an updated firmware version. I cannot blacklist it with the > --blacklist ctrl_fw option: > > ./check_openmanage -H rdpaysist -C public -P 1 -a -b ctrl_fw=1 > Controller 1 [SAS 6/iR Integrated] needs attention: Degraded > ESM log content: 4 critical, 0 non-critical, 1 ok > > Is upgrading the firmware the only option to get rid of this message? > (that is not a problem, by the way, but if I can blacklist it ...) Hmm.. When the controller is in a degraded state, the plugin tries to figure out why. In this case it obviously doesn't work. I need some debugging info to get to the bottom of this. Can you email me directly with output from the following commands: * On the monitored server: omreport storage controller * On the Nagios server: snmpwalk -v1 -c public rdpaysist 1.3.6.1.4.1.674.10893 Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Mon Jan 11 15:26:39 2010 From: perldork at webwizarddesign.com (Max) Date: Mon, 11 Jan 2010 09:26:39 -0500 Subject: Nagios Integration with IBM TEC ( Tivoli ) In-Reply-To: References: Message-ID: Snmp traps are the easiest way to integrate Nagios with other tools that use SNMP. With a little leg work you can use SnMPTT to also receive traps from other SNMP sources and submit them toi Nagios as passive checks. We currently use SNMP traps to integrate Nagios with Spectrum, works quite nicely. Max On 1/11/10, steve f wrote: > > Hello, > > We have IBM TEC ( Tivoli Enterprise Console) for alot of our current > monitoring. > > Does anyone have any experience / history integrating Nagios with IBM TEC? > > We use postd messages to get the info from the clients back to TEC. Can > Nagios read the postd messages? I guess I could somehow use a passive check > to do this correct ?? > > Would Nagios be considered a viable replacement for TEC? I have not been > able to find any Marketing type of info as far as Nagios as a replacement > for other monitoring solutions. > > Thanks, > Steve > > _________________________________________________________________ > Hotmail: Powerful Free email with security by Microsoft. > http://clk.atdmt.com/GBL/go/196390710/direct/01/ ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From joe.hartley at retailsolutions.com Mon Jan 11 15:48:53 2010 From: joe.hartley at retailsolutions.com (Joe Hartley) Date: Mon, 11 Jan 2010 06:48:53 -0800 Subject: Checking IIS sites when there is a default "outage" page In-Reply-To: <6f7ab08bdacff73cb645c59ff7a982b0.squirrel@webmail.stinkweasel.net> References: <63365268b9db77d120509239618fa65b.squirrel@webmail.stinkweasel.net><8FC191CA-2ECA-46EA-B9D7-DC029A76FC7A@ena.com> <6f7ab08bdacff73cb645c59ff7a982b0.squirrel@webmail.stinkweasel.net> Message-ID: <580CB599E684764C84C5CCFA64A929B4037D0581@EXVBE011-2.exch011.intermedia.net> From: C. Bensend [mailto:benny at bennyvision.com] > (hints/tips/examples/pointers on webinject tests still very much > appreciated, too - I have some web apps to log into and test) I had been using webinject to test NTLM authentication successfully on both a RedHat 4 and a Fedora 7 based system (the Fedora 7 was actually Aurora Linux for SPARC). When I moved to RedHat/CentOS 5.x, webinject stopped working for me and I have never been able to figure out why. Webinject's last release was 4 years ago, and the forums have not seen a post in almost a year. I'm definitely keeping an eye on this thread to see what results come out of it. -- Joe Hartley | Sr. Linux SysAdmin Retail Solutions, Inc. 40 Sharpe Drive Cranston, RI 02920 joe.hartley at retailsolutions.com +1 401.824.5040 (o) | +1 401.824.5002 (f) ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From israel at frontierflying.com Mon Jan 11 18:45:48 2010 From: israel at frontierflying.com (Israel Brewster) Date: Mon, 11 Jan 2010 08:45:48 -0900 Subject: Dependancy clarification In-Reply-To: References: Message-ID: <1B87925E-BB36-4433-8E64-E3BBA18A8FEB@frontierflying.com> On Jan 8, 2010, at 2:55 PM, gmartin wrote: > Israel, > I believe you are correct. I'll be interested to hear what other > shave to say on the inner workings. In the meantime, can the > problem be solved if the event handler for Service B is written to > restart svc A if it is down? (perhaps it calls the same nagios > check from the command line and acts on the results) Yeah, that should work, at least for my specific situation. Of course, doing so greatly reduces the utility of having the dependancy in the first place, since the situations under which it would be triggered (given nagios restarting service A as soon as it detects it as down) would be somewhat rare, and even when triggered it would no longer be needed, since the service B event handler does its own dependancy checking. The only time the dependancy would apply (assuming our understanding is right) is in the situation where Nagios detects A as down, and then tries to run a check on B before verifying that A is back up. Of course, even then it wouldn't matter, since a) nagios should have restarted service A immediately (so a straight restart of B would be fine), and b) even if nagios didn't, the new event handler for service B would. At which point there is no need of the dependancy at all, since the event handler takes care of the dependancies. Basically, if the dependancy only applies when nagios ALREADY knows service A is down, then the dependancy is basically useless, at least in this situation. Of course, if this is just the way dependancies work, then there may be no other option. Thanks for the feedback. > > \\Greg > > > > On Fri, Jan 8, 2010 at 6:07 PM, Israel Brewster > wrote: > Here's the situation: running nagios 3.2.0, I have two services, > we'll call them A and B. Both have event handlers such that if they > register a hard critical state, Nagios attempts to restart them. > Service B depends on service A, such that when service A goes down, > service B does as well, causing them both to need restarted, with A > needing to be restarted first. I have a servicedependancy set up in > nagios specifying service B's dependancy on service A. > > My understanding is that the way this works is that when nagios goes > to check service B, it first looks at the "current" state (as > defined by the last nagios check) of service A, and, if the > execution_failure_criteria matches (i.e. if service A is down) > nagios does not run the check on service B, thus not running the > event handler to attempt to restart B until A is back up. This is > good. But what happens in the following scenario? > > Service A is scheduled to check every 5 minutes. > 1) Nagios does a normally scheduled check of service A, finding it > to be OK. > 2) One minute later, Service A crashes > 3) One minute after that (three minutes prior to the next regular > check of service A), thanks to nagios staggering checks, Nagios goes > to do a normal check of service B > > Now, to my understanding of this scenario, the check on service B > would run normally, since the last check on A was OK, and nagios > uses cached results for dependancy checks. Since service A is > actually critical, service B will be critical as well. The problem > with this is that Nagios will respond by attempting to restart > service B, which will invariably fail since service A is still down. > Once the next regular check time for service A is reached, Nagios > will detect service A as down and restart it, but service B will > never get restarted successfully, since nagios already tried and > failed. > > Is this correct? If so, what can be done about it? Or is nagios > smart enough to schedule its service checks to avoid this scenario? > It seems that the most logical solution (if possible) would be to > mirror the service/host check logic. That is, when a check of > service B comes back as critical, immediately check service A. If > service A is critical, then don't declare service B to be critical > until service A is OK, at which point B would enter a hard down > state and run the event handler. Alternately, if I could say > something like always check service A immediately before checking > service B to make sure our data is current, that would work as well. > Although I could see it resulting in excessive checking of service > A, which may be less desirable. What do you guys think? > ----------------------------------------------- > Israel Brewster > Computer Support Technician II > Frontier Flying Service Inc. > 5245 Airport Industrial Rd > Fairbanks, AK 99709 > (907) 450-7250 x293 > ----------------------------------------------- > > > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast > and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ----------------------------------------------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Israel Brewster.vcf Type: text/directory Size: 417 bytes Desc: not available URL: -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Jan 11 19:19:46 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Mon, 11 Jan 2010 10:19:46 -0800 Subject: [question] Service State Information -> "Last Update" - should it reset to 0 every 10 seconds? In-Reply-To: <4B4A3ED2.6000805@inet.net.nz> References: <4B4A3ED2.6000805@inet.net.nz> Message-ID: <4B4B6BC2.5020301@hp.com> Dietrich Radel wrote: > Just a quick question. When examining the information for a service in > the NAGIOS web interface, I have noticed that the "Last Update" time > keeps resetting back to zero every 10 seconds i.e. when it reaches "0d > 0h 0m 10s ago". Is this normal behaviour. I was certain that it didn't > use to do this. Could I have done something to cause this in my config? > If Nagios is configured to update every 10 seconds, yes, that's perfectly normal. http://nagios.sourceforge.net/docs/3_0/configmain.html#status_update_interval ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Jan 11 19:25:15 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Mon, 11 Jan 2010 10:25:15 -0800 Subject: question about running Nagios on Solaris LDom and Zone servers In-Reply-To: <771645F4ADF2C9449B7E6AE630C720018BC4BD2315@HQ-MAIL-01.scotborders.gov.uk> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD2315@HQ-MAIL-01.scotborders.gov.uk> Message-ID: <4B4B6D0B.2090606@hp.com> Scott, Ewan wrote: > > I am about to put Nagios on both the primary LDom physical servers and > all the virtual guest systems which run on them. Similarly I intend to > run it on the primary host zone servers and the guest zones running on > them. Is anyone aware of any problems - performance issues? - which > can result from this blanket approach? > Not enough information to say, but generally, yes, there are a lot of issues that *could* result from doing things this way, but you haven't provided enough information to guess whether that'll be an issue for you or not. Nagios can be resource intensive, but it's all dependent on what you're doing with it (how many hosts, how many services, which version you're running, etc.). There have also been a lot of timing issues reported with running Nagios in a VM, though I don't know whether those apply to Solaris guest zones. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From radel at inet.net.nz Mon Jan 11 20:12:17 2010 From: radel at inet.net.nz (D & E Radel) Date: Tue, 12 Jan 2010 08:12:17 +1300 Subject: [question] Service State Information -> "Last Update" - should it reset to 0 every 10 seconds? In-Reply-To: <4B4B6BC2.5020301@hp.com> References: <4B4A3ED2.6000805@inet.net.nz> <4B4B6BC2.5020301@hp.com> Message-ID: <4B4B7811.6070000@inet.net.nz> Morris, Patrick wrote: >> Just a quick question. When examining the information for a service >> in the NAGIOS web interface, I have noticed that the "Last Update" >> time keeps resetting back to zero every 10 seconds i.e. when it >> reaches "0d 0h 0m 10s ago". Is this normal behaviour. I was certain >> that it didn't use to do this. Could I have done something to cause >> this in my config? >> > > If Nagios is configured to update every 10 seconds, yes, that's > perfectly normal. > http://nagios.sourceforge.net/docs/3_0/configmain.html#status_update_interval > > Brilliant. Thanks for that Patrick. :-) ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stanb at panix.com Mon Jan 11 23:13:06 2010 From: stanb at panix.com (stan) Date: Mon, 11 Jan 2010 17:13:06 -0500 Subject: NagiSQL created files Message-ID: <20100111221306.GA21239@teddy.fas.com> We have installed NagioSQL in an effort to make it easier for various people to configure Nagios. It seeems to work well, but it has the anoying habit of putting CR.LF at the end of each line in the files it generates. Is there an easy way to stop this short of post procesing all the files it generates? -- A: Because it messes up the order in which people normally read text. Q: Why is top-posting such a bad thing? A: Top-posting. Q: What is the most annoying thing in e-mail? ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From glicerinu at gmail.com Tue Jan 12 06:48:48 2010 From: glicerinu at gmail.com (Marc Aymerich) Date: Tue, 12 Jan 2010 06:48:48 +0100 Subject: avoid notifications during time interval Message-ID: *Hi all!* *From 2:00 to 2:15AM my mail server is stroked by the backup system. Most of the times Nagios gets a Timeout error when the SMTP service is checked during this time period, so Nagios sends me an annoying Critical notification mail alert every night. How can I avoid this 'redundant' Critical Alerts? Is possible to disable the smtp checks during this time? There is a better solution? * *thanks!* *Marc* -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Tue Jan 12 08:12:40 2010 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Tue, 12 Jan 2010 08:12:40 +0100 Subject: is_volatile Message-ID: <4B4C20E8.8050503@fhv.at> Hi there, at the moment I'm migrating from nagios Nagios-2.3 to Nagios-3.0.6 with nearly the same config Files. One parameter makes me headaches: "is_volatile". In V2.3 it doesn't shows me this parameter in the webinterface and notification (SMS and Email) messages. But in V3.0.6 does. EXAMPLE from the Webinterface: V2.3 Free Space C:\ OK V3.0.6 Free Space C: is_volatile 0 OK In V3.0.6 it only shows me this output from services when I check the DriveSizes from Windows-Server with check_nrpe. Here is one Service Check from V3.0.6 and V2.6 .... # Service definition define service{ use windows-service host_name service_description Free Space C:\ is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 3 retry_check_interval 1 contact_groups sys-admins notification_interval 0 notification_period 24x7 notification_options w,u,c,r check_command check_win_nrpe2!CheckDriveSize!ShowAll=long MinWarnFree=30% MinCritFree=20% Drive=C:! .... Memory Checks looks like this. .... # Service definition define service{ use windows-service host_name service_description MEMORY is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 3 retry_check_interval 1 contact_groups nosms-admins notification_interval 0 notification_period 24x7 notification_options w,u,c,r check_command check_win_nrpe2!checkMem!ShowAll=long MaxWarn=96% MaxCrit=99% type=physical type=page! } .... My first idea was to delete the line "is_volatile 0" But a nagios -v nagios.cfg gives me this Warning. .... Warning: Service 'Free Space C: check_period 24x7' on host '' has no check time period defined! .... any hints are welcome to hide this 'is_volatile' output. thanks in advance Richard ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Tue Jan 12 08:57:02 2010 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Tue, 12 Jan 2010 08:57:02 +0100 Subject: check_openmanage block esmlog, check the rest of items In-Reply-To: <15t6378afbu.fsf@tux.uio.no> References: <90f6e8271001110600t362db767u91e70510ee838389@mail.gmail.com> <15t6378afbu.fsf@tux.uio.no> Message-ID: <90f6e8271001112357h25b5a896m942de046695a4238@mail.gmail.com> On Mon, Jan 11, 2010 at 3:25 PM, Trond Hasle Amundsen wrote: >> Is it possible to exclude the esm log from the check? > > Yes. You have implicitely included it with the '-a' option, which means > "check everything, including log content". Drop the '-a' option from the > command and you should be fine. but this will continue to check the chassis, will it not? i mean, I still want to check the fans, memory etc. >> And a second question: the raid controller is 'degraded', but it only >> is an updated firmware version. I cannot blacklist it with the >> --blacklist ctrl_fw option: >> >> ./check_openmanage -H rdpaysist -C public -P 1 -a -b ctrl_fw=1 >> Controller 1 [SAS 6/iR Integrated] needs attention: Degraded >> ESM log content: 4 critical, 0 non-critical, 1 ok >> >> Is upgrading the firmware the only option to get rid of this message? >> (that is not a problem, by the way, but if I can blacklist it ...) > > Hmm.. When the controller is in a degraded state, the plugin tries to > figure out why. In this case it obviously doesn't work. I need some > debugging info to get to the bottom of this. Can you email me directly > with output from the following commands: > > ?* On the monitored server: > ? ? ?omreport storage controller C:\>omreport storage controller List of Controllers in the system Controllers ID : 0 Status : Ok Name : PERC 6/E Adapter Slot ID : PCI Slot 3 State : Ready Firmware Version : 6.2.0-0013 Minimum Required Firmware Version : Not Applicable Driver Version : 2.24.00.64 Minimum Required Driver Version : Not Applicable Storport Driver Version : 5.2.3790.3959 Minimum Required Storport Driver Version : 5.2.3790.4173 Number of Connectors : 2 Rebuild Rate : 30% BGI Rate : 30% Check Consistency Rate : 30% Reconstruct Rate : 30% Alarm State : Not Applicable Cluster Mode : Not Applicable SCSI Initiator ID : Not Applicable Cache Memory Size : 256 MB Patrol Read Mode : Auto Patrol Read State : Stopped Patrol Read Rate : 30% Patrol Read Iterations : 0 Abort check consistency on error : Disabled Allow Revertible Hot Spare and Replace Member : Enabled Auto replace member on predictive failure : Disabled Load balance : Auto Security Capable : Not Applicable Security Key Present : Not Applicable Redundant Path view : Not Applicable ID : 1 Status : Non-Critical Name : SAS 6/iR Integrated Slot ID : Embedded State : Degraded Firmware Version : 00.25.47.00.06.22.03.00 Minimum Required Firmware Version : Not Applicable Driver Version : 1.28.03.01 Minimum Required Driver Version : Not Applicable Storport Driver Version : 5.2.3790.3959 Minimum Required Storport Driver Version : 5.2.3790.4173 Number of Connectors : 2 Rebuild Rate : Not Applicable BGI Rate : Not Applicable Check Consistency Rate : Not Applicable Reconstruct Rate : Not Applicable Alarm State : Not Applicable Cluster Mode : Not Applicable SCSI Initiator ID : Not Applicable Cache Memory Size : Not Applicable Patrol Read Mode : Not Applicable Patrol Read State : Not Applicable Patrol Read Rate : Not Applicable Patrol Read Iterations : Not Applicable Abort check consistency on error : Not Applicable Allow Revertible Hot Spare and Replace Member : Not Applicable Auto replace member on predictive failure : Not Applicable Load balance : Not Applicable Security Capable : Not Applicable Security Key Present : Not Applicable Redundant Path view : Not Applicable > > ?* On the Nagios server: > ? ? ?snmpwalk -v1 -c public rdpaysist 1.3.6.1.4.1.674.10893 SNMPv2-SMI::enterprises.674.10893.1.20.1.0 = STRING: "3.2.0" SNMPv2-SMI::enterprises.674.10893.1.20.2.0 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.3.0 = STRING: "Dell Inc." SNMPv2-SMI::enterprises.674.10893.1.20.4.0 = STRING: "Server Administrator (Storage Management)" SNMPv2-SMI::enterprises.674.10893.1.20.5.0 = STRING: "Configuration and monitoring of disk storage devices." SNMPv2-SMI::enterprises.674.10893.1.20.100.1.0 = STRING: "Server Administrator (Storage Management)" SNMPv2-SMI::enterprises.674.10893.1.20.100.2.0 = STRING: "Configuration and monitoring of disk storage devices." SNMPv2-SMI::enterprises.674.10893.1.20.100.3.0 = STRING: "Dell Inc." SNMPv2-SMI::enterprises.674.10893.1.20.110.3.0 = INTEGER: 1263282653 SNMPv2-SMI::enterprises.674.10893.1.20.110.4.0 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.110.5.0 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.110.6.0 = INTEGER: 300 SNMPv2-SMI::enterprises.674.10893.1.20.110.11.0 = STRING: "3.2" SNMPv2-SMI::enterprises.674.10893.1.20.110.12.0 = STRING: "https://172.20.4.96:1311" SNMPv2-SMI::enterprises.674.10893.1.20.110.13.0 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.110.14.0 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.110.15.0 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.2.1 = STRING: "PERC 6/E Adapter" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.2.2 = STRING: "SAS 6/iR Integrated" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.3.1 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.3.2 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.4.1 = INTEGER: 6 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.4.2 = INTEGER: 6 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.5.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.5.2 = INTEGER: 6 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.7.1 = INTEGER: 30 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.8.1 = STRING: "6.2.0-0013" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.8.2 = STRING: "00.25.47.00.06.22.03.00" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.9.1 = INTEGER: 256 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.10.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.11.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.11.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.12.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.12.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.37.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.37.2 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.38.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.38.2 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.39.1 = STRING: "\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.39.2 = STRING: "\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.40.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.40.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.41.1 = STRING: "2.24.00.64" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.41.2 = STRING: "1.28.03.01" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.42.1 = STRING: "3" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.42.2 = STRING: "embedded" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.43.1 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.43.2 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.47.1 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.47.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.48.1 = INTEGER: 30 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.49.1 = INTEGER: 30 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.50.1 = INTEGER: 30 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.51.1 = INTEGER: 30 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.52.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.53.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.54.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.55.1 = STRING: "5.2.3790.3959" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.55.2 = STRING: "5.2.3790.3959" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.56.1 = STRING: "5.2.3790.4173" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.56.2 = STRING: "5.2.3790.4173" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.57.1 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.57.2 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.58.1 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.58.2 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.1.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.1.4 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.2.1 = STRING: "Connector 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.2.2 = STRING: "Connector 1" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.2.3 = STRING: "Connector 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.2.4 = STRING: "Connector 1" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.3.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.3.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.3.3 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.3.4 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.7.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.7.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.7.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.7.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.8.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.8.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.8.3 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.8.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.9.1 = STRING: "\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.9.2 = STRING: "\\0\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.9.3 = STRING: "\\1\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.9.4 = STRING: "\\1\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.11.1 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.11.2 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.11.3 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.11.4 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.2.1 = STRING: "Backplane" SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.3.1 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.4.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.8.1 = STRING: "9AB01ZA" SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.12.1 = STRING: "BACKPLANE " SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.16.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.19.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.23.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.24.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.25.1 = STRING: "\\1\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.26.1 = STRING: "1.07" SNMPv2-SMI::enterprises.674.10893.1.20.130.3.1.30.1 = STRING: "5102408081177C00" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.2.1 = STRING: "Physical Disk 0:0:0" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.2.2 = STRING: "Physical Disk 0:0:1" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.3.1 = STRING: "DELL(tm)" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.3.2 = STRING: "DELL(tm)" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.4.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.4.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.6.1 = STRING: "ST973452SS " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.6.2 = STRING: "ST973452SS " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.7.1 = STRING: "3TA0PKTV " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.7.2 = STRING: "3TA0DP4W " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.8.1 = STRING: "HT04" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.8.2 = STRING: "HT03" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.9.1 = STRING: "0" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.9.2 = STRING: "0" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.10.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.10.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.11.1 = INTEGER: 69376 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.11.2 = INTEGER: 69376 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.12.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.12.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.13.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.13.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.14.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.14.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.15.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.15.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.16.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.16.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.17.1 = INTEGER: 69376 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.17.2 = INTEGER: 69376 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.18.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.18.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.19.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.19.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.20.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.20.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.21.1 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.21.2 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.22.1 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.22.2 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.23.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.23.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.24.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.24.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.26.1 = STRING: "\\1\\0\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.26.2 = STRING: "\\1\\0\\0\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.27.1 = STRING: "SG0G108N125319AQ02SEA01 " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.27.2 = STRING: "SG0G108N1253198O00LHA00 " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.28.1 = STRING: "5000C500177CA68D" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.28.2 = STRING: "5000C5000BEC1D95" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.29.1 = INTEGER: 3072 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.29.2 = INTEGER: 3072 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.30.1 = INTEGER: 3072 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.30.2 = INTEGER: 3072 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.31.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.31.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.32.1 = STRING: "02" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.32.2 = STRING: "02" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.33.1 = STRING: "44" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.33.2 = STRING: "35" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.34.1 = STRING: "2009" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.34.2 = STRING: "2009" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.35.1 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.35.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.36.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.36.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.40.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.40.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.41.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.41.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.2.1 = STRING: "Physical Disk 0:0:0" SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.2.2 = STRING: "Physical Disk 0:0:1" SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.3.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.3.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.4.1 = STRING: "Backplane" SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.4.2 = STRING: "Backplane" SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.5.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.5.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.6.1 = STRING: "SAS 6/iR Integrated" SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.6.2 = STRING: "SAS 6/iR Integrated" SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.7.1 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.5.1.7.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.2.1 = STRING: "Battery 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.3.1 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.4.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.5.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.6.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.9.1 = STRING: "\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.10.1 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.11.1 = INTEGER: 1426 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.12.1 = INTEGER: 16 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.13.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.15.1.14.1 = INTEGER: 168 SNMPv2-SMI::enterprises.674.10893.1.20.130.16.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.16.1.2.1 = STRING: "Battery 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.16.1.3.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.16.1.4.1 = STRING: "PERC 6/E Adapter" SNMPv2-SMI::enterprises.674.10893.1.20.130.16.1.5.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.1 = STRING: "Virtual Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.1 = STRING: "Windows Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.4.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.6.1 = INTEGER: 69376 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.7.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.12.1 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.13.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.17.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.19.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.20.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.21.1 = STRING: "\\1\\0" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.22.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.23.1 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.24.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.1 = STRING: "Physical Disk 0:0:0" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.2 = STRING: "Physical Disk 0:0:1" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.1 = STRING: "Virtual Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.2 = STRING: "Virtual Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.2 = INTEGER: 1 TIA, HTH. Natxo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Tue Jan 12 09:03:38 2010 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Tue, 12 Jan 2010 09:03:38 +0100 Subject: check_openmanage block esmlog, check the rest of items In-Reply-To: <90f6e8271001112357h25b5a896m942de046695a4238@mail.gmail.com> References: <90f6e8271001110600t362db767u91e70510ee838389@mail.gmail.com> <15t6378afbu.fsf@tux.uio.no> <90f6e8271001112357h25b5a896m942de046695a4238@mail.gmail.com> Message-ID: <90f6e8271001120003ocd7c144p4fb2f96356094eb7@mail.gmail.com> On Tue, Jan 12, 2010 at 8:57 AM, Natxo Asenjo wrote: > On Mon, Jan 11, 2010 at 3:25 PM, Trond Hasle Amundsen > wrote: >>> Is it possible to exclude the esm log from the check? >> >> Yes. You have implicitely included it with the '-a' option, which means >> "check everything, including log content". Drop the '-a' option from the >> command and you should be fine. > > but this will continue to check the chassis, will it not? i mean, I > still want to check the fans, memory etc. yes, it will, I just checked it. Sorry for the noise. natxo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Tue Jan 12 09:52:09 2010 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Tue, 12 Jan 2010 09:52:09 +0100 Subject: is_volatile - SOLUTION In-Reply-To: <4B4C20E8.8050503@fhv.at> References: <4B4C20E8.8050503@fhv.at> Message-ID: <4B4C3839.6070005@fhv.at> On 1/12/10 8:12 AM Richard Gliebe wrote: > Hi there, Me onesmore, > > at the moment I'm migrating from nagios Nagios-2.3 to Nagios-3.0.6 with > nearly the same config Files. > > One parameter makes me headaches: "is_volatile". > > In V2.3 it doesn't shows me this parameter in the webinterface and > notification (SMS and Email) messages. But in V3.0.6 does. > > EXAMPLE from the Webinterface: > V2.3 > Free Space C:\ OK > > V3.0.6 > Free Space C: is_volatile 0 OK > > In V3.0.6 it only shows me this output from services when I check the > DriveSizes from Windows-Server with check_nrpe. > > Here is one Service Check from V3.0.6 and V2.6 > .... > # Service definition > define service{ > use windows-service > > host_name > service_description Free Space C:\ > is_volatile 0 > check_period 24x7 > max_check_attempts 3 > normal_check_interval 3 > retry_check_interval 1 > contact_groups sys-admins > notification_interval 0 > notification_period 24x7 > notification_options w,u,c,r > check_command > check_win_nrpe2!CheckDriveSize!ShowAll=long MinWarnFree=30% > MinCritFree=20% Drive=C:! > .... > > Memory Checks looks like this. > .... > # Service definition > define service{ > use windows-service > > host_name > service_description MEMORY > is_volatile 0 > check_period 24x7 > max_check_attempts 3 > normal_check_interval 3 > retry_check_interval 1 > contact_groups nosms-admins > notification_interval 0 > notification_period 24x7 > notification_options w,u,c,r > check_command > check_win_nrpe2!checkMem!ShowAll=long MaxWarn=96% MaxCrit=99% > type=physical type=page! > } > .... > > > My first idea was to delete the line "is_volatile 0" > But a nagios -v nagios.cfg gives me this Warning. > .... > Warning: Service 'Free Space C: check_period > 24x7' on host '' has no check time period defined! > .... > > any hints are welcome to hide this 'is_volatile' output. the problem was the leading backslash in the service_description ... service_description Free Space C:\ ... removing this backslash and it works as it should. Regards Richard ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From glicerinu at gmail.com Tue Jan 12 11:05:30 2010 From: glicerinu at gmail.com (Marc Aymerich) Date: Tue, 12 Jan 2010 11:05:30 +0100 Subject: avoid notifications during time interval In-Reply-To: References: Message-ID: On Tue, Jan 12, 2010 at 6:48 AM, Marc Aymerich wrote: > *Hi all!* > > *From 2:00 to 2:15AM my mail server is stroked by the backup system. Most > of the times Nagios gets a Timeout error when the SMTP service is checked > during this time period, so Nagios sends me an annoying Critical > notification mail alert every night. How can I avoid this 'redundant' > Critical Alerts? Is possible to disable the smtp checks during this time? > There is a better solution? * > > *thanks!* > > *Marc* > > I found the answer here: http://nagios.sourceforge.net/docs/3_0/oncallrotation.html -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From clair at practeo.ch Tue Jan 12 12:14:50 2010 From: clair at practeo.ch (Clair Johnson) Date: Tue, 12 Jan 2010 11:14:50 +0000 Subject: Greetings all, CGI access problem Message-ID: <0CC98CB467A5E044A9C403E9688943C7043262@exchange2010-vm.office.practeo.ch> Hi all, This is my first post to the list. I've search the archive but didn't find clues to my actual problem. It's since two years that I'm working with nagios, from the version 2.5 and I very happy and grateful for all the work people have done to this project. I've developed my own graphical interface to manage nagios core and my authentication. Everything is working great up to last week where I worked to migrate from version 2.9 to version 3.2. The only problem comes from the CGIs that I use, the statusmap.cgi and avail.cgi. Since I take care by myself of the authentication I don't want the nagios authentication, but when I call the statusmap or avail, I get a dialog box asking me a login and password. I've check the cgi config file and change the use_authentication to 0 but that doesn't change anything, when I call the CGI in my browser I'm still asked for credentials. My apache configuration hasn't changed and was working in nagios 2.9. Do you have ideas how to solve my problem, that is to use nagios CGIs without any authentication ? many thanks for your help Kind regards Clair Johnson -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From EWScott at scotborders.gov.uk Tue Jan 12 11:41:07 2010 From: EWScott at scotborders.gov.uk (Scott, Ewan) Date: Tue, 12 Jan 2010 10:41:07 +0000 Subject: question about running Nagios on Solaris LDom and Zone servers In-Reply-To: <4B4B6D0B.2090606@hp.com> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD2315@HQ-MAIL-01.scotborders.gov.uk> <4B4B6D0B.2090606@hp.com> Message-ID: <771645F4ADF2C9449B7E6AE630C720018BC4BD2317@HQ-MAIL-01.scotborders.gov.uk> Thanks for this. I have not decided on exactly what I'll monitor as I'm still installing the clients and making sure the the basic infrastucture works. However in outline: 1. I intend to use nrpe for the checks. 2. I would envisage 5-6 basic checks on the host primary LDom servers (4) and primary Zone servers (2). 3. I would expect to run up to 10 nrpe checks on each of the guest LDoms and Zones (25 systems). I would see that as a small system with lightweight checking but would appreciate comments if anyone thinks otherwise. Further down the line I'd like to add in 80+ Windows VM systems at which point it would become a very differnet beast. Currently I'm running the Nagios server on Ubuntu on an old Dell PowerEdge 1850 but had thought of moving it over to a VMware virtual box as it grows. From what you are saying you think I'd run into problems with Nagios on a VM like this and I'd be better keeping it on an - adequately powered - standalone physical box? Regards Ewan Version info: Nagios core: 3.2 Nagios plugins: 1.4.14 Nagios nrpe: 2.12 -----Original Message----- From: Morris, Patrick [mailto:patrick.morris at hp.com] Sent: 11 January 2010 18:25 To: Scott, Ewan Cc: Nagios Mailinglist Subject: Re: [Nagios-users] question about running Nagios on Solaris LDom and Zone servers Scott, Ewan wrote: > > I am about to put Nagios on both the primary LDom physical servers and > all the virtual guest systems which run on them. Similarly I intend to > run it on the primary host zone servers and the guest zones running on > them. Is anyone aware of any problems - performance issues? - which > can result from this blanket approach? > Not enough information to say, but generally, yes, there are a lot of issues that *could* result from doing things this way, but you haven't provided enough information to guess whether that'll be an issue for you or not. Nagios can be resource intensive, but it's all dependent on what you're doing with it (how many hosts, how many services, which version you're running, etc.). There have also been a lot of timing issues reported with running Nagios in a VM, though I don't know whether those apply to Solaris guest zones. ********************************************************************** This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. ********************************************************************** ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 12 14:48:16 2010 From: marc at ena.com (Marc Powell) Date: Tue, 12 Jan 2010 07:48:16 -0600 Subject: Greetings all, CGI access problem In-Reply-To: <0CC98CB467A5E044A9C403E9688943C7043262@exchange2010-vm.office.practeo.ch> References: <0CC98CB467A5E044A9C403E9688943C7043262@exchange2010-vm.office.practeo.ch> Message-ID: <29BB8F81-2B88-47B3-A312-5D4EFC6FBAE9@ena.com> On Jan 12, 2010, at 5:14 AM, Clair Johnson wrote: > I've check the cgi config file and change the use_authentication to 0 but that doesn't change anything, when I call the CGI in my browser I'm still asked for credentials. Nagios doesn't present the authentication dialog, your web server does. This parameter only controls whether nagios pays attention to the username it gets from your web server. > My apache configuration hasn't changed and was working in nagios 2.9. > > Do you have ideas how to solve my problem, that is to use nagios CGIs without any authentication ? You have htaccess enabled in your web server, either in it's config file (httpd.conf if using apache) or via a .htaccess file in the nagios sbin directory. If in httpd.conf, remove things like AuthType, AuthUserFile, etc related to nagios. If you find a .htaccess file, remove it or rename it. If you need more specific information, detail the steps you took during your upgrade, what OS you are using, what web server you are using and any guides you might have used during installation. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Tue Jan 12 15:13:05 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Tue, 12 Jan 2010 15:13:05 +0100 Subject: check_openmanage block esmlog, check the rest of items In-Reply-To: <90f6e8271001112357h25b5a896m942de046695a4238@mail.gmail.com> (Natxo Asenjo's message of "Tue, 12 Jan 2010 08:57:02 +0100") References: <90f6e8271001110600t362db767u91e70510ee838389@mail.gmail.com> <15t6378afbu.fsf@tux.uio.no> <90f6e8271001112357h25b5a896m942de046695a4238@mail.gmail.com> Message-ID: <15t63775s3y.fsf@tux.uio.no> Natxo Asenjo writes: > C:\>omreport storage controller > List of Controllers in the system [...] > Storport Driver Version : 5.2.3790.3959 > Minimum Required Storport Driver Version : 5.2.3790.4173 This is your problem: the storport driver is out of date. I have never seen this before, and I don't know what "Storport" is. The official Dell OMSA 6.2.0 documentation doesn't mention it at all as far as I can see. > SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.55.1 = STRING: "5.2.3790.3959" > SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.55.2 = STRING: "5.2.3790.3959" > SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.56.1 = STRING: "5.2.3790.4173" > SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.56.2 = STRING: "5.2.3790.4173" Here are the same values obtained via SNMP. The downloadable MIBs from Dell does not contain these SNMP values, and the OMSA 6.2.0 SNMP guide doesn't mention them either. It is completely undocumented! Anyway, even though I don't know exactly what this "Storport" thing is, it is clearly some driver that is out of date. It is fairly simple to add support for this in check_openmanage, and I have uploaded a beta version here: http://folk.uio.no/trondham/tmp/ This beta version will correctly report the storport driver being out of date, and I have added a new blacklist keyword 'ctrl_stdr' that disables these storport-driver-is-out-of-date messages. Please give the beta version a try (with and without the new blacklisting keyword) at your convenience, and let me know how the testing goes. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Jan 12 16:57:37 2010 From: rperezm at uci.cu (ReynierPM) Date: Tue, 12 Jan 2010 10:57:37 -0500 Subject: What this message means Message-ID: <4B4C9BF1.4000208@uci.cu> Hi every: Today I get this message from one of my checked hosts: "Notifications for this host are being suppressed because it was detected as having been flapping between different states (24.2% change > 20.0% threshold). When the host state stabilizes and the flapping stops, notifications will be re-enabled." What this means? What I need to check on this host? -- Cheers and thanks in advance Ing. Reynier P?rez Mira ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Tue Jan 12 17:27:38 2010 From: jpratt at norwich.edu (James Pratt) Date: Tue, 12 Jan 2010 11:27:38 -0500 Subject: What this message means In-Reply-To: <4B4C9BF1.4000208@uci.cu> References: <4B4C9BF1.4000208@uci.cu> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD493E6FDE@nuexchg.norwich.edu> Hello Reynier, Have a look at: http://nagios.sourceforge.net/docs/3_0/flapping.html Hth - Jamie >> -----Original Message----- >> From: ReynierPM [mailto:rperezm at uci.cu] >> Sent: Tuesday, January 12, 2010 10:58 AM >> To: nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] What this message means >> >> Hi every: >> Today I get this message from one of my checked hosts: "Notifications >> for this host are being suppressed because it was detected as having >> been flapping between different states (24.2% change > 20.0% threshold). >> When the host state stabilizes and the flapping stops, notifications >> will be re-enabled." >> >> What this means? What I need to check on this host? >> -- >> Cheers and thanks in advance >> Ing. Reynier P?rez Mira >> >> ------------------------------------------------------------------------------ >> This SF.Net email is sponsored by the Verizon Developer Community >> Take advantage of Verizon's best-in-class app development support >> A streamlined, 14 day to market process makes app distribution fast and easy >> Join now and get one step closer to millions of Verizon customers >> http://p.sf.net/sfu/verizon-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any >> issue. >> ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From maxhetrick at verizon.net Tue Jan 12 17:35:07 2010 From: maxhetrick at verizon.net (Max Hetrick) Date: Tue, 12 Jan 2010 11:35:07 -0500 Subject: What this message means In-Reply-To: <4B4C9BF1.4000208@uci.cu> References: <4B4C9BF1.4000208@uci.cu> Message-ID: <4B4CA4BB.7000705@verizon.net> ReynierPM wrote: > Hi every: > Today I get this message from one of my checked hosts: "Notifications > for this host are being suppressed because it was detected as having > been flapping between different states (24.2% change > 20.0% threshold). > When the host state stabilizes and the flapping stops, notifications > will be re-enabled." > > What this means? What I need to check on this host? Flapping is some built in logic to Nagios that prevents a barrage of notifications of hosts/services up and down messages. http://nagios.sourceforge.net/docs/3_0/flapping.html So if a host keeps coming up, and then down, and then up, and then down, notfications are suppressed to prevent a load of messages. You can disable this feature if you want globally by setting in the nagios.cfg file. enable_flap_detection={0,1} A lot of times, if a host is flapping, it might not be the actual host that has a problem, perhaps it's a switch, or some other piece of equipment in between. It does have its uses. Regards, Max ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Tue Jan 12 20:00:31 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Tue, 12 Jan 2010 12:00:31 -0700 Subject: question about running Nagios on Solaris LDom and Zone servers In-Reply-To: <771645F4ADF2C9449B7E6AE630C720018BC4BD2317@HQ-MAIL-01.scotborders.gov.uk> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD2315@HQ-MAIL-01.scotborders.gov.uk> <4B4B6D0B.2090606@hp.com> <771645F4ADF2C9449B7E6AE630C720018BC4BD2317@HQ-MAIL-01.scotborders.gov.uk> Message-ID: I am looking at the same situation myself and have decided that I am going to have to run NRPE on each zone lest I rewrite all my plugins. Now I am deciding which checks should run on the LDom and which on the zones. It seems that disk checks are are good first choice to run on the LDom only, especially since there are already plugins on Nagios Exchange to do this (check_zpools.sh and such). I am trying to figure out what would be the best way to just use the LDom for CPU and load checking, but I don't see any plugins that would be able to give zone specific results. Aside from performance concerns, the main concern I am having is about how Nagios runs the checks and therefore how it organizes it. The whole LDom model breaks the Nagios host model of organazing things by host and in order to preserve this you pretty much have to make a new set of entries in command.cfg that can map your host names to zone names so that while you are running the commands against the LDom Nagios organizes the resultant data by the proper virtual host name and not just dump everything under the LDom host. This will be some work and will add yet another layer of complexity to an already complex system. I am not sure if any of this will be helpful, but it sounds like are trying to do the same thing. Here are some other resources that I stumbled upon so far which should be helpful as well: https://s23.org/wiki/Nagios/checks/solaris_zones http://stig.prod.dbs.melbourneit.com/ On Jan 12, 2010, at 2:41 AM, Scott, Ewan wrote: > Thanks for this. I have not decided on exactly what I'll monitor as I'm still installing the clients and making sure the the basic infrastucture works. However in outline: > > 1. I intend to use nrpe for the checks. > 2. I would envisage 5-6 basic checks on the host primary LDom servers (4) and primary Zone servers (2). > 3. I would expect to run up to 10 nrpe checks on each of the guest LDoms and Zones (25 systems). > > I would see that as a small system with lightweight checking but would appreciate comments if anyone thinks otherwise. > > Further down the line I'd like to add in 80+ Windows VM systems at which point it would become a very differnet beast. Currently I'm running the Nagios server on Ubuntu on an old Dell PowerEdge 1850 but had thought of moving it over to a VMware virtual box as it grows. From what you are saying you think I'd run into problems with Nagios on a VM like this and I'd be better keeping it on an - adequately powered - standalone physical box? > > Regards > Ewan > > Version info: > Nagios core: 3.2 > Nagios plugins: 1.4.14 > Nagios nrpe: 2.12 > > > > > > > > -----Original Message----- > From: Morris, Patrick [mailto:patrick.morris at hp.com] > Sent: 11 January 2010 18:25 > To: Scott, Ewan > Cc: Nagios Mailinglist > Subject: Re: [Nagios-users] question about running Nagios on Solaris LDom and Zone servers > > Scott, Ewan wrote: >> >> I am about to put Nagios on both the primary LDom physical servers and >> all the virtual guest systems which run on them. Similarly I intend to >> run it on the primary host zone servers and the guest zones running on >> them. Is anyone aware of any problems - performance issues? - which >> can result from this blanket approach? >> > > Not enough information to say, but generally, yes, there are a lot of > issues that *could* result from doing things this way, but you haven't > provided enough information to guess whether that'll be an issue for you > or not. > > Nagios can be resource intensive, but it's all dependent on what you're > doing with it (how many hosts, how many services, which version you're > running, etc.). There have also been a lot of timing issues reported > with running Nagios in a VM, though I don't know whether those apply to > Solaris guest zones. > > ********************************************************************** > This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. > The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. > Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. > > ********************************************************************** > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null Thanks, Matthew Litwin mlitwin at stubhub.com 415.222.8475 ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From EWScott at scotborders.gov.uk Wed Jan 13 10:57:21 2010 From: EWScott at scotborders.gov.uk (Scott, Ewan) Date: Wed, 13 Jan 2010 09:57:21 +0000 Subject: question about running Nagios on Solaris LDom and Zone servers In-Reply-To: References: <771645F4ADF2C9449B7E6AE630C720018BC4BD2315@HQ-MAIL-01.scotborders.gov.uk> <4B4B6D0B.2090606@hp.com> <771645F4ADF2C9449B7E6AE630C720018BC4BD2317@HQ-MAIL-01.scotborders.gov.uk> Message-ID: <771645F4ADF2C9449B7E6AE630C720018BC4BD231E@HQ-MAIL-01.scotborders.gov.uk> Matthew Thanks for this. I am going to have to give consideration to exactly the same things you already have, so what you've written will be helpful. Regards Ewan -----Original Message----- From: Litwin, Matthew [mailto:mlitwin at stubhub.com] Sent: 12 January 2010 19:01 To: Scott, Ewan Cc: Morris, Patrick; Nagios Mailinglist Subject: Re: [Nagios-users] question about running Nagios on Solaris LDom and Zone servers I am looking at the same situation myself and have decided that I am going to have to run NRPE on each zone lest I rewrite all my plugins. Now I am deciding which checks should run on the LDom and which on the zones. It seems that disk checks are are good first choice to run on the LDom only, especially since there are already plugins on Nagios Exchange to do this (check_zpools.sh and such). I am trying to figure out what would be the best way to just use the LDom for CPU and load checking, but I don't see any plugins that would be able to give zone specific results. Aside from performance concerns, the main concern I am having is about how Nagios runs the checks and therefore how it organizes it. The whole LDom model breaks the Nagios host model of organazing things by host and in order to preserve this you pretty much have to make a new set of entries in command.cfg that can map your host names to zone names so that while you are running the commands against the LDom Nagios organizes the resultant data by the proper virtual host name and not just dump everything under the LDom host. This will be some work and will add yet another layer of complexity to an already complex system. I am not sure if any of this will be helpful, but it sounds like are trying to do the same thing. Here are some other resources that I stumbled upon so far which should be helpful as well: https://s23.org/wiki/Nagios/checks/solaris_zones http://stig.prod.dbs.melbourneit.com/ On Jan 12, 2010, at 2:41 AM, Scott, Ewan wrote: > Thanks for this. I have not decided on exactly what I'll monitor as I'm still installing the clients and making sure the the basic infrastucture works. However in outline: > > 1. I intend to use nrpe for the checks. > 2. I would envisage 5-6 basic checks on the host primary LDom servers (4) and primary Zone servers (2). > 3. I would expect to run up to 10 nrpe checks on each of the guest LDoms and Zones (25 systems). > > I would see that as a small system with lightweight checking but would appreciate comments if anyone thinks otherwise. > > Further down the line I'd like to add in 80+ Windows VM systems at which point it would become a very differnet beast. Currently I'm running the Nagios server on Ubuntu on an old Dell PowerEdge 1850 but had thought of moving it over to a VMware virtual box as it grows. From what you are saying you think I'd run into problems with Nagios on a VM like this and I'd be better keeping it on an - adequately powered - standalone physical box? > > Regards > Ewan > > Version info: > Nagios core: 3.2 > Nagios plugins: 1.4.14 > Nagios nrpe: 2.12 > > > > > > > > -----Original Message----- > From: Morris, Patrick [mailto:patrick.morris at hp.com] > Sent: 11 January 2010 18:25 > To: Scott, Ewan > Cc: Nagios Mailinglist > Subject: Re: [Nagios-users] question about running Nagios on Solaris LDom and Zone servers > > Scott, Ewan wrote: >> >> I am about to put Nagios on both the primary LDom physical servers and >> all the virtual guest systems which run on them. Similarly I intend to >> run it on the primary host zone servers and the guest zones running on >> them. Is anyone aware of any problems - performance issues? - which >> can result from this blanket approach? >> > > Not enough information to say, but generally, yes, there are a lot of > issues that *could* result from doing things this way, but you haven't > provided enough information to guess whether that'll be an issue for you > or not. > > Nagios can be resource intensive, but it's all dependent on what you're > doing with it (how many hosts, how many services, which version you're > running, etc.). There have also been a lot of timing issues reported > with running Nagios in a VM, though I don't know whether those apply to > Solaris guest zones. > > ********************************************************************** > This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. > The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. > Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. > > ********************************************************************** > > > ------------------------------------------------------------------------------ > This SF.Net email is sponsored by the Verizon Developer Community > Take advantage of Verizon's best-in-class app development support > A streamlined, 14 day to market process makes app distribution fast and easy > Join now and get one step closer to millions of Verizon customers > http://p.sf.net/sfu/verizon-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null Thanks, Matthew Litwin mlitwin at stubhub.com 415.222.8475 ********************************************************************** This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. ********************************************************************** ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From clair at practeo.ch Wed Jan 13 11:20:30 2010 From: clair at practeo.ch (Clair Johnson) Date: Wed, 13 Jan 2010 10:20:30 +0000 Subject: Greetings all, CGI access problem In-Reply-To: <29BB8F81-2B88-47B3-A312-5D4EFC6FBAE9@ena.com> References: <0CC98CB467A5E044A9C403E9688943C7043262@exchange2010-vm.office.practeo.ch> <29BB8F81-2B88-47B3-A312-5D4EFC6FBAE9@ena.com> Message-ID: <0CC98CB467A5E044A9C403E9688943C704AE46@exchange2010-vm.office.practeo.ch> Thanks all for the help. I've found what my problem was. The installation of the new version created a sample conf file in apache2 conf.d folder witch overridden my original web server configuration. I've deleted this sample and everything went back to normal. Many thanks to everybody Kind regards Clair Johnson -----Message d'origine----- De?: Marc Powell [mailto:marc at ena.com] Envoy??: mardi 12 janvier 2010 14:48 ??: Nagios Mailinglist Objet?: Re: [Nagios-users] Greetings all, CGI access problem On Jan 12, 2010, at 5:14 AM, Clair Johnson wrote: > I've check the cgi config file and change the use_authentication to 0 but that doesn't change anything, when I call the CGI in my browser I'm still asked for credentials. Nagios doesn't present the authentication dialog, your web server does. This parameter only controls whether nagios pays attention to the username it gets from your web server. > My apache configuration hasn't changed and was working in nagios 2.9. > > Do you have ideas how to solve my problem, that is to use nagios CGIs without any authentication ? You have htaccess enabled in your web server, either in it's config file (httpd.conf if using apache) or via a .htaccess file in the nagios sbin directory. If in httpd.conf, remove things like AuthType, AuthUserFile, etc related to nagios. If you find a .htaccess file, remove it or rename it. If you need more specific information, detail the steps you took during your upgrade, what OS you are using, what web server you are using and any guides you might have used during installation. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Wed Jan 13 12:27:45 2010 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Wed, 13 Jan 2010 12:27:45 +0100 Subject: check_openmanage block esmlog, check the rest of items In-Reply-To: <15t63775s3y.fsf@tux.uio.no> References: <90f6e8271001110600t362db767u91e70510ee838389@mail.gmail.com> <15t6378afbu.fsf@tux.uio.no> <90f6e8271001112357h25b5a896m942de046695a4238@mail.gmail.com> <15t63775s3y.fsf@tux.uio.no> Message-ID: <90f6e8271001130327h13bc498r7dd940884ef0432d@mail.gmail.com> On Tue, Jan 12, 2010 at 3:13 PM, Trond Hasle Amundsen wrote: > Natxo Asenjo writes: > >> C:\>omreport storage controller >> List of Controllers in the system > [...] >> Storport Driver Version ? ? ? ? ? ? ? ? ? ? ? : 5.2.3790.3959 >> Minimum Required Storport Driver Version ? ? ?: 5.2.3790.4173 > > This is your problem: the storport driver is out of date. I have never > seen this before, and I don't know what "Storport" is. The official Dell > OMSA 6.2.0 documentation doesn't mention it at all as far as I can see. ok. I will update those. >> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.55.1 = STRING: "5.2.3790.3959" >> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.55.2 = STRING: "5.2.3790.3959" >> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.56.1 = STRING: "5.2.3790.4173" >> SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.56.2 = STRING: "5.2.3790.4173" > > Here are the same values obtained via SNMP. The downloadable MIBs from > Dell does not contain these SNMP values, and the OMSA 6.2.0 SNMP guide > doesn't mention them either. It is completely undocumented! > > Anyway, even though I don't know exactly what this "Storport" thing is, > it is clearly some driver that is out of date. It is fairly simple to > add support for this in check_openmanage, and I have uploaded a beta > version here: > > ?http://folk.uio.no/trondham/tmp/ > > This beta version will correctly report the storport driver being out of > date, and I have added a new blacklist keyword 'ctrl_stdr' that disables > these storport-driver-is-out-of-date messages. great stuff! They work exactly as you say they should. Thanks! > Please give the beta version a try (with and without the new > blacklisting keyword) at your convenience, and let me know how the > testing goes. I will finally upgrade the storport drivers in the controllers, but for the time being this blacklist option is very handy. In an aside note, it is great to see this plugin so actively maintained. Awesome. Again, thank you. Natxo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Wed Jan 13 16:37:52 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Wed, 13 Jan 2010 15:37:52 +0000 Subject: Three tier alert system, and how to invoke an external script? Message-ID: <4B4DE8D0.20405@impactteachers.com> Is it possible to do the following; There are five people, three groups of people who should be notified in case of any issues. the three groups of people each have three ways of being notified. Group One -> Help Desk Team Group Two -> IT Support Team Group Three -> Management First notification is email, the second is text message, and the third one is by phone. Email is already configured. We have a perl script that can be run to send messages, and also a perl script that can connect to our asterisk boxes to make calls. On the first alert, notify group one by email, and ignore the second and third group. If no acknowledgment is recieved, run perl script for SMS with group one by, and group two notify by email If again no acknowledgment is received, run script for phone calls, and script for SMS/email to group three. I would like to know if such a setup is possible, and any hints and tips as to where I should be looking to figure it out. Before I forget, how can I invoke an external script, which is stored on the same server as nagios? Thank you -- Mr Gabriel ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From maxhetrick at verizon.net Wed Jan 13 16:53:08 2010 From: maxhetrick at verizon.net (Max Hetrick) Date: Wed, 13 Jan 2010 10:53:08 -0500 Subject: Three tier alert system, and how to invoke an external script? In-Reply-To: <4B4DE8D0.20405@impactteachers.com> References: <4B4DE8D0.20405@impactteachers.com> Message-ID: <4B4DEC64.3070904@verizon.net> Mr Gabriel wrote: > Is it possible to do the following; > > There are five people, three groups of people who should be notified in > case of any issues. the three groups of people each have three ways of > being notified. > > Group One -> Help Desk Team > Group Two -> IT Support Team > Group Three -> Management > > First notification is email, the second is text message, and the third > one is by phone. Email is already configured. We have a perl script that > can be run to send messages, and also a perl script that can connect to > our asterisk boxes to make calls. > > On the first alert, notify group one by email, and ignore the second and > third group. > If no acknowledgment is recieved, run perl script for SMS with group one > by, and group two notify by email > If again no acknowledgment is received, run script for phone calls, and > script for SMS/email to group three. > > > I would like to know if such a setup is possible, and any hints and tips > as to where I should be looking to figure it out. Before I forget, how > can I invoke an external script, which is stored on the same server as > nagios? You should be able to do all that you want to do by using escalations. http://nagios.sourceforge.net/docs/3_0/escalations.html I'm not certain how to invoke external commands using escalations, but you can notify the first to your e-mail, wait however many minutes for the second notification and have it go to group 2, and then wait another x amount of minutes, then notify group 3. Escalations should be the ticket for what you want. Regards, Max ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 13 16:56:50 2010 From: marc at ena.com (Marc Powell) Date: Wed, 13 Jan 2010 09:56:50 -0600 Subject: Three tier alert system, and how to invoke an external script? In-Reply-To: <4B4DE8D0.20405@impactteachers.com> References: <4B4DE8D0.20405@impactteachers.com> Message-ID: <3DC628A9-9968-4689-8A0F-17A09ADF28A1@ena.com> On Jan 13, 2010, at 9:37 AM, Mr Gabriel wrote: > On the first alert, notify group one by email, and ignore the second and > third group. > If no acknowledgment is recieved, run perl script for SMS with group one > by, and group two notify by email > If again no acknowledgment is received, run script for phone calls, and > script for SMS/email to group three. > > > I would like to know if such a setup is possible, and any hints and tips > as to where I should be looking to figure it out. I don't use them but it seems like a pretty standard escalation configuration - http://nagios.sourceforge.net/docs/3_0/escalations.html > Before I forget, how can I invoke an external script, which is stored on the same server as > nagios? Depends on how you intend to call them but it seems to me that you just need to make it the notification command for the members of the groups it should be used for. -- Marc ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Wed Jan 13 17:13:58 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Wed, 13 Jan 2010 17:13:58 +0100 Subject: check_openmanage block esmlog, check the rest of items In-Reply-To: <90f6e8271001130327h13bc498r7dd940884ef0432d@mail.gmail.com> (Natxo Asenjo's message of "Wed, 13 Jan 2010 12:27:45 +0100") References: <90f6e8271001110600t362db767u91e70510ee838389@mail.gmail.com> <15t6378afbu.fsf@tux.uio.no> <90f6e8271001112357h25b5a896m942de046695a4238@mail.gmail.com> <15t63775s3y.fsf@tux.uio.no> <90f6e8271001130327h13bc498r7dd940884ef0432d@mail.gmail.com> Message-ID: <15twrzmyoc9.fsf@tux.uio.no> Natxo Asenjo writes: >> This beta version will correctly report the storport driver being out of >> date, and I have added a new blacklist keyword 'ctrl_stdr' that disables >> these storport-driver-is-out-of-date messages. > > great stuff! They work exactly as you say they should. Thanks! Good, thanks for testing :) I have released a new version of check_openmanage, with these and other improvements. It is available from the plugin homepage: http://folk.uio.no/trondham/software/check_openmanage.html as well as NagiosExchange and MonitoringExchange. Happy monitoring, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Wed Jan 13 17:20:08 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Wed, 13 Jan 2010 09:20:08 -0700 Subject: question about running Nagios on Solaris LDom and Zone servers In-Reply-To: <771645F4ADF2C9449B7E6AE630C720018BC4BD231E@HQ-MAIL-01.scotborders.gov.uk> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD2315@HQ-MAIL-01.scotborders.gov.uk> <4B4B6D0B.2090606@hp.com> <771645F4ADF2C9449B7E6AE630C720018BC4BD2317@HQ-MAIL-01.scotborders.gov.uk> <771645F4ADF2C9449B7E6AE630C720018BC4BD231E@HQ-MAIL-01.scotborders.gov.uk> Message-ID: I just pacakged up NRPE and it conveniently installs on all the zones. Today I get it going on SMF. Sent from my iPhone On Jan 13, 2010, at 2:01 AM, "Scott, Ewan" wrote: > Matthew > Thanks for this. I am going to have to give consideration to exactly > the same things you already have, so what you've written will be > helpful. > Regards > Ewan > > -----Original Message----- > From: Litwin, Matthew [mailto:mlitwin at stubhub.com] > Sent: 12 January 2010 19:01 > To: Scott, Ewan > Cc: Morris, Patrick; Nagios Mailinglist > Subject: Re: [Nagios-users] question about running Nagios on Solaris > LDom and Zone servers > > I am looking at the same situation myself and have decided that I am > going to have to run NRPE on each zone lest I rewrite all my > plugins. Now I am deciding which checks should run on the LDom and > which on the zones. It seems that disk checks are are good first > choice to run on the LDom only, especially since there are already > plugins on Nagios Exchange to do this (check_zpools.sh and such). I > am trying to figure out what would be the best way to just use the > LDom for CPU and load checking, but I don't see any plugins that > would be able to give zone specific results. > > Aside from performance concerns, the main concern I am having is > about how Nagios runs the checks and therefore how it organizes it. > The whole LDom model breaks the Nagios host model of organazing > things by host and in order to preserve this you pretty much have to > make a new set of entries in command.cfg that can map your host > names to zone names so that while you are running the commands > against the LDom Nagios organizes the resultant data by the proper > virtual host name and not just dump everything under the LDom host. > This will be some work and will add yet another layer of complexity > to an already complex system. > > I am not sure if any of this will be helpful, but it sounds like are > trying to do the same thing. > > Here are some other resources that I stumbled upon so far which > should be helpful as well: > https://s23.org/wiki/Nagios/checks/solaris_zones > http://stig.prod.dbs.melbourneit.com/ > > On Jan 12, 2010, at 2:41 AM, Scott, Ewan wrote: > >> Thanks for this. I have not decided on exactly what I'll monitor as >> I'm still installing the clients and making sure the the basic >> infrastucture works. However in outline: >> >> 1. I intend to use nrpe for the checks. >> 2. I would envisage 5-6 basic checks on the host primary LDom >> servers (4) and primary Zone servers (2). >> 3. I would expect to run up to 10 nrpe checks on each of the guest >> LDoms and Zones (25 systems). >> >> I would see that as a small system with lightweight checking but >> would appreciate comments if anyone thinks otherwise. >> >> Further down the line I'd like to add in 80+ Windows VM systems at >> which point it would become a very differnet beast. Currently I'm >> running the Nagios server on Ubuntu on an old Dell PowerEdge 1850 >> but had thought of moving it over to a VMware virtual box as it >> grows. From what you are saying you think I'd run into problems >> with Nagios on a VM like this and I'd be better keeping it on an - >> adequately powered - standalone physical box? >> >> Regards >> Ewan >> >> Version info: >> Nagios core: 3.2 >> Nagios plugins: 1.4.14 >> Nagios nrpe: 2.12 >> >> >> >> >> >> >> >> -----Original Message----- >> From: Morris, Patrick [mailto:patrick.morris at hp.com] >> Sent: 11 January 2010 18:25 >> To: Scott, Ewan >> Cc: Nagios Mailinglist >> Subject: Re: [Nagios-users] question about running Nagios on >> Solaris LDom and Zone servers >> >> Scott, Ewan wrote: >>> >>> I am about to put Nagios on both the primary LDom physical servers >>> and >>> all the virtual guest systems which run on them. Similarly I >>> intend to >>> run it on the primary host zone servers and the guest zones >>> running on >>> them. Is anyone aware of any problems - performance issues? - which >>> can result from this blanket approach? >>> >> >> Not enough information to say, but generally, yes, there are a lot of >> issues that *could* result from doing things this way, but you >> haven't >> provided enough information to guess whether that'll be an issue >> for you >> or not. >> >> Nagios can be resource intensive, but it's all dependent on what >> you're >> doing with it (how many hosts, how many services, which version >> you're >> running, etc.). There have also been a lot of timing issues reported >> with running Nagios in a VM, though I don't know whether those >> apply to >> Solaris guest zones. >> >> *** >> ******************************************************************* >> This email and any files transmitted with it are privileged, >> confidential and subject to copyright. Any unauthorised use or >> disclosure of any part of this email is prohibited. If you are not >> the intended recipient please inform the sender immediately; you >> should then delete the email and remove any copies from your system. >> The views or opinions expressed in this communication may not >> necessarily be those of Scottish Borders Council. >> Please be advised that Scottish Borders Council's incoming and >> outgoing email is subject to regular monitoring and any email may >> require to be disclosed by the Council under the provisions of the >> Freedom of Information (Scotland) Act 2002. >> >> *** >> ******************************************************************* >> >> >> --- >> --- >> --- >> --------------------------------------------------------------------- >> This SF.Net email is sponsored by the Verizon Developer Community >> Take advantage of Verizon's best-in-class app development support >> A streamlined, 14 day to market process makes app distribution fast >> and easy >> Join now and get one step closer to millions of Verizon customers >> http://p.sf.net/sfu/verizon-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/ >> null > > Thanks, > Matthew Litwin > mlitwin at stubhub.com > 415.222.8475 > > > ********************************************************************** > This email and any files transmitted with it are privileged, > confidential and subject to copyright. Any unauthorised use or > disclosure of any part of this email is prohibited. If you are not > the intended recipient please inform the sender immediately; you > should then delete the email and remove any copies from your system. > The views or opinions expressed in this communication may not > necessarily be those of Scottish Borders Council. > Please be advised that Scottish Borders Council's incoming and > outgoing email is subject to regular monitoring and any email may > require to be disclosed by the Council under the provisions of the > Freedom of Information (Scotland) Act 2002. > > ********************************************************************** > ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gianluca.varenni at gmail.com Wed Jan 13 22:57:56 2010 From: gianluca.varenni at gmail.com (Gianluca Varenni) Date: Wed, 13 Jan 2010 13:57:56 -0800 Subject: Warnings/errors with check_openmanage plugin Message-ID: Hi all. check_openmanage keeps spitting a number of warnings and errors when interrogating a DELL T610 server running OMSA 6.2.0 Here is the some output from v 3.5.3 root at tinman:/home/nagios/check_openmanage-3.5.3# ./check_openmanage -H odyssey -d Use of uninitialized value in hash element at ./check_openmanage line 1474. Use of uninitialized value in hash element at ./check_openmanage line 1475. Use of uninitialized value in hash element at ./check_openmanage line 1476. Use of uninitialized value in division (/) at ./check_openmanage line 1477. Use of uninitialized value $nexus in substitution (s///) at ./check_openmanage line 899. Use of uninitialized value $nexus in substitution (s///) at ./check_openmanage line 900. Use of uninitialized value $ctrl in substitution (s///) at ./check_openmanage line 1481. Use of uninitialized value $state in string eq at ./check_openmanage line 1503. Use of uninitialized value $status in string ne at ./check_openmanage line 1503. Use of uninitialized value $id in sprintf at ./check_openmanage line 1510. Use of uninitialized value $layout in sprintf at ./check_openmanage line 1510. Use of uninitialized value $ctrl in sprintf at ./check_openmanage line 1510. Use of uninitialized value $state in sprintf at ./check_openmanage line 1510. Use of uninitialized value $status in hash element at ./check_openmanage line 1512. Use of uninitialized value within @_ in list assignment at ./check_openmanage line 422. Use of uninitialized value in hash element at ./check_openmanage line 1474. Use of uninitialized value in hash element at ./check_openmanage line 1475. Use of uninitialized value in hash element at ./check_openmanage line 1476. Use of uninitialized value in division (/) at ./check_openmanage line 1477. Use of uninitialized value $nexus in substitution (s///) at ./check_openmanage line 899. Use of uninitialized value $nexus in substitution (s///) at ./check_openmanage line 900. Use of uninitialized value $ctrl in substitution (s///) at ./check_openmanage line 1481. Use of uninitialized value $state in string eq at ./check_openmanage line 1503. Use of uninitialized value $status in string ne at ./check_openmanage line 1503. Use of uninitialized value $id in sprintf at ./check_openmanage line 1510. Use of uninitialized value $layout in sprintf at ./check_openmanage line 1510. Use of uninitialized value $ctrl in sprintf at ./check_openmanage line 1510. Use of uninitialized value $state in sprintf at ./check_openmanage line 1510. Use of uninitialized value $status in hash element at ./check_openmanage line 1512. Use of uninitialized value within @_ in list assignment at ./check_openmanage line 422. System: PowerEdge T610 ServiceTag: XXXXXXX OMSA version: 6.2.0 BIOS/date: 1.2.6 07/17/2009 Plugin version: 3.5.3 ----------------------------------------------------------------------------- Storage Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+----------+-------------------------------------------------------- OK | 0 | Controller 0 [PERC 6/i Integrated] is Ready OK | 0:0:0:0 | Physical Disk 0:0:0 [159GB] on ctrl 0 is Online OK | 0:0:0:1 | Physical Disk 0:0:1 [159GB] on ctrl 0 is Online OK | 0:0:0:2 | Physical Disk 0:0:2 [750GB] on ctrl 0 is Online OK | 0:0:0:3 | Physical Disk 0:0:3 [750GB] on ctrl 0 is Online OK | 0:1:0:4 | Physical Disk 1:0:4 [750GB] on ctrl 0 is Online OK | 0:1:0:5 | Physical Disk 1:0:5 [750GB] on ctrl 0 is Online OK | 0:1:0:6 | Physical Disk 1:0:6 [750GB] on ctrl 0 is Online OK | 0:1:0:7 | Physical Disk 1:0:7 [750GB] on ctrl 0 is Ready OK | 0:0 | Logical drive 0 'Windows Disk 0' [RAID-1, 148.50 GB] on ctrl 0 is Ready Use of uninitialized value $level in hash element at ./check_openmanage line 3886. Use of uninitialized value within %reverse_exitcode in length at ./check_openmanage line 3886. Use of uninitialized value $level in hash element at ./check_openmanage line 3886. Use of uninitialized value within %reverse_exitcode in concatenation (.) or string at ./check_openmanage line 3886. | | Logical drive 0 '' [, 0.00 GB] on ctrl 0 needs attention: Use of uninitialized value $level in hash element at ./check_openmanage line 3888. Use of uninitialized value within %reverse_exitcode in hash element at ./check_openmanage line 3888. Use of uninitialized value $level in hash element at ./check_openmanage line 3886. Use of uninitialized value within %reverse_exitcode in length at ./check_openmanage line 3886. Use of uninitialized value $level in hash element at ./check_openmanage line 3886. Use of uninitialized value within %reverse_exitcode in concatenation (.) or string at ./check_openmanage line 3886. | | Logical drive 0 '' [, 0.00 GB] on ctrl 0 needs attention: Use of uninitialized value $level in hash element at ./check_openmanage line 3888. Use of uninitialized value within %reverse_exitcode in hash element at ./check_openmanage line 3888. OK | 0:1 | Logical drive 1 'Windows Disk 1' [RAID-5, 2792.50 GB] on ctrl 0 is Ready OK | 0:0 | Cache battery 0 in controller 0 is Ready OK | 0:0 | Connector 0 [SAS] on controller 0 is Ready OK | 0:1 | Connector 1 [SAS] on controller 0 is Ready OK | 0:0:0 | Enclosure 0:0:0 [Backplane] on controller 0 is Ready OK | 0:1:0 | Enclosure 0:1:0 [Backplane] on controller 0 is Ready ----------------------------------------------------------------------------- Chassis Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+------+------------------------------------------------------------ OK | 1 | Memory module 1 [DIMM_A2, 2048 MB] is Ok OK | 2 | Memory module 2 [DIMM_A3, 2048 MB] is Ok OK | 1 | Chassis fan 1 [System Board FAN 3 RPM]: 2160 OK | 2 | Chassis fan 2 [System Board FAN 4 RPM]: 2160 OK | 0 | Power Supply 0 [AC]: Presence detected OK | 1 | Power Supply 1 [AC]: Presence detected OK | 0 | Temperature Probe 0 [System Board Ambient Temp] reads 22 C (min=8/3, max=42/47) OK | 0 | Processor 0 [Intel Xeon E5504 2.00GHz] is Present OK | 0 | Voltage sensor 0 [CPU1 VCORE] is Good OK | 1 | Voltage sensor 1 [CPU1 0.75 VTT PG] is Good OK | 2 | Voltage sensor 2 [System Board 1.5V PG] is Good OK | 3 | Voltage sensor 3 [System Board 1.8V PG] is Good OK | 4 | Voltage sensor 4 [System Board 3.3V PG] is Good OK | 5 | Voltage sensor 5 [System Board 5V PG] is Good OK | 6 | Voltage sensor 6 [CPU1 MEM PG] is Good OK | 7 | Voltage sensor 7 [CPU1 VTT PG] is Good OK | 8 | Voltage sensor 8 [System Board 0.9V PG] is Good OK | 9 | Voltage sensor 9 [CPU1 1.8 PLL PG] is Good OK | 10 | Voltage sensor 10 [System Board 8.0V PG] is Good OK | 11 | Voltage sensor 11 [System Board 1.1V PG] is Good OK | 12 | Voltage sensor 12 [System Board 1.0V LOM PG] is Good OK | 13 | Voltage sensor 13 [System Board 1.0V AUX PG] is Good OK | 14 | Voltage sensor 14 [System Board 1.05V PG] is Good OK | 15 | Voltage sensor 15 [PS 2 Voltage 2] is 114.000 V OK | 0 | Battery probe 0 [System Board CMOS Battery] is Presence Detected OK | 0 | Amperage probe 0 [PS 2 Current 2] reads 0.8 A OK | 1 | Amperage probe 1 [System Board System Level] reads 105 W OK | 0 | Chassis intrusion 0 detection: Ok (Not Breached) ----------------------------------------------------------------------------- Other messages ============================================================================= STATE | MESSAGE TEXT ---------+------------------------------------------------------------------- OK | ESM log health is Ok (less than 80% full) root at tinman:/home/nagios/check_openmanage-3.5.3# And here is the output from v 3.5.4 root at tinman:/home/nagios/check_openmanage-3.5.4# ./check_openmanage -H odyssey -d System: PowerEdge T610 ServiceTag: XXXXXXX OMSA version: 6.2.0 BIOS/date: 1.2.6 07/17/2009 Plugin version: 3.5.4 ----------------------------------------------------------------------------- Storage Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+----------+-------------------------------------------------------- OK | 0 | Controller 0 [PERC 6/i Integrated] is Ready OK | 0:0:0:0 | Physical Disk 0:0:0 [159GB] on ctrl 0 is Online OK | 0:0:0:1 | Physical Disk 0:0:1 [159GB] on ctrl 0 is Online OK | 0:0:0:2 | Physical Disk 0:0:2 [750GB] on ctrl 0 is Online OK | 0:0:0:3 | Physical Disk 0:0:3 [750GB] on ctrl 0 is Online OK | 0:1:0:4 | Physical Disk 1:0:4 [750GB] on ctrl 0 is Online OK | 0:1:0:5 | Physical Disk 1:0:5 [750GB] on ctrl 0 is Online OK | 0:1:0:6 | Physical Disk 1:0:6 [750GB] on ctrl 0 is Online OK | 0:1:0:7 | Physical Disk 1:0:7 [750GB] on ctrl 0 is Ready OK | 0:0 | Logical drive 0 'Windows Disk 0' [RAID-1, 148.50 GB] on ctrl 0 is Ready | | Logical drive 0 '' [, 0.00 GB] on ctrl 0 needs attention: | | Logical drive 0 '' [, 0.00 GB] on ctrl 0 needs attention: OK | 0:1 | Logical drive 1 'Windows Disk 1' [RAID-5, 2792.50 GB] on ctrl 0 is Ready OK | 0:0 | Cache battery 0 in controller 0 is Ready OK | 0:0 | Connector 0 [SAS] on controller 0 is Ready OK | 0:1 | Connector 1 [SAS] on controller 0 is Ready OK | 0:0:0 | Enclosure 0:0:0 [Backplane] on controller 0 is Ready OK | 0:1:0 | Enclosure 0:1:0 [Backplane] on controller 0 is Ready ----------------------------------------------------------------------------- Chassis Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+------+------------------------------------------------------------ OK | 1 | Memory module 1 [DIMM_A2, 2048 MB] is Ok OK | 2 | Memory module 2 [DIMM_A3, 2048 MB] is Ok OK | 1 | Chassis fan 1 [System Board FAN 3 RPM]: 2160 OK | 2 | Chassis fan 2 [System Board FAN 4 RPM]: 2160 OK | 0 | Power Supply 0 [AC]: Presence detected OK | 1 | Power Supply 1 [AC]: Presence detected OK | 0 | Temperature Probe 0 [System Board Ambient Temp] reads 22 C (min=8/3, max=42/47) OK | 0 | Processor 0 [Intel Xeon E5504 2.00GHz] is Present OK | 0 | Voltage sensor 0 [CPU1 VCORE] is Good OK | 1 | Voltage sensor 1 [CPU1 0.75 VTT PG] is Good OK | 2 | Voltage sensor 2 [System Board 1.5V PG] is Good OK | 3 | Voltage sensor 3 [System Board 1.8V PG] is Good OK | 4 | Voltage sensor 4 [System Board 3.3V PG] is Good OK | 5 | Voltage sensor 5 [System Board 5V PG] is Good OK | 6 | Voltage sensor 6 [CPU1 MEM PG] is Good OK | 7 | Voltage sensor 7 [CPU1 VTT PG] is Good OK | 8 | Voltage sensor 8 [System Board 0.9V PG] is Good OK | 9 | Voltage sensor 9 [CPU1 1.8 PLL PG] is Good OK | 10 | Voltage sensor 10 [System Board 8.0V PG] is Good OK | 11 | Voltage sensor 11 [System Board 1.1V PG] is Good OK | 12 | Voltage sensor 12 [System Board 1.0V LOM PG] is Good OK | 13 | Voltage sensor 13 [System Board 1.0V AUX PG] is Good OK | 14 | Voltage sensor 14 [System Board 1.05V PG] is Good OK | 15 | Voltage sensor 15 [PS 2 Voltage 2] is 114.000 V OK | 0 | Battery probe 0 [System Board CMOS Battery] is Presence Detected OK | 0 | Amperage probe 0 [PS 2 Current 2] reads 0.8 A OK | 1 | Amperage probe 1 [System Board System Level] reads 105 W OK | 0 | Chassis intrusion 0 detection: Ok (Not Breached) ----------------------------------------------------------------------------- Other messages ============================================================================= STATE | MESSAGE TEXT ---------+------------------------------------------------------------------- OK | ESM log health is Ok (less than 80% full) INTERNAL ERROR: Use of uninitialized value in hash element at ./check_openmanage line 1497. INTERNAL ERROR: Use of uninitialized value in hash element at ./check_openmanage line 1498. INTERNAL ERROR: Use of uninitialized value in hash element at ./check_openmanage line 1499. INTERNAL ERROR: Use of uninitialized value in division (/) at ./check_openmanage line 1500. INTERNAL ERROR: Use of uninitialized value $nexus in substitution (s///) at ./check_openmanage line 900. INTERNAL ERROR: Use of uninitialized value $nexus in substitution (s///) at ./check_openmanage line 901. INTERNAL ERROR: Use of uninitialized value $ctrl in substitution (s///) at ./check_openmanage line 1504. INTERNAL ERROR: Use of uninitialized value $state in string eq at ./check_openmanage line 1526. INTERNAL ERROR: Use of uninitialized value $status in string ne at ./check_openmanage line 1526. INTERNAL ERROR: Use of uninitialized value $id in sprintf at ./check_openmanage line 1533. INTERNAL ERROR: Use of uninitialized value $layout in sprintf at ./check_openmanage line 1533. INTERNAL ERROR: Use of uninitialized value $ctrl in sprintf at ./check_openmanage line 1533. INTERNAL ERROR: Use of uninitialized value $state in sprintf at ./check_openmanage line 1533. INTERNAL ERROR: Use of uninitialized value $status in hash element at ./check_openmanage line 1535. INTERNAL ERROR: Use of uninitialized value within @_ in list assignment at ./check_openmanage line 423. INTERNAL ERROR: Use of uninitialized value in hash element at ./check_openmanage line 1497. INTERNAL ERROR: Use of uninitialized value in hash element at ./check_openmanage line 1498. INTERNAL ERROR: Use of uninitialized value in hash element at ./check_openmanage line 1499. INTERNAL ERROR: Use of uninitialized value in division (/) at ./check_openmanage line 1500. INTERNAL ERROR: Use of uninitialized value $nexus in substitution (s///) at ./check_openmanage line 900. INTERNAL ERROR: Use of uninitialized value $nexus in substitution (s///) at ./check_openmanage line 901. INTERNAL ERROR: Use of uninitialized value $ctrl in substitution (s///) at ./check_openmanage line 1504. INTERNAL ERROR: Use of uninitialized value $state in string eq at ./check_openmanage line 1526. INTERNAL ERROR: Use of uninitialized value $status in string ne at ./check_openmanage line 1526. INTERNAL ERROR: Use of uninitialized value $id in sprintf at ./check_openmanage line 1533. INTERNAL ERROR: Use of uninitialized value $layout in sprintf at ./check_openmanage line 1533. INTERNAL ERROR: Use of uninitialized value $ctrl in sprintf at ./check_openmanage line 1533. INTERNAL ERROR: Use of uninitialized value $state in sprintf at ./check_openmanage line 1533. INTERNAL ERROR: Use of uninitialized value $status in hash element at ./check_openmanage line 1535. INTERNAL ERROR: Use of uninitialized value within @_ in list assignment at ./check_openmanage line 423. INTERNAL ERROR: Use of uninitialized value $level in hash element at ./check_openmanage line 3925. INTERNAL ERROR: Use of uninitialized value within %reverse_exitcode in length at ./check_openmanage line 3925. INTERNAL ERROR: Use of uninitialized value $level in hash element at ./check_openmanage line 3925. INTERNAL ERROR: Use of uninitialized value within %reverse_exitcode in concatenation (.) or string at ./check_openmanage line 3925. INTERNAL ERROR: Use of uninitialized value $level in hash element at ./check_openmanage line 3927. INTERNAL ERROR: Use of uninitialized value within %reverse_exitcode in hash element at ./check_openmanage line 3927. INTERNAL ERROR: Use of uninitialized value $level in hash element at ./check_openmanage line 3925. INTERNAL ERROR: Use of uninitialized value within %reverse_exitcode in length at ./check_openmanage line 3925. INTERNAL ERROR: Use of uninitialized value $level in hash element at ./check_openmanage line 3925. INTERNAL ERROR: Use of uninitialized value within %reverse_exitcode in concatenation (.) or string at ./check_openmanage line 3925. INTERNAL ERROR: Use of uninitialized value $level in hash element at ./check_openmanage line 3927. INTERNAL ERROR: Use of uninitialized value within %reverse_exitcode in hash element at ./check_openmanage line 3927.root at tinman:/home/nagios/check_openmanage-3.5.4# Any ideas? Have a nice day GV ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Thu Jan 14 00:00:43 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Thu, 14 Jan 2010 00:00:43 +0100 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: (Gianluca Varenni's message of "Wed, 13 Jan 2010 13:57:56 -0800") References: Message-ID: <15tska9zk2s.fsf@tux.uio.no> "Gianluca Varenni" writes: > check_openmanage keeps spitting a number of warnings and errors when > interrogating a DELL T610 server running OMSA 6.2.0 Hmm.. All those errors seem related to virtual disks. Can you send the output from these commands: 1. On the monitored server: omreport storage vdisk controller=0 2. On the Nagios server: snmpwalk -v2c -c public odyssey 1.3.6.1.4.1.674.10893.1.20.140 The undefined values that perl complains about, should not be undefined. The commands mentioned above would be helpful for debugging this problem. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gianluca.varenni at gmail.com Thu Jan 14 00:20:36 2010 From: gianluca.varenni at gmail.com (Gianluca Varenni) Date: Wed, 13 Jan 2010 15:20:36 -0800 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <15tska9zk2s.fsf@tux.uio.no> References: <15tska9zk2s.fsf@tux.uio.no> Message-ID: <5748BF419D664EF984BDBEE32111E1EB@NELSON3> Here they are -------------- List of Virtual Disks on Controller PERC 6/i Integrated (Embedded) Controller PERC 6/i Integrated (Embedded) ID : 0 Status : Ok Name : Virtual Disk 0 State : Ready HotSpare Policy violated : Not Assigned Virtual Disk Bad Blocks : Not Applicable Secured : Not Applicable Progress : Not Applicable Layout : RAID-1 Size : 148.50 GB (159450660864 bytes) Device Name : Windows Disk 0 Bus Protocol : SATA Media : HDD Read Policy : No Read Ahead Write Policy : Write Back Cache Policy : Not Applicable Stripe Element Size : 64 KB Disk Cache Policy : Enabled ID : 1 Status : Ok Name : DATA State : Ready HotSpare Policy violated : Not Assigned Virtual Disk Bad Blocks : Not Applicable Secured : Not Applicable Progress : Not Applicable Layout : RAID-5 Size : 2,792.50 GB (2998424043520 bytes) Device Name : Windows Disk 1 Bus Protocol : SATA Media : HDD Read Policy : No Read Ahead Write Policy : Write Back Cache Policy : Not Applicable Stripe Element Size : 64 KB Disk Cache Policy : Enabled ------------------ root at tinman:~# snmpwalk -v2c -c public odyssey 1.3.6.1.4.1.674.10893.1.20.140 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.4 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.1 = STRING: "Virtual Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.4 = STRING: "DATA" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.1 = STRING: "Windows Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.4 = STRING: "Windows Disk 1" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.4.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.4.4 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.6.1 = INTEGER: 152064 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.6.4 = INTEGER: 2859520 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.7.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.7.4 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.10.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.10.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.11.1 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.11.4 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.12.1 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.12.4 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.13.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.13.4 = INTEGER: 7 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.14.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.14.4 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.15.1 = INTEGER: 65536 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.15.4 = INTEGER: 65536 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.17.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.17.4 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.19.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.19.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.20.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.20.4 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.21.1 = STRING: "\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.21.4 = STRING: "\\0\\1" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.22.1 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.22.4 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.23.1 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.23.4 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.24.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.24.4 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.15 = INTEGER: 15 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.16 = INTEGER: 16 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.17 = INTEGER: 17 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.18 = INTEGER: 18 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.19 = INTEGER: 19 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.1.20 = INTEGER: 20 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.1 = STRING: "Physical Disk 0:0:0" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.2 = STRING: "Physical Disk 0:0:1" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.15 = STRING: "Physical Disk 0:0:2" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.16 = STRING: "Physical Disk 0:0:3" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.17 = STRING: "Physical Disk 1:0:4" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.18 = STRING: "Physical Disk 1:0:5" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.19 = STRING: "Physical Disk 1:0:6" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.2.20 = STRING: "Physical Disk 1:0:7" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.15 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.16 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.17 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.18 = INTEGER: 6 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.19 = INTEGER: 7 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.3.20 = INTEGER: 8 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.1 = STRING: "Virtual Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.2 = STRING: "Virtual Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.15 = STRING: "DATA" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.16 = STRING: "DATA" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.17 = STRING: "DATA" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.18 = STRING: "DATA" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.19 = STRING: "DATA" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.4.20 = STRING: "DATA" SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.15 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.16 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.17 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.18 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.19 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.140.3.1.5.20 = INTEGER: 4 root at tinman:~# Have a nice day GV -------------------------------------------------- From: "Trond Hasle Amundsen" Sent: Wednesday, January 13, 2010 3:00 PM To: "Gianluca Varenni" Cc: Subject: Re: [Nagios-users] Warnings/errors with check_openmanage plugin > "Gianluca Varenni" writes: > >> check_openmanage keeps spitting a number of warnings and errors when >> interrogating a DELL T610 server running OMSA 6.2.0 > > Hmm.. All those errors seem related to virtual disks. Can you send the > output from these commands: > > 1. On the monitored server: > omreport storage vdisk controller=0 > > 2. On the Nagios server: > snmpwalk -v2c -c public odyssey 1.3.6.1.4.1.674.10893.1.20.140 > > The undefined values that perl complains about, should not be > undefined. The commands mentioned above would be helpful for debugging > this problem. > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kelseysmith at caseyfamilyservices.org Thu Jan 14 02:11:53 2010 From: kelseysmith at caseyfamilyservices.org (kelseysmith at caseyfamilyservices.org) Date: Wed, 13 Jan 2010 20:11:53 -0500 Subject: AUTO: Kelsey Smith/CFS01/CFS is out of the office. (returning 01/14/2010) Message-ID: I am out of the office until 01/14/2010. I'm out of the office today for jury duty. I'll respond to any email I receive tomorrow at the latest depending on whether I am selected for a case. Thank you. Note: This is an automated response to your message "Nagios-users Digest, Vol 44, Issue 6" sent on 1/13/2010 4:58:14 PM. This is the only notification you will receive while this person is away. ------------------------------------------------------ Confirmed virus-free by mymssp.net see www.anchortechnologies.com for more information. ------------------------------------------------------ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Thu Jan 14 10:41:06 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Thu, 14 Jan 2010 09:41:06 +0000 Subject: Monitor Axis Cameras -- Howto? Message-ID: <4B4EE6B2.2020802@impactteachers.com> Dear All, I would like to know if it is possible to monitor Axis based security cameras via Nagios? ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Thu Jan 14 12:12:41 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Thu, 14 Jan 2010 12:12:41 +0100 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <5748BF419D664EF984BDBEE32111E1EB@NELSON3> (Gianluca Varenni's message of "Wed, 13 Jan 2010 15:20:36 -0800") References: <15tska9zk2s.fsf@tux.uio.no> <5748BF419D664EF984BDBEE32111E1EB@NELSON3> Message-ID: <15t3a29x7me.fsf@tux.uio.no> "Gianluca Varenni" writes: > Here they are > > -------------- > List of Virtual Disks on Controller PERC 6/i Integrated (Embedded) > > Controller PERC 6/i Integrated (Embedded) > ID : 0 > Status : Ok > Name : Virtual Disk 0 > State : Ready > HotSpare Policy violated : Not Assigned > Virtual Disk Bad Blocks : Not Applicable > Secured : Not Applicable > Progress : Not Applicable > Layout : RAID-1 > Size : 148.50 GB (159450660864 bytes) > Device Name : Windows Disk 0 > Bus Protocol : SATA > Media : HDD > Read Policy : No Read Ahead > Write Policy : Write Back > Cache Policy : Not Applicable > Stripe Element Size : 64 KB > Disk Cache Policy : Enabled > > ID : 1 > Status : Ok > Name : DATA > State : Ready > HotSpare Policy violated : Not Assigned > Virtual Disk Bad Blocks : Not Applicable > Secured : Not Applicable > Progress : Not Applicable > Layout : RAID-5 > Size : 2,792.50 GB (2998424043520 bytes) > Device Name : Windows Disk 1 > Bus Protocol : SATA > Media : HDD > Read Policy : No Read Ahead > Write Policy : Write Back > Cache Policy : Not Applicable > Stripe Element Size : 64 KB > Disk Cache Policy : Enabled Ok, this looks perfectly normal. > root at tinman:~# snmpwalk -v2c -c public odyssey > 1.3.6.1.4.1.674.10893.1.20.140 > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.1 = INTEGER: 1 > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.4 = INTEGER: 4 > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.1 = STRING: "Virtual Disk > 0" > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.4 = STRING: "DATA" > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.1 = STRING: "Windows Disk > 0" > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.4 = STRING: "Windows Disk > 1" [...] The results from snmpwalk also looks OK, except for the fact that the last number jumps from 1 to 4. Where is 2 and 3? The plugin assumes that these indexes in the OIDs are sequential, which is clearly wrong. Though this situation is rare (never seen it before) the plugin should deal and the fact that it doesn't is a bug. I have a modified version of the plugin available here: http://folk.uio.no/trondham/tmp/ Can you give it a try to see if this one performs better? I'm curious about the cause of this problem. Have you perhaps created a couple of virtual disks that were later deleted? It would be interesting to see if a restart of OMSA, reboot or powercycle resets the indexes, but please help debug the plugin before attempting this :) Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lecb79 at gmail.com Thu Jan 14 14:33:53 2010 From: lecb79 at gmail.com (Eduardo Barreto) Date: Thu, 14 Jan 2010 11:33:53 -0200 Subject: Checking DHCP Message-ID: Hi All, Does anybody knows or have a clue on how to check a dhcp server, I've tried direct no success and then I'm trying to monitor it through nrpe. I'll appreciate any help Thanks in advance Luiz Eduardo -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 14 15:02:49 2010 From: marc at ena.com (Marc Powell) Date: Thu, 14 Jan 2010 08:02:49 -0600 Subject: Monitor Axis Cameras -- Howto? In-Reply-To: <4B4EE6B2.2020802@impactteachers.com> References: <4B4EE6B2.2020802@impactteachers.com> Message-ID: On Jan 14, 2010, at 3:41 AM, Mr Gabriel wrote: > Dear All, > > I would like to know if it is possible to monitor Axis based security > cameras via Nagios? What are you wanting to monitor on them? It's been a long time since I've used one but the ones I'm familiar with (similar to the current Axis 210 network camera) had an IP, so you could use check_ping to ping it and it had a web server so you could use check_http to make sure that was answering. You could probably come up with more sophisticated monitoring of the web interface with webinject but I'm not sure what additional value would be gained. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 14 15:12:52 2010 From: marc at ena.com (Marc Powell) Date: Thu, 14 Jan 2010 08:12:52 -0600 Subject: Checking DHCP In-Reply-To: References: Message-ID: <96D3C579-A4A8-4C58-9D8A-44AF19191993@ena.com> On Jan 14, 2010, at 7:33 AM, Eduardo Barreto wrote: > Hi All, > > Does anybody knows or have a clue on how to check a dhcp server, I've tried direct no success and then I'm trying to monitor it through nrpe. > I'll appreciate any help There's a check_dhcp plugin in the standard plugins to verify that a DHCP server his handing out leases. You need to run it from a host on the LAN that your DHCP server answers on. In most cases you cannot run this plugin on the DHCP server itself as they are most often configured *not* to give answers to themselves. If you're limited to running something on the DHCP server itself, you'll likely only be able to look to see if the DHCP process/service is running. How that's done depends on the OS of the server and how you can get to it from the Nagios machine. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Charles.Breite at altertrading.com Thu Jan 14 15:36:48 2010 From: Charles.Breite at altertrading.com (Charles Breite) Date: Thu, 14 Jan 2010 08:36:48 -0600 Subject: Monitor Axis Cameras -- Howto? In-Reply-To: References: <4B4EE6B2.2020802@impactteachers.com> Message-ID: <5D490E0402B4D14F836B5C4436D5949ACB3065@VMEXCHANGE2.alterscrap.com> We have hundreds deployed and we only use ping to monitor them. -----Original Message----- From: Marc Powell [mailto:marc at ena.com] Sent: Thursday, January 14, 2010 8:03 AM To: Nagios Mailinglist Subject: Re: [Nagios-users] Monitor Axis Cameras -- Howto? On Jan 14, 2010, at 3:41 AM, Mr Gabriel wrote: > Dear All, > > I would like to know if it is possible to monitor Axis based security > cameras via Nagios? What are you wanting to monitor on them? It's been a long time since I've used one but the ones I'm familiar with (similar to the current Axis 210 network camera) had an IP, so you could use check_ping to ping it and it had a web server so you could use check_http to make sure that was answering. You could probably come up with more sophisticated monitoring of the web interface with webinject but I'm not sure what additional value would be gained. -- Marc ------------------------------------------------------------------------ ------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cosmin.neagu at omnilogic.ro Thu Jan 14 16:44:50 2010 From: cosmin.neagu at omnilogic.ro (Cosmin Neagu) Date: Thu, 14 Jan 2010 17:44:50 +0200 Subject: Beginer questions about check_http Message-ID: <4B4F3BF2.9030901@omnilogic.ro> Hello everybody, I'm using check_http to monitor a wab server where we have more than 3 web address. Like: www.example1.com www.example2.com www.example3.com...are all on the same server. Is there a way to check the status for each site? I'm a little confused from reding the help of check_http, i have tried like: libexec/check_http -H 194.81.149.12 -p 80 --url=/www.example1.com libexec/check_http -H 194.81.149.12 -p 80 -P www.example1.com libexec/check_http -H 194.81.149.12 -p 80 -s www.example1.com But the output is not what i'm expecting. Can someone tell me if this is even posible? I think it should be something like this: cosmin at cacti-1:/usr/local/nagios> libexec/check_http -H 194.81.149.12 -p 80 *-u www.example1.com* HTTP WARNING: HTTP/1.1 400 Bad Request But it gives me bad request, and i don't know why. PS: of course that all the testing is done with the real web site and the correct ip address, not exemple1.com. -- Cosmin Neagu NOC Team Leader Str. I. G. Duca nr 36 Otopeni, Judetul Ilfov, 075100 Romania Tel: 021 303 3159 / 0732 669 193 www.omnilogic.ro -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From johannes at dagemark.com Thu Jan 14 16:25:52 2010 From: johannes at dagemark.com (Johannes Dagemark) Date: Thu, 14 Jan 2010 16:25:52 +0100 Subject: Monitor Axis Cameras -- Howto? In-Reply-To: <5D490E0402B4D14F836B5C4436D5949ACB3065@VMEXCHANGE2.alterscrap.com> References: <4B4EE6B2.2020802@impactteachers.com> <5D490E0402B4D14F836B5C4436D5949ACB3065@VMEXCHANGE2.alterscrap.com> Message-ID: <4B4F3780.9000305@dagemark.com> Some cameras has the possibility to detect motion and upload a snapshot over ftp. I once configured a camera to do this and then checked for newly uploaded files. Fairly simple and also pretty cool. With a custom plugin or by using urlize you could even get a link to a mugshot of the intruder attached to your email alert :) Cheers Johannes Dagemark On 2010-01-14 15:36, Charles Breite wrote: > We have hundreds deployed and we only use ping to monitor them. > > -----Original Message----- > From: Marc Powell [mailto:marc at ena.com] > Sent: Thursday, January 14, 2010 8:03 AM > To: Nagios Mailinglist > Subject: Re: [Nagios-users] Monitor Axis Cameras -- Howto? > > > On Jan 14, 2010, at 3:41 AM, Mr Gabriel wrote: > > >> Dear All, >> >> I would like to know if it is possible to monitor Axis based security >> cameras via Nagios? >> > What are you wanting to monitor on them? It's been a long time since > I've used one but the ones I'm familiar with (similar to the current > Axis 210 network camera) had an IP, so you could use check_ping to ping > it and it had a web server so you could use check_http to make sure that > was answering. You could probably come up with more sophisticated > monitoring of the web interface with webinject but I'm not sure what > additional value would be gained. > > -- > Marc > ------------------------------------------------------------------------ > ------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established > companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Mvh Johannes Dagemark JD Consulting (op5 service partner) Phone: +46733709024 Email: johannes at dagemark.com ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Jan 14 17:54:01 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 14 Jan 2010 16:54:01 +0000 Subject: Beginer questions about check_http In-Reply-To: <4B4F3BF2.9030901@omnilogic.ro> References: <4B4F3BF2.9030901@omnilogic.ro> Message-ID: <4B4F4C29.2050100@flatto.net> Cosmin Neagu wrote: > Hello everybody, > > I'm using check_http to monitor a wab server where we have more than 3 > web address. Like: > www.example1.com > www.example2.com > www.example3.com...are all on the same server. > > > Is there a way to check the status for each site? I'm a little > confused from reding the help of check_http, i have tried like: > libexec/check_http -H 194.81.149.12 -p 80 --url=/www.example1.com > libexec/check_http -H 194.81.149.12 -p 80 -P www.example1.com > libexec/check_http -H 194.81.149.12 -p 80 -s www.example1.com > But the output is not what i'm expecting. > > Can someone tell me if this is even posible? > > I think it should be something like this: > cosmin at cacti-1:/usr/local/nagios> libexec/check_http -H 194.81.149.12 > -p 80 *-u www.example1.com* > HTTP WARNING: HTTP/1.1 400 Bad Request > > But it gives me bad request, and i don't know why. > PS: of course that all the testing is done with the real web site and > the correct ip address, not exemple1.com. > The best way to do that is to query the websites with the FQDN in the host address : libexec/check_http -H www.example.com -p 80 that will resolve and send the query to the server with the URL and will trigger the vhost reply on the target host . Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gianluca.varenni at gmail.com Thu Jan 14 18:28:06 2010 From: gianluca.varenni at gmail.com (Gianluca Varenni) Date: Thu, 14 Jan 2010 09:28:06 -0800 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <15t3a29x7me.fsf@tux.uio.no> References: <15tska9zk2s.fsf@tux.uio.no><5748BF419D664EF984BDBEE32111E1EB@NELSON3> <15t3a29x7me.fsf@tux.uio.no> Message-ID: <2FB477CB80334499913431B28A0FFE16@NELSON3> -------------------------------------------------- From: "Trond Hasle Amundsen" Sent: Thursday, January 14, 2010 3:12 AM To: "Gianluca Varenni" Cc: Subject: Re: [Nagios-users] Warnings/errors with check_openmanage plugin ..... > > The results from snmpwalk also looks OK, except for the fact that the > last number jumps from 1 to 4. Where is 2 and 3? The plugin assumes that > these indexes in the OIDs are sequential, which is clearly wrong. Though > this situation is rare (never seen it before) the plugin should deal and > the fact that it doesn't is a bug. > > I have a modified version of the plugin available here: > > http://folk.uio.no/trondham/tmp/ > > Can you give it a try to see if this one performs better? This new version works ok. Thanks! > > I'm curious about the cause of this problem. Have you perhaps created a > couple of virtual disks that were later deleted? Yes, definitely. I'm still playing with the configuration and testing of this server, and I've deleted and recreated some virtual disks directly from OMSA. > It would be interesting > to see if a restart of OMSA, reboot or powercycle resets the indexes, > but please help debug the plugin before attempting this :) I tried restarting the SNMP service on the server, and now I get a more usual output from snmpwalk SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.1 = STRING: "Virtual Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.2 = STRING: "DATA" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.1 = STRING: "Windows Disk 0" SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.2 = STRING: "Windows Disk 1" Have a nice day GV > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gianluca.varenni at gmail.com Thu Jan 14 19:30:49 2010 From: gianluca.varenni at gmail.com (Gianluca Varenni) Date: Thu, 14 Jan 2010 10:30:49 -0800 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <15t3a29x7me.fsf@tux.uio.no> References: <15tska9zk2s.fsf@tux.uio.no><5748BF419D664EF984BDBEE32111E1EB@NELSON3> <15t3a29x7me.fsf@tux.uio.no> Message-ID: <5FF760C88D854D76A18FB48021D45CF0@NELSON3> I spoke too soon. The new build works ok on my T610 server, but it generates an internal error on a 750 server root at tinman:/home/nagios/plugins# ./check_openmanage -H homer -t 180 -d System: PowerEdge 750 ServiceTag: XXXXXXX OMSA version: 5.0.0 BIOS/date: XXXXXXXXXXXXXX Plugin version: 3.5.5-beta1 ----------------------------------------------------------------------------- Storage Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+----------+-------------------------------------------------------- OK | 0 | Controller 0 [CERC SATA 1.5/6ch] is Ready OK | 0:0:0 | Array Disk 0:0 [250GB] on ctrl -1 is Online OK | 0:0:1 | Array Disk 0:1 [250GB] on ctrl -1 is Online OK | 0:0 | Logical drive 0 'Windows Disk 0' [RAID-1, 232.80 GB] on ctrl 0 is Ready OK | 0:0 | Connector 0 [SATA] on controller 0 is Ready ----------------------------------------------------------------------------- Chassis Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+------+------------------------------------------------------------ OK | 1 | Memory module 1 [DIMM1_A, 512 MB] is Ok OK | 2 | Memory module 2 [DIMM2_A, 512 MB] is Ok OK | 3 | Memory module 3 [DIMM1_B, 512 MB] is Ok OK | 4 | Memory module 4 [DIMM2_B, 512 MB] is Ok OK | 1 | Chassis fan 1 [System Blower]: 3479 OK | 2 | Chassis fan 2 [System Memory Fan]: 8881 OK | 3 | Chassis fan 3 [System PCI Fan]: 6250 OK | 0 | Temperature Probe 0 [System Temp] reads 24 C (min=10/5, max=45/50) OK | 1 | Temperature Probe 1 [CPU Temp] reads 35 C (min=10/5, max=80/85) OK | 0 | Processor 0 [Intel Pentium 4 3.00GHz] is Enabled OK | 0 | Voltage sensor 0 [CPU Voltage] is 1.310 V OK | 1 | Voltage sensor 1 [Battery Voltage] is 3.136 V OK | 2 | Voltage sensor 2 [2.6V MB] is 2.597 V OK | 3 | Voltage sensor 3 [VTT Voltage] is 1.298 V OK | 4 | Voltage sensor 4 [3.3V MB] is 3.356 V OK | 5 | Voltage sensor 5 [3.3V AUX] is 3.373 V OK | 6 | Voltage sensor 6 [5V MB] is 4.992 V OK | 7 | Voltage sensor 7 [12V MB] is 12.250 V OK | 8 | Voltage sensor 8 [5V AUX] is 5.031 V OK | 9 | Voltage sensor 9 [2.5V VGA] is 2.503 V OK | 10 | Voltage sensor 10 [1.5V MB] is 1.485 V OK | 11 | Voltage sensor 11 [1.2V LAN1] is 1.225 V OK | 12 | Voltage sensor 12 [1.2V LAN2] is 1.205 V OK | 0 | Chassis intrusion 0 detection: Ok (Not Breached) ----------------------------------------------------------------------------- Other messages ============================================================================= STATE | MESSAGE TEXT ---------+------------------------------------------------------------------- OK | ESM log health is Ok (less than 80% full) INTERNAL ERROR: Use of uninitialized value in join or string at ./check_openmanage line 1345. INTERNAL ERROR: Use of uninitialized value in join or string at ./check_openmanage line 1345. Let me know if you need any snmpwalk output. Have a nice day GV -------------------------------------------------- From: "Trond Hasle Amundsen" Sent: Thursday, January 14, 2010 3:12 AM To: "Gianluca Varenni" Cc: Subject: Re: [Nagios-users] Warnings/errors with check_openmanage plugin > "Gianluca Varenni" writes: > >> Here they are >> >> -------------- >> List of Virtual Disks on Controller PERC 6/i Integrated (Embedded) >> >> Controller PERC 6/i Integrated (Embedded) >> ID : 0 >> Status : Ok >> Name : Virtual Disk 0 >> State : Ready >> HotSpare Policy violated : Not Assigned >> Virtual Disk Bad Blocks : Not Applicable >> Secured : Not Applicable >> Progress : Not Applicable >> Layout : RAID-1 >> Size : 148.50 GB (159450660864 bytes) >> Device Name : Windows Disk 0 >> Bus Protocol : SATA >> Media : HDD >> Read Policy : No Read Ahead >> Write Policy : Write Back >> Cache Policy : Not Applicable >> Stripe Element Size : 64 KB >> Disk Cache Policy : Enabled >> >> ID : 1 >> Status : Ok >> Name : DATA >> State : Ready >> HotSpare Policy violated : Not Assigned >> Virtual Disk Bad Blocks : Not Applicable >> Secured : Not Applicable >> Progress : Not Applicable >> Layout : RAID-5 >> Size : 2,792.50 GB (2998424043520 bytes) >> Device Name : Windows Disk 1 >> Bus Protocol : SATA >> Media : HDD >> Read Policy : No Read Ahead >> Write Policy : Write Back >> Cache Policy : Not Applicable >> Stripe Element Size : 64 KB >> Disk Cache Policy : Enabled > > Ok, this looks perfectly normal. > >> root at tinman:~# snmpwalk -v2c -c public odyssey >> 1.3.6.1.4.1.674.10893.1.20.140 >> SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.1 = INTEGER: 1 >> SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.4 = INTEGER: 4 >> SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.1 = STRING: "Virtual >> Disk >> 0" >> SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.4 = STRING: "DATA" >> SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.1 = STRING: "Windows >> Disk >> 0" >> SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.4 = STRING: "Windows >> Disk >> 1" > [...] > > The results from snmpwalk also looks OK, except for the fact that the > last number jumps from 1 to 4. Where is 2 and 3? The plugin assumes that > these indexes in the OIDs are sequential, which is clearly wrong. Though > this situation is rare (never seen it before) the plugin should deal and > the fact that it doesn't is a bug. > > I have a modified version of the plugin available here: > > http://folk.uio.no/trondham/tmp/ > > Can you give it a try to see if this one performs better? > > I'm curious about the cause of this problem. Have you perhaps created a > couple of virtual disks that were later deleted? It would be interesting > to see if a restart of OMSA, reboot or powercycle resets the indexes, > but please help debug the plugin before attempting this :) > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Thu Jan 14 22:56:38 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Thu, 14 Jan 2010 13:56:38 -0800 Subject: nagios blocking on notifications? In-Reply-To: References: Message-ID: <4B4F9316.5080904@5dninja.net> I've got a high volume site. Everything seems to keep up reasonably well, unless there are a good number of state changes. Once services start changing state, and notifications start getting sent out, nagios falls behind. Did some digging in the logs and it looks like while a batch of notifications are being sent out, it's rate limiting to about one per five seconds. Also, from the first notification for a service to the last notification for that service, nothing else is written to the logs. Since a typical notification goes out to 15+ people, that's over a minute with no service check handling. Is there something going on under the hood that I'm not aware of (like, is it just not doing the log writing, but still doing the passive service check handling, and there's something else causing my latency?) Is that delay configurable? I don't see anything in the docs for that. I've even set my notification script to just call and background a secondary script, to try and see if it wasn't a delay in the notification script, but that seemed not to do anything at all. Should I be forking the notification script instead? Here's a log snippet: [1263505735] EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;;System Check;0;OK load mem ntp swap cfengine disk| [1263505735] EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;;System Check;0;OK load mem ntp swap cfengine disk| [1263505735] EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;;System Check;1;WARNING [swap utilization 25%] [/data/ at 77% (inodes 0%)]| [1263505735] PASSIVE SERVICE CHECK: ;check_mtime-redlist.txt;0;OK - redlist.txt 102 seconds old [1263505735] PASSIVE SERVICE CHECK: ;pre_queuedepth;2;CRITICAL - pre_queuedepth status: 2159 > 500 [1263505735] SERVICE NOTIFICATION: ;;pre_queuedepth;CRITICAL;notify-by-email;CRITICAL - pre_queuedepth status: 2159 500 [1263505741] SERVICE NOTIFICATION: ;;pre_queuedepth;CRITICAL;notify-by-email;CRITICAL - pre_queuedepth status: 2159 500 The SERVICE NOTIFICATION entries keep rolling in every 5-6 seconds for the next minute+, then it goes back to it's usual happy speed. Is this an artifact of the way it logs, or is the whole system choking while it sends email? I've searched the list archives and not found anything on this. -- Mike Lindsey ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Fri Jan 15 03:22:31 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Thu, 14 Jan 2010 18:22:31 -0800 Subject: nagios blocking on notifications? In-Reply-To: <4B4F9316.5080904@5dninja.net> References: <4B4F9316.5080904@5dninja.net> Message-ID: <4B4FD167.90001@5dninja.net> Turns out nagios doesn't fork before handling notifications, and also waits for the children of any notification commands to exit, so forking inside my notification script won't help. I took the part of the script that was taking 5-6 seconds to complete and added in a cache mechanism, which changed the 90+ second notification cycle, to a 6-8 second notification cycle. Might be overkill, but I've also wrapped some fork() logic around the service_notification() call inside handle_async_service_check_result().. Compiles and runs, I'll stress test it tonight and see how it does with real load, tomorrow. Also, if there's a better way to do this, I'm all ears. Mike Lindsey wrote: > I've got a high volume site. Everything seems to keep up reasonably > well, unless there are a good number of state changes. Once services > start changing state, and notifications start getting sent out, nagios > falls behind. > > Did some digging in the logs and it looks like while a batch of > notifications are being sent out, it's rate limiting to about one per > five seconds. Also, from the first notification for a service to the > last notification for that service, nothing else is written to the logs. > > Since a typical notification goes out to 15+ people, that's over a > minute with no service check handling. > > Is there something going on under the hood that I'm not aware of (like, > is it just not doing the log writing, but still doing the passive > service check handling, and there's something else causing my latency?) > > Is that delay configurable? I don't see anything in the docs for that. > > I've even set my notification script to just call and background a > secondary script, to try and see if it wasn't a delay in the > notification script, but that seemed not to do anything at all. Should > I be forking the notification script instead? > > Here's a log snippet: > [1263505735] EXTERNAL COMMAND: > PROCESS_SERVICE_CHECK_RESULT;;System Check;0;OK load mem ntp > swap cfengine disk| > [1263505735] EXTERNAL COMMAND: > PROCESS_SERVICE_CHECK_RESULT;;System Check;0;OK load mem ntp > swap cfengine disk| > [1263505735] EXTERNAL COMMAND: > PROCESS_SERVICE_CHECK_RESULT;;System Check;1;WARNING [swap > utilization 25%] [/data/ at 77% (inodes 0%)]| > [1263505735] PASSIVE SERVICE CHECK: > ;check_mtime-redlist.txt;0;OK - redlist.txt 102 seconds old > [1263505735] PASSIVE SERVICE CHECK: ;pre_queuedepth;2;CRITICAL > - pre_queuedepth status: 2159 > 500 > > [1263505735] SERVICE NOTIFICATION: > ;;pre_queuedepth;CRITICAL;notify-by-email;CRITICAL - > pre_queuedepth status: 2159 500 > [1263505741] SERVICE NOTIFICATION: > ;;pre_queuedepth;CRITICAL;notify-by-email;CRITICAL - > pre_queuedepth status: 2159 500 > > > The SERVICE NOTIFICATION entries keep rolling in every 5-6 seconds for > the next minute+, then it goes back to it's usual happy speed. > > Is this an artifact of the way it logs, or is the whole system choking > while it sends email? I've searched the list archives and not found > anything on this. > -- Mike Lindsey ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Fri Jan 15 11:03:00 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Fri, 15 Jan 2010 10:03:00 +0000 Subject: Monitor Axis Cameras -- Howto? In-Reply-To: References: <4B4EE6B2.2020802@impactteachers.com> Message-ID: <4B503D54.2080800@impactteachers.com> On 14/01/2010 14:02, Marc Powell wrote: > On Jan 14, 2010, at 3:41 AM, Mr Gabriel wrote: > > >> Dear All, >> >> I would like to know if it is possible to monitor Axis based security >> cameras via Nagios? >> > What are you wanting to monitor on them? It's been a long time since I've used one but the ones I'm familiar with (similar to the current Axis 210 network camera) had an IP, so you could use check_ping to ping it and it had a web server so you could use check_http to make sure that was answering. You could probably come up with more sophisticated monitoring of the web interface with webinject but I'm not sure what additional value would be gained. > > -- > Marc > Thank you for your reply. I would like to monitor if they have detected any motion. For example, if two have detected motion in the last 5 minutes after hours, we would like to receive an alert to this effect. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Fri Jan 15 11:05:14 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Fri, 15 Jan 2010 11:05:14 +0100 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <2FB477CB80334499913431B28A0FFE16@NELSON3> (Gianluca Varenni's message of "Thu, 14 Jan 2010 09:28:06 -0800") References: <15tska9zk2s.fsf@tux.uio.no> <5748BF419D664EF984BDBEE32111E1EB@NELSON3> <15t3a29x7me.fsf@tux.uio.no> <2FB477CB80334499913431B28A0FFE16@NELSON3> Message-ID: <15taawfptt1.fsf@tux.uio.no> "Gianluca Varenni" writes: >> The results from snmpwalk also looks OK, except for the fact that the >> last number jumps from 1 to 4. Where is 2 and 3? The plugin assumes that >> these indexes in the OIDs are sequential, which is clearly wrong. Though >> this situation is rare (never seen it before) the plugin should deal and >> the fact that it doesn't is a bug. >> >> I have a modified version of the plugin available here: >> >> http://folk.uio.no/trondham/tmp/ >> >> Can you give it a try to see if this one performs better? > > This new version works ok. Thanks! Great :) Thanks for testing and for the bug report. >> I'm curious about the cause of this problem. Have you perhaps created a >> couple of virtual disks that were later deleted? > > Yes, definitely. I'm still playing with the configuration and testing of > this server, and I've deleted and recreated some virtual disks directly from > OMSA. > >> It would be interesting >> to see if a restart of OMSA, reboot or powercycle resets the indexes, >> but please help debug the plugin before attempting this :) > > I tried restarting the SNMP service on the server, and now I get a more > usual output from snmpwalk > > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.1 = INTEGER: 1 > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.1.2 = INTEGER: 2 > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.1 = STRING: "Virtual Disk > 0" > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.2.2 = STRING: "DATA" > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.1 = STRING: "Windows Disk > 0" > SNMPv2-SMI::enterprises.674.10893.1.20.140.1.1.3.2 = STRING: "Windows Disk > 1" Rather surprising that all it took was an SNMP restart. But good to know anyway :) Cheers, -- Trond Hasle Amundsen Gruppe for basis systemdrift (BSD), SAPP, USIT Tel. +47 22840058 (office) ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Fri Jan 15 11:10:12 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Fri, 15 Jan 2010 10:10:12 +0000 Subject: Monitor Axis Cameras -- Howto? In-Reply-To: <4B4F3780.9000305@dagemark.com> References: <4B4EE6B2.2020802@impactteachers.com> <5D490E0402B4D14F836B5C4436D5949ACB3065@VMEXCHANGE2.alterscrap.com> <4B4F3780.9000305@dagemark.com> Message-ID: <4B503F04.80203@impactteachers.com> > On 2010-01-14 15:36, Charles Breite wrote: > >> We have hundreds deployed and we only use ping to monitor them. >> >> -----Original Message----- >> From: Marc Powell [mailto:marc at ena.com] >> Sent: Thursday, January 14, 2010 8:03 AM >> To: Nagios Mailinglist >> Subject: Re: [Nagios-users] Monitor Axis Cameras -- Howto? >> >> >> On Jan 14, 2010, at 3:41 AM, Mr Gabriel wrote: >> >> >> >>> Dear All, >>> >>> I would like to know if it is possible to monitor Axis based security >>> cameras via Nagios? >>> >>> >> On 14/01/2010 15:25, Johannes Dagemark wrote: >>> Some cameras has the possibility to detect motion and upload a snapshot >>> over ftp. I once configured a camera to do this and then checked for >>> newly uploaded files. Fairly simple and also pretty cool. >>> >>> With a custom plugin or by using urlize you could even get a link to a >>> mugshot of the intruder attached to your email alert :) > Cheers > Johannes Dagemark This is very similar to what I would like to achieve. The cameras have that feature built in already, they can FTP in the case of motion detected after certain hours, but I only wish to react if there are more than two cameras active within a short time period. It seems a custom plugin is the way to go here ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Fri Jan 15 11:10:47 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Fri, 15 Jan 2010 11:10:47 +0100 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <5FF760C88D854D76A18FB48021D45CF0@NELSON3> (Gianluca Varenni's message of "Thu, 14 Jan 2010 10:30:49 -0800") References: <15tska9zk2s.fsf@tux.uio.no> <5748BF419D664EF984BDBEE32111E1EB@NELSON3> <15t3a29x7me.fsf@tux.uio.no> <5FF760C88D854D76A18FB48021D45CF0@NELSON3> Message-ID: <15t6373ptjs.fsf@tux.uio.no> "Gianluca Varenni" writes: > I spoke too soon. The new build works ok on my T610 server, but it generates > an internal error on a 750 server > > root at tinman:/home/nagios/plugins# ./check_openmanage -H homer -t 180 -d > System: PowerEdge 750 > ServiceTag: XXXXXXX OMSA version: 5.0.0 > BIOS/date: XXXXXXXXXXXXXX Plugin version: 3.5.5-beta1 You're using OMSA 5.0.0 on that server, which is very old. The plugin is designed to work with OMSA 5.3 and later. Can you try upgrading OMSA to version 5.5.0? Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Fri Jan 15 11:38:53 2010 From: nigel.leach at uk.bnpparibas.com (nigel.leach at uk.bnpparibas.com) Date: Fri, 15 Jan 2010 10:38:53 +0000 Subject: NRPE Parameter Problems Message-ID: I'm having problem with NRPE parameters while testing a new Nagios installation. Header Node is running RHEL AS4 U6, and Nagios v3.2.0 Client is running RHEL AS4 U4, and NRPE v2.12 (compiled with command arguments, and with dont_blame_nrpe=1) Standard hardcoded nrpe disk checks work fine, for example; Service definition on Header Node - check_nrpe!check_home Relates to the following in nrpe.cfg on the Client - command[check_home]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% -p /home and returns - DISK OK - free space: /home 1878 MB (98% inode=99%);| /home=34MB;1612;1813;0;2015 Using variables, I can get the correct results from the command line; Service definition on Header Node - check_nrpe!check_adisk!20%!10%!/opt Relates to the following in nrpe.cfg on the Client - command[check_adisk]=/usr/local/nagios/libexec/check_disk -w $ARG1$ -c $ARG2$ -p $ARG3$ and I can enter this on the Header Node - #../../libexec/check_nrpe -H -c check_adisk -a 20 10 /opt DISK OK - free space: /opt 432 MB (45% inode=97%);| /opt=524MB;987;997;0;1007 However, this does not work within Nagios GUI, all the parameters passed are blank. I see this in syslog on the Client - Added command[check_adisk]=/usr/local/nagios/libexec/check_disk-w $ARG1$ -c $ARG2$ -p $ARG3$ Handling the connection... Host is asking for command \'check_adisk\' to be run... >>>>>Running command: /usr/local/nagios/libexec/check_disk -w -c -p Command completed with return code 3 and output: Unknown argument Usage: check_disk -w limit -c limit [-W limit] [-K limit] {-p path | -x device} [-C] [-E] [-e] [-g group ] [-k] [-l] [-M] [-m] [-R path ] [-r path ] [-t timeout] [-u unit] [-v] [-X type] Return Code: 3, Output: Unknown argument Usage: check_disk -w limit -c limit [-W limit] [-K limit] {-p path | -x device} [-C] [-E] [-e] [-g group ] [-k] [-l] [-M] [-m] [-R path ] [-r path ] [-t timeout] [-u unit] [-v] [-X type] I'm conscious that I'm probably doing something very trivial wrong, but can't get my head around why this should work on the command line, and not via the GUI. Any help much appreciated. Regards Nigel Nigel Leach Grid Engineer | Front Office Risk Systems | BNP Paribas -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ___________________________________________________________ This communication is confidential, may be privileged and is meant only for the intended recipient. If you are not the intended recipient, please notify the sender by reply and delete the message from your system. Any unauthorised dissemination, distribution or copying hereof is prohibited. BNP Paribas Trust Corporation UK Limited, BNP Paribas UK Limited, BNP Paribas Commodity Futures Limited, BNP Paribas Asset Management UK Limited and Investment Fund Services Limited are authorised and regulated by the Financial Services Authority. BNP Paribas London Branch and BNP Paribas Wealth Management London Branch are authorised by the CECEI and supervised by the Commission Bancaire. BNP Paribas London Branch is authorised and subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas is also a member of the London Stock Exchange. BNP Paribas Wealth Management London Branch is subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services London Branch is authorised by the CECEI and supervised by the AMF, and subject to limited regulation by the Financial Services Authority. Details on the extent of our regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services is also a member of the London Stock Exchange. BNP Paribas Trust Corporation UK Limited is registered in England and Wales (registered no. 4042668) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas UK Limited is registered in England and Wales (registered no. 1488108) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Commodity Futures Limited is registered in England and Wales (registered no. 2391477) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Asset Management UK Limited is registered in England and Wales (registered no. 2474627) at registered office 10 Harewood Avenue, London NW1 6AA. Investment Fund Services Limited is registered in England and Wales (registered no. 6110770) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas London Branch is registered in England and Wales (registered no. FC13447) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Wealth Management London Branch is registered in England and Wales (registered no. FC023926) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Securities Services London Branch is registered in England and Wales (registered no. BR006393) at registered office 55 Moorgate, London, EC2R 6PA. BNP Paribas Lease Group Plc is registered in England and Wales (registered no. 2341989) at registered office Northern Cross, Basing View, Basingstoke, Hampshire RG21 4HL. ____________________________________________________________ BNP Paribas is proud to support the Royal Academy of Arts in their presentation of ?Wild Thing: Epstein, Gaudier-Brzeska, Gill? in the Sackler Wing of Galleries from 24 October 2009 to 24 January 2010. -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shacky83 at gmail.com Fri Jan 15 11:44:52 2010 From: shacky83 at gmail.com (shacky) Date: Fri, 15 Jan 2010 11:44:52 +0100 Subject: Checking the time taken by a webpage to load Message-ID: <7fedbc911001150244h44688754lf71fdbc33d326b39@mail.gmail.com> Hi. I'm thinking about a way to monitor the loading time of a webpage with Nagios. Do you have any idea or any hint? Thank you very much! Bye. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 15 11:44:25 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 15 Jan 2010 10:44:25 +0000 Subject: NRPE Parameter Problems In-Reply-To: References: Message-ID: <4B504709.4070907@flatto.net> nigel.leach at uk.bnpparibas.com wrote: > > > I'm having problem with NRPE parameters while testing a new Nagios > installation. > > Header Node is running RHEL AS4 U6, and Nagios v3.2.0 > > Client is running RHEL AS4 U4, and NRPE v2.12 (compiled with command > arguments, and with dont_blame_nrpe=1) > > Standard hardcoded nrpe disk checks work fine, for example; > > Service definition on Header Node - > > check_nrpe!check_home > > Relates to the following in nrpe.cfg on the Client - > > command[check_home]=/usr/local/nagios/libexec/check_disk -w > 20% -c 10% -p /home > > and returns - > > DISK OK - free space: /home 1878 MB (98% inode=99%);| > /home=34MB;1612;1813;0;2015 > > Using variables, I can get the correct results from the command line; > > Service definition on Header Node - > > check_nrpe!check_adisk!20%!10%!/opt > > Relates to the following in nrpe.cfg on the Client - > > command[check_adisk]=/usr/local/nagios/libexec/check_disk -w > $ARG1$ -c $ARG2$ -p $ARG3$ > > and I can enter this on the Header Node - > > #../../libexec/check_nrpe -H -c check_adisk -a 20 10 > /opt > DISK OK - free space: /opt 432 MB (45% inode=97%);| > /opt=524MB;987;997;0;1007 > > However, this does not work within Nagios GUI, all the parameters > passed are blank. I see this in syslog on the Client - > > Added > command[check_adisk]=/usr/local/nagios/libexec/check_disk-w $ARG1$ -c > $ARG2$ -p $ARG3$ > Handling the connection... > Host is asking for command \'check_adisk\' to be run... > >>>>>Running command: /usr/local/nagios/libexec/check_disk -w > -c -p > Command completed with return code 3 and output: Unknown > argument Usage: check_disk -w limit -c limit [-W limit] [-K limit] {-p > path | -x device} [-C] [-E] [-e] [-g group ] [-k] [-l] [-M] [-m] [-R > path ] [-r path ] [-t timeout] [-u unit] [-v] [-X type] > Return Code: 3, Output: Unknown argument Usage: check_disk -w > limit -c limit [-W limit] [-K limit] {-p path | -x device} [-C] [-E] > [-e] [-g group ] [-k] [-l] [-M] [-m] [-R path ] [-r path ] [-t > timeout] [-u unit] [-v] [-X type] > > I'm conscious that I'm probably doing something very trivial wrong, > but can't get my head around why this should work on the command line, > and not via the GUI. > > Any help much appreciated. > > Regards > Nigel > > Nigel Leach > Grid Engineer | Front Office Risk Systems | BNP Paribas > ______________________________________________ > I'd recommend to enable the nagios debug log and see how nagios is extrapolating the command for execution . Once you do that and see how it is sent you can make sure to fix the command/service to provide the required results. the debug file definitions are at the bottom of the nagios.cfg file. Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 15 11:50:31 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 15 Jan 2010 10:50:31 +0000 Subject: Checking the time taken by a webpage to load In-Reply-To: <7fedbc911001150244h44688754lf71fdbc33d326b39@mail.gmail.com> References: <7fedbc911001150244h44688754lf71fdbc33d326b39@mail.gmail.com> Message-ID: <4B504877.3070904@flatto.net> shacky wrote: > Hi. > > I'm thinking about a way to monitor the loading time of a webpage with Nagios. > > Do you have any idea or any hint? > > Thank you very much! > Bye. > > Do you mean the amount of time the http test is getting when requesting a page ? ./check_http -H www.google.com HTTP OK: HTTP/1.1 302 Found - 832 bytes in* 0.069 second response time |*time=0.069112s;;;0.000000 size=832B;;;0 i think this is covered in the check_http plugin -w, --warning=DOUBLE Response time to result in warning status (seconds) -c, --critical=DOUBLE Response time to result in critical status (seconds) -t, --timeout=INTEGER Seconds before connection times out (default: 10) Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From binbin.b.wang at hotmail.com Fri Jan 15 12:02:23 2010 From: binbin.b.wang at hotmail.com (Binbin Wang) Date: Fri, 15 Jan 2010 19:02:23 +0800 Subject: Nagios Capacity Message-ID: Hi all I have one question about nagios capacity, According to my investigation, nagios server's capacity is about 3000 items. For example, 1000 hosts are defined on nagios servers, and every host offer 3 services monitoring, so total 3000 items are on one nagios server. If item quantity is beyond 3000, nagios performance will dramatically decline. Could someone give me the offical quantity about nagios capacity? thanks a lot. Wang Binbin _________________________________________________________________ Windows Live: Make it easier for your friends to see what you?re up to on Facebook. http://www.microsoft.com/middleeast/windows/windowslive/see-it-in-action/social-network-basics.aspx?ocid=PID23461::T:WLMTAGL:ON:WL:en-xm:SI_SB_2:092009 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cosmin.neagu at omnilogic.ro Fri Jan 15 12:18:34 2010 From: cosmin.neagu at omnilogic.ro (Cosmin Neagu) Date: Fri, 15 Jan 2010 13:18:34 +0200 Subject: Beginer questions about check_http In-Reply-To: <4B4F4C29.2050100@flatto.net> References: <4B4F3BF2.9030901@omnilogic.ro> <4B4F4C29.2050100@flatto.net> Message-ID: <4B504F0A.7080102@omnilogic.ro> Thanks allot, it make sens that way :) Cosmin Neagu NOC Team Leader Str. I. G. Duca nr 36 Otopeni, Judetul Ilfov, 075100 Romania Tel: 021 303 3159 / 0732 669 193 www.omnilogic.ro Assaf Flatto wrote: > Cosmin Neagu wrote: > >> Hello everybody, >> >> I'm using check_http to monitor a wab server where we have more than 3 >> web address. Like: >> www.example1.com >> www.example2.com >> www.example3.com...are all on the same server. >> >> >> Is there a way to check the status for each site? I'm a little >> confused from reding the help of check_http, i have tried like: >> libexec/check_http -H 194.81.149.12 -p 80 --url=/www.example1.com >> libexec/check_http -H 194.81.149.12 -p 80 -P www.example1.com >> libexec/check_http -H 194.81.149.12 -p 80 -s www.example1.com >> But the output is not what i'm expecting. >> >> Can someone tell me if this is even posible? >> >> I think it should be something like this: >> cosmin at cacti-1:/usr/local/nagios> libexec/check_http -H 194.81.149.12 >> -p 80 *-u www.example1.com* >> HTTP WARNING: HTTP/1.1 400 Bad Request >> >> But it gives me bad request, and i don't know why. >> PS: of course that all the testing is done with the real web site and >> the correct ip address, not exemple1.com. >> >> > The best way to do that is to query the websites with the FQDN in the > host address : > > libexec/check_http -H www.example.com -p 80 > > that will resolve and send the query to the server with the URL and will > trigger the vhost reply on the target host . > > Assaf > > > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 15 15:12:08 2010 From: marc at ena.com (Marc Powell) Date: Fri, 15 Jan 2010 08:12:08 -0600 Subject: NRPE Parameter Problems In-Reply-To: <4B504709.4070907@flatto.net> References: <4B504709.4070907@flatto.net> Message-ID: hmm. I don't see the original that Assaf is reponding to but.... comment inline below for the OP -- > nigel.leach at uk.bnpparibas.com wrote: >> Using variables, I can get the correct results from the command line; >> >> Service definition on Header Node - >> >> check_nrpe!check_adisk!20%!10%!/opt >> >> Relates to the following in nrpe.cfg on the Client - >> >> command[check_adisk]=/usr/local/nagios/libexec/check_disk -w >> $ARG1$ -c $ARG2$ -p $ARG3$ But is the check_nrpe command{} definition actually configured to pass the $ARG2$, $ARG3$ and $ARG4$ macros from the service definition above to the remote host? It doesn't look like it is based on your results below. Post your check_nrpe command definition if you are unsure. >> However, this does not work within Nagios GUI, all the parameters >> passed are blank. I see this in syslog on the Client - >> >> Added >> command[check_adisk]=/usr/local/nagios/libexec/check_disk-w $ARG1$ -c >> $ARG2$ -p $ARG3$ >> Handling the connection... >> Host is asking for command \'check_adisk\' to be run... >>>>>>> Running command: /usr/local/nagios/libexec/check_disk -w >> -c -p >> Command completed with return code 3 and output: Unknown >> argument Usage: check_disk -w limit -c limit [-W limit] [-K limit] {-p >> path | -x device} [-C] [-E] [-e] [-g group ] [-k] [-l] [-M] [-m] [-R >> path ] [-r path ] [-t timeout] [-u unit] [-v] [-X type] >> Return Code: 3, Output: Unknown argument Usage: check_disk -w >> limit -c limit [-W limit] [-K limit] {-p path | -x device} [-C] [-E] >> [-e] [-g group ] [-k] [-l] [-M] [-m] [-R path ] [-r path ] [-t >> timeout] [-u unit] [-v] [-X type] -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 15 15:15:57 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 15 Jan 2010 14:15:57 +0000 Subject: Parent /child issues Message-ID: <4B50789D.8050900@flatto.net> Hello I have come up with an issue I have not encountered in previous versions of nagios ( the issue is on 3.20 from source on SLES 10.2 64 bits ). I am trying to map a parent child relationships that match my physical network ( host connected to switch), and I am getting this error SWITCH is not a valid parent for host 'HOST'! Both HOST and SWITCH are on the same logical network ( 10.0.8.x ) , and from the manual it implies that since they are on the same subnet the definition i with to present is no longer valid in nagios (although it used to be in older versions). Is there a way for me to get the setting I need ? Thanks Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 15 15:23:42 2010 From: marc at ena.com (Marc Powell) Date: Fri, 15 Jan 2010 08:23:42 -0600 Subject: Nagios Capacity In-Reply-To: References: Message-ID: <968ECDD4-A8C6-4112-8884-875895BF47FC@ena.com> On Jan 15, 2010, at 5:02 AM, Binbin Wang wrote: > Hi all > > I have one question about nagios capacity, According to my investigation, nagios server's capacity is about 3000 items. For example, 1000 hosts are defined on nagios servers, and every host offer 3 services monitoring, so total 3000 items are on one nagios server. If item quantity is beyond 3000, nagios performance will dramatically decline. Could someone give me the offical quantity about nagios capacity? thanks a lot. There's nothing official or even unofficial as there are too many variable involved. Some of them are the type of hardware you're running it on, the types of checks you are doing, the plugins you use and how frequently you check. There's no formula for determining max capacity. It's an art more than a science. The best you can expect is reports about what others have been able to do. It's been discussed before here but it can be difficult to search for... 3000 is conservative and an easy number on modern hardware with mixed checks running every 5 minutes. I have non-dedicated machines doing that easily on older hardware. I expect that I can easily double, triple or more those checks. There are people who have reported running 10,000+ checks on a single machine (or even more than that). -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Fri Jan 15 15:32:26 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Fri, 15 Jan 2010 14:32:26 +0000 Subject: Nagios Alert emails -- Can we mod them? Message-ID: <4B507C7A.8000105@impactteachers.com> On my nagios alerts, can I include the web URL of the nagios instance they came from? That way, from within the email notification, I can click a link straight to either the nagios home page, or the nagios acknowledgment page :) ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 15 15:34:06 2010 From: marc at ena.com (Marc Powell) Date: Fri, 15 Jan 2010 08:34:06 -0600 Subject: Parent /child issues In-Reply-To: <4B50789D.8050900@flatto.net> References: <4B50789D.8050900@flatto.net> Message-ID: <0D92BA61-16EA-4B87-97A7-30858F8D89C1@ena.com> On Jan 15, 2010, at 8:15 AM, Assaf Flatto wrote: > Hello > I have come up with an issue I have not encountered in previous versions > of nagios ( the issue is on 3.20 from source on SLES 10.2 64 bits ). > > I am trying to map a parent child relationships that match my physical > network ( host connected to switch), and I am getting this error SWITCH > is not a valid parent for host 'HOST'! hm. The code bit that generates this error is -- /* check all parent parent host */ for(temp_hostsmember=temp_host->parent_hosts;temp_hostsmember!=NULL;temp_hostsmember=temp_hostsmember->next){ if((temp_host2=find_host(temp_hostsmember->host_name))==NULL){ logit(NSLOG_VERIFICATION_ERROR,TRUE,"Error: '%s' is not a valid parent for host '%s'!",temp_hostsmember->host_name,temp_host->name); I read this to indicate that the parents name you've specified (SWITCH) doesn't have a valid host{} definition (i.e. doesn't exist). I could very easily be reading this wrong though. > Both HOST and SWITCH are on the same logical network ( 10.0.8.x ) , and > from the manual it implies that since they are on the same subnet the > definition i with to present is no longer valid in nagios (although it > used to be in older versions). Not sure what you're reading here (reference?) but Nagios has no understanding of addressing or subnets. It wouldn't know or care if they were on the same subnet. It must be something else. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 15 15:41:05 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 15 Jan 2010 14:41:05 +0000 Subject: Parent /child issues In-Reply-To: <0D92BA61-16EA-4B87-97A7-30858F8D89C1@ena.com> References: <4B50789D.8050900@flatto.net> <0D92BA61-16EA-4B87-97A7-30858F8D89C1@ena.com> Message-ID: <4B507E81.9080303@flatto.net> Marc Powell wrote: > On Jan 15, 2010, at 8:15 AM, Assaf Flatto wrote: > > >> Hello >> I have come up with an issue I have not encountered in previous versions >> of nagios ( the issue is on 3.20 from source on SLES 10.2 64 bits ). >> >> I am trying to map a parent child relationships that match my physical >> network ( host connected to switch), and I am getting this error SWITCH >> is not a valid parent for host 'HOST'! >> > > hm. The code bit that generates this error is -- > > /* check all parent parent host */ > for(temp_hostsmember=temp_host->parent_hosts;temp_hostsmember!=NULL;temp_hostsmember=temp_hostsmember->next){ > > if((temp_host2=find_host(temp_hostsmember->host_name))==NULL){ > logit(NSLOG_VERIFICATION_ERROR,TRUE,"Error: '%s' is not a valid parent for host '%s'!",temp_hostsmember->host_name,temp_host->name); > > > I read this to indicate that the parents name you've specified (SWITCH) doesn't have a valid host{} definition (i.e. doesn't exist). I could very easily be reading this wrong though. > > >> Both HOST and SWITCH are on the same logical network ( 10.0.8.x ) , and >> from the manual it implies that since they are on the same subnet the >> definition i with to present is no longer valid in nagios (although it >> used to be in older versions). >> > > Not sure what you're reading here (reference?) but Nagios has no understanding of addressing or subnets. It wouldn't know or care if they were on the same subnet. It must be something else. > > -- > Marc Sorry , I was an idiot . I move the directory that contained my network objects to a new location and forgot to update the nagios.cfg . Sorry for this . Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From binbin.b.wang at hotmail.com Fri Jan 15 15:44:11 2010 From: binbin.b.wang at hotmail.com (Binbin Wang) Date: Fri, 15 Jan 2010 22:44:11 +0800 Subject: Nagios Capacity In-Reply-To: <968ECDD4-A8C6-4112-8884-875895BF47FC@ena.com> References: , <968ECDD4-A8C6-4112-8884-875895BF47FC@ena.com> Message-ID: Thank you for your reply, I would like to give you some detail of our monitoring environment. I configure nagnios on an Virtual Machine based on 2x4Core CPU, 24G Memory physical machine, which runs 4 VMs. Service quantity is 3814 and host quantity is 406. The service check latency is unstable, always changed. Does any configuration error cause that problem? or Virtual Machine capacity result it? Thanks a lot. Monitoring Performance Service Check Execution Time: 0.03 / 71.65 / 0.582 sec Service Check Latency: 0.00 / 7.60 / 1.140 sec Host Check Execution Time: 0.00 / 0.00 / 0.000 sec Host Check Latency: 0.00 / 0.00 / 0.000 sec # Active Host / Service Checks: 406 / 3814 # Passive Host / Service Checks: 0 / 0 Monitoring Performance Service Check Execution Time: 0.00 / 71.65 / 0.604 sec Service Check Latency: 0.19 / 178.28 / 86.398 sec Host Check Execution Time: 0.00 / 0.00 / 0.000 sec Host Check Latency: 0.00 / 0.00 / 0.000 sec # Active Host / Service Checks: 406 / 3814 # Passive Host / Service Checks: 0 / 0 > From: marc at ena.com > Date: Fri, 15 Jan 2010 08:23:42 -0600 > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios Capacity > > > On Jan 15, 2010, at 5:02 AM, Binbin Wang wrote: > > > Hi all > > > > I have one question about nagios capacity, According to my investigation, nagios server's capacity is about 3000 items. For example, 1000 hosts are defined on nagios servers, and every host offer 3 services monitoring, so total 3000 items are on one nagios server. If item quantity is beyond 3000, nagios performance will dramatically decline. Could someone give me the offical quantity about nagios capacity? thanks a lot. > > > There's nothing official or even unofficial as there are too many variable involved. Some of them are the type of hardware you're running it on, the types of checks you are doing, the plugins you use and how frequently you check. There's no formula for determining max capacity. It's an art more than a science. The best you can expect is reports about what others have been able to do. It's been discussed before here but it can be difficult to search for... > > 3000 is conservative and an easy number on modern hardware with mixed checks running every 5 minutes. I have non-dedicated machines doing that easily on older hardware. I expect that I can easily double, triple or more those checks. There are people who have reported running 10,000+ checks on a single machine (or even more than that). > > -- > Marc > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null _________________________________________________________________ Windows Live: Keep your friends up to date with what you do online. http://www.microsoft.com/middleeast/windows/windowslive/see-it-in-action/social-network-basics.aspx?ocid=PID23461::T:WLMTAGL:ON:WL:en-xm:SI_SB_1:092010 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Fri Jan 15 15:49:54 2010 From: nigel.leach at uk.bnpparibas.com (NIgel Leach) Date: Fri, 15 Jan 2010 14:49:54 +0000 (UTC) Subject: NRPE Parameter Problems References: <4B504709.4070907@flatto.net> Message-ID: Marc Powell ena.com> writes: > > hmm. I don't see the original that Assaf is reponding to but.... comment inline below for the OP -- > > > nigel.leach uk.bnpparibas.com wrote: > > >> Using variables, I can get the correct results from the command line; > >> > >> command[check_adisk]=/usr/local/nagios/libexec/check_disk -w > >> $ARG1$ -c $ARG2$ -p $ARG3$ > > But is the check_nrpe command{} definition actually configured to pass the $ARG2$, $ARG3$ and $ARG4$ > macros from the service definition above to the remote host? It doesn't look like it is based on your > results below. Post your check_nrpe command definition if you are unsure. > > >> However, this does not work within Nagios GUI, all the parameters > >> passed are blank. I see this in syslog on the Client - > > -- > Marc > many thanks Marc, that was exactly my problem. # diff commands.cfg commands.cfg.orig 23c23 < command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ -a $ARG2$ $ARG3$... --- > command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c $ARG1$ this was my first post to the list, so it was very encouraging to get such a quick, and useful, reply. cheers, Nigel ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 15 16:15:26 2010 From: marc at ena.com (Marc Powell) Date: Fri, 15 Jan 2010 09:15:26 -0600 Subject: Nagios Alert emails -- Can we mod them? In-Reply-To: <4B507C7A.8000105@impactteachers.com> References: <4B507C7A.8000105@impactteachers.com> Message-ID: On Jan 15, 2010, at 8:32 AM, Mr Gabriel wrote: > On my nagios alerts, can I include the web URL of the nagios instance > they came from? That way, from within the email notification, I can > click a link straight to either the nagios home page, or the nagios > acknowledgment page :) The alerts are 100% customizable. Just look at the command{} definition for your notification commands. It should be fairly obvious what needs to be changed as they're just typical shell commands... -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 15 16:32:04 2010 From: marc at ena.com (Marc Powell) Date: Fri, 15 Jan 2010 09:32:04 -0600 Subject: Nagios Capacity In-Reply-To: References: , <968ECDD4-A8C6-4112-8884-875895BF47FC@ena.com> Message-ID: On Jan 15, 2010, at 8:44 AM, Binbin Wang wrote: > Thank you for your reply, I would like to give you some detail of our monitoring environment. I configure nagnios on an Virtual Machine based on 2x4Core CPU, 24G Memory physical machine, which runs 4 VMs. Service quantity is 3814 and host quantity is 406. The service check latency is unstable, always changed. Does any configuration error cause that problem? or Virtual Machine capacity result it? > Thanks a lot. I can't speak to the problems that might occur in a virtual environment as I don't have any experience there but I expect that clock issues might lead to strange timing issues within nagios, especially if the host machine is heavily loaded.... Other than that, I'd suggest you look at Tips 2-7 of http://nagios.sourceforge.net/docs/3_0/tuning.html as a start. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 15 16:35:20 2010 From: marc at ena.com (Marc Powell) Date: Fri, 15 Jan 2010 09:35:20 -0600 Subject: NRPE Parameter Problems In-Reply-To: References: <4B504709.4070907@flatto.net> Message-ID: <212A1E7B-EB4D-4DB5-A6E4-133F54A45084@ena.com> On Jan 15, 2010, at 8:49 AM, NIgel Leach wrote: > this was my first post to the list, so it was very encouraging to get such a > quick, and useful, reply. Thank *you* for taking the time to understand and work with such a vague-ish answer... -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew.ford at wachovia.com Fri Jan 15 16:54:11 2010 From: andrew.ford at wachovia.com (Ford, Andy) Date: Fri, 15 Jan 2010 10:54:11 -0500 Subject: send_nsca 64 bit problem Message-ID: <323336996ADE534FB38549B446AC5C1A42C4DC9DF0@MEIS1MSX005001.noam.msds.wachovia.net> I'm trying to set up a collector and the nsca daemon on the central server is rejecting my packets: nsca[10902]: Dropping packet with invalid CRC32 - possibly due to client using wrong password or crypto algorithm? Obviously, I've triple-checked the password and algorithm settings, so that's not the problem. I'm suspecting that since the collector is running a 64 bit version of SLES (10.2) and the central server is 32 bit SLES (10.2), that somehow the CRC is being generated differently between the two. Another bit of evidence pointing to the suspected mismatch is that the collector in question can do send_nsca successfully to another 64 bit system. Anyone else run into this or can think of a workaround to try? Andy Ford Network Security Compliance & Automation, Wells Fargo & Co. Office: 314-955-6647 Cell (preferred): 314-600-7025 andrew.ford at wachovia.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From harald.boehmecke at bertelsmann.de Fri Jan 15 16:19:55 2010 From: harald.boehmecke at bertelsmann.de (Harald =?ISO-8859-1?Q?B=F6hmecke?=) Date: Fri, 15 Jan 2010 16:19:55 +0100 Subject: Squid SARG Reports Monitoring Message-ID: <1263568795.5500.36.camel@BDI12718> Hi, I need to monitor the Squid Reports that SARG creates. Basically, I need to know if new reports were created each day. I have no idea where I should start... any ideas? Regards, Harald -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris at westnet.com Fri Jan 15 17:15:52 2010 From: chris at westnet.com (Christopher X. Candreva) Date: Fri, 15 Jan 2010 11:15:52 -0500 (EST) Subject: How to define ATM circuit with PVCs Message-ID: I am using Nagios to monitor an ATM circuit that has multiple PVCs, some with a network behind them. Initially, I set up our router as a host, the ATM interface as a service, and the PVCs as services dependent on the ATM interface. The problem with this is that there are hosts on the other end of those PVCs, and there is no way to make those 'hosts' dependent on a service -- in this case the ATM line itself. What I ended up doing was making the ATM circuit it's own host, so that if that circuit is down, I don't get notified of everything on the other end of it that is down. Is this "correct", or is there a better way to do it ? ========================================================== Chris Candreva -- chris at westnet.com -- (914) 948-3162 WestNet Internet Services of Westchester http://www.westnet.com/ ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From akpgeek at gmail.com Fri Jan 15 17:42:49 2010 From: akpgeek at gmail.com (akp geek) Date: Fri, 15 Jan 2010 11:42:49 -0500 Subject: NRPE - issue Message-ID: <2024a9fb1001150842w5db2bf92q1190e6de1871a863@mail.gmail.com> Dear all - I have installed nagios and also nrpe to monitor services on a different server. it is working fine. Some times I get emails that CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. and immediately after some time, I will get the recovery message also. Can you please help me why I am getting messages like this? Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Fri Jan 15 18:11:57 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Fri, 15 Jan 2010 17:11:57 +0000 Subject: NRPE - issue In-Reply-To: <2024a9fb1001150842w5db2bf92q1190e6de1871a863@mail.gmail.com> References: <2024a9fb1001150842w5db2bf92q1190e6de1871a863@mail.gmail.com> Message-ID: <4B50A1DD.8000904@impactteachers.com> On 15/01/2010 16:42, akp geek wrote: > Dear all - > I have installed nagios and also nrpe to monitor services > on a different server. it is working fine. Some times I get emails that > CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. and immediately after some time, I will get the recovery message also. Can you please help me why I am getting messages like this? > > Regards > What type of load do you have on this server being monitored? And also, check your SSL options, make sure nrpe and nagios both are compiled with ssl. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Fri Jan 15 18:39:31 2010 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Fri, 15 Jan 2010 18:39:31 +0100 Subject: Cisco Aironet 1200 AccessPoints - SOLUTION In-Reply-To: References: <4B3C55D3.1080808@fhv.at> <6db4a4200912310802g452f6249t7b8626fbc5529bea@mail.gmail.com> <4B45A80C.9010909@fhv.at> Message-ID: <4B50A853.8040406@fhv.at> On 1/7/10 3:04 PM Marc Powell wrote: Hi all, > Yes. You'll want to be cognizant of counter re-sets, > counter roll-overs and the fact that the OID index for > that specific interface may change on reboot so that > you don't get strange results. You'll need to read current value > compare with saved previous value/timestamp, save current > value/timestamp, repeat at next run. You could also look > for an existing plugin that suits your needs instead. > You're not the first to want to do this. look for OK, I wrote my own scripts the check the In- and Output traffics from the CISCO interfaces rad0 and fa0 and User connections to the Cisco Aironet 1200 APs. Traffic crosschecks and graphs will be done by MRTG ;-) If someone needs the scripts, just send an email to me. Thanks for your hints. Richard ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gianluca.varenni at gmail.com Sat Jan 16 01:20:30 2010 From: gianluca.varenni at gmail.com (Gianluca Varenni) Date: Fri, 15 Jan 2010 16:20:30 -0800 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <15t6373ptjs.fsf@tux.uio.no> References: <15tska9zk2s.fsf@tux.uio.no><5748BF419D664EF984BDBEE32111E1EB@NELSON3><15t3a29x7me.fsf@tux.uio.no><5FF760C88D854D76A18FB48021D45CF0@NELSON3> <15t6373ptjs.fsf@tux.uio.no> Message-ID: <4332B0BEFA314B68BF2B300299D11039@NELSON3> I updated it to 6.2, now I get this root at tinman:/home/nagios/plugins# ./check_openmanage -H homer -d System: PowerEdge 750 ServiceTag: XXXXXXX OMSA version: 6.2.0 BIOS/date: A04 02/16/2005 Plugin version: 3.5.5-beta1 ----------------------------------------------------------------------------- Storage Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+----------+-------------------------------------------------------- WARNING | 0 | Controller 0 [CERC SATA 1.5/6ch]: Driver '4.1.0.7010' is out of date WARNING | 0 | Controller 0 [CERC SATA 1.5/6ch]: Storport driver '5.2.3790.3959' is out of date OK | 0 | Controller 0 [CERC SATA 1.5/6ch] is Degraded OK | 0:0:0 | Physical Disk 0:0 [250GB] on ctrl -1 is Online OK | 0:0:1 | Physical Disk 0:1 [250GB] on ctrl -1 is Online OK | 0:0 | Logical drive 0 'Windows Disk 0' [RAID-1, 232.80 GB] on ctrl 0 is Ready OK | 0:0 | Connector 0 [SATA] on controller 0 is Ready ----------------------------------------------------------------------------- Chassis Components ============================================================================= STATE | ID | MESSAGE TEXT ---------+------+------------------------------------------------------------ OK | 1 | Memory module 1 [DIMM1_A, 512 MB] is Ok OK | 2 | Memory module 2 [DIMM2_A, 512 MB] is Ok OK | 3 | Memory module 3 [DIMM1_B, 512 MB] is Ok OK | 4 | Memory module 4 [DIMM2_B, 512 MB] is Ok OK | 1 | Chassis fan 1 [System Blower]: 3479 OK | 2 | Chassis fan 2 [System Memory Fan]: 8881 OK | 3 | Chassis fan 3 [System PCI Fan]: 6308 OK | 0 | Temperature Probe 0 [System Temp] reads 24 C (min=10/5, max=45/50) OK | 1 | Temperature Probe 1 [CPU Temp] reads 35 C (min=10/5, max=80/85) OK | 0 | Processor 0 [Intel Pentium 4 3.00GHz] is Enabled OK | 0 | Voltage sensor 0 [CPU Voltage] is 1.345 V OK | 1 | Voltage sensor 1 [Battery Voltage] is 3.136 V OK | 2 | Voltage sensor 2 [2.6V MB] is 2.597 V OK | 3 | Voltage sensor 3 [VTT Voltage] is 1.298 V OK | 4 | Voltage sensor 4 [3.3V MB] is 3.356 V OK | 5 | Voltage sensor 5 [3.3V AUX] is 3.373 V OK | 6 | Voltage sensor 6 [5V MB] is 4.992 V OK | 7 | Voltage sensor 7 [12V MB] is 12.250 V OK | 8 | Voltage sensor 8 [5V AUX] is 5.031 V OK | 9 | Voltage sensor 9 [2.5V VGA] is 2.503 V OK | 10 | Voltage sensor 10 [1.5V MB] is 1.485 V OK | 11 | Voltage sensor 11 [1.2V LAN1] is 1.225 V OK | 12 | Voltage sensor 12 [1.2V LAN2] is 1.216 V OK | 0 | Chassis intrusion 0 detection: Ok (Not Breached) ----------------------------------------------------------------------------- Other messages ============================================================================= STATE | MESSAGE TEXT ---------+------------------------------------------------------------------- OK | ESM log health is Ok (less than 80% full) INTERNAL ERROR: Use of uninitialized value in join or string at ./check_openmanage line 1345. INTERNAL ERROR: Use of uninitialized value in join or string at ./check_openmanage line 1345. Have a nice day GV -------------------------------------------------- From: "Trond Hasle Amundsen" Sent: Friday, January 15, 2010 2:10 AM To: "Gianluca Varenni" Cc: Subject: Re: [Nagios-users] Warnings/errors with check_openmanage plugin > "Gianluca Varenni" writes: > >> I spoke too soon. The new build works ok on my T610 server, but it >> generates >> an internal error on a 750 server >> >> root at tinman:/home/nagios/plugins# ./check_openmanage -H homer -t 180 -d >> System: PowerEdge 750 >> ServiceTag: XXXXXXX OMSA version: 5.0.0 >> BIOS/date: XXXXXXXXXXXXXX Plugin version: 3.5.5-beta1 > > You're using OMSA 5.0.0 on that server, which is very old. The plugin is > designed to work with OMSA 5.3 and later. Can you try upgrading OMSA to > version 5.5.0? > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rmaglasang at infoweapons.com Sat Jan 16 07:34:03 2010 From: rmaglasang at infoweapons.com (Sho) Date: Sat, 16 Jan 2010 14:34:03 +0800 Subject: Checking DHCP In-Reply-To: References: Message-ID: <4B515DDB.9050209@infoweapons.com> The check_dhcp plugin should do the job. Though its lacks some monitoring functionality, it can however tell if the DHCP server if up or not. Here's a snippet of my dhcp service config: # dhcp server define service{ use generic-service host_name dhcps.test.com service_description DHCPv4 servicegroups IPv4Services check_command check_dhcp! -i rl0 notifications_enabled 1 normal_check_interval 5 max_check_attempts 2 } # 'check_dhcp' command definition define command{ command_name check_dhcp command_line $USER1$/check_dhcp $ARG1$ } rl0 is the nagios's outgoing interface. If you know the IP address of the DHCP server, just specify it like check_dhcp! -i rl0 -s 10.3.3.190 -u Eduardo Barreto wrote: > Hi All, > > Does anybody knows or have a clue on how to check a dhcp server, I've > tried direct no success and then I'm trying to monitor it through nrpe. > I'll appreciate any help > > Thanks in advance > > > Luiz Eduardo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sascha.Runschke at gfkl.com Sat Jan 16 14:12:23 2010 From: Sascha.Runschke at gfkl.com (Sascha.Runschke at gfkl.com) Date: Sat, 16 Jan 2010 14:12:23 +0100 Subject: Antwort: send_nsca 64 bit problem In-Reply-To: <323336996ADE534FB38549B446AC5C1A42C4DC9DF0@MEIS1MSX005001.noam.msds.wachovia.net> References: <323336996ADE534FB38549B446AC5C1A42C4DC9DF0@MEIS1MSX005001.noam.msds.wachovia.net> Message-ID: > nsca[10902]: Dropping packet with invalid CRC32 - possibly due to > client using wrong password or crypto algorithm? > > Obviously, I?ve triple-checked the password and algorithm settings, > so that?s not the problem. > > I?m suspecting that since the collector is running a 64 bit version > of SLES (10.2) and the central server is 32 bit SLES (10.2), that > somehow the CRC is being generated differently between the two. > > Another bit of evidence pointing to the suspected mismatch is that > the collector in question can do send_nsca successfully to another > 64 bit system. > > Anyone else run into this or can think of a workaround to try? I am pretty sure there's no general problem with a 64 bit send_nsca and a 32 bit nsca-daemon, as I am using this setup massively. Most of our Oracle servers are RHEL5 64bit and our Nagios is a RHEL5 32bit. So it must be either a misconfiguration on your side, or a SLES specific problem. Even though you triple checked it, check it again - and I'd suppose to just start the integration of that particular server from scratch again. Maybe you made such an obvious mistake that even looking 10 times over it would not reveal it to you. Human beings tend to overlook obvious mistakes they made themselves - our brain is tricking us there ;) I'd say the top mistake of all the "I checked it ten times and the config is correct!"-problems I heard of is: your send_nsca is using another config then you think it does. Therefor you always check the wrong config and the real config has a mistake. Just a hint. Else I can't think of much else, as this setup is running here without problems. Last question though: are you using a 32bit precompiled send_nsca on the 64bit system or did you compile it natively on that system for 64bit? S GFKL Financial Services AG Vorstand: J?rgen Baltes, Dr. Tom Haverkamp Vorsitzender des Aufsichtsrats: Wilhelm Plumpe Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dermoth at aei.ca Mon Jan 18 02:11:07 2010 From: dermoth at aei.ca (Thomas Guyot-Sionnest) Date: Sun, 17 Jan 2010 20:11:07 -0500 Subject: Checking the time taken by a webpage to load In-Reply-To: <4B504877.3070904@flatto.net> References: <7fedbc911001150244h44688754lf71fdbc33d326b39@mail.gmail.com> <4B504877.3070904@flatto.net> Message-ID: <4B53B52B.30907@aei.ca> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 15/01/10 05:50 AM, Assaf Flatto wrote: > shacky wrote: >> Hi. >> >> I'm thinking about a way to monitor the loading time of a webpage with Nagios. >> >> Do you have any idea or any hint? >> >> Thank you very much! >> Bye. >> >> > > Do you mean the amount of time the http test is getting when requesting > a page ? > ./check_http -H www.google.com > HTTP OK: HTTP/1.1 302 Found - 832 bytes in* 0.069 second response time > |*time=0.069112s;;;0.000000 size=832B;;;0 > > i think this is covered in the check_http plugin In the case above you're monitoring only a redirect, which doesn't really means much. you can use the --onredirect switch (see check_http - --help) to follow redirects. Even when following redirects, it calculates only the time taken to load the main page, no resources (css, js, images, iframes) will be loaded. If you need something more complete you should have a look at WebInject, although I have no personal experience with it. http://www.webinject.org/ Even WebInject does not run JavaScript... I'm not aware of any free solution that does it right now, so results may be very different from reality on pages that rely a lot on them (Ajax, "Web 2.0"...) - -- Thomas -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFLU7Ur6dZ+Kt5BchYRAo6/AKCw+bxSbdsBtODsMzvNYyN7JAklAQCgqlV+ BVKihfWxFM/uDlB3fMjZ5U0= =cd3P -----END PGP SIGNATURE----- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagiosusers at edcint.co.nz Mon Jan 18 10:20:11 2010 From: nagiosusers at edcint.co.nz (Matthew Jurgens) Date: Mon, 18 Jan 2010 20:20:11 +1100 Subject: Squid SARG Reports Monitoring In-Reply-To: <1263568795.5500.36.camel@BDI12718> References: <1263568795.5500.36.camel@BDI12718> Message-ID: <4B5427CB.2090309@edcint.co.nz> Several ways to do it. Some I can think of right now are 1) check the sarg web page has changed each day, you might even try and get tricky and check for the specific date 2) Create a script to run the sarg job each day and log a passive result to Nagios with the result I personally like 2) since its closer to checking the actual source ie the sarg run, rather checking the web page that is created by sarg Read about passive checks: http://nagios.sourceforge.net/docs/3_0/passivechecks.html On 16/01/2010 2:19 AM, Harald B?hmecke wrote: > Hi, > > I need to monitor the Squid Reports > that SARG creates. Basically, I need to know if new reports were > created each day. > > I have no idea where I should start... any ideas? > > Regards, > > Harald > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > > > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null -- Smartmon System Monitoring www.smartmon.com.au -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Mon Jan 18 10:53:43 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Mon, 18 Jan 2010 09:53:43 +0000 Subject: Checking 64bit host from 32bit server Message-ID: <4B542FA7.1070802@impactteachers.com> When I run --# ./check_nrpe -H 64.bit.host I get back -- CHECK_NRPE: Error - Could not complete SSL handshake. When I do --# ./check_nrpe -H 64.bit.host -n I get -- CHECK_NRPE: Error receiving data from daemon. I'm checking from a 32 bit host, I shouldn't be able to have both problems? ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Mon Jan 18 11:23:45 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Mon, 18 Jan 2010 10:23:45 +0000 Subject: Checking 64bit host from 32bit server SOLVED In-Reply-To: <4B542FA7.1070802@impactteachers.com> References: <4B542FA7.1070802@impactteachers.com> Message-ID: <4B5436B1.80204@impactteachers.com> On 18/01/2010 09:53, Mr Gabriel wrote: > When I run > > --# ./check_nrpe -H 64.bit.host > > I get back -- CHECK_NRPE: Error - Could not complete SSL handshake. When > I do > > --# ./check_nrpe -H 64.bit.host -n > > I get -- CHECK_NRPE: Error receiving data from daemon. I'm checking > from a 32 bit host, I shouldn't be able to have both problems? > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > Okay, I found the problem, editing the wrong configuration file for NRPE on the clients! *my mistake* ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Mon Jan 18 11:48:30 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Mon, 18 Jan 2010 11:48:30 +0100 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <4332B0BEFA314B68BF2B300299D11039@NELSON3> (Gianluca Varenni's message of "Fri, 15 Jan 2010 16:20:30 -0800") References: <15tska9zk2s.fsf@tux.uio.no> <5748BF419D664EF984BDBEE32111E1EB@NELSON3> <15t3a29x7me.fsf@tux.uio.no> <5FF760C88D854D76A18FB48021D45CF0@NELSON3> <15t6373ptjs.fsf@tux.uio.no> <4332B0BEFA314B68BF2B300299D11039@NELSON3> Message-ID: <15thbqjit8h.fsf@tux.uio.no> "Gianluca Varenni" writes: > I updated it to 6.2, now I get this > OK | 0:0:0 | Physical Disk 0:0 [250GB] on ctrl -1 is Online > OK | 0:0:1 | Physical Disk 0:1 [250GB] on ctrl -1 is Online This is interesting. Controller number shouldn't be -1. > INTERNAL ERROR: Use of uninitialized value in join or string at > ./check_openmanage line 1345. > INTERNAL ERROR: Use of uninitialized value in join or string at > ./check_openmanage line 1345. Ok, so the error persists with updated OMSA. To debug this further, I need the output from: * On the monitored host: omreport storage controller omreport storage pdisk controller=0 * On the Nagios server: snmpwalk -v2c -c 1.3.6.1.4.1.674.10893.1.20.130 Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Mon Jan 18 14:08:45 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Mon, 18 Jan 2010 13:08:45 +0000 Subject: check_hpjd issues Message-ID: <4B545D5D.1030909@impactteachers.com> I'm getting the following error for running # ./check_hpjd -H printer1.domain.com -C public Error in packet () SNMP is enabled, although I do recall seeing snmp protocol 1 being used by this particular printer. Would this cause a problem? ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Mon Jan 18 15:14:44 2010 From: jpratt at norwich.edu (James Pratt) Date: Mon, 18 Jan 2010 09:14:44 -0500 Subject: check_hpjd issues In-Reply-To: <4B545D5D.1030909@impactteachers.com> References: <4B545D5D.1030909@impactteachers.com> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD49425EEE@nuexchg.norwich.edu> >> -----Original Message----- >> From: Mr Gabriel [mailto:gabriel at impactteachers.com] >> Sent: Monday, January 18, 2010 8:09 AM >> To: nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] check_hpjd issues >> >> I'm getting the following error for running >> >> # ./check_hpjd -H printer1.domain.com -C public >> Error in packet () >> >> SNMP is enabled, although I do recall seeing snmp protocol 1 being used >> by this particular printer. Would this cause a problem? >> Hi - I'm not sure if that will be a problem but you can always test/troubleshoot this further by using snmpwalk on the command line, which may give you more information as to what the issue really is ... http://www.google.com/search?q=snmpwalk+examples hth regards, Jamie ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gianluca.varenni at gmail.com Mon Jan 18 17:46:18 2010 From: gianluca.varenni at gmail.com (Gianluca Varenni) Date: Mon, 18 Jan 2010 08:46:18 -0800 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <15thbqjit8h.fsf@tux.uio.no> References: <15tska9zk2s.fsf@tux.uio.no><5748BF419D664EF984BDBEE32111E1EB@NELSON3><15t3a29x7me.fsf@tux.uio.no><5FF760C88D854D76A18FB48021D45CF0@NELSON3><15t6373ptjs.fsf@tux.uio.no><4332B0BEFA314B68BF2B300299D11039@NELSON3> <15thbqjit8h.fsf@tux.uio.no> Message-ID: <7BAB810F306440B987E34C82FA699072@NELSON3> Here they are (embedded below) Have a nice day GV -------------------------------------------------- From: "Trond Hasle Amundsen" Sent: Monday, January 18, 2010 2:48 AM To: "Gianluca Varenni" Cc: Subject: Re: [Nagios-users] Warnings/errors with check_openmanage plugin > "Gianluca Varenni" writes: > >> I updated it to 6.2, now I get this >> OK | 0:0:0 | Physical Disk 0:0 [250GB] on ctrl -1 is Online >> OK | 0:0:1 | Physical Disk 0:1 [250GB] on ctrl -1 is Online > > This is interesting. Controller number shouldn't be -1. > >> INTERNAL ERROR: Use of uninitialized value in join or string at >> ./check_openmanage line 1345. >> INTERNAL ERROR: Use of uninitialized value in join or string at >> ./check_openmanage line 1345. > > Ok, so the error persists with updated OMSA. To debug this further, I > need the output from: > > * On the monitored host: > omreport storage controller Controller CERC SATA 1.5/6ch (Slot 2) Controllers ID : 0 Status : Non-Critical Name : CERC SATA 1.5/6ch Slot ID : PCI Slot 2 State : Degraded Firmware Version : 4.1-0 (Bld 7419) Minimum Required Firmware Version : Not Applicable Driver Version : 4.1.0.7010 Minimum Required Driver Version : 4.1.1.7040 Storport Driver Version : 5.2.3790.3959 Minimum Required Storport Driver Version : 5.2.3790.4173 Number of Connectors : 1 Rebuild Rate : Not Applicable BGI Rate : Not Applicable Check Consistency Rate : Not Applicable Reconstruct Rate : Not Applicable Alarm State : Enabled Cluster Mode : Not Applicable SCSI Initiator ID : Not Applicable Cache Memory Size : 64 MB Patrol Read Mode : Not Applicable Patrol Read State : Not Applicable Patrol Read Rate : Not Applicable Patrol Read Iterations : Not Applicable Abort check consistency on error : Not Applicable Allow Revertible Hot Spare and Replace Member : Not Applicable Auto replace member on predictive failure : Not Applicable Load balance : Not Applicable Security Capable : Not Applicable Security Key Present : Not Applicable Redundant Path view : Not Applicable > omreport storage pdisk controller=0 List of Physical Disks on Controller CERC SATA 1.5/6ch (Slot 2) Controller CERC SATA 1.5/6ch (Slot 2) ID : 0:0 Status : Ok Name : Physical Disk 0:0 State : Online Failure Predicted : No Certified : Not Applicable Encryption Capable : No Secured : Not Applicable Progress : Not Applicable Bus Protocol : SATA Media : HDD Capacity : 232.80 GB (249965248512 bytes) Used RAID Disk Space : 232.80 GB (249965248512 bytes) Available RAID Disk Space : 0.00 GB (0 bytes) Hot Spare : No Vendor ID : Maxtor Product ID : 7Y250M0 Revision : YAR5 Serial No. : Y66FN7ME Part Number : Not Available Negotiated Speed : Not Available Capable Speed : Not Available Manufacture Day : Not Available Manufacture Week : Not Available Manufacture Year : Not Available SAS Address : Not Available ID : 0:1 Status : Ok Name : Physical Disk 0:1 State : Online Failure Predicted : No Certified : Not Applicable Encryption Capable : No Secured : Not Applicable Progress : Not Applicable Bus Protocol : SATA Media : HDD Capacity : 232.80 GB (249965248512 bytes) Used RAID Disk Space : 232.80 GB (249965248512 bytes) Available RAID Disk Space : 0.00 GB (0 bytes) Hot Spare : No Vendor ID : Maxtor Product ID : 7Y250M0 Revision : YAR5 Serial No. : Y66FQLJE Part Number : Not Available Negotiated Speed : Not Available Capable Speed : Not Available Manufacture Day : Not Available Manufacture Week : Not Available Manufacture Year : Not Available SAS Address : Not Available > > * On the Nagios server: > snmpwalk -v2c -c > 1.3.6.1.4.1.674.10893.1.20.130 gianlucav at tinman:~$ snmpwalk -v2c -c public homer 1.3.6.1.4.1.674.10893.1.20.130 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.2.1 = STRING: "CERC SATA 1.5/6ch" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.3.1 = STRING: "DELL" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.4.1 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.5.1 = INTEGER: 6 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.8.1 = STRING: "4.1-0 (Bld 7419)" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.9.1 = INTEGER: 64 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.10.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.11.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.12.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.15.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.16.1 = INTEGER: 48 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.37.1 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.38.1 = INTEGER: 4 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.39.1 = STRING: "\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.40.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.41.1 = STRING: "4.1.0.7010" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.42.1 = STRING: "2" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.43.1 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.45.1 = STRING: "4.1.1.7040" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.47.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.55.1 = STRING: "5.2.3790.3959" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.56.1 = STRING: "5.2.3790.4173" SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.57.1 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.1.1.58.1 = INTEGER: 99 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.2.1 = STRING: "Connector 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.3.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.5.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.7.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.8.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.9.1 = STRING: "\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.2.1.11.1 = INTEGER: 7 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.2.1 = STRING: "Physical Disk 0:0" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.2.2 = STRING: "Physical Disk 0:1" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.3.1 = STRING: "Maxtor " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.3.2 = STRING: "Maxtor " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.4.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.4.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.6.1 = STRING: "7Y250M0 " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.6.2 = STRING: "7Y250M0 " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.7.1 = STRING: "Y66FN7ME " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.7.2 = STRING: "Y66FQLJE " SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.8.1 = STRING: "YAR5" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.8.2 = STRING: "YAR5" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.10.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.10.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.11.1 = INTEGER: 238385 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.11.2 = INTEGER: 238385 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.12.1 = INTEGER: 458752 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.12.2 = INTEGER: 458752 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.13.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.13.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.14.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.14.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.15.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.15.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.16.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.16.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.17.1 = INTEGER: 238385 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.17.2 = INTEGER: 238385 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.18.1 = INTEGER: 458752 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.18.2 = INTEGER: 458752 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.19.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.19.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.20.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.20.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.21.1 = INTEGER: 7 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.21.2 = INTEGER: 7 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.22.1 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.22.2 = INTEGER: 5 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.23.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.23.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.24.1 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.24.2 = INTEGER: 3 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.26.1 = STRING: "\\0\\0\\0" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.26.2 = STRING: "\\0\\0\\1" SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.31.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.31.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.35.1 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.35.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.36.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.36.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.40.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.40.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.41.1 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.4.1.41.2 = INTEGER: 0 SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.1.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.1.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.2.1 = STRING: "Physical Disk 0:0" SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.2.2 = STRING: "Physical Disk 0:1" SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.3.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.3.2 = INTEGER: 2 SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.4.1 = STRING: "Connector 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.4.2 = STRING: "Connector 0" SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.5.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.5.2 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.6.1 = STRING: "CERC SATA 1.5/6ch" SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.6.2 = STRING: "CERC SATA 1.5/6ch" SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.7.1 = INTEGER: 1 SNMPv2-SMI::enterprises.674.10893.1.20.130.6.1.7.2 = INTEGER: 1 gianlucav at tinman:~$ > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From juki.emma at gmail.com Tue Jan 19 08:37:48 2010 From: juki.emma at gmail.com (Juki) Date: Tue, 19 Jan 2010 10:37:48 +0300 Subject: Multiple NRPE Processes Message-ID: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> Hi all, I'm running Solaris 10 on the client that is being monitored. However, when I run the command netstat -a | grep nrpe, I see multiple instances of the NRPE daemon running as shown below; nagios at pms # netstat -a | grep nrpe *.nrpe *.* 0 0 49152 0 LISTEN pms.nrpe h1de4.n2 56147 5888 0 50540 0 TIME_WAIT pms.nrpe h1de4.n2 56148 5888 0 50540 0 TIME_WAIT pms.nrpe h1de4.n2 56149 5888 0 50540 0 TIME_WAIT I have tried to *kill* them (using kill command) but that doesn't seem to work because they just don't go away - weird right? :) I would like to kill all the pms.nrpe processes.. Anyone have another way of achieving this? Thanks, Juki -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From komodo at uvt.cz Tue Jan 19 10:59:43 2010 From: komodo at uvt.cz (komodo) Date: Tue, 19 Jan 2010 10:59:43 +0100 Subject: first_notification_delay problem Message-ID: <201001191059.43554.komodo@uvt.cz> Hi all Can someone pls explane to me how first_notification_delay works ? I setup one host with one service for testing. I setup first_notification_delay for host to 5 and for service (http) to 10 so i expected that i get nitification after 10 minutes when i stop http service. But i always get first notification after 5 minutes, no matter what i set in config file. Then i changed delay for host to 3 because i thought that this is the problem, but after this change and nagios restart i got notification after 5 minutes again. That's very strange to me. I don't know what to check and how make it work. Is there any option that should i check ? Thank you Martin ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 19 14:37:08 2010 From: marc at ena.com (Marc Powell) Date: Tue, 19 Jan 2010 07:37:08 -0600 Subject: Multiple NRPE Processes In-Reply-To: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> References: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> Message-ID: On Jan 19, 2010, at 1:37 AM, Juki wrote: > nagios at pms # netstat -a | grep nrpe > *.nrpe *.* 0 0 49152 0 LISTEN > pms.nrpe h1de4.n2 56147 5888 0 50540 0 TIME_WAIT > pms.nrpe h1de4.n2 56148 5888 0 50540 0 TIME_WAIT > pms.nrpe h1de4.n2 56149 5888 0 50540 0 TIME_WAIT > > > I have tried to *kill* them (using kill command) but that doesn't seem to work because they just don't go away - weird right? :) I would like to kill all the pms.nrpe processes.. There's nothing to kill and these entries are appropriate... Welcome to the world of TCP and a greater understanding of how it all _really_ works. http://www.developerweb.net/forum/showthread.php?t=2941 How long are they sticking around? What OS are you using? -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.hallgren at ibiz-solutions.se Tue Jan 19 15:25:17 2010 From: richard.hallgren at ibiz-solutions.se (Richard Hallgren) Date: Tue, 19 Jan 2010 15:25:17 +0100 Subject: Filter status.cgi view in 1.3 Message-ID: Hi, We're running version 1.3 of Nagios. I'd like to have the status.cgi view to only show a couple of the 1000+ host we have monitored. I know I can add the name of the host as a parameter so it becomes something like "status.cgi?host=MyHostName" but can I also use this technique to filter out a number of hosts? Other ideas to get a page with just a few of all the hosts? Richard -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 19 16:13:13 2010 From: marc at ena.com (Marc Powell) Date: Tue, 19 Jan 2010 09:13:13 -0600 Subject: Filter status.cgi view in 1.3 In-Reply-To: References: Message-ID: On Jan 19, 2010, at 8:25 AM, Richard Hallgren wrote: > Hi, > > We?re running version 1.3 of Nagios. Much of the rest of the world is running 3.x. I strongly encourage you to upgrade sooner rather than later. The conversion from 1.x to 2.x+ requires configuration changes and tribal memory of what exactly needs to be done, how best to do it and ability to find third party conversion scripts gets less and less likely the longer you wait. Additionally, less and less people on this list have actual experience with 1.x... You're already somewhere in the neighborhood of 5 years out-of-date. > I?d like to have the status.cgi view to only show a couple of the 1000+ host we have monitored. I know I can add the name of the host as a parameter so it becomes something like ?status.cgi?host=MyHostName? but can I also use this technique to filter out a number of hosts? No. > > Other ideas to get a page with just a few of all the hosts? You can do this with authentication enabled. Create groups of the limited view hosts, assign unique contacts to them, create auth usernames that match those contacts and log in with them. By default nagios will only show hosts and services that an authenticated user is a contact for. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From juki.emma at gmail.com Tue Jan 19 16:22:48 2010 From: juki.emma at gmail.com (Juki) Date: Tue, 19 Jan 2010 18:22:48 +0300 Subject: Multiple NRPE Processes In-Reply-To: References: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> Message-ID: <7545d7d21001190722m11f7632erfd25d234549f53@mail.gmail.com> Hi Marc, 2010/1/19 Marc Powell > > There's nothing to kill and these entries are appropriate... Welcome to the > world of TCP and a greater understanding of how it all _really_ works. > > http://www.developerweb.net/forum/showthread.php?t=2941 > I will have a look at this > > How long are they sticking around? What OS are you using? > I'm running Solaris 10. They are more less sticking around indefinitely. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Tue Jan 19 16:50:47 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Tue, 19 Jan 2010 16:50:47 +0100 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <7BAB810F306440B987E34C82FA699072@NELSON3> (Gianluca Varenni's message of "Mon, 18 Jan 2010 08:46:18 -0800") References: <15tska9zk2s.fsf@tux.uio.no> <5748BF419D664EF984BDBEE32111E1EB@NELSON3> <15t3a29x7me.fsf@tux.uio.no> <5FF760C88D854D76A18FB48021D45CF0@NELSON3> <15t6373ptjs.fsf@tux.uio.no> <4332B0BEFA314B68BF2B300299D11039@NELSON3> <15thbqjit8h.fsf@tux.uio.no> <7BAB810F306440B987E34C82FA699072@NELSON3> Message-ID: <15tska2f608.fsf@tux.uio.no> "Gianluca Varenni" writes: > Here they are (embedded below) Thanks. This really puzzled me, because snmpwalk returned all the output needed, while my plugin didn't seem to fetch the needed values. Fortunately, I was able to reproduce this on a local 2650, which makes debugging much easier. It turns out that when using Net::SNMP::get_entries() to fetch the values, there will sometimes be a whitespace in the OID field of the data returned, while Net::SNMP::get_table() is OK. I really want to use get_entries(), which is significantly faster on servers with lots of physical disks. I have patched check_openmanage so that any excess whitespace is removed from the OID field before it is analyzed, so get_entries() can still be used. A beta version is available (check_openmanage-3.5.5-beta4) here: http://folk.uio.no/trondham/tmp/ This beta also addresses the controller identification for physical drives, where the controller id was reported as '-1'. On older hardware such as the 2650 and 750, the controller to which a physical disk is attached is not available as a connection to the enclosure, because no enclosure exists. The plugin will now probe for this information via the channel connection as well, which provides this info. This only implies that I probe an additional OID for this information. Please try this beta version and report your results. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Steven.Battaille at ap2.be Tue Jan 19 17:15:09 2010 From: Steven.Battaille at ap2.be (Steven Battaille) Date: Tue, 19 Jan 2010 17:15:09 +0100 Subject: Notification question Message-ID: <16A9FB339EF0024C80FD6D1910268C7F01DAB7B1@apps18.ap2.local> I'm having a question about the notifications that are being sent by nagios. I found out that if you put the notification interval at 0 you only get a mail when the service goes into the type you defined. I'll explain my question with an example. We monitor the disk space of several servers. The threshold for the warning and critical are set to 90 and 95. As soon as the service goes into warning because the threshold is at 90% we get an email with a warning status (so far so good) If the threshold increases to 91% we don't get an email anymore because the state doesn't change. (it's still warning). We now want Nagios to send an email that as soon as the threshold value has changed we get an email. (from 90 to 91) We don't want to get an email as long as the threshold stays at 90%(we already got this by mail) Does somebody knows a solution for this? Steven Battaille -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Tue Jan 19 17:39:40 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Tue, 19 Jan 2010 16:39:40 +0000 Subject: Variables in .cfg files -- Howto? Message-ID: <4B55E04C.7030607@impactteachers.com> How can I use variables in my .cfg files that are only used in that file, and not across my entire install? ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 19 17:41:32 2010 From: marc at ena.com (Marc Powell) Date: Tue, 19 Jan 2010 10:41:32 -0600 Subject: Multiple NRPE Processes In-Reply-To: <7545d7d21001190722m11f7632erfd25d234549f53@mail.gmail.com> References: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> <7545d7d21001190722m11f7632erfd25d234549f53@mail.gmail.com> Message-ID: <4FB2C132-9682-43D6-9185-20920AE5225F@ena.com> On Jan 19, 2010, at 9:22 AM, Juki wrote: > How long are they sticking around? What OS are you using? > > I'm running Solaris 10. They are more less sticking around indefinitely. 60 seconds seems to be the default/recommended setting on Solaris but can be as high as 10 minutes if the OS thinks it needs to be (http://docs.sun.com/app/docs/doc/817-0404/chapter4-51?a=view). I wouldn't recommend tuning this unless you know exactly what you are doing and why; strangeness may result. Perhaps you've tuned it already and this is the resulting strangeness... You're going to have better luck understanding this by asking about it on a Solaris support list as what you're asking about is a general TCP stack question, not specific to Nagios. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Jan 19 17:45:43 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Tue, 19 Jan 2010 08:45:43 -0800 Subject: Variables in .cfg files -- Howto? In-Reply-To: <4B55E04C.7030607@impactteachers.com> References: <4B55E04C.7030607@impactteachers.com> Message-ID: <4B55E1B7.3040209@hp.com> Mr Gabriel wrote: > How can I use variables in my .cfg files that are only used in that > file, and not across my entire install? > You don't really. Nagios doesn't keep track of what lives in which file; it just loads everything up and parses the whole thing afterward. If you really need per-file variables, you'll probably have to do it before loading them into Nagios with some sort of external pre-parser, but whether that'll work for you or not is hard to say since it's not clear what you're trying to accomplish. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Jan 19 17:48:59 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Tue, 19 Jan 2010 08:48:59 -0800 Subject: Notification question In-Reply-To: <16A9FB339EF0024C80FD6D1910268C7F01DAB7B1@apps18.ap2.local> References: <16A9FB339EF0024C80FD6D1910268C7F01DAB7B1@apps18.ap2.local> Message-ID: <4B55E27B.5010300@hp.com> Steven Battaille wrote: > > I?m having a question about the notifications that are being sent by > nagios. > > > > I found out that if you put the notification interval at 0 you only > get a mail when the service goes into the type you defined. > > > > I?ll explain my question with an example. > > > > We monitor the disk space of several servers. The > threshold for the warning and critical are set to 90 and 95. > > > > As soon as the service goes into warning because the > threshold is at 90% we get an email with a warning status (so far so good) > > > > If the threshold increases to 91% we don?t get an > email anymore because the state doesn?t change. (it?s still warning). > > > > We now want Nagios to send an email that as soon as the threshold > value has changed we get an email. (from 90 to 91) > > > > We don?t want to get an email as long as the threshold stays at 90%(we > already got this by mail) > > > > Does somebody knows a solution for this? > > > You'll probably want to *not* set your notification interval to zero, and come up with a custom notification command to do what you want (i.e., check if that value has changed, and don't send anything if it hasn't). Naguios generally doesn't care about plugin output; it just looks at the result code. No result code change means, as far as Nagios is concerned, nothing has changed. One warning's no different from any other as far as the notification logic goes. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ken.McKinlay at curtisswright.com Tue Jan 19 17:46:41 2010 From: Ken.McKinlay at curtisswright.com (McKinlay, Ken) Date: Tue, 19 Jan 2010 11:46:41 -0500 Subject: check_openmanage 3.5.4 status bug Message-ID: Greetings, The status hash that is used to translate the voltageProbeStatus appears to be incorrect. Instead of %snmp_status, the hash used should be %snmp_probestatus. This is at line 3054 in check_openmanage v3.5.4. The corrected line reads: $status = $snmp_probestatus{$out->{voltageProbeStatus}}; I discovered this issue since one of my Dells has an almost dead battery but the check returned " OOPS! Something is wrong with this server, but I don't know what. The global system health status is CRITICAL, but every component check is OK. This may be a bug in the Nagios plugin, please file a bug report." -- Ken McKinlay, CISSP, GCIA, GAWN, GSNA, GPEN Network Security Curtiss Wright Controls Embedded Computing _______________________________________________________________________ This e-mail and any files transmitted with it are proprietary and intended solely for the use of the individual or entity to whom they are addressed. If you have reason to believe that you have received this e-mail in error, please notify the sender and destroy this email and any attached files. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of the Curtiss-Wright Corporation or any of its subsidiaries. Documents attached hereto may contain technology subject to government export regulations. Recipient is solely responsible for ensuring that any re-export, transfer or disclosure of this information is in accordance with applicable government export regulations. The recipient should check this e-mail and any attachments for the presence of viruses. Curtiss-Wright Corporation and its subsidiaries accept no liability for any damage caused by any virus transmitted by this e-mail. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Tue Jan 19 18:18:49 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Tue, 19 Jan 2010 18:18:49 +0100 Subject: check_openmanage 3.5.4 status bug In-Reply-To: (Ken McKinlay's message of "Tue, 19 Jan 2010 11:46:41 -0500") References: Message-ID: <15tk4vef1xi.fsf@tux.uio.no> "McKinlay, Ken" writes: > The status hash that is used to translate the voltageProbeStatus appears > to be incorrect. Instead of %snmp_status, the hash used should be > %snmp_probestatus. This is at line 3054 in check_openmanage v3.5.4. The > corrected line reads: > > $status = $snmp_probestatus{$out->{voltageProbeStatus}}; > > I discovered this issue since one of my Dells has an almost dead battery > but the check returned " OOPS! Something is wrong with this server, but > I don't know what. The global system health status is CRITICAL, but > every component check is OK. This may be a bug in the Nagios plugin, > please file a bug report." Hi Ken, Thanks for the bug report and the patch. I guess that this was never discovered before, since voltage errors occur rarely. Also good to know that the little failsafe oops works as intended :) Your patch will be included in the next release, and a patched beta version is available here: http://folk.uio.no/trondham/tmp/check_openmanage-3.5.5-beta6 Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 19 18:18:25 2010 From: marc at ena.com (Marc Powell) Date: Tue, 19 Jan 2010 11:18:25 -0600 Subject: Notification question In-Reply-To: <16A9FB339EF0024C80FD6D1910268C7F01DAB7B1@apps18.ap2.local> References: <16A9FB339EF0024C80FD6D1910268C7F01DAB7B1@apps18.ap2.local> Message-ID: <39A726E5-C7CA-482C-9C48-CD7E18E3BFB5@ena.com> On Jan 19, 2010, at 10:15 AM, Steven Battaille wrote: > If the threshold increases to 91% we don?t get an email anymore because the state doesn?t change. (it?s still warning). > > We now want Nagios to send an email that as soon as the threshold value has changed we get an email. (from 90 to 91) > > We don?t want to get an email as long as the threshold stays at 90%(we already got this by mail) > > Does somebody knows a solution for this? Nagios doesn't work like this. It does not know or care that it changed from 90 to 91 so it can't do anything with it. This speaks to the core of how nagios sees the world. When nagios runs a plugin to check something, it will receive up to 3 things back from the plugin - - a numeric exit code that corresponds to the current state (OK, WARNING, CRITICAL, UNKNOWN) - some text comprised of [|] Nagios only really cares about the state, everything else is just fluff. State changes are the only things that trigger notifications*, unless you have 'is_volatile' set. If the plugin returns WARNING for 90 and WARNING for 91, nagios only sees WARNING for both so the state hasn't changed and a new notification isn't necessary. If you set is_volatile, nagios will send a notification every time the state is not OK (90, 91, 90, 90, 90 would all generate notifications.) The part that you're interested in doing something with is in the . Nagios does nothing with this. It can be any text in any format, depending on what the plugin thinks is useful to you and what you've asked it to check. Since there's no possible way to know and parse an effectively infinite variety of plugin output content and format, nagios does none of it. It just makes it available in the GUI and as a macro for you to do whatever you want to with it on your own. Typically it's just sent to the notification command for inclusion in the notification. That being said, what you could do is to add some logic to your notification command for this case. Your notification command can be any script at all. The default is just a simple mail command but it could be something very complex. If you write your own, it would need to do things like keep track of the prior state and only send a notification of your criteria were met. -- Marc *outside of re-notifications due to notification_interval or escalations... ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gianluca.varenni at gmail.com Tue Jan 19 19:08:16 2010 From: gianluca.varenni at gmail.com (Gianluca Varenni) Date: Tue, 19 Jan 2010 10:08:16 -0800 Subject: Warnings/errors with check_openmanage plugin In-Reply-To: <15tska2f608.fsf@tux.uio.no> References: <15tska9zk2s.fsf@tux.uio.no><5748BF419D664EF984BDBEE32111E1EB@NELSON3><15t3a29x7me.fsf@tux.uio.no><5FF760C88D854D76A18FB48021D45CF0@NELSON3><15t6373ptjs.fsf@tux.uio.no><4332B0BEFA314B68BF2B300299D11039@NELSON3><15thbqjit8h.fsf@tux.uio.no><7BAB810F306440B987E34C82FA699072@NELSON3> <15tska2f608.fsf@tux.uio.no> Message-ID: This version seems to work ok now. Thanks! Have a nice day GV -------------------------------------------------- From: "Trond Hasle Amundsen" Sent: Tuesday, January 19, 2010 7:50 AM To: "Gianluca Varenni" Cc: Subject: Re: [Nagios-users] Warnings/errors with check_openmanage plugin > "Gianluca Varenni" writes: > >> Here they are (embedded below) > > Thanks. This really puzzled me, because snmpwalk returned all the output > needed, while my plugin didn't seem to fetch the needed values. > > Fortunately, I was able to reproduce this on a local 2650, which makes > debugging much easier. It turns out that when using > Net::SNMP::get_entries() to fetch the values, there will sometimes be a > whitespace in the OID field of the data returned, while > Net::SNMP::get_table() is OK. I really want to use get_entries(), which > is significantly faster on servers with lots of physical disks. > > I have patched check_openmanage so that any excess whitespace is removed > from the OID field before it is analyzed, so get_entries() can still be > used. A beta version is available (check_openmanage-3.5.5-beta4) here: > > http://folk.uio.no/trondham/tmp/ > > This beta also addresses the controller identification for physical > drives, where the controller id was reported as '-1'. On older hardware > such as the 2650 and 750, the controller to which a physical disk is > attached is not available as a connection to the enclosure, because no > enclosure exists. The plugin will now probe for this information via the > channel connection as well, which provides this info. This only implies > that I probe an additional OID for this information. > > Please try this beta version and report your results. > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 19 19:18:32 2010 From: marc at ena.com (Marc Powell) Date: Tue, 19 Jan 2010 12:18:32 -0600 Subject: Variables in .cfg files -- Howto? In-Reply-To: <4B55E04C.7030607@impactteachers.com> References: <4B55E04C.7030607@impactteachers.com> Message-ID: <375A8F3D-9DFF-45B8-AD6C-13F47895365A@ena.com> On Jan 19, 2010, at 10:39 AM, Mr Gabriel wrote: > How can I use variables in my .cfg files that are only used in that > file, and not across my entire install? Your question doesn't make any sense. Can you elaborate by example? -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From komodo at uvt.cz Tue Jan 19 18:57:37 2010 From: komodo at uvt.cz (komodo) Date: Tue, 19 Jan 2010 18:57:37 +0100 Subject: first_notification_delay problem In-Reply-To: <201001191059.43554.komodo@uvt.cz> References: <201001191059.43554.komodo@uvt.cz> Message-ID: <201001191857.37170.komodo@uvt.cz> Hi As addition to my first post about notification delay. I made some further investigation and found out some another strange behaviour (for me) and couldn't find any info in docs. I have first_nitification_delay for host se to 0 and i have tried change this setting for service and here are the results first column is value that i have set and second is real time till notification wa sent. 5 = 0 (immediately) 10 = 5 15 = 10 25 = 20 12 = 5 14 = 10 45 = 30 Can plesae explain someone to me why is it like this ? I don't understand, if it has some dependency from another settings or what. Thank you very much. Martin On Tuesday 19 January 2010 10:59:43 komodo wrote: > Hi all > > Can someone pls explane to me how first_notification_delay works ? > I setup one host with one service for testing. I setup > first_notification_delay for host to 5 and for service (http) to 10 so i > expected that i get nitification after 10 minutes when i stop http > service. But i always get first notification after 5 minutes, no matter > what i set in config file. > > Then i changed delay for host to 3 because i thought that this is the > problem, but after this change and nagios restart i got notification after > 5 minutes again. That's very strange to me. > > I don't know what to check and how make it work. Is there any option that > should i check ? > > Thank you > > Martin > > --------------------------------------------------------------------------- > --- Throughout its 18-year history, RSA Conference consistently attracts > the world's best and brightest in the field, creating opportunities for > Conference attendees to learn about information security's most important > issues through interactions with peers, luminaries and emerging and > established companies. http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > ::: reporting any issue. Messages without supporting info will risk being > ::: sent to /dev/null > ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gael.cheron at free.fr Tue Jan 19 19:42:52 2010 From: gael.cheron at free.fr (Gael Cheron) Date: Tue, 19 Jan 2010 19:42:52 +0100 Subject: check_cluster and service hardstate In-Reply-To: <8480e60f1001191039s72a61fa9g897bc6b805311429@mail.gmail.com> References: <8480e60f1001191039s72a61fa9g897bc6b805311429@mail.gmail.com> Message-ID: <8480e60f1001191042p131bdbfdgc2c21c028d38eb4@mail.gmail.com> Hi all, In the doc dealing with the service cluster with the check_cluster plugin, the example uses $SERVICESTATEID:host:Service$ But I'd like to use the hardstate. As far as I know, there is no variable giving the service hardstate. I had a look at the list http://nagios.sourceforge.net/docs/2_0/macros.html Do somebody know if there is a way to do this ? Thanks a lot. Regards, Ga?l. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ken.McKinlay at curtisswright.com Tue Jan 19 19:27:07 2010 From: Ken.McKinlay at curtisswright.com (McKinlay, Ken) Date: Tue, 19 Jan 2010 13:27:07 -0500 Subject: check_openmanage 3.5.5-beta6 snmp_detect_blade bug In-Reply-To: <15tk4vef1xi.fsf@tux.uio.no> References: <15tk4vef1xi.fsf@tux.uio.no> Message-ID: Trond, Other little bug for your next release. Using check_openmanage 3.5.5-beta6 on a server loaded with OMSA 5.1.0 (a different box this time), in the snmp_detect_blade function it returned: "INTERNAL ERROR: Use of uninitialized value in string eq at ./check_openmanage-3.5.5-beta6 line 599." Looking at the line and then doing my own SNMP query, that OID is missing in OMSA 5.1.0. However, by changing line 599 to first make sure a result has been set then the uninitialized value error is bypassed in the if statement: if ( $result->{$DellBaseBoardType} && $result->{$DellBaseBoardType} eq '3') { Ken > -----Original Message----- > From: Trond Hasle Amundsen [mailto:t.h.amundsen at usit.uio.no] > Sent: Tuesday, January 19, 2010 12:19 PM > To: McKinlay, Ken > Cc: nagios-users at lists.sourceforge.net > Subject: Re: check_openmanage 3.5.4 status bug > > "McKinlay, Ken" writes: > > > The status hash that is used to translate the voltageProbeStatus > appears > > to be incorrect. Instead of %snmp_status, the hash used should be > > %snmp_probestatus. This is at line 3054 in check_openmanage v3.5.4. > The > > corrected line reads: > > > > $status = $snmp_probestatus{$out->{voltageProbeStatus}}; > > > > I discovered this issue since one of my Dells has an almost dead > battery > > but the check returned " OOPS! Something is wrong with this server, > but > > I don't know what. The global system health status is CRITICAL, but > > every component check is OK. This may be a bug in the Nagios plugin, > > please file a bug report." > > Hi Ken, > > Thanks for the bug report and the patch. I guess that this was never > discovered before, since voltage errors occur rarely. Also good to know > that the little failsafe oops works as intended :) > > Your patch will be included in the next release, and a patched beta > version is available here: > > http://folk.uio.no/trondham/tmp/check_openmanage-3.5.5-beta6 > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo _______________________________________________________________________ This e-mail and any files transmitted with it are proprietary and intended solely for the use of the individual or entity to whom they are addressed. If you have reason to believe that you have received this e-mail in error, please notify the sender and destroy this email and any attached files. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of the Curtiss-Wright Corporation or any of its subsidiaries. Documents attached hereto may contain technology subject to government export regulations. Recipient is solely responsible for ensuring that any re-export, transfer or disclosure of this information is in accordance with applicable government export regulations. The recipient should check this e-mail and any attachments for the presence of viruses. Curtiss-Wright Corporation and its subsidiaries accept no liability for any damage caused by any virus transmitted by this e-mail. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 19 20:07:06 2010 From: marc at ena.com (Marc Powell) Date: Tue, 19 Jan 2010 13:07:06 -0600 Subject: first_notification_delay problem In-Reply-To: <201001191857.37170.komodo@uvt.cz> References: <201001191059.43554.komodo@uvt.cz> <201001191857.37170.komodo@uvt.cz> Message-ID: On Jan 19, 2010, at 11:57 AM, komodo wrote: > Hi > > As addition to my first post about notification delay. > > I made some further investigation and found out some another strange behaviour > (for me) and couldn't find any info in docs. > > I have first_nitification_delay for host se to 0 and i have tried change this > setting for service and here are the results I have no experience with this feature and don't have time to dig into it right now but based on your unreliable behavior I'd suggest you make sure you don't have multiple copies of nagios running at the same time. If you've been stopping and starting a lot that may be the case. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 19 20:38:27 2010 From: marc at ena.com (Marc Powell) Date: Tue, 19 Jan 2010 13:38:27 -0600 Subject: check_cluster and service hardstate In-Reply-To: <8480e60f1001191042p131bdbfdgc2c21c028d38eb4@mail.gmail.com> References: <8480e60f1001191039s72a61fa9g897bc6b805311429@mail.gmail.com> <8480e60f1001191042p131bdbfdgc2c21c028d38eb4@mail.gmail.com> Message-ID: On Jan 19, 2010, at 12:42 PM, Gael Cheron wrote: > Hi all, > > In the doc dealing with the service cluster with the check_cluster plugin, the example uses $SERVICESTATEID:host:Service$ > > But I'd like to use the hardstate. As far as I know, there is no variable giving the service hardstate. I had a look at the list http://nagios.sourceforge.net/docs/2_0/macros.html > > Do somebody know if there is a way to do this ? Having such a macro wouldn't make much sense, IMHO. It seems you might not understand the difference between HARD and SOFT states or I don't understand the problem you're trying to solve? What would it's value be while the service was in a SOFT state, presumably when you'd be interested in using it? OK? WARNING? CRITICAL? $LASTSERVICESTATEID$ might be close to what you're thinking about but I believe that it updates after every check, regardless of HARD/SOFT status. I think you're wanting check_cluster to do something only if the things it's watching are in a HARD state. My suggestion would be to pass $SERVICESTATETYPE:host:Service$ for each cluster element to check_cluster and teach check_cluster to read that and to do what you need based on your business logic. Hope that helps... -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From komodo at uvt.cz Tue Jan 19 20:52:26 2010 From: komodo at uvt.cz (komodo) Date: Tue, 19 Jan 2010 20:52:26 +0100 Subject: first_notification_delay problem Message-ID: <201001192052.26520.komodo@uvt.cz> > I have no experience with this feature and don't have time to dig into it > right now but based on your unreliable behavior I'd suggest you make sure > you don't have multiple copies of nagios running at the same time. If > you've been stopping and starting a lot that may be the case. > > -- > Marc Thanks for the answer, but that is not a problem. There is only one nagios process. Martin ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Wed Jan 20 10:35:43 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Wed, 20 Jan 2010 10:35:43 +0100 Subject: check_openmanage 3.5.5-beta6 snmp_detect_blade bug In-Reply-To: (Ken McKinlay's message of "Tue, 19 Jan 2010 13:27:07 -0500") References: <15tk4vef1xi.fsf@tux.uio.no> Message-ID: <15tfx61f79s.fsf@tux.uio.no> "McKinlay, Ken" writes: > Trond, > > Other little bug for your next release. Using check_openmanage > 3.5.5-beta6 on a server loaded with OMSA 5.1.0 (a different box this > time), in the snmp_detect_blade function it returned: "INTERNAL ERROR: > Use of uninitialized value in string eq at > ./check_openmanage-3.5.5-beta6 line 599." > > Looking at the line and then doing my own SNMP query, that OID is > missing in OMSA 5.1.0. However, by changing line 599 to first make sure > a result has been set then the uninitialized value error is bypassed in > the if statement: > > if ( $result->{$DellBaseBoardType} && $result->{$DellBaseBoardType} eq > '3') { Thank you, the patch is applied. Note that check_openmanage is not designed to work with really old OMSA versions (5.2 and earlier). This is more of a problem when checking locally, since omreport commands are different. I generally won't add support for old OMSA if it has a noticeable speed or complexity impact, but that is not the case here. Besides, checking that the value exists is good practice anyway :) An updated version is available here: http://folk.uio.no/trondham/tmp/check_openmanage-3.5.5-beta7 If you confirm that this beta works for you, and I don't get any more bug reports in the next few days, this will eventually become 3.5.5. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gael.cheron at free.fr Wed Jan 20 11:00:05 2010 From: gael.cheron at free.fr (Gael Cheron) Date: Wed, 20 Jan 2010 11:00:05 +0100 Subject: check_cluster and service hardstate In-Reply-To: References: <8480e60f1001191039s72a61fa9g897bc6b805311429@mail.gmail.com> <8480e60f1001191042p131bdbfdgc2c21c028d38eb4@mail.gmail.com> Message-ID: <8480e60f1001200200p2b0cb25et6578583bcc60358e@mail.gmail.com> It's a special business logic : distributed nagios, nagios business process, no notification and watching nagvis 24/7. Thank you for your suggestion about SERVICESTATETYPE, I was also thinking about it. thanks for your answer Marc. regards, Gael. 2010/1/19 Marc Powell > > On Jan 19, 2010, at 12:42 PM, Gael Cheron wrote: > > > Hi all, > > > > In the doc dealing with the service cluster with the check_cluster > plugin, the example uses $SERVICESTATEID:host:Service$ > > > > But I'd like to use the hardstate. As far as I know, there is no variable > giving the service hardstate. I had a look at the list > http://nagios.sourceforge.net/docs/2_0/macros.html > > > > Do somebody know if there is a way to do this ? > > Having such a macro wouldn't make much sense, IMHO. It seems you might not > understand the difference between HARD and SOFT states or I don't understand > the problem you're trying to solve? What would it's value be while the > service was in a SOFT state, presumably when you'd be interested in using > it? OK? WARNING? CRITICAL? $LASTSERVICESTATEID$ might be close to what > you're thinking about but I believe that it updates after every check, > regardless of HARD/SOFT status. > > I think you're wanting check_cluster to do something only if the things > it's watching are in a HARD state. My suggestion would be to pass > $SERVICESTATETYPE:host:Service$ for each cluster element to check_cluster > and teach check_cluster to read that and to do what you need based on your > business logic. > > Hope that helps... > > -- > Marc > > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subs at arahmantech.com Wed Jan 20 17:04:19 2010 From: subs at arahmantech.com (Athiqur Rahman) Date: Wed, 20 Jan 2010 16:04:19 +0000 Subject: display_name directive Message-ID: <4B572983.4000101@arahmantech.com> Hi, according to the docs, if display_name directive is used when creating a service, then it will override the service_description directive when displaying on the web interface. This does not seem to be the case, i always get the service_descrition being displayed. Below is the service i created define service{ use generic-service host_name router service_description TCP hit on 179 display_name BGP check_command BGP } Seems pretty straight forward, don't know why it isn't working! ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Jan 20 17:34:19 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 20 Jan 2010 08:34:19 -0800 Subject: display_name directive In-Reply-To: <4B572983.4000101@arahmantech.com> References: <4B572983.4000101@arahmantech.com> Message-ID: <4B57308B.5010607@hp.com> Athiqur Rahman wrote: > Hi, > > according to the docs, if display_name directive is used when creating a > service, then it will override the service_description directive when > displaying on the web interface. This does not seem to be the case, i > always get the service_descrition being displayed. Below is the service > i created > > define service{ > use generic-service > host_name router > service_description TCP hit on 179 > display_name BGP > check_command BGP > } > > Seems pretty straight forward, don't know why it isn't working! You didn't read into the docs far enough. The next sentence after the one you paraphrased is: "Note: The current CGIs do not use this option, although future versions of the web interface will." ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Jan 20 17:35:50 2010 From: marc at ena.com (Marc Powell) Date: Wed, 20 Jan 2010 10:35:50 -0600 Subject: display_name directive In-Reply-To: <4B572983.4000101@arahmantech.com> References: <4B572983.4000101@arahmantech.com> Message-ID: <71DE470D-A1C8-4E8B-8F7D-F11B93F6529C@ena.com> On Jan 20, 2010, at 10:04 AM, Athiqur Rahman wrote: > Hi, > > according to the docs, if display_name directive is used when creating a > service, then it will override the service_description directive when > displaying on the web interface. Please reference this in the docs. > Seems pretty straight forward, don't know why it isn't working! It's not supposed to from the documentation I'm reading... http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service display_name: This directive is used to define an alternate name that should be displayed in the web interface for this service. If not specified, this defaults to the value you specify for the service_description directive. ***Note: The current CGIs do not use this option, although future versions of the web interface will.*** emphasis mine. A grep of the source for 3.2.0 confirms that none of the cgi's reference that variable. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ken.McKinlay at curtisswright.com Wed Jan 20 17:16:58 2010 From: Ken.McKinlay at curtisswright.com (McKinlay, Ken) Date: Wed, 20 Jan 2010 11:16:58 -0500 Subject: check_openmanage 3.5.5-beta6 snmp_detect_blade bug In-Reply-To: <15tfx61f79s.fsf@tux.uio.no> References: <15tk4vef1xi.fsf@tux.uio.no> <15tfx61f79s.fsf@tux.uio.no> Message-ID: Works fine. Thanks once again. Ken > -----Original Message----- > From: Trond Hasle Amundsen [mailto:t.h.amundsen at usit.uio.no] > Sent: Wednesday, January 20, 2010 4:36 AM > To: McKinlay, Ken > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] check_openmanage 3.5.5-beta6 > snmp_detect_blade bug > > "McKinlay, Ken" writes: > > > Trond, > > > > Other little bug for your next release. Using check_openmanage > > 3.5.5-beta6 on a server loaded with OMSA 5.1.0 (a different box this > > time), in the snmp_detect_blade function it returned: "INTERNAL > ERROR: > > Use of uninitialized value in string eq at > > ./check_openmanage-3.5.5-beta6 line 599." > > > > Looking at the line and then doing my own SNMP query, that OID is > > missing in OMSA 5.1.0. However, by changing line 599 to first make > sure > > a result has been set then the uninitialized value error is bypassed > in > > the if statement: > > > > if ( $result->{$DellBaseBoardType} && $result->{$DellBaseBoardType} > eq > > '3') { > > Thank you, the patch is applied. Note that check_openmanage is not > designed to work with really old OMSA versions (5.2 and earlier). This > is more of a problem when checking locally, since omreport commands are > different. I generally won't add support for old OMSA if it has a > noticeable speed or complexity impact, but that is not the case here. > Besides, checking that the value exists is good practice anyway :) > > An updated version is available here: > > http://folk.uio.no/trondham/tmp/check_openmanage-3.5.5-beta7 > > If you confirm that this beta works for you, and I don't get any more > bug reports in the next few days, this will eventually become 3.5.5. > > Cheers, > -- > Trond H. Amundsen > Center for Information Technology Services, University of Oslo _______________________________________________________________________ This e-mail and any files transmitted with it are proprietary and intended solely for the use of the individual or entity to whom they are addressed. If you have reason to believe that you have received this e-mail in error, please notify the sender and destroy this email and any attached files. Please note that any views or opinions presented in this e-mail are solely those of the author and do not necessarily represent those of the Curtiss-Wright Corporation or any of its subsidiaries. Documents attached hereto may contain technology subject to government export regulations. Recipient is solely responsible for ensuring that any re-export, transfer or disclosure of this information is in accordance with applicable government export regulations. The recipient should check this e-mail and any attachments for the presence of viruses. Curtiss-Wright Corporation and its subsidiaries accept no liability for any damage caused by any virus transmitted by this e-mail. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subs at arahmantech.com Wed Jan 20 17:59:31 2010 From: subs at arahmantech.com (Athiqur Rahman) Date: Wed, 20 Jan 2010 16:59:31 +0000 Subject: display_name directive In-Reply-To: <71DE470D-A1C8-4E8B-8F7D-F11B93F6529C@ena.com> References: <4B572983.4000101@arahmantech.com> <71DE470D-A1C8-4E8B-8F7D-F11B93F6529C@ena.com> Message-ID: <4B573673.7090906@arahmantech.com> Marc Powell wrote: > On Jan 20, 2010, at 10:04 AM, Athiqur Rahman wrote: > > >> Hi, >> >> according to the docs, if display_name directive is used when creating a >> service, then it will override the service_description directive when >> displaying on the web interface. >> > > Please reference this in the docs. > > >> Seems pretty straight forward, don't know why it isn't working! >> > > It's not supposed to from the documentation I'm reading... > > http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service > > display_name: This directive is used to define an alternate name that should be displayed in the web interface for this service. If not specified, this defaults to the value you specify for the service_description directive. ***Note: The current CGIs do not use this option, although future versions of the web interface will.*** > > emphasis mine. > > A grep of the source for 3.2.0 confirms that none of the cgi's reference that variable. > > -- > Marc > > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > Seems like I jumped the gun there. Sorry ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Wed Jan 20 23:49:47 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Wed, 20 Jan 2010 22:49:47 +0000 Subject: Check NRPE working with two servers, and not a third? Message-ID: <4B57888B.4040906@impactteachers.com> On my nagios server I get C:\ Drive Space;UNKNOWN;HARD;3;CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. On my windows box I get the error messages NRPE_NT: Client request was invalid, bailing out... Which would suggest based on my research that the problem is misconfiguration, yet the same setup works with other servers already, so i know the server is capable of communicating with a windows setup. What could cause this to be the case? ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Thu Jan 21 04:46:09 2010 From: rperezm at uci.cu (ReynierPM) Date: Wed, 20 Jan 2010 22:46:09 -0500 Subject: Where to change the email address for notifications Message-ID: <4B57CE01.5000209@uci.cu> Hi every: I need to change the email address that Nagios uses to send emails notifications, wich file(s) I need to look for? -- Cheers Ing. Reynier P?rez Mira ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Jan 21 05:07:51 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 20 Jan 2010 20:07:51 -0800 Subject: Where to change the email address for notifications In-Reply-To: <4B57CE01.5000209@uci.cu> References: <4B57CE01.5000209@uci.cu> Message-ID: <4B57D317.8080904@hp.com> ReynierPM wrote: > Hi every: > I need to change the email address that Nagios uses to send emails > notifications, wich file(s) I need to look for? > That depends what you're using to send mail. It may be in your notification command, but more likely it's dictated your mail configuration (which will, of course, depend which MTA you use). ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Jan 21 05:10:22 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 20 Jan 2010 20:10:22 -0800 Subject: Check NRPE working with two servers, and not a third? In-Reply-To: <4B57888B.4040906@impactteachers.com> References: <4B57888B.4040906@impactteachers.com> Message-ID: <4B57D3AE.1050504@hp.com> Mr Gabriel wrote: > On my nagios server I get > > C:\ Drive Space;UNKNOWN;HARD;3;CHECK_NRPE: Received 0 bytes from daemon. > Check the remote server logs for error messages. > > On my windows box I get the error messages > > NRPE_NT: Client request was invalid, bailing out... > > Which would suggest based on my research that the problem is > misconfiguration, yet the same setup works with other servers already, > so i know the server is capable of communicating with a windows setup. > > What could cause this to be the case? You're probably right: it's most likely a configuration issue. Beyond that, the total lack of any configuration information in your question makes it unlikely someone could tell you what that misconfiguration might be. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Thu Jan 21 07:04:22 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Wed, 20 Jan 2010 23:04:22 -0700 Subject: Check NRPE working with two servers, and not a third? In-Reply-To: <4B57D3AE.1050504@hp.com> References: <4B57888B.4040906@impactteachers.com> <4B57D3AE.1050504@hp.com> Message-ID: <41D61362-823C-439A-953B-5AB9E7322621@stubhub.com> That is the log entry in nsclient.log file? Try diffing the config file with that from a working config file. You might have a typo in there. Did you try running the check_nrpe plugin on the nagios server against that server to see if it responds with a valid response to see if this is a problem with NRPE on your client or something just with just the check disk command on nagios or the check disk DLL or config on the client. On Jan 20, 2010, at 8:10 PM, Morris, Patrick wrote: > Mr Gabriel wrote: >> On my nagios server I get >> >> C:\ Drive Space;UNKNOWN;HARD;3;CHECK_NRPE: Received 0 bytes from daemon. >> Check the remote server logs for error messages. >> >> On my windows box I get the error messages >> >> NRPE_NT: Client request was invalid, bailing out... >> >> Which would suggest based on my research that the problem is >> misconfiguration, yet the same setup works with other servers already, >> so i know the server is capable of communicating with a windows setup. >> >> What could cause this to be the case? > > > You're probably right: it's most likely a configuration issue. > > Beyond that, the total lack of any configuration information in your > question makes it unlikely someone could tell you what that > misconfiguration might be. > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null Thanks, Matthew Litwin mlitwin at stubhub.com 415.222.8475 ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Thu Jan 21 07:06:54 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Wed, 20 Jan 2010 23:06:54 -0700 Subject: Where to change the email address for notifications In-Reply-To: <4B57D317.8080904@hp.com> References: <4B57CE01.5000209@uci.cu> <4B57D317.8080904@hp.com> Message-ID: <8B775EBF-349A-447F-9586-8A7AD59017AC@stubhub.com> Under nagios.cfg you can change the nagios admin email, but really you should be using contacts.cfg for configuring notifications. If you just want one notification email for every monitor, which is what it sounds like what you want, you can define the contact_email in templates.cfg and it will be used in whatever uses the respective template. On Jan 20, 2010, at 8:07 PM, Morris, Patrick wrote: > ReynierPM wrote: >> Hi every: >> I need to change the email address that Nagios uses to send emails >> notifications, wich file(s) I need to look for? >> > > That depends what you're using to send mail. It may be in your > notification command, but more likely it's dictated your mail > configuration (which will, of course, depend which MTA you use). > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null Thanks, Matthew Litwin mlitwin at stubhub.com 415.222.8475 ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mlitwin at stubhub.com Thu Jan 21 07:18:06 2010 From: mlitwin at stubhub.com (Litwin, Matthew) Date: Wed, 20 Jan 2010 23:18:06 -0700 Subject: Multiple NRPE Processes In-Reply-To: <4FB2C132-9682-43D6-9185-20920AE5225F@ena.com> References: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> <7545d7d21001190722m11f7632erfd25d234549f53@mail.gmail.com> <4FB2C132-9682-43D6-9185-20920AE5225F@ena.com> Message-ID: <144C9CDA-9665-486E-827B-2FC50201DE93@stubhub.com> In solaris it is critical that you run nrpe on solaris in daemon mode. You need to launch it with like so: /usr/local/nagios/bin/nrpe -c /usr/local/nagios/etc/nrpe.cfg -d Running it like this you should be able to kill it without a problem as it will maintain the pid. Make sure in nrpe.cfg that the nrpe user can write its pid where specified. If you look in syslog you should see details about how it starts. On Jan 19, 2010, at 8:41 AM, Marc Powell wrote: > > On Jan 19, 2010, at 9:22 AM, Juki wrote: > >> How long are they sticking around? What OS are you using? >> >> I'm running Solaris 10. They are more less sticking around indefinitely. > > 60 seconds seems to be the default/recommended setting on Solaris but can be as high as 10 minutes if the OS thinks it needs to be (http://docs.sun.com/app/docs/doc/817-0404/chapter4-51?a=view). I wouldn't recommend tuning this unless you know exactly what you are doing and why; strangeness may result. Perhaps you've tuned it already and this is the resulting strangeness... > > You're going to have better luck understanding this by asking about it on a Solaris support list as what you're asking about is a general TCP stack question, not specific to Nagios. > > -- > Marc > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null Thanks, Matthew Litwin mlitwin at stubhub.com 415.222.8475 ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From liyasmacosx at gmail.com Thu Jan 21 08:42:18 2010 From: liyasmacosx at gmail.com (liyas m) Date: Thu, 21 Jan 2010 15:42:18 +0800 Subject: removing and installing nagios Message-ID: <131162e41001202342r7685b163o6ce85e8743595608@mail.gmail.com> DEar all, I have installed nagios 3.2 on my server for student project and everything worked fine until i run apt-get nagios-text. After that the version appeared on the server is Nagios 1.3 not 3.2 which is very strange. I tried to reinstalled but still the old version appear although the engine shown by nagios -c is 3.2 Nagios Core 3.2.0 Copyright (c) 2009 Nagios Core Development Team and Community Contributors Copyright (c) 1999-2009 Ethan Galstad Last Modified: 08-12-2009 License: GPL How do I remove the old version ? Thank you for your help. Regards, Liyas -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Thu Jan 21 08:46:04 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Thu, 21 Jan 2010 07:46:04 +0000 Subject: Check NRPE working with two servers, and not a third? In-Reply-To: <41D61362-823C-439A-953B-5AB9E7322621@stubhub.com> References: <4B57888B.4040906@impactteachers.com> <4B57D3AE.1050504@hp.com> <41D61362-823C-439A-953B-5AB9E7322621@stubhub.com> Message-ID: <4B58063C.5020701@impactteachers.com> On 21/01/2010 06:04, Litwin, Matthew wrote: > That is the log entry in nsclient.log file? Try diffing the config file with that from a working config file. You might have a typo in there. > > Did you try running the check_nrpe plugin on the nagios server against that server to see if it responds with a valid response to see if this is a problem with NRPE on your client or something just with just the check disk command on nagios or the check disk DLL or config on the client. > > On Jan 20, 2010, at 8:10 PM, Morris, Patrick wrote: > > >> Mr Gabriel wrote: >> >>> On my nagios server I get >>> >>> C:\ Drive Space;UNKNOWN;HARD;3;CHECK_NRPE: Received 0 bytes from daemon. >>> Check the remote server logs for error messages. >>> >>> On my windows box I get the error messages >>> >>> NRPE_NT: Client request was invalid, bailing out... >>> >>> Which would suggest based on my research that the problem is >>> misconfiguration, yet the same setup works with other servers already, >>> so i know the server is capable of communicating with a windows setup. >>> >>> What could cause this to be the case? >>> >> >> You're probably right: it's most likely a configuration issue. >> >> Beyond that, the total lack of any configuration information in your >> question makes it unlikely someone could tell you what that >> misconfiguration might be. >> >> >> I didn't consider running a diff on the config files. I'll do that as soon as I get into the office. Also, when I check the plugin from my nagios server, check_nrpe -H new.server.com works. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cristoir at gmail.com Thu Jan 21 10:01:37 2010 From: cristoir at gmail.com (Christopher McAtackney) Date: Thu, 21 Jan 2010 09:01:37 +0000 Subject: Multiple NRPE Processes In-Reply-To: <144C9CDA-9665-486E-827B-2FC50201DE93@stubhub.com> References: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> <7545d7d21001190722m11f7632erfd25d234549f53@mail.gmail.com> <4FB2C132-9682-43D6-9185-20920AE5225F@ena.com> <144C9CDA-9665-486E-827B-2FC50201DE93@stubhub.com> Message-ID: Can you elaborate Matthew, why is it critical to run NRPE in daemon mode under Solaris? I'm just curious as to any issues that may result from not doing so. Cheers, Chris ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Jan 21 11:17:42 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 21 Jan 2010 10:17:42 +0000 Subject: removing and installing nagios In-Reply-To: <131162e41001202342r7685b163o6ce85e8743595608@mail.gmail.com> References: <131162e41001202342r7685b163o6ce85e8743595608@mail.gmail.com> Message-ID: <4B5829C6.7000904@flatto.net> liyas m wrote: > DEar all, > > I have installed nagios 3.2 on my server for student project and > everything worked fine until i run apt-get nagios-text. After that the > version appeared on the server is Nagios 1.3 not 3.2 which is very > strange. I tried to reinstalled but still the old version appear > although the engine shown by nagios -c is 3.2 > > Nagios Core 3.2.0 > Copyright (c) 2009 Nagios Core Development Team and Community > Contributors > Copyright (c) 1999-2009 Ethan Galstad > Last Modified: 08-12-2009 > License: GPL > > How do I remove the old version ? Thank you for your help. > > Regards, > Liyas if you installed with apt - you can try apt-get purge nagios-text . that will remove the package installed . As for returning to the source installed software - you might have to rerun the configure / make process to make sure the newer files are installed and used . or you can use the blunt force method and rm every file related to nagios - but i wouldn't recommend it , only as a last resort . Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Jan 21 12:09:56 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 21 Jan 2010 11:09:56 +0000 Subject: Nagiosgraph failing to generate graphs Message-ID: <4B583604.7030308@flatto.net> Hello I am having an issue with nagiosgraph , i installed nagios 3.2 and trying to add nagiosgraph-1.3.3 for the graphs , but for some reason the graphs are not being generated . The RRD files are being created but not populated with data , meaning I can see the graph page but no graphs. from the nagiosgraph.log (set to debug 5 ) Thu Jan 21 10:52:56 2010 insert.pl warn perfdata not recognized: servicedescr: output: perfdata: from the nagios.debug : [1264071672.381929] [016.2] [pid=7107] Found a check result (#14) to handle... [1264071672.381946] [016.1] [pid=7107] Handling check result for service 'Disk Space /boot' on host 'gbc1-db-27'... [1264071672.381960] [001.0] [pid=7107] handle_async_service_check_result() [1264071672.381972] [016.0] [pid=7107] ** Handling check result for service 'Disk Space /boot' on host 'gbc1-db-27'... [1264071672.381985] [016.1] [pid=7107] HOST: gbc1-db-27, SERVICE: Disk Space /boot, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes, RESCHEDULE: Yes, EXITED OK: Yes, RETURN CODE: 0, OUTPUT: DISK OK - free space: /boot 57 MB (83% inode=100%);| /boot=12MB;54;61;99; 68\n [1264071672.382016] [016.2] [pid=7107] Parsing check output... [1264071672.382030] [016.2] [pid=7107] Short Output: DISK OK - free space: /boot 57 MB (83% inode=100%): [1264071672.382043] [016.2] [pid=7107] Long Output: NULL [1264071672.382055] [016.2] [pid=7107] Perf Data: /boot=12MB;54;61;99;68 [1264071672.382067] [016.2] [pid=7107] ST: HARD CA: 1 MA: 4 CS: 0 LS: 0 LHS: 0 [1264071672.382081] [016.1] [pid=7107] Service is OK. [1264071672.382092] [016.1] [pid=7107] Service did not change state. [1264071672.382110] [016.1] [pid=7107] Rescheduling next check of service at Thu Jan 21 11:03:03 2010 [1264071672.382123] [001.0] [pid=7107] get_next_valid_time() [1264071672.382134] [001.0] [pid=7107] check_time_against_period() [1264071672.382164] [001.0] [pid=7107] schedule_service_check() [1264071672.382182] [016.0] [pid=7107] Scheduling a non-forced, active check of service 'Disk Space /boot' on host 'gbc1-db-27' @ Thu Jan 21 11:03:03 2010 [1264071672.382245] [016.2] [pid=7107] Scheduling new service check event. [1264071672.382259] [001.0] [pid=7107] reschedule_event() [1264071672.382270] [001.0] [pid=7107] add_event() [1264071672.382288] [064.1] [pid=7107] Making callbacks (type 8)... [1264071672.382318] [064.2] [pid=7107] Callback #1 (type 8) return code = 0 [1264071672.382332] [064.1] [pid=7107] Making callbacks (type 20)... [1264071672.382378] [064.2] [pid=7107] Callback #1 (type 20) return code = 0 [1264071672.382394] [064.1] [pid=7107] Making callbacks (type 13)... [1264071672.382428] [064.2] [pid=7107] Callback #1 (type 13) return code = 0 [1264071672.382442] [064.1] [pid=7107] Making callbacks (type 20)... [1264071672.382486] [064.2] [pid=7107] Callback #1 (type 20) return code = 0 [1264071672.382500] [001.0] [pid=7107] check_for_service_flapping() [1264071672.382512] [016.1] [pid=7107] Checking service 'Disk Space /boot' on host 'gbc1-db-27' for flapping... [1264071672.382525] [016.2] [pid=7107] LFT=5.00, HFT=20.00, CPC=0.00, PSC=0.00% [1264071672.382574] [016.1] [pid=7107] Service is not flapping (0.00% state change). [1264071672.382589] [001.0] [pid=7107] check_for_host_flapping() [1264071672.382601] [016.1] [pid=7107] Checking host 'gbc1-db-27' for flapping... [1264071672.382613] [016.2] [pid=7107] LFT=5.00, HFT=20.00, CPC=0.00, PSC=0.00% [1264071672.382629] [016.1] [pid=7107] Host is not flapping (0.00% state change). [1264071672.382642] [001.0] [pid=7107] run_service_performance_data_command() [1264071672.382654] [001.0] [pid=7107] update_service_performance_data_file() [1264071672.382675] [016.2] [pid=7107] Raw service performance data file output: $LASTSERVICECHECK$||$HOSTNAME$||$SERVICEDESC$||$SERVICEOUTPUT$||$SERVICEPERFDATA$ [1264071672.382689] [001.0] [pid=7107] process_macros() [1264071672.382701] [2048.1] [pid=7107] **** BEGIN MACRO PROCESSING *********** [1264071672.382712] [2048.1] [pid=7107] Processing: '$LASTSERVICECHECK$||$HOSTNAME$||$SERVICEDESC$||$SERVICEOUTPUT$||$SERVICEPERFDATA$' [1264071672.382725] [2048.2] [pid=7107] Processing part: '' [1264071672.382737] [2048.2] [pid=7107] Not currently in macro. Running output (0): '' [1264071672.382749] [2048.2] [pid=7107] Processing part: 'LASTSERVICECHECK' [1264071672.382763] [2048.2] [pid=7107] macro_x[13] (LASTSERVICECHECK) match. [1264071672.382778] [2048.2] [pid=7107] Processed 'LASTSERVICECHECK', Clean Options: 0, Free: 1 [1264071672.382790] [2048.2] [pid=7107] Processed 'LASTSERVICECHECK', Clean Options: 0, Free: 1 [1264071672.382803] [2048.2] [pid=7107] Cleaning options: global=0, local=0, effective=0 [1264071672.382816] [2048.2] [pid=7107] Uncleaned macro. Running output (10): '1264071663' [1264071672.382828] [2048.2] [pid=7107] Just finished macro. Running output (10): '1264071663' [1264071672.382840] [2048.2] [pid=7107] Processing part: '||' [1264071672.382853] [2048.2] [pid=7107] Not currently in macro. Running output (12): '1264071663||' [1264071672.382865] [2048.2] [pid=7107] Processing part: 'HOSTNAME' [1264071672.382878] [2048.2] [pid=7107] macro_x[0] (HOSTNAME) match. [1264071672.382891] [2048.2] [pid=7107] Processed 'HOSTNAME', Clean Options: 0, Free: 1 [1264071672.382903] [2048.2] [pid=7107] Processed 'HOSTNAME', Clean Options: 0, Free: 1 [1264071672.382916] [2048.2] [pid=7107] Cleaning options: global=0, local=0, effective=0 [1264071672.382929] [2048.2] [pid=7107] Uncleaned macro. Running output (22): '1264071663||gbc1-db-27' [1264071672.382941] [2048.2] [pid=7107] Just finished macro. Running output (22): '1264071663||gbc1-db-27' [1264071672.382953] [2048.2] [pid=7107] Processing part: '||' [1264071672.382966] [2048.2] [pid=7107] Not currently in macro. Running output (24): '1264071663||gbc1-db-27||' [1264071672.382978] [2048.2] [pid=7107] Processing part: 'SERVICEDESC' [1264071672.382998] [2048.2] [pid=7107] macro_x[3] (SERVICEDESC) match. [1264071672.383011] [2048.2] [pid=7107] Processed 'SERVICEDESC', Clean Options: 0, Free: 1 [1264071672.383024] [2048.2] [pid=7107] Processed 'SERVICEDESC', Clean Options: 0, Free: 1 [1264071672.383037] [2048.2] [pid=7107] Cleaning options: global=0, local=0, effective=0 [1264071672.383050] [2048.2] [pid=7107] Uncleaned macro. Running output (40): '1264071663||gbc1-db-27||Disk Space /boot' [1264071672.383062] [2048.2] [pid=7107] Just finished macro. Running output (40): '1264071663||gbc1-db-27||Disk Space /boot' [1264071672.383074] [2048.2] [pid=7107] Processing part: '||' [1264071672.383086] [2048.2] [pid=7107] Not currently in macro. Running output (42): '1264071663||gbc1-db-27||Disk Space /boot||' [1264071672.383099] [2048.2] [pid=7107] Processing part: 'SERVICEOUTPUT' [1264071672.383114] [2048.2] [pid=7107] macro_x[17] (SERVICEOUTPUT) match. [1264071672.383127] [2048.2] [pid=7107] New clean options: 3 [1264071672.383139] [2048.2] [pid=7107] Processed 'SERVICEOUTPUT', Clean Options: 3, Free: 1 [1264071672.383152] [2048.2] [pid=7107] Processed 'SERVICEOUTPUT', Clean Options: 3, Free: 1 [1264071672.383164] [2048.2] [pid=7107] Cleaning options: global=0, local=3, effective=3 [1264071672.383210] [2048.2] [pid=7107] Cleaned macro. Running output (93): '1264071663||gbc1-db-27||Disk Space /boot||DISK OK - free space: /boot 57 MB (83% inode=100%):' [1264071672.383223] [2048.2] [pid=7107] Just finished macro. Running output (93): '1264071663||gbc1-db-27||Disk Space /boot||DISK OK - free space: /boot 57 MB (83% inode=100%):' [1264071672.383367] [2048.2] [pid=7107] Processing part: '' [1264071672.383380] [2048.2] [pid=7107] Not currently in macro. Running output (117): '1264071663||gbc1-db-27||Disk Space /boot||DISK OK - free space: /boot 57 MB (83% inode=100%):||/boot=12MB;54;61;99;68' [1264071672.383393] [2048.1] [pid=7107] Done. Final output: '1264071663||gbc1-db-27||Disk Space /boot||DISK OK - free space: /boot 57 MB (83% inode=100%):||/boot=12MB;54;61;99;68' [1264071672.383406] [2048.1] [pid=7107] **** END MACRO PROCESSING ************* [1264071672.383417] [016.2] [pid=7107] Processed service performance data file output: 1264071663||gbc1-db-27||Disk Space /boot||DISK OK - free space: /boot 57 MB (83% inode=100%):||/boot=12MB;54;61;99;68 [1264071672.383458] [016.1] [pid=7107] Deleted check result file '/var/spool/nagios/c9l3LFV' then the content of the rrd file :( last entries ) NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN NaN Looking for the entry in the /var/spool/nagios/perfdata.log , i cant find one . Anyone has any ideas as to what am i missing ? Thanks Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eemerson at safecore.com Thu Jan 21 13:35:58 2010 From: eemerson at safecore.com (Eric Emerson) Date: Thu, 21 Jan 2010 07:35:58 -0500 Subject: Nagiosgraph failing to generate graphs In-Reply-To: <4B583604.7030308@flatto.net> References: <4B583604.7030308@flatto.net> Message-ID: Is that the only entry in nagiosgraph.log? On Thu, Jan 21, 2010 at 6:09 AM, Assaf Flatto wrote: > Hello > > I am having an issue with nagiosgraph , i installed nagios 3.2 and > trying to add nagiosgraph-1.3.3 > for the graphs , but for some reason the graphs are not being generated . > > The RRD files are being created but not populated with data , meaning I > can see the graph page but no graphs. > > from the nagiosgraph.log (set to debug 5 ) > Thu Jan 21 10:52:56 2010 insert.pl warn perfdata not recognized: > servicedescr: > output: > perfdata: > > from the nagios.debug : > > [1264071672.381929] [016.2] [pid=7107] Found a check result (#14) to > handle... > [1264071672.381946] [016.1] [pid=7107] Handling check result for service > 'Disk Space /boot' on host 'gbc1-db-27'... > [1264071672.381960] [001.0] [pid=7107] handle_async_service_check_result() > [1264071672.381972] [016.0] [pid=7107] ** Handling check result for > service 'Disk Space /boot' on host 'gbc1-db-27'... > [1264071672.381985] [016.1] [pid=7107] HOST: gbc1-db-27, SERVICE: Disk > Space /boot, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes, RESCHEDULE: > Yes, EXITED OK: Yes, RETURN CODE: 0, OUTPUT: DISK OK - free space: /boot > 57 MB (83% inode=100%);| /boot=12MB;54;61;99; > 68\n > [1264071672.382016] [016.2] [pid=7107] Parsing check output... > [1264071672.382030] [016.2] [pid=7107] Short Output: DISK OK - free > space: /boot 57 MB (83% inode=100%): > [1264071672.382043] [016.2] [pid=7107] Long Output: NULL > [1264071672.382055] [016.2] [pid=7107] Perf Data: /boot=12MB;54;61;99;68 > [1264071672.382067] [016.2] [pid=7107] ST: HARD CA: 1 MA: 4 CS: 0 > LS: 0 LHS: 0 > [1264071672.382081] [016.1] [pid=7107] Service is OK. > [1264071672.382092] [016.1] [pid=7107] Service did not change state. > [1264071672.382110] [016.1] [pid=7107] Rescheduling next check of > service at Thu Jan 21 11:03:03 2010 > [1264071672.382123] [001.0] [pid=7107] get_next_valid_time() > [1264071672.382134] [001.0] [pid=7107] check_time_against_period() > [1264071672.382164] [001.0] [pid=7107] schedule_service_check() > [1264071672.382182] [016.0] [pid=7107] Scheduling a non-forced, active > check of service 'Disk Space /boot' on host 'gbc1-db-27' @ Thu Jan 21 > 11:03:03 2010 > [1264071672.382245] [016.2] [pid=7107] Scheduling new service check event. > [1264071672.382259] [001.0] [pid=7107] reschedule_event() > [1264071672.382270] [001.0] [pid=7107] add_event() > [1264071672.382288] [064.1] [pid=7107] Making callbacks (type 8)... > [1264071672.382318] [064.2] [pid=7107] Callback #1 (type 8) return code = 0 > [1264071672.382332] [064.1] [pid=7107] Making callbacks (type 20)... > [1264071672.382378] [064.2] [pid=7107] Callback #1 (type 20) return code = > 0 > [1264071672.382394] [064.1] [pid=7107] Making callbacks (type 13)... > [1264071672.382428] [064.2] [pid=7107] Callback #1 (type 13) return code = > 0 > [1264071672.382442] [064.1] [pid=7107] Making callbacks (type 20)... > [1264071672.382486] [064.2] [pid=7107] Callback #1 (type 20) return code = > 0 > [1264071672.382500] [001.0] [pid=7107] check_for_service_flapping() > [1264071672.382512] [016.1] [pid=7107] Checking service 'Disk Space > /boot' on host 'gbc1-db-27' for flapping... > [1264071672.382525] [016.2] [pid=7107] LFT=5.00, HFT=20.00, CPC=0.00, > PSC=0.00% > [1264071672.382574] [016.1] [pid=7107] Service is not flapping (0.00% > state change). > [1264071672.382589] [001.0] [pid=7107] check_for_host_flapping() > [1264071672.382601] [016.1] [pid=7107] Checking host 'gbc1-db-27' for > flapping... > [1264071672.382613] [016.2] [pid=7107] LFT=5.00, HFT=20.00, CPC=0.00, > PSC=0.00% > [1264071672.382629] [016.1] [pid=7107] Host is not flapping (0.00% state > change). > [1264071672.382642] [001.0] [pid=7107] > run_service_performance_data_command() > [1264071672.382654] [001.0] [pid=7107] > update_service_performance_data_file() > [1264071672.382675] [016.2] [pid=7107] Raw service performance data file > output: > > $LASTSERVICECHECK$||$HOSTNAME$||$SERVICEDESC$||$SERVICEOUTPUT$||$SERVICEPERFDATA$ > [1264071672.382689] [001.0] [pid=7107] process_macros() > [1264071672.382701] [2048.1] [pid=7107] **** BEGIN MACRO PROCESSING > *********** > [1264071672.382712] [2048.1] [pid=7107] Processing: > > '$LASTSERVICECHECK$||$HOSTNAME$||$SERVICEDESC$||$SERVICEOUTPUT$||$SERVICEPERFDATA$' > [1264071672.382725] [2048.2] [pid=7107] Processing part: '' > [1264071672.382737] [2048.2] [pid=7107] Not currently in macro. > Running output (0): '' > [1264071672.382749] [2048.2] [pid=7107] Processing part: > 'LASTSERVICECHECK' > [1264071672.382763] [2048.2] [pid=7107] macro_x[13] (LASTSERVICECHECK) > match. > [1264071672.382778] [2048.2] [pid=7107] Processed 'LASTSERVICECHECK', > Clean Options: 0, Free: 1 > [1264071672.382790] [2048.2] [pid=7107] Processed 'LASTSERVICECHECK', > Clean Options: 0, Free: 1 > [1264071672.382803] [2048.2] [pid=7107] Cleaning options: global=0, > local=0, effective=0 > [1264071672.382816] [2048.2] [pid=7107] Uncleaned macro. Running > output (10): '1264071663' > [1264071672.382828] [2048.2] [pid=7107] Just finished macro. Running > output (10): '1264071663' > [1264071672.382840] [2048.2] [pid=7107] Processing part: '||' > [1264071672.382853] [2048.2] [pid=7107] Not currently in macro. > Running output (12): '1264071663||' > [1264071672.382865] [2048.2] [pid=7107] Processing part: 'HOSTNAME' > [1264071672.382878] [2048.2] [pid=7107] macro_x[0] (HOSTNAME) match. > [1264071672.382891] [2048.2] [pid=7107] Processed 'HOSTNAME', Clean > Options: 0, Free: 1 > [1264071672.382903] [2048.2] [pid=7107] Processed 'HOSTNAME', Clean > Options: 0, Free: 1 > [1264071672.382916] [2048.2] [pid=7107] Cleaning options: global=0, > local=0, effective=0 > [1264071672.382929] [2048.2] [pid=7107] Uncleaned macro. Running > output (22): '1264071663||gbc1-db-27' > [1264071672.382941] [2048.2] [pid=7107] Just finished macro. Running > output (22): '1264071663||gbc1-db-27' > [1264071672.382953] [2048.2] [pid=7107] Processing part: '||' > [1264071672.382966] [2048.2] [pid=7107] Not currently in macro. > Running output (24): '1264071663||gbc1-db-27||' > [1264071672.382978] [2048.2] [pid=7107] Processing part: 'SERVICEDESC' > [1264071672.382998] [2048.2] [pid=7107] macro_x[3] (SERVICEDESC) match. > [1264071672.383011] [2048.2] [pid=7107] Processed 'SERVICEDESC', Clean > Options: 0, Free: 1 > [1264071672.383024] [2048.2] [pid=7107] Processed 'SERVICEDESC', Clean > Options: 0, Free: 1 > [1264071672.383037] [2048.2] [pid=7107] Cleaning options: global=0, > local=0, effective=0 > [1264071672.383050] [2048.2] [pid=7107] Uncleaned macro. Running > output (40): '1264071663||gbc1-db-27||Disk Space /boot' > [1264071672.383062] [2048.2] [pid=7107] Just finished macro. Running > output (40): '1264071663||gbc1-db-27||Disk Space /boot' > [1264071672.383074] [2048.2] [pid=7107] Processing part: '||' > [1264071672.383086] [2048.2] [pid=7107] Not currently in macro. > Running output (42): '1264071663||gbc1-db-27||Disk Space /boot||' > [1264071672.383099] [2048.2] [pid=7107] Processing part: 'SERVICEOUTPUT' > [1264071672.383114] [2048.2] [pid=7107] macro_x[17] (SERVICEOUTPUT) > match. > [1264071672.383127] [2048.2] [pid=7107] New clean options: 3 > [1264071672.383139] [2048.2] [pid=7107] Processed 'SERVICEOUTPUT', > Clean Options: 3, Free: 1 > [1264071672.383152] [2048.2] [pid=7107] Processed 'SERVICEOUTPUT', > Clean Options: 3, Free: 1 > [1264071672.383164] [2048.2] [pid=7107] Cleaning options: global=0, > local=3, effective=3 > [1264071672.383210] [2048.2] [pid=7107] Cleaned macro. Running output > (93): '1264071663||gbc1-db-27||Disk Space /boot||DISK OK - free space: > /boot 57 MB (83% inode=100%):' > [1264071672.383223] [2048.2] [pid=7107] Just finished macro. Running > output (93): '1264071663||gbc1-db-27||Disk Space /boot||DISK OK - free > space: /boot 57 MB (83% inode=100%):' > [1264071672.383367] [2048.2] [pid=7107] Processing part: '' > [1264071672.383380] [2048.2] [pid=7107] Not currently in macro. > Running output (117): '1264071663||gbc1-db-27||Disk Space /boot||DISK OK > - free space: /boot 57 MB (83% inode=100%):||/boot=12MB;54;61;99;68' > [1264071672.383393] [2048.1] [pid=7107] Done. Final output: > '1264071663||gbc1-db-27||Disk Space /boot||DISK OK - free space: /boot > 57 MB (83% inode=100%):||/boot=12MB;54;61;99;68' > [1264071672.383406] [2048.1] [pid=7107] **** END MACRO PROCESSING > ************* > [1264071672.383417] [016.2] [pid=7107] Processed service performance > data file output: 1264071663||gbc1-db-27||Disk Space /boot||DISK OK - > free space: /boot 57 MB (83% inode=100%):||/boot=12MB;54;61;99;68 > [1264071672.383458] [016.1] [pid=7107] Deleted check result file > '/var/spool/nagios/c9l3LFV' > > then the content of the rrd file :( last entries ) > > > NaN NaN NaN NaN NaN NaN > > > NaN NaN NaN NaN NaN NaN > > > NaN NaN NaN NaN NaN NaN > > > NaN NaN NaN NaN NaN NaN > > > NaN NaN NaN NaN NaN NaN > > > NaN NaN NaN NaN NaN NaN > > > NaN NaN NaN NaN NaN NaN > > > NaN NaN NaN NaN NaN NaN > > > NaN NaN NaN NaN NaN NaN > > > Looking for the entry in the /var/spool/nagios/perfdata.log , i cant > find one . > > > Anyone has any ideas as to what am i missing ? > > Thanks > > Assaf > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 21 14:02:37 2010 From: marc at ena.com (Marc Powell) Date: Thu, 21 Jan 2010 07:02:37 -0600 Subject: Multiple NRPE Processes In-Reply-To: <144C9CDA-9665-486E-827B-2FC50201DE93@stubhub.com> References: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> <7545d7d21001190722m11f7632erfd25d234549f53@mail.gmail.com> <4FB2C132-9682-43D6-9185-20920AE5225F@ena.com> <144C9CDA-9665-486E-827B-2FC50201DE93@stubhub.com> Message-ID: <4CB8AEC2-AE58-48BA-9A11-CC919622DD22@ena.com> On Jan 21, 2010, at 12:18 AM, Litwin, Matthew wrote: > In solaris it is critical that you run nrpe on solaris in daemon mode. That's a strong statement. Why? I don't recall hearing of problems running it out of inetd and doing so seems to be fairly common. > Running it like this you should be able to kill it without a problem as it will maintain the pid. Make sure in nrpe.cfg that the nrpe user can write its pid where specified. If you look in syslog you should see details about how it starts. The OP's apparent 'problem' is not that he wants to kill child processes hanging out in the process table but rather socket connections that he sees using netstat that the OS is keeping in a TIME_WAIT state. These would be normal except for 'hanging around indefinitely'. Pending further developments from the OP, this would appear to be an OS TCP stack issue since NRPE is not involved in removing those. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Jan 21 14:27:51 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 21 Jan 2010 13:27:51 +0000 Subject: Nagiosgraph failing to generate graphs In-Reply-To: References: <4B583604.7030308@flatto.net> Message-ID: <4B585657.4000901@flatto.net> Eric Emerson wrote: > Is that the only entry in nagiosgraph.log? > It was when i first posted , hoiwever now there is a new line in the log : Thu Jan 21 11:08:39 2010 show.cgi critical define '/boot' in ngshared.pm (in %Ctrans) and report it, please Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From a31modela at hotmail.com Thu Jan 21 14:40:29 2010 From: a31modela at hotmail.com (steve f) Date: Thu, 21 Jan 2010 08:40:29 -0500 Subject: Nagios Plugins & glibc Message-ID: I have been building a sandbox to set up Nagios 3.2.0 and so far everything has been fine. I am using the nagios-plugins-1.4.13 with Nagios 3. A large number of the clients we need to monitor ( about 1200 ) are running SuSE SLES-8 (i386) VERSION = 8.1 I was trying to set up one of these servers to run a check_disk via check_by_ssh and was having a problem. I tried to run the check_disk locally on one of these test servers and got the message that the plugin needed glibc 2.4 ./check_disk: /lib/tls/libc.so.6: version `GLIBC_2.4' not found (required by ./check_disk) We are running glibc-2.2.5-235 I copied the plugin from the plugin directory on my host server which is running glibc 2.4. Is my problem that I copied a plugin that was compiled on a server running glibc 2.4? Can I run an older set of plugings on the servers with the older glibc? Anyone know the nagios-plugin version # ? Can I continue to keep my host server running 3.2 and have clients running older versions of the plugins? Thanks, Steve _________________________________________________________________ Your E-mail and More On-the-Go. Get Windows Live Hotmail Free. http://clk.atdmt.com/GBL/go/196390709/direct/01/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Thu Jan 21 14:42:12 2010 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Thu, 21 Jan 2010 14:42:12 +0100 Subject: Checking 64bit host from 32bit server In-Reply-To: <4B542FA7.1070802@impactteachers.com> References: <4B542FA7.1070802@impactteachers.com> Message-ID: <90f6e8271001210542v1bfd5171wcb2f1d4880c4a8fa@mail.gmail.com> On Mon, Jan 18, 2010 at 10:53 AM, Mr Gabriel wrote: > When I run > > --# ?./check_nrpe -H 64.bit.host > > I get back -- CHECK_NRPE: Error - Could not complete SSL handshake. When > I do > > --# ./check_nrpe -H 64.bit.host -n i check 64bits machine all the time from our virtual 32 bit nagios installation and it just works. natxo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 21 15:11:31 2010 From: marc at ena.com (Marc Powell) Date: Thu, 21 Jan 2010 08:11:31 -0600 Subject: Nagios Plugins & glibc In-Reply-To: References: Message-ID: <61127526-38A5-4AEC-A6EA-07B9CA69B899@ena.com> On Jan 21, 2010, at 7:40 AM, steve f wrote: > We are running glibc-2.2.5-235 > > I copied the plugin from the plugin directory on my host server which is running glibc 2.4. > > Is my problem that I copied a plugin that was compiled on a server running glibc 2.4? Yes. > Can I run an older set of plugings on the servers with the older glibc? Yes, or you can compile the newer plugins on the destination server so that they use glibc 2.2.5. > Anyone know the nagios-plugin version # ? Version number doesn't matter. Any will do if you compile them on that machine. What matters is what dependencies a pre-compiled *package* you've chosen to use has. If you're depending on finding an pre-existing package, you'll need to search the repositories available for your OS for either an older or newer version of the plugins that expects 2.2.5. > > Can I continue to keep my host server running 3.2 and have clients running older versions of the plugins? Yes, plugin version is not tied to nagios version. -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From komodo at uvt.cz Thu Jan 21 15:39:56 2010 From: komodo at uvt.cz (komodo) Date: Thu, 21 Jan 2010 15:39:56 +0100 Subject: Checking 64bit host from 32bit server In-Reply-To: <90f6e8271001210542v1bfd5171wcb2f1d4880c4a8fa@mail.gmail.com> References: <4B542FA7.1070802@impactteachers.com> <90f6e8271001210542v1bfd5171wcb2f1d4880c4a8fa@mail.gmail.com> Message-ID: <201001211539.56318.komodo@uvt.cz> Do you have same version of nrpe on both sides ? I had this problem too, and i need to upgrade nrpe on one machine. Regards Martin On Thursday 21 January 2010 14:42:12 Natxo Asenjo wrote: > On Mon, Jan 18, 2010 at 10:53 AM, Mr Gabriel wrote: > > When I run > > > > --# ./check_nrpe -H 64.bit.host > > > > I get back -- CHECK_NRPE: Error - Could not complete SSL handshake. When > > I do > > > > --# ./check_nrpe -H 64.bit.host -n > > i check 64bits machine all the time from our virtual 32 bit nagios > installation and it just works. > > natxo > > --------------------------------------------------------------------------- > --- Throughout its 18-year history, RSA Conference consistently attracts > the world's best and brightest in the field, creating opportunities for > Conference attendees to learn about information security's most important > issues through interactions with peers, luminaries and emerging and > established companies. http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > ::: reporting any issue. Messages without supporting info will risk being > ::: sent to /dev/null > ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From juki.emma at gmail.com Fri Jan 22 06:00:05 2010 From: juki.emma at gmail.com (Juki) Date: Fri, 22 Jan 2010 08:00:05 +0300 Subject: Multiple NRPE Processes In-Reply-To: <4CB8AEC2-AE58-48BA-9A11-CC919622DD22@ena.com> References: <7545d7d21001182337t7b5e0118q4507a77c959e3a4d@mail.gmail.com> <7545d7d21001190722m11f7632erfd25d234549f53@mail.gmail.com> <4FB2C132-9682-43D6-9185-20920AE5225F@ena.com> <144C9CDA-9665-486E-827B-2FC50201DE93@stubhub.com> <4CB8AEC2-AE58-48BA-9A11-CC919622DD22@ena.com> Message-ID: <7545d7d21001212100u75e3e800kabeec746ac542bc5@mail.gmail.com> Hi all, I have noticed that when I turn off monitoring on the client machine, all the other socket connections disappear and I'm only left with; nagios at pms # netstat -a | grep nrpe *.nrpe *.* 0 0 49152 0 LISTEN As soon as I enable monitoring for the client, the socket connections that the OS is keeping in a TIME_WAIT state return. But like Marc advised, I'll take this up on another list since NRPE is not *directly* involved. Thanks all! -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From skymen4 at hotmail.com Fri Jan 22 08:48:31 2010 From: skymen4 at hotmail.com (Renaud Vanderhagen) Date: Fri, 22 Jan 2010 08:48:31 +0100 Subject: Plugins for jvm memory Message-ID: Hello all ! I'm new in Nagios world and I want to monitor the jvm memory of tomcat with NRPE... I want nagios warn me if the memory used is over 80% or 90%. Do you know a plugins for that ? Thank you a lot ! _________________________________________________________________ Nouveau Windows 7 : Simplifiez votre quotidien. Trouvez le PC qui vous convient. http://windows.microsoft.com/shop -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 22 10:47:27 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 22 Jan 2010 09:47:27 +0000 Subject: Plugins for jvm memory In-Reply-To: References: Message-ID: <4B59742F.2070506@flatto.net> Renaud Vanderhagen wrote: > > Hello all ! > > I'm new in Nagios world and I want to monitor the jvm memory of tomcat > with NRPE... > I want nagios warn me if the memory used is over 80% or 90%. > Do you know a plugins for that ? > > Thank you a lot ! > Take a look here , maybe one of those will give you the functionality you need. http://www.monitoringexchange.org/inventory/Check-Plugins/Software/Java Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 22 12:47:40 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 22 Jan 2010 11:47:40 +0000 Subject: check_http and xml Message-ID: <4B59905C.1080904@flatto.net> Hello I am trying to verify the working of a web service which output is a xml content . I thought that using the check_http -s or -e will help me do the verifications by searching for a string in the xml however when trying the test , the result comes back as a 404 ( classified info changed to XXX ). ./check_http -H some.service.con -s "xml" -u "xml.aspx?account_code=XXXXXXX&license_code=XXXXXXXXXXXXXXX&action=lookup&postcode=XXXXXX" HTTP CRITICAL: HTTP/1.1 400 Bad Request - string not found - 163 bytes in 0.013 second response time |time=0.013409s;;;0.000000 size=163B;;;0 the output of curling (curl -D - $URL ) the page gives this : Date: Fri, 22 Jan 2010 11:35:24 GMT Server: Microsoft-IIS/6.0 X-Powered-By: ASP.NET X-AspNet-Version: 2.0.50727 NextAction: NextType: Prompt: ResultType: Cache-Control: private Content-Type: text/xml Content-Length: 356 Does anyone have ideas on how to get xml content check ? I looked at the nagiosexchange but did not find anything ( could be i was looking in the wrong place/context) . Thanks Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From natxo.asenjo at gmail.com Fri Jan 22 13:08:59 2010 From: natxo.asenjo at gmail.com (Natxo Asenjo) Date: Fri, 22 Jan 2010 13:08:59 +0100 Subject: Checking 64bit host from 32bit server In-Reply-To: <201001211539.56318.komodo@uvt.cz> References: <4B542FA7.1070802@impactteachers.com> <90f6e8271001210542v1bfd5171wcb2f1d4880c4a8fa@mail.gmail.com> <201001211539.56318.komodo@uvt.cz> Message-ID: <90f6e8271001220408t2cf06706q718287ff2efc3b57@mail.gmail.com> On Thu, Jan 21, 2010 at 3:39 PM, komodo wrote: > Do you have same version of nrpe on both sides ? > > I had this problem too, and i need to upgrade nrpe on one machine. I run routinely a somewhat old version of nrpe on debian stable servers withouth any problem at all either. I use the debian package on them. The nagios server is always up to date, its plugins also. The OP should start looking at his/her logs on the nrpe server to see what is going on. -- natxo ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mario at rimann.org Fri Jan 22 13:35:48 2010 From: mario at rimann.org (Mario Rimann) Date: Fri, 22 Jan 2010 13:35:48 +0100 Subject: =?utf-8?q?Problem_with_check=5Fby=5Fssh?= Message-ID: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> Hi there We're experiencing an issue on a new setup where check_by_ssh should run a script on a foreign host with two parameters and handle the response. Executing the script on the remote server works - but the parameters don't get transmitted to the remote server. Afther fiddling around with the service and the command definition, I started to run the command on the shell to see what happens: If I run it locally as root, all works out: [root at monitoring ~]# /path/to/libexec/check_by_ssh -l root -H xxx.xxx.xxx.xxx -v -C "/root/bin/checkSomeThing.sh 300 500" Running the same stuff as the nagios user fails (script get's called, but doesn't get the parameters) [nagios at monitoring ~]$ /path/to/libexec/check_by_ssh -l root -H xxx.xxx.xxx.xxx -v -C "/root/bin/checkSomeThing.sh 300 500" I already checked /etc/passwd - but besides the user's names, user-id, group-id and the home directory path all looks the same - both have /bin/bash as their logon shell. Any hint? Cheers, Mario ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 22 14:30:11 2010 From: marc at ena.com (Marc Powell) Date: Fri, 22 Jan 2010 07:30:11 -0600 Subject: check_http and xml In-Reply-To: <4B59905C.1080904@flatto.net> References: <4B59905C.1080904@flatto.net> Message-ID: <3D0DAB62-B1DC-43DA-8BF7-933C2245F75F@ena.com> On Jan 22, 2010, at 5:47 AM, Assaf Flatto wrote: > Hello > > I am trying to verify the working of a web service which output is a xml > content . > > I thought that using the check_http -s or -e will help me do the > verifications by searching for a string in the xml however when trying > the test , the result comes back as a 404 ( classified info changed to > XXX ). If you add a '-v' flag to your check_http test below,or check your server logs, you'll see why you're getting the 400 back. > ./check_http -H some.service.con -s "xml" -u > "xml.aspx?account_code=XXXXXXX&license_code=XXXXXXXXXXXXXXX&action=lookup&postcode=XXXXXX" I'm quite sure this should be "/xml.aspx?account_code..." -- Marc ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mario at rimann.org Fri Jan 22 15:24:32 2010 From: mario at rimann.org (Mario Rimann) Date: Fri, 22 Jan 2010 15:24:32 +0100 Subject: =?utf-8?q?Problem_with_check=5Fby=5Fssh_SOLVED?= In-Reply-To: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> References: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> Message-ID: <06d6379e207a014c120180bd607296a1@roosta.rimann.org> On Fri, 22 Jan 2010 13:35:48 +0100, Mario Rimann wrote: > Running the same stuff as the nagios user fails (script get's called, but > doesn't get the parameters) > [nagios at monitoring ~]$ /path/to/libexec/check_by_ssh -l root -H > xxx.xxx.xxx.xxx -v -C "/root/bin/checkSomeThing.sh 300 500" Problem solved: The key of the nagios user that was put into the authorized_keys file on the remote server was restricted to a forced command - which obviously cut of the parameters. *damned* Cheers, Mario ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Fri Jan 22 15:27:24 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Fri, 22 Jan 2010 09:27:24 -0500 Subject: Problem with check_by_ssh In-Reply-To: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> References: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> Message-ID: <5bcb62b61001220627y4f44e5e2i3d46d845838f58b1@mail.gmail.com> Are you able to ssh to the remote machine as the remote user? I've been bitten by not having the remote host in my nagios user's known_hosts file, and my scripts get prompted, and for some reason that stops me from obtaining statistics ;-) --Matt On Fri, Jan 22, 2010 at 7:35 AM, Mario Rimann wrote: > > Hi there > > We're experiencing an issue on a new setup where check_by_ssh should run a > script on a foreign host with two parameters and handle the response. > Executing the script on the remote server works - but the parameters don't > get transmitted to the remote server. Afther fiddling around with the > service and the command definition, I started to run the command on the > shell to see what happens: > > If I run it locally as root, all works out: > [root at monitoring ~]# /path/to/libexec/check_by_ssh -l root -H > xxx.xxx.xxx.xxx -v -C "/root/bin/checkSomeThing.sh 300 500" > > Running the same stuff as the nagios user fails (script get's called, but > doesn't get the parameters) > [nagios at monitoring ~]$ ?/path/to/libexec/check_by_ssh -l root -H > xxx.xxx.xxx.xxx -v -C "/root/bin/checkSomeThing.sh 300 500" > > I already checked /etc/passwd - but besides the user's names, user-id, > group-id and the home directory path all looks the same - both have > /bin/bash as their logon shell. > > Any hint? > > Cheers, > Mario > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gmartin at gmartin.org Fri Jan 22 15:28:05 2010 From: gmartin at gmartin.org (gmartin) Date: Fri, 22 Jan 2010 09:28:05 -0500 Subject: check_http and xml In-Reply-To: <4B59905C.1080904@flatto.net> References: <4B59905C.1080904@flatto.net> Message-ID: If you look at the content of the returned data it appears your call is failing for a bad license key. Is that what you are tresting for? \\Greg On Fri, Jan 22, 2010 at 6:47 AM, Assaf Flatto wrote: > Hello > > I am trying to verify the working of a web service which output is a xml > content . > > I thought that using the check_http -s or -e will help me do the > verifications by searching for a string in the xml however when trying > the test , the result comes back as a 404 ( classified info changed to > XXX ). > > ./check_http -H some.service.con -s "xml" -u > > "xml.aspx?account_code=XXXXXXX&license_code=XXXXXXXXXXXXXXX&action=lookup&postcode=XXXXXX" > > HTTP CRITICAL: HTTP/1.1 400 Bad Request - string not found - 163 bytes > in 0.013 second response time |time=0.013409s;;;0.000000 size=163B;;;0 > > the output of curling (curl -D - $URL ) the page gives this : > > Date: Fri, 22 Jan 2010 11:35:24 GMT > Server: Microsoft-IIS/6.0 > X-Powered-By: ASP.NET > X-AspNet-Version: 2.0.50727 > NextAction: > NextType: > Prompt: > ResultType: > Cache-Control: private > Content-Type: text/xml > Content-Length: 356 > > > Duration="0.000s"> > > > > > > > > > > Does anyone have ideas on how to get xml content check ? > I looked at the nagiosexchange but did not find anything ( could be i > was looking in the wrong place/context) . > > Thanks > > Assaf > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 22 16:05:20 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 22 Jan 2010 15:05:20 +0000 Subject: check_http and xml In-Reply-To: <3D0DAB62-B1DC-43DA-8BF7-933C2245F75F@ena.com> References: <4B59905C.1080904@flatto.net> <3D0DAB62-B1DC-43DA-8BF7-933C2245F75F@ena.com> Message-ID: <4B59BEB0.1040609@flatto.net> Marc Powell wrote: > On Jan 22, 2010, at 5:47 AM, Assaf Flatto wrote: > > >> Hello >> >> I am trying to verify the working of a web service which output is a xml >> content . >> >> I thought that using the check_http -s or -e will help me do the >> verifications by searching for a string in the xml however when trying >> the test , the result comes back as a 404 ( classified info changed to >> XXX ). >> > > If you add a '-v' flag to your check_http test below,or check your server logs, you'll see why you're getting the 400 back. > > >> ./check_http -H some.service.con -s "xml" -u >> "xml.aspx?account_code=XXXXXXX&license_code=XXXXXXXXXXXXXXX&action=lookup&postcode=XXXXXX" >> > > I'm quite sure this should be "/xml.aspx?account_code..." > > -- > Marc > Yes , you are write Marc , that did sort out my issue. Thanks Assaf ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robert.wolfe at robertwolfe.org Fri Jan 22 13:38:38 2010 From: robert.wolfe at robertwolfe.org (Wolfe, Robert) Date: Fri, 22 Jan 2010 07:38:38 -0500 Subject: Plugins for jvm memory In-Reply-To: References: Message-ID: <863B3EE13473404BB5FA7516A1E31EF70EB99E9CF5@exchange-01.wolfe.local> Renaud, have you tried giving Nagios Exchange a search for this? ________________________________ From: Renaud Vanderhagen [skymen4 at hotmail.com] Sent: Friday, January 22, 2010 2:48 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Plugins for jvm memory Hello all ! I'm new in Nagios world and I want to monitor the jvm memory of tomcat with NRPE... I want nagios warn me if the memory used is over 80% or 90%. Do you know a plugins for that ? Thank you a lot ! ________________________________ Windows 7 : Simplifiez votre quotidien. Trouvez le PC qui vous convient. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tinzauro at ha-solutions.net Fri Jan 22 17:37:19 2010 From: tinzauro at ha-solutions.net (Terry L. Inzauro) Date: Fri, 22 Jan 2010 10:37:19 -0600 Subject: Plugins for jvm memory In-Reply-To: <863B3EE13473404BB5FA7516A1E31EF70EB99E9CF5@exchange-01.wolfe.local> References: <863B3EE13473404BB5FA7516A1E31EF70EB99E9CF5@exchange-01.wolfe.local> Message-ID: <4B59D43F.50606@ha-solutions.net> On 01/22/2010 06:38 AM, Wolfe, Robert wrote: > Renaud, have you tried giving Nagios Exchange a search for this? > > ------------------------------------------------------------------------ > *From:* Renaud Vanderhagen [skymen4 at hotmail.com] > *Sent:* Friday, January 22, 2010 2:48 AM > *To:* nagios-users at lists.sourceforge.net > *Subject:* [Nagios-users] Plugins for jvm memory > > > Hello all ! > > I'm new in Nagios world and I want to monitor the jvm memory of tomcat > with NRPE... > I want nagios warn me if the memory used is over 80% or 90%. > Do you know a plugins for that ? > > Thank you a lot ! > check_procs will accomplish what you desire: http://nagiosplugins.org/man/check_procs kind regards, Terry Inzauro http://ha-solutions.net ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Fri Jan 22 20:03:49 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Fri, 22 Jan 2010 11:03:49 -0800 Subject: n00b trying to set up on freebsd Message-ID: I'm trying to install nagios on freebsd, and can't get to the point of getting lynx on the host to talk to http://192.168.8.20/nagios - I get a 403 "You don't have permission to access /nagios/ on this server." I'm beating my brains out on this one, and don't know what I'm doing wrong. I'm sure it's something really simple I'm overlooking, because just browsing http://192.168.8.20 gets a response. Unfortunately there isn't a quick install guide for freebsd, so I'm having to do some translation of the docs. Any help much appreciated. More info: I've created /usr/local/etc/nagios/htpasswd.users, as noted in the stanzas below. I've tried www:www and root:wheel and www:nagios as permissions on /usr/local/www/nagios, with the same result. I've added the following stanzas to the end of httpd.conf: ----------begin httpd.conf snippet---------- Include etc/apache22/Includes/*.conf ScriptAlias /nagios/cgi-bin "/usr/local/www/nagios/cgi-bin" Options ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/etc/nagios/htpasswd.users Require valid-user Alias /nagios "/usr/local/www/nagios" Options None AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/etc/nagios/htpasswd.users Require valid-user ----------end httpd.conf snippet---------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From liyasmacosx at gmail.com Fri Jan 22 21:39:48 2010 From: liyasmacosx at gmail.com (liyas m) Date: Sat, 23 Jan 2010 04:39:48 +0800 Subject: removing and installing nagios In-Reply-To: <4B5829C6.7000904@flatto.net> References: <131162e41001202342r7685b163o6ce85e8743595608@mail.gmail.com> <4B5829C6.7000904@flatto.net> Message-ID: <131162e41001221239m4d5341d7gfa19e8971e968028@mail.gmail.com> i use apt-get remove nagios-text and apt-get remove --purge nagios-text but still does not work the old version 1.3 is still there. On Thu, Jan 21, 2010 at 6:17 PM, Assaf Flatto wrote: > liyas m wrote: > > DEar all, > > > > I have installed nagios 3.2 on my server for student project and > > everything worked fine until i run apt-get nagios-text. After that the > > version appeared on the server is Nagios 1.3 not 3.2 which is very > > strange. I tried to reinstalled but still the old version appear > > although the engine shown by nagios -c is 3.2 > > > > Nagios Core 3.2.0 > > Copyright (c) 2009 Nagios Core Development Team and Community > > Contributors > > Copyright (c) 1999-2009 Ethan Galstad > > Last Modified: 08-12-2009 > > License: GPL > > > > How do I remove the old version ? Thank you for your help. > > > > Regards, > > Liyas > if you installed with apt - you can try apt-get purge nagios-text . > that will remove the package installed . > As for returning to the source installed software - you might have to > rerun the configure / make process to make sure the newer files are > installed and used . > > or you can use the blunt force method and rm every file related to > nagios - but i wouldn't recommend it , only as a last resort . > > Assaf > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tinzauro at ha-solutions.net Fri Jan 22 22:53:13 2010 From: tinzauro at ha-solutions.net (Terry L. Inzauro) Date: Fri, 22 Jan 2010 15:53:13 -0600 Subject: removing and installing nagios In-Reply-To: <131162e41001221239m4d5341d7gfa19e8971e968028@mail.gmail.com> References: <131162e41001202342r7685b163o6ce85e8743595608@mail.gmail.com> <4B5829C6.7000904@flatto.net> <131162e41001221239m4d5341d7gfa19e8971e968028@mail.gmail.com> Message-ID: <4B5A1E49.90105@ha-solutions.net> On 01/22/2010 02:39 PM, liyas m wrote: > i use apt-get remove nagios-text and apt-get remove --purge nagios-text > but still does not work > the old version 1.3 is still there. > > > On Thu, Jan 21, 2010 at 6:17 PM, Assaf Flatto > wrote: > > liyas m wrote: > > DEar all, > > > > I have installed nagios 3.2 on my server for student project and > > everything worked fine until i run apt-get nagios-text. After > that the > > version appeared on the server is Nagios 1.3 not 3.2 which is very > > strange. I tried to reinstalled but still the old version appear > > although the engine shown by nagios -c is 3.2 > > > > Nagios Core 3.2.0 > > Copyright (c) 2009 Nagios Core Development Team and Community > > Contributors > > Copyright (c) 1999-2009 Ethan Galstad > > Last Modified: 08-12-2009 > > License: GPL > > > > How do I remove the old version ? Thank you for your help. > > > > Regards, > > Liyas > if you installed with apt - you can try apt-get purge nagios-text . > that will remove the package installed . > As for returning to the source installed software - you might have to > rerun the configure / make process to make sure the newer files are > installed and used . > > or you can use the blunt force method and rm every file related to > nagios - but i wouldn't recommend it , only as a last resort . > > Assaf > If, by helping you, are we enabling you to cheat? If the answer is yes, you had better stop reading this and figure out your issue on your own (by reading the apt docs...hint...hint) What OS/version is this? ----------------------------- If the OS is Debian Lenny (This may/may not work in Ubuntu): ----------------------------- Fist find which nagios packages are installed. Install apt-show-versions. I recommend using aptitude has it has greater package dependency handling abilities and saftey guards. [15:27:28 root at jenna:~]# aptitude install apt-show-versions Second, invoke apt-show-versions and parse the output using grep. [15:29:28 root at jenna:~]# apt-show-versions | grep nagios nagios-images/lenny uptodate 0.4 nagios-nrpe-plugin/lenny uptodate 2.12-1 nagios-plugins/lenny uptodate 1.4.12-5 nagios-plugins-basic/lenny uptodate 1.4.12-5 nagios-plugins-standard/lenny uptodate 1.4.12-5 nagios3/lenny uptodate 3.0.6-4~lenny2 nagios3-common/lenny uptodate 3.0.6-4~lenny2 nagios3-doc/lenny uptodate 3.0.6-4~lenny2 [15:29:33 root at jenna:~]# The first string is the package name to use as an argument to `aptitude remove --purge`. I recommend you start from scratch and remove ALL nagios packages to help quell confusion. If I wanted to only remove the nagios core package(s) (and purge configuration files) I would invoke aptitude like this: [15:29:33 root at jenna:~]# aptitude remove --purge nagios3 nagios3-common If I wanted to save the configuration files, I would omit the --purge switch. If, you are like me and are short tempered, you can invoke aptitude with more than one task. Such as the following. [15:29:33 root at jenna:~]# aptitude remove --purge nagios3 nagios3-common nagios2+ This command will remove (and purge) nagios3 and nagios3-common. Then it will install nagios2 thuns accomplishing a downgrade. Obviously, this command isn't what you want, but I decided to show it for informational purposes. hope this helps. Terry http://ha-solutions.net -- /* Name: Terry Inzauro Locale: Omaha Office Addr0: 222 South 15th Addr1: Suite 3B Addr2: Omaha, NE 68102 Office: 402.884.7967x103 Fax: 402.884.7970 Mobile: 402.212.1450 Email: tinzauro at ha-solutions.net Web1: http://ha-solutions.net Web2: http://storsolgroup.com GNNUPG: http://pgp.mit.edu:11371/pks/lookup?op=get&search=0xE55FA44C */ ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lambert at lambertfam.org Fri Jan 22 23:13:19 2010 From: lambert at lambertfam.org (Scott Lambert) Date: Fri, 22 Jan 2010 16:13:19 -0600 Subject: n00b trying to set up on freebsd In-Reply-To: References: Message-ID: <20100122221319.GI89235@sysmon.tcworks.net> On Fri, Jan 22, 2010 at 11:03:49AM -0800, Kurt Buff wrote: > I'm trying to install nagios on freebsd, and can't get to the point of > getting lynx on the host to talk to http://192.168.8.20/nagios - I get > a 403 "You don't have permission to access /nagios/ on this server." > > I'm beating my brains out on this one, and don't know what I'm doing > wrong. I'm sure it's something really simple I'm overlooking, because > just browsing http://192.168.8.20 gets a response. Unfortunately there > isn't a quick install guide for freebsd, so I'm having to do some > translation of the docs. > > Any help much appreciated. Rather than guessing, you might want to take a look at what Apache thinks is wrong: Access the page and then: sudo tail /var/log/httpd-error.log It should pretty much tell you what is wrong. Is lynx asking you for a username and password? Is there some reason you didn't follow exactly the instructions suggested by the pkg-message for the nagios port with regard to the Apache configuration? I don't see anything particularly wrong with your config, but it may be needlessly over complex. I believe in getting things working according how the port maintainer suggests. The port maintainers tend to do a good job of handing you a good basic configuration which works. Once that works, I make the changes needed for my site one step at a time, testing each change individually. -- Scott Lambert KC5MLE Unix SysAdmin lambert at lambertfam.org ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Eliot.Picken at wenaas.co.uk Sat Jan 23 00:06:20 2010 From: Eliot.Picken at wenaas.co.uk (Eliot.Picken at wenaas.co.uk) Date: Fri, 22 Jan 2010 23:06:20 +0000 Subject: AUTO: Eliot Picken is out of the office (returning 01/02/2010) Message-ID: I am out of the office until 01/02/2010. I am currently out of the office on annual leave. Your email has not been forwarded For urgent issues, please contact Alex Lawrie on +44 (0) 1224 894 000 Best regards Eliot Note: This is an automated response to your message "Re: [Nagios-users] n00b trying to set up on freebsd" sent on 1/22/2010 10:13:19 PM. This is the only notification you will receive while this person is away. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Sat Jan 23 00:13:06 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Fri, 22 Jan 2010 15:13:06 -0800 Subject: Overloaded master In-Reply-To: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> References: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> Message-ID: <4B5A3102.8050903@5dninja.net> What kind of options does one have, if your master nagios server is getting overloaded? I have half a dozen slaves doing polling, submitting passive check results back via send_nsca. The master does no active polling, just event processing, notifications, and web ui. Under normal circumstances, it works alright. But after a restart it can take up to half an hour before the master catches up; and if there are a lot of events, the act of sending out notifications can cause it to fall behind. I'm pre-caching my object file, I'm skipping circular dependency checks, and I've gotten a notification cycle down to 9 seconds. I tried modifying nagios to fork before notifications, but that failed pretty spectacularly; so that 9 seconds is a time where 900 or so passive check submissions block until the notifications are done. Are there any options for running a dual-master setup, or other ways to spread the load across multiple machines? Has anyone patched nsca to submit check results into the checkresults directory, instead of via the nagios.cmd pipe? What kind of improvement can one expect from that? Any other advice? -- Mike Lindsey ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Sat Jan 23 00:53:55 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Fri, 22 Jan 2010 15:53:55 -0800 Subject: n00b trying to set up on freebsd In-Reply-To: <20100122221319.GI89235@sysmon.tcworks.net> References: <20100122221319.GI89235@sysmon.tcworks.net> Message-ID: On Fri, Jan 22, 2010 at 14:13, Scott Lambert wrote: > On Fri, Jan 22, 2010 at 11:03:49AM -0800, Kurt Buff wrote: >> I'm trying to install nagios on freebsd, and can't get to the point of >> getting lynx on the host to talk to http://192.168.8.20/nagios - I get >> a 403 "You don't have permission to access /nagios/ on this server." >> >> I'm beating my brains out on this one, and don't know what I'm doing >> wrong. I'm sure it's something really simple I'm overlooking, because >> just browsing http://192.168.8.20 gets a response. Unfortunately there >> isn't a quick install guide for freebsd, so I'm having to do some >> translation of the docs. >> >> Any help much appreciated. > > Rather than guessing, you might want to take a look at what Apache > thinks is wrong: > > Access the page and then: > > sudo tail /var/log/httpd-error.log > > It should pretty much tell you what is wrong. > > Is lynx asking you for a username and password? > > Is there some reason you didn't follow exactly the instructions > suggested by the pkg-message for the nagios port with regard to the > Apache configuration? ?I don't see anything particularly wrong with your > config, but it may be needlessly over complex. > > I believe in getting things working according how the port maintainer > suggests. The port maintainers tend to do a good job of handing you a > good basic configuration which works. > > Once that works, I make the changes needed for my site one step at a > time, testing each change individually. > > -- > Scott Lambert ? ? ? ? ? ? ? ? ? ?KC5MLE ? ? ? ? ? ? ? ? ? ? ? Unix SysAdmin > lambert at lambertfam.org Just so you know, the only things I modifed in the default httpd.conf were the following parameters: Listen is now 'Listen 192.168.8.20:80' ServerAdmin is now 'ServerAdmin ifteam at example.com ServerName is now 'ServerName loki.example.com:80' The page does indeed ask for auth when configured with the stanzas I listed. I've been examining /var/log/httpd-error.log, and found the following, with the configuration I first listed: [client 192.168.8.20] Directory index forbidden by Options directive: /usr/local/www/nagios/ Following your suggestion, I have removed those stanzas, and replaced them with the ones I found in /usr/ports/net-mgmt/nagios/work/pkg-message, and the results are slightly different. Namely, I get a 404 message saying The requested URL /nagios was not found on this server. and /var/log/httpd-error.log says [error] [client 192.168.8.20] File does not exist: /usr/local/www/apache22/data/nagios If I then add Alias /nagios /usr/local/www/nagios/ to the end of httpd.conf, I again get an auth prompt in lynx, and again I get the error in httpd-error.log: [client 192.168.8.20] Directory index forbidden by Options directive: /usr/local/www/nagios/ So, I'm led to believe that I'm making a bonehead error in httpd.conf, but I can't figure it out. Kurt ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Sat Jan 23 01:17:26 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Fri, 22 Jan 2010 16:17:26 -0800 Subject: n00b trying to set up on freebsd In-Reply-To: <20100122221319.GI89235@sysmon.tcworks.net> References: <20100122221319.GI89235@sysmon.tcworks.net> Message-ID: On Fri, Jan 22, 2010 at 14:13, Scott Lambert wrote: > On Fri, Jan 22, 2010 at 11:03:49AM -0800, Kurt Buff wrote: >> I'm trying to install nagios on freebsd, and can't get to the point of >> getting lynx on the host to talk to http://192.168.8.20/nagios - I get >> a 403 "You don't have permission to access /nagios/ on this server." >> >> I'm beating my brains out on this one, and don't know what I'm doing >> wrong. I'm sure it's something really simple I'm overlooking, because >> just browsing http://192.168.8.20 gets a response. Unfortunately there >> isn't a quick install guide for freebsd, so I'm having to do some >> translation of the docs. >> >> Any help much appreciated. > > Rather than guessing, you might want to take a look at what Apache > thinks is wrong: > > Access the page and then: > > sudo tail /var/log/httpd-error.log > > It should pretty much tell you what is wrong. > > Is lynx asking you for a username and password? > > Is there some reason you didn't follow exactly the instructions > suggested by the pkg-message for the nagios port with regard to the > Apache configuration? ?I don't see anything particularly wrong with your > config, but it may be needlessly over complex. > > I believe in getting things working according how the port maintainer > suggests. The port maintainers tend to do a good job of handing you a > good basic configuration which works. > > Once that works, I make the changes needed for my site one step at a > time, testing each change individually. > > -- > Scott Lambert ? ? ? ? ? ? ? ? ? ?KC5MLE ? ? ? ? ? ? ? ? ? ? ? Unix SysAdmin > lambert at lambertfam.org So, after doing a little RTFM, I found that adding Allow from 192.168.8.0/24 Allow from 192.168.24.0/24 to the Directory stanza gets me the directory listing when browsing http://loki.example.com - but not the web interface I expect. So, I've made progress, but I'm not there yet. Obviously, I've got to learn the barest fundamentals of apache. Help still appreciated. Kurt ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Sat Jan 23 01:19:58 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Fri, 22 Jan 2010 16:19:58 -0800 Subject: n00b trying to set up on freebsd In-Reply-To: References: <20100122221319.GI89235@sysmon.tcworks.net> Message-ID: On Fri, Jan 22, 2010 at 16:17, Kurt Buff wrote: > On Fri, Jan 22, 2010 at 14:13, Scott Lambert wrote: >> On Fri, Jan 22, 2010 at 11:03:49AM -0800, Kurt Buff wrote: >>> I'm trying to install nagios on freebsd, and can't get to the point of >>> getting lynx on the host to talk to http://192.168.8.20/nagios - I get >>> a 403 "You don't have permission to access /nagios/ on this server." >>> >>> I'm beating my brains out on this one, and don't know what I'm doing >>> wrong. I'm sure it's something really simple I'm overlooking, because >>> just browsing http://192.168.8.20 gets a response. Unfortunately there >>> isn't a quick install guide for freebsd, so I'm having to do some >>> translation of the docs. >>> >>> Any help much appreciated. >> >> Rather than guessing, you might want to take a look at what Apache >> thinks is wrong: >> >> Access the page and then: >> >> sudo tail /var/log/httpd-error.log >> >> It should pretty much tell you what is wrong. >> >> Is lynx asking you for a username and password? >> >> Is there some reason you didn't follow exactly the instructions >> suggested by the pkg-message for the nagios port with regard to the >> Apache configuration? ?I don't see anything particularly wrong with your >> config, but it may be needlessly over complex. >> >> I believe in getting things working according how the port maintainer >> suggests. The port maintainers tend to do a good job of handing you a >> good basic configuration which works. >> >> Once that works, I make the changes needed for my site one step at a >> time, testing each change individually. >> >> -- >> Scott Lambert ? ? ? ? ? ? ? ? ? ?KC5MLE ? ? ? ? ? ? ? ? ? ? ? Unix SysAdmin >> lambert at lambertfam.org > > So, after doing a little RTFM, I found that adding > > ? ? Allow from 192.168.8.0/24 > ? ? Allow from 192.168.24.0/24 > > to the Directory stanza gets me the directory listing when browsing > http://loki.example.com - but not the web interface I expect. So, I've > made progress, but I'm not there yet. > > Obviously, I've got to learn the barest fundamentals of apache. > > Help still appreciated. > > Kurt Apologies for replying to myself... Forgot to mention: When I get the directory listing, there is no auth taking place - I don't get asked for a username or password. Kurt ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lambert at lambertfam.org Sat Jan 23 05:03:29 2010 From: lambert at lambertfam.org (Scott Lambert) Date: Fri, 22 Jan 2010 22:03:29 -0600 Subject: n00b trying to set up on freebsd In-Reply-To: References: <20100122221319.GI89235@sysmon.tcworks.net> Message-ID: <20100123040329.GJ89235@sysmon.tcworks.net> On Fri, Jan 22, 2010 at 04:19:58PM -0800, Kurt Buff wrote: > Forgot to mention: When I get the directory listing, there is no auth > taking place - I don't get asked for a username or password. You don't get asked for a password due to the Allow of your IP address. What does the directory list? Is there an index.something file? If so, you probably don't have the index.something listed in your DirectoryIndexes statement. You may need to install PHP and configure Apache to use it. -- Scott Lambert KC5MLE Unix SysAdmin lambert at lambertfam.org ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From liyasmacosx at gmail.com Sat Jan 23 07:39:08 2010 From: liyasmacosx at gmail.com (liyas m) Date: Sat, 23 Jan 2010 14:39:08 +0800 Subject: removing and installing nagios In-Reply-To: <4B5A1D13.50409@ha-solutions.net> References: <131162e41001202342r7685b163o6ce85e8743595608@mail.gmail.com> <4B5829C6.7000904@flatto.net> <131162e41001221239m4d5341d7gfa19e8971e968028@mail.gmail.com> <4B5A1D13.50409@ha-solutions.net> Message-ID: <131162e41001222239m51d38b45g9ff3b6b2a6e3e139@mail.gmail.com> On Sat, Jan 23, 2010 at 5:48 AM, Terry L. Inzauro wrote: > On 01/22/2010 02:39 PM, liyas m wrote: > > i use apt-get remove nagios-text and apt-get remove --purge nagios-text > > but still does not work > > the old version 1.3 is still there. > > > > > > On Thu, Jan 21, 2010 at 6:17 PM, Assaf Flatto > > wrote: > > > > liyas m wrote: > > > DEar all, > > > > > > I have installed nagios 3.2 on my server for student project and > > > everything worked fine until i run apt-get nagios-text. After > > that the > > > version appeared on the server is Nagios 1.3 not 3.2 which is very > > > strange. I tried to reinstalled but still the old version appear > > > although the engine shown by nagios -c is 3.2 > > > > > > Nagios Core 3.2.0 > > > Copyright (c) 2009 Nagios Core Development Team and Community > > > Contributors > > > Copyright (c) 1999-2009 Ethan Galstad > > > Last Modified: 08-12-2009 > > > License: GPL > > > > > > How do I remove the old version ? Thank you for your help. > > > > > > Regards, > > > Liyas > > if you installed with apt - you can try apt-get purge nagios-text . > > that will remove the package installed . > > As for returning to the source installed software - you might have > to > > rerun the configure / make process to make sure the newer files are > > installed and used . > > > > or you can use the blunt force method and rm every file related to > > nagios - but i wouldn't recommend it , only as a last resort . > > > > Assaf > > > > If, by helping you, are we enabling you to cheat? If the answer is yes, you > had better stop reading this and figure out your > issue on your own (by reading the apt docs...hint...hint) > > > What OS/version is this? > > ----------------------------- > If the OS is Debian Lenny (This may/may not work in Ubuntu): > ----------------------------- > > Fist find which nagios packages are installed. Install apt-show-versions. > I recommend using aptitude has it has greater > package dependency handling abilities and saftey guards. > > > [15:27:28 root at jenna:~]# aptitude install apt-show-versions > > > > Second, invoke apt-show-versions and parse the output using grep. > > [15:29:28 root at jenna:~]# apt-show-versions | grep nagios > nagios-images/lenny uptodate 0.4 > nagios-nrpe-plugin/lenny uptodate 2.12-1 > nagios-plugins/lenny uptodate 1.4.12-5 > nagios-plugins-basic/lenny uptodate 1.4.12-5 > nagios-plugins-standard/lenny uptodate 1.4.12-5 > nagios3/lenny uptodate 3.0.6-4~lenny2 > nagios3-common/lenny uptodate 3.0.6-4~lenny2 > nagios3-doc/lenny uptodate 3.0.6-4~lenny2 > [15:29:33 root at jenna:~]# > > root at ubuntu:~# apt-show-versions | grep nagios nagios-plugins-standard/dapper uptodate 1.4.2-5ubuntu3.1 nagios-plugins-basic/dapper uptodate 1.4.2-5ubuntu3.1 nagios-common/dapper uptodate 2:1.3-cvs.20050402-8ubuntu8 nagios-plugins/dapper uptodate 1.4.2-5ubuntu3.1 > > The first string is the package name to use as an argument to `aptitude > remove --purge`. I recommend you start from scratch > and remove ALL nagios packages to help quell confusion. > > If I wanted to only remove the nagios core package(s) (and purge > configuration files) I would invoke aptitude like this: > > [15:29:33 root at jenna:~]# aptitude remove --purge nagios3 nagios3-common > Couldn't find any package whose name or description matched "nagios3" Couldn't find any package whose name or description matched "nagios3-common" . . 0 packages upgraded, 0 newly installed, 0 to remove and 324 not upgraded. Need to get 0B of archives. After unpacking 0B will be used. > > If I wanted to save the configuration files, I would omit the --purge > switch. > > If, you are like me and are short tempered, you can invoke aptitude with > more than one task. Such as the following. > > [15:29:33 root at jenna:~]# aptitude remove --purge nagios3 nagios3-common > nagios2+ > > This command will remove (and purge) nagios3 and nagios3-common. Then it > will install nagios2 thuns accomplishing a > downgrade. Obviously, this command isn't what you want, but I decided to > show it for informational purposes. > > > > hope this helps. > > > Terry > http://ha-solutions.net > > > > > > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From liyasmacosx at gmail.com Sat Jan 23 07:40:40 2010 From: liyasmacosx at gmail.com (liyas m) Date: Sat, 23 Jan 2010 14:40:40 +0800 Subject: removing and installing nagios In-Reply-To: <33C7BE4CE80D504E8907FDC31F8A083047BA35A440@iexchange.I-EVOLVE.com> References: <131162e41001202342r7685b163o6ce85e8743595608@mail.gmail.com> <4B5829C6.7000904@flatto.net> <131162e41001221239m4d5341d7gfa19e8971e968028@mail.gmail.com> <33C7BE4CE80D504E8907FDC31F8A083047BA35A440@iexchange.I-EVOLVE.com> Message-ID: <131162e41001222240w786a7eaev682ab27939310abf@mail.gmail.com> got it from here http://nagios.org/ On Sat, Jan 23, 2010 at 4:52 AM, Robert Wolfe wrote: > Did you install the Nagios software from source or from packages? > > > > *From:* liyas m [mailto:liyasmacosx at gmail.com] > *Sent:* Friday, January 22, 2010 3:40 PM > *To:* nagios-users at lists.sourceforge.net > *Subject:* Re: [Nagios-users] removing and installing nagios > > > > i use apt-get remove nagios-text and apt-get remove --purge nagios-text > but still does not work > the old version 1.3 is still there. > > On Thu, Jan 21, 2010 at 6:17 PM, Assaf Flatto wrote: > > liyas m wrote: > > DEar all, > > > > I have installed nagios 3.2 on my server for student project and > > everything worked fine until i run apt-get nagios-text. After that the > > version appeared on the server is Nagios 1.3 not 3.2 which is very > > strange. I tried to reinstalled but still the old version appear > > although the engine shown by nagios -c is 3.2 > > > > Nagios Core 3.2.0 > > Copyright (c) 2009 Nagios Core Development Team and Community > > Contributors > > Copyright (c) 1999-2009 Ethan Galstad > > Last Modified: 08-12-2009 > > License: GPL > > > > How do I remove the old version ? Thank you for your help. > > > > Regards, > > Liyas > > if you installed with apt - you can try apt-get purge nagios-text . > that will remove the package installed . > As for returning to the source installed software - you might have to > rerun the configure / make process to make sure the newer files are > installed and used . > > or you can use the blunt force method and rm every file related to > nagios - but i wouldn't recommend it , only as a last resort . > > > Assaf > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Sat Jan 23 23:38:02 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Sat, 23 Jan 2010 14:38:02 -0800 Subject: n00b trying to set up on freebsd In-Reply-To: <20100123040329.GJ89235@sysmon.tcworks.net> References: <20100122221319.GI89235@sysmon.tcworks.net> <20100123040329.GJ89235@sysmon.tcworks.net> Message-ID: On Fri, Jan 22, 2010 at 20:03, Scott Lambert wrote: > On Fri, Jan 22, 2010 at 04:19:58PM -0800, Kurt Buff wrote: >> Forgot to mention: When I get the directory listing, there is no auth >> taking place - I don't get asked for a username or password. > > You don't get asked for a password due to the Allow of your IP address. > > What does the directory list? ?Is there an index.something file? > > If so, you probably don't have the index.something listed in your > DirectoryIndexes statement. ?You may need to install PHP and configure > Apache to use it. > > -- > Scott Lambert ? ? ? ? ? ? ? ? ? ?KC5MLE ? ? ? ? ? ? ? ? ? ? ? Unix SysAdmin > lambert at lambertfam.org Thank you for the clue. That was exactly correct. This link gave me the help I needed: http://www.freebsdmadeeasy.com/tutorials/web-server/install-php-5-for-web-hosting.php Now I can proceed. One round of your choice of adult beverage is due upon our meeting. Kurt ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From justinp at norchemlab.com Sun Jan 24 19:41:50 2010 From: justinp at norchemlab.com (Justin Pryzby) Date: Sun, 24 Jan 2010 11:41:50 -0700 Subject: best expression of a timeperiod with same hours every day Message-ID: <20100124184150.GA19389@norchemlab.com> Is there a shorter way of expressing this? We have several cases where we need this, with different intervals. I was hoping there was a way of writing it in fewer than 10 lines each. define timeperiod { sunday 04:00-21:00 monday 04:00-21:00 tuesday 04:00-21:00 wednesday 04:00-21:00 thursday 04:00-21:00 friday 04:00-21:00 saturday 04:00-21:00 } Thanks, Justin ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriel at impactteachers.com Sun Jan 24 23:48:36 2010 From: gabriel at impactteachers.com (Mr Gabriel) Date: Sun, 24 Jan 2010 22:48:36 +0000 Subject: How to run a perl script instead of sending emails? Message-ID: <4B5CCE44.5090402@impactteachers.com> Dear All, I have a perl script that connects to my asterisk instance, and calls who ever is on shift to log in, and take care of whatever broke. I would like to initiate this script whenever there is a critical alert that has gone off more than once - how can I best achieve this? I would like these to run instead of an email when out of hours, and when a critical alert has been left unattended to for some time ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Mon Jan 25 00:50:20 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Sun, 24 Jan 2010 18:50:20 -0500 Subject: How to run a perl script instead of sending emails? In-Reply-To: <4B5CCE44.5090402@impactteachers.com> References: <4B5CCE44.5090402@impactteachers.com> Message-ID: <5bcb62b61001241550q8d177f9h8d097d763cdcf052@mail.gmail.com> Would you be willing to share this script? I asked a question a while back on serverfault (http://serverfault.com/questions/21121/nagios-alerts-by-telephone) and this could be useful to a lot of people (assuming I'm not missing a general solution that's out there and just not known by me). --Matt On Sun, Jan 24, 2010 at 5:48 PM, Mr Gabriel wrote: > Dear All, > > I have a perl script that connects to my asterisk instance, and calls > who ever is on shift to log in, and take care of whatever broke. I would > like to initiate this script whenever there is a critical alert that has > gone off more than once - how can I best achieve this? > > I would like these to run instead of an email when out of hours, and > when a critical alert has been left unattended to for some time > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fizah_decoco at yahoo.com Mon Jan 25 01:29:35 2010 From: fizah_decoco at yahoo.com (fizah zali) Date: Sun, 24 Jan 2010 16:29:35 -0800 (PST) Subject: what should i do? Message-ID: <595436.60648.qm@web45008.mail.sp1.yahoo.com> hello, Initially, i have successfully installed nagios 3.2 version but?after the installation of?nagios-text, the nagios 3.2 version turn to nagios 1.3 version. What should i do to get back the nagios 3.2 version? ______________________________________________________________________________________________________________________ Get more social and better connected today. Add more friends to Yahoo! Messenger now! http://malaysia.messenger.yahoo.com/invite/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.browne at ausregistry.com.au Mon Jan 25 00:21:55 2010 From: stuart.browne at ausregistry.com.au (Stuart Browne) Date: Mon, 25 Jan 2010 10:21:55 +1100 Subject: How to run a perl script instead of sending emails? In-Reply-To: <4B5CCE44.5090402@impactteachers.com> References: <4B5CCE44.5090402@impactteachers.com> Message-ID: <8CEF048B9EC83748B1517DC64EA130FB3E224CE13F@off-win2003-01.ausregistrygroup.local> > -----Original Message----- > From: Mr Gabriel [mailto:gabriel at impactteachers.com] > Sent: Monday, 25 January 2010 09:49 > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] How to run a perl script instead of sending > emails? > > Dear All, > > I have a perl script that connects to my asterisk instance, and calls > who ever is on shift to log in, and take care of whatever broke. I > would > like to initiate this script whenever there is a critical alert that > has > gone off more than once - how can I best achieve this? > > I would like these to run instead of an email when out of hours, and > when a critical alert has been left unattended to for some time What a cool idea.. Fairly simple. Use service escalation to change the contact for 2nd notification onward, and set the contact's notification 'command' to be the perl routine to poke asterisk. define serviceescalation{ host_name nt-3 service_description Processor Load first_notification 2 last_notification 2 notification_interval 30 contacts +thephone escalation_options c } define contact{ contact_name jdoe alias John Doe host_notifications_enabled 1 service_notifications_enabled 1 service_notification_period 24x7 host_notification_period 24x7 service_notification_options c,u host_notification_options d,u,r service_notification_commands notify-by-phone host_notification_commands host-notify-by-phone email jdoe at localhost.localdomain pager 555-5555 at pagergateway.localhost.localdomain address1 xxxxx.xyyy at icq.com address2 555-555-5555 } define command{ command_name notify-by-phone command_line /usr/local/bin/service-phone.pl } define command{ command_name host-notify-by-phone command_line /usr/local/bin/host-phone.pl } Examples pulled from the documentation., modified slightly. Stuart ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tinzauro at ha-solutions.net Mon Jan 25 05:07:06 2010 From: tinzauro at ha-solutions.net (Terry L. Inzauro) Date: Sun, 24 Jan 2010 22:07:06 -0600 Subject: what should i do? In-Reply-To: <595436.60648.qm@web45008.mail.sp1.yahoo.com> References: <595436.60648.qm@web45008.mail.sp1.yahoo.com> Message-ID: <4B5D18EA.7030908@ha-solutions.net> On 01/24/2010 06:29 PM, fizah zali wrote: > hello, > > Initially, i have successfully installed nagios 3.2 version but after > the installation of nagios-text, the nagios 3.2 version turn to nagios > 1.3 version. > What should i do to get back the nagios 3.2 version? > Odd, someone asked this very questio on the 22nd. You would get better responses if you actually asked good questions or gave the list participants one iota of information to work with. Eric Steven Raymond made a document to assist with the art of asking questions: http://catb.org/~esr/faqs/smart-questions.html Please read, then repost. kind regards, Terry Inzauro ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From liyasmacosx at gmail.com Mon Jan 25 08:48:03 2010 From: liyasmacosx at gmail.com (liyas m) Date: Mon, 25 Jan 2010 15:48:03 +0800 Subject: removing and installing nagios In-Reply-To: <131162e41001222240w786a7eaev682ab27939310abf@mail.gmail.com> References: <131162e41001202342r7685b163o6ce85e8743595608@mail.gmail.com> <4B5829C6.7000904@flatto.net> <131162e41001221239m4d5341d7gfa19e8971e968028@mail.gmail.com> <33C7BE4CE80D504E8907FDC31F8A083047BA35A440@iexchange.I-EVOLVE.com> <131162e41001222240w786a7eaev682ab27939310abf@mail.gmail.com> Message-ID: <131162e41001242348lec1c262l5636eacafd127885@mail.gmail.com> got it this is the command to remove your old nagios from yr ws sudo apt-get remove nagios* --purge or sudo apt-get remove nagios1* --purge ( to remove nagios version 1) TQ On Sat, Jan 23, 2010 at 2:40 PM, liyas m wrote: > got it from here > http://nagios.org/ > > > On Sat, Jan 23, 2010 at 4:52 AM, Robert Wolfe wrote: > >> Did you install the Nagios software from source or from packages? >> >> >> >> *From:* liyas m [mailto:liyasmacosx at gmail.com] >> *Sent:* Friday, January 22, 2010 3:40 PM >> *To:* nagios-users at lists.sourceforge.net >> *Subject:* Re: [Nagios-users] removing and installing nagios >> >> >> >> i use apt-get remove nagios-text and apt-get remove --purge nagios-text >> but still does not work >> the old version 1.3 is still there. >> >> On Thu, Jan 21, 2010 at 6:17 PM, Assaf Flatto wrote: >> >> liyas m wrote: >> > DEar all, >> > >> > I have installed nagios 3.2 on my server for student project and >> > everything worked fine until i run apt-get nagios-text. After that the >> > version appeared on the server is Nagios 1.3 not 3.2 which is very >> > strange. I tried to reinstalled but still the old version appear >> > although the engine shown by nagios -c is 3.2 >> > >> > Nagios Core 3.2.0 >> > Copyright (c) 2009 Nagios Core Development Team and Community >> > Contributors >> > Copyright (c) 1999-2009 Ethan Galstad >> > Last Modified: 08-12-2009 >> > License: GPL >> > >> > How do I remove the old version ? Thank you for your help. >> > >> > Regards, >> > Liyas >> >> if you installed with apt - you can try apt-get purge nagios-text . >> that will remove the package installed . >> As for returning to the source installed software - you might have to >> rerun the configure / make process to make sure the newer files are >> installed and used . >> >> or you can use the blunt force method and rm every file related to >> nagios - but i wouldn't recommend it , only as a last resort . >> >> >> Assaf >> >> >> ------------------------------------------------------------------------------ >> Throughout its 18-year history, RSA Conference consistently attracts the >> world's best and brightest in the field, creating opportunities for >> Conference >> attendees to learn about information security's most important issues >> through >> interactions with peers, luminaries and emerging and established >> companies. >> http://p.sf.net/sfu/rsaconf-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> >> >> > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Mon Jan 25 08:33:03 2010 From: martinm at op5.org (Martin Melin) Date: Mon, 25 Jan 2010 08:33:03 +0100 Subject: best expression of a timeperiod with same hours every day In-Reply-To: <20100124184150.GA19389@norchemlab.com> References: <20100124184150.GA19389@norchemlab.com> Message-ID: <5f0bcb7c1001242333x779c5857p34df03c2a6bebb3f@mail.gmail.com> There is no way to avoid having one line per weekday in the config, but that doesn't mean that you have to write the definitions out by hand :-) A quick shellscript to echo each weekday with an argument and you should be good to go. Here you go: #!/bin/sh echo "sunday $1" echo "monday $1" echo "tuesday $1" echo "wednesday $1" echo "thursday $1" echo "friday $1" echo "saturday $1" Add the above to a dir in your path, then when you need to add a timeperiod definition you can use ":r ! timeperiod.sh 04:00-21:00" in vi, or something similar in your editor of choice. On Sun, Jan 24, 2010 at 7:41 PM, Justin Pryzby wrote: > Is there a shorter way of expressing this? We have several cases > where we need this, with different intervals. I was hoping there was > a way of writing it in fewer than 10 lines each. > > define timeperiod { > sunday 04:00-21:00 > monday 04:00-21:00 > tuesday 04:00-21:00 > wednesday 04:00-21:00 > thursday 04:00-21:00 > friday 04:00-21:00 > saturday 04:00-21:00 > } > > Thanks, > Justin > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From binbin.b.wang at hotmail.com Mon Jan 25 09:53:56 2010 From: binbin.b.wang at hotmail.com (Binbin Wang) Date: Mon, 25 Jan 2010 16:53:56 +0800 Subject: How to stop sending notification after 3 times? Message-ID: Hi all I make some troubles in nagios setting. My customer just only receive notification only 3 times, and stop receiving it even through that warning still exists. Anyone know how to set up nagios ? thanks in advance. Binbin -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Mon Jan 25 11:17:32 2010 From: martinm at op5.org (Martin Melin) Date: Mon, 25 Jan 2010 11:17:32 +0100 Subject: How to stop sending notification after 3 times? In-Reply-To: References: Message-ID: <5f0bcb7c1001250217o3a1acf8foc48d4abf26c1f485@mail.gmail.com> This sounds like a service/host escalation causing the problem. If you don't know what that is and haven't set it up, my money is on a spam filter at the customer kicking in. Take a look in notifications.cgi ("Notifications" in the web interface menu) and check if Nagios thinks it has sent out more notifications or not. 2010/1/25 Binbin Wang > Hi all > > I make some troubles in nagios setting. My customer just only receive > notification only 3 times, and stop receiving it even through that warning > still exists. Anyone know how to set up nagios ? thanks in advance. > > Binbin > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Mon Jan 25 11:31:43 2010 From: martinm at op5.org (Martin Melin) Date: Mon, 25 Jan 2010 11:31:43 +0100 Subject: Overloaded master In-Reply-To: <4B5A3102.8050903@5dninja.net> References: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> <4B5A3102.8050903@5dninja.net> Message-ID: <5f0bcb7c1001250231t4df5c87bjecdee8c70e322ae0@mail.gmail.com> What kind of notifications are you doing and how many are you sending out? Why does a notification cycle take 9 seconds to complete? On Sat, Jan 23, 2010 at 12:13 AM, Mike Lindsey wrote: > What kind of options does one have, if your master nagios server is > getting overloaded? > > I have half a dozen slaves doing polling, submitting passive check > results back via send_nsca. The master does no active polling, just > event processing, notifications, and web ui. > > Under normal circumstances, it works alright. But after a restart it > can take up to half an hour before the master catches up; and if there > are a lot of events, the act of sending out notifications can cause it > to fall behind. > > I'm pre-caching my object file, I'm skipping circular dependency checks, > and I've gotten a notification cycle down to 9 seconds. I tried > modifying nagios to fork before notifications, but that failed pretty > spectacularly; so that 9 seconds is a time where 900 or so passive check > submissions block until the notifications are done. > > Are there any options for running a dual-master setup, or other ways to > spread the load across multiple machines? > > Has anyone patched nsca to submit check results into the checkresults > directory, instead of via the nagios.cmd pipe? What kind of improvement > can one expect from that? > > Any other advice? > > -- > Mike Lindsey > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ml at nicole-haehnel.de Mon Jan 25 12:32:31 2010 From: ml at nicole-haehnel.de (Nicole H=?UTF-8?B?w6Q=?=hnel) Date: Mon, 25 Jan 2010 12:32:31 +0100 (CET) Subject: =?utf-8?q?Problems_using_check_=5F_yum_on_sles11?= Message-ID: <20100125113231.4937018068C68@dd18808.kasserver.com> Hi, I'm trying to use check_yum 0.7.1 (Hari Sekhon) on sles 11 with yum 3.2.25. "yum check-update" works on the commandline. Check_yum says: YUM WARNING: Cannot find summary line in yum output. Please make sure you have upgraded to the latest version of this plugin. If the problem persists, please contact the author for a fix Any experiences? Thanks! Nicole ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kyleodonnell at gmail.com Mon Jan 25 14:54:25 2010 From: kyleodonnell at gmail.com (Kyle O'Donnell) Date: Mon, 25 Jan 2010 08:54:25 -0500 Subject: How to stop sending notification after 3 times? In-Reply-To: References: Message-ID: <2274b9c31001250554o3f324a3btec24d294bd5ed4a1@mail.gmail.com> you could also pass the macro $NOTIFICATIONNUMBER$ to your notification command, which does an if on the value. [ $NOTIFICATIONNUMBER$ -le 3 ] && echo blah blah blah $HOSTNAME$ $SERVICE... |mail On 1/25/10, Binbin Wang wrote: > Hi all > > I make some troubles in nagios setting. My customer just only receive > notification only 3 times, and stop receiving it even through that warning > still exists. Anyone know how to set up nagios ? thanks in advance. > > Binbin ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Mon Jan 25 15:24:58 2010 From: martinm at op5.org (Martin Melin) Date: Mon, 25 Jan 2010 15:24:58 +0100 Subject: Problems using check _ yum on sles11 In-Reply-To: <20100125113231.4937018068C68@dd18808.kasserver.com> References: <20100125113231.4937018068C68@dd18808.kasserver.com> Message-ID: <5f0bcb7c1001250624r6c6655dbr68144330a92f8c20@mail.gmail.com> I think "check_yum -vvv" will give some useful information on what might be going wrong. I tested 0.7.1 and it seemed to work fine for me. Although I personally use http://git.op5.org/git/?p=nagios/op5plugins.git;a=blob_plain;f=check_yum_update.pl;hb=HEAD:-) On Mon, Jan 25, 2010 at 12:32 PM, Nicole H?hnel wrote: > Hi, > > I'm trying to use check_yum 0.7.1 (Hari Sekhon) on sles 11 with yum 3.2.25. > "yum check-update" works on the commandline. > > Check_yum says: > > YUM WARNING: Cannot find summary line in yum output. Please make sure you > have upgraded to the latest version of this plugin. If the problem persists, > please contact the author for a fix > > Any experiences? > > Thanks! > > Nicole > > > > > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robert.wolfe at robertwolfe.org Mon Jan 25 16:47:44 2010 From: robert.wolfe at robertwolfe.org (Wolfe, Robert) Date: Mon, 25 Jan 2010 10:47:44 -0500 Subject: Checking Asterisk Call Availability Message-ID: <863B3EE13473404BB5FA7516A1E31EF71235A00725@exchange-01.wolfe.local> Hi all! Was just wondering if there was a plugin for Nagios (latest version) that works with Asterisk to make an outbound call in Asterisk ad return a value back to Nagios to see if the call was placed successfully or not? ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Mon Jan 25 16:54:28 2010 From: martinm at op5.org (Martin Melin) Date: Mon, 25 Jan 2010 16:54:28 +0100 Subject: Checking Asterisk Call Availability In-Reply-To: <863B3EE13473404BB5FA7516A1E31EF71235A00725@exchange-01.wolfe.local> References: <863B3EE13473404BB5FA7516A1E31EF71235A00725@exchange-01.wolfe.local> Message-ID: <5f0bcb7c1001250754u7f648642m89f7f3d4476f3af8@mail.gmail.com> I don't know of a plugin that does this specifically, and couldn't find it with a quick search. However it should be simple to do if you use Asterisk call files, where you simply put a file into /var/spool/asterisk/outgoing and then watch if that file ends up in /failed or /completed (I'm not sure if the directories are named exactly that). On Mon, Jan 25, 2010 at 4:47 PM, Wolfe, Robert wrote: > Hi all! > > Was just wondering if there was a plugin for Nagios (latest version) that > works with Asterisk to make an outbound call in Asterisk ad return a value > back to Nagios to see if the call was placed successfully or not? > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for > Conference > attendees to learn about information security's most important issues > through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From asraikhn at gmail.com Mon Jan 25 17:21:41 2010 From: asraikhn at gmail.com (Asrai khn) Date: Mon, 25 Jan 2010 21:21:41 +0500 Subject: Defining service dependency Message-ID: <5f0f8dba1001250821s31fc4d0r84e8bbe55e2aa382@mail.gmail.com> Hi list, I want to define service dependency, i have read the relevant nagios docs but I am not getting how to define one. we are running nagios 3.2.x (pre build binary on centos 5.4) Eg, I want to define service dependency for all services that been checked with nrpe, ie if nrpe not running on remote host then nagios shouldn't send notifications for these services instead just for nrpe. Few days back our monitoring host send lot of notifications coz of nrpe wasn't running on remote hosts, in this case I want nagios only alert for 'nrpe' not all the services that being checked with nrpe. 1. So first I have define a service to check nrpe on remote hosts. # Command def for checking nrpe on remote hosts. define command{ command_name nrpe_service command_line $USER1$/check_nrpe -H $HOSTADDRESS$ } 2. define a service for each hostgroup. # service def. define service{ use check-interval-service service_description nrpe check_command nrpe_service hostgroup_name Sol-Servers } So far so good, I have several hosts (linux, solaris, hp) all of them have the following standard service definition for checking remote hosts service via check_nrpe define service{ use local-service host_name BB-2,NB-1 service_description SCC Procs check_command check_nrpe!check_scc_procs } define service{ use local-service host_name BB-2 service_description SCC Partition check_command nrpe_disk!check_disk4 } we have 5-10 nrpe check for each hosts which are scattered in different hostgroups. Now how to define a dependency so that if 'nrpe' service fail on remote hosts nagios shouldn't send alerts for services whom check via nrpe. Any configuration example will be greatly appreciated. Due to binary version of nagios installed it will not possible for me to go for any patching solution. Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Mon Jan 25 17:29:52 2010 From: nagios at flatto.net (Assaf Flatto) Date: Mon, 25 Jan 2010 16:29:52 +0000 Subject: Defining service dependency In-Reply-To: <5f0f8dba1001250821s31fc4d0r84e8bbe55e2aa382@mail.gmail.com> References: <5f0f8dba1001250821s31fc4d0r84e8bbe55e2aa382@mail.gmail.com> Message-ID: <4B5DC700.3000009@flatto.net> Asrai khn wrote: > Hi list, > > I want to define service dependency, i have read the relevant nagios > docs but I am not getting how to define one. we are running nagios > 3.2.x (pre build binary on centos 5.4) > Eg, I want to define service dependency for all services that been > checked with nrpe, ie if nrpe not running on remote host then nagios > shouldn't send notifications for these services instead just for nrpe. > > Few days back our monitoring host send lot of notifications coz of > nrpe wasn't running on remote hosts, in this case I want nagios only > alert for 'nrpe' not all the services that being checked with nrpe. > > 1. So first I have define a service to check nrpe on remote hosts. > > # Command def for checking nrpe on remote hosts. > define command{ > command_name nrpe_service > command_line $USER1$/check_nrpe -H $HOSTADDRESS$ > } > > > 2. define a service for each hostgroup. > > # service def. > define service{ > use check-interval-service > service_description nrpe > check_command nrpe_service > hostgroup_name Sol-Servers > } > > So far so good, I have several hosts (linux, solaris, hp) all of them > have the following standard service definition for checking remote > hosts service via check_nrpe > > define service{ > use local-service > host_name BB-2,NB-1 > service_description SCC Procs > check_command check_nrpe!check_scc_procs > } > define service{ > use local-service > host_name BB-2 > service_description SCC Partition > check_command nrpe_disk!check_disk4 > } > > we have 5-10 nrpe check for each hosts which are scattered in > different hostgroups. > > Now how to define a dependency so that if 'nrpe' service fail on > remote hosts nagios shouldn't send alerts for services whom check via > nrpe. > > Any configuration example will be greatly appreciated. > > Due to binary version of nagios installed it will not possible for me > to go for any patching solution. You got so far and stopped near the finish line !! define servicedependency{ host_name BB-2,NB-1 service nrpe dependent_host BB-2,NB-1 dendent_service SCC Partition execution_failure_criteria u,c,p notification_failure_criteria u,c,p } this is of the top of my head - you will need to make sure the directives are according to the nagios definitions (I'm sure i got at least one wrong) , but that is the way to move on. Have fun. ------------------------------------------------------------------------------ Throughout its 18-year history, RSA Conference consistently attracts the world's best and brightest in the field, creating opportunities for Conference attendees to learn about information security's most important issues through interactions with peers, luminaries and emerging and established companies. http://p.sf.net/sfu/rsaconf-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Mon Jan 25 20:51:51 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Mon, 25 Jan 2010 11:51:51 -0800 Subject: n00b trying to set up on freebsd In-Reply-To: <20100123040329.GJ89235@sysmon.tcworks.net> References: <20100122221319.GI89235@sysmon.tcworks.net> <20100123040329.GJ89235@sysmon.tcworks.net> Message-ID: On Fri, Jan 22, 2010 at 20:03, Scott Lambert wrote: > On Fri, Jan 22, 2010 at 04:19:58PM -0800, Kurt Buff wrote: >> Forgot to mention: When I get the directory listing, there is no auth >> taking place - I don't get asked for a username or password. > > You don't get asked for a password due to the Allow of your IP address. > > What does the directory list? ?Is there an index.something file? > > If so, you probably don't have the index.something listed in your > DirectoryIndexes statement. ?You may need to install PHP and configure > Apache to use it. Once more into the breach... I've got the php issue solved, so the site is showing up just fine. Using lynx on localhost, I get prompted for a login to nagios and get to the home page and subsidiary pages just fine, but from FF on a remote box I get no login prompt, and the home page just comes up, and it shows as me being logged in as nagiosadmin for any of the pages I then select. While this works, it's a bit insecure. I'd like to set it up so that 1) we get prompted for login and 2) only logins from authorized subnets are allowed, though if I can only have one, I'll take a login prompt. Below is the snippet of httpd.conf for nagios. I was terribly unsuccessful trying to 'Deny from all' and then specify, for instance, 'Allow from 192.168.24.0/24'. My google-fu is weak, and other's expertise is desired. If I need to provide any further information, I'll be happy to provide it. Kurt ---------- begin snippet of httpd.conf---------- Options None AllowOverride None Order deny,allow Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/etc/nagios/htpasswd.users Require valid-user php_flag engine on php_admin_value open_basedir /usr/local/www/nagios/:/var/spool/nagios/ Options ExecCGI AllowOverride None Order allow,deny Allow from all AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/etc/nagios/htpasswd.users Require valid-user ---------- end snippet of httpd.conf---------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nccomp at gmail.com Mon Jan 25 21:14:42 2010 From: nccomp at gmail.com (Andrew Davis) Date: Mon, 25 Jan 2010 15:14:42 -0500 Subject: Virtual Machines - define as parent or as host dependency... Message-ID: <4B5DFBB2.6060708@gmail.com> I'm trying to figure out the best way to do this, yet keep things as simple as possible. Say I have a server called Saturn running VMWare. I'm monitoring this server with Nagios. I also have three VM's on Saturn: Jupiter, Mars, and Pluto I want to suppress all host and service alerts on Jupiter, Mars, & Pluto if the host Saturn is down (unreachable). I do NOT want to suppress host or service alerts from Jupiter, Mars, and Pluto if the VMWare processes (services) are down on Saturn. Basically, if my VM server is completely unreachable, don't bother me about its client VM's. Am I better off doing this with a host dependency? Something like: *define hostdependency { host_name Host B dependent_host_name Host C notification_failure_criteria d,u } * Or am I better off defining Saturn as the parent of the VM's in the host config? Something like: *define host { host_name jupiter use VMs alias jupiter address 172.26.251.60 parents saturn, tpdmzsw1 }* I've successfully used the "parents" directive to define network topology, so I would think this would work. What might be the risks of defining both? -- A. Davis Email: nccomp at gmail.com "There is no limit to what a man can accomplish if he doesn't care who gets the credit." - Ronald Reagan -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lambert at lambertfam.org Mon Jan 25 22:08:48 2010 From: lambert at lambertfam.org (Scott Lambert) Date: Mon, 25 Jan 2010 15:08:48 -0600 Subject: n00b trying to set up on freebsd In-Reply-To: References: <20100122221319.GI89235@sysmon.tcworks.net> <20100123040329.GJ89235@sysmon.tcworks.net> Message-ID: <20100125210848.GK89235@sysmon.tcworks.net> On Mon, Jan 25, 2010 at 11:51:51AM -0800, Kurt Buff wrote: > On Fri, Jan 22, 2010 at 20:03, Scott Lambert wrote: > > On Fri, Jan 22, 2010 at 04:19:58PM -0800, Kurt Buff wrote: > >> Forgot to mention: When I get the directory listing, there is no auth > >> taking place - I don't get asked for a username or password. > > > > You don't get asked for a password due to the Allow of your IP address. > > > > What does the directory list? ?Is there an index.something file? > > > > If so, you probably don't have the index.something listed in your > > DirectoryIndexes statement. ?You may need to install PHP and configure > > Apache to use it. > > Once more into the breach... > > I've got the php issue solved, so the site is showing up just fine. > > Using lynx on localhost, I get prompted for a login to nagios and get > to the home page and subsidiary pages just fine, but from FF on a > remote box I get no login prompt, and the home page just comes up, and > it shows as me being logged in as nagiosadmin for any of the pages I > then select. While this works, it's a bit insecure. > > I'd like to set it up so that 1) we get prompted for login and 2) only > logins from authorized subnets are allowed, though if I can only have > one, I'll take a login prompt. Below is the snippet of httpd.conf for > nagios. I was terribly unsuccessful trying to 'Deny from all' and then > specify, for instance, 'Allow from 192.168.24.0/24'. You don't have any access allow based on IP. That makes me think that you probably haven't exitted Firefox since you logged in as nagiosadmin while testing. With HTTP Basic Authentication, there is no "Log Out" there is only, "forget you know the password" for the web browser. Firefox remembers the username and password until you exit and launch it again. You may also need to check to see if Firefox has memorized the username and password in it's master authentication database. That will persist across restarts. -- Scott Lambert KC5MLE Unix SysAdmin lambert at lambertfam.org ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Jan 25 22:18:20 2010 From: marc at ena.com (Marc Powell) Date: Mon, 25 Jan 2010 15:18:20 -0600 Subject: n00b trying to set up on freebsd In-Reply-To: References: <20100122221319.GI89235@sysmon.tcworks.net> <20100123040329.GJ89235@sysmon.tcworks.net> Message-ID: On Jan 25, 2010, at 1:51 PM, Kurt Buff wrote: > Using lynx on localhost, I get prompted for a login to nagios and get > to the home page and subsidiary pages just fine, but from FF on a > remote box I get no login prompt, and the home page just comes up, and > it shows as me being logged in as nagiosadmin for any of the pages I > then select. While this works, it's a bit insecure. Interesting... I'd have to say that the httpd.conf snippet below is not what's being used or is incomplete. It seems that there is some configuration that references source IP's that would lead to this different behavior (prompted v.s. unprompted). The 'nagiosadmin' bit is something else entirely I bet, but can't entirely be explained by the httpd.conf snippets below. > I'd like to set it up so that 1) we get prompted for login and 2) only > logins from authorized subnets are allowed, though if I can only have > one, I'll take a login prompt. Both are achievable, I expect. > Below is the snippet of httpd.conf for > nagios. I was terribly unsuccessful trying to 'Deny from all' and then > specify, for instance, 'Allow from 192.168.24.0/24'. > > My google-fu is weak, and other's expertise is desired. If I need to > provide any further information, I'll be happy to provide it. > > Kurt > > > ---------- begin snippet of httpd.conf---------- > > Options None > AllowOverride None > Order deny,allow > Allow from all > AuthName "Nagios Access" > AuthType Basic > AuthUserFile /usr/local/etc/nagios/htpasswd.users > Require valid-user > php_flag engine on > php_admin_value open_basedir /usr/local/www/nagios/:/var/spool/nagios/ > Change this to -- Options None AllowOverride None Order deny,allow Deny from all Allow from 192.168.24.0/24 # or 192.168.24. AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/etc/nagios/htpasswd.users Require valid-user Satisfy all php_flag engine on php_admin_value open_basedir /usr/local/www/nagios/:/var/spool/nagios/ > > Options ExecCGI > AllowOverride None > Order allow,deny > Allow from all > AuthName "Nagios Access" > AuthType Basic > AuthUserFile /usr/local/etc/nagios/htpasswd.users > Require valid-user > If you're using anything like a normal install, this one will never match, and it's the only one that really matters. Do you really have the directory '/usr/local/www/nagios/cgi-bin' on your server and it contains the *.cgi files? I expect not because of the nagiosadmin bit above. That implies that the CGI's are not being passed a username from your web server because there is no htaccess policy that applies to the directory that contains the CGI's. I expect that this should really be '/usr/local/www/nagios/sbin'. Adjust the following as necessary -- Options ExecCGI AllowOverride None Order deny,allow Deny from all Allow from 192.168.24.0/24 # or 192.168.24. AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/etc/nagios/htpasswd.users Require valid-user Satisfy all If this doesn't work, what errors do you see in error_log? -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Mon Jan 25 22:40:20 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Mon, 25 Jan 2010 13:40:20 -0800 Subject: n00b trying to set up on freebsd In-Reply-To: <20100125210848.GK89235@sysmon.tcworks.net> References: <20100122221319.GI89235@sysmon.tcworks.net> <20100123040329.GJ89235@sysmon.tcworks.net> <20100125210848.GK89235@sysmon.tcworks.net> Message-ID: On Mon, Jan 25, 2010 at 13:08, Scott Lambert wrote: > On Mon, Jan 25, 2010 at 11:51:51AM -0800, Kurt Buff wrote: >> On Fri, Jan 22, 2010 at 20:03, Scott Lambert wrote: >> > On Fri, Jan 22, 2010 at 04:19:58PM -0800, Kurt Buff wrote: >> >> Forgot to mention: When I get the directory listing, there is no auth >> >> taking place - I don't get asked for a username or password. >> > >> > You don't get asked for a password due to the Allow of your IP address. >> > >> > What does the directory list? ?Is there an index.something file? >> > >> > If so, you probably don't have the index.something listed in your >> > DirectoryIndexes statement. ?You may need to install PHP and configure >> > Apache to use it. >> >> Once more into the breach... >> >> I've got the php issue solved, so the site is showing up just fine. >> >> Using lynx on localhost, I get prompted for a login to nagios and get >> to the home page and subsidiary pages just fine, but from FF on a >> remote box I get no login prompt, and the home page just comes up, and >> it shows as me being logged in as nagiosadmin for any of the pages I >> then select. While this works, it's a bit insecure. >> >> I'd like to set it up so that 1) we get prompted for login and 2) only >> logins from authorized subnets are allowed, though if I can only have >> one, I'll take a login prompt. Below is the snippet of httpd.conf for >> nagios. I was terribly unsuccessful trying to 'Deny from all' and then >> specify, for instance, 'Allow from 192.168.24.0/24'. > > You don't have any access allow based on IP. ?That makes me think that > you probably haven't exitted Firefox since you logged in as nagiosadmin > while testing. ?With HTTP Basic Authentication, there is no "Log Out" > there is only, "forget you know the password" for the web browser. > Firefox remembers the username and password until you exit and launch it > again. > > You may also need to check to see if Firefox has memorized the username > and password in it's master authentication database. ?That will persist > across restarts. > > -- > Scott Lambert ? ? ? ? ? ? ? ? ? ?KC5MLE ? ? ? ? ? ? ? ? ? ? ? Unix SysAdmin > lambert at lambertfam.org So, yes, restarting FF on my box does then cause the auth prompt. However, if I change httpd.conf to the following, I still get the auth prompt, but then get denied for anything but the home page - I've also verified with IE, which I don't much use any more. And, just for completeness, the nagios installation is at 192.168.8.20/24 ----------end httpd.conf snippet---------- ScriptAlias /nagios/cgi-bin /usr/local/www/nagios/cgi-bin/ Alias /nagios /usr/local/www/nagios/ AddType application/x-httpd-php .php AddType application/x-httpd-php-source .phps Options None AllowOverride None Order deny,allow # Allow from all Deny from all Allow from 127.0.0.1 Allow from 192.168.8.0/24 Allow from 192.168.24.0/24 AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/etc/nagios/htpasswd.users Require valid-user php_flag engine on php_admin_value open_basedir /usr/local/www/nagios/:/var/spool/nagios/ Options ExecCGI AllowOverride None Order allow,deny # Allow from all Deny from all Allow from 127.0.0.1 Allow from 192.168.8.0/24 Allow from 192.168.24.0/24 AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/etc/nagios/htpasswd.users Require valid-user ----------end httpd.conf snippet---------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Mon Jan 25 22:47:43 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Mon, 25 Jan 2010 13:47:43 -0800 Subject: n00b trying to set up on freebsd In-Reply-To: References: <20100122221319.GI89235@sysmon.tcworks.net> <20100123040329.GJ89235@sysmon.tcworks.net> Message-ID: On Mon, Jan 25, 2010 at 13:18, Marc Powell wrote: > > On Jan 25, 2010, at 1:51 PM, Kurt Buff wrote: >> >> ?Options ExecCGI >> ?AllowOverride None >> ?Order allow,deny >> ?Allow from all >> ?AuthName "Nagios Access" >> ?AuthType Basic >> ?AuthUserFile /usr/local/etc/nagios/htpasswd.users >> ?Require valid-user >> > > If you're using anything like a normal install, this one will never match, and it's > the only one that really matters. Do you really have the directory > '/usr/local/www/nagios/cgi-bin' on your server and it contains the *.cgi files? Yes, '/usr/local/www/nagios/cgi-bin' does exist and contain .cgi files. I didn't put it there, so I expect that's where the FreeBSD port installs it there. > I expect not because of the nagiosadmin bit above. That implies that the CGI's are > not being passed a username from your web server because there is no htaccess > policy that applies to the directory that contains the CGI's. I expect that this should > really be '/usr/local/www/nagios/sbin'. Adjust the following as necessary -- > > > ?Options ExecCGI > ?AllowOverride None > ?Order deny,allow > ?Deny from all > ?Allow from 192.168.24.0/24 # or 192.168.24. > ?AuthName "Nagios Access" > ?AuthType Basic > ?AuthUserFile /usr/local/etc/nagios/htpasswd.users > ?Require valid-user > ?Satisfy all > > > If this doesn't work, what errors do you see in error_log? > > -- > Marc 'Satisfy all' - that's new to me. Back in a second... Nope - that didn't work. See also my last reply to Scott Lambert. Kurt ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Mon Jan 25 23:03:12 2010 From: rperezm at uci.cu (ReynierPM) Date: Mon, 25 Jan 2010 17:03:12 -0500 Subject: Command execution failed Message-ID: <4B5E1520.60706@uci.cu> Hi every: Some time ago I wrote a message to this list for get help in how to check Bacula over Nagios. As bacula nagios plugins use check_host_by_ssh command I configure shared key between client (monitoring server, where nagios reside) and server(bacula - server to check) and it works because I can login from Nagios Server to Bacula Server without enter any password. Now Nagios is reporting this error: Remote command execution failed: Permission denied, please try again. How I can fix this now? -- Saludos ReynierPM ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Jan 25 23:16:26 2010 From: marc at ena.com (Marc Powell) Date: Mon, 25 Jan 2010 16:16:26 -0600 Subject: n00b trying to set up on freebsd In-Reply-To: References: <20100122221319.GI89235@sysmon.tcworks.net> <20100123040329.GJ89235@sysmon.tcworks.net> <20100125210848.GK89235@sysmon.tcworks.net> Message-ID: <1FA5D2DF-E3C9-4ED3-85A5-94E69EE42507@ena.com> On Jan 25, 2010, at 3:40 PM, Kurt Buff wrote: > On Mon, Jan 25, 2010 at 13:08, Scott Lambert wrote: >> On Mon, Jan 25, 2010 at 11:51:51AM -0800, Kurt Buff wrote: >>> On Fri, Jan 22, 2010 at 20:03, Scott Lambert wrote: >>>> On Fri, Jan 22, 2010 at 04:19:58PM -0800, Kurt Buff wrote: >>>>> Forgot to mention: When I get the directory listing, there is no auth >>>>> taking place - I don't get asked for a username or password. >>>> >>>> You don't get asked for a password due to the Allow of your IP address. >>>> >>>> What does the directory list? Is there an index.something file? >>>> >>>> If so, you probably don't have the index.something listed in your >>>> DirectoryIndexes statement. You may need to install PHP and configure >>>> Apache to use it. >>> >>> Once more into the breach... >>> >>> I've got the php issue solved, so the site is showing up just fine. >>> >>> Using lynx on localhost, I get prompted for a login to nagios and get >>> to the home page and subsidiary pages just fine, but from FF on a >>> remote box I get no login prompt, and the home page just comes up, and >>> it shows as me being logged in as nagiosadmin for any of the pages I >>> then select. While this works, it's a bit insecure. >>> >>> I'd like to set it up so that 1) we get prompted for login and 2) only >>> logins from authorized subnets are allowed, though if I can only have >>> one, I'll take a login prompt. Below is the snippet of httpd.conf for >>> nagios. I was terribly unsuccessful trying to 'Deny from all' and then >>> specify, for instance, 'Allow from 192.168.24.0/24'. >> >> You don't have any access allow based on IP. That makes me think that >> you probably haven't exitted Firefox since you logged in as nagiosadmin >> while testing. With HTTP Basic Authentication, there is no "Log Out" >> there is only, "forget you know the password" for the web browser. >> Firefox remembers the username and password until you exit and launch it >> again. >> >> You may also need to check to see if Firefox has memorized the username >> and password in it's master authentication database. That will persist >> across restarts. >> >> -- >> Scott Lambert KC5MLE Unix SysAdmin >> lambert at lambertfam.org > > So, yes, restarting FF on my box does then cause the auth prompt. > However, if I change httpd.conf to the following, I still get the auth > prompt, but then get denied for anything but the home page - What version of apache and what does error_log have to say about it? /usr/local/www/nagios/cgi-bin is the physical directory on your server that contains the *.cgi files? > > Options ExecCGI > AllowOverride None > Order allow,deny You didn't change the Order directive here. > # Allow from all > Deny from all > Allow from 127.0.0.1 > Allow from 192.168.8.0/24 > Allow from 192.168.24.0/24 > AuthName "Nagios Access" > AuthType Basic > AuthUserFile /usr/local/etc/nagios/htpasswd.users > Require valid-user > I've just successfully tested with the following config for the cgi directory. I was blocked if I came from anywhere besides 172.27.2.0/24 and I was prompted for username/password if I did. I've left everything intact as I tested it -- Options ExecCGI AllowOverride None Order deny,allow Deny from all Allow from 172.27.2.0/24 AuthName "Nagios Access" AuthType Basic AuthUserFile /usr/local/nagios/etc/htpasswd.users Require valid-user Things you need to change should only be - Allow from [your block] (multiple Allow from's...) AuthUserFile [/path/to/htpasswd.users] Once you've got this part working correctly, verify that you have 'use_authentication=1' in cgi.cfg. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gevery at gmail.com Mon Jan 25 23:27:28 2010 From: gevery at gmail.com (Gary Every) Date: Mon, 25 Jan 2010 15:27:28 -0700 Subject: Command execution failed In-Reply-To: <4B5E1520.60706@uci.cu> References: <4B5E1520.60706@uci.cu> Message-ID: Looks like either the execute permissions are improperly set on the script you're trying to run - via ssh - on the remote (not nagios) server, or the user that you've set up shared keys with doesn't have permission to run the command or script. The problem with shared keys and check-by-ssh is that either you've got to set up your scripts to be able to be run by a non-priveleged user, or you've got to share the root users keys . . .. g.; On Mon, Jan 25, 2010 at 3:03 PM, ReynierPM wrote: > Hi every: > Some time ago I wrote a message to this list for get help in how to > check Bacula over Nagios. As bacula nagios plugins use check_host_by_ssh > command I configure shared key between client (monitoring server, where > nagios reside) and server(bacula - server to check) and it works because > I can login from Nagios Server to Bacula Server without enter any > password. Now Nagios is reporting this error: > > Remote command execution failed: Permission denied, please try again. > > How I can fix this now? > -- > Saludos > ReynierPM > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Gary Every "Pay it Forward!" -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Jan 25 23:33:45 2010 From: marc at ena.com (Marc Powell) Date: Mon, 25 Jan 2010 16:33:45 -0600 Subject: Command execution failed In-Reply-To: References: <4B5E1520.60706@uci.cu> Message-ID: On Jan 25, 2010, at 4:27 PM, Gary Every wrote: > The problem with shared keys and check-by-ssh is that either you've got to set up your scripts to be able to be run by a non-priveleged user, or you've got to share the root users keys . . .. or better, use sudo... -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kurt.buff at gmail.com Tue Jan 26 00:03:08 2010 From: kurt.buff at gmail.com (Kurt Buff) Date: Mon, 25 Jan 2010 15:03:08 -0800 Subject: n00b trying to set up on freebsd In-Reply-To: <1FA5D2DF-E3C9-4ED3-85A5-94E69EE42507@ena.com> References: <20100122221319.GI89235@sysmon.tcworks.net> <20100123040329.GJ89235@sysmon.tcworks.net> <20100125210848.GK89235@sysmon.tcworks.net> <1FA5D2DF-E3C9-4ED3-85A5-94E69EE42507@ena.com> Message-ID: On Mon, Jan 25, 2010 at 14:16, Marc Powell wrote: > > On Jan 25, 2010, at 3:40 PM, Kurt Buff wrote: > >> On Mon, Jan 25, 2010 at 13:08, Scott Lambert wrote: >>> On Mon, Jan 25, 2010 at 11:51:51AM -0800, Kurt Buff wrote: >> So, yes, restarting FF on my box does then cause the auth prompt. >> However, if I change httpd.conf to the following, I still get the auth >> prompt, but then get denied for anything but the home page - > > What version of apache and what does error_log have to say about it? apache-2.2.14_5 192.168.24.63 - - [25/Jan/2010:14:48:54 -0800] "GET /nagios HTTP/1.1" 401 401 "-" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios HTTP/1.1" 301 238 "-" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/ HTTP/1.1" 200 718 "-" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/side.php HTTP/1.1" 200 4309 "http://loki.zetron.com/nagios/" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/stylesheets/common.css HTTP/1.1" 304 - "http://loki.zetron.com/nagios/side.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/images/sblogo.png HTTP/1.1" 304 - "http://loki.zetron.com/nagios/side.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/main.php HTTP/1.1" 200 2382 "http://loki.zetron.com/nagios/" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/images/greendot.gif HTTP/1.1" 304 - "http://loki.zetron.com/nagios/side.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/images/NagiosEnterprises-whitebg-112x46.png HTTP/1.1" 304 - "http://loki.zetron.com/nagios/main.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/images/sflogo.png HTTP/1.1" 304 - "http://loki.zetron.com/nagios/main.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/images/logofullsize.png HTTP/1.1" 304 - "http://loki.zetron.com/nagios/main.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:00 -0800] "GET /nagios/images/weblogo1.png HTTP/1.1" 304 - "http://loki.zetron.com/nagios/main.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - nagiosadmin [25/Jan/2010:14:49:02 -0800] "GET /nagios/images/orangedot.gif HTTP/1.1" 304 - "http://loki.zetron.com/nagios/side.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" 192.168.24.63 - - [25/Jan/2010:14:49:13 -0800] "GET /nagios/cgi-bin/status.cgi?servicegroup=all&style=summary HTTP/1.1" 403 227 "http://loki.zetron.com/nagios/side.php" "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 5.1; Trident/4.0; .NET CLR 1.1.4322; .NET CLR 2.0.50727; .NET CLR 3.0.04506.30; .NET CLR 3.0.04506.648; .NET CLR 3.0.4506.2152; .NET CLR 3.5.30729)" > /usr/local/www/nagios/cgi-bin is the physical directory on your server that contains the *.cgi files? Yes, so it seems: loki/usr/local/www 14:47 2010-01-25 root 0# ll nagios/cgi-bin total 3466 -rwxr-xr-x 1 nagios nagios 250420 Dec 8 08:12 avail.cgi -rwxr-xr-x 1 nagios nagios 246552 Dec 8 08:12 cmd.cgi -rwxr-xr-x 1 nagios nagios 211060 Dec 8 08:12 config.cgi -rwxr-xr-x 1 nagios nagios 265460 Dec 8 08:12 extinfo.cgi -rwxr-xr-x 1 nagios nagios 217652 Dec 8 08:12 histogram.cgi -rwxr-xr-x 1 nagios nagios 194804 Dec 8 08:12 history.cgi -rwxr-xr-x 1 nagios nagios 190804 Dec 8 08:12 notifications.cgi -rwxr-xr-x 1 nagios nagios 184244 Dec 8 08:12 outages.cgi -rwxr-xr-x 1 nagios nagios 189236 Dec 8 08:12 showlog.cgi -rwxr-xr-x 1 nagios nagios 257044 Dec 8 08:12 status.cgi -rwxr-xr-x 1 nagios nagios 212276 Dec 8 08:12 statusmap.cgi -rwxr-xr-x 1 nagios nagios 204724 Dec 8 08:12 statuswml.cgi -rwxr-xr-x 1 nagios nagios 213428 Dec 8 08:12 summary.cgi -rwxr-xr-x 1 nagios nagios 207380 Dec 8 08:12 tac.cgi -rwxr-xr-x 1 nagios nagios 223668 Dec 8 08:12 trends.cgi >> >> ?Options ExecCGI >> ?AllowOverride None >> ?Order allow,deny > > You didn't change the Order directive here. That seems to have fixed it. Wow. > Things you need to change should only be - > ? ? ? ? > ? ? ? ?Allow from [your block] (multiple Allow from's...) > ? ? ? ?AuthUserFile [/path/to/htpasswd.users] > > Once you've got this part working correctly, verify that you have 'use_authentication=1' in cgi.cfg. That was set - I didn't set it, so I'm guessing it was set as part of the default installation. Thanks - that's two people I now owe on this list. Kurt ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Jan 26 00:10:19 2010 From: rperezm at uci.cu (ReynierPM) Date: Mon, 25 Jan 2010 18:10:19 -0500 Subject: Command execution failed In-Reply-To: References: <4B5E1520.60706@uci.cu> Message-ID: <4B5E24DB.70908@uci.cu> On 1/25/2010 5:33 PM, Marc Powell wrote: > or better, use sudo... Any guide to do this with "sudo"? I not know how to -- Cheers ReynierPM ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Jan 26 00:12:48 2010 From: rperezm at uci.cu (ReynierPM) Date: Mon, 25 Jan 2010 18:12:48 -0500 Subject: Command execution failed In-Reply-To: References: <4B5E1520.60706@uci.cu> Message-ID: <4B5E2570.2060503@uci.cu> On 1/25/2010 5:27 PM, Gary Every wrote: > Looks like either the execute permissions are improperly set on the > script you're trying to run - via ssh - on the remote (not nagios) > server, or the user that you've set up shared keys with doesn't have > permission to run the command or script. It have permissions: -rwxr-x--- 1 nagios nagios 5020 2009-09-22 00:01 check_bacula.pl* Any other idea? > The problem with shared keys and check-by-ssh is that either you've got > to set up your scripts to be able to be run by a non-priveleged user, or > you've got to share the root users keys . . .. Is this secure? -- Cheers ReynierPM ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Tue Jan 26 02:02:26 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Mon, 25 Jan 2010 17:02:26 -0800 Subject: Overloaded master In-Reply-To: <5f0bcb7c1001250231t4df5c87bjecdee8c70e322ae0@mail.gmail.com> References: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> <4B5A3102.8050903@5dninja.net> <5f0bcb7c1001250231t4df5c87bjecdee8c70e322ae0@mail.gmail.com> Message-ID: <4B5E3F22.2090703@5dninja.net> A typical first tier notification goes to 20 people. One of those will be a pager, and is very simple. The rest are fairly complex. Notifications include a link to existing and recent tickets in our ticketing system (this also allows me to not send a ticket opening notification if a ticket already exists).. I populate the notification with links to cacti graphs, links to wiki documentation for the event as well as fire off a secondary notification handler that adds in additional information based on the host, service, and state. The first notification of the cycles does all the heavy lifting and takes about 6 seconds. The other 19 finish relatively quickly. I've been thinking of building a notification server - so I could have separate and discrete notification escalations for different service states - which would also let me fire off one notification with just the contents of $ENV{NAGIOS_*}.. Perhaps that's my best option? Martin Melin wrote: > What kind of notifications are you doing and how many are you sending > out? Why does a notification cycle take 9 seconds to complete? > > On Sat, Jan 23, 2010 at 12:13 AM, Mike Lindsey > wrote: > > What kind of options does one have, if your master nagios server is > getting overloaded? > > I have half a dozen slaves doing polling, submitting passive check > results back via send_nsca. The master does no active polling, just > event processing, notifications, and web ui. > > Under normal circumstances, it works alright. But after a restart it > can take up to half an hour before the master catches up; and if there > are a lot of events, the act of sending out notifications can cause it > to fall behind. > > I'm pre-caching my object file, I'm skipping circular dependency checks, > and I've gotten a notification cycle down to 9 seconds. I tried > modifying nagios to fork before notifications, but that failed pretty > spectacularly; so that 9 seconds is a time where 900 or so passive check > submissions block until the notifications are done. > > Are there any options for running a dual-master setup, or other ways to > spread the load across multiple machines? > > Has anyone patched nsca to submit check results into the checkresults > directory, instead of via the nagios.cmd pipe? What kind of improvement > can one expect from that? > > Any other advice? -- Mike Lindsey ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 26 06:01:11 2010 From: marc at ena.com (Marc Powell) Date: Mon, 25 Jan 2010 23:01:11 -0600 Subject: Command execution failed In-Reply-To: <4B5E2570.2060503@uci.cu> References: <4B5E1520.60706@uci.cu> <4B5E2570.2060503@uci.cu> Message-ID: <3AA81FC7-A5E7-45C4-8C98-315E3C8097FD@ena.com> On Jan 25, 2010, at 5:12 PM, ReynierPM wrote: > It have permissions: > -rwxr-x--- 1 nagios nagios 5020 2009-09-22 00:01 check_bacula.pl* > Any other idea? Can you run check_bacula.pl as the nagios user on the remote machine? If not, read any documentation you received with the plugin to see what other changes you need to make. I'm guessing that other programs it calls are not executable by the nagios user. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From asraikhn at gmail.com Tue Jan 26 06:33:43 2010 From: asraikhn at gmail.com (Asrai khn) Date: Tue, 26 Jan 2010 10:33:43 +0500 Subject: Defining service dependency In-Reply-To: <4B5DC700.3000009@flatto.net> References: <5f0f8dba1001250821s31fc4d0r84e8bbe55e2aa382@mail.gmail.com> <4B5DC700.3000009@flatto.net> Message-ID: <5f0f8dba1001252133j33441ef0h51354407b12ea2ee@mail.gmail.com> On Mon, Jan 25, 2010 at 9:29 PM, Assaf Flatto wrote: > Asrai khn wrote: > > Hi list, > > > > I want to define service dependency, i have read the relevant nagios > > docs but I am not getting how to define one. we are running nagios > > 3.2.x (pre build binary on centos 5.4) > > Eg, I want to define service dependency for all services that been > > checked with nrpe, ie if nrpe not running on remote host then nagios > > shouldn't send notifications for these services instead just for nrpe. > > > > Few days back our monitoring host send lot of notifications coz of > > nrpe wasn't running on remote hosts, in this case I want nagios only > > alert for 'nrpe' not all the services that being checked with nrpe. > > > > 1. So first I have define a service to check nrpe on remote hosts. > > > > # Command def for checking nrpe on remote hosts. > > define command{ > > command_name nrpe_service > > command_line $USER1$/check_nrpe -H $HOSTADDRESS$ > > } > > > > > > 2. define a service for each hostgroup. > > > > # service def. > > define service{ > > use check-interval-service > > service_description nrpe > > check_command nrpe_service > > hostgroup_name Sol-Servers > > } > > > > So far so good, I have several hosts (linux, solaris, hp) all of them > > have the following standard service definition for checking remote > > hosts service via check_nrpe > > > > define service{ > > use local-service > > host_name BB-2,NB-1 > > service_description SCC Procs > > check_command check_nrpe!check_scc_procs > > } > > define service{ > > use local-service > > host_name BB-2 > > service_description SCC Partition > > check_command nrpe_disk!check_disk4 > > } > > > > we have 5-10 nrpe check for each hosts which are scattered in > > different hostgroups. > > > > Now how to define a dependency so that if 'nrpe' service fail on > > remote hosts nagios shouldn't send alerts for services whom check via > > nrpe. > > > > Any configuration example will be greatly appreciated. > > > > Due to binary version of nagios installed it will not possible for me > > to go for any patching solution. > > You got so far and stopped near the finish line !! > define servicedependency{ > host_name BB-2,NB-1 > service nrpe > dependent_host BB-2,NB-1 > dendent_service SCC Partition > execution_failure_criteria u,c,p > notification_failure_criteria u,c,p > } > > this is of the top of my head - you will need to make sure the > directives are according to the nagios definitions (I'm sure i got at > least one wrong) , but that is the way to move on. > > Thanks Assaf, I ended up with the following working version, define servicedependency{ dependent_host_name Sol-1 dependent_service_description Root Partition,Var Partition,Data Partition,Current Load,FTP-BB host_name Sol-1 service_description nrpe execution_failure_criteria w,u,c notification_failure_criteria w,u,c } Later I'll switch my dependency configuration to something simple by using "Time-saving Tricks" ---> http://nagios.sourceforge.net/docs/3_0/objecttricks.html#same_host_dependency Tell me one thing, is it ok to get ... Checking service dependencies... Checked 10 service dependencies. For the 5 dependent_service_description above? Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ml at nicole-haehnel.de Tue Jan 26 07:58:24 2010 From: ml at nicole-haehnel.de (Nicole H=?UTF-8?B?w6Q=?=hnel) Date: Tue, 26 Jan 2010 07:58:24 +0100 (CET) Subject: =?utf-8?q?=5B_Nagios-users_=5D_Problems_using_chec?= =?utf-8?q?k_=5F_yum_on_sles11?= Message-ID: <20100126065825.115051804CDE5@dd18808.kasserver.com> Yes, "check_yum -vvv" helped. Summary line should be "No packages needed, for security, d+ available" but I get "No packages needed for security; d+ packages available" Thanks! Nicole Am 25.01.2010 15:24, schrieb Martin Melin: > I think "check_yum -vvv" will give some useful information on what might be going wrong. I tested 0.7.1 and it seemed to work fine for me. > > Although I personally use http://git.op5.org/git/?p=nagios/op5plugins.git;a=blob_plain;f=check_yum_update.pl;hb=HEAD :-) > > On Mon, Jan 25, 2010 at 12:32 PM, Nicole H?hnel wrote: > > Hi, > > I'm trying to use check_yum 0.7.1 (Hari Sekhon) on sles 11 with yum 3.2.25. > "yum check-update" works on the commandline. > > Check_yum says: > > YUM WARNING: Cannot find summary line in yum output. Please make sure you have upgraded to the latest version of this plugin. If the problem persists, please contact the author for a fix > > Any experiences? > > Thanks! > > Nicole > > > > > > ------------------------------------------------------------------------------ > Throughout its 18-year history, RSA Conference consistently attracts the > world's best and brightest in the field, creating opportunities for Conference > attendees to learn about information security's most important issues through > interactions with peers, luminaries and emerging and established companies. > http://p.sf.net/sfu/rsaconf-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > > -- > Martin Melin > ____________________________ > op5 AB > http://www.op5.com > > http://www.op5.org/ > http://www.op5.com/op5/products/network-monitor/nagios/ ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From binbin.b.wang at hotmail.com Tue Jan 26 02:36:55 2010 From: binbin.b.wang at hotmail.com (Wang Binbin) Date: Tue, 26 Jan 2010 09:36:55 +0800 Subject: How to stop sending notification after 3 times? In-Reply-To: <2274b9c31001250554o3f324a3btec24d294bd5ed4a1@mail.gmail.com> References: <2274b9c31001250554o3f324a3btec24d294bd5ed4a1@mail.gmail.com> Message-ID: Kyle Much appreciate your reply, and I am a rookie in nagios. Could you give me much detail about how to use macro $NOTIFICATIONNUMBERS$? How can I add the "if" judge in command definition? Millions of thanks. define command{ command_name notify-by-email command_line /usr/bin/printf "%b" "***** Nagios 2.9 *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$" | /usr/local/bin/sendEmail -f nagios at test.com -t $CONTACTEMAIL$ -s mail.test.com -u "** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" -xu nagios } -------------------------------------------------- From: "Kyle O'Donnell" Sent: Monday, January 25, 2010 9:54 PM To: "Binbin Wang" Cc: Subject: Re: [Nagios-users] How to stop sending notification after 3 times? > you could also pass the macro $NOTIFICATIONNUMBER$ to your > notification command, which does an if on the value. > > [ $NOTIFICATIONNUMBER$ -le 3 ] && echo blah blah blah $HOSTNAME$ > $SERVICE... |mail > > On 1/25/10, Binbin Wang wrote: >> Hi all >> >> I make some troubles in nagios setting. My customer just only receive >> notification only 3 times, and stop receiving it even through that warning >> still exists. Anyone know how to set up nagios ? thanks in advance. >> >> Binbin > ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Tue Jan 26 08:59:05 2010 From: martinm at op5.org (Martin Melin) Date: Tue, 26 Jan 2010 08:59:05 +0100 Subject: Virtual Machines - define as parent or as host dependency... In-Reply-To: <4B5DFBB2.6060708@gmail.com> References: <4B5DFBB2.6060708@gmail.com> Message-ID: <5f0bcb7c1001252359x485bf1b4pc0107485c0707823@mail.gmail.com> I think you should be fine with setting the VMWare host to be the parent of your guests. It's the simplest configuration, it meets your goals (host down suppresses guests, but host's services do not affect guests' notifications) and it has the added benefit of looking good in the Status Map :-) IMHO using parents for VMWare hosts/guests is within the intended use of the directive, even though the docs only talk about network equipment, because your guests are as directly dependent upon their host as a physical host is of its switch. On Mon, Jan 25, 2010 at 9:14 PM, Andrew Davis wrote: > I'm trying to figure out the best way to do this, yet keep things as > simple as possible. > > Say I have a server called Saturn running VMWare. I'm monitoring this > server with Nagios. > I also have three VM's on Saturn: Jupiter, Mars, and Pluto > > I want to suppress all host and service alerts on Jupiter, Mars, & Pluto if > the host Saturn is down (unreachable). I do NOT want to suppress host or > service alerts from Jupiter, Mars, and Pluto if the VMWare processes > (services) are down on Saturn. Basically, if my VM server is completely > unreachable, don't bother me about its client VM's. > > Am I better off doing this with a host dependency? Something like: > > *define hostdependency { > host_name Host B > dependent_host_name Host C > notification_failure_criteria d,u > }* > > Or am I better off defining Saturn as the parent of the VM's in the host > config? Something like: > > *define host { > host_name jupiter > use VMs > alias jupiter > address 172.26.251.60 > parents saturn, tpdmzsw1 > }* > > I've successfully used the "parents" directive to define network topology, > so I would think this would work. What might be the risks of defining both? > > -- > > > A. Davis > Email: nccomp at gmail.com > > "There is no limit to what a man can accomplish > if he doesn't care who gets the credit." - Ronald Reagan > > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From s.shipway at auckland.ac.nz Tue Jan 26 09:03:23 2010 From: s.shipway at auckland.ac.nz (Steve Shipway) Date: Tue, 26 Jan 2010 21:03:23 +1300 Subject: Virtual Machines - define as parent or as host dependency... In-Reply-To: <4B5DFBB2.6060708@gmail.com> References: <4B5DFBB2.6060708@gmail.com> Message-ID: <6B587E8C999646469B54486AF21958460CE66C8670@UXCHANGE7-1.UoA.auckland.ac.nz> This is the way we do it, with Parents (not host dependencies). First we create a virtual object for the VMWare farm. This has a status of UP if any of the farm servers are up (using check_summary). This virtual 'host' has several services, using the v0.9 check_vmware, relating to the farm's alarms, storage volumes, etc. These services have service dependencies on the VirtualCentre service running on the Virtual Centre host. The Farm object has ALL of the ESX Servers as Parents. All the VMs in the farm have the Farm object as a parent. Some of them also use check_esx3 to alert on Alarms, CPU, and Memory usage within VMWare. This might seem a bit complex if you've only the one server, but as soon as oyu have multiple servers in the farm, and use DRS, you have to use a farm object for parents/dependencies. It might make more sense for these relationships to be host dependencies rather than parents i nmost cases, but we have a SAN mirrored environment to a seocnd ESX farm so that the VMs can be brought up ther ein the event of a complete farm outage, hence the use of Parents rather than dependencies. If you have VSphere4 (ESX4.0) with a SNMP-enabled Cisco virtual switch in the farm, you could probably make the virtual switch the parent device rather than having to use a farm object. The VMWare monitoring plugin we're using is v0.9 of check_vmware, from here: http://www.steveshipway.org/forum/viewtopic.php?f=28&t=1648 check_summary is available from nagiosexchange.org (as is check_esx3 which is the forerunner of check_vmware) Steve ________________________________ From: Andrew Davis [nccomp at gmail.com] Sent: Tuesday, 26 January 2010 9:14 a.m. To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Virtual Machines - define as parent or as host dependency... I'm trying to figure out the best way to do this, yet keep things as simple as possible. Say I have a server called Saturn running VMWare. I'm monitoring this server with Nagios. I also have three VM's on Saturn: Jupiter, Mars, and Pluto I want to suppress all host and service alerts on Jupiter, Mars, & Pluto if the host Saturn is down (unreachable). I do NOT want to suppress host or service alerts from Jupiter, Mars, and Pluto if the VMWare processes (services) are down on Saturn. Basically, if my VM server is completely unreachable, don't bother me about its client VM's. Am I better off doing this with a host dependency? Something like: define hostdependency { host_name Host B dependent_host_name Host C notification_failure_criteria d,u } Or am I better off defining Saturn as the parent of the VM's in the host config? Something like: define host { host_name jupiter use VMs alias jupiter address 172.26.251.60 parents saturn, tpdmzsw1 } I've successfully used the "parents" directive to define network topology, so I would think this would work. What might be the risks of defining both? -- A. Davis Email: nccomp at gmail.com "There is no limit to what a man can accomplish if he doesn't care who gets the credit." - Ronald Reagan -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Tue Jan 26 09:12:30 2010 From: martinm at op5.org (Martin Melin) Date: Tue, 26 Jan 2010 09:12:30 +0100 Subject: Overloaded master In-Reply-To: <4B5E3F22.2090703@5dninja.net> References: <751507e643dfd893c56f3afc13e24b46@roosta.rimann.org> <4B5A3102.8050903@5dninja.net> <5f0bcb7c1001250231t4df5c87bjecdee8c70e322ae0@mail.gmail.com> <4B5E3F22.2090703@5dninja.net> Message-ID: <5f0bcb7c1001260012qf675e94m6de072cd85bdab44@mail.gmail.com> I would stop doing the time-consuming ticket integration (or whatever is taking up the time) from within the notification command itself. I don't think you need to build a separate notification server - after all, notification logic is one of the things Nagios does best - but I do think you could build a quick program to do the heavy lifting for notifications, per Nagios' instructions. Without knowing what kind of info you need, I would probably have the Nagios notification command write to a spool directory, then send a signal to a separate program to have it read the spool directory and create/search for tickets etc. On Tue, Jan 26, 2010 at 2:02 AM, Mike Lindsey wrote: > A typical first tier notification goes to 20 people. One of those will be > a pager, and is very simple. > > The rest are fairly complex. > > Notifications include a link to existing and recent tickets in our > ticketing system (this also allows me to not send a ticket opening > notification if a ticket already exists).. I populate the notification with > links to cacti graphs, links to wiki documentation for the event as well as > fire off a secondary notification handler that adds in additional > information based on the host, service, and state. > > The first notification of the cycles does all the heavy lifting and takes > about 6 seconds. The other 19 finish relatively quickly. > > I've been thinking of building a notification server - so I could have > separate and discrete notification escalations for different service states > - which would also let me fire off one notification with just the contents > of $ENV{NAGIOS_*}.. Perhaps that's my best option? > > Martin Melin wrote: > >> What kind of notifications are you doing and how many are you sending out? >> Why does a notification cycle take 9 seconds to complete? >> >> On Sat, Jan 23, 2010 at 12:13 AM, Mike Lindsey > mike-nagios at 5dninja.net>> wrote: >> >> What kind of options does one have, if your master nagios server is >> getting overloaded? >> >> I have half a dozen slaves doing polling, submitting passive check >> results back via send_nsca. The master does no active polling, just >> event processing, notifications, and web ui. >> >> Under normal circumstances, it works alright. But after a restart it >> can take up to half an hour before the master catches up; and if there >> are a lot of events, the act of sending out notifications can cause it >> to fall behind. >> >> I'm pre-caching my object file, I'm skipping circular dependency >> checks, >> and I've gotten a notification cycle down to 9 seconds. I tried >> modifying nagios to fork before notifications, but that failed pretty >> spectacularly; so that 9 seconds is a time where 900 or so passive >> check >> submissions block until the notifications are done. >> >> Are there any options for running a dual-master setup, or other ways to >> spread the load across multiple machines? >> >> Has anyone patched nsca to submit check results into the checkresults >> directory, instead of via the nagios.cmd pipe? What kind of >> improvement >> can one expect from that? >> >> Any other advice? >> > > > -- > Mike Lindsey > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From asraikhn at gmail.com Tue Jan 26 09:36:21 2010 From: asraikhn at gmail.com (Asrai khn) Date: Tue, 26 Jan 2010 13:36:21 +0500 Subject: Defining service dependency In-Reply-To: <5f0f8dba1001252133j33441ef0h51354407b12ea2ee@mail.gmail.com> References: <5f0f8dba1001250821s31fc4d0r84e8bbe55e2aa382@mail.gmail.com> <4B5DC700.3000009@flatto.net> <5f0f8dba1001252133j33441ef0h51354407b12ea2ee@mail.gmail.com> Message-ID: <5f0f8dba1001260036m7d471e88r5438d8b59026b9c2@mail.gmail.com> On Tue, Jan 26, 2010 at 10:33 AM, Asrai khn wrote: > > > Later I'll switch my dependency configuration to something simple by using > "Time-saving Tricks" ---> > http://nagios.sourceforge.net/docs/3_0/objecttricks.html#same_host_dependency > > Tell me one thing, is it ok to get ... > > Checking service dependencies... > Checked 10 service dependencies. > > For the 5 dependent_service_description above? > > > > I have just configured service dependency using "*All Services In Multiple Servicegroups" *nagios docs. 1. First of all i have created (4) servicegroups for services which using nrpe for checking remote hosts including a servicegroup for service 'nrpe' on 8 different hosts The other 3 servicegroups contains total 95 services 2. Then defined service dependency using following config define servicedependency{ dependent_servicegroup_name Sol-CPU-Load,Sol-Partitions,Sol-Processes servicegroup_name Sol-nrpe execution_failure_criteria w,u,c notification_failure_criteria w,u,c } Everything is working just fine I have tested it by stopping nrpe on one of the server and nagios only send notification for master process 'nrpe' by suppressing all all other checks and notification for the host. However I am confusing with the nagios -v check its giving me too much counts for service dependency and i wonders if this have any effect on monitoring host (nagios) performance, working etc. *sudo nagios -v /etc/nagios/nagios.cfg Checking service dependencies... Checked 1530 service dependencies.* 1530 service dependencies..... and this also include the previous 10 for another server, but I wonders what this counts mean? I'll greatly appreciate if someone shed some light on it. Thanks. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.elsen at gmail.com Tue Jan 26 11:18:48 2010 From: mark.elsen at gmail.com (Mark Elsen) Date: Tue, 26 Jan 2010 11:18:48 +0100 Subject: Using Unix groupnames in contact definitions or cgi authorization schemes Message-ID: Nagios 3.x ---------------- - I was wondering wether one can use UNIX groupnames in contact defintions , or in cgi auth. directives such as 'authorized_for_all_service_commands=..." ; instead of having to use individual usernames ? I need to make a simple distinction between a couple of divisions, regarding nagios access and server monitoring in our company. Using individual usernames would be cumbersome w.r.t the simple allowed views I want to use in the NAGIOS (console). Tx, M. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mesutgl at iem.gov.tr Tue Jan 26 11:25:02 2010 From: mesutgl at iem.gov.tr (=?iso-8859-9?Q?Mesut_G=DCLNAZ?=) Date: Tue, 26 Jan 2010 12:25:02 +0200 Subject: popen timeout received, but no child process Message-ID: <010b01ca9e71$d1dd3a30$7597ae90$@gov.tr> [1264500750] SERVICE ALERT: CCC151;PING;CRITICAL;SOFT;2;CRITICAL - popen timeout received, but no child process ? am getting this error. ? googled and read a few forum but there is no solutions for me. ? use FreeBSD 8.0 amd64 last pid: 24484; load averages: 0.05, 0.16, 0.21 up 0+04:37:59 12:21:44 281 processes: 1 running, 280 sleeping CPU: 14.9% user, 0.0% nice, 11.9% system, 0.0% interrupt, 73.2% idle Mem: 80M Active, 54M Inact, 111M Wired, 53M Buf, 739M Free Swap: 1500M Total, 1500M Free What can i do to solve this? Thanks.. I have a system like this. Checked 1231 hosts. Checking host groups... Checked 49 host groups. Checking service groups... Checked 0 service groups. Checking contacts... Checked 44 contacts. Checking contact groups... Checked 44 contact groups. Checking service escalations... Checked 0 service escalations. Checking service dependencies... Checked 0 service dependencies. Checking host escalations... Checked 0 host escalations. Checking host dependencies... Checked 0 host dependencies. Checking commands... Checked 24 commands. Checking time periods... Checked 2 time periods. Checking for circular paths between hosts... Checking for circular host and service dependencies... Checking global event handlers... Checking obsessive compulsive processor commands... Checking misc settings... Total Warnings: 288 Total Errors: 0 All of my hosy definitions are like that: #'CCC151' host definition define host{ use generic-host host_name CCC151 alias CCC151 address 172.20.55.55 parents CCC_15 # check_command check-host-alive max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r contact_groups CCC1_admin, CCC15_admin,CCC151-admin } And allof my service definitions are like: define service{ use generic-service host_name CCC151,CCC152,CCC153. etc. service_description PING is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups CCCs notification_interval 120 notification_period 24x7 notification_options c,r check_command check-host-alive } -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Tue Jan 26 13:58:10 2010 From: jpratt at norwich.edu (James Pratt) Date: Tue, 26 Jan 2010 07:58:10 -0500 Subject: Virtual Machines - define as parent or ashost dependency... In-Reply-To: <6B587E8C999646469B54486AF21958460CE66C8670@UXCHANGE7-1.UoA.auckland.ac.nz> References: <4B5DFBB2.6060708@gmail.com> <6B587E8C999646469B54486AF21958460CE66C8670@UXCHANGE7-1.UoA.auckland.ac.nz> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD4945F0D4@nuexchg.norwich.edu> >> -----Original Message----- >> From: Steve Shipway [mailto:s.shipway at auckland.ac.nz] >> Sent: Tuesday, January 26, 2010 3:03 AM >> To: nccomp at gmail.com; nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] Virtual Machines - define as parent or ashost >> dependency... >> >> This is the way we do it, with Parents (not host dependencies). >> Hi Steve, been following this with great interest. Care to share how you do host dep's too?, this is all very helpful, thank you! :) >> First we create a virtual object for the VMWare farm. This has a status of UP if any >> of the farm servers are up (using check_summary). This virtual 'host' has several >> services, using the v0.9 check_vmware, relating to the farm's alarms, storage >> volumes, etc. These services have service dependencies on the VirtualCentre service >> running on the Virtual Centre host. >> >> The Farm object has ALL of the ESX Servers as Parents. So ultimately, the "Farm object" definition would be: the vcenter server (with esx hosts as parents?), combined with a service check on the vcenter service? Not familiar with check_summary, sorry, that's next up to google. ;) >> >> All the VMs in the farm have the Farm object as a parent. Some of them also use >> check_esx3 to alert on Alarms, CPU, and Memory usage within VMWare. >> >> This might seem a bit complex if you've only the one server, but as soon as oyu >> have multiple servers in the farm, and use DRS, you have to use a farm object for >> parents/dependencies. >> Yes, I feel that pain well. How can/do you keep track of what ESX hosts are parents to what vm's when DRS is in fully-automated mode, that seems to be the key... ? >> It might make more sense for these relationships to be host dependencies rather >> than parents i nmost cases, but we have a SAN mirrored environment to a seocnd >> ESX farm so that the VMs can be brought up ther ein the event of a complete farm >> outage, hence the use of Parents rather than dependencies. >> >> If you have VSphere4 (ESX4.0) with a SNMP-enabled Cisco virtual switch in the farm, >> you could probably make the virtual switch the parent device rather than having to >> use a farm object. >> >> The VMWare monitoring plugin we're using is v0.9 of check_vmware, from here: >> http://www.steveshipway.org/forum/viewtopic.php?f=28&t=1648 >> >> check_summary is available from nagiosexchange.org (as is check_esx3 which is the >> forerunner of check_vmware) >> >> Steve Excellent - off to have a look. We use a combination of things at the moment, including check_esx3 and also use python/WBEM & snmp on HP hardware. Thanks much! James ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Tue Jan 26 13:58:51 2010 From: nigel.leach at uk.bnpparibas.com (nigel.leach at uk.bnpparibas.com) Date: Tue, 26 Jan 2010 12:58:51 +0000 Subject: check_ifoperstatus Query Message-ID: I maybe expecting something that is not designed, but I was hoping to exclude checking of my loopback interface #snmpwalk -v1 -c public localhost ifType IF-MIB::ifType.1 = INTEGER: softwareLoopback(24) IF-MIB::ifType.2 = INTEGER: ethernetCsmacd(6) IF-MIB::ifType.3 = INTEGER: ethernetCsmacd(6) # ./check_ifoperstatus -H localhost -T 6 OK: Interface eth0 (index 2) is up. Should I not be seeing two interfaces in the check_ifoperstatus output ? I have check_ifoperstatus v1.4.14 (nagios-plugins 1.4.14) on AS4 U6. Regards Nigel Nigel Leach Grid Engineer | Front Office Risk Systems | BNP Paribas ___________________________________________________________ This communication is confidential, may be privileged and is meant only for the intended recipient. If you are not the intended recipient, please notify the sender by reply and delete the message from your system. Any unauthorised dissemination, distribution or copying hereof is prohibited. BNP Paribas Trust Corporation UK Limited, BNP Paribas UK Limited, BNP Paribas Commodity Futures Limited, BNP Paribas Asset Management UK Limited and Investment Fund Services Limited are authorised and regulated by the Financial Services Authority. BNP Paribas London Branch and BNP Paribas Wealth Management London Branch are authorised by the CECEI and supervised by the Commission Bancaire. BNP Paribas London Branch is authorised and subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas is also a member of the London Stock Exchange. BNP Paribas Wealth Management London Branch is subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services London Branch is authorised by the CECEI and supervised by the AMF, and subject to limited regulation by the Financial Services Authority. Details on the extent of our regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services is also a member of the London Stock Exchange. BNP Paribas Trust Corporation UK Limited is registered in England and Wales (registered no. 4042668) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas UK Limited is registered in England and Wales (registered no. 1488108) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Commodity Futures Limited is registered in England and Wales (registered no. 2391477) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Asset Management UK Limited is registered in England and Wales (registered no. 2474627) at registered office 10 Harewood Avenue, London NW1 6AA. Investment Fund Services Limited is registered in England and Wales (registered no. 6110770) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas London Branch is registered in England and Wales (registered no. FC13447) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Wealth Management London Branch is registered in England and Wales (registered no. FC023926) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Securities Services London Branch is registered in England and Wales (registered no. BR006393) at registered office 55 Moorgate, London, EC2R 6PA. BNP Paribas Lease Group Plc is registered in England and Wales (registered no. 2341989) at registered office Northern Cross, Basing View, Basingstoke, Hampshire RG21 4HL. ____________________________________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 26 14:43:01 2010 From: marc at ena.com (Marc Powell) Date: Tue, 26 Jan 2010 07:43:01 -0600 Subject: Using Unix groupnames in contact definitions or cgi authorization schemes In-Reply-To: References: Message-ID: <268C4614-8A2B-4F29-B5DB-175CAB1C043E@ena.com> On Jan 26, 2010, at 4:18 AM, Mark Elsen wrote: > Nagios 3.x > ---------------- > > - I was wondering wether one can use UNIX groupnames in contact > defintions , or in cgi auth. directives > such as 'authorized_for_all_service_commands=..." ; instead of having > to use individual usernames ? There's no Apache auth module that works like that, that I know of. All nagios does is look at the web server environment variable 'REMOTE_USER' to determine the username of the logged in user. If you can find (or write) an Apache auth module that sets REMOTE_USER to what you need then it will work. The difficult part is that you want to change REMOTE_USER from the username passed during login to something else after authentication. I'm not sure that Apache permits/supports that. > I need to make a simple distinction between a couple of divisions, > regarding nagios access and > server monitoring in our company. Using individual usernames would be > cumbersome w.r.t the simple > allowed views I want to use in the NAGIOS (console). You could create contacts and associated .htpasswd entries that are role accounts and provide the divisions with the single username/password that is theirs. That's worked well for our different NOCs. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nccomp at gmail.com Tue Jan 26 15:38:57 2010 From: nccomp at gmail.com (Andrew Davis) Date: Tue, 26 Jan 2010 09:38:57 -0500 Subject: Virtual Machines - define as parent or as host dependency... In-Reply-To: <6B587E8C999646469B54486AF21958460CE66C8670@UXCHANGE7-1.UoA.auckland.ac.nz> References: <4B5DFBB2.6060708@gmail.com> <6B587E8C999646469B54486AF21958460CE66C8670@UXCHANGE7-1.UoA.auckland.ac.nz> Message-ID: <4B5EFE81.5070304@gmail.com> Thanks much for the info. Curious... you mention ESX and VSphere. Do you know if your check_vmware test also works against VMWare Server 2.x (free edition). We are running ESX internally, but have a single external server running VMWare Server 2.x with three VMs in it. Hence why I ask... A. Davis Email: nccomp at gmail.com "There is no limit to what a man can accomplish if he doesn't care who gets the credit." - Ronald Reagan On 1/26/10 3:03 AM, Steve Shipway wrote: > This is the way we do it, with Parents (not host dependencies). > First we create a virtual object for the VMWare farm. This has a > status of UP if any of the farm servers are up (using check_summary). > This virtual 'host' has several services, using the v0.9 check_vmware, > relating to the farm's alarms, storage volumes, etc. These services > have service dependencies on the VirtualCentre service running on the > Virtual Centre host. > The Farm object has ALL of the ESX Servers as Parents. > All the VMs in the farm have the Farm object as a parent. Some of > them also use check_esx3 to alert on Alarms, CPU, and Memory usage > within VMWare. > This might seem a bit complex if you've only the one server, but as > soon as oyu have multiple servers in the farm, and use DRS, you have > to use a farm object for parents/dependencies. > It might make more sense for these relationships to be host > dependencies rather than parents i nmost cases, but we have a SAN > mirrored environment to a seocnd ESX farm so that the VMs can be > brought up ther ein the event of a complete farm outage, hence the use > of Parents rather than dependencies. > If you have VSphere4 (ESX4.0) with a SNMP-enabled Cisco virtual switch > in the farm, you could probably make the virtual switch the parent > device rather than having to use a farm object. > The VMWare monitoring plugin we're using is v0.9 of check_vmware, from > here: http://www.steveshipway.org/forum/viewtopic.php?f=28&t=1648 > > check_summary is available from nagiosexchange.org (as is check_esx3 > which is the forerunner of check_vmware) > Steve > ------------------------------------------------------------------------ > *From:* Andrew Davis [nccomp at gmail.com] > *Sent:* Tuesday, 26 January 2010 9:14 a.m. > *To:* nagios-users at lists.sourceforge.net > *Subject:* [Nagios-users] Virtual Machines - define as parent or as > host dependency... > > I'm trying to figure out the best way to do this, yet keep things as > simple as possible. > > Say I have a server called Saturn running VMWare. I'm monitoring this > server with Nagios. > I also have three VM's on Saturn: Jupiter, Mars, and Pluto > > I want to suppress all host and service alerts on Jupiter, Mars, & > Pluto if the host Saturn is down (unreachable). I do NOT want to > suppress host or service alerts from Jupiter, Mars, and Pluto if the > VMWare processes (services) are down on Saturn. Basically, if my VM > server is completely unreachable, don't bother me about its client VM's. > > Am I better off doing this with a host dependency? Something like: > > *define hostdependency { > host_name Host B > dependent_host_name Host C > notification_failure_criteria d,u > } > * > Or am I better off defining Saturn as the parent of the VM's in the > host config? Something like: > > *define host { > host_name jupiter > use VMs > alias jupiter > address 172.26.251.60 > parents saturn, tpdmzsw1 > }* > I've successfully used the "parents" directive to define network > topology, so I would think this would work. What might be the risks of > defining both? > -- > > > A. Davis > Email:nccomp at gmail.com > > "There is no limit to what a man can accomplish > if he doesn't care who gets the credit." - Ronald Reagan > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Jan 26 17:23:04 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Tue, 26 Jan 2010 08:23:04 -0800 Subject: Using Unix groupnames in contact definitions or cgi authorization schemes In-Reply-To: References: Message-ID: <4B5F16E8.8020102@hp.com> Mark Elsen wrote: > - I was wondering wether one can use UNIX groupnames in contact > defintions , or in cgi auth. directives > such as 'authorized_for_all_service_commands=..." ; instead of having > to use individual usernames ? > > I need to make a simple distinction between a couple of divisions, > regarding nagios access and > server monitoring in our company. Using individual usernames would be > cumbersome w.r.t the simple > allowed views I want to use in the NAGIOS (console). The only way to pull this off, really, is to define contactgroups in Nagios that match your Unix groups. Since Nagios does not deal with Unix users and groups directly, you'd need something like a script that generated Nagios configs from your account information. In our case here, that's relatively trivial. A simple script can go out and pull a group and its members from LDAP and reformat that information into a Nagios config. I suppose it wouldn't be much harder to do it with local accounts, if we had a script to parse the group information from the passwd and group files. So I guess the short answer is no, Nagios doesn't know how to deal with Unix account info directly, but that doesn't mean you can't do what you're trying to do. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From casey at shobe.info Tue Jan 26 16:44:15 2010 From: casey at shobe.info (Casey Allen Shobe) Date: Tue, 26 Jan 2010 10:44:15 -0500 Subject: Unexplainable disk space usage in nagios/var/spool/checkresults Message-ID: Hi! I have Nagios installed on a 1GB partition, with everything installed under a nagios directory on that partition. I did not expect Nagios to start consuming a large amount of space, and now that it is, I cannot determine why or in what files, which is baffling. I've found the biggest culprit is the var/spool/checkresults directory within the Nagios installation. a `du` command on the directory shows that it is using 287MB. However, when I `ls -la` within that directory, I can't see any file using more than a single kilobyte, and there aren't that many files. However, an `ls -ld` on the directory itself shows that the directory itself is using 286MB. That doesn't make sense to me, and so I'm looking for input! I wonder if the following approach would help correct things (but I'd like to understand why they're in this state first): Stop nagios, move the checkresults directory to checkresults.old, make a new checkresults directory, copy all the old files into it, remove the empty .old directory, start nagios. The filesystem is JFS, if that makes any difference. Here is raw output of what I'm seeing: root at indhlcwww1:/mnt/data/share/nagios-3.2.0/var/spool/checkresults # du -h 287M . root at indhlcwww1:/mnt/data/share/nagios-3.2.0/var/spool/checkresults # ls -laSrh total 287M -rw------- 1 nagios nagios 0 2010-01-26 15:23 cZar4eF.ok -rw------- 1 nagios nagios 0 2010-01-04 19:15 cy8clT1.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cXaj1gN.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cvWmN4R.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cVvx4M8.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cvtm0Hd.ok -rw------- 1 nagios nagios 0 2009-12-06 01:27 cvqSGDK -rw------- 1 nagios nagios 0 2010-01-26 15:23 cv6pJ1B.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 ctwW4SD.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cTpIKAl.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cTm3Vzw.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 ctCCihr.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 csRsM4M.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cryC1Dq.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cRR4Ubm.ok -rw------- 1 nagios nagios 0 2009-11-27 23:24 cqqacdj -rw------- 1 nagios nagios 0 2010-01-26 15:23 cq75lkN.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cnxxq4z.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cnDr6c0.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cmxnujo.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cMPxHgo.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cMf26lf.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cLksJml.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 ckVnJcE.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cjjgyfh.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cjCc8ov.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 chokVz4.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cHN3b3M.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cHlsWuh.ok -rw------- 1 nagios nagios 0 2009-12-11 19:19 checkyTpLfU -rw------- 1 nagios nagios 0 2009-11-05 04:12 checkXbP1al -rw------- 1 nagios nagios 0 2009-11-04 12:53 checkWx0m6z -rw------- 1 nagios nagios 0 2009-11-05 04:12 checkwpqhVE -rw------- 1 nagios nagios 0 2009-12-29 17:23 checkvMraRW -rw------- 1 nagios nagios 0 2009-12-11 19:20 checkqWrl04 -rw------- 1 nagios nagios 0 2009-11-04 12:53 checkQq7MX0 -rw------- 1 nagios nagios 0 2009-12-29 17:23 checknzRtjN -rw------- 1 nagios nagios 0 2009-11-05 04:12 checkM6htUZ -rw------- 1 nagios nagios 0 2009-11-27 23:24 checkhVbOzY -rw------- 1 nagios nagios 0 2009-11-17 12:43 checkEVDugp -rw------- 1 nagios nagios 0 2009-11-04 12:53 checkd4A1MS -rw------- 1 nagios nagios 0 2009-11-05 04:12 checkBXX71Z -rw------- 1 nagios nagios 0 2009-11-27 23:24 checkBLCU9i -rw------- 1 nagios nagios 0 2009-11-04 12:53 checkAaA6Qr -rw------- 1 nagios nagios 0 2009-12-06 01:27 check8avYJJ -rw------- 1 nagios nagios 0 2009-12-29 17:23 cGZFyHr -rw------- 1 nagios nagios 0 2010-01-26 15:23 cgBr4Y1.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cFxFlm3.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 cD2T5eV.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 caw2Ihv.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 c9iVtCP.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 c7QYuXc.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 c75znoT.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 c6NV5yW.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 c59UhZL.ok -rw------- 1 nagios nagios 0 2010-01-26 15:23 c1m1KIr.ok drwxr-xr-x 3 root root 8 2009-10-20 01:44 ../ -rw------- 1 nagios nagios 249 2009-11-17 15:03 check47CLdy -rw------- 1 nagios nagios 252 2009-11-17 15:03 checkAE7iMM -rw------- 1 nagios nagios 277 2009-11-17 15:03 checkaNpheC -rw------- 1 nagios nagios 278 2010-01-26 15:23 checkWDzCTc -rw------- 1 nagios nagios 279 2010-01-26 15:23 checkpLyZS3 -rw------- 1 nagios nagios 280 2010-01-26 15:23 checkVSgE8U -rw------- 1 nagios nagios 281 2010-01-26 15:23 check6txgYX -rw------- 1 nagios nagios 285 2009-11-17 15:03 checkQXTDa5 -rw------- 1 nagios nagios 394 2010-01-26 15:23 cD2T5eV -rw------- 1 nagios nagios 395 2010-01-26 15:23 cgBr4Y1 -rw------- 1 nagios nagios 397 2010-01-26 15:23 cZar4eF -rw------- 1 nagios nagios 397 2010-01-26 15:23 ctwW4SD -rw------- 1 nagios nagios 401 2010-01-26 15:23 ctCCihr -rw------- 1 nagios nagios 401 2010-01-26 15:23 cjCc8ov -rw------- 1 nagios nagios 403 2010-01-26 15:23 cHlsWuh -rw------- 1 nagios nagios 404 2010-01-26 15:23 cRR4Ubm -rw------- 1 nagios nagios 408 2009-12-01 08:38 cV0kCF9 -rw------- 1 nagios nagios 409 2010-01-26 15:23 cTpIKAl -rw------- 1 nagios nagios 409 2010-01-26 15:23 cq75lkN -rw------- 1 nagios nagios 412 2010-01-26 15:23 cTm3Vzw -rw------- 1 nagios nagios 412 2009-12-11 19:20 ch4XVu5 -rw------- 1 nagios nagios 413 2010-01-26 15:23 c1m1KIr -rw------- 1 nagios nagios 416 2010-01-26 15:23 c9iVtCP -rw------- 1 nagios nagios 417 2010-01-26 15:23 chokVz4 -rw------- 1 nagios nagios 418 2010-01-26 15:23 cVvx4M8 -rw------- 1 nagios nagios 418 2010-01-26 15:23 cnxxq4z -rw------- 1 nagios nagios 418 2010-01-26 15:23 cLksJml -rw------- 1 nagios nagios 427 2010-01-26 15:23 cXaj1gN -rw------- 1 nagios nagios 433 2010-01-26 15:23 ckVnJcE -rw------- 1 nagios nagios 435 2010-01-26 15:23 cnDr6c0 -rw------- 1 nagios nagios 437 2010-01-26 15:23 cvWmN4R -rw------- 1 nagios nagios 437 2010-01-26 15:23 c6NV5yW -rw------- 1 nagios nagios 438 2010-01-26 15:23 cMPxHgo -rw------- 1 nagios nagios 440 2010-01-26 15:23 cMf26lf -rw------- 1 nagios nagios 442 2010-01-26 15:23 cmxnujo -rw------- 1 nagios nagios 442 2010-01-26 15:23 c59UhZL -rw------- 1 nagios nagios 445 2009-11-05 04:12 cdarn2Z -rw------- 1 nagios nagios 447 2010-01-26 15:23 csRsM4M -rw------- 1 nagios nagios 465 2010-01-26 15:23 cjjgyfh -rw------- 1 nagios nagios 467 2010-01-26 15:23 cryC1Dq -rw------- 1 nagios nagios 473 2010-01-26 15:23 cHN3b3M -rw------- 1 nagios nagios 474 2010-01-26 15:23 c7QYuXc -rw------- 1 nagios nagios 498 2010-01-26 15:23 cv6pJ1B -rw------- 1 nagios nagios 499 2010-01-26 15:23 c75znoT -rw------- 1 nagios nagios 504 2010-01-26 15:23 cFxFlm3 -rw------- 1 nagios nagios 506 2010-01-26 15:23 cvtm0Hd -rw------- 1 nagios nagios 507 2010-01-26 15:23 caw2Ihv drwxrwxr-x 2 nagios nagios 286M 2010-01-26 15:23 ./ root at indhlcwww1:/mnt/data/share/nagios-3.2.0/var/spool/checkresults # ls -ldh drwxrwxr-x 2 nagios nagios 286M 2010-01-26 15:24 ./ Thank you in advance for any advice! -- Casey Allen Shobe casey at shobe.info -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Tue Jan 26 17:44:22 2010 From: rperezm at uci.cu (ReynierPM) Date: Tue, 26 Jan 2010 11:44:22 -0500 Subject: Command execution failed In-Reply-To: <3AA81FC7-A5E7-45C4-8C98-315E3C8097FD@ena.com> References: <4B5E1520.60706@uci.cu> <4B5E2570.2060503@uci.cu> <3AA81FC7-A5E7-45C4-8C98-315E3C8097FD@ena.com> Message-ID: <4B5F1BE6.2040003@uci.cu> On 1/26/2010 12:01 AM, Marc Powell wrote: > Can you run check_bacula.pl as the nagios user on the remote machine? > If not, read any documentation you received with the plugin to see > what other changes you need to make. I'm guessing that other programs > it calls are not executable by the nagios user. I can run the command but get this message after: Can't locate DBI.pm in @INC (@INC contains: /etc/perl /usr/local/lib/perl/5.10.0 /usr/local/share/perl/5.10.0 /usr/lib/perl5 /usr/share/perl5 /usr/lib/perl/5.10 /usr/share/perl/5.10 /usr/local/lib/site_perl .) at /usr/local/nagios/libexec/check_bacula.pl line 39. BEGIN failed--compilation aborted at /usr/local/nagios/libexec/check_bacula.pl line 39. Maybe I need to install some dependencies but anyway I think that previous message error (Remote command execution failed: Permission denied, please try again) has nothing to do. -- Cheers ReynierPM ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From casey at shobe.info Tue Jan 26 18:29:18 2010 From: casey at shobe.info (Casey Allen Shobe) Date: Tue, 26 Jan 2010 12:29:18 -0500 Subject: Unexplainable disk space usage in nagios/var/spool/checkresults In-Reply-To: <863B3EE13473404BB5FA7516A1E31EF71235A00729@exchange-01.wolfe.local> References: <863B3EE13473404BB5FA7516A1E31EF71235A00729@exchange-01.wolfe.local> Message-ID: 2010/1/26 Wolfe, Robert > When you do an ?ls ?lash? on this directory, what do you see? > I already included that in my last post. The only difference was that I added -Sr to do a reverse sort by size (largest items at the bottom), and -F is included in my alias for ls, so '/' is shown at the end of directory names. However, this actually seems to be an issue with JFS and high turnover within a directory. http://www.mail-archive.com/jfs-discussion at lists.sourceforge.net/msg00748.HTML I've forwarded this along to the JFS list already. I don't suppose there's any way to make Nagios not use so many constantly changing files in this directory is there? # ls -laSrh > total 287M > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cZar4eF.ok > -rw------- 1 nagios nagios 0 2010-01-04 19:15 cy8clT1.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cXaj1gN.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cvWmN4R.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cVvx4M8.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cvtm0Hd.ok > -rw------- 1 nagios nagios 0 2009-12-06 01:27 cvqSGDK > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cv6pJ1B.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 ctwW4SD.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cTpIKAl.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cTm3Vzw.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 ctCCihr.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 csRsM4M.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cryC1Dq.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cRR4Ubm.ok > -rw------- 1 nagios nagios 0 2009-11-27 23:24 cqqacdj > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cq75lkN.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cnxxq4z.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cnDr6c0.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cmxnujo.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cMPxHgo.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cMf26lf.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cLksJml.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 ckVnJcE.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cjjgyfh.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cjCc8ov.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 chokVz4.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cHN3b3M.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cHlsWuh.ok > -rw------- 1 nagios nagios 0 2009-12-11 19:19 checkyTpLfU > -rw------- 1 nagios nagios 0 2009-11-05 04:12 checkXbP1al > -rw------- 1 nagios nagios 0 2009-11-04 12:53 checkWx0m6z > -rw------- 1 nagios nagios 0 2009-11-05 04:12 checkwpqhVE > -rw------- 1 nagios nagios 0 2009-12-29 17:23 checkvMraRW > -rw------- 1 nagios nagios 0 2009-12-11 19:20 checkqWrl04 > -rw------- 1 nagios nagios 0 2009-11-04 12:53 checkQq7MX0 > -rw------- 1 nagios nagios 0 2009-12-29 17:23 checknzRtjN > -rw------- 1 nagios nagios 0 2009-11-05 04:12 checkM6htUZ > -rw------- 1 nagios nagios 0 2009-11-27 23:24 checkhVbOzY > -rw------- 1 nagios nagios 0 2009-11-17 12:43 checkEVDugp > -rw------- 1 nagios nagios 0 2009-11-04 12:53 checkd4A1MS > -rw------- 1 nagios nagios 0 2009-11-05 04:12 checkBXX71Z > -rw------- 1 nagios nagios 0 2009-11-27 23:24 checkBLCU9i > -rw------- 1 nagios nagios 0 2009-11-04 12:53 checkAaA6Qr > -rw------- 1 nagios nagios 0 2009-12-06 01:27 check8avYJJ > -rw------- 1 nagios nagios 0 2009-12-29 17:23 cGZFyHr > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cgBr4Y1.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cFxFlm3.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 cD2T5eV.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 caw2Ihv.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 c9iVtCP.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 c7QYuXc.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 c75znoT.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 c6NV5yW.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 c59UhZL.ok > -rw------- 1 nagios nagios 0 2010-01-26 15:23 c1m1KIr.ok > drwxr-xr-x 3 root root 8 2009-10-20 01:44 ../ > -rw------- 1 nagios nagios 249 2009-11-17 15:03 check47CLdy > -rw------- 1 nagios nagios 252 2009-11-17 15:03 checkAE7iMM > -rw------- 1 nagios nagios 277 2009-11-17 15:03 checkaNpheC > -rw------- 1 nagios nagios 278 2010-01-26 15:23 checkWDzCTc > -rw------- 1 nagios nagios 279 2010-01-26 15:23 checkpLyZS3 > -rw------- 1 nagios nagios 280 2010-01-26 15:23 checkVSgE8U > -rw------- 1 nagios nagios 281 2010-01-26 15:23 check6txgYX > -rw------- 1 nagios nagios 285 2009-11-17 15:03 checkQXTDa5 > -rw------- 1 nagios nagios 394 2010-01-26 15:23 cD2T5eV > -rw------- 1 nagios nagios 395 2010-01-26 15:23 cgBr4Y1 > -rw------- 1 nagios nagios 397 2010-01-26 15:23 cZar4eF > -rw------- 1 nagios nagios 397 2010-01-26 15:23 ctwW4SD > -rw------- 1 nagios nagios 401 2010-01-26 15:23 ctCCihr > -rw------- 1 nagios nagios 401 2010-01-26 15:23 cjCc8ov > -rw------- 1 nagios nagios 403 2010-01-26 15:23 cHlsWuh > -rw------- 1 nagios nagios 404 2010-01-26 15:23 cRR4Ubm > -rw------- 1 nagios nagios 408 2009-12-01 08:38 cV0kCF9 > -rw------- 1 nagios nagios 409 2010-01-26 15:23 cTpIKAl > -rw------- 1 nagios nagios 409 2010-01-26 15:23 cq75lkN > -rw------- 1 nagios nagios 412 2010-01-26 15:23 cTm3Vzw > -rw------- 1 nagios nagios 412 2009-12-11 19:20 ch4XVu5 > -rw------- 1 nagios nagios 413 2010-01-26 15:23 c1m1KIr > -rw------- 1 nagios nagios 416 2010-01-26 15:23 c9iVtCP > -rw------- 1 nagios nagios 417 2010-01-26 15:23 chokVz4 > -rw------- 1 nagios nagios 418 2010-01-26 15:23 cVvx4M8 > -rw------- 1 nagios nagios 418 2010-01-26 15:23 cnxxq4z > -rw------- 1 nagios nagios 418 2010-01-26 15:23 cLksJml > -rw------- 1 nagios nagios 427 2010-01-26 15:23 cXaj1gN > -rw------- 1 nagios nagios 433 2010-01-26 15:23 ckVnJcE > -rw------- 1 nagios nagios 435 2010-01-26 15:23 cnDr6c0 > -rw------- 1 nagios nagios 437 2010-01-26 15:23 cvWmN4R > -rw------- 1 nagios nagios 437 2010-01-26 15:23 c6NV5yW > -rw------- 1 nagios nagios 438 2010-01-26 15:23 cMPxHgo > -rw------- 1 nagios nagios 440 2010-01-26 15:23 cMf26lf > -rw------- 1 nagios nagios 442 2010-01-26 15:23 cmxnujo > -rw------- 1 nagios nagios 442 2010-01-26 15:23 c59UhZL > -rw------- 1 nagios nagios 445 2009-11-05 04:12 cdarn2Z > -rw------- 1 nagios nagios 447 2010-01-26 15:23 csRsM4M > -rw------- 1 nagios nagios 465 2010-01-26 15:23 cjjgyfh > -rw------- 1 nagios nagios 467 2010-01-26 15:23 cryC1Dq > -rw------- 1 nagios nagios 473 2010-01-26 15:23 cHN3b3M > -rw------- 1 nagios nagios 474 2010-01-26 15:23 c7QYuXc > -rw------- 1 nagios nagios 498 2010-01-26 15:23 cv6pJ1B > -rw------- 1 nagios nagios 499 2010-01-26 15:23 c75znoT > -rw------- 1 nagios nagios 504 2010-01-26 15:23 cFxFlm3 > -rw------- 1 nagios nagios 506 2010-01-26 15:23 cvtm0Hd > -rw------- 1 nagios nagios 507 2010-01-26 15:23 caw2Ihv > drwxrwxr-x 2 nagios nagios 286M 2010-01-26 15:23 ./ > Cheers, -- Casey Allen Shobe casey at shobe.info -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 26 18:38:56 2010 From: marc at ena.com (Marc Powell) Date: Tue, 26 Jan 2010 11:38:56 -0600 Subject: Unexplainable disk space usage in nagios/var/spool/checkresults In-Reply-To: References: Message-ID: <5E69D47C-1170-4DC0-B118-0F3538C55EAB@ena.com> On Jan 26, 2010, at 9:44 AM, Casey Allen Shobe wrote: > Hi! > > I have Nagios installed on a 1GB partition, with everything installed under a nagios directory on that partition. I did not expect Nagios to start consuming a large amount of space, and now that it is, I cannot determine why or in what files, which is baffling. > > I've found the biggest culprit is the var/spool/checkresults directory within the Nagios installation. a `du` command on the directory shows that it is using 287MB. However, when I `ls -la` within that directory, I can't see any file using more than a single kilobyte, and there aren't that many files. However, an `ls -ld` on the directory itself shows that the directory itself is using 286MB. That doesn't make sense to me, and so I'm looking for input! My gut instinct is that this is jfs doing something unexpected, in which case I can't offer much advice. The first thing I'd verify though is that nagios isn't actually still holding a bunch of files open in that directory, even after they've been removed. The disk space would still be seen as used even though you can't see the files any longer. Use 'lsof | grep checkresults' and see if you have any output for files that can't be seen in the directory any longer. I expect that you would see thousands or more if that is the case. > I wonder if the following approach would help correct things (but I'd like to understand why they're in this state first): Stop nagios, move the checkresults directory to checkresults.old, make a new checkresults directory, copy all the old files into it, remove the empty .old directory, start nagios. That would certainly work in either case... -- marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michelleavery8 at gmail.com Tue Jan 26 19:54:41 2010 From: michelleavery8 at gmail.com (Michelle Avery) Date: Tue, 26 Jan 2010 13:54:41 -0500 Subject: CGI authentication problems Message-ID: I originally set up my nagios environment with an administrative user named nagiosadmin and basic authentication. At that time everything worked fine. I tried to set up Active Directory authentication, but that caused numerous problems, including the fact that my Active Directory users couldn't do things like re-schedule service checks. At the time, I thought this was an AD issue. I have since decided to abandon AD authentication altogether, although I'm now using Digest authentication (per the official documentation) instead of Basic. Now, however, I'm still finding that I can't re-schedule service checks (or schedule downtime) as any user other than nagiosadmin. I have created passwords for both users in my .digest_pw file. My cgi.cfg file shows: physical_html_path=/usr/share/nagios3/htdocs url_html_path=/nagios3 use_authentication=1 default_user_name= authorized_for_system_information=michellea,nagiosadmin authorized_for_system_commands=michellea,nagiosadmin authorized_for_configuration_information=michellea,nagiosadmin authorized_for_all_hosts=michellea,nagiosadmin authorized_for_all_host_commands=michellea,nagiosadmin authorized_for_all_services=michellea,nagiosadmin authorized_for_all_service_commands=michellea,nagiosadmin default_statusmap_layout=5 default_statuswrl_layout=4 refresh_rate=90 ping_syntax=/bin/ping -n -U -c 5 $HOSTADDRESS$ main_config_file=/etc/nagios3/nagios.cfg The user michellea is also set as a contact for all of the hosts. When I log in to the web interface, it shows "Logged in as *michellea*." I can see all information for all hosts and services, but I still can't issue commands. What am I missing? Michelle -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michelleavery8 at gmail.com Tue Jan 26 20:34:32 2010 From: michelleavery8 at gmail.com (Michelle Avery) Date: Tue, 26 Jan 2010 14:34:32 -0500 Subject: CGI authentication problems In-Reply-To: <5bcb62b61001261105r55e06320sfdc50cd8f9f9677d@mail.gmail.com> References: <5bcb62b61001261105r55e06320sfdc50cd8f9f9677d@mail.gmail.com> Message-ID: I do use Likewise-Open for authentication to the system, but I was trying to make the Apache authentication completely independent. For some reason, until you asked me that, it didn't even occur that there could be a conflict between the two. I just tried adding a completely different user (michellea is the active directory user name), and that works perfectly fine. Apparently the two of them were conflicting. Thank you! On Tue, Jan 26, 2010 at 2:05 PM, Matt Simmons wrote: > You aren't using Likewise-Open for your AD authentication, are you? > > --Matt > > > On Tue, Jan 26, 2010 at 1:54 PM, Michelle Avery > wrote: > > I originally set up my nagios environment with an administrative user > named > > nagiosadmin and basic authentication. At that time everything worked > fine. > > I tried to set up Active Directory authentication, but that caused > numerous > > problems, including the fact that my Active Directory users couldn't do > > things like re-schedule service checks. At the time, I thought this was > an > > AD issue. I have since decided to abandon AD authentication altogether, > > although I'm now using Digest authentication (per the official > > documentation) instead of Basic. Now, however, I'm still finding that I > > can't re-schedule service checks (or schedule downtime) as any user other > > than nagiosadmin. I have created passwords for both users in my > .digest_pw > > file. My cgi.cfg file shows: > > > > physical_html_path=/usr/share/nagios3/htdocs > > url_html_path=/nagios3 > > use_authentication=1 > > default_user_name= > > authorized_for_system_information=michellea,nagiosadmin > > authorized_for_system_commands=michellea,nagiosadmin > > authorized_for_configuration_information=michellea,nagiosadmin > > authorized_for_all_hosts=michellea,nagiosadmin > > authorized_for_all_host_commands=michellea,nagiosadmin > > authorized_for_all_services=michellea,nagiosadmin > > authorized_for_all_service_commands=michellea,nagiosadmin > > default_statusmap_layout=5 > > default_statuswrl_layout=4 > > refresh_rate=90 > > ping_syntax=/bin/ping -n -U -c 5 $HOSTADDRESS$ > > main_config_file=/etc/nagios3/nagios.cfg > > > > The user michellea is also set as a contact for all of the hosts. When I > log > > in to the web interface, it shows "Logged in as michellea." I can see > all > > information for all hosts and services, but I still can't issue commands. > > What am I missing? > > > > Michelle > > > > > ------------------------------------------------------------------------------ > > The Planet: dedicated and managed hosting, cloud storage, colocation > > Stay online with enterprise data centers and the best network in the > > business > > Choose flexible plans and management services without long-term contracts > > Personal 24x7 support from experience hosting pros just a phone call > away. > > http://p.sf.net/sfu/theplanet-com > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > -- > > LITTLE GIRL: But which cookie will you eat FIRST? > COOKIE MONSTER: Me think you have misconception of cookie-eating process. > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Jan 26 20:41:26 2010 From: marc at ena.com (Marc Powell) Date: Tue, 26 Jan 2010 13:41:26 -0600 Subject: CGI authentication problems In-Reply-To: References: Message-ID: <3F6B0FB7-FB1A-4426-8D52-76C144086A37@ena.com> On Jan 26, 2010, at 12:54 PM, Michelle Avery wrote: > The user michellea is also set as a contact for all of the hosts. When I log in to the web interface, it shows "Logged in as michellea." I can see all information for all hosts and services, but I still can't issue commands. What am I missing? What's the error that you see? Did you enable external commands in nagios.cfg and specify a valid path for the command pipe? Does it exist and is it writable by the user your web server runs as? -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From asraikhn at gmail.com Wed Jan 27 07:19:56 2010 From: asraikhn at gmail.com (Asrai khn) Date: Wed, 27 Jan 2010 11:19:56 +0500 Subject: Defining service dependency In-Reply-To: <5f0f8dba1001260036m7d471e88r5438d8b59026b9c2@mail.gmail.com> References: <5f0f8dba1001250821s31fc4d0r84e8bbe55e2aa382@mail.gmail.com> <4B5DC700.3000009@flatto.net> <5f0f8dba1001252133j33441ef0h51354407b12ea2ee@mail.gmail.com> <5f0f8dba1001260036m7d471e88r5438d8b59026b9c2@mail.gmail.com> Message-ID: <5f0f8dba1001262219x54f6a96fj19d40b7864b7d748@mail.gmail.com> Anyone please? I wonders why 1500+ dependencies? Thanks in advance. On Tue, Jan 26, 2010 at 1:36 PM, Asrai khn wrote: > > > On Tue, Jan 26, 2010 at 10:33 AM, Asrai khn wrote: > >> >> >> Later I'll switch my dependency configuration to something simple by using >> "Time-saving Tricks" ---> >> http://nagios.sourceforge.net/docs/3_0/objecttricks.html#same_host_dependency >> >> Tell me one thing, is it ok to get ... >> >> Checking service dependencies... >> Checked 10 service dependencies. >> >> For the 5 dependent_service_description above? >> >> >> >> I have just configured service dependency using "*All Services In > Multiple Servicegroups" *nagios docs. > > 1. First of all i have created (4) servicegroups for services which using > nrpe for checking remote hosts including a servicegroup for service 'nrpe' > on 8 different hosts > > The other 3 servicegroups contains total 95 services > > 2. Then defined service dependency using following config > > define servicedependency{ > dependent_servicegroup_name > Sol-CPU-Load,Sol-Partitions,Sol-Processes > servicegroup_name Sol-nrpe > > execution_failure_criteria w,u,c > notification_failure_criteria w,u,c > } > > Everything is working just fine I have tested it by stopping nrpe on one of > the server and nagios only send notification for master process 'nrpe' by > suppressing all all other checks and notification for the host. > > However I am confusing with the nagios -v check its giving me too much > counts for service dependency and i wonders if this have any effect on > monitoring host (nagios) performance, working etc. > > *sudo nagios -v /etc/nagios/nagios.cfg > > Checking service dependencies... > Checked 1530 service dependencies.* > > 1530 service dependencies..... and this also include the previous 10 for > another server, but I wonders what this counts mean? > > I'll greatly appreciate if someone shed some light on it. > > Thanks. > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dermoth at aei.ca Wed Jan 27 07:40:48 2010 From: dermoth at aei.ca (Thomas Guyot-Sionnest) Date: Wed, 27 Jan 2010 01:40:48 -0500 Subject: Unexplainable disk space usage in nagios/var/spool/checkresults In-Reply-To: References: <863B3EE13473404BB5FA7516A1E31EF71235A00729@exchange-01.wolfe.local> Message-ID: <4B5FDFF0.4070300@aei.ca> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 26/01/10 12:29 PM, Casey Allen Shobe wrote: > 2010/1/26 Wolfe, Robert > > > When you do an ?ls ?lash? on this directory, what do you see? > > > I already included that in my last post. The only difference was that I > added -Sr to do a reverse sort by size (largest items at the bottom), > and -F is included in my alias for ls, so '/' is shown at the end of > directory names. > > However, this actually seems to be an issue with JFS and high turnover > within a directory. > http://www.mail-archive.com/jfs-discussion at lists.sourceforge.net/msg00748.HTML I would highly recommend using tmpfs for the checkresults directory. This is especially important on logging file systems like JFS, XFS, Ext3+ and ReiserFS since every meta-data operation will cause a log write+flush (at the very least) which will cause a constant stream of unneeded IO. 1. Add this line to fstab: checkresults /usr/local/nagios/var/checkresults tmpfs uid=1001,gid=103,mode=0755 0 0 (in one line, nevermind auto line-warp... make sure the uid/gid are that of the nagios user). 2. Make sure the dir exists (it should if nagios uses it!) and that nagios is stopped. Delete any stale file and/or re-create the directory if you need to free space used by it (i.e. Jfs bug), then mount it (it should also get mounted at boot time before Nagios starts since it's in the fstab): # mount -a 3. Start nagios - it should now write all files in a memory fs (tmpfs). Note: you can also add the size=nbytes mount option... it limits the amount of RAM that can be used by the tmpfs, however unless you hit a nasty bug it shouldn't be a problem (plus you can monitor it with check_disk...) Note 2: I also use one for temp/status.dat files... - -- Thomas -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFLX9/w6dZ+Kt5BchYRAkgXAKDrkRWtY2O1sRg2eD8h1FbkWlvMNgCgmtq2 fFZKTDXXrNaPgVf6CLZlD7g= =WfLm -----END PGP SIGNATURE----- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dermoth at aei.ca Wed Jan 27 07:50:36 2010 From: dermoth at aei.ca (Thomas Guyot-Sionnest) Date: Wed, 27 Jan 2010 01:50:36 -0500 Subject: Unexplainable disk space usage in nagios/var/spool/checkresults In-Reply-To: <4B5FDFF0.4070300@aei.ca> References: <863B3EE13473404BB5FA7516A1E31EF71235A00729@exchange-01.wolfe.local> <4B5FDFF0.4070300@aei.ca> Message-ID: <4B5FE23C.2020008@aei.ca> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 27/01/10 01:40 AM, Thomas Guyot-Sionnest wrote: > This is especially important on logging file systems like JFS, XFS, I meant "journalled" file systems. I need sleep... :) - -- Thomas -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFLX+I86dZ+Kt5BchYRAj9MAJ4xeVasEGU0+IIaInK+eDHGIwE/xACgvPS0 DE9kkm4+IBWe9GSNpb1g9/M= =BXu8 -----END PGP SIGNATURE----- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From loesprite at gmail.com Wed Jan 27 08:30:06 2010 From: loesprite at gmail.com (Sunny Soung) Date: Wed, 27 Jan 2010 15:30:06 +0800 Subject: Need to monitor Nagios Service Automatically Message-ID: Hi everyone, I'm using Nagios to monitor some servers. All the things are good. But these days I'm thinking about one question - What if Nagios went down?? If my Nagios went down, I would never get any notification. That would be really bad. *So is there a good solution to monitor the Nagios service??* Advise will be appreciated. Thanks and regards, Sunny, -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Wed Jan 27 08:54:22 2010 From: martinm at op5.org (Martin Melin) Date: Wed, 27 Jan 2010 08:54:22 +0100 Subject: Need to monitor Nagios Service Automatically In-Reply-To: References: Message-ID: <5f0bcb7c1001262354g1e953961l5421dab364cd1b56@mail.gmail.com> On Wed, Jan 27, 2010 at 8:30 AM, Sunny Soung wrote: > Hi everyone, > > I'm using Nagios to monitor some servers. All the things are good. But > these days I'm thinking about one question - What if Nagios went down?? > If my Nagios went down, I would never get any notification. That would be > really bad. > I agree, that is a very bad thing :-) > *So is there a good solution to monitor the Nagios service??* > It depends on how much effort you want to spend and how critical Nagios is for your organization. If you just want to know if Nagios goes down, you can simply set up a second machine to monitor your Nagios host and notify you when it goes down or stops working. However this means that your other machines are unmonitored for the time it takes you to get your Nagios machine back online - and in case of hardware failure that could take a while. Most people for whom Nagios is a critical tool implement some sort of redundancy instead, where Nagios continues to monitor your network even if an individual Nagios host goes down. There is a good overview of your options in the documentation: http://nagios.sourceforge.net/docs/3_0/redundancy.html Also, the smartass answer to "is there a good solution to monitor the Nagios service" is http://nagiosplugins.org/man/check_nagios > Advise will be appreciated. > > Thanks and regards, > Sunny, > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From asraikhn at gmail.com Wed Jan 27 10:39:17 2010 From: asraikhn at gmail.com (Asrai khn) Date: Wed, 27 Jan 2010 14:39:17 +0500 Subject: Need to monitor Nagios Service Automatically In-Reply-To: <5f0bcb7c1001262354g1e953961l5421dab364cd1b56@mail.gmail.com> References: <5f0bcb7c1001262354g1e953961l5421dab364cd1b56@mail.gmail.com> Message-ID: <5f0f8dba1001270139m409b5813q470b4dd45d8030@mail.gmail.com> On Wed, Jan 27, 2010 at 12:54 PM, Martin Melin wrote: > On Wed, Jan 27, 2010 at 8:30 AM, Sunny Soung wrote: > >> Hi everyone, >> >> I'm using Nagios to monitor some servers. All the things are good. But >> these days I'm thinking about one question - What if Nagios went down?? >> If my Nagios went down, I would never get any notification. That would be >> really bad. >> > > I agree, that is a very bad thing :-) > > >> *So is there a good solution to monitor the Nagios service??* >> > > It depends on how much effort you want to spend and how critical Nagios is > for your organization. If you just want to know if Nagios goes down, you can > simply set up a second machine to monitor your Nagios host and notify you > when it goes down or stops working. However this means that your other > machines are unmonitored for the time it takes you to get your Nagios > machine back online - and in case of hardware failure that could take a > while. > > Most people for whom Nagios is a critical tool implement some sort of > redundancy instead, where Nagios continues to monitor your network even if > an individual Nagios host goes down. There is a good overview of your > options in the documentation: > http://nagios.sourceforge.net/docs/3_0/redundancy.html > > Also, the smartass answer to "is there a good solution to monitor the > Nagios service" is http://nagiosplugins.org/man/check_nagios > > In check_nagios example they have given nagios.log but I thinks it should be 'status.dat', for example in my case when nagios is installed from binary package.. /usr/lib/nagios/plugins/check_nagios -e 5 -F /var/nagios/status.dat -C /usr/bin/nagios NAGIOS OK: 1 process, status log updated 9 seconds ago I am interesting in implementing this check, any other example will be greatly appreciated. Thanks. > Advise will be appreciated. >> >> Thanks and regards, >> Sunny, >> >> >> ------------------------------------------------------------------------------ >> The Planet: dedicated and managed hosting, cloud storage, colocation >> Stay online with enterprise data centers and the best network in the >> business >> Choose flexible plans and management services without long-term contracts >> Personal 24x7 support from experience hosting pros just a phone call away. >> http://p.sf.net/sfu/theplanet-com >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- > Martin Melin > ____________________________ > op5 AB > http://www.op5.com > > http://www.op5.org/ > http://www.op5.com/op5/products/network-monitor/nagios/ > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Wed Jan 27 11:03:21 2010 From: martinm at op5.org (Martin Melin) Date: Wed, 27 Jan 2010 11:03:21 +0100 Subject: Need to monitor Nagios Service Automatically In-Reply-To: <5f0f8dba1001270139m409b5813q470b4dd45d8030@mail.gmail.com> References: <5f0bcb7c1001262354g1e953961l5421dab364cd1b56@mail.gmail.com> <5f0f8dba1001270139m409b5813q470b4dd45d8030@mail.gmail.com> Message-ID: <5f0bcb7c1001270203g639db321r397397a9593aa233@mail.gmail.com> On Wed, Jan 27, 2010 at 10:39 AM, Asrai khn wrote: > > > On Wed, Jan 27, 2010 at 12:54 PM, Martin Melin wrote: > >> On Wed, Jan 27, 2010 at 8:30 AM, Sunny Soung wrote: >> >>> Hi everyone, >>> >>> I'm using Nagios to monitor some servers. All the things are good. But >>> these days I'm thinking about one question - What if Nagios went down?? >>> If my Nagios went down, I would never get any notification. That would be >>> really bad. >>> >> >> I agree, that is a very bad thing :-) >> >> >>> *So is there a good solution to monitor the Nagios service??* >>> >> >> It depends on how much effort you want to spend and how critical Nagios is >> for your organization. If you just want to know if Nagios goes down, you can >> simply set up a second machine to monitor your Nagios host and notify you >> when it goes down or stops working. However this means that your other >> machines are unmonitored for the time it takes you to get your Nagios >> machine back online - and in case of hardware failure that could take a >> while. >> >> Most people for whom Nagios is a critical tool implement some sort of >> redundancy instead, where Nagios continues to monitor your network even if >> an individual Nagios host goes down. There is a good overview of your >> options in the documentation: >> http://nagios.sourceforge.net/docs/3_0/redundancy.html >> >> Also, the smartass answer to "is there a good solution to monitor the >> Nagios service" is http://nagiosplugins.org/man/check_nagios >> >> > > > In check_nagios example they have given nagios.log but I thinks it should > be 'status.dat', for example in my case when nagios is installed from binary > package.. > > /usr/lib/nagios/plugins/check_nagios -e 5 -F /var/nagios/status.dat -C > /usr/bin/nagios > NAGIOS OK: 1 process, status log updated 9 seconds ago > > I am interesting in implementing this check, any other example will be > greatly appreciated. > check_nagios is not very complicated. Per the manpage, it wants to be pointed to status.log, not nagios.log or status.dat. The reason for that is simply that it checks that the last entry in the log is within N minutes of the current time (set by -e 5 to 5 minutes). It also checks that a Nagios process is present in ps. What kind of examples are you looking for? > > Thanks. > > > >> Advise will be appreciated. >>> >>> Thanks and regards, >>> Sunny, >>> >>> >>> ------------------------------------------------------------------------------ >>> The Planet: dedicated and managed hosting, cloud storage, colocation >>> Stay online with enterprise data centers and the best network in the >>> business >>> Choose flexible plans and management services without long-term contracts >>> Personal 24x7 support from experience hosting pros just a phone call >>> away. >>> http://p.sf.net/sfu/theplanet-com >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when >>> reporting any issue. >>> ::: Messages without supporting info will risk being sent to /dev/null >>> >> >> >> >> -- >> Martin Melin >> ____________________________ >> op5 AB >> http://www.op5.com >> >> http://www.op5.org/ >> http://www.op5.com/op5/products/network-monitor/nagios/ >> >> >> ------------------------------------------------------------------------------ >> The Planet: dedicated and managed hosting, cloud storage, colocation >> Stay online with enterprise data centers and the best network in the >> business >> Choose flexible plans and management services without long-term contracts >> Personal 24x7 support from experience hosting pros just a phone call away. >> http://p.sf.net/sfu/theplanet-com >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From asraikhn at gmail.com Wed Jan 27 11:20:22 2010 From: asraikhn at gmail.com (Asrai khn) Date: Wed, 27 Jan 2010 15:20:22 +0500 Subject: Need to monitor Nagios Service Automatically In-Reply-To: <5f0bcb7c1001270203g639db321r397397a9593aa233@mail.gmail.com> References: <5f0bcb7c1001262354g1e953961l5421dab364cd1b56@mail.gmail.com> <5f0f8dba1001270139m409b5813q470b4dd45d8030@mail.gmail.com> <5f0bcb7c1001270203g639db321r397397a9593aa233@mail.gmail.com> Message-ID: <5f0f8dba1001270220p704edfbbiaed64011c2229b50@mail.gmail.com> On Wed, Jan 27, 2010 at 3:03 PM, Martin Melin wrote: > > On Wed, Jan 27, 2010 at 10:39 AM, Asrai khn wrote: > >> >> >> On Wed, Jan 27, 2010 at 12:54 PM, Martin Melin wrote: >> >>> On Wed, Jan 27, 2010 at 8:30 AM, Sunny Soung wrote: >>> >>>> Hi everyone, >>>> >>>> I'm using Nagios to monitor some servers. All the things are good. But >>>> these days I'm thinking about one question - What if Nagios went down?? >>>> If my Nagios went down, I would never get any notification. That would >>>> be really bad. >>>> >>> >>> I agree, that is a very bad thing :-) >>> >>> >>>> *So is there a good solution to monitor the Nagios service??* >>>> >>> >>> It depends on how much effort you want to spend and how critical Nagios >>> is for your organization. If you just want to know if Nagios goes down, you >>> can simply set up a second machine to monitor your Nagios host and notify >>> you when it goes down or stops working. However this means that your other >>> machines are unmonitored for the time it takes you to get your Nagios >>> machine back online - and in case of hardware failure that could take a >>> while. >>> >>> Most people for whom Nagios is a critical tool implement some sort of >>> redundancy instead, where Nagios continues to monitor your network even if >>> an individual Nagios host goes down. There is a good overview of your >>> options in the documentation: >>> http://nagios.sourceforge.net/docs/3_0/redundancy.html >>> >>> Also, the smartass answer to "is there a good solution to monitor the >>> Nagios service" is http://nagiosplugins.org/man/check_nagios >>> >>> >> >> >> In check_nagios example they have given nagios.log but I thinks it should >> be 'status.dat', for example in my case when nagios is installed from binary >> package.. >> >> /usr/lib/nagios/plugins/check_nagios -e 5 -F /var/nagios/status.dat -C >> /usr/bin/nagios >> NAGIOS OK: 1 process, status log updated 9 seconds ago >> >> I am interesting in implementing this check, any other example will be >> greatly appreciated. >> > > check_nagios is not very complicated. Per the manpage, it wants to be > pointed to status.log, not nagios.log or status.dat. The reason for that is > simply that it checks that the last entry in the log is within N minutes of > the current time (set by -e 5 to 5 minutes). It also checks that a Nagios > process is present in ps. What kind of examples are you looking for? > > Thanks for the explanation. I am looking for usage examples, and I thinks the one given in the check_nagios manpage is fine. /usr/lib/nagios/plugins/check_nagios -e 5 -F /var/nagios/status.dat -C /usr/bin/nagios > >> Thanks. >> >> >> >>> Advise will be appreciated. >>>> >>>> Thanks and regards, >>>> Sunny, >>>> >>>> >>>> ------------------------------------------------------------------------------ >>>> The Planet: dedicated and managed hosting, cloud storage, colocation >>>> Stay online with enterprise data centers and the best network in the >>>> business >>>> Choose flexible plans and management services without long-term >>>> contracts >>>> Personal 24x7 support from experience hosting pros just a phone call >>>> away. >>>> http://p.sf.net/sfu/theplanet-com >>>> _______________________________________________ >>>> Nagios-users mailing list >>>> Nagios-users at lists.sourceforge.net >>>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>>> ::: Please include Nagios version, plugin version (-v) and OS when >>>> reporting any issue. >>>> ::: Messages without supporting info will risk being sent to /dev/null >>>> >>> >>> >>> >>> -- >>> Martin Melin >>> ____________________________ >>> op5 AB >>> http://www.op5.com >>> >>> http://www.op5.org/ >>> http://www.op5.com/op5/products/network-monitor/nagios/ >>> >>> >>> ------------------------------------------------------------------------------ >>> The Planet: dedicated and managed hosting, cloud storage, colocation >>> Stay online with enterprise data centers and the best network in the >>> business >>> Choose flexible plans and management services without long-term contracts >>> Personal 24x7 support from experience hosting pros just a phone call >>> away. >>> http://p.sf.net/sfu/theplanet-com >>> _______________________________________________ >>> Nagios-users mailing list >>> Nagios-users at lists.sourceforge.net >>> https://lists.sourceforge.net/lists/listinfo/nagios-users >>> ::: Please include Nagios version, plugin version (-v) and OS when >>> reporting any issue. >>> ::: Messages without supporting info will risk being sent to /dev/null >>> >> >> >> >> ------------------------------------------------------------------------------ >> The Planet: dedicated and managed hosting, cloud storage, colocation >> Stay online with enterprise data centers and the best network in the >> business >> Choose flexible plans and management services without long-term contracts >> Personal 24x7 support from experience hosting pros just a phone call away. >> http://p.sf.net/sfu/theplanet-com >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- > Martin Melin > ____________________________ > op5 AB > http://www.op5.com > > http://www.op5.org/ > http://www.op5.com/op5/products/network-monitor/nagios/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From casey at shobe.info Wed Jan 27 14:22:54 2010 From: casey at shobe.info (Casey Allen Shobe) Date: Wed, 27 Jan 2010 08:22:54 -0500 Subject: Unexplainable disk space usage in nagios/var/spool/checkresults In-Reply-To: <4B5FDFF0.4070300@aei.ca> References: <863B3EE13473404BB5FA7516A1E31EF71235A00729@exchange-01.wolfe.local> <4B5FDFF0.4070300@aei.ca> Message-ID: On Jan 27, 2010, at 1:40 AM, Thomas Guyot-Sionnest wrote: > I would highly recommend using tmpfs for the checkresults directory. > This is especially important on logging file systems like JFS, XFS, > Ext3+ and ReiserFS since every meta-data operation will cause a log > write+flush (at the very least) which will cause a constant stream of > unneeded IO. Thanks for the advice - this sounds perfectly practical, as long as those files are not important to persist across nagios restarts. The JFS partition is on top of DRBD, so if I move to tmpfs, the contents will disappear when a failover happens - that should be okay I take it? Cheers, -- Casey Allen Shobe casey at shobe.info ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From doepain at gmail.com Wed Jan 27 20:55:30 2010 From: doepain at gmail.com (dOE) Date: Wed, 27 Jan 2010 14:55:30 -0500 Subject: Help with check_jmx Message-ID: I am trying to use the "check_jmx but every time I execute the command below ./check_jmx -U service:jmx:rmi:///jndi/rmi://HOSTNAME:6100/jmxrmi -O com.ocs.framework.schedule:type=BackgroundJobMonitor -A SchedulerStatus I know what I know because we use jManage which is another open source utility to probe JMX with.. My guess is this "check_jmx" can only return integers, and is not able to return word strings. sudo ./check_jmx -U service:jmx:rmi:///jndi/rmi://ocstestptcweb22:6100/jmxrmi -O com.ocs.framework.schedule:type=BackgroundJobMonitor -A SchedulerStatus -K 0 -I 0 -J 0 -vvvv JMX CRITICAL For input string: "INACTIVE" connecting to com.ocs.framework.schedule:type=BackgroundJobMonitor by URL service:jmx:rmi:///jndi/rmi://HOSTNAME:6100/jmxrmijava.lang.NumberFormatException: For input string: "*INACTIVE*" at java.lang.NumberFormatException.forInputString(NumberFormatException.java:48) at java.lang.Long.parseLong(Long.java:410) at java.lang.Long.parseLong(Long.java:468) at org.nagios.JMXQuery.parseData(JMXQuery.java:212) at org.nagios.JMXQuery.execute(JMXQuery.java:191) at org.nagios.JMXQuery.main(JMXQuery.java:76) The check is for background jobs running on the application server. These background jobs can be set to either ACTIVE or INACTIVE. What is strange to me is that it is getting the INACTIVE from the object (com.app.framework.schedule:type=BackgroundJobMonitor) without me specifying, but how do I specify ACTIVE , so that I can get the status of active background jobs that are running. I understand if this question is beyond the scope of this mailing-list. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Thu Jan 28 00:13:29 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Wed, 27 Jan 2010 18:13:29 -0500 Subject: hosts showing up as down even though they are not being monitored Message-ID: Hi there -- We are running Nagios 3.1.2 on one of our servers, and the following anomaly is occurring: Several of our clients have been taken offline, during which time they were detected as being down. I have gone through the motions of commenting out references to them in the services file located in the /usr/local/nagios/etc/objects directory. Even though the clients are no longer being monitored, they are still showing up as being down. What other steps do I need to take in order to correct this problem? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Thu Jan 28 00:33:53 2010 From: rperezm at uci.cu (ReynierPM) Date: Wed, 27 Jan 2010 18:33:53 -0500 Subject: How to accomplish this task Message-ID: <4B60CD61.2070006@uci.cu> Hi every: I have 6 Blade Servers with ESX vSphere and in top of this structure I have 78 more virtualized servers. Exists any way to check Blade Servers processor temperature, ram status, and all this stuff related to hardware? -- Cheers and thx in advance ReynierPM ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Thu Jan 28 00:50:58 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Wed, 27 Jan 2010 18:50:58 -0500 Subject: How to accomplish this task In-Reply-To: <4B60CD61.2070006@uci.cu> References: <4B60CD61.2070006@uci.cu> Message-ID: <5bcb62b61001271550g1773076l4f396f5529c53a22@mail.gmail.com> What manufacturer / model of blades do you have? There are sometimes additions to ESX(i) that can be implemented (such as Dell's openmanage) that provide interfaces to monitor hardware. On Wed, Jan 27, 2010 at 6:33 PM, ReynierPM wrote: > Hi every: > I have 6 Blade Servers with ESX vSphere and in top of this structure I > have 78 more virtualized servers. Exists any way to check Blade Servers > processor temperature, ram status, and all this stuff related to hardware? > -- > Cheers and thx in advance > ReynierPM > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rperezm at uci.cu Thu Jan 28 00:57:17 2010 From: rperezm at uci.cu (ReynierPM) Date: Wed, 27 Jan 2010 18:57:17 -0500 Subject: How to accomplish this task In-Reply-To: <5bcb62b61001271550g1773076l4f396f5529c53a22@mail.gmail.com> References: <4B60CD61.2070006@uci.cu> <5bcb62b61001271550g1773076l4f396f5529c53a22@mail.gmail.com> Message-ID: <4B60D2DD.4030309@uci.cu> On 1/27/2010 6:50 PM, Matt Simmons wrote: > What manufacturer / model of blades do you have? There are sometimes > additions to ESX(i) that can be implemented (such as Dell's > openmanage) that provide interfaces to monitor hardware. > HP BladeSystems BL460G5 -- Cheers ReynierPM ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dermoth at aei.ca Thu Jan 28 00:58:36 2010 From: dermoth at aei.ca (Thomas Guyot-Sionnest) Date: Wed, 27 Jan 2010 18:58:36 -0500 Subject: Unexplainable disk space usage in nagios/var/spool/checkresults In-Reply-To: References: <863B3EE13473404BB5FA7516A1E31EF71235A00729@exchange-01.wolfe.local> <4B5FDFF0.4070300@aei.ca> Message-ID: <4B60D32C.3050008@aei.ca> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 27/01/10 08:22 AM, Casey Allen Shobe wrote: > On Jan 27, 2010, at 1:40 AM, Thomas Guyot-Sionnest wrote: >> I would highly recommend using tmpfs for the checkresults directory. >> This is especially important on logging file systems like JFS, XFS, >> Ext3+ and ReiserFS since every meta-data operation will cause a log >> write+flush (at the very least) which will cause a constant stream of >> unneeded IO. > > > Thanks for the advice - this sounds perfectly practical, as long as > those files are not important to persist across nagios restarts. The > JFS partition is on top of DRBD, so if I move to tmpfs, the contents > will disappear when a failover happens - that should be okay I take it? Absolutely. checkresults, temp files and status.dat don't need to stick around. In normal circumstances there should not be any file left behind in these directories after a clean shutdown. This will also save you some useless network traffic if you can avoid DRDB replication :) - -- Thomas -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.6 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org iD8DBQFLYNMs6dZ+Kt5BchYRAiTfAJ4hLynuqHNndi8pFPm7GHKDrkao1gCgtm5o ov6rKYEZf6KBufZmkhcBYz0= =kbdH -----END PGP SIGNATURE----- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Thu Jan 28 01:58:38 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Wed, 27 Jan 2010 19:58:38 -0500 Subject: hosts showing up as down even though they are not being monitored In-Reply-To: References: Message-ID: Hi there -- I went through the steps you mentioned in your e-mail, and the problem continues to persist. The preflight check did mention the clients in question did not have any services associated with them. Aside from those warnings, the Nagios application did not have any complaints or problems starting up. ________________________________ From: Gary Every [mailto:gevery at gmail.com] Sent: Wednesday, January 27, 2010 6:52 PM To: Kaplan, Andrew H. Subject: Re: [Nagios-users] hosts showing up as down even though they are not being monitored You still need to restart nagios. I'm assuming you're running it on *nix of some kind. These are linux commands below: Check your config first tho via: /path/to/nagios/bindir/nagios -v /path/to/nagios.cfg If that checks out without errors, then restart nagios /path/to/nagios/[STARTUPSCRIPT|nagios] restart On Wed, Jan 27, 2010 at 4:13 PM, Kaplan, Andrew H. wrote: Hi there -- We are running Nagios 3.1.2 on one of our servers, and the following anomaly is occurring: Several of our clients have been taken offline, during which time they were detected as being down. I have gone through the motions of commenting out references to them in the services file located in the /usr/local/nagios/etc/objects directory. Even though the clients are no longer being monitored, they are still showing up as being down. What other steps do I need to take in order to correct this problem? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Gary Every "Pay it Forward!" -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Thu Jan 28 02:06:21 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Wed, 27 Jan 2010 20:06:21 -0500 Subject: hosts showing up as down even though they are not being monitored In-Reply-To: References: Message-ID: Hi there -- Just for clarification, are you referring to the hosts.cfg, hostextinfo.cfg, hostgroups.cfg file, or all of the above? Thanks. ________________________________ From: L.Q. Marshall [mailto:qmarshall at iNetSpace.net] Sent: Wednesday, January 27, 2010 8:04 PM To: 'nagios-users at lists.sourceforge.net' Subject: RE: [Nagios-users] hosts showing up as down even though they are not being monitored I have gone through the motions of commenting out references to them in the services file located in the /usr/local/nagios/etc/objects directory. Even though the clients are no longer being monitored, they are still showing up as being down. ________________________________ restart (reload?) nagios service. Also verify that you have removed the host entries. - LQ The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Thu Jan 28 03:03:46 2010 From: jpratt at norwich.edu (James Pratt) Date: Wed, 27 Jan 2010 21:03:46 -0500 Subject: How to accomplish this task In-Reply-To: <4B60CD61.2070006@uci.cu> References: <4B60CD61.2070006@uci.cu> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD4945F735@nuexchg.norwich.edu> Yes, there are different ways lately, snmp vs WBEM and python,... I believe both require esx not esxi (not sure about the latter, no service console so not sure how you'd install this?) and I believe both require the "HP Management Agents" - you should be able to find them here for your 460c g5 blades, and version of ESX: http://h20000.www2.hp.com/bizsupport/TechSupport/DriverDownload.jsp?prod NameId=3808911&lang=en&cc=us&prodTypeId=3709945&prodSeriesId=3808910&tas kId=135 cheers, jamie >> -----Original Message----- >> From: ReynierPM [mailto:rperezm at uci.cu] >> Sent: Wednesday, January 27, 2010 6:34 PM >> To: nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] How to accomplish this task >> >> Hi every: >> I have 6 Blade Servers with ESX vSphere and in top of this structure I >> have 78 more virtualized servers. Exists any way to check Blade Servers >> processor temperature, ram status, and all this stuff related to hardware? >> -- >> Cheers and thx in advance >> ReynierPM >> >> ------------------------------------------------------------------------ ------ >> The Planet: dedicated and managed hosting, cloud storage, colocation >> Stay online with enterprise data centers and the best network in the business >> Choose flexible plans and management services without long-term contracts >> Personal 24x7 support from experience hosting pros just a phone call away. >> http://p.sf.net/sfu/theplanet-com >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting any >> issue. >> ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jkamens at Advent.COM Thu Jan 28 03:26:20 2010 From: jkamens at Advent.COM (Jonathan Kamens) Date: Wed, 27 Jan 2010 21:26:20 -0500 Subject: Nagios 30 seconds initial delay In-Reply-To: <1595932556.20091216005707@gmail.com> References: <1595932556.20091216005707@gmail.com> Message-ID: <4B60F5CC.20906@advent.com> Brandino Andreas wrote: > Hi all, > I am using Nagios 3.2.0 (I just upgrade from early 3.0 releases) > Every time I restart nagios I face a 30 seconds delay before the > various pages appear for first time (not before starting checks, 30 > seconds before displaying pages!!!) > You want my performance patches, attached. The first of these, nagios-3.2.0-status-cgi-speed.patch, has been vetted extensively by Ton Voon and will be in the next Nagios release (note: for those of you who saw this patch when it came over the nagios-devel mailing list last month, note that the attached version has some bug fixes in it). The second, nagios-3.2.0-startup-speed.patch, I just wrote tonight, so it hasn't had as much burn-in time, but I'm reasonably confident that it is correct. These two patches decrease startup time and CGI response time dramatically. Enjoy! Jonathan Kamens -- Jonathan Kamens Operations Manager Advent Tamale RMS 201 South Street, Suite 300, Boston, MA 02111 Phone: +1 617 261 0264 ext. 133 | Mobile : +1 617 417 8989 | Fax: + 1 617 812 0330 jkamens at advent.com | www.advent.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: tamale.jpg Type: image/jpeg Size: 6255 bytes Desc: not available URL: -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: nagios-3.2.0-status-cgi-speed.patch URL: -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: nagios-3.2.0-startup-speed.patch URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bernard.Barbosa at behringer.com Thu Jan 28 06:52:47 2010 From: Bernard.Barbosa at behringer.com (Barbosa, Bernard) Date: Thu, 28 Jan 2010 13:52:47 +0800 Subject: Help Support Message-ID: Dear Support, We are using Nagios version 2.9 for monitoring our servers. I just met an issue on creating an availability report for one service of one server. Below are the details: Under Select Report Options I put: Report Period = 31 Days Report Time Period = some_customized time which exclude the scheduled downtime Assume Initial States = Yes Assume State Retention = Yes Assume State During Program Downtime = Yes Include Soft State = No First Assumed Service State = Current State Backtracked Archives = 4 After Clicking the create availability this screen below shows up: Can you please explain as to why is this happening? I put 31 days on report period but it shows that value on unscheduled critical time. Please help. Thank you so much. Kind Regards, BARBOSA, Bernard Jnr Administrator, System/Network BEHRINGER Holdings (Pte) Ltd IP Phone: 60651 ext 1245 Tel: +63 2 7505401 ext 1245 Fax: +63 2 7505384 Email: INFOServSystPH at behringer.com Web: www.behringer.com :-) Build Teamwork :-) Take Ownership :-) Don't Waste Resources :-) Clean Workplace = Clean Mind :-) Respect Guidelines and Policies :-) Improve Yourself and Help Others :-) Don't Forget to Smile and Say Thank You This email is intended exclusively for the addressee(s) named above and may contain privileged and confidential information. If you are not (among) the intended recipient(s), you may not copy, utilize or distribute any of the information contained herein. If you have received this email in error, please notify us immediately via return email and delete the original from your mailbox. Thank you. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.jpg Type: image/jpeg Size: 57844 bytes Desc: image001.jpg URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Thu Jan 28 09:03:00 2010 From: martinm at op5.org (Martin Melin) Date: Thu, 28 Jan 2010 09:03:00 +0100 Subject: hosts showing up as down even though they are not being monitored In-Reply-To: References: Message-ID: <5f0bcb7c1001280003n577885fmf36514ed744c6c92@mail.gmail.com> On Thu, Jan 28, 2010 at 1:58 AM, Kaplan, Andrew H. wrote: > Hi there -- > > I went through the steps you mentioned in your e-mail, and the problem > continues to persist. > > The preflight check did mention the clients in question did not have any > services associated > with them. Aside from those warnings, the Nagios application did not have > any complaints or > problems starting up. > You've commented out your service definitions for these hosts, but still have host definitions left in. These hosts are being checked with a check_command (most probably check-host-alive in a default Nagios), and since the machines are offline, they show up in Nagios as down. Go to your $NAGIOSHOME/etc directory and do a "grep -R machinename *" where machinename is the host you want to remove. It'll show you all the places where you mention that host, which will let you know which file you still need to edit. Then do the verify+reload dance again :-) > > > > ------------------------------ > *From:* Gary Every [mailto:gevery at gmail.com] > *Sent:* Wednesday, January 27, 2010 6:52 PM > *To:* Kaplan, Andrew H. > *Subject:* Re: [Nagios-users] hosts showing up as down even though they > are not being monitored > > You still need to restart nagios. > I'm assuming you're running it on *nix of some kind. These are linux > commands below: > > Check your config first tho via: > /path/to/nagios/bindir/nagios -v /path/to/nagios.cfg > > If that checks out without errors, then restart nagios > > /path/to/nagios/[STARTUPSCRIPT|nagios] restart > > > > On Wed, Jan 27, 2010 at 4:13 PM, Kaplan, Andrew H. wrote: > >> Hi there -- >> >> We are running Nagios 3.1.2 on one of our servers, and the following >> anomaly is occurring: >> >> Several of our clients have been taken offline, during which time they >> were detected as being down. >> I have gone through the motions of commenting out references to them in >> the services file located in >> the /usr/local/nagios/etc/objects directory. Even though the clients are >> no longer being monitored, >> they are still showing up as being down. >> >> What other steps do I need to take in order to correct this problem? >> Thanks. >> >> The information in this e-mail is intended only for the person to whom it is >> addressed. If you believe this e-mail was sent to you in error and the e-mail >> contains patient information, please contact the Partners Compliance HelpLine athttp://www.partners.org/complianceline . If the e-mail was sent to you in error >> but does not contain patient information, please contact the sender and properly >> dispose of the e-mail. >> >> >> >> ------------------------------------------------------------------------------ >> >> The Planet: dedicated and managed hosting, cloud storage, colocation >> Stay online with enterprise data centers and the best network in the >> business >> Choose flexible plans and management services without long-term contracts >> Personal 24x7 support from experience hosting pros just a phone call away. >> http://p.sf.net/sfu/theplanet-com >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> > > > > -- > Gary Every > "Pay it Forward!" > > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Thu Jan 28 09:26:31 2010 From: martinm at op5.org (Martin Melin) Date: Thu, 28 Jan 2010 09:26:31 +0100 Subject: Help Support In-Reply-To: References: Message-ID: <5f0bcb7c1001280026k1f49b3c0la699511be98ff46f@mail.gmail.com> On Thu, Jan 28, 2010 at 6:52 AM, Barbosa, Bernard < Bernard.Barbosa at behringer.com> wrote: > Dear Support, > This is not Support, this is people volunteering their time to help you out. > > > We are using Nagios version 2.9 for monitoring our servers. I just met an > issue on creating an availability report for one service of one server. > Below are the details: > > > > Under Select Report Options I put: > > Report Period = 31 Days > > Report Time Period = some_customized time which exclude the scheduled > downtime > > Assume Initial States = Yes > > Assume State Retention = Yes > > Assume State During Program Downtime = Yes > > Include Soft State = No > > First Assumed Service State = Current State > > Backtracked Archives = 4 > > > > After Clicking the create availability this screen below shows up: > > > > Can you please explain as to why is this happening? I put 31 days on report > period but it shows that value on unscheduled critical time. Please help. > Thank you so much. > It looks like this is an old bug in Nagios 2.x. I found another person running into the same issue back in 2006: http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg03272.html Basically it looks like there is a rounding bug of some kind when creating the report. Since I don't have any 2.x systems left I can't test it for you, but you might get around it by playing around with the report settings. However I think you should be planning to get up to 3.x - it has working availability reports and lots of other good stuff :-) > > > > > > > Kind Regards, > > > > BARBOSA, Bernard > > Jnr Administrator, System/Network > > BEHRINGER Holdings (Pte) Ltd > > IP Phone: 60651 ext 1245 > > Tel: +63 2 7505401 ext 1245 > > Fax: +63 2 7505384 > > Email: INFOServSystPH at behringer.com > > Web: www.behringer.com > > > J Build Teamwork J Take Ownership J Don?t Waste Resources J Clean > Workplace = Clean Mind J Respect Guidelines and Policies J Improve > Yourself and Help Others J Don?t Forget to Smile and Say Thank You > > > This email is intended exclusively for the addressee(s) named above and may > contain privileged and confidential information. If you are not (among) the > intended recipient(s), you may not copy, utilize or distribute any of the > information contained herein. If you have received this email in error, > please notify us immediately via return email and delete the original from > your mailbox. Thank you. > > > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.jpg Type: image/jpeg Size: 57844 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bernard.Barbosa at behringer.com Thu Jan 28 09:33:58 2010 From: Bernard.Barbosa at behringer.com (Barbosa, Bernard) Date: Thu, 28 Jan 2010 16:33:58 +0800 Subject: Help Support Message-ID: Dear Martin, Thank you so much for the info and Sorry for the "support" word I used :p Kind Regards, BARBOSA, Bernard Jnr Administrator, System/Network BEHRINGER Holdings (Pte) Ltd IP Phone: 60651 ext 1245 Tel: +63 2 7505401 ext 1245 Fax: +63 2 7505384 Email: INFOServSystPH at behringer.com Web: www.behringer.com :-) Build Teamwork :-) Take Ownership :-) Don't Waste Resources :-) Clean Workplace = Clean Mind :-) Respect Guidelines and Policies :-) Improve Yourself and Help Others :-) Don't Forget to Smile and Say Thank You This email is intended exclusively for the addressee(s) named above and may contain privileged and confidential information. If you are not (among) the intended recipient(s), you may not copy, utilize or distribute any of the information contained herein. If you have received this email in error, please notify us immediately via return email and delete the original from your mailbox. Thank you. ________________________________ From: Martin Melin [mailto:martinm at op5.org] Sent: Thursday, January 28, 2010 4:27 PM To: Barbosa, Bernard Cc: nagios-users-request at lists.sourceforge.net; nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Help Support On Thu, Jan 28, 2010 at 6:52 AM, Barbosa, Bernard < Bernard.Barbosa at behringer.com> wrote: Dear Support, This is not Support, this is people volunteering their time to help you out. We are using Nagios version 2.9 for monitoring our servers. I just met an issue on creating an availability report for one service of one server. Below are the details: Under Select Report Options I put: Report Period = 31 Days Report Time Period = some_customized time which exclude the scheduled downtime Assume Initial States = Yes Assume State Retention = Yes Assume State During Program Downtime = Yes Include Soft State = No First Assumed Service State = Current State Backtracked Archives = 4 After Clicking the create availability this screen below shows up: Can you please explain as to why is this happening? I put 31 days on report period but it shows that value on unscheduled critical time. Please help. Thank you so much. It looks like this is an old bug in Nagios 2.x. I found another person running into the same issue back in 2006: http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg03272. html Basically it looks like there is a rounding bug of some kind when creating the report. Since I don't have any 2.x systems left I can't test it for you, but you might get around it by playing around with the report settings. However I think you should be planning to get up to 3.x - it has working availability reports and lots of other good stuff :-) Kind Regards, BARBOSA, Bernard Jnr Administrator, System/Network BEHRINGER Holdings (Pte) Ltd IP Phone: 60651 ext 1245 Tel: +63 2 7505401 ext 1245 Fax: +63 2 7505384 Email: INFOServSystPH at behringer.com Web: www.behringer.com :-) Build Teamwork :-) Take Ownership :-) Don't Waste Resources :-) Clean Workplace = Clean Mind :-) Respect Guidelines and Policies :-) Improve Yourself and Help Others :-) Don't Forget to Smile and Say Thank You This email is intended exclusively for the addressee(s) named above and may contain privileged and confidential information. If you are not (among) the intended recipient(s), you may not copy, utilize or distribute any of the information contained herein. If you have received this email in error, please notify us immediately via return email and delete the original from your mailbox. Thank you. ------------------------------------------------------------------------ ------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.jpg Type: image/jpeg Size: 57844 bytes Desc: image001.jpg URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ton.voon at opsera.com Thu Jan 28 10:01:50 2010 From: ton.voon at opsera.com (Ton Voon) Date: Thu, 28 Jan 2010 09:01:50 +0000 Subject: Nagios 30 seconds initial delay In-Reply-To: <1786557943.20091217101212@gmail.com> References: <1595932556.20091216005707@gmail.com> <1786557943.20091217101212@gmail.com> Message-ID: <465F6A08-52A6-46C5-BBD0-EA75BAEA51DA@opsera.com> On 17 Dec 2009, at 08:12, Brandino Andreas wrote: > I have much less hosts and services (for the moment). > After deleting "retention.dat" I still face the same delay... > When did this slow down occur? I can see how Jonathan's patches speed up the system but these look like long-term bottlenecks, not a 3.0 -> 3.2 migration. I'm interested to know when the slowdown occurred and if it is the side effect of something else. There was a patch applied sometime in 3.1 which improved the circular parents lookup, but I don't know much else (Andreas applied it but didn't update the Changelog). Some information from a "slow startup system" and a "fast startup system": * how do you define the startup time? (time from nagios start to status.dat appearing?) * which version of Nagios? * size of system (number of hosts, services, comments, downtimes) * broker modules you use * nagios -s nagios.cfg - what does the timings tell you? Ton ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Jan 28 15:08:24 2010 From: ae at op5.se (Andreas Ericsson) Date: Thu, 28 Jan 2010 15:08:24 +0100 Subject: Nagios 30 seconds initial delay In-Reply-To: <465F6A08-52A6-46C5-BBD0-EA75BAEA51DA@opsera.com> References: <1595932556.20091216005707@gmail.com> <1786557943.20091217101212@gmail.com> <465F6A08-52A6-46C5-BBD0-EA75BAEA51DA@opsera.com> Message-ID: <4B619A58.5010608@op5.se> On 01/28/2010 10:01 AM, Ton Voon wrote: > > On 17 Dec 2009, at 08:12, Brandino Andreas wrote: > >> I have much less hosts and services (for the moment). >> After deleting "retention.dat" I still face the same delay... >> > > When did this slow down occur? I can see how Jonathan's patches speed > up the system but these look like long-term bottlenecks, not a 3.0 -> > 3.2 migration. I'm interested to know when the slowdown occurred and > if it is the side effect of something else. > > There was a patch applied sometime in 3.1 which improved the circular > parents lookup, but I don't know much else (Andreas applied it but > didn't update the Changelog). > I don't do changelogs :p The info is in the commit history though. I think you're referring to the following patch, sent in by Jean Gab?s: commit a6e06d8de24ffcb4a8c341a60098042dc3284756 Author: Andreas Ericsson Date: Sun May 17 12:54:28 2009 +0000 Revamp hosts' circular parent/child detection With this patch, all hosts are checked at most twice when determining circular host/parent relationship chains. Previously, all hosts and all their parents were always checked once for each host, which caused for the non-linear scaling function represtented as O((n*n)+depth_of_n). For some timing comparisons: A configuration with 151109 nodes is used. The 150300 interesting nodes are in chains 300 levels deep, with 501 hosts in each level. Patched: 6.28user 0.19system 0:06.58elapsed 98%CPU (0avgtext+0avgdata 0maxresident)k 0inputs+0outputs (0major+65723minor)pagefaults 0swaps Unpatched: (Ctrl-C'd out as it wasn't done after nearly an hour) 3221.77user 0.31system 53:56.51elapsed 99%CPU (0text+0data 0max)k 0inputs+0outputs (0major+67480minor)pagefaults 0swaps Without even completing the timing comparison, the patch provides a speedup of more than 51300%. On a smaller and more real-worldy configuration of roughly 15000 hosts, we get these timings: Patched: 0.71user 0.02system 0:00.75elapsed 98%CPU (0avgtext+0avgdata 0maxresident)k 0inputs+0outputs (0major+8219minor)pagefaults 0swaps Unpatched: 213.77user 0.02system 3:34.34elapsed 99%CPU (0text+0data 0max)k 0inputs+0outputs (0major+8280minor)pagefaults 0swaps A huge improvement indeed, and one that's necessary for Nagios to be usable in truly huge networks. Signed-off-by: Andreas Ericsson -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From casey at shobe.info Thu Jan 28 16:11:58 2010 From: casey at shobe.info (Casey Allen Shobe) Date: Thu, 28 Jan 2010 10:11:58 -0500 Subject: CGI timezone versus Nagios timezone Message-ID: Hi, I run a server with a timezone setting of UTC for the system - I do not want to change that to local time. I have tried restarting the nagios server using a TZ variable of 'EST5EDT', and when I do that, I see in the log that Nagios recognizes the local time in EST, however, the CGI's still show all times in UTC. Personally, I don't really care what the server thinks the time zone is - I just want local times to show up in the CGI interface. I did not see any setting in cgi.cfg to do this - how can I go about changing the time zone displayed in the CGI interface without changing the serverwide time zone? Cheers, -- Casey Allen Shobe casey at shobe.info -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jerryapr7 at gmail.com Thu Jan 28 16:59:51 2010 From: jerryapr7 at gmail.com (Jerry Joy) Date: Thu, 28 Jan 2010 21:29:51 +0530 Subject: Ports used in Nagios infrastructure Message-ID: <9f3a8a031001280759yf889067y75149d789495ee3f@mail.gmail.com> Hi, I'm trying to find out all the ports used by Nagios infrastructure components in a distributed architecture. The idea is to identify the firewall changes required for the implementation. As per documentation and information on web I notice the following ports. Please let me know if I missed anything Port 80 for Console Access on Primary Nagios server (including web server) Port 5667 for NSCA on Primary Nagios server Port 12489 on Windows clients for NSClient++ Port 5666 on Linux clients for NRPE daemon Is there any other port on the Primary Nagios server which is listening for reports from clients? Also, is there any port opening required for any port on the distributed Nagios server which sends reports to the Primary Nagios server Thanks, Jerry -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 28 17:13:30 2010 From: marc at ena.com (Marc Powell) Date: Thu, 28 Jan 2010 10:13:30 -0600 Subject: CGI timezone versus Nagios timezone In-Reply-To: References: Message-ID: <3CCD65CE-EFC5-4915-9853-33CF58407B5D@ena.com> On Jan 28, 2010, at 9:11 AM, Casey Allen Shobe wrote: > Personally, I don't really care what the server thinks the time zone is - I just want local times to show up in the CGI interface. I did not see any setting in cgi.cfg to do this - how can I go about changing the time zone displayed in the CGI interface without changing the serverwide time zone? >From nagios.cfg -- # TIMEZONE OFFSET # This option is used to override the default timezone that this # instance of Nagios runs in. If not specified, Nagios will use # the system configured timezone. # # NOTE: In order to display the correct timezone in the CGIs, you # will also need to alter the Apache directives for the CGI path # to include your timezone. Example: # # # SetEnv TZ "Australia/Brisbane" # ... # #use_timezone=US/Mountain #use_timezone=Australia/Brisbane -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Matsushita.Nobuo at tchden.org Thu Jan 28 16:47:56 2010 From: Matsushita.Nobuo at tchden.org (Matsushita, Nobuo) Date: Thu, 28 Jan 2010 08:47:56 -0700 Subject: a little confused Message-ID: Hi, I was checking the "Object Definitions" in the documentation for the 3.x. Under the "Service Definition" section, I don't see "normal_check_interval". But under the "Monitoring Network Printers", I see "normal_check_interval". I found that "retry_check_interval" is not defined in the "Service Definition", but used in the printers' section. Can someone help me to clear up on these? Thanks in advance! Nobuo ------------------------------------------------------------------------------ CONFIDENTIALITY NOTICE: This e-mail is confidential, may be legally privileged, and for the intended recipient only. Access, disclosure, copying, forwarding and distribution by any means is strictly prohibited. If received in error, do not read but delete and e-mail confirmation to the sender. ============================================================================== -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew.ford at wachovia.com Thu Jan 28 18:12:08 2010 From: andrew.ford at wachovia.com (Ford, Andy) Date: Thu, 28 Jan 2010 12:12:08 -0500 Subject: Ports used in Nagios infrastructure In-Reply-To: <9f3a8a031001280759yf889067y75149d789495ee3f@mail.gmail.com> References: <9f3a8a031001280759yf889067y75149d789495ee3f@mail.gmail.com> Message-ID: <323336996ADE534FB38549B446AC5C1A4407569CE8@MEIS1MSX005001.noam.msds.wachovia.net> Depending on your setup, you might want to give yourself some way to send configs to the distributed collectors like ssh/scp. Some folks also like to run all web servers on SSL, so you might want to make sure port 443 is open too. Andy Ford Network Security Compliance & Automation, Wells Fargo & Co. 314-600-7025 andrew.ford at wachovia.com From: Jerry Joy [mailto:jerryapr7 at gmail.com] Sent: Thursday, January 28, 2010 10:00 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Ports used in Nagios infrastructure Hi, I'm trying to find out all the ports used by Nagios infrastructure components in a distributed architecture. The idea is to identify the firewall changes required for the implementation. As per documentation and information on web I notice the following ports. Please let me know if I missed anything Port 80 for Console Access on Primary Nagios server (including web server) Port 5667 for NSCA on Primary Nagios server Port 12489 on Windows clients for NSClient++ Port 5666 on Linux clients for NRPE daemon Is there any other port on the Primary Nagios server which is listening for reports from clients? Also, is there any port opening required for any port on the distributed Nagios server which sends reports to the Primary Nagios server Thanks, Jerry -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Jan 28 18:21:59 2010 From: marc at ena.com (Marc Powell) Date: Thu, 28 Jan 2010 11:21:59 -0600 Subject: a little confused In-Reply-To: References: Message-ID: On Jan 28, 2010, at 9:47 AM, Matsushita, Nobuo wrote: > Hi, > > I was checking the "Object Definitions" in the documentation for the 3.x. > > Under the "Service Definition" section, I don't see "normal_check_interval". But under the "Monitoring Network Printers", I see "normal_check_interval". I found that "retry_check_interval" is not defined in the "Service Definition", but used in the printers' section. > > Can someone help me to clear up on these? Both are interchangeable (for now)- check_interval == normal_check_interval retry_interval == retry_check_interval The longer format is being phased out and it seems that page of the documentation was not updated. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Jan 28 18:20:27 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Thu, 28 Jan 2010 09:20:27 -0800 Subject: Ports used in Nagios infrastructure In-Reply-To: <9f3a8a031001280759yf889067y75149d789495ee3f@mail.gmail.com> References: <9f3a8a031001280759yf889067y75149d789495ee3f@mail.gmail.com> Message-ID: <4B61C75B.2090503@hp.com> Jerry Joy wrote: > I'm trying to find out all the ports used by Nagios infrastructure > components in a distributed architecture. The idea is to identify the > firewall changes required for the implementation. As per documentation > and information on web I notice the following ports. Please let me > know if I missed anything > > > Port 80 for Console Access on Primary Nagios server (including web server) > Port 5667 for NSCA on Primary Nagios server > Port 12489 on Windows clients for NSClient++ > Port 5666 on Linux clients for NRPE daemon > > Is there any other port on the Primary Nagios server which is > listening for reports from clients? > Also, is there any port opening required for any port on the > distributed Nagios server which sends reports to the Primary Nagios server Strictly speaking, Nagios does not open *any* network ports. If your setup includes NSCA, NRPE, and NSCLient++ then yes, you need to make sure their ports are open, but that's dependent on your implementation and whether you've installed those add-ons. The same goes for any other services external to Nagios itself (like the front-end web-server, and other network-enabled external components) you may use in your particular setup. I've noted you haven't mentioned any access from the Nagios server to other hosts, which will be necessary if you actually want the Nagios server to be able to check anything on other hosts, but again, that's dependent on your configuration and maybe outbound access from the Nagios machine isn't an issue in your environment. In any case, what ports you need opened to your Nagios server is completely defined by your configuration. There are multiple ways to do a distributed setup, and the ports you'll need will be defined by how you choose to do that. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From casey at shobe.info Fri Jan 29 04:57:56 2010 From: casey at shobe.info (Casey Allen Shobe) Date: Thu, 28 Jan 2010 22:57:56 -0500 Subject: CGI timezone versus Nagios timezone In-Reply-To: <3CCD65CE-EFC5-4915-9853-33CF58407B5D@ena.com> References: <3CCD65CE-EFC5-4915-9853-33CF58407B5D@ena.com> Message-ID: On Jan 28, 2010, at 11:13 AM, Marc Powell wrote: > # NOTE: In order to display the correct timezone in the CGIs, you > # will also need to alter the Apache directives for the CGI path > # to include your timezone. Example: > # > # > # SetEnv TZ "Australia/Brisbane" > # ... > # Ahh, thanks. I removed all the comments when I set things up so I could more easily maintain the files - should have grepped through the stock files before mailing. In case anybody's wondering how to do the same under nginx, one just needs to add the following line to the relevant section of their conf: fastcgi_param TZ EST5EDT; Cheers, -- Casey Allen Shobe casey at shobe.info ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From casey at shobe.info Fri Jan 29 05:18:49 2010 From: casey at shobe.info (Casey Allen Shobe) Date: Thu, 28 Jan 2010 23:18:49 -0500 Subject: Checking DHCP In-Reply-To: <96D3C579-A4A8-4C58-9D8A-44AF19191993@ena.com> References: <96D3C579-A4A8-4C58-9D8A-44AF19191993@ena.com> Message-ID: <54E7F65F-634A-4966-9B10-F1A58D892599@shobe.info> On Jan 14, 2010, at 9:12 AM, Marc Powell wrote: > There's a check_dhcp plugin in the standard plugins to verify that a > DHCP server his handing out leases. You need to run it from a host > on the LAN that your DHCP server answers on. In most cases you > cannot run this plugin on the DHCP server itself as they are most > often configured *not* to give answers to themselves. If you're > limited to running something on the DHCP server itself, you'll > likely only be able to look to see if the DHCP process/service is > running. How that's done depends on the OS of the server and how you > can get to it from the Nagios machine. If I wanted to configure this to appear in a host named "network" that represents broadcast network services not tied to a particular host, what would I need to set check_host_command to so that nothing actually runs to check the "host"? Or where else would this be appropriate? Cheers, -- Casey Allen Shobe casey at shobe.info ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 29 05:54:28 2010 From: marc at ena.com (Marc Powell) Date: Thu, 28 Jan 2010 22:54:28 -0600 Subject: Checking DHCP In-Reply-To: <54E7F65F-634A-4966-9B10-F1A58D892599@shobe.info> References: <96D3C579-A4A8-4C58-9D8A-44AF19191993@ena.com> <54E7F65F-634A-4966-9B10-F1A58D892599@shobe.info> Message-ID: On Jan 28, 2010, at 10:18 PM, Casey Allen Shobe wrote: > If I wanted to configure this to appear in a host named "network" that represents broadcast network services not tied to a particular host, what would I need to set check_host_command to so that nothing actually runs to check the "host"? Or where else would this be appropriate? check command check_dummy or any valid command with a check_period of none and/or checks_enabled to false. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 29 11:03:33 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 29 Jan 2010 10:03:33 +0000 Subject: status.cgi parameters Message-ID: <4B62B275.5070608@flatto.net> Hello I am trying to find the service and host status values for setting specific views from the status cgi. I looked in the nagios documentation but was unable to find it . I know it is the the Wolfgang book ( ordered - still waiting for it ) , but I can not wait till it arrives . Does any one knows the parameters ? or can point me to an online page that lists them ? Thanks Assaf ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pitchfork at ederdrom.de Fri Jan 29 11:47:30 2010 From: pitchfork at ederdrom.de (Joerg Linge) Date: Fri, 29 Jan 2010 11:47:30 +0100 Subject: status.cgi parameters In-Reply-To: <4B62B275.5070608@flatto.net> References: <4B62B275.5070608@flatto.net> Message-ID: <4B62BCC2.5080506@ederdrom.de> Am 29.01.10 11:03, schrieb Assaf Flatto: > Hello > > I am trying to find the service and host status values for setting > specific views from the status cgi. > > I looked in the nagios documentation but was unable to find it . > > I know it is the the Wolfgang book ( ordered - still waiting for it ) , > but I can not wait till it arrives . > > Does any one knows the parameters ? or can point me to an online page > that lists them ? http://nagios.cvs.sourceforge.net/viewvc/nagios/nagios/include/cgiutils.h.in?revision=1.29&view=markup Line 276 ff Joerg ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Jan 29 11:52:49 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 29 Jan 2010 10:52:49 +0000 Subject: status.cgi parameters In-Reply-To: <4B62BCC2.5080506@ederdrom.de> References: <4B62B275.5070608@flatto.net> <4B62BCC2.5080506@ederdrom.de> Message-ID: <4B62BE01.6060603@flatto.net> Joerg Linge wrote: > Am 29.01.10 11:03, schrieb Assaf Flatto: > >> Hello >> >> I am trying to find the service and host status values for setting >> specific views from the status cgi. >> >> I looked in the nagios documentation but was unable to find it . >> >> I know it is the the Wolfgang book ( ordered - still waiting for it ) , >> but I can not wait till it arrives . >> >> Does any one knows the parameters ? or can point me to an online page >> that lists them ? >> > > http://nagios.cvs.sourceforge.net/viewvc/nagios/nagios/include/cgiutils.h.in?revision=1.29&view=markup > > Line 276 ff > > Joerg > Thank you very much. Assaf ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From howard.allison at pva.sozvers.at Fri Jan 29 09:51:34 2010 From: howard.allison at pva.sozvers.at (Howard Allison) Date: Fri, 29 Jan 2010 09:51:34 +0100 Subject: Service Escalation In-Reply-To: <4A0B1EA1.5000305@physics.ucsd.edu> References: <4A0B1EA1.5000305@physics.ucsd.edu> Message-ID: Hi group, I want to do something fairly simple, but I can't seem to make it happen... When a service goes into critical status, I want to send one, and only one e-mail notification.If the service is still down after say 15 minutes, I want to send one, and only one SMS. Is this possible? Unless I'm missing something it seems that I will have to set the service notification interval to something other than 0 to allow escalations... and that would mean lots of notifications.. Thanks in advance, Howard -------------------------- VERTRAULICHKEIT: Diese Nachricht ist ausschlie?lich f?r denjenigen bestimmt, an den sie adressiert ist und kann vertrauliche Informationen enthalten. Falls Sie nicht der Empf?nger dieser Nachricht sind, weisen wir Sie darauf hin, dass die unberechtigte Weitergabe oder Verwendung sowie das unberechtigte Verteilen oder Kopieren dieser Nachricht strikt untersagt sind. Falls Sie diese Nachricht irrt?mlich erhalten haben, vernichten Sie sie bitte sofort. CONFIDENTIALITY: This message is intended only for the use of the individuality or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure. If you are not the intended recipient you are notified that any dissemination, distribution, use or copying of this communication is strictly prohibited. If you received this message in error, please immediately destroy this message. -------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Fri Jan 29 12:33:24 2010 From: martinm at op5.org (Martin Melin) Date: Fri, 29 Jan 2010 12:33:24 +0100 Subject: Service Escalation In-Reply-To: References: <4A0B1EA1.5000305@physics.ucsd.edu> Message-ID: <5f0bcb7c1001290333q56c4411ds3b4d72c3b63a886b@mail.gmail.com> You want to have two escalations, one for the email and one for the SMS. They have first_notification,last_notification set to 1,1 and 2,2 respectively. Then set the contact for your service to be some contact that has notification_period set to none. This will mean that the ordinary contact for the service will never be notified (because the timeperiod never matches) but your escalations will kick in and send out 1 email and 1 SMS per problem. On Fri, Jan 29, 2010 at 9:51 AM, Howard Allison < howard.allison at pva.sozvers.at> wrote: > Hi group, I want to do something fairly simple, but I can't seem to make it > happen... > When a service goes into critical status, I want to send one, and only one > e-mail notification.If the service is still down after say 15 minutes, I > want to send one, and only one SMS. Is this possible? Unless I'm missing > something it seems that I will have to set the service notification interval > to something other than 0 to allow escalations... and that would mean lots > of notifications.. > Thanks in advance, > Howard > > > -------------------------- > VERTRAULICHKEIT: Diese Nachricht ist ausschlie?lich f?r denjenigen > bestimmt, an > den sie adressiert ist und kann vertrauliche Informationen enthalten. Falls > Sie nicht > der Empf?nger dieser Nachricht sind, weisen wir Sie darauf hin, dass die > unberechtigte Weitergabe oder Verwendung sowie das unberechtigte Verteilen > oder > Kopieren dieser Nachricht strikt untersagt sind. Falls Sie diese Nachricht > irrt?mlich > erhalten haben, vernichten Sie sie bitte sofort. > > CONFIDENTIALITY: This message is intended only for the use of the > individuality or > entity to which it is addressed and may contain information that is > privileged, > confidential and exempt from disclosure. If you are not the intended > recipient you are > notified that any dissemination, distribution, use or copying of this > communication is > strictly prohibited. If you received this message in error, please > immediately destroy > this message. > -------------------------- > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From howard.allison at pva.sozvers.at Fri Jan 29 12:57:22 2010 From: howard.allison at pva.sozvers.at (Howard Allison) Date: Fri, 29 Jan 2010 12:57:22 +0100 Subject: Antwort: Re: Service Escalation In-Reply-To: <5f0bcb7c1001290333q56c4411ds3b4d72c3b63a886b@mail.gmail.com> References: <4A0B1EA1.5000305@physics.ucsd.edu> <5f0bcb7c1001290333q56c4411ds3b4d72c3b63a886b@mail.gmail.com> Message-ID: Thanks Martin, I was afraid that might be the only way... Mit freundlichen Gr??en Howard ALLISON Martin Melin wrote on 29.01.2010 12:33:24: > Martin Melin > 29.01.2010 12:39 > > An > > Howard Allison > > Kopie > > nagios-users at lists.sourceforge.net > > Thema > > Re: [Nagios-users] Service Escalation > > You want to have two escalations, one for the email and one for the > SMS. They have first_notification,last_notification set to 1,1 and > 2,2 respectively. > > Then set the contact for your service to be some contact that has > notification_period set to none. > > This will mean that the ordinary contact for the service will never > be notified (because the timeperiod never matches) but your > escalations will kick in and send out 1 email and 1 SMS per problem. > On Fri, Jan 29, 2010 at 9:51 AM, Howard Allison > wrote: > Hi group, I want to do something fairly simple, but I can't seem to > make it happen... > When a service goes into critical status, I want to send one, and > only one e-mail notification.If the service is still down after say > 15 minutes, I want to send one, and only one SMS. Is this possible? > Unless I'm missing something it seems that I will have to set the > service notification interval to something other than 0 to allow > escalations... and that would mean lots of notifications.. > Thanks in advance, > Howard > > > -------------------------- > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > > -- > Martin Melin > ____________________________ > op5 AB > http://www.op5.com > > http://www.op5.org/ > http://www.op5.com/op5/products/network-monitor/nagios/ -------------------------- VERTRAULICHKEIT: Diese Nachricht ist ausschlie?lich f?r denjenigen bestimmt, an den sie adressiert ist und kann vertrauliche Informationen enthalten. Falls Sie nicht der Empf?nger dieser Nachricht sind, weisen wir Sie darauf hin, dass die unberechtigte Weitergabe oder Verwendung sowie das unberechtigte Verteilen oder Kopieren dieser Nachricht strikt untersagt sind. Falls Sie diese Nachricht irrt?mlich erhalten haben, vernichten Sie sie bitte sofort. CONFIDENTIALITY: This message is intended only for the use of the individuality or entity to which it is addressed and may contain information that is privileged, confidential and exempt from disclosure. If you are not the intended recipient you are notified that any dissemination, distribution, use or copying of this communication is strictly prohibited. If you received this message in error, please immediately destroy this message. -------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Chris.Menard at Aspect.com Fri Jan 29 15:01:21 2010 From: Chris.Menard at Aspect.com (Menard, Chris) Date: Fri, 29 Jan 2010 09:01:21 -0500 Subject: event handlers and Schedule Downtime In-Reply-To: References: <96D3C579-A4A8-4C58-9D8A-44AF19191993@ena.com> <54E7F65F-634A-4966-9B10-F1A58D892599@shobe.info> Message-ID: I have a scheduled host downtime setup for Sunday. When I start to take services down, will event handlers still trigger and still try to restart what I want down? What is the easiest way to temporarily suspend event handlers? Thanks, Chris (Nagios 3.1.2 in Solaris) ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Fri Jan 29 15:33:32 2010 From: martinm at op5.org (Martin Melin) Date: Fri, 29 Jan 2010 15:33:32 +0100 Subject: event handlers and Schedule Downtime In-Reply-To: References: <96D3C579-A4A8-4C58-9D8A-44AF19191993@ena.com> <54E7F65F-634A-4966-9B10-F1A58D892599@shobe.info> Message-ID: <5f0bcb7c1001290633m3c94d541m3593a810020fe605@mail.gmail.com> On Fri, Jan 29, 2010 at 3:01 PM, Menard, Chris wrote: > I have a scheduled host downtime setup for Sunday. When I start to take > services down, will event handlers still trigger and still try to restart > what I want down? > Yes, if your event handlers are not written to consider scheduled downtime. > > What is the easiest way to temporarily suspend event handlers? > Globally, enable_event_handlers nagios.cfg. Per service/host: event_handler_enabled in the definition. However I would recommend you to use the $HOSTDOWNTIME$ and $SERVICEDOWNTIME$ macros in your event handler definitions. If these macros are non-zero, the service/host is in scheduled downtime. This will allow you to have your event handlers decide how to proceed - maybe you'd like a different action to be taken when there's scheduled downtime. > Thanks, > Chris > > (Nagios 3.1.2 in Solaris) > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Chris.Menard at Aspect.com Fri Jan 29 15:36:21 2010 From: Chris.Menard at Aspect.com (Menard, Chris) Date: Fri, 29 Jan 2010 09:36:21 -0500 Subject: event handlers and Schedule Downtime In-Reply-To: <5f0bcb7c1001290633m3c94d541m3593a810020fe605@mail.gmail.com> References: <96D3C579-A4A8-4C58-9D8A-44AF19191993@ena.com> <54E7F65F-634A-4966-9B10-F1A58D892599@shobe.info> <5f0bcb7c1001290633m3c94d541m3593a810020fe605@mail.gmail.com> Message-ID: Perfect, thanks. I think I may for, for this weekend 'Disable Active Checks' on the service group. Then get to updating my event handlers. From: Martin Melin [mailto:martinm at op5.org] Sent: Friday, January 29, 2010 9:34 AM To: Menard, Chris Cc: Nagios Mailinglist Subject: Re: [Nagios-users] event handlers and Schedule Downtime On Fri, Jan 29, 2010 at 3:01 PM, Menard, Chris > wrote: I have a scheduled host downtime setup for Sunday. When I start to take services down, will event handlers still trigger and still try to restart what I want down? Yes, if your event handlers are not written to consider scheduled downtime. What is the easiest way to temporarily suspend event handlers? Globally, enable_event_handlers nagios.cfg. Per service/host: event_handler_enabled in the definition. However I would recommend you to use the $HOSTDOWNTIME$ and $SERVICEDOWNTIME$ macros in your event handler definitions. If these macros are non-zero, the service/host is in scheduled downtime. This will allow you to have your event handlers decide how to proceed - maybe you'd like a different action to be taken when there's scheduled downtime. Thanks, Chris (Nagios 3.1.2 in Solaris) ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yu.watanabe at jp.fujitsu.com Fri Jan 29 13:14:26 2010 From: yu.watanabe at jp.fujitsu.com (Yu Watanabe) Date: Fri, 29 Jan 2010 21:14:26 +0900 Subject: About Host Check scheduling Message-ID: <201001291214.AA01490@S2007337.jp.fujitsu.com> Hello all I have a question regarding to Host Check scheduling. I am using Nagios 3.0.6. And I see that usually host check uses on demand check for its scheduling. When the Host Check turns into non UP stautus, this scheduling seems to be flexible. Is there any rules in Nagios for the check scheduling for retry? Like if it is "Service Time Out" , nagios schedules its check with in a minute , etc. Any advice would be useful! Thank you Yu Watanabe ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From binbin.b.wang at hotmail.com Fri Jan 29 15:50:32 2010 From: binbin.b.wang at hotmail.com (Wang Binbin) Date: Fri, 29 Jan 2010 22:50:32 +0800 Subject: Confused by SW CPU load Message-ID: Hi Group I encounter a problem of Switch CPU load, could anyone help me? I use nagios to monitor SW port status through SNMP. When getting SNMP value from SW, the CPU load will surge to 96%, which case ping time out, then host alert will send out, the only method I can think of is to increase normal check interval and retry interval. As these interval increasing, frequency of notification decreases, but it still exists. Has anyone encounter the same problem? Could we have other methods? thanks in advance. Wang Binbin -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirko.stefanelli at gmail.com Fri Jan 29 16:08:55 2010 From: mirko.stefanelli at gmail.com (mirko stefanelli) Date: Fri, 29 Jan 2010 16:08:55 +0100 Subject: problem to use check_by_ssh Message-ID: <9eaab8b81001290708l7b863eafh60ebd50c47a56b8d@mail.gmail.com> I to all, I have successfull configure nagios user on remote machine and share keys pairs. I have also configure service on cfg file on nagios for the remote machine. But i always got this error from nagios server: (Return code of 127 is out of bounds - plugin may be missing) But if I execute the same commando from CLI by usign user nagios it's work. Can someone help me? Regards, Mirko. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Matsushita.Nobuo at tchden.org Fri Jan 29 16:17:53 2010 From: Matsushita.Nobuo at tchden.org (Matsushita, Nobuo) Date: Fri, 29 Jan 2010 08:17:53 -0700 Subject: a little confused In-Reply-To: References: Message-ID: Thanks, Marc! Sorry for a little late reply since I am only subscribing the Digest version. Nobuo >> Hi, > >> I was checking the "Object Definitions" in the documentation for the 3.x. >> >> Under the "Service Definition" section, I don't see "normal_check_interval". But under the >"Monitoring Network Printers", I see "normal_check_interval". I found that >"retry_check_interval" is not defined in the "Service Definition", but used in the printers' >section. >> >> Can someone help me to clear up on these? >Both are interchangeable (for now)- >check_interval == normal_check_interval >retry_interval == retry_check_interval >The longer format is being phased out and it seems that page of the documentation was not >updated. >-- >Marc ------------------------------------------------------------------------------ CONFIDENTIALITY NOTICE: This e-mail is confidential, may be legally privileged, and for the intended recipient only. Access, disclosure, copying, forwarding and distribution by any means is strictly prohibited. If received in error, do not read but delete and e-mail confirmation to the sender. ============================================================================== ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Fri Jan 29 16:40:09 2010 From: flyinvap at orange.fr (Flyinvap) Date: Fri, 29 Jan 2010 16:40:09 +0100 Subject: problem to use check_by_ssh In-Reply-To: <9eaab8b81001290708l7b863eafh60ebd50c47a56b8d@mail.gmail.com> References: <9eaab8b81001290708l7b863eafh60ebd50c47a56b8d@mail.gmail.com> Message-ID: <20100129164009.05cfd80a@orange.fr> Le Fri, 29 Jan 2010 16:08:55 +0100, mirko stefanelli a ?crit : > (Return code of 127 is out of bounds - plugin may be missing) Could you post check command and service configuration ? -- Flyinvap ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirko.stefanelli at gmail.com Fri Jan 29 16:47:34 2010 From: mirko.stefanelli at gmail.com (mirko stefanelli) Date: Fri, 29 Jan 2010 16:47:34 +0100 Subject: SOLVED: problem to use check_by_ssh Message-ID: <9eaab8b81001290747p62b8c33av82b3f228feb477f2@mail.gmail.com> Sorry, I have found problem, problem is in command.cfg. I have missing path for the plugins... :-( Regards, Mirko. ---------- Forwarded message ---------- From: mirko stefanelli Date: 2010/1/29 Subject: problem to use check_by_ssh To: nagios-users at lists.sourceforge.net I to all, I have successfull configure nagios user on remote machine and share keys pairs. I have also configure service on cfg file on nagios for the remote machine. But i always got this error from nagios server: (Return code of 127 is out of bounds - plugin may be missing) But if I execute the same commando from CLI by usign user nagios it's work. Can someone help me? Regards, Mirko. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 29 17:27:11 2010 From: marc at ena.com (Marc Powell) Date: Fri, 29 Jan 2010 10:27:11 -0600 Subject: About Host Check scheduling In-Reply-To: <201001291214.AA01490@S2007337.jp.fujitsu.com> References: <201001291214.AA01490@S2007337.jp.fujitsu.com> Message-ID: <01F8024A-90FA-4584-9E47-B60863206631@ena.com> On Jan 29, 2010, at 6:14 AM, Yu Watanabe wrote: > Hello all > > I have a question regarding to Host Check scheduling. > > I am using Nagios 3.0.6. And I see that usually host check uses > on demand check for its scheduling. > > When the Host Check turns into non UP stautus, this scheduling seems to be flexible. > > Is there any rules in Nagios for the check scheduling for retry? > > Like if it is "Service Time Out" , nagios schedules its check with in a minute , etc. retry_interval is used no matter what the reason. The only differentiation is OK v.s. non-OK status. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Jan 29 17:29:33 2010 From: marc at ena.com (Marc Powell) Date: Fri, 29 Jan 2010 10:29:33 -0600 Subject: problem to use check_by_ssh In-Reply-To: <20100129164009.05cfd80a@orange.fr> References: <9eaab8b81001290708l7b863eafh60ebd50c47a56b8d@mail.gmail.com> <20100129164009.05cfd80a@orange.fr> Message-ID: <158B74E6-9DD0-4153-933B-9D2CF1A5165A@ena.com> On Jan 29, 2010, at 9:40 AM, Flyinvap wrote: > Le Fri, 29 Jan 2010 16:08:55 +0100, > mirko stefanelli a ?crit : > >> (Return code of 127 is out of bounds - plugin may be missing) > > Could you post check command and service configuration ? And example test runs... -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rick.mangus+nagios at gmail.com Fri Jan 29 18:01:32 2010 From: rick.mangus+nagios at gmail.com (Rick Mangus) Date: Fri, 29 Jan 2010 11:01:32 -0600 Subject: Nagios 3.0.5 problem Message-ID: <77c454bb1001290901s5248b437yba4c50994e8bf649@mail.gmail.com> Hello, all. Forgive me, I am new to the list, and have only begun working with nagios recently. I have searched this list and googled furiously with little result, so must cease my lurking and present my problem to you. I will begin with the problem: Sometime after midnight every night, my nagios server starts to have trouble processing service checks. I don't know the cause, and cannot find a solution. I can describe the symptoms in detail and hope we can diagnose it. The web interface shows the last service check came in at 02:28:34 (EST). I know that around 4:15 every morning, xinetd starts refusing connections to nsca due to high load (max_load is 18), and that eventually I will have 32000+ nsca connections using up all available PIDs leading to an inability to fork new processes, effectively killing the machine. While all this happens, the nagios.log appears to periodically stall, making no new entries for 15 minutes at a time, and then flush 15000 in the space of a single second. Also, it seems the checkresults directory is empty most of the time, but sometimes pops up to 2045 files (it's on a ramdisk with 2048 inodes) and not a single one gets deleted in a time period I have been patient enough to observe. The periods in which the nagios log is going nowhere are accompanied by nagios taking 100% of 2 CPUs. One thread appears to poll() approximately every 25 usecs, and another is inscrutable, with mprotect() the only strace-visible syscall. All the nsca processes have a blocking write() they are waiting on. When the log is showing new entries, there are still no updates made to the services, and it seems that that is what is filling up checkresults. I admit I have not checked to find the order of the log and checkresults processes, though I assumed they would operate in the opposite order of what this appears to show. I know this behavior has been ongoing for at least 1 month. I have disabled all cron jobs that I feared might be interfering. I will answer any and all questions to the best of my ability, and hope someone here can shed some light on the situation. --Rick -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null