From jjk_saji at yahoo.com Sat Oct 1 14:36:22 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Sat, 1 Oct 2005 13:36:22 +0100 (BST) Subject: SOLVED "check_process " giving errror "System call sent warnings to stderr Message-ID: <20051001123622.74464.qmail@web34815.mail.mud.yahoo.com> Hi This is with regard to my previous mail, in which I had posted about getting error, which executing check_procs with nrpe I did configure , plugins once again , now I am able to get the result of check_procs using nrpe Thanks Joseph John ****************************************************** Hi I am successful in configuring , nrpe up to some extend , I am stuck with the ?check_procs ? , All other check commands such as ?check_users?, check_load?, check_disk? are giving the correct result but for ?check_procs? , ?check_zombie_procs? ?check_total_procs? I get the following error ?System call sent warnings to stderr?? [root at john libexec]# ./check_nrpe -H 192.168.20.19 -c check_zombie_procs System call sent warnings to stderr [root at john libexec]# ./check_nrpe -H 192.168.20.19 -c check_total_procs System call sent warnings to stderr [root at john libexec]# ./check_nrpe -H 192.168.20.19 -c check_procs System call sent warnings to stderr [root at john libexec]# ./check_nrpe -H 192.168.20.19 -c check_procs_saji System call sent warnings to stderr My nrpe.cfg of 192.168.20.19 has command[check_diske2]=/usr/local/nagios/libexec/check_disk -w 20 -c 10 -p /dev/hde2 command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c 200 command[check_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c 200 command[check_procs_saji]=/usr/local/nagios/libexec/check_procs I would like to request for guidance , why check_procs is giving this error and how to over come it to get the correct result Thanks Joseph John ___________________________________________________________ Yahoo! Messenger - NEW crystal clear PC to PC calling worldwide with voicemail http://uk.messenger.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From barbereau at gmail.com Sun Oct 2 20:19:05 2005 From: barbereau at gmail.com (=?ISO-8859-1?Q?S=E9bastien_Barbereau?=) Date: Sun, 2 Oct 2005 20:19:05 +0200 Subject: Unable to check disk utilization on Windows Servers In-Reply-To: <1EE2544AB03AD34BAA60E9AA8AF1A8FE6C18E3@EPI-EXB2.epidirect.epi> References: <1EE2544AB03AD34BAA60E9AA8AF1A8FE6C18E3@EPI-EXB2.epidirect.epi> Message-ID: <4e0e33ee0510021119jb28f708s7ce41232fa0b9df7@mail.gmail.com> Hi, what does nagios "verify" command switch report: # /usr/sbin/nagios -v /etc/nagios/nagios.cfg ? On 9/30/05, Pavel Santos wrote: > > Hello List, > > I'm having a weird problem trying to check disk utilization on Windows > Servers. Whenever I add the scrip to my commads.cfg file nagios stops > working (services will not restart). Below is what I'm adding to my commands > and services.cfg files. > > > > define command{ > command_name check_nt_disk > command_line $USER1$/check_nt -H $HOSTADDRESS$ -p 1248 -v USEDDISKSPACE -l > $ARG1$ -w $ARG2$ -c $ARG3$ > } > > > define service{ > host_name host > service_description C: Drive Disk Space > is_volatile 0 > check_period 24x7 > max_check_attempts 3 > normal_check_interval 5 > retry_check_interval 1 > contact_groups diskcheck > notification_interval 10 > notification_period 24x7 > notification_options w,u,c,r > check_command check_nt_disk!C!85%!90% > } > > > Thanks in advance, > > > > Pavel > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From master at virtuosonetsolutions.com Sun Oct 2 20:58:10 2005 From: master at virtuosonetsolutions.com (Master Virtuoso) Date: Sun, 02 Oct 2005 14:58:10 -0400 Subject: Problem with Nagios web interface / install Message-ID: <0INQ00EB0Y171J00@mta2.srv.hcvlny.cv.net> I am getting this error in web interface and command prompt root at server3 [/usr/local/nagios/bin]# ./nagios -v nagios.cfg Nagios 2.0b4 Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) Last Modified: 08-02-2005 License: GPL Reading configuration data... Error: Command 'check-host-alive' has already been defined Error: Could not register command (config file '/usr/local/nagios/etc/minimal.cfg', starting on line 76) ***> One or more problems was encountered while processing the config files... Check your configuration file(s) to ensure that they contain valid directives and data defintions. If you are upgrading from a previous version of Nagios, you should be aware that some variables/definitions may have been removed or modified in this version. Make sure to read the HTML documentation regarding the config files, as well as the 'Whats New' section to find out what has changed. ***This is the last portion of my httpd.conf file (restarted httpd 3 times after adding this to the httpd.conf*** ScriptAlias /nagios/cgi-bin /usr/local/nagios/sbin AllowOverride AuthConfig Options ExecCGI Order allow,deny Allow from all Alias /nagios /usr/local/nagios/share Options None AllowOverride AuthConfig Order allow,deny Allow from all AllowOverride AuthConfig order allow,deny allow from all Options ExecCGI AllowOverride AuthConfig order allow,deny allow from all ***This is the default Nagios.cfg file as well (only top portion to show default settings*** log_file=/usr/local/nagios/var/nagios.log cfg_file=/usr/local/nagios/etc/checkcommands.cfg cfg_file=/usr/local/nagios/etc/misccommands.cfg cfg_file=/usr/local/nagios/etc/minimal.cfg object_cache_file=/usr/local/nagios/var/objects.cache resource_file=/usr/local/nagios/etc/resource.cfg status_file=/usr/local/nagios/var/status.dat nagios_user=nagios nagios_group=nagios check_external_commands=0 command_check_interval=-1 command_file=/usr/local/nagios/var/rw/nagios.cmd Why am I getting all these errors above? I also get 404 error in Nagios web interface when I click any link My specs: PHP 4.4.0 Apache 1.33x Mysql 4.1 Latest stable Nagios APF firewall BFD intrusion protection LSM / SIM ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jhmartin at toger.us Sun Oct 2 21:13:20 2005 From: jhmartin at toger.us (Jason Martin) Date: Sun, 2 Oct 2005 12:13:20 -0700 Subject: Problem with Nagios web interface / install In-Reply-To: <0INQ00EB0Y171J00@mta2.srv.hcvlny.cv.net> References: <0INQ00EB0Y171J00@mta2.srv.hcvlny.cv.net> Message-ID: <20051002191320.GA13782@zippy.toger.us> On Sun, Oct 02, 2005 at 02:58:10PM -0400, Master Virtuoso wrote: > Error: Command 'check-host-alive' has already been defined > Error: Could not register command (config file > '/usr/local/nagios/etc/minimal.cfg', starting on line 76) Check your nagios.cfg. Are you importing any files other than minimal.cfg? Does the command named check-host-alive appear move than once in minimal.cfg? -Jason Martin -- Nitrate: Lower than the day rate. This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From chris at aidworld.org Sun Oct 2 23:46:51 2005 From: chris at aidworld.org (Chris Wilson) Date: Sun, 02 Oct 2005 22:46:51 +0100 Subject: more details about the service status interfaces In-Reply-To: References: Message-ID: <1128289611.11001.0.camel@localhost> Hi Enediel, > Refreshing every 5 seconds the web page for the service status, sometimes > only a few hosts appear, sometimes everything look fine ( all host and > services), and sometimes some services take the unknow status, I don't > understand why it's happening when nagios check the services every 3 > minutes. Sounds like you still have multiple Nagios processes running. Kill them all and start a new one. Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james.peel at altinity.com Sun Oct 2 13:34:46 2005 From: james.peel at altinity.com (James Peel) Date: Sun, 2 Oct 2005 12:34:46 +0100 Subject: refreshing too fast In-Reply-To: <20051002031122.85221881D5@sc8-sf-spam1.sourceforge.net> References: <20051002031122.85221881D5@sc8-sf-spam1.sourceforge.net> Message-ID: Hi > I installed nagios over debian 3.1, is working, but when I refres the page > of Status Grid For All Host Groups, sometimes eveything seem to be fine > (green), and a few seconds after, some services take the unknown status. > where I can control this effect? I suspect there are multiple instances of Nagios running so you are getting inconsistent results. Recommend shutting down Nagios and then using pgrep / pkill to ensure no Nagios processes are still running. Have found this problem sometimes occurs when you restart Nagios via the standard init script: /etc/init.d/nagios restart. James ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Sat Oct 1 19:55:07 2005 From: marc at ena.com (Marc Powell) Date: Sat, 1 Oct 2005 12:55:07 -0500 Subject: refreshing too fast Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of enediel gonzalez > Sent: Friday, September 30, 2005 1:22 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] refreshing too fast > > > Hello > > I installed nagios over debian 3.1, is working, but when I refres the page > of Status Grid For All Host Groups, sometimes eveything seem to be fine > (green), and a few seconds after, some services take the unknown status. You almost certainly have multiple nagios daemons running at the same time. Stop nagios, verify that they're all dead and restart nagios. -- marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjk_saji at yahoo.com Sat Oct 1 10:23:54 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Sat, 1 Oct 2005 09:23:54 +0100 (BST) Subject: "check_process " giving errror "System call sent warnings to stderr" Message-ID: <20051001082354.84413.qmail@web34814.mail.mud.yahoo.com> Hi I am successful in configuring , nrpe up to some extend , I am stuck with the ?check_procs ? , All other check commands such as ?check_users?, check_load?, check_disk? are giving the correct result but for ?check_procs? , ?check_zombie_procs? ?check_total_procs? I get the following error ?System call sent warnings to stderr?? [root at john libexec]# ./check_nrpe -H 192.168.20.19 -c check_zombie_procs System call sent warnings to stderr [root at john libexec]# ./check_nrpe -H 192.168.20.19 -c check_total_procs System call sent warnings to stderr [root at john libexec]# ./check_nrpe -H 192.168.20.19 -c check_procs System call sent warnings to stderr [root at john libexec]# ./check_nrpe -H 192.168.20.19 -c check_procs_saji System call sent warnings to stderr My nrpe.cfg of 192.168.20.19 has command[check_diske2]=/usr/local/nagios/libexec/check_disk -w 20 -c 10 -p /dev/hde2 command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c 200 command[check_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c 200 command[check_procs_saji]=/usr/local/nagios/libexec/check_procs I would like to request for guidance , why check_procs is giving this error and how to over come it to get the correct result Thanks Joseph John ___________________________________________________________ To help you stay safe and secure online, we've developed the all new Yahoo! Security Centre. http://uk.security.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris at aidworld.org Sat Oct 1 17:27:15 2005 From: chris at aidworld.org (Chris Wilson) Date: Sat, 01 Oct 2005 16:27:15 +0100 Subject: refreshing too fast In-Reply-To: References: Message-ID: <1128180432.1699.0.camel@localhost> Hi Enediel, > I installed nagios over debian 3.1, is working, but when I refres the page > of Status Grid For All Host Groups, sometimes eveything seem to be fine > (green), and a few seconds after, some services take the unknown status. It sounds like you have multiple copies of Nagios running. Try shutting down all Nagios processes, then start a fresh one. Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james at lovedthanlost.net Sat Oct 1 13:37:36 2005 From: james at lovedthanlost.net (James Turnbull) Date: Sat, 01 Oct 2005 21:37:36 +1000 Subject: NRPE - daemon versus xinetd - pros and cons In-Reply-To: <20050929230559.52674.qmail@web30506.mail.mud.yahoo.com> References: <20050929230559.52674.qmail@web30506.mail.mud.yahoo.com> Message-ID: <433E7500.80308@lovedthanlost.net> Richard DeWath wrote: > I am interested in hearing from anyone who has insight > on to whether running the daemon is more efficient > than xinetd for nagios (I mean real difference not > minor overhead differences); Is there more security > with xinetd and wrappers in this case? Anyone who has > tried both, I would be interested in your feedback. I > have been using xinted/inetd without real problems > (some nrpe timeouts), but there are people who want to > use the daemon option. I can do this, but hate to > Personally I find inetd/xinetd adds considerable overhead to daemons. I run NRPE in daemon mode and use iptables or pf rules to lock down the incoming and outgoing ports and IP addresses. IMHO - inetd/xinetd/tcpwrappers are outmoded and really only useful for very dumb daemons - r-tools, finger, ftp - which really you shouldn't be running anyways - their clear-text implementations make for lousy security. Regards James Turnbull -- James Turnbull --- Author of Hardening Linux, Apress (http://www.amazon.com/exec/obidos/tg/detail/-/1590594444/) --- PGP Key (http://pgp.mit.edu:11371/pks/lookup?op=get&search=0x0C42DF40) ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james.peel at altinity.com Sun Oct 2 13:37:42 2005 From: james.peel at altinity.com (James Peel) Date: Sun, 2 Oct 2005 12:37:42 +0100 Subject: good working GSM terminal to use with nagios? Message-ID: Hi Roy > I'm looking for a good GSM terminal to use with Nagios to send alerts to GSM phones. > Connectivity should be serial/RS-232 and I don't need too much apart from stability. We use Fargo Maestro 100 GSM modems for SMS messaging and have been impressed so far. They take a standard AT command set and are very reliable. http://www.fargotelecom.com/gb/products/meastro_modems.htm If you contact Fargo Telecom they'll put you in touch with a local reseller. We use RF Solutions: http://www.rfsolutions.co.uk/ If you are interested we have some code designed to allow Nagios to send SMS messages via these devices. Regards, James Peel http://www.altinity.com/ T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: james_altinity ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From quanah at stanford.edu Sat Oct 1 06:26:43 2005 From: quanah at stanford.edu (Quanah Gibson-Mount) Date: Fri, 30 Sep 2005 21:26:43 -0700 Subject: Adapters for Nagios (BMC, MOM) Message-ID: Does anyone have a BMC->Nagios adapter (allows Nagios to receive alerts from BMC patrol) or a MOM->Nagios adapter (allows Nagios to receive alerts from MOM)? Thanks, Quanah -- Quanah Gibson-Mount Principal Software Developer ITSS/Shared Services Stanford University GnuPG Public Key: http://www.stanford.edu/~quanah/pgp.html "These censorship operations against schools and libraries are stronger than ever in the present religio-political climate. They often focus on fantasy and sf books, which foster that deadly enemy to bigotry and blind faith, the imagination." -- Ursula K. Le Guin ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mitsuto at gmail.com Sun Oct 2 22:56:11 2005 From: mitsuto at gmail.com (Marcel Sugano) Date: Sun, 2 Oct 2005 17:56:11 -0300 Subject: Problem with Nagios web interface / install In-Reply-To: <20051002191320.GA13782@zippy.toger.us> References: <0INQ00EB0Y171J00@mta2.srv.hcvlny.cv.net> <20051002191320.GA13782@zippy.toger.us> Message-ID: On 10/2/05, Jason Martin wrote: > On Sun, Oct 02, 2005 at 02:58:10PM -0400, Master Virtuoso wrote: > > Error: Command 'check-host-alive' has already been defined > > Error: Could not register command (config file > > '/usr/local/nagios/etc/minimal.cfg', starting on line 76) > Check your nagios.cfg. Are you importing any files other than > minimal.cfg? Does the command named check-host-alive appear move > than once in minimal.cfg? > > -Jason Martin > -- > Nitrate: Lower than the day rate. > This message is PGP/MIME signed. > > > The default conf-files that come whitin the source are a little misconfigured. You should comment the entries that refer to command definition on minimal.cfg, or stop loading the command.cfg||checkcommands.cfg||misccommands.cfg on nagios.cfg. -- Marcel Mitsuto Fucatu Sugano mitsuto at gmail.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jhmartin at toger.us Mon Oct 3 04:38:49 2005 From: jhmartin at toger.us (Jason Martin) Date: Sun, 2 Oct 2005 19:38:49 -0700 Subject: Adapters for Nagios (BMC, MOM) In-Reply-To: References: Message-ID: <20051003023848.GC13782@zippy.toger.us> On Fri, Sep 30, 2005 at 09:26:43PM -0700, Quanah Gibson-Mount wrote: > Does anyone have a BMC->Nagios adapter (allows Nagios to receive alerts > from BMC patrol) or a MOM->Nagios adapter (allows Nagios to receive alerts > from MOM)? What do you mean by 'receive alerts'? Nagios is a service monitor, not an event management tool. I -Jason Martin -- Chernobyl used Windows This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From jawale_prashant at extenprise.net Mon Oct 3 17:10:41 2005 From: jawale_prashant at extenprise.net (Prashant Jawale) Date: 03 Oct 2005 10:10:41 -0500 Subject: How to configure nagios2.0b4 with nagios-db..... Message-ID: <1128352240.5448.5.camel@localhost.localdomain> Hello to all board, I want to configure nagios with postgresql such that nagios stores it's check results and runtime information in postgresql. I am using "nagios2.0b4" and additional module "nagios-db". I really can't figure out how to use nagios-db to get work done. Please does anyone knows how to use nagios-db with nagios. May be any tutorial,any site, any help.... Thanks in advance for all help, it's really urgent... -Prash -- This message has been scanned for viruses and dangerous content by Extenprise mailscanner, and is believed to be clean. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mohamed.azizi at belgacom.be Mon Oct 3 10:20:19 2005 From: mohamed.azizi at belgacom.be (mohamed.azizi at belgacom.be) Date: Mon, 3 Oct 2005 10:20:19 +0200 Subject: CHECK NRPE time out after 50 seconds Message-ID: <5F3043372274524C967EB597009D2A0E0241C393@AE0008.BGC.NET> Dear colleagues , I am using nagios 1.2 on linux server to check process on both servers which runs on linux. When we have lot of load on the servers , Nagios receives check nrpe time out after 50 seconds. After 3 times , this service is put on critical status .Normally , when a check is on CRITICAL , Nagios launchs a scripts to restart the system and we have a lot of restart on each servers due to the timeout of checknrpe. How can I avoid this kind of problem.I have increased the timer od NRPE from 30 seconds to 50 seconds but without succes. Can we configure nagios to have only warning state for a checknrpe timeout Thanks for your support Mohamed -------------- next part -------------- An HTML attachment was scrubbed... URL: From ton.voon at altinity.com Mon Oct 3 11:06:17 2005 From: ton.voon at altinity.com (Ton Voon) Date: Mon, 3 Oct 2005 10:06:17 +0100 Subject: Problem with check_oracle In-Reply-To: <5383c62b0509300729h739b7d64k1fa6c2df339e7a61@mail.gmail.com> References: <44f056cc05092909022728d156@mail.gmail.com> <5383c62b0509300611l1abe7f96x91822552cad14afb@mail.gmail.com> <5383c62b0509300612q2660033dlb21cd6f99fd8613a@mail.gmail.com> <433D4536.5030101@op5.se> <5383c62b0509300729h739b7d64k1fa6c2df339e7a61@mail.gmail.com> Message-ID: <7AA0D62F-603E-4E5D-87E9-0E8544C35009@altinity.com> On 30 Sep 2005, at 15:29, Hans Engelen wrote: > > > On 9/30/05, Andreas Ericsson wrote: > It's the LANG environment variable. NLS_LANG has no special meaning. > > Nagios uses the library wrapper popen() to execute other programs (for > now, anyways). popen()'ed children inherit the environment from the > parent (but doesn't read any of the profile or rc-files; sh -c is > called > in to execute the program in question). At least on Linux and *BSD, > although I can't imagine those systems going through the extra trouble > of copying the environment data unless it was in POSIX. > > That sorta fits in then with what was going on. Depending on the > client version you have installed (and I took the version 10 client > of course, grumble) those variables have to be set though or your > check_nagios goes horribly wrong. I think I even went sofar as to > try setting them from withing check_nagios itself at one point > before getting to the sqlclient part but ... > > Is there any ... ehm ... approved solution for that ? > I don't have access to an Oracle system to try, but when I used to use Oracle 8 clients, configuring the oratab file (to point to the ORACLE_HOME for a particular sid or * for wildcards) and configuring the tnsnames.ora file (in $ORACLE_HOME/network/tnsnames.ora) was usually sufficient for the check_oracle script to find the Oracle executable. I didn't use the --tns option. We found that the --login option was better - this told you information about why a database was not contactable, including if TNS was not running on a remote server. Since this works over the network, you can setup your Nagios server to run check_oracle and it will use the Oracle executables on your Nagios server to connect to the remote database, so it is not necessary to do the check via nrpe. Ton http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon -------------- next part -------------- An HTML attachment was scrubbed... URL: From moshesharon at gmail.com Mon Oct 3 11:18:03 2005 From: moshesharon at gmail.com (Moshe Sharon) Date: Mon, 3 Oct 2005 09:18:03 +0000 (UTC) Subject: Adapters for Nagios (BMC, MOM) References: Message-ID: Hello altough nagios is not an event console it can become one. im not very familier with BMC / MOM software but we did integration with other software. there are severl ways to do that. 1. "write" directly to PIPE file the event you want using scripts that determines what severity to notify. 2. use nsca alaram a trigger with BMC / MOM (there is nsca version for windows) and notify the event. 3. use REL send email to nagios. 4 you can also send SNMP TRAPS from those monitor software and the list can go on. just rembmer to read docs about volatile hope it helped Moshe Sharon ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jose.munoz at proyectosbiz.com Mon Oct 3 17:09:33 2005 From: jose.munoz at proyectosbiz.com (Jose Orlando Muņoz B) Date: Mon, 03 Oct 2005 10:09:33 -0500 Subject: NRPE, over old version of HPUX (10.01) Message-ID: Hi all, I'm looking some plugig to monitoring my OLD servers, I read all about NRPE and this is the solution I need. However I can't get it functioning over my Hpux V10.01, I tried to install the precompiled version "nrpe-1.8-B.10.20" but i get this error ./nrpe --help /usr/lib/dld.sl: Invalid version for shared library: /usr/lib/libc.1 /usr/lib/dld.sl: Exec format error Abort(coredump) I try to compiled the 1.8 source and I get a lot of errors. I have problem even installing Perl the only perl i get installed is v5.002. I have 20 of this servers and is imposible to monitor manualy. Thanks a lot for any advisor Jose. -- ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ladams at cloudmark.com Mon Oct 3 17:54:33 2005 From: ladams at cloudmark.com (Lori Adams) Date: Mon, 3 Oct 2005 08:54:33 -0700 Subject: CHECK NRPE time out after 50 seconds Message-ID: I wrote a small script to pipe the state and output to. If the output said "CHECK_NRPE: Socket timeout after 50 seconds", then exit(1). In the checkcommand is where you set up the pipe. define checkcommand { command_name check_nrpe_ignore command_line /your/path/to/check_nrpe -H $HOSTADDRESS -c nrpe_command -a (if you have args) | /your/path/to/nrpe/ignore/script $SERVICESTATE$ } The other option is to turn up your nrpe timeout, but that's not a very good solution, as it just means it will take longer to timeout. -Lori ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of mohamed.azizi at belgacom.be Sent: Monday, October 03, 2005 1:20 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] CHECK NRPE time out after 50 seconds Dear colleagues , I am using nagios 1.2 on linux server to check process on both servers which runs on linux. When we have lot of load on the servers , Nagios receives check nrpe time out after 50 seconds. After 3 times , this service is put on critical status .Normally , when a check is on CRITICAL , Nagios launchs a scripts to restart the system and we have a lot of restart on each servers due to the timeout of checknrpe. How can I avoid this kind of problem.I have increased the timer od NRPE from 30 seconds to 50 seconds but without succes. Can we configure nagios to have only warning state for a checknrpe timeout Thanks for your support Mohamed -------------- next part -------------- An HTML attachment was scrubbed... URL: From hamid.forough at amd.com Mon Oct 3 18:03:43 2005 From: hamid.forough at amd.com (Forough, Hamid) Date: Mon, 3 Oct 2005 11:03:43 -0500 Subject: timing Message-ID: <360C39FD92313F48A74A60471FA38E87013559AC@SAUSEXMB1.amd.com> Hello There. I am a new user to the nagios version 2.0b4 running on a linux As 2.1. I have defined the services, hosts and timeperiods. My problem is as I am monitoring the servers, I immediately shutdown one of my servers hoping the status turns from GREEN to RED. It does not do that unless I re-start the web server. What am I doing wrong? Thanks, Hamid Advanced Micro Devices W: 512.602.4704 C: 512.971-5345 E: hamid.forough at amd.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 2669 bytes Desc: image001.gif URL: From hamid.forough at amd.com Mon Oct 3 18:32:47 2005 From: hamid.forough at amd.com (Forough, Hamid) Date: Mon, 3 Oct 2005 11:32:47 -0500 Subject: FW: timing .. PLEASE HELP Message-ID: <360C39FD92313F48A74A60471FA38E87013559AD@SAUSEXMB1.amd.com> ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Forough, Hamid Sent: Monday, October 03, 2005 11:04 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] timing Hello There. I am a new user to the nagios version 2.0b4 running on a linux As 2.1. I have defined the services, hosts and timeperiods. My problem is as I am monitoring the servers, I immediately shutdown one of my servers hoping the status turns from GREEN to RED. It does not do that unless I re-start the web server. What am I doing wrong? Thanks, Hamid Advanced Micro Devices W: 512.602.4704 C: 512.971-5345 E: hamid.forough at amd.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 2669 bytes Desc: image001.gif URL: From hamid.forough at amd.com Mon Oct 3 19:41:13 2005 From: hamid.forough at amd.com (Forough, Hamid) Date: Mon, 3 Oct 2005 12:41:13 -0500 Subject: timing .. PLEASE HELP Message-ID: <360C39FD92313F48A74A60471FA38E87013559AE@SAUSEXMB1.amd.com> Why am I getting this kicked back to myself again ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Forough, Hamid Sent: Monday, October 03, 2005 11:33 AM To: nagios-users at lists.sourceforge.net Subject: FW: [Nagios-users] timing .. PLEASE HELP ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Forough, Hamid Sent: Monday, October 03, 2005 11:04 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] timing Hello There. I am a new user to the nagios version 2.0b4 running on a linux As 2.1. I have defined the services, hosts and timeperiods. My problem is as I am monitoring the servers, I immediately shutdown one of my servers hoping the status turns from GREEN to RED. It does not do that unless I re-start the web server. What am I doing wrong? Thanks, Hamid Advanced Micro Devices W: 512.602.4704 C: 512.971-5345 E: hamid.forough at amd.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 2669 bytes Desc: image001.gif URL: From al at its-lehmann.de Mon Oct 3 20:17:24 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Mon, 03 Oct 2005 20:17:24 +0200 Subject: timing .. PLEASE HELP In-Reply-To: <360C39FD92313F48A74A60471FA38E87013559AE@SAUSEXMB1.amd.com> References: <360C39FD92313F48A74A60471FA38E87013559AE@SAUSEXMB1.amd.com> Message-ID: <434175B4.1000804@its-lehmann.de> On 03.10.2005 19:41, Forough, Hamid wrote: > > > Why am I getting this kicked back to myself again Because this is a mailing list and you are subscribed to it? > * From: * nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] *On Behalf Of > *Forough, Hamid > *Sent:* Monday, October 03, 2005 11:33 AM > *To:* nagios-users at lists.sourceforge.net > *Subject:* FW: [Nagios-users] timing .. PLEASE HELP In fact I assumed you needed help... no reason to shout. > > * From: * nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] *On Behalf Of > *Forough, Hamid > *Sent:* Monday, October 03, 2005 11:04 AM > *To:* nagios-users at lists.sourceforge.net > *Subject:* [Nagios-users] timing > > > > Hello There. I am a new user to the nagios version 2.0b4 running on a > linux As 2.1. I have defined the services, hosts and timeperiods. My > problem is as I am monitoring the servers, I immediately shutdown one of > my servers hoping the status turns from GREEN to RED. It does not do > that unless I re-start the web server. What am I doing wrong? What check_intervals etc. have you set? Depending on your configuration, it might take a while for nagios to consider a host down. Probably you have more than one instance of nagios running. Kill all nagios processes, wait until they are all gone, and start nagios using the start script again. See what happens. If this doesn't solve the problem, it might be better if you gave us the relevant parts of your configuration. Arno -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From quanah at stanford.edu Mon Oct 3 20:26:34 2005 From: quanah at stanford.edu (Quanah Gibson-Mount) Date: Mon, 03 Oct 2005 11:26:34 -0700 Subject: Adapters for Nagios (BMC, MOM) In-Reply-To: References: Message-ID: <3497196F89CDD24B4D15D6B7@cadabra-dsl.stanford.edu> --On Monday, October 03, 2005 9:18 AM +0000 Moshe Sharon wrote: > Hello > > altough nagios is not an event console it can become one. im not very > familier with BMC / MOM software but we did integration with other > software. there are severl ways to do that. 1. "write" directly to PIPE > file the event you want using scripts that determines what severity to > notify. 2. use nsca alaram a trigger with BMC / MOM (there is nsca > version for windows) and notify the event. 3. use REL send email to > nagios. 4 you can also send SNMP TRAPS from those monitor software and > the list can go on. just rembmer to read docs about volatile > > hope it helped Thanks... This doesn't really help much, because I'm already quite aware about how to make Nagios react to events on other systems. I was really more curious as to whether or not anyone had set up a system that forwards data from MOM/BMC to Nagios using passive checks. --Quanah -- Quanah Gibson-Mount Principal Software Developer ITSS/Shared Services Stanford University GnuPG Public Key: http://www.stanford.edu/~quanah/pgp.html "These censorship operations against schools and libraries are stronger than ever in the present religio-political climate. They often focus on fantasy and sf books, which foster that deadly enemy to bigotry and blind faith, the imagination." -- Ursula K. Le Guin ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From davea at support.kcm.org Mon Oct 3 21:08:19 2005 From: davea at support.kcm.org (Dave Augustus) Date: Mon, 03 Oct 2005 14:08:19 -0500 Subject: Anyone Using QPage? In-Reply-To: <6.1.1.1.2.20050930115312.0313dc18@pop.nam.slb.com> References: <6.1.1.1.2.20050930115312.0313dc18@pop.nam.slb.com> Message-ID: <1128366499.1581.1.camel@kcm40202> I use this one on Centos4(Redhat RHEL4). #!/bin/sh # # Startup for QuickPage # # chkconfig: 345 85 15 # description: QuickPage is a client/server software package that # enables you to send messages to an alphanumeric pager. The client # accepts a message from the user and forwards it to a server using # SNPP. The server uses a modem to transmit the message to the # recipient's paging service using the TAP protocol (also known as the IXO protocol). # processname: qpage # config: /etc/qpage.cf # Source function library. . /etc/rc.d/init.d/functions case "$1" in start) echo -n "Starting QuickPage daemon" /usr/bin/screen -d -m /usr/local/bin/qpage -d -q10 # /usr/local/bin/qpage -d -q10 if [ $? = 0 ] then echo_success else echo_failure fi echo ;; stop) echo -n "Stopping QuickPage daemon" killproc qpage if [ $? = 0 ] then echo_success else echo_failure fi echo ;; restart) $0 stop $0 start ;; *) echo "Usage: /etc/init.d/qpage { start | stop | restart }" ;; esac exit 0 On Fri, 2005-09-30 at 11:55 -0500, Doug Veldhuisen wrote: > > Joe: > If your talking about QuickPage then all you need to do is in > your /etc/rc.d/rc.local file > just add a line like this: /usr/local/bin/qpage -q 7 > > the -q 7 was to search the queue every 7 seconds, qpage -h gives all > the options. > > > Doug > > > > At 11:44 AM 9/30/2005, Joe Regular wrote: > > I've just installed QPage but cannot figure out how to get it to > > load upon system startup as a deamon. Does anyone have a startup > > script that they can send me? > > > > Thank you, > > Joe > > > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: > > Power Architecture Resource Center: Free content, downloads, > > discussions, > > and more. http://solutions.newsforge.com/ibmarch.tmpl > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. > > ::: Messages without supporting info will risk being sent > > to /dev/null -- Dave Augustus ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enediel at hotmail.com Mon Oct 3 22:20:26 2005 From: enediel at hotmail.com (enediel gonzalez) Date: Mon, 03 Oct 2005 20:20:26 +0000 Subject: installing nagios from debian's package on debian 3.1(sarge) stable Message-ID: Hello I've a box with nagios up and running installed on /usr/local/nagios I'm preparing a second box, with debian sarge stable version, I installed nagios from the packages included on debian, but now nagios was installed on /etc/nagios. The documentation references to /usr/local/nagios, and mention a set of subdirectories that should exist, nothing except the stylesheets subdirectory appear on /etc/nagios after this new installation Looks like I need a little more informacion to pass transparently the configuration I've on the first box to the new box Thanks in advance for any help Enediel Linux user 398956 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From john at stilen.com Mon Oct 3 23:50:21 2005 From: john at stilen.com (John Stile) Date: Mon, 03 Oct 2005 14:50:21 -0700 Subject: installing nagios from debian's package on debian 3.1(sarge) stable In-Reply-To: References: Message-ID: <1128376221.17093.36.camel@localhost> On Mon, 2005-10-03 at 20:20 +0000, enediel gonzalez wrote: > Hello > > I've a box with nagios up and running installed on /usr/local/nagios > > I'm preparing a second box, with debian sarge stable version, I installed > nagios from the packages included on debian, but now nagios was installed on > /etc/nagios. > > The documentation references to /usr/local/nagios, and mention a set of > subdirectories that should exist, nothing except the stylesheets > subdirectory appear on /etc/nagios after this new installation > > Looks like I need a little more informacion to pass transparently the > configuration I've on the first box to the new box > > Thanks in advance for any help > Enediel > Linux user 398956 > What is your question? I have setup nagios 1.3 on Debian 3.1 with deb packages, and while I haven't figured out nagiosgraphs, Nagios with mysql works. Hopefully these notes will be helpful to you: # Install packages aptitude install nagios-mysql \ nagios-common \ nagios-plugins \ nagios-statd-client \ nagios-statd-server \ nagios-stat-client \ iputils-ping \ nsca \ nagios-nrpe-plugin \ nagios-nrpe-server \ nagios-nrpe-doc \ qstat fping libnet-snmp-perl libradius1 # Location of some files: Apache conf: /etc/apache2/conf.d/nagios Apache pass: /etc/nagios/htpasswd.users Nagios cgi-bin: /usr/lib/cgi-bin/nagios Nagios htdocs: /usr/share/nagios/htdocs Nagios logs: /var/log/nagios/nagios.log NSCA: /etc/send_nsca.cfg /etc/nsca.cfg NRPE Server: /etc/nagios/nrpe.cfg /etc/nagios/nrpe_local.cfg # Setup passwords sudo htpasswd -c /etc/nagios/htpasswd.users nagiosadmin # Fix perms on nagios-plugins chown -R nagios:www-data /etc/nagios-plugins # Databases setup mysqladmin -u root -p create nagios zcat /usr/share/doc/nagios-mysql/create_mysql.gz | mysql -u root -p nagios echo > mysql nagios -uroot -p ?,()= illegal_macro_output_chars=`~$&|'"<> admin_email=nagios admin_pager=pagenagios # Add your hosts cp hosts.cfg hosts.cfg.orig vi hosts.cfg # 'one_host.stilen.com' host definition define host{ use generic-host host_name one_host.stilen.com alias one_host address 192.169.50.195 check_command check-host-alive max_check_attempts 10 notification_interval 480 notification_period 24x7 notification_options d,u,r } # Add host to hostgroup. vi hostgroups.cfg # Default gateway host group definition define hostgroup{ hostgroup_name gateways alias Routers and Gateways contact_groups linux-admins members gw } # 'linux-boxes' host group definition define hostgroup{ hostgroup_name linux-boxes alias Linux Servers contact_groups linux-admins members one_host.stilen.com } # Crate a service to monitor for the host cp services.cfg services.cfg.orig vi services.cfg Change all contact_groups to linux-admins Basically one definition section per service per host. i.e. # Service definition define service{ use generic-service host_name one_host.stilen.com service_description PING is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups linux-admins notification_interval 240 notification_period 24x7 notification_options c,r check_command check_ping!100.0,20%!500.0,60% } Create an escalation cp escalations.cfg escalations.cfg.orig vi escalations.cfg # Serviceescalation definition define serviceescalation{ host_name one_host.stilen.com service_description PING first_notification 2 last_notification 6 contact_groups linux-admins notification_interval 0 } # Add a user to contact group cp contactgroups.cfg contactgroups.cfg.orig vi contactgroups.cfg # 'linux-admins' contact group definition define contactgroup{ contactgroup_name linux-admins alias Linux Administrators members johns } cp contacts.cfg contacts.cfg.orig vi contacts.cfg # 'johns' contact definition define contact{ contact_name johns alias Nagios Admin service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,u,r service_notification_commands notify-by-email host_notification_commands host-notify-by-email,host-notify-by-epager email joe at shmo.com pager pagenagios-admin at localhost } # Allow Nagios to see it's onw process info cp /usr/share/doc/nagios-common/check_nagios_db /etc/nagios/check_nagios_db chmod +x /etc/nagios/check_nagios_db # Configure nagios to use mysql for status info vi cgi.cfg Comment out: #nagios_check_command=/usr/lib/nagios/plugins/check_nagios /var/log/nagios/status.log 5 'nagios' Uncomment: nagios_check_command=/etc/nagios/check_nagios_db # Restart the daemon /etc/init.d/nagios restart ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From circlefun at hotmail.com Tue Oct 4 01:01:38 2005 From: circlefun at hotmail.com (circle fun) Date: Mon, 03 Oct 2005 16:01:38 -0700 Subject: problem - sticky acknowledgement through external command Message-ID: Hi everyone, I wanted to use external command to make a sticky acknowledgement for a service problem. Here is the command format I used: [1128150002] ACKNOWLEDGE_SVC_PROBLEM;;;1;1;1;; I can acknowledge the problem successfully. However, once the service's state went from CRITICAL to WARNING, the acknowledgement got removed. When the service's state went from WARNING back to CRITICAL a few minutes later. I got another alert. I don't want to have to re-acknowlege the problem everytime it changes state. In other words, I want a sticky acknowledgement. Anyone have any advice for me. Here is the environment I have OS: fedora 4 Nagios's version: 2.0b4 what else do I need to put here? _________________________________________________________________ Express yourself instantly with MSN Messenger! Download today - it's FREE! http://messenger.msn.click-url.com/go/onm00200471ave/direct/01/ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From john at stilen.com Tue Oct 4 02:23:48 2005 From: john at stilen.com (John Stile) Date: Mon, 03 Oct 2005 17:23:48 -0700 Subject: help configuring nagios-mysql with nagiosgraph Message-ID: <1128385428.17094.77.camel@localhost> Hays anyone configured nagios 1.3 with mysql and nagiosgraph-0.5? Nagiosgraph is not creating rrd tables on my debian system. Is there a trick config if nagiosgraph us used with mysql enabled nagios? The rrd directory is world writable, but no databases are created. The nagiosgraph log shows 'perfdata not recognized' Fri Sep 30 18:06:09 2005 INSERT debug: Config logfile:/var/log/nagiosgraph/nagiosgraph.log Fri Sep 30 18:06:09 2005 INSERT debug: Config rrddir:/usr/share/nagios/nagiosgraph/rrd Fri Sep 30 18:06:09 2005 INSERT debug: Config mapfile:/usr/share/nagios/nagiosgraph/map Fri Sep 30 18:06:09 2005 INSERT debug: Config rrdtool:/usr/bin/rrdtool Fri Sep 30 18:06:09 2005 INSERT debug: Config colorscheme:1 Fri Sep 30 18:06:09 2005 INSERT debug: Config heartbeat:600 Fri Sep 30 18:06:09 2005 INSERT debug: perfdata: Fri Sep 30 18:06:09 2005 INSERT info: Input lastcheck: Fri Sep 30 18:06:09 2005 INSERT info: Input hostname: Fri Sep 30 18:06:09 2005 INSERT info: Input perfdata: Fri Sep 30 18:06:09 2005 INSERT info: Input servicedescr: Fri Sep 30 18:06:09 2005 INSERT info: Input output: Fri Sep 30 18:06:09 2005 INSERT warn: perfdata not recognized Fri Sep 30 18:06:21 2005 INSERT debug: Config logfile:/var/log/nagiosgraph/nagiosgraph.log Fri Sep 30 18:06:21 2005 INSERT debug: Config rrddir:/usr/share/nagios/nagiosgraph/rrd Fri Sep 30 18:06:21 2005 INSERT debug: Config mapfile:/usr/share/nagios/nagiosgraph/map Fri Sep 30 18:06:21 2005 INSERT debug: Config rrdtool:/usr/bin/rrdtool Fri Sep 30 18:06:21 2005 INSERT debug: Config colorscheme:1 Fri Sep 30 18:06:21 2005 INSERT debug: Config heartbeat:600 Fri Sep 30 18:06:21 2005 INSERT debug: perfdata: Fri Sep 30 18:06:21 2005 INSERT info: Input lastcheck: Fri Sep 30 18:06:21 2005 INSERT info: Input hostname: Fri Sep 30 18:06:21 2005 INSERT info: Input perfdata: Fri Sep 30 18:06:21 2005 INSERT info: Input servicedescr: Fri Sep 30 18:06:21 2005 INSERT info: Input output: Fri Sep 30 18:06:21 2005 INSERT warn: perfdata not recognized Fri Sep 30 18:09:00 2005 CGI debug: Config logfile:/var/log/nagiosgraph/nagiosgraph.log Fri Sep 30 18:09:00 2005 CGI debug: Config rrddir:/usr/share/nagios/nagiosgraph/rrd Fri Sep 30 18:09:00 2005 CGI debug: Config mapfile:/usr/share/nagios/nagiosgraph/map Fri Sep 30 18:09:00 2005 CGI debug: Config rrdtool:/usr/bin/rrdtool Fri Sep 30 18:09:00 2005 CGI debug: Config colorscheme:1 Fri Sep 30 18:09:00 2005 CGI debug: Config heartbeat:600 Fri Sep 30 18:09:00 2005 CGI debug: Config logfile:/var/log/nagiosgraph/nagiosgraph.log Fri Sep 30 18:09:00 2005 CGI debug: Config rrddir:/usr/share/nagios/nagiosgraph/rrd Fri Sep 30 18:09:00 2005 CGI debug: Config mapfile:/usr/share/nagios/nagiosgraph/map Fri Sep 30 18:09:00 2005 CGI debug: Config rrdtool:/usr/bin/rrdtool Fri Sep 30 18:09:00 2005 CGI debug: Config colorscheme:1 Fri Sep 30 18:09:00 2005 CGI debug: Config heartbeat:600 Fri Sep 30 18:09:00 2005 CGI debug: @db= Fri Sep 30 18:09:00 2005 CGI info: Listing host_home%2Estilen%2Ecom_PING db files in /usr/share/nagios/nagiosgraph/rrd: Fri Sep 30 18:09:00 2005 CGI info: System /usr/bin/rrdtool graph - -a PNG --start -118800 Fri Sep 30 18:09:00 2005 CGI info: System returncode 0 message My nagiosgraph.conf debug = 5 logfile = /var/log/nagiosgraph/nagiosgraph.log rrddir = /usr/share/nagios/nagiosgraph/rrd mapfile = /usr/share/nagios/nagiosgraph/map rrdtool = /usr/bin/rrdtool colorscheme = 1 heartbeat = 600 This was my insert statement put an icon in the services column on the nagios web interface: mysql> insert into serviceextinfo \ set host_name="bugzilla.msli.com", service_description="PING", notes_url="/nagiosgraph/show.cgi?host=$HOSTNAME$&service=$SERVICEDESC$", icon_image="graph.png", icon_image_alt="View Graph"; I installed with debian nagios packages. This is what my nagios setup looks like (nagios -m) Nagios 1.3 Copyright (c) 1999-2004 Ethan Galstad (nagios at nagios.org) Last Modified: 10-24-2004 License: GPL External Data I/O ----------------- Object Data: TEMPLATE Status Data: DATABASE (MySQL) Retention Data: DATABASE (MySQL) Comment Data: DATABASE (MySQL) Downtime Data: DATABASE (MySQL) Performance Data: FILE Options ------- Thank you, John ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From caldwell at gwu.edu Tue Oct 4 03:31:08 2005 From: caldwell at gwu.edu (Chris Caldwell) Date: Mon, 3 Oct 2005 21:31:08 -0400 Subject: NSCA Documentation Message-ID: <2992D1D9-3BCC-4225-A979-0C7A4697E151@gwu.edu> Hopefully a simple question, Could someone please point me to some documentation on running NSCA on a remote host? I have built and installed NSCA with no problem and my remote host can talk to the master, but I am not sure where to go next. Can the normal nagios plugins run locally on the remote host with their output piped to NSCA? There is no documentation for NSCA with the module or on the NagiosExchange site. -- Chris Caldwell Systems Engineer, Enterprise Systems The George Washington University caldwell @ gwu . edu | +1 202.994.4674 (w) | +1 202.409.0878 (c) PGP key ID: 0x0A0EC46C "Formal education can rarely improve the character of a scoundrel." - Derek Bok, Harvard University -------------- next part -------------- A non-text attachment was scrubbed... Name: PGP.sig Type: application/pgp-signature Size: 186 bytes Desc: This is a digitally signed message part URL: From dmourati at cm.math.uiuc.edu Tue Oct 4 03:46:38 2005 From: dmourati at cm.math.uiuc.edu (Demetri Mouratis) Date: Mon, 3 Oct 2005 20:46:38 -0500 (CDT) Subject: NSCA Documentation In-Reply-To: <2992D1D9-3BCC-4225-A979-0C7A4697E151@gwu.edu> References: <2992D1D9-3BCC-4225-A979-0C7A4697E151@gwu.edu> Message-ID: On Mon, 3 Oct 2005, Chris Caldwell wrote: > Hopefully a simple question, > > Could someone please point me to some documentation on running NSCA on a > remote host? I have built and installed NSCA with no problem and my remote > host can talk to the master, but I am not sure where to go next. Can the > normal nagios plugins run locally on the remote host with their output piped > to NSCA? There is no documentation for NSCA with the module or on the > NagiosExchange site. Chris, Check this out: v2.0 http://nagios.sourceforge.net/docs/2_0/distributed.html v1.2 http://nagios.sourceforge.net/docs/1_0/distributed.html (Note, I'm not sure whether the v2.0 docs are any different, just playing it safe. I'm still on v1.2) The gist of it is NSCA runs on the Central Nagios server and you use send_nsca to transmit data from Distributed Nagios over to Central. Hope that helps. --------------------------------------------------------------------- Demetri Mouratis dmourati at linfactory.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pla at softflare.com Tue Oct 4 04:33:31 2005 From: pla at softflare.com (Paul L. Allen) Date: Tue, 04 Oct 2005 03:33:31 +0100 Subject: NSCA Documentation In-Reply-To: <2992D1D9-3BCC-4225-A979-0C7A4697E151@gwu.edu> References: <2992D1D9-3BCC-4225-A979-0C7A4697E151@gwu.edu> Message-ID: <20051004023331.9184.qmail@mullet.softflare.net> Chris Caldwell writes: > Could someone please point me to some documentation on running NSCA on a > remote host? Yep, it's in the Nagios docs. Click on the link in the left-hand menu and eventually you will find all you need without even having to resort to Google. > Can the normal nagios plugins run locally on the remote host with their > output piped to NSCA? This is how it is meant to work. You have Nagios (with or without the CGIs, with or without notifications, this is your choice) running on the remote host and submitting passive check details. > There is no documentation for NSCA with the module or on the > NagiosExchange site. There is a LOT of documentation about NSCA on the main NAGIOS site. It may not be easy to find or where you might expect it, but it's there. In my opinion the documentation could be redesigned to be of more use to those (who are, I seem to observe, more common than others) trying to use NSCA but that's my opinion and I managed to find out how to use NSCA. What you want is (mostly) covered in redundant monitoring, failover monitoring and stuff like that. Places you'd never look unles you read the whole docs. Which you really, really ought to do. Dunno about you, but I figure that any time I use FREE software there is likely to be a cost in expenditure of my time figuring out how to use it. If you want the dancing installation wizard that asks you all the questions you might possibly ask (unless you are remotely technical) then stick with Microsucks. If you want to use products that can be made to do what you want to do, rather than what Bill Gates says you have to do, then RTFM. -- Paul Allen Softflare Support ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Tue Oct 4 10:37:53 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Tue, 4 Oct 2005 10:37:53 +0200 Subject: NRPE, over old version of HPUX (10.01) Message-ID: Hi Jose, because I do have to monitor quite a few HP-UX boxes, some of which I'd consider vintage stuff (i.e. 10.20 where HP have discontinued support for), I ran into similar difficulties when preparing grounds for NRPE on these particular machines. However I did manage to find a site where one could download prebuilt binaries of gcc and binutils as well as Perl. With this foundation laid I was able to build and install NRPE. But even there you won't find binaries for release versions earlier than 10.20. I have no idea if the 10.20 builds will be of any use to you, but you could give it a try. http://mirrors.develooper.com/hpux/ > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Jose > Orlando Muqoz B > Sent: Monday, October 03, 2005 5:10 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] NRPE, over old version of HPUX (10.01) > > > > Hi all, > > I'm looking some plugig to monitoring my OLD servers, I read > all about > NRPE and this is the solution I need. > > However I can't get it functioning over my Hpux V10.01, I tried to > install the precompiled version "nrpe-1.8-B.10.20" but i get > this error > > ./nrpe --help > /usr/lib/dld.sl: Invalid version for shared library: /usr/lib/libc.1 > /usr/lib/dld.sl: Exec format error > Abort(coredump) > > I try to compiled the 1.8 source and I get a lot of errors. > > I have problem even installing Perl the only perl i get installed is > v5.002. > > I have 20 of this servers and is imposible to monitor manualy. > > Thanks a lot for any advisor > > Jose. > > > -- > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Tue Oct 4 12:40:45 2005 From: robmossrm at aol.com (Rob Moss) Date: Tue, 04 Oct 2005 11:40:45 +0100 Subject: NRPE, over old version of HPUX (10.01) In-Reply-To: References: Message-ID: <43425C2D.50006@aol.com> Jose Orlando Mu?oz B wrote: >Hi all, > >I'm looking some plugig to monitoring my OLD servers, I read all about >NRPE and this is the solution I need. > >However I can't get it functioning over my Hpux V10.01, I tried to >install the precompiled version "nrpe-1.8-B.10.20" but i get this error > >./nrpe --help >/usr/lib/dld.sl: Invalid version for shared library: /usr/lib/libc.1 >/usr/lib/dld.sl: Exec format error >Abort(coredump) > >I try to compiled the 1.8 source and I get a lot of errors. > >I have problem even installing Perl the only perl i get installed is >v5.002. > >I have 20 of this servers and is imposible to monitor manualy. > >Thanks a lot for any advisor > >Jose. > > Okay, the problem is to do with the versions, you are trying to run HPUX 10.20 binaries (with newer libraries) on an older HPUX 10.01 system. That is why you are getting warnings about invalid versions and exec format errors.. I recommend that you find a decent HPUX 10.01 server (maybe an L class?) and set up a build server. :Load on GCC/CC and appropriate libraries and compile NRPE for your version of HPUX.. God help you if your fastest box is a G or F class server.. my coldolences.. HPUX 10.01 is around 10 years old, and the G and F class servers aren't much newer.. While I respect the fact that you need to maintain them, the operating system is no longer maintained even by HP, and the rest of the world has moved on to HPUX 10.20 and 11.0. http://www.hp.com/softwarereleases/releases-media2/discon/5971-0265.htm There are newer versions of NRPE, version 2.0 and 2.2.. Try them Cheers rob. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enediel at hotmail.com Tue Oct 4 14:49:19 2005 From: enediel at hotmail.com (enediel gonzalez) Date: Tue, 04 Oct 2005 12:49:19 +0000 Subject: installing nagios from debian's package ondebian 3.1(sarge) s In-Reply-To: <1128376221.17093.36.camel@localhost> References: <1128376221.17093.36.camel@localhost> Message-ID: Thanks Jonh for your answer, with all of this information I'll try to finish the migration. Linux user 398956 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravikmrs at yahoo.com Tue Oct 4 15:17:22 2005 From: ravikmrs at yahoo.com (Ravi Kumar) Date: Tue, 4 Oct 2005 06:17:22 -0700 (PDT) Subject: http check Message-ID: <20051004131723.7136.qmail@web53903.mail.yahoo.com> I have been seeing following error during htttp_8080 monitoring. HTTP CRITICAL: HTTP/1.1 500 Internal Server Error plesae help! thanks --------------------------------- Yahoo! for Good Click here to donate to the Hurricane Katrina relief effort. -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Tue Oct 4 15:22:11 2005 From: robmossrm at aol.com (Rob Moss) Date: Tue, 04 Oct 2005 14:22:11 +0100 Subject: http check In-Reply-To: <20051004131723.7136.qmail@web53903.mail.yahoo.com> References: <20051004131723.7136.qmail@web53903.mail.yahoo.com> Message-ID: <43428203.3000106@aol.com> Ravi Kumar wrote: > I have been seeing following error during htttp_8080 monitoring. > HTTP CRITICAL: HTTP/1.1 500 Internal Server Error > plesae help! > thanks Nagios is doing a good job of alerting you that your webserver is broken. Go fix your webserver, it's broken. Read the error log file. rob. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tholmes at mcaschool.net Tue Oct 4 16:07:11 2005 From: tholmes at mcaschool.net (Timothy A. Holmes) Date: Tue, 4 Oct 2005 10:07:11 -0400 Subject: New Nagios Install Message-ID: <17CD9CE4C0FA574A8B29EF02D49B385D2FFB@srvexch-01.mcaschool.local> Good Morning Everyone: I have a new computer (P4 2.0Ghz) that I am getting ready to install Nagios on. In checking out the Nagios page, I am wondering which version is the best for use. I see that version 1.x is tagged as stable, but I also notice that version 2 beta 4 has been up for several months. Im not willing to fight a buggy beta, but if beta 4 is stable and nearing release, I am willing to give it a go. If not, I will just go with version 1.2 and wait till the final release. I will be monitoring 15 servers and 8 switches as it stands now Also, any problems or roadblocks that I need to be aware of with the RPM installs? I prefer to avoid compiling from source if I can, as I always seem to have BAD luck doing anything from source Thanks for any and all advice Timothy A. Holmes IT Manager / Network Admin / Web Master / Computer Teacher Medina Christian Academy A Higher Standard... Jeremiah 33:3 Jeremiah 29:11 Esther 4:14 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rodrigofortes at veloxmail.com.br Tue Oct 4 16:14:27 2005 From: rodrigofortes at veloxmail.com.br (Rodrigo Fortes) Date: Tue, 4 Oct 2005 11:14:27 -0300 Subject: NRPE to new Beta Version References: <43425C2D.50006@aol.com> Message-ID: <000801c5c8ed$f05788c0$1b3fa492@rodrigocisi> Hi, How can I use NRPE at new Beta version? Thanks Rodrigo Fortes ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From brianmas at highstream.net Tue Oct 4 16:15:39 2005 From: brianmas at highstream.net (brianmas at highstream.net) Date: Tue, 4 Oct 2005 10:15:39 -0400 Subject: parents/dependencies confusion Message-ID: <1128435339.43428e8bb20a0@webmail.highstream.net> I'll give a quick explanation of my set up: I have 1 nagios machine in the office here, I monitor all services on different subnets, sometimes the DSL connection here at the office goes down at which point I am flooded with notifications. I have looked into dependencies and parent relations but I can't quite figure out how to suppress all warnings when the network connection dies. I've checked the FAQ pages but I can't figure out what to do based on them. Examples using config files would be most helpful. thank you. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enediel at hotmail.com Tue Oct 4 16:43:24 2005 From: enediel at hotmail.com (enediel gonzalez) Date: Tue, 04 Oct 2005 14:43:24 +0000 Subject: different nagios versions for migration Message-ID: Hello: Installing nagios from packages on debian sarge, I found out that the stable version of debian includes nagios 1.3, but the box that is up and running besides me has nagios 2.0b3 I need to migrate one box into another Some directives into the configuration files look like they were included on the version 2.0, so I have incompatible cofiguration files for the version 1.3 ?upgrading the executable and some libraries could be enough? Thanks in advance for the answer Linux user 398956 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Tue Oct 4 16:46:45 2005 From: robmossrm at aol.com (Rob Moss) Date: Tue, 04 Oct 2005 15:46:45 +0100 Subject: NRPE to new Beta Version In-Reply-To: <000801c5c8ed$f05788c0$1b3fa492@rodrigocisi> References: <43425C2D.50006@aol.com> <000801c5c8ed$f05788c0$1b3fa492@rodrigocisi> Message-ID: <434295D5.5090406@aol.com> Rodrigo Fortes wrote: > Hi, > How can I use NRPE at new Beta version? > > Thanks > Rodrigo Fortes You can try Andreas's website http://oss.op5.se/nagios/ The latest verison of NRPE is here: http://oss.op5.se/nagios/nrpe-2.2.tar.gz rob. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eford at nexusenergy.com Tue Oct 4 16:49:13 2005 From: eford at nexusenergy.com (Edward Ford) Date: Tue, 4 Oct 2005 10:49:13 -0400 Subject: New Nagios Install Message-ID: <200510041049569.SM03020@ford> I am not up to stuff with my Linux Redhat 9 and did try to use the RPM's figuring it would be easier. The RPM's turned out to not work for me at all. I actually had better luck compiling, even though I had a pretty good learning curve to overcome this. Eddie -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Timothy A. Holmes Sent: Tuesday, October 04, 2005 10:07 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] New Nagios Install Good Morning Everyone: I have a new computer (P4 2.0Ghz) that I am getting ready to install Nagios on. In checking out the Nagios page, I am wondering which version is the best for use. I see that version 1.x is tagged as stable, but I also notice that version 2 beta 4 has been up for several months. Im not willing to fight a buggy beta, but if beta 4 is stable and nearing release, I am willing to give it a go. If not, I will just go with version 1.2 and wait till the final release. I will be monitoring 15 servers and 8 switches as it stands now Also, any problems or roadblocks that I need to be aware of with the RPM installs? I prefer to avoid compiling from source if I can, as I always seem to have BAD luck doing anything from source Thanks for any and all advice Timothy A. Holmes IT Manager / Network Admin / Web Master / Computer Teacher Medina Christian Academy A Higher Standard... Jeremiah 33:3 Jeremiah 29:11 Esther 4:14 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at gridshield.net Tue Oct 4 17:01:13 2005 From: marc at gridshield.net (Marc DeTrano) Date: Tue, 04 Oct 2005 09:01:13 -0600 Subject: New Nagios Install In-Reply-To: <17CD9CE4C0FA574A8B29EF02D49B385D2FFB@srvexch-01.mcaschool.local> References: <17CD9CE4C0FA574A8B29EF02D49B385D2FFB@srvexch-01.mcaschool.local> Message-ID: <43429939.9070306@gridshield.net> I have been running Nagios 2 beta on quite a few systems now and I am very happy with it. Mostly run on Mandiva LE 2005 systems, from RPM, and I am guessing the RedHat / Fedora / CentOS worlds work just as well. I find that the additional features of Nagios 2 (much easier and more powerful configuration) are well worth the "risk" of using the beta. Marc DeTrano Gridshield, S.A: Timothy A. Holmes wrote: >Good Morning Everyone: > >I have a new computer (P4 2.0Ghz) that I am getting ready to install >Nagios on. In checking out the Nagios page, I am wondering which >version is the best for use. I see that version 1.x is tagged as >stable, but I also notice that version 2 beta 4 has been up for several >months. Im not willing to fight a buggy beta, but if beta 4 is stable >and nearing release, I am willing to give it a go. If not, I will just >go with version 1.2 and wait till the final release. I will be >monitoring 15 servers and 8 switches as it stands now > >Also, any problems or roadblocks that I need to be aware of with the RPM >installs? I prefer to avoid compiling from source if I can, as I always >seem to have BAD luck doing anything from source > > >Thanks for any and all advice > >Timothy A. Holmes >IT Manager / Network Admin / Web Master / Computer Teacher > >Medina Christian Academy >A Higher Standard... > >Jeremiah 33:3 >Jeremiah 29:11 >Esther 4:14 > > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james.mohr at elaxy.com Tue Oct 4 17:42:38 2005 From: james.mohr at elaxy.com (Mohr James) Date: Tue, 4 Oct 2005 17:42:38 +0200 Subject: Nagios not recognizing service via send_nsca Message-ID: Hi All! I want to create a service on the Nagios server (2.0b3) that accepts message via ncsa and sends an SMS. In this way we have tested the chain through nagios to sending a message. I have two services in service.cfg defined like this: define service ( use generic-service ; Name of service template to use host_name nts-mgm-18 service_description SendXMS is_volatile 1 active_checks_enabled 0 check_command check-host-alive } define service ( use generic-service ; Name of service template to use host_name nts-mgm-17 service_description SendXMS is_volatile 1 active_checks_enabled 0 check_command check-host-alive } The generic service is defined like this: # Generic service definition template define service{ name generic-service ; The 'name' of this service template, referenced in other service definitions parallelize_check 1 ; Active service checks should be parallelized (disabling this can lead to major performance problems) obsess_over_service 1 ; We should obsess over this service (if necessary) event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! is_volatile 0 check_period 24x7 active_checks_enabled 1 passive_checks_enabled 1 max_check_attempts 3 normal_check_interval 5 retry_check_interval 1 contact_groups datacenter notification_interval 120 notification_period 24x7 notification_options c notifications_enabled 0 } After I added the services, I restarted nagios and nsca. When I am on nts-mgm-18, I can run send_nsca, it sends the message and I see the text in the Nagios GUI. I also see this in the /var/log/messages on the Nagios server: Oct 4 17:26:03 nts-mgm-17 nsca[20268]: Connection from 10.2.28.61 port 38666 Oct 4 17:26:03 nts-mgm-17 nsca[20268]: Host address checks out ok Oct 4 17:26:03 nts-mgm-17 nsca[20268]: Handling the connection... Oct 4 17:26:03 nts-mgm-17 nsca[20268]: SERVICE CHECK -> Host Name: 'nts-mgm-18', Service Description: 'SendXMS', Return Code: '0', Output: 'TEST TEST TEST' Oct 4 17:26:03 nts-mgm-17 nsca[20268]: End of connection... Oct 4 17:26:03 nts-mgm-17 nagios: EXTERNAL COMMAND: PROCESS_SERVICE_CHECK_RESULT;nts-mgm-18;SendXMS;0;TEST TEST TEST However, when I run the command from nts-mgm-17 (the Nagios server), in the /var/log/messages all I see is this: Oct 4 17:27:23 nts-mgm-17 nsca[20268]: Connection from 10.2.28.48 port 35050 Oct 4 17:27:23 nts-mgm-17 nsca[20268]: Host address checks out ok Oct 4 17:27:23 nts-mgm-17 nsca[20268]: Handling the connection... Oct 4 17:27:23 nts-mgm-17 nsca[20268]: End of connection... Apparently it does not recognize SendXMS as a valid service. I have ensured that the local machine ist listed in nsca.cfg file, so I am fairly sure it is not simply ignoring this machine. I have looked though the nsca.c to find a way to be more verbose, but I didn't find anything useful. I am at a lost to see where the difference is. Does it have something to do with the fact that this is the local machine? I would appreciate any info at all. Regards, Jim Mohr ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From scotjenkins at gmail.com Tue Oct 4 17:57:09 2005 From: scotjenkins at gmail.com (Scot Jenkins) Date: Tue, 4 Oct 2005 10:57:09 -0500 Subject: parents/dependencies confusion In-Reply-To: <1128435339.43428e8bb20a0@webmail.highstream.net> References: <1128435339.43428e8bb20a0@webmail.highstream.net> Message-ID: On 10/4/05, brianmas at highstream.net wrote: > I'll give a quick explanation of my set up: > > I have 1 nagios machine in the office here, I monitor all services on different > subnets, sometimes the DSL connection here at the office goes down at which > point I am flooded with notifications. > I have looked into dependencies and parent relations but I can't quite figure > out how to suppress all warnings when the network connection dies. I've checked > the FAQ pages but I can't figure out what to do based on them. Examples using > config files would be most helpful. > > thank you. This is a good place to start: http://nagios.sourceforge.net/docs/2_0/networkreachability.html Basically you want to diagram out the paths to each host/service and set parents on each host/service that is not directly on the same network as the nagios monitoring host. The documentation explains it pretty well. Scot ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From James.P.Fuentes at boeing.com Tue Oct 4 18:31:56 2005 From: James.P.Fuentes at boeing.com (EXT-Fuentes, James P) Date: Tue, 4 Oct 2005 09:31:56 -0700 Subject: New Nagios Install Message-ID: <626FC7C6A97381468FB872072AB5DDC85CE0C4@XCH-SW-42.sw.nos.boeing.com> I had been using Nagios 2.0b3 with no problems and just upgraded to b4 a couple of weeks ago. So far things have been working just fine. Compiling is not a big deal. Just make sure you've got all prerequisites on your system. You'll see those requirements in the READMEs. -Jim ________________________________ From: Timothy A. Holmes [mailto:tholmes at mcaschool.net] Sent: Tue 10/4/2005 7:07 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] New Nagios Install Good Morning Everyone: I have a new computer (P4 2.0Ghz) that I am getting ready to install Nagios on. In checking out the Nagios page, I am wondering which version is the best for use. I see that version 1.x is tagged as stable, but I also notice that version 2 beta 4 has been up for several months. Im not willing to fight a buggy beta, but if beta 4 is stable and nearing release, I am willing to give it a go. If not, I will just go with version 1.2 and wait till the final release. I will be monitoring 15 servers and 8 switches as it stands now Also, any problems or roadblocks that I need to be aware of with the RPM installs? I prefer to avoid compiling from source if I can, as I always seem to have BAD luck doing anything from source Thanks for any and all advice Timothy A. Holmes IT Manager / Network Admin / Web Master / Computer Teacher Medina Christian Academy A Higher Standard... Jeremiah 33:3 Jeremiah 29:11 Esther 4:14 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sergei.e.haramundanis at bankofamerica.com Tue Oct 4 20:04:42 2005 From: sergei.e.haramundanis at bankofamerica.com (Haramundanis, Sergei E) Date: Tue, 04 Oct 2005 14:04:42 -0400 Subject: nagios 1.2 UI on Solaris 9 unable to detect process is running Message-ID: <18D9E0AC97C5D348966B274C84EB5ABC0106B48F@ex2k.bankofamerica.com> When using the Nagios UI (w/Apache 2.0.54 running as nobody) to view Service Detail it always give the "Monitoring process may not be running" warning. This also causes process commands to not be listed/available, and the process status is always UNKNOWN. The service details for all hosts appear ok so I know it is not an issue generally executing the .cgis and the nagios process is definitely running: nagios 6725 1 0 15:24:27 ? 0:11 /opt/nagios/bin/nagios -d /opt/nagios/etc/nagios.cfg I've set userids nagios and nobody to be in the same group (nagiocmd) and set permissions on all the files and directories (and subdirectories) in /opt/nagios to "rwxrwxr-x 1 nagios nagiocmd". Executing ./bin/nagios -v ./etc/nagios.cfg returns 0 warnings 0 errors. Executing the nagios check command from the command line (as userid nagios) and via a perl script from my web server's cgi-bin directories (so it runs as nobody) also works fine and returns: Nagios ok: located 1 process, status log updated 14 seconds ago Here is the nagios check command I am using: from the command line/perl script: /opt/nagios/plugins/libexec/check_nagios /opt/nagios/var/status.log 5 '/opt/nagios/bin/nagios' 2>&1 from cgi.cfg: nagios_check_command=/opt/nagios/plugins/libexec/check_nagios /opt/nagios/var/status.log 5 '/opt/nagios/bin/nagios' I have not been able to find any logs which indicate any errors. I have also tried executing extinfo.cgi from the command line but I don't know what the QUERY_STRING is supposed to be for the POST parameters. Can anyone give me any pointers to figure out why nagios process detection and process commands do not work? This is happening on two similar installations. If you need more configuration information, please let me know. Any help is greatly appreciated. Thanks. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enediel at hotmail.com Tue Oct 4 21:13:42 2005 From: enediel at hotmail.com (enediel gonzalez) Date: Tue, 04 Oct 2005 19:13:42 +0000 Subject: installing nagios from debian's packageondebian 3.1(sarge) s Message-ID: Thanks John. The testing version of debian sarge includes nagios 1.3 as debian stable, so I don't have other choice except install nagios 2.x from tar.gz, That's what I'm doing right now. Linux user 398956 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From enediel at hotmail.com Tue Oct 4 21:22:15 2005 From: enediel at hotmail.com (enediel gonzalez) Date: Tue, 04 Oct 2005 19:22:15 +0000 Subject: http://nagios.sourceforge.net/docs/2_0/installing.html error? Message-ID: inside of this page I can read /usr/sbin/groupadd nagcmd Next, add the users that your web server and Nagios run as to the newly created group with the following commands (I'll assume apache and nagios are the respective users): /usr/sbin/usermod -G nagcmd apache /usr/sbin/usermod -G nagcmd nagios but executing make install-commandmode I received the following message /usr/bin/install: invalid group `cmdgroup' make: *** [install-commandmode] Error 1 ?It looks like the documentation and the installation don't consider the same group name? Thanks in advance for the answer Enediel Linux user 398956 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From scotjenkins at gmail.com Wed Oct 5 00:59:12 2005 From: scotjenkins at gmail.com (Scot Jenkins) Date: Tue, 4 Oct 2005 17:59:12 -0500 Subject: parents/dependencies confusion In-Reply-To: <1128452154.4342d03a19f9d@webmail.highstream.net> References: <1128435339.43428e8bb20a0@webmail.highstream.net> <1128452154.4342d03a19f9d@webmail.highstream.net> Message-ID: On 10/4/05, brianmas at highstream.net wrote: > > ok what I have done is set up a hosts.cfg entries for the default gateway of the > 3 networks and set the 'parents' directive for all hosts in each respective > networks. I guess I wasn't thinking about monitoring machines that are not > under our control (all our servers are colo'd). > I find looking at the "status map" cgi also helps to see if you have your "parents" set up correctly. I have multiple NIC's on my firewall and set each NIC up as it's own host to more accurately reflect the network and then use the (nic) host as a parent for things beyond it. So I have something like this: Assume nagios runs on the firewall box itself. nagios process ---- fw_inside_nic --- inside_host_a | | | +----fw_dmz_nic --- dmz_host_a | +---- fw_outside_nic --- dsl_router ----- isp_upstream_link ---- outside_host_a Then in my config: inside_host_a's parents == fw_inside_nic dmz_host_a's parents == fw_dmz_nic outside_host_a's parents == isp_upstream_link isp_upstream_link's parents == dsl_router dsl_router's parents == fw_outside_nic This seems to work well. If I were to unplug my dsl router, the dsl_router host would show down and the isp_upstream_link and outside_host_a will show unreachable. HTH, Scot ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pubaddr2 at davyandbeth.com Wed Oct 5 04:26:46 2005 From: pubaddr2 at davyandbeth.com (Davy Durham) Date: Tue, 04 Oct 2005 21:26:46 -0500 Subject: Check for thrashing Message-ID: <434339E6.5030800@davyandbeth.com> Hi.. Does anyone know of a plugin that can check if a machine is "thrashing" by which I mean it continues to swap and swap.. because either there is too little ram, or too many processes running on the box. I would think a vmstat command could tell you that, though it would need a few seconds to determine. And I'm not sure what numbers would indicate what a bad amount would be. Thanks, Davy ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From guy at extragalactic.net Wed Oct 5 05:13:56 2005 From: guy at extragalactic.net (Guy B. Purcell) Date: Tue, 4 Oct 2005 20:13:56 -0700 Subject: Check for thrashing In-Reply-To: <434339E6.5030800@davyandbeth.com> References: <434339E6.5030800@davyandbeth.com> Message-ID: On Oct 4, 2005, at 19:26, Davy Durham wrote: > Does anyone know of a plugin that can check if a machine is > "thrashing" by which I mean it continues to swap and swap.. because > either there is too little ram, or too many processes running on > the box. I think such a thing would be unnecessary, really: if a box is thrashing (in the canonical sense of that word--something happens that causes it to have to work harder, which slows everything down so jobs pile up, causing it to have to work even harder, and so on in a catch-22 cycle), then the various services on it you're otherwise monitoring will time out because there aren't enough resources of one kind or another to handle the monitor requests. > I would think a vmstat command could tell you that, though it > would need a few seconds to determine. And I'm not sure what > numbers would indicate what a bad amount would be. The actual implementation of such a thing would need to be carefully crafted to work across multiple OSes. IMHO, it's not worth the effort: if a box truly is thrashing, you should know all about it from existing monitors. -Guy ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tim at infostream.com.au Wed Oct 5 05:22:37 2005 From: tim at infostream.com.au (Timothy O'Hare) Date: Wed, 05 Oct 2005 13:22:37 +1000 Subject: Check for thrashing In-Reply-To: References: <434339E6.5030800@davyandbeth.com> Message-ID: <434346FD.8010901@infostream.com.au> You could monitor the CPU load and swap space usage? Guy B. Purcell wrote: > > On Oct 4, 2005, at 19:26, Davy Durham wrote: > >> Does anyone know of a plugin that can check if a machine is >> "thrashing" by which I mean it continues to swap and swap.. because >> either there is too little ram, or too many processes running on the >> box. > > > I think such a thing would be unnecessary, really: if a box is > thrashing (in the canonical sense of that word--something happens that > causes it to have to work harder, which slows everything down so jobs > pile up, causing it to have to work even harder, and so on in a > catch-22 cycle), then the various services on it you're otherwise > monitoring will time out because there aren't enough resources of one > kind or another to handle the monitor requests. > >> I would think a vmstat command could tell you that, though it would >> need a few seconds to determine. And I'm not sure what numbers would >> indicate what a bad amount would be. > > > The actual implementation of such a thing would need to be carefully > crafted to work across multiple OSes. IMHO, it's not worth the > effort: if a box truly is thrashing, you should know all about it from > existing monitors. > > -Guy > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drw at adnc.com Wed Oct 5 06:40:09 2005 From: drw at adnc.com (Dan Wilson) Date: Tue, 04 Oct 2005 21:40:09 -0700 Subject: High CPU utilization at random times Message-ID: <43435929.304@adnc.com> I've been looking into a problem for quite some time now and have come up stumped. Every time I think I know what the problem is I turn out to be wrong. Sorry, this is LONG but has lots of detail, hopefully all the detail you guys need to make a diagnosis or point me in the right direction :-) PROBLEM: Randomly, and for no good reason, the CPU usage on this machine will go up to anywhere from .7 to 1.5!?!?!?!?!? HARDWARE: PIII 677 384MB ram Software RAID 1 with IDE(all partitions except swap, yes, I boot from it too... I already took crap for booting from software raid, but it works fine, really) extra drive for swap and nightly "snapshots" of /usr/local/ and /etc and a few other things. SOFTWARE: Mandrake linux 10.1(last updates 45 days ago) Nagios 1.2 (no perl interpreter, with perl cache) Plugins 1.3.1 Optional/custom plugins... check_icmp instead of check_ping custom check_ink script/plugin - this plugin is written in perl and uses the netsnmp module for perl. This isn't the problem either, stopped all service checks that used it for a few hours, the problem was still there.... FYI: This script checks supply levels in network printers, I could have used the check_snmp plugin for this but that was too messy(i tried!). This way the out put is cleaner(ex. Levels OK - C-34% Y-75% M-12% K-90%) and there is only one check per printer instead of one for each supply :-) [my programming skills suck, really, they do. You have to specify the type of printer which has to be put in the script so if can correctly read the supplies... I should have written it to "explore" the printer to see what kind of supplies it had and what could be checked so it would in theory work with any printer... but it works the way it is, and I couldn't figure out how to get everything to work... I'm learning and will some day get it to work the way I want????] check_smart - checks HDD SMART values... not the trouble either, it was added recently after a HDD went bad and the box crashed 2 nights in a row(the extra drive was bad and failed during the "snapshot") The follwing were the latest stable versions as of about Feb-2005 Apache MRTG NetSNMP PERL PHP MySQL THINGS I HAVE DONE/LOOKED AT TO TRY AND FIX THIS ISSUE: Recompiled the kernel... no change, went back to the standard kernel. Restarted like a MS machine... uptime makes no difference, pleanty of memory availble(150+MB) all the time Nagios - stopped the service, no issue, start the service and let it run a while, the problem appears... I recompiled(twice), adjusted a few options, no luck with the issue though nagios ran a tiny faster, maybe 1-2%, not worth the wait to recompile IMHO MRTG - checking interface on 2 routers, it is using RRD and the MRTG-RRD.CGI fast cgi script so the load from this every 5 minutes isn't even worth mentioning. Tried removing access from users to stop MRTG-RRD.CGI from generating graphs on demand. I even tried stopping MRTG and lost 4 hours of data but still had the problem. Apache - stopped the service, problem still continues. PERL - recompiled and removed a few options that the documentation said could cause trouble, no change. Even ran Nagios without any perl scripts/plugins, problem still there. PHP - nothing is using this at the moment... was only installed for testing a Nagios config utility with a web interface... MySQL - not being used, makes no difference if it is running or not. I only run X while downloading updates, otherwise it stays off and I just SSH in. MORE INFO: At first I only noticed it when I would SSH in and look at the load because it took 15+seconds to log in. I though it was SSH to I started having Nagios check the CPU load, I can look from time to time and catch it up nice and high. It is NOT logs being rotated, excessive swaping, bad hardware(second machine it's happened on), too many people accessing the box, too many services/hosts down.(I'm checking about 90 hosts and 180+ services, after I delete the retention data and start Nagios fresh everything is checked and fine in 2 minutes or less.). It's not to the point where the box is unusable, it clears up in a minute or two(always, every time, and that makes it hard to track down). It is NOT(at least not that I can tell) Nagios making excessive retries on problems, it happens when there are no problem and I have the max retries set to 3 for all but a few things. Timeouts are 10 seconds or less on all but one check. I'm not using obssesive checks, processing perf data or anything like that. When I first installed nagios 2 years ago I tinkered with getting it to respond faster, I set the time period to 15 seconds(default is 60?) so I could get a few things running every 15 or 30 seconds... works great and with little increased overhead.... I just have to remember that 1 minute is now 4 and not 1... ;-) Nagios responds like a champ now, forced checks don't take a minute or longer... 20 seconds at the longest. I HATE WAITING! LOL Any ideas? Or should I just live with it until I upgrade to 2.0? I'll be moving to faster hardware then anyway, dual PIII 700 with 2GB ram and hardware RAID1... It's not much but it is better :-) ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pubaddr2 at davyandbeth.com Wed Oct 5 07:16:14 2005 From: pubaddr2 at davyandbeth.com (Davy Durham) Date: Wed, 05 Oct 2005 00:16:14 -0500 Subject: Check for thrashing In-Reply-To: <434346FD.8010901@infostream.com.au> References: <434339E6.5030800@davyandbeth.com> <434346FD.8010901@infostream.com.au> Message-ID: <4343619E.9020608@davyandbeth.com> Timothy O'Hare wrote: > You could monitor the CPU load and swap space usage? > Yeah, but my question was what values from vmstat would mean that it's thrashing.. a plugin would seem simple to write to check that > Guy B. Purcell wrote: > >> >> On Oct 4, 2005, at 19:26, Davy Durham wrote: >> >>> Does anyone know of a plugin that can check if a machine is >>> "thrashing" by which I mean it continues to swap and swap.. because >>> either there is too little ram, or too many processes running on >>> the box. >> >> >> >> I think such a thing would be unnecessary, really: if a box is >> thrashing (in the canonical sense of that word--something happens >> that causes it to have to work harder, which slows everything down >> so jobs pile up, causing it to have to work even harder, and so on >> in a catch-22 cycle), then the various services on it you're >> otherwise monitoring will time out because there aren't enough >> resources of one kind or another to handle the monitor requests. >> >>> I would think a vmstat command could tell you that, though it >>> would need a few seconds to determine. And I'm not sure what >>> numbers would indicate what a bad amount would be. >> >> >> >> The actual implementation of such a thing would need to be carefully >> crafted to work across multiple OSes. IMHO, it's not worth the >> effort: if a box truly is thrashing, you should know all about it >> from existing monitors. >> >> -Guy >> >> >> >> >> ------------------------------------------------------- >> This SF.Net email is sponsored by: >> Power Architecture Resource Center: Free content, downloads, >> discussions, >> and more. http://solutions.newsforge.com/ibmarch.tmpl >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting any issue. ::: Messages without supporting info will risk >> being sent to /dev/null > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Wed Oct 5 07:49:26 2005 From: td3201 at gmail.com (Terry) Date: Wed, 5 Oct 2005 00:49:26 -0500 Subject: Check for thrashing In-Reply-To: <4343619E.9020608@davyandbeth.com> References: <434339E6.5030800@davyandbeth.com> <434346FD.8010901@infostream.com.au> <4343619E.9020608@davyandbeth.com> Message-ID: <8ee061010510042249u4ed415cm989f6de66be104d3@mail.gmail.com> Sar tools might be more useful for you. On 10/5/05, Davy Durham wrote: > Timothy O'Hare wrote: > > > You could monitor the CPU load and swap space usage? > > > Yeah, but my question was what values from vmstat would mean that it's > thrashing.. a plugin would seem simple to write to check that > > > Guy B. Purcell wrote: > > > >> > >> On Oct 4, 2005, at 19:26, Davy Durham wrote: > >> > >>> Does anyone know of a plugin that can check if a machine is > >>> "thrashing" by which I mean it continues to swap and swap.. because > >>> either there is too little ram, or too many processes running on > >>> the box. > >> > >> > >> > >> I think such a thing would be unnecessary, really: if a box is > >> thrashing (in the canonical sense of that word--something happens > >> that causes it to have to work harder, which slows everything down > >> so jobs pile up, causing it to have to work even harder, and so on > >> in a catch-22 cycle), then the various services on it you're > >> otherwise monitoring will time out because there aren't enough > >> resources of one kind or another to handle the monitor requests. > >> > >>> I would think a vmstat command could tell you that, though it > >>> would need a few seconds to determine. And I'm not sure what > >>> numbers would indicate what a bad amount would be. > >> > >> > >> > >> The actual implementation of such a thing would need to be carefully > >> crafted to work across multiple OSes. IMHO, it's not worth the > >> effort: if a box truly is thrashing, you should know all about it > >> from existing monitors. > >> > >> -Guy > >> > >> > >> > >> > >> ------------------------------------------------------- > >> This SF.Net email is sponsored by: > >> Power Architecture Resource Center: Free content, downloads, > >> discussions, > >> and more. http://solutions.newsforge.com/ibmarch.tmpl > >> _______________________________________________ > >> Nagios-users mailing list > >> Nagios-users at lists.sourceforge.net > >> https://lists.sourceforge.net/lists/listinfo/nagios-users > >> ::: Please include Nagios version, plugin version (-v) and OS when > >> reporting any issue. ::: Messages without supporting info will risk > >> being sent to /dev/null > > > > > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: > > Power Architecture Resource Center: Free content, downloads, discussions, > > and more. http://solutions.newsforge.com/ibmarch.tmpl > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > > reporting any issue. ::: Messages without supporting info will risk > > being sent to /dev/null > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james.mohr at elaxy.com Wed Oct 5 08:48:50 2005 From: james.mohr at elaxy.com (Mohr James) Date: Wed, 5 Oct 2005 08:48:50 +0200 Subject: AW: Nagios not recognizing service via send_nsca Message-ID: > -----Urspr?ngliche Nachricht----- >> Von: Lori Adams [mailto:ladams at cloudmark.com] >> >> I think it has to do with you trying to use nsca on the >> nagios server. >> There's no reason to do that. Just do a local call instead >> with active checks instead of passive checks. >> >> -lori Hi Lori! Thanks for your response. Well, in my mind, there *is* a reason for doing it. Maybe you or someone else can correct me, but as far as I can tell, Nagios cannot do an active check at a specific time. If we wanted a check, for example, at 08.00 and 18.00, the best we could do is set the check to run every 12 hours and start Nagios close 18.00 (or 08.00). We want a regular scheduled message sent through Nagios. This tells us that Nagios has received and processed the message and was able to sent it corretly. So, we were planning to set up a cron job that runs twice a day. This sends a message to a volatile service on the Nagios server, so that it always sends a notification (i.e. SMS). We want to test the nsca server. In princple we could test it from another machine and thus we would also test the actual network connection. However, part of the question at this point is why it is not working as I expect. Is there really a problem with nsca on the nagios server? Regards, Jim Mohr >> >> > -----Original Message----- >> > From: nagios-users-admin at lists.sourceforge.net >> [mailto:nagios-users- >> > admin at lists.sourceforge.net] On Behalf Of Mohr James >> > Sent: Tuesday, October 04, 2005 8:43 AM >> > To: Nagios-users at lists.sourceforge.net >> > Subject: [Nagios-users] Nagios not recognizing service via >> send_nsca >> > >> > Hi All! >> > >> > I want to create a service on the Nagios server (2.0b3) >> that accepts >> > message via ncsa and sends an SMS. In this way we have tested the >> chain >> > through nagios to sending a message. I have two services in >> service.cfg >> > defined like this: >> > >> > define service ( >> > use generic-service >> ; Name >> > of service template to use >> > host_name nts-mgm-18 >> > service_description SendXMS >> > is_volatile 1 >> > active_checks_enabled 0 >> > check_command check-host-alive >> > } >> > >> > define service ( >> > use generic-service >> ; Name >> > of service template to use >> > host_name nts-mgm-17 >> > service_description SendXMS >> > is_volatile 1 >> > active_checks_enabled 0 >> > check_command check-host-alive >> > } >> > The generic service is defined like this: >> > >> > # Generic service definition template >> > define service{ >> > name generic-service ; >> The 'name' >> of >> > this service template, referenced in other service definitions >> > parallelize_check 1 ; Active service >> checks >> > should be parallelized (disabling this can lead to major >> performance >> > problems) >> > obsess_over_service 1 ; We should obsess >> over >> > this service (if necessary) >> > event_handler_enabled 1 ; Service event >> handler >> > is enabled >> > flap_detection_enabled 1 ; Flap detection is >> > enabled >> > process_perf_data 1 ; Process >> performance >> > data >> > retain_status_information 1 ; Retain status >> > information across program restarts >> > retain_nonstatus_information 1 ; Retain non-status >> > information across program restarts >> > >> > register 0 ; DONT >> REGISTER THIS >> > DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! >> > is_volatile 0 >> > check_period 24x7 >> > active_checks_enabled 1 >> > passive_checks_enabled 1 >> > max_check_attempts 3 >> > normal_check_interval 5 >> > retry_check_interval 1 >> > contact_groups datacenter >> > notification_interval 120 >> > notification_period 24x7 >> > notification_options c >> > notifications_enabled 0 >> > } >> > >> > After I added the services, I restarted nagios and nsca. >> When I am on >> > nts-mgm-18, I can run send_nsca, it sends the message and I see the >> text >> > in the Nagios GUI. I also see this in the /var/log/messages on the >> > Nagios server: >> > >> > Oct 4 17:26:03 nts-mgm-17 nsca[20268]: Connection from 10.2.28.61 >> port >> > 38666 >> > Oct 4 17:26:03 nts-mgm-17 nsca[20268]: Host address >> checks out ok Oct >> > 4 17:26:03 nts-mgm-17 nsca[20268]: Handling the connection... >> > Oct 4 17:26:03 nts-mgm-17 nsca[20268]: SERVICE CHECK -> Host Name: >> > 'nts-mgm-18', Service Description: 'SendXMS', Return Code: '0', >> Output: >> > 'TEST TEST TEST' >> > Oct 4 17:26:03 nts-mgm-17 nsca[20268]: End of connection... >> > Oct 4 17:26:03 nts-mgm-17 nagios: EXTERNAL COMMAND: >> > PROCESS_SERVICE_CHECK_RESULT;nts-mgm-18;SendXMS;0;TEST TEST TEST >> > >> > However, when I run the command from nts-mgm-17 (the >> Nagios server), >> in >> > the /var/log/messages all I see is this: >> > >> > Oct 4 17:27:23 nts-mgm-17 nsca[20268]: Connection from 10.2.28.48 >> port >> > 35050 >> > Oct 4 17:27:23 nts-mgm-17 nsca[20268]: Host address >> checks out ok Oct >> > 4 17:27:23 nts-mgm-17 nsca[20268]: Handling the connection... >> > Oct 4 17:27:23 nts-mgm-17 nsca[20268]: End of connection... >> > >> > Apparently it does not recognize SendXMS as a valid >> service. I have >> > ensured that the local machine ist listed in nsca.cfg >> file, so I am >> > fairly sure it is not simply ignoring this machine. I have looked >> though >> > the nsca.c to find a way to be more verbose, but I didn't find >> anything >> > useful. >> > >> > I am at a lost to see where the difference is. Does it >> have something >> to >> > do with the fact that this is the local machine? I would appreciate >> any >> > info at all. >> > >> > Regards, >> > >> > Jim Mohr >> > >> > >> > ------------------------------------------------------- >> > This SF.Net email is sponsored by: >> > Power Architecture Resource Center: Free content, downloads, >> discussions, >> > and more. http://solutions.newsforge.com/ibmarch.tmpl >> > _______________________________________________ >> > Nagios-users mailing list >> > Nagios-users at lists.sourceforge.net >> > https://lists.sourceforge.net/lists/listinfo/nagios-users >> > ::: Please include Nagios version, plugin version (-v) and OS when >> > reporting any issue. >> > ::: Messages without supporting info will risk being sent >> to /dev/null >> ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Philipp.Sand at sycor.de Wed Oct 5 10:41:21 2005 From: Philipp.Sand at sycor.de (Sand Philipp) Date: Wed, 5 Oct 2005 10:41:21 +0200 Subject: check_freshness not working... Message-ID: <6943DE9E9934CD42BBE4A20E0E28E10E01EA0130@SYGOMAIL.GOE.AD.SYCOR.DE> Hi, I'm trying to check the freshness of my passive checks in Nagios. If my distributed Nagios has any Problems delivering the checks to my central Nagios, I want to know :) I'm using Nagios 2.0b3 I configured the following: In Nagios.cfg: check_service_freshness=1 service_freshness_check_interval=60 Passive Check: check_command check_warning active_checks_enabled 0 passive_checks_enabled 1 check_freshness 1 freshness_threshold 500 check_period 24x7 retry_check_interval 2 max_check_attempts 3 the checkcommand: define command{ command_name check_warning command_line $USER1$/check_dummy 3 "$HOSTNAME$ check not up to date" } Any suggestions? Thanks, Philipp _____________________________ Philipp Sand OC-CC-TEC-SYS SYCOR GmbH Heinrich-von-Stephan-Stra?e 1-5 D - 37073 G?ttingen Telefon +49 (0) 551 - 490 - 0 Telefax +49 (0) 551 - 490 - 232468 philipp.sand at sycor.de www.sycor.de ------------------------------------------------ ? ************************************************ "Ferner Osten - neue Maerkte - vertraute Technik" unter diesem Motto praesentiert sich sycor auf der FAKUMA Messe vom 18.-22. Oktober in Friedrichshafen sycor - IT-Kompetenz fuer die Kunststoffbranche! Besuchen Sie uns! Zwischen Halle A5 und A6 - Stand 6511 Weitere Infos unter www.sycor.de ************************************************ Diese E-Mail ist vertraulich und kann darueber hinaus persoenliche Informationen beinhalten. Wenn Sie nicht der bestimmungsgemaesse Empfaenger sind, loeschen Sie bitte die E-Mail und deren Anhaenge sofort und benachrichtigen Sie uns darueber. Die Firma sycor willigt in keine Vertraege oder vertragliche Verpflichtungen ein oder uebermittelt rechtsverbindliche Angebote, die in Form von E-Mail versandt werden, sofern dies nicht ausdruecklich in schriftlicher Form zwischen den Parteien vereinbart wurde. This e-mail is confidential and may contain personal and/or privileged information. If you are not the intended recipient please delete this e-mail and all attachments immediately and inform us. The company sycor does not agree with contracts or contract obligations sent by e-mail, neither do we transmit legally binding offers by e-mail, unless this is not expressly agreed upon between the parties and documented in written form. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris at aidworld.org Wed Oct 5 11:21:46 2005 From: chris at aidworld.org (Chris Wilson) Date: Wed, 05 Oct 2005 10:21:46 +0100 Subject: High CPU utilization at random times In-Reply-To: <43435929.304@adnc.com> References: <43435929.304@adnc.com> Message-ID: <1128504106.22545.1.camel@localhost> Hi Dan, On Wed, 2005-10-05 at 05:40, Dan Wilson wrote: > Nagios - stopped the service, no issue, start the service and let it run > a while, the problem appears... I recompiled(twice), adjusted a few > options, no luck with the issue though nagios ran a tiny faster, maybe > 1-2%, not worth the wait to recompile IMHO Just to check I understood correctly, stopping Nagios is the ONLY thing that you've found so far that makes the problem go away? Try removing services from your Nagios configuration in batches, to narrow down which one(s) are causing the problem. Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lee.wilson at vmc.com Wed Oct 5 13:52:55 2005 From: lee.wilson at vmc.com (Lee Wilson) Date: Wed, 5 Oct 2005 12:52:55 +0100 Subject: Host Notification Problem Message-ID: Hi, I've got a bit of a problem with Nagios 1.2 on Debian, in that HOST DOWN notifications are not being sent, the recovery notifications come though okay. My Host configuration is as follows:- define host { use uk-default-host host_name ukhost01 alias UKHOST01 address 192.168.1.23 parents uk-upper-isa-hub,uk-upper-hp-2650 notification_period workhours notification_options d,u,r notifications_enabled 1 stalking_options o,d,u max_check_attempts 5 check_command check-host-alive retain_status_information 1 retain_nonstatus_information 1 flap_detection_enabled 0 event_handler_enabled 0 } The contacts are setup to receive all notifications as from the following template(work hours is the builtin 0900-1700):- define contact { register 0 name tpl-std-contact alias TEMPLATE Standard Contact service_notification_period workhours host_notification_period workhours service_notification_options w,u,c,r host_notification_options d,u,r service_notification_commands notify-by-email host_notification_commands host-notify-by-email } All hosts also have a ping service setup, which doesn't send any notifications out either if the host is down. If the host is up but the ping response is very slow then the appopriate warning/critical notification does go out. I'm sure that I'm just missing something simple, but cannot figure it out. Thanks in advance for any help. Regards Lee -------------- next part -------------- An HTML attachment was scrubbed... URL: From scotjenkins at gmail.com Wed Oct 5 17:39:06 2005 From: scotjenkins at gmail.com (Scot Jenkins) Date: Wed, 5 Oct 2005 10:39:06 -0500 Subject: High CPU utilization at random times In-Reply-To: <43435929.304@adnc.com> References: <43435929.304@adnc.com> Message-ID: On 10/4/05, Dan Wilson wrote: > I've been looking into a problem for quite some time now and have come > up stumped. Every time I think I know what the problem is I turn out to > be wrong. > > Sorry, this is LONG but has lots of detail, hopefully all the detail you > guys need to make a diagnosis or point me in the right direction :-) > > PROBLEM: > Randomly, and for no good reason, the CPU usage on this machine will go > up to anywhere from .7 to 1.5!?!?!?!?!? A load average of 1.5 is not really all that high, especially if it's not sustained for any lengthy period of time. I've seen a heavily loaded Linux mail server running with a load average of about 30, and a FreeBSD system with a load average of over 100 (Apache went nuts spawning CGI scripts). Also keep in mind the load average is report for the last 1, 5, and 15 minutes; man uptime(1) for details. In which field are you seeing the 1.5 load average? Are you tracking disk IO some where? sar and iostat (part of the sysstat package) are good tools for this task. You might want to track and compare disk and CPU to see if they're related. Since you're running software RAID it could be that disk IO is causing the CPU spike. Check the Nagios "trends" CGI output and compare that with the output from other tools: vmstat, top (real-time), sar, iostat (real-time and historical) to get a feel for what is normal for your system. Scot ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From brianmas at highstream.net Wed Oct 5 18:20:19 2005 From: brianmas at highstream.net (brianmas at highstream.net) Date: Wed, 5 Oct 2005 12:20:19 -0400 Subject: parents/dependencies confusion In-Reply-To: References: <1128435339.43428e8bb20a0@webmail.highstream.net> <1128452154.4342d03a19f9d@webmail.highstream.net> Message-ID: <1128529219.4343fd43e2fea@webmail.highstream.net> Quoting Scot Jenkins : > On 10/4/05, brianmas at highstream.net wrote: > > > > ok what I have done is set up a hosts.cfg entries for the default gateway > of the > > 3 networks and set the 'parents' directive for all hosts in each respective > > networks. I guess I wasn't thinking about monitoring machines that are not > > under our control (all our servers are colo'd). > > > > I find looking at the "status map" cgi also helps to see if you have > your "parents" set up correctly. > > I have multiple NIC's on my firewall and set each NIC up as it's own > host to more accurately reflect the network and then use the (nic) > host as a parent for things beyond it. So I have something like this: > > Assume nagios runs on the firewall box itself. > > nagios process ---- fw_inside_nic --- inside_host_a > | | > | +----fw_dmz_nic --- dmz_host_a > | > +---- fw_outside_nic --- dsl_router ----- isp_upstream_link ---- > outside_host_a > > Then in my config: > > inside_host_a's parents == fw_inside_nic > dmz_host_a's parents == fw_dmz_nic > > outside_host_a's parents == isp_upstream_link > isp_upstream_link's parents == dsl_router > dsl_router's parents == fw_outside_nic > > This seems to work well. If I were to unplug my dsl router, the > dsl_router host would show down and the isp_upstream_link and > outside_host_a will show unreachable. yes, I understand, if we had a single point of access on the network such as a router or firewall I would have known what to do, but all these machines have 1 interface with a firewall on each one. So I set up the default gateways for each network as a host instead. The Status Map does show what I'd expect: nagios ---> gateway/parent machine ---> fans out to a bunch of servers. One thing though the "Status Information" for 2 of the 3 gateways are in "Pending" state. They are pingable ... little confused on that. It's been 20 hours. hm adding a service to them (ping, kinda redundant) and they now show as OK. Known issue? Still using 1.2 btw. > HTH, > > Scot ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Jeremy.Pavleck at capella.edu Wed Oct 5 19:17:56 2005 From: Jeremy.Pavleck at capella.edu (Pavleck, Jeremy D.) Date: Wed, 5 Oct 2005 12:17:56 -0500 Subject: Customize TAC.cgi Message-ID: <0F1A5F3993566D4883C596CB56E59B9A054FCDF4@mspmail02.int.capella.lan> Hi Guys, Did a search and didn't find any information regarding this, so I'll shoot it out here - is there a way to customize tac.cgi? More specifically I'm looking to at least change the color of the background to better suit my company's color scheme. Everything else is setup that way inside of Nagios, so hitting tac.cgi and it's black background is somewhat of an eye sore. Sorry if this has been asked before, but I wasn't able to find exactly what I was looking for! Jeremy D. Pavleck Network Engineer System Management Capella University 225 South 6th Street, 9th Floor Minneapolis, MN 55402 www.capella.edu ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From andrew at profitability.net Wed Oct 5 19:23:33 2005 From: andrew at profitability.net (Andrew Cruse) Date: Wed, 5 Oct 2005 13:23:33 -0400 Subject: parents/dependencies confusion In-Reply-To: <1128529219.4343fd43e2fea@webmail.highstream.net> References: <1128529219.4343fd43e2fea@webmail.highstream.net> Message-ID: nagios-users-admin at lists.sourceforge.net wrote: > yes, I understand, if we had a single point of access on the > network such as a router or firewall I would have known what > to do, but all these machines have 1 interface with a > firewall on each one. So I set up the default gateways for > each network as a host instead. > > The Status Map does show what I'd expect: nagios ---> > gateway/parent machine > ---> fans out to a bunch of servers. One thing though the "Status > Information" for 2 of the 3 gateways are in "Pending" state. They are > pingable ... little confused on that. It's been 20 hours. hm > adding a service to them (ping, kinda > redundant) and they now show as OK. Known issue? Still using 1.2 btw. Nagios *will* allow you to configure more than one parent for a host. Seems to work fine like that for me. Andrew ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Wed Oct 5 19:46:54 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Wed, 5 Oct 2005 19:46:54 +0200 Subject: Customize TAC.cgi In-Reply-To: <0F1A5F3993566D4883C596CB56E59B9A054FCDF4@mspmail02.int.capella.lan> References: <0F1A5F3993566D4883C596CB56E59B9A054FCDF4@mspmail02.int.capella.lan> Message-ID: <20051005174654.GF9666329@CIS.FU-Berlin.DE> * "Pavleck, Jeremy D." [2005-10-05 12:17]: > Did a search and didn't find any information regarding this, so I'll > shoot it out here - is there a way to customize tac.cgi? More > specifically I'm looking to at least change the color of the background > to better suit my company's color scheme. Everything else is setup that > way inside of Nagios, so hitting tac.cgi and it's black background is > somewhat of an eye sore. You could edit share/stylesheets/tac.css, the background color is set within the ".tac" definition. Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rejaine at bhz.jamef.com.br Wed Oct 5 20:15:11 2005 From: rejaine at bhz.jamef.com.br (Rejaine Monteiro) Date: Wed, 05 Oct 2005 15:15:11 -0300 Subject: macros and event_handlers Message-ID: <1128536110.24860.2.camel@acesup2.bhz.jamef> Hi, Is possible to use macros in event_handlers commands on services definitions? Like this: define service{ use generic-service host_name host1 service_description Test max_check_attempts 4 normal_check_interval 1 retry_check_interval 1 event_handler service-restarts!httpd check_command check_http } Thanks! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From guy at extragalactic.net Wed Oct 5 20:32:03 2005 From: guy at extragalactic.net (Guy B. Purcell) Date: Wed, 5 Oct 2005 11:32:03 -0700 Subject: parents/dependencies confusion In-Reply-To: <1128529219.4343fd43e2fea@webmail.highstream.net> References: <1128435339.43428e8bb20a0@webmail.highstream.net> <1128452154.4342d03a19f9d@webmail.highstream.net> <1128529219.4343fd43e2fea@webmail.highstream.net> Message-ID: <792B56A3-6567-423F-B4F7-632683C9292B@extragalactic.net> On Oct 5, 2005, at 09:20, brianmas at highstream.net wrote: > The Status Map does show what I'd expect: nagios ---> gateway/ > parent machine > ---> fans out to a bunch of servers. One thing though the "Status > Information" > for 2 of the 3 gateways are in "Pending" state. They are > pingable ... little > confused on that. It's been 20 hours. hm adding a service to them > (ping, kinda > redundant) and they now show as OK. Known issue? Still using 1.2 btw. Well, yes, it's a known issue in the sense that it's the way Nagios works: it doesn't assume initial states. When you add a new service, it's in the Pending state until the service check returns a different state. If, as in your case, you add a new host with _no_ associated services, then Nagios will keep the host in the Pending state. If you add a service, Nagios will then track the host's state. I've often wondered whether I consider it a bug or not that Nagios doesn't just force one initial active host check for new hosts. I'm still on the fence, because I need to run some tests. It all comes down to what happens if such a service-devoid host (let's say it's a router) goes down and is the parent of other hosts: in that case, active checks of the child hosts should ensue (and fail), followed by an active check of the router (which also fails), and an active check of the router's parent (which we'll assume succeeds)--all of which should yield one DOWN alert for the router, and possibly some UNREACHABLE alerts for its children (I forget--been a while since I've read the docs). If Nagios does that, and marks the host as back UP when it comes back up & the hosts & services beyond it are "visible" once again, then I'd consider the lack of an initial host check a quirk of Nagios, rather than a bug; otherwise, I'd consider it a bug. -Guy ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Jeremy.Pavleck at capella.edu Wed Oct 5 20:59:47 2005 From: Jeremy.Pavleck at capella.edu (Pavleck, Jeremy D.) Date: Wed, 5 Oct 2005 13:59:47 -0500 Subject: Customize TAC.cgi Message-ID: <0F1A5F3993566D4883C596CB56E59B9A054FCE18@mspmail02.int.capella.lan> Ah! Thank you, didn't even notice that! Jeremy D. Pavleck Network Engineer System Management Capella University 225 South 6th Street, 9th Floor Minneapolis, MN 55402 www.capella.edu -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Holger Weiss Sent: Wednesday, October 05, 2005 12:47 PM To: Nagios Users Subject: Re: [Nagios-users] Customize TAC.cgi * "Pavleck, Jeremy D." [2005-10-05 12:17]: > Did a search and didn't find any information regarding this, so I'll > shoot it out here - is there a way to customize tac.cgi? More > specifically I'm looking to at least change the color of the > background to better suit my company's color scheme. Everything else > is setup that way inside of Nagios, so hitting tac.cgi and it's black > background is somewhat of an eye sore. You could edit share/stylesheets/tac.css, the background color is set within the ".tac" definition. Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From smadim2 at grads.ece.mcmaster.ca Wed Oct 5 21:07:27 2005 From: smadim2 at grads.ece.mcmaster.ca (M.N.A.Smadi) Date: Wed, 05 Oct 2005 15:07:27 -0400 Subject: problem with service notification Message-ID: <4344246F.7000806@grads.ece.mcmaster.ca> Hi; I get notifications if the command associated with host exits with a critical code but if the commands associated with the services exit with a critical code i dont get notifications. I am not exactly sure what i am missing. thanks moe smadi I am using the following config object: ############################################################################### # # TIME PERIODS # ############################################################################### # This defines a timeperiod where all times are valid for checks, # notifications, etc. The classic "24x7" support nightmare. :-) define timeperiod{ timeperiod_name 24x7 alias 24 Hours A Day, 7 Days A Week sunday 00:00-24:00 monday 00:00-24:00 tuesday 00:00-24:00 wednesday 00:00-24:00 thursday 00:00-24:00 friday 00:00-24:00 saturday 00:00-24:00 } ############################################################################### # # CONTACTS # ############################################################################### define contact{ contact_name nagios alias Nagios Admin service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,r service_notification_commands notify-by-email host_notification_commands host-notify-by-email email whatever at whatever.com } ############################################################################### # # CONTACT GROUPS # ############################################################################### define contactgroup{ contactgroup_name admins alias Nagios Administrators members nagios } ############################################################################### # # COMMANDS # ############################################################################### define command{ command_name check_ping command_line /usr/local/nagios/libexec/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ } define command{ command_name check_sipast command_line /usr/local/nagios/libexec/check_sipast -r $HOSTADDRESS$ -u $ARG1$ -p $ARG2$ } define command{ command_name notify-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SER VICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info :\n\n$OUTPUT$" | /bin/mail -s "** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } define command{ command_name host-notify-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME $\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "Host $HOSTSTA TE$ alert for $HOSTNAME$!" $CONTACTEMAIL$ } ############################################################################## # SERVICES # ############################################################################### # # service to "ping" the local machine define service{ host_name xyz.com service_description PING is_volatile 0 check_period 24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 960 notification_period 24x7 check_command check_ping!100.0,20%!500.0,60% } define service{ host_name xyz.com service_description SIP_ONLINE is_volatile 0 check_period 24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 960 notification_period 24x7 check_command check_sipast!nagios!hi } ############################################################################## # # HOST GROUPS # ############################################################################### define hostgroup{ hostgroup_name my_testing_group alias my_testing_group members zia-sv-008 } ############################################################################### # # HOSTS # ############################################################################### # Since this is a simple configuration file, we only monitor one host - the # local host (this machine). define host{ host_name xyz.com alias zxy address X.X.X.X check_command check_sipast!nagios!hi max_check_attempts 1 notification_interval 120 notification_period 24x7 notification_options d,r contact_groups admins } # EOF ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Kyle_Narum at eLoyalty.com Wed Oct 5 21:17:36 2005 From: Kyle_Narum at eLoyalty.com (Narum, Kyle) Date: Wed, 5 Oct 2005 14:17:36 -0500 Subject: problem with service notification Message-ID: <037594FB5AFC354FA38653EB3FF3242105F0E1A2@elhqex01.eloyaltyco.com> You have notification_options defined for your hosts, but not your services. If you add at least one notification_option to your service(s), you should receive the notifications. -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of M.N.A.Smadi Sent: Wednesday, October 05, 2005 2:07 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] problem with service notification Hi; I get notifications if the command associated with host exits with a critical code but if the commands associated with the services exit with a critical code i dont get notifications. I am not exactly sure what i am missing. thanks moe smadi I am using the following config object: ######################################################################## ####### # # TIME PERIODS # ######################################################################## ####### # This defines a timeperiod where all times are valid for checks, # notifications, etc. The classic "24x7" support nightmare. :-) define timeperiod{ timeperiod_name 24x7 alias 24 Hours A Day, 7 Days A Week sunday 00:00-24:00 monday 00:00-24:00 tuesday 00:00-24:00 wednesday 00:00-24:00 thursday 00:00-24:00 friday 00:00-24:00 saturday 00:00-24:00 } ######################################################################## ####### # # CONTACTS # ######################################################################## ####### define contact{ contact_name nagios alias Nagios Admin service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,r service_notification_commands notify-by-email host_notification_commands host-notify-by-email email whatever at whatever.com } ######################################################################## ####### # # CONTACT GROUPS # ######################################################################## ####### define contactgroup{ contactgroup_name admins alias Nagios Administrators members nagios } ######################################################################## ####### # # COMMANDS # ######################################################################## ####### define command{ command_name check_ping command_line /usr/local/nagios/libexec/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ } define command{ command_name check_sipast command_line /usr/local/nagios/libexec/check_sipast -r $HOSTADDRESS$ -u $ARG1$ -p $ARG2$ } define command{ command_name notify-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SER VICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info :\n\n$OUTPUT$" | /bin/mail -s "** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } define command{ command_name host-notify-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME $\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $OUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "Host $HOSTSTA TE$ alert for $HOSTNAME$!" $CONTACTEMAIL$ } ######################################################################## ###### # SERVICES # ######################################################################## ####### # # service to "ping" the local machine define service{ host_name xyz.com service_description PING is_volatile 0 check_period 24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 960 notification_period 24x7 check_command check_ping!100.0,20%!500.0,60% } define service{ host_name xyz.com service_description SIP_ONLINE is_volatile 0 check_period 24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 960 notification_period 24x7 check_command check_sipast!nagios!hi } ######################################################################## ###### # # HOST GROUPS # ######################################################################## ####### define hostgroup{ hostgroup_name my_testing_group alias my_testing_group members zia-sv-008 } ######################################################################## ####### # # HOSTS # ######################################################################## ####### # Since this is a simple configuration file, we only monitor one host - the # local host (this machine). define host{ host_name xyz.com alias zxy address X.X.X.X check_command check_sipast!nagios!hi max_check_attempts 1 notification_interval 120 notification_period 24x7 notification_options d,r contact_groups admins } # EOF ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Gaston_Perez at itginc.com Wed Oct 5 21:50:01 2005 From: Gaston_Perez at itginc.com (Gaston_Perez at itginc.com) Date: Wed, 5 Oct 2005 15:50:01 -0400 Subject: WMI Query Message-ID: Hi All I am pretty new to Linux / Nagios and I am running Nagios 1.2, on a Redhat 9 server. I have Nagios plugins 1.42 installed on my server and on my windows 2003 servers I have nc_net 2.28. I am trying to run wmi queries to check the physical hard drive to report back to me if one fails. Here is what I have in my checkcommands # 'WMI Harddrive check' command definition define command{ command_name check_nt_wmihdd command_line $USER1$check_nt -H $HOSTADDRESS$ -v WMICHECK -l "cimv2/win32_logicaldrive" } I have also tried this command_line /usr/local/nagios/libexec/check_nt -H $HOSTADDRESS$ -v WMICHECK -l "cimv2^select name,freespace,deviceid,status from win32_DiskDrive" } And this command_line $USER$/check_nt -H $HOSTADDRESS$ -v WMICHECK -l "cimv2^select DeviceID,Status from win32_DiskDrive" } Here is my services command define service{ use check-nt-service register 1 host_name server1 service_description WMI HDD check_command check_nt_wmihdd max_check_attempts 3 normal_check_interval 5 retry_check_interval 3 check_period 24x7 notification_interval 120 notification_period workhours notification_options w,c,r contact_groups prodsupp } Response I have seen from the server is (Return code of 127 is out of bounds - plugin may be missing) Nagios Log displays this [1128541333] Warning: Return code of 127 for check of service 'WMI HDD' on host 'rad-db01' was out of bounds. Make sure the plugin you're trying to run actually exists. When I ran the first check command statement above this is the output to the Nagios log rad-db01;WMI HDD;CRITICAL;HARD;3;(Return code of 127 is out of bounds - plugin may be missing) Now I checked my nc_net plugin and it is the version (2.28) that can read wmi checks. I also verified that the Nagios plugins I am using (1.42)can understand wmi checks. Any help at all would be greatly appreciated, Thanks in Advance -+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+- This message is for the named person's use only. This communication is for informational purposes only and has been obtained from sources believed to be reliable, but it is not necessarily complete and its accuracy cannot be guaranteed. It is not intended as an offer or solicitation for the purchase or sale of any financial instrument or as an official confirmation of any transaction. Moreover, this material should not be construed to contain any recommendation regarding, or opinion concerning, any security. It may contain confidential, proprietary or legally privileged information. No confidentiality or privilege is waived or lost by any mistransmission. If you receive this message in error, please immediately delete it and all copies of it from your system, destroy any hard copies of it and notify the sender. You must not, directly or indirectly, use, disclose, distribute, print, or copy any part of this message if you are not the intended recipient. Any views expressed in this message are those of the individual sender, except where the message states otherwise and the sender is authorized to state them to be the views of any such entity. ITG Inc. and/or its affiliates reserves the right to monitor and archive all electronic communications through its network. ITG Inc. Member NASD, SIPC -+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+- -------------- next part -------------- An HTML attachment was scrubbed... URL: From smadim2 at grads.ece.mcmaster.ca Wed Oct 5 21:50:49 2005 From: smadim2 at grads.ece.mcmaster.ca (M.N.A.Smadi) Date: Wed, 05 Oct 2005 15:50:49 -0400 Subject: problem with service notification In-Reply-To: <037594FB5AFC354FA38653EB3FF3242105F0E1A2@elhqex01.eloyaltyco.com> References: <037594FB5AFC354FA38653EB3FF3242105F0E1A2@elhqex01.eloyaltyco.com> Message-ID: <43442E99.7020201@grads.ece.mcmaster.ca> This thing is driving me nuts. for the SIP_ONLINE service i want to have a very aggressive checking, so every minute do check and if 1 fails send notification. I think i got things right since on the web interface i see the service as critical but I am not getting notifications even if restart nagios or do whatever. This is the relevent config file: ############################################################################## # SERVICES # ############################################################################### # # service to "ping" the local machine define service{ host_name zya.com service_description PING is_volatile 0 check_period 24x7 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 960 notification_period 24x7 notification_options w,u,c,r check_command check_ping!100.0,20%!500.0,60% } define service{ host_name zya.com service_description SIP_ONLINE is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 5 retry_check_interval 1 contact_groups admins notification_interval 1 notification_period 24x7 notification_options w,u,c,r check_command check_sipast!nagios!GrobPadeg } ############################################################################## # # HOST GROUPS # ############################################################################### define hostgroup{ hostgroup_name my_testing_group alias my_testing_group members zya.com } ############################################################################### # # HOSTS # ############################################################################### # Since this is a simple configuration file, we only monitor one host - the # local host (this machine). define host{ host_name zya.com alias zya.com address X.Y.Z.Z check_command check_sipast!nagios!GrobPadeg max_check_attempts 1 notification_interval 120 notification_period 24x7 notification_options d,r contact_groups admins } # EOF Narum, Kyle wrote: >You have notification_options defined for your hosts, but not your >services. If you add at least one notification_option to your >service(s), you should receive the notifications. > >-----Original Message----- >From: nagios-users-admin at lists.sourceforge.net >[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of >M.N.A.Smadi >Sent: Wednesday, October 05, 2005 2:07 PM >To: nagios-users at lists.sourceforge.net >Subject: [Nagios-users] problem with service notification > >Hi; >I get notifications if the command associated with host exits with a >critical code but if the commands associated with the services exit with >a critical code i dont get notifications. I am not exactly sure what i >am missing. > >thanks >moe smadi > >I am using the following config object: >######################################################################## >####### ># ># TIME PERIODS ># >######################################################################## >####### ># This defines a timeperiod where all times are valid for checks, # >notifications, etc. The classic "24x7" support nightmare. :-) > >define timeperiod{ > timeperiod_name 24x7 > alias 24 Hours A Day, 7 Days A Week > sunday 00:00-24:00 > monday 00:00-24:00 > tuesday 00:00-24:00 > wednesday 00:00-24:00 > thursday 00:00-24:00 > friday 00:00-24:00 > saturday 00:00-24:00 > } > >######################################################################## >####### ># ># CONTACTS ># >######################################################################## >####### >define contact{ > contact_name nagios > alias Nagios Admin > service_notification_period 24x7 > host_notification_period 24x7 > service_notification_options w,u,c,r > host_notification_options d,r > service_notification_commands notify-by-email > host_notification_commands host-notify-by-email > email whatever at whatever.com > } > > >######################################################################## >####### ># ># CONTACT GROUPS ># >######################################################################## >####### >define contactgroup{ > contactgroup_name admins > alias Nagios Administrators > members nagios > } > > >######################################################################## >####### ># ># COMMANDS ># >######################################################################## >####### >define command{ > command_name check_ping > command_line /usr/local/nagios/libexec/check_ping -H >$HOSTADDRESS$ -w $ARG1$ -c $ARG2$ >} > >define command{ > command_name check_sipast > command_line /usr/local/nagios/libexec/check_sipast -r >$HOSTADDRESS$ -u $ARG1$ -p $ARG2$ >} > > >define command{ > command_name notify-by-email > command_line /usr/bin/printf "%b" "***** Nagios >*****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SER >VICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: >$SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info >:\n\n$OUTPUT$" | /bin/mail -s "** $NOTIFICATIONTYPE$ alert - >$HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ > } > >define command{ > command_name host-notify-by-email > command_line /usr/bin/printf "%b" "***** Nagios >*****\n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME >$\nState: $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: >$OUTPUT$\n\nDate/Time: $LONGDATETIME$\n" | /bin/mail -s "Host $HOSTSTA >TE$ alert for $HOSTNAME$!" $CONTACTEMAIL$ > } > > >######################################################################## >###### ># SERVICES ># >######################################################################## >####### ># ># service to "ping" the local machine > >define service{ > host_name xyz.com > service_description PING > is_volatile 0 > check_period 24x7 > max_check_attempts 4 > normal_check_interval 5 > retry_check_interval 1 > contact_groups admins > notification_interval 960 > notification_period 24x7 > check_command check_ping!100.0,20%!500.0,60% > } > >define service{ > host_name xyz.com > service_description SIP_ONLINE > is_volatile 0 > check_period 24x7 > max_check_attempts 4 > normal_check_interval 5 > retry_check_interval 1 > contact_groups admins > notification_interval 960 > notification_period 24x7 > check_command check_sipast!nagios!hi > } > > > >######################################################################## >###### ># ># HOST GROUPS ># >######################################################################## >####### > > >define hostgroup{ > hostgroup_name my_testing_group > alias my_testing_group > members zia-sv-008 > } > >######################################################################## >####### ># ># HOSTS ># >######################################################################## >####### > ># Since this is a simple configuration file, we only monitor one host - >the # local host (this machine). > >define host{ > host_name xyz.com > alias zxy > address X.X.X.X > check_command check_sipast!nagios!hi > max_check_attempts 1 > notification_interval 120 > notification_period 24x7 > notification_options d,r > contact_groups admins > } ># EOF > > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, >discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when >reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stevelml1 at scee.sony.co.uk Wed Oct 5 22:42:42 2005 From: stevelml1 at scee.sony.co.uk (Steve Loughran) Date: Wed, 05 Oct 2005 21:42:42 +0100 Subject: Notification on end of downtime? Message-ID: <43443AC2.4070808@scee.sony.co.uk> Hi all Long time Nagios user, almost zero problems, but every now and then I come across a problem I cant quite work out. I have a box that is offline, and although I know for sure it will come back online soon, I have no idea when (days/weeks... unsure). Is there some way of marking a down host as being in a scheduled downtime, with no defined time period, and only started notifying me again once the host been seen as up again? Many thanks for this, it is greatly appreciated. Steve ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fruiz at netsuite.com Wed Oct 5 22:47:46 2005 From: fruiz at netsuite.com (Ruiz, Francisco) Date: Wed, 5 Oct 2005 13:47:46 -0700 Subject: Service is not scheduled to be checked... Message-ID: <2BF1D777D914174DB2F6874540E3B6C70503EC64@corpmail1.svale.netledger.com> Hi Gurus, I just added a new service check. I see the output of my check in the Nagios log. However, the web interface gives the message "Service is not scheduled to be checked..." Anyone got any ideas? If the service wasn't scheduled to be checked, my assumption is that I should not see data for the check in the Nagios log. Thanks in advance... -------------- next part -------------- An HTML attachment was scrubbed... URL: From guy at extragalactic.net Thu Oct 6 00:17:05 2005 From: guy at extragalactic.net (Guy B. Purcell) Date: Wed, 5 Oct 2005 15:17:05 -0700 Subject: High CPU utilization at random times In-Reply-To: <43435929.304@adnc.com> References: <43435929.304@adnc.com> Message-ID: <0445A3A5-14CA-4B4D-809D-5FDA8F4FF3DE@extragalactic.net> On Oct 4, 2005, at 21:40, Dan Wilson wrote: > I've been looking into a problem for quite some time now and have > come up stumped. Every time I think I know what the problem is I > turn out to be wrong. > > PROBLEM: > Randomly, and for no good reason, the CPU usage on this machine > will go up to anywhere from .7 to 1.5!?!?!?!?!? I'm assuming these are load average numbers, not CPU utilization percentages or something else. (This problem seems a tad off-topic for this list, since it really doesn't seem to be related to Nagios other than that Nagios is reporting seemingly unusual load. Have you asked a Linux UG for suggestions?) > HARDWARE: > PIII 677 > 384MB ram > Software RAID 1 with IDE(all partitions except swap, yes, I boot > from it too... I already took crap for booting from software raid, > but it works fine, really) > extra drive for swap and nightly "snapshots" of /usr/local/ and / > etc and a few other things. I don't see a problem at all (at least it wouldn't be on a Solaris box; not sure what the load avg. numbers under Mandrake mean): on a box that's doing software RAID & running the Nagios server, you should expect to see some load, on average; and I wouldn't worry about loads up to twice the number of CPUs in the box for brief periods (again, at least not running Solaris, where "load average" means the number of processes in the run queue--including those on CPU, as well as those hanging out waiting on some I/O to complete). However, if this box truly is doing nothing and you still see high loads--especially for prolonged periods--perhaps there is a problem. Try shutting down Nagios & any other daemons you don't need (eg. sendmail) for a while and checking the 'sar' logs for load bumps. If you still see load when there shouldn't be any, you may have been hacked (although by someone not very competent if s/he allowed load from their hidden activities to show). -Guy ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From john at stilen.com Thu Oct 6 01:07:11 2005 From: john at stilen.com (John Stile) Date: Wed, 05 Oct 2005 16:07:11 -0700 Subject: help configuring nagios-mysql with nagiosgraph In-Reply-To: References: <1128385428.17094.77.camel@localhost> Message-ID: <1128553631.15324.46.camel@localhost> On Tue, 2005-10-04 at 10:45 +1000, John Stevens wrote: > perfdata not recognised, usually means that the performance data > passed to nagiosgraph did not match any of the map file's definitions. > Read up on perl regexps and look at the map file. It is pretty well > documented. Then look at your output > > On 10/4/05, John Stile wrote: > The rrd directory is world writable, but no databases are > created. > > The nagiosgraph log shows 'perfdata not recognized' > Fri Sep 30 18:06:21 2005 INSERT info: Input lastcheck: > Fri Sep 30 18:06:21 2005 INSERT info: Input hostname: > Fri Sep 30 18:06:21 2005 INSERT info: Input perfdata: > Fri Sep 30 18:06:21 2005 INSERT info: Input > servicedescr: > Fri Sep 30 18:06:21 2005 INSERT info: Input output: > Fri Sep 30 18:06:21 2005 INSERT warn: perfdata not > recognized > This shows me that nagiosgraph is not receiving any information. > There is no data for lastcheck, hostname, perfdata, servicedescr, or > output. In otherwords, at best, an empty line. What is your > nagios.cfg file like? > You should have lines like this: > host_perfdata_command=process-host-perfdata > service_perfdata_command=process-service-perfdata > process_performance_data=1 > in nagios.cfg, and > # 'process-service-perfdata' command definition > define command{ > command_name process-service-perfdata > command_line /usr/local/bin/exec_perf "$HOSTNAME$" > "$SERVICEDESC$" "$SERVICESTATE$" "$OUTPUT$" "$PERFDATA$" "$LASTCHECK$" > } > in misccommands.cfg. Sorry, I am not familiar with MySQL nagios > backend, but unless you have these set up, and you know what kind of > data is being passed to your process-service-data command, and the map > file recognises your performance data and how to handle it, > nagiosgraph will not produce rrds. It is the process-service-perfdata > command that creates the rrds and updates them with new data. > > Regards > Watching the logs, I never see an INSERT statement. The logged INSERT > statement in my original email was because I ran insert.pl from the > console with no options, so that is why all the data is missing. I'm stick stuck. What seems insane usually is due to a type-o, but I really need some help on where to look next. >From the ground up: 1. Compile flags for the Debian-3.1 package nagios-common (nagios v1.3): ./configure --prefix=/usr \ --mandir=/usr/share/man \ --bindir=/usr/sbin \ --sbindir=/usr/lib/cgi-bin/nagios \ --datadir=/usr/share/nagios/htdocs \ --sysconfdir=/etc/nagios \ --infodir=/usr/share/info \ --libexecdir=/usr/lib/nagios/plugins \ --localstatedir=/var/log/nagios \ --with-ping-command="/bin/ping -n %s -c %d" \ --with-mail=/usr/bin/mail --with-perlcache \ --with-nagios-user=nagios \ --with-nagios-grp=nagios \ --with-template-objects \ --build=$(DEB_BUILD_GNU_TYPE) \ --host=$(DEB_HOST_GNU_TYPE) \ --with-pgsql-xdata \ --with-mysql-xdata \ --with-template-extinfo 2. In /etc/nagios/nagios.conf: process_performance_data=1 service_perfdata_command=process-service-perfdata 3. In /etc/nagios/misccommands.cfg define command { command_name process-service-perfdata command_line /usr/share/nagios/nagiosgraph/insert.pl "$HOSTNAME$" "$LASTCHECK$||$HOSTNAME$||$SERVICEDESC$||$OUTPUT$||$PERFDATA$" >> /var/log/nagios/host-perfdata.out } 4. Permissions on log files and rrd directory are 777, owned by nagios daemon user and apache daemon group. 5. Nothing goes into /var/log/nagios/host-perfdata.out, no rrd databases are created. So what does that tell me? It appears to mean that perfdata is never sent to process-service-perfdata command? So do I load a bullet in the gun at this point? Is the problem that --with-template-extinfo was used and it must be --with-default-extinfo? Are they are mutually exclusive? ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From msugano at uolinc.com Thu Oct 6 01:32:37 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Wed, 05 Oct 2005 20:32:37 -0300 Subject: WEB-Interface performance Message-ID: <1128555158.11266.32.camel@localhost> Hi nagios-user list, I don't know how to begin this question, because i can't imagine how much use of the nagios web-interface is made by the people who read this list. But here we use nagios to actively check something around 10k services now, and up to 2300 hosts. Lately we upgrade our monitoring pool of machines, setting up a distributed framework to agregate all warnings at one unique webserver. So far, this new framework is doing its job, but sometimes, we get around 15 people connected to the nagios web-interface, and the status.cgi is taking too much time to load. So here is my question: "Is there any ./configure options, or any set of CFLAGS to improve performance of the cgis?" Here's a snipet from top: Tasks: 135 total, 18 running, 117 sleeping, 0 stopped, 0 zombie Cpu(s): 86.8% us, 12.7% sy, 0.0% ni, 0.2% id, 0.0% wa, 0.2% hi, 0.2% si Mem: 2074356k total, 1450956k used, 623400k free, 170980k buffers Swap: 2104472k total, 0k used, 2104472k free, 1041400k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 8509 nagios 19 0 7244 6096 424 R 34.0 0.3 0:01.17 status.cgi 8508 nagios 19 0 14868 12m 8508 R 24.5 0.6 0:01.09 status.cgi 8687 nagios 18 0 12756 7104 4600 R 17.5 0.3 0:00.53 status.cgi 8690 nagios 18 0 12756 7016 4544 R 17.2 0.3 0:00.52 status.cgi 8506 nagios 19 0 14472 11m 7772 R 16.2 0.6 0:01.04 status.cgi 8027 nagios 24 0 22952 20m 11m R 12.2 1.0 0:02.93 status.cgi 8115 nagios 21 0 22956 15m 6816 R 10.6 0.8 0:02.21 status.cgi 8078 nagios 22 0 10412 9348 540 R 10.2 0.5 0:03.30 status.cgi 8103 nagios 22 0 10412 9336 528 R 10.2 0.5 0:03.27 status.cgi 8046 nagios 21 0 10416 9340 524 R 7.6 0.5 0:03.06 status.cgi 7995 nagios 22 0 22956 17m 9420 R 1.3 0.9 0:02.52 status.cgi 15374 nagios 15 0 39780 21m 908 S 1.0 1.0 1:48.06 nagios 15382 nagios 16 0 1672 648 540 S 1.0 0.0 0:10.55 nsca 8072 nagios 20 0 22948 13m 4844 R 1.0 0.7 0:01.91 status.cgi 23767 nagios 20 0 223m 8516 2172 S 0.7 0.4 0:00.52 httpd 23769 nagios 20 0 224m 8272 2172 S 0.3 0.4 0:00.52 httpd 8151 msugano 16 0 2040 1136 828 R 0.3 0.1 0:00.05 top As you can see, lots of instances of the cgis around, consuming about 90% of CPU time. The problem we are experiencing here, it's that we used to monitoring nagios service, by checking a regexp at the tac.cgi, and the thresholds are tight, 6 seconds to warning, 8 seconds to critical and 10seconds to timeout. We've never experienced critical levels of this check, but after putting this interface to agregate all alarms, and having 15~20 people hanged onto nagios interface to see whats happening with the services they operate, we are dealing with high levels of response time from cgis. Finally, the machine that's serving the interface is getting passive messages from the active monitoring agents and is a Pentium4 HT-SMP processor, with 2GB memory, SATA HDD, running SuSE9.3 with kernel 2.6.11-8-SMP. -- Marcel Mitsuto -------------- next part -------------- An HTML attachment was scrubbed... URL: From p_santos at psantos.no-ip.org Thu Oct 6 02:22:47 2005 From: p_santos at psantos.no-ip.org (Pavel Santos) Date: Wed, 5 Oct 2005 20:22:47 -0400 Subject: Nagios CGI's not displaying Message-ID: <2C377FE90617434CA42DAC0A4AF052F0E396@exchange.home.net> Hello list, I have installed Nagios 2.0b4 and running on FC4. I can view the Nagios home page and view the documentation but cannot get into any of the CGI pages. I am getting two different error messages. The first error message is that of Forbidden access. This is simply telling me that I can't access a specific directory on the server. The other error is that of page NOT FOUND. This is telling me that the path is not on the server. I followed the nagios instructions for Setting Up The Web Interface and added the following to my httpd.conf file: ScriptAlias /nagios/cgi-bin /usr/local/nagios/sbin AllowOverride AuthConfig Options ExecCGI Order allow,deny Allow from all Alias /nagios /usr/local/nagios/share Options None AllowOverride AuthConfig Order allow,deny Allow from all Everyting is working find on the back-end (getting proper notifications for my hosts etc) Thanks in advanced. Pavel -------------- next part -------------- An HTML attachment was scrubbed... URL: From yonienjcn at gmail.com Thu Oct 6 04:18:30 2005 From: yonienjcn at gmail.com (Tao Yaoning) Date: Wed, 5 Oct 2005 22:18:30 -0400 Subject: about nagiosQL installation and configuration Message-ID: <8ee7548c0510051918h44acdc0ep16bce081c4084d3@mail.gmail.com> Hi, Nagios-users I just download the nagios 2.0b4 rpm and install it. I download nagiosQL also, and extract it under /var/www/html/nagiosQL. I configure nagiosQL according readme.txt include: create database, create configuration directories and setting the properties of the directory and nagios configuration files. I configure the httpd for nagiosQL like this: Alias /nagiosQL "/var/www/html/nagiosQL" AllowOverride All I restart the httpd service, but my Firefox display nothing, no nagiosQL page, no error page, only blank. I don't have any sense about it. I try to fix my httpd configuration, but it still doesn't work properly. Anybody can tell what's wrong with my configuration and how to fix it. Thanks a lot Tao -------------- next part -------------- An HTML attachment was scrubbed... URL: From aaronhcarr at gmail.com Thu Oct 6 04:46:59 2005 From: aaronhcarr at gmail.com (Aaron Carr) Date: Wed, 5 Oct 2005 22:46:59 -0400 Subject: about nagiosQL installation and configuration In-Reply-To: <8ee7548c0510051918h44acdc0ep16bce081c4084d3@mail.gmail.com> References: <8ee7548c0510051918h44acdc0ep16bce081c4084d3@mail.gmail.com> Message-ID: Did you configure the php config file for nagiosQL to connect to the database? Have you checked your httpd error log, or the regular httpd log? Usually those will give some insight to errors like this. That's where I would suggest you start. Aaron On 10/5/05, Tao Yaoning wrote: > > Hi, Nagios-users > > I just download the nagios 2.0b4 rpm and install it. I download nagiosQL > also, and extract it under /var/www/html/nagiosQL. > > I configure nagiosQL according readme.txt include: create database, create > configuration directories and setting the properties of the directory and > nagios configuration files. > > I configure the httpd for nagiosQL like this: > Alias /nagiosQL "/var/www/html/nagiosQL" > > AllowOverride All > > > I restart the httpd service, but my Firefox display nothing, no nagiosQL > page, no error page, only blank. I don't have any sense about it. I try to > fix my httpd configuration, but it still doesn't work properly. Anybody can > tell what's wrong with my configuration and how to fix it. > > Thanks a lot > > Tao > -------------- next part -------------- An HTML attachment was scrubbed... URL: From dido at imperium.ph Thu Oct 6 05:20:11 2005 From: dido at imperium.ph (Rafael 'Dido' Sevilla) Date: Thu, 06 Oct 2005 11:20:11 +0800 Subject: WEB-Interface performance In-Reply-To: <1128555158.11266.32.camel@localhost> References: <1128555158.11266.32.camel@localhost> Message-ID: <434497EB.5090707@imperium.ph> Marcel Mitsuto Fucatu Sugano wrote: > "Is there any ./configure options, or any set of CFLAGS to improve > performance of the cgis?" No. Our distributed Nagios system now has 3500 hosts and more than 30,000 services, and the CGI's are completely useless. Apparently, from the responses I got a few months ago when I asked about it, the web interface has got scalability issues: http://sourceforge.net/mailarchive/message.php?msg_id=11335135 There also appears to be some work on improving the scalability of the web interface; see the other followups. -- True compassion is more than throwing a coin to a beggar. It demands of our humanity that if we live in a society that produces beggars, we are morally commanded to restructure that society. http://stormwyrm.blogspot.com/ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Thu Oct 6 08:04:19 2005 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Thu, 06 Oct 2005 08:04:19 +0200 Subject: Route Interfaces Message-ID: <1128578659.912.28.camel@glr-nb.dh.uclv.net> Hi everyone, We have a little notification *problem* between our Routers. Here is a cut out of our Router Environment: |-------| r1-r2 |-------| |Router1|<--------------->|Router2| |-------| r2-r1 |-------| We are monitoring every interface (r1-r2 / r2-r1) between our Routers (see above). Now, when the interface r1-r2 on our router1 goes down, we get two Notifications, one from the interface "r1-r2" on router1 and from "r2-r1" on router2. But, we only need one Notification (from the faulty router). Is there a way to suppress the other notification ? I've played with the "parents" parameter: # 'r1-r2' host definition define host{ use generic-host host_name r1-r2 alias r1-r2 address xxx.xxx.xxx.xxx parents r1-r2 check_command check-host-alive max_check_attempts 3 notification_interval 0 notification_period 24x7 notification_options d,u,r contact_groups sys-admins } But '# nagios -v ./nagios.cfg' says: [...] Error: There is a circular parent/child path that exists for host 'r1-r2'! [...] We are running Nagios Version 2.0b3 on FreeBSD 5.4-STABLE. Thanks in advance Richard -- Richard Gliebe Fachhochschule Vorarlberg ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lanning at lanning.cc Thu Oct 6 08:15:28 2005 From: lanning at lanning.cc (Robert Hajime Lanning) Date: Wed, 5 Oct 2005 23:15:28 -0700 (PDT) Subject: Route Interfaces In-Reply-To: <1128578659.912.28.camel@glr-nb.dh.uclv.net> References: <1128578659.912.28.camel@glr-nb.dh.uclv.net> Message-ID: <53954.192.168.128.102.1128579328.squirrel@ssl.monsoonwind.com> > # 'r1-r2' host definition > define host{ > use generic-host > > host_name r1-r2 > alias r1-r2 > address xxx.xxx.xxx.xxx > parents r1-r2 > check_command check-host-alive > max_check_attempts 3 > notification_interval 0 > notification_period 24x7 > notification_options d,u,r > contact_groups sys-admins > } The router furthest from the Nagios server should have a "parents" statement with the name of the router closest to the Nagios server. So, something like this: Nagios -> r1-r2 -> r2-r1 -> other network... r2-r1 would have a parent of r1-r2. r1-r2 would have no parent. -- And, did Guloka think the Ulus were too ugly to save? -Centauri ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Oct 5 11:57:50 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 05 Oct 2005 11:57:50 +0200 Subject: High CPU utilization at random times In-Reply-To: <43435929.304@adnc.com> References: <43435929.304@adnc.com> Message-ID: <4343A39E.5050403@op5.se> Dan Wilson wrote: > I've been looking into a problem for quite some time now and have come > up stumped. Every time I think I know what the problem is I turn out to > be wrong. > > Sorry, this is LONG but has lots of detail, hopefully all the detail you > guys need to make a diagnosis or point me in the right direction :-) > > PROBLEM: > Randomly, and for no good reason, the CPU usage on this machine will go > up to anywhere from .7 to 1.5!?!?!?!?!? > > HARDWARE: > PIII 677 > 384MB ram > Software RAID 1 with IDE(all partitions except swap, yes, I boot from it > too... I already took crap for booting from software raid, but it works > fine, really) > extra drive for swap and nightly "snapshots" of /usr/local/ and /etc and > a few other things. > > SOFTWARE: > Mandrake linux 10.1(last updates 45 days ago) > Nagios 1.2 (no perl interpreter, with perl cache) I don't think you could have the perl cache without the perl interpreter... > Plugins 1.3.1 > Optional/custom plugins... > check_icmp instead of check_ping Early incantations of check_icmp could end up in an infinite loop if it timed out and entered the finish() function. This ofcourse ups the load no end, until Nagios kills it off with SIGKILL. Try upgrading it from the package at http://oss.op5.se/nagios/op5plugins-2005-09-27.tar.gz AFAIK, this bug was only ever present in a version of check_icmp which specifically wasn't intended for production use, but was tested by a number of friendly helpers (all mentioned in check_icmp.c). > custom check_ink script/plugin - this plugin is written in perl and uses > the netsnmp module for perl. This isn't the problem either, stopped all > service checks that used it for a few hours, the problem was still > there.... FYI: This script checks supply levels in network printers, I > could have used the check_snmp plugin for this but that was too messy(i > tried!). This way the out put is cleaner(ex. Levels OK - C-34% Y-75% > M-12% K-90%) and there is only one check per printer instead of one for > each supply :-) [my programming skills suck, really, they do. You have > to specify the type of printer which has to be put in the script so if > can correctly read the supplies... I should have written it to > "explore" the printer to see what kind of supplies it had and what could > be checked so it would in theory work with any printer... but it works > the way it is, and I couldn't figure out how to get everything to > work... I'm learning and will some day get it to work the way I want????] > check_smart - checks HDD SMART values... not the trouble either, it was > added recently after a HDD went bad and the box crashed 2 nights in a > row(the extra drive was bad and failed during the "snapshot") > > The follwing were the latest stable versions as of about Feb-2005 > Apache > MRTG > NetSNMP > PERL > PHP > MySQL > > > THINGS I HAVE DONE/LOOKED AT TO TRY AND FIX THIS ISSUE: > > Recompiled the kernel... no change, went back to the standard kernel. > > Restarted like a MS machine... uptime makes no difference, pleanty of > memory availble(150+MB) all the time > This seems to indicate an infinite loop problem in some small piece of software then. Believe me, it can eat load *fast*. > Nagios - stopped the service, no issue, start the service and let it run > a while, the problem appears... I recompiled(twice), adjusted a few > options, no luck with the issue though nagios ran a tiny faster, maybe > 1-2%, not worth the wait to recompile IMHO > Did you happen to notice if this coincided with a host going down or in some other way not being able to respond to ping? The host check (or ping service check) output would be something along the lines of "Plugin timed out" if it was down to check_icmp. > MRTG - checking interface on 2 routers, it is using RRD and the > MRTG-RRD.CGI fast cgi script so the load from this every 5 minutes isn't > even worth mentioning. Tried removing access from users to stop > MRTG-RRD.CGI from generating graphs on demand. I even tried stopping > MRTG and lost 4 hours of data but still had the problem. > > Apache - stopped the service, problem still continues. > > PERL - recompiled and removed a few options that the documentation said > could cause trouble, no change. Even ran Nagios without any perl > scripts/plugins, problem still there. > > PHP - nothing is using this at the moment... was only installed for > testing a Nagios config utility with a web interface... > > MySQL - not being used, makes no difference if it is running or not. > > I only run X while downloading updates, otherwise it stays off and I > just SSH in. > > > MORE INFO: > > At first I only noticed it when I would SSH in and look at the load > because it took 15+seconds to log in. I though it was SSH to I started > having Nagios check the CPU load, I can look from time to time and catch > it up nice and high. > > It is NOT logs being rotated, excessive swaping, bad hardware(second > machine it's happened on), too many people accessing the box, too many > services/hosts down.(I'm checking about 90 hosts and 180+ services, > after I delete the retention data and start Nagios fresh everything is > checked and fine in 2 minutes or less.). > > It's not to the point where the box is unusable, it clears up in a > minute or two(always, every time, and that makes it hard to track down). > > It is NOT(at least not that I can tell) Nagios making excessive retries > on problems, it happens when there are no problem and I have the max > retries set to 3 for all but a few things. Timeouts are 10 seconds or > less on all but one check. I'm not using obssesive checks, processing > perf data or anything like that. > > When I first installed nagios 2 years ago I tinkered with getting it to > respond faster, I set the time period to 15 seconds(default is 60?) so I > could get a few things running every 15 or 30 seconds... works great and > with little increased overhead.... I just have to remember that 1 > minute is now 4 and not 1... ;-) Nagios responds like a champ now, > forced checks don't take a minute or longer... 20 seconds at the > longest. I HATE WAITING! LOL > > > > > Any ideas? Or should I just live with it until I upgrade to 2.0? I'll > be moving to faster hardware then anyway, dual PIII 700 with 2GB ram and > hardware RAID1... It's not much but it is better :-) > > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From richard.gliebe at fhv.at Thu Oct 6 08:54:52 2005 From: richard.gliebe at fhv.at (Richard Gliebe) Date: Thu, 06 Oct 2005 08:54:52 +0200 Subject: Route Interfaces In-Reply-To: <53954.192.168.128.102.1128579328.squirrel@ssl.monsoonwind.com> References: <1128578659.912.28.camel@glr-nb.dh.uclv.net> <53954.192.168.128.102.1128579328.squirrel@ssl.monsoonwind.com> Message-ID: <1128581692.912.33.camel@glr-nb.dh.uclv.net> On Wed, 2005-10-05 at 23:15 -0700, Robert Hajime Lanning wrote: > > > # 'r1-r2' host definition > > define host{ > > use generic-host > > > > host_name r1-r2 > > alias r1-r2 > > address xxx.xxx.xxx.xxx > > parents r1-r2 > > check_command check-host-alive > > max_check_attempts 3 > > notification_interval 0 > > notification_period 24x7 > > notification_options d,u,r > > contact_groups sys-admins > > } > > The router furthest from the Nagios server should have a "parents" > statement with the name of the router closest to the Nagios server. > > So, something like this: > Nagios -> r1-r2 -> r2-r1 -> other network... > > r2-r1 would have a parent of r1-r2. > r1-r2 would have no parent. So far so good. But, what we want is: when on our router1 the interface r1-r2 goes down, we want only one notification (from the router1), same action from the router2 when the interface r2-r1 goes down. Richard -- Richard Gliebe Fachhochschule Vorarlberg ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lanning at lanning.cc Thu Oct 6 09:32:55 2005 From: lanning at lanning.cc (Robert Hajime Lanning) Date: Thu, 6 Oct 2005 00:32:55 -0700 (PDT) Subject: Route Interfaces In-Reply-To: <1128581692.912.33.camel@glr-nb.dh.uclv.net> References: <1128578659.912.28.camel@glr-nb.dh.uclv.net> <53954.192.168.128.102.1128579328.squirrel@ssl.monsoonwind.com> <1128581692.912.33.camel@glr-nb.dh.uclv.net> Message-ID: <42811.192.168.128.102.1128583975.squirrel@ssl.monsoonwind.com> > when on our router1 the interface r1-r2 goes down, we want only one > notification (from the router1), same action from the router2 when the > interface r2-r1 goes down. If the link between the two is a serial link (like a T1) and you are running something like HDLC, CiscoHDLC or PPP, for framing, then when one side goes down, the other will to. (Line protocol will be down.) Now, I also take it that the Nagios server is parallel to the two routers. So, topology is more like: Nagios | Router A / \ / \ / \ / \ Router r1-r2 <-----> r2-r1 Router The main issue with what you want, is that it is not a parent/child relationship. You want monitoring where one unit is not dependant on the other unit, but you want it combined into one unit. This is very difficult to do, without changing the Nagios code. A work around is to have either: o just one monitor that checks both (some scripting required) or o monitor each seperately with notifications turned off and have a third monitor using check_cluster, from the nagios_plugins contrib directory, that has notifications turned on. -- And, did Guloka think the Ulus were too ugly to save? -Centauri ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mohamed.azizi at belgacom.be Thu Oct 6 09:57:00 2005 From: mohamed.azizi at belgacom.be (mohamed.azizi at belgacom.be) Date: Thu, 6 Oct 2005 09:57:00 +0200 Subject: Command timed out after 60 seconds Message-ID: <5F3043372274524C967EB597009D2A0E02476DCB@AE0008.BGC.NET> Dear Friends , I am seeing in my nagios the following error :NRPE: Command timed out after 60 seconds. Can you tell what is the meaning of this error and how can I change the timer to avoid this error. Thanks in advance for your support Mohamed -------------- next part -------------- An HTML attachment was scrubbed... URL: From moshesharon at gmail.com Thu Oct 6 10:07:23 2005 From: moshesharon at gmail.com (Moshe Sharon) Date: Thu, 6 Oct 2005 08:07:23 +0000 (UTC) Subject: WEB-Interface performance References: <1128555158.11266.32.camel@localhost> Message-ID: Hello the main problem is when the web page tries to read the status information from the log files. we solved this issue in our enviroment with mysql backend which improves the preformance. and the NOC only checks webpage status.cgi? host=all&servicestatustypes=248&sorttype=2&sortoption=3 which list only problems. we also did some custom page with navigation panel which limit the list of hosts per page Moshe Sharon ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Riccardo.GENTILUCCI at skytv.it Thu Oct 6 10:47:05 2005 From: Riccardo.GENTILUCCI at skytv.it (GENTILUCCI Riccardo) Date: Thu, 6 Oct 2005 10:47:05 +0200 Subject: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: The platform is a FedoraC3 on i386 , full updated. I've installed the required Net::SNNP module but the nagios-plugins.1.4.1.1 don't see this and they don't proceed to install. Can somebody help me please? regards Riccardo Gentilucci IT SKY - Direzione Network & Security Management SKY Italia Via Piranesi, 46 - 20137 Milano Tel: +390270027426 Fax: +390270022248 Mob: +393355342826 Emailto: riccardo.gentilucci at skytv.it Website: http://www.skytv.it -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.jpg Type: image/jpeg Size: 1402 bytes Desc: image001.jpg URL: From mohamed.azizi at belgacom.be Thu Oct 6 10:50:19 2005 From: mohamed.azizi at belgacom.be (mohamed.azizi at belgacom.be) Date: Thu, 6 Oct 2005 10:50:19 +0200 Subject: Command timed out after 60 seconds Message-ID: <5F3043372274524C967EB597009D2A0E02476EC1@AE0008.BGC.NET> > Dear Friends , > > I am seeing in my nagios the following error :NRPE: Command timed out > after 60 seconds. > Can you tell what is the meaning of this error and how can I change > the timer to avoid this error. > > Thanks in advance for your support > > Mohamed **** DISCLAIMER **** http://www.belgacom.be/maildisclaimer -------------- next part -------------- An HTML attachment was scrubbed... URL: From ae at op5.se Thu Oct 6 11:03:11 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 06 Oct 2005 11:03:11 +0200 Subject: Command timed out after 60 seconds In-Reply-To: <5F3043372274524C967EB597009D2A0E02476EC1@AE0008.BGC.NET> References: <5F3043372274524C967EB597009D2A0E02476EC1@AE0008.BGC.NET> Message-ID: <4344E84F.9000603@op5.se> Posting the same question several times with a couple of minutes interval isn't going to get it answered any faster. In fact, it's gonna get it ignored (at least by me). mohamed.azizi at belgacom.be wrote: > > > > >>Dear Friends , >> >>I am seeing in my nagios the following error :NRPE: Command timed out >>after 60 seconds. >>Can you tell what is the meaning of this error and how can I change >>the timer to avoid this error. >> >>Thanks in advance for your support >> >>Mohamed > > > > **** DISCLAIMER **** > http://www.belgacom.be/maildisclaimer -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Oct 6 11:06:28 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 06 Oct 2005 11:06:28 +0200 Subject: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! In-Reply-To: References: Message-ID: <4344E914.1050004@op5.se> GENTILUCCI Riccardo wrote: > The platform is a FedoraC3 on i386 , full updated. > > > > I've installed the required Net::SNNP module but the > nagios-plugins.1.4.1.1 don't see this and they don't proceed to install. > The configure and make script does nothing at all to locate and/or check the usability of the perl Net::SNMP stuff during the installation. What's probably missing from your system is the net-snmp package from www.net-snmp.org (or www.netsnmp.org, one of those, I think), which is the underlying C sources for everything else SNMP-based. Some of the C plugins requires this package to be installed. On FC3, I think the package is named net-snmp-utils and it should be available from freshrpms or some such. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ankushgrover1711 at gmail.com Thu Oct 6 12:14:58 2005 From: ankushgrover1711 at gmail.com (ankush grover) Date: Thu, 6 Oct 2005 15:44:58 +0530 Subject: I want to monitor Apache,Mysql,Disk Space,Bandwidth on my Clustered Servers configured on FC3 Message-ID: hey friends, Today I downloaded the rpm of nagios and nagios-plugins from the nagios site.I have recently configured Apache & Mysql clustering. Now I want to monitor Apache & Mysql connections,bandwidth consumed by these services ,load of both these services apart from that I also want to monitor the CPU Load,Disk Space etc ... on my clustered servers. What files I do need to edit for my reqiurements.I am using Fedora Core 3. Any useful guide or tutorials . Thanks & Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Thu Oct 6 12:16:08 2005 From: robmossrm at aol.com (Rob Moss) Date: Thu, 06 Oct 2005 11:16:08 +0100 Subject: WEB-Interface performance In-Reply-To: <1128555158.11266.32.camel@localhost> References: <1128555158.11266.32.camel@localhost> Message-ID: <4344F968.9090701@aol.com> Marcel Mitsuto Fucatu Sugano wrote: > Hi nagios-user list, > > I don't know how to begin this question, because i can't imagine how > much use of the nagios web-interface is made by the people who read > this list. But here we use nagios to actively check something around > 10k services now, and up to 2300 hosts. Lately we upgrade our > monitoring pool of machines, setting up a distributed framework to > agregate all warnings at one unique webserver. So far, this new > framework is doing its job, but sometimes, we get around 15 people > connected to the nagios web-interface, and the status.cgi is taking > too much time to load. So here is my question: > "Is there any ./configure options, or any set of CFLAGS to improve > performance of the cgis?" Here's a snipet from top: > [snip] > Finally, the machine that's serving the interface is getting passive > messages from the active monitoring agents and is a Pentium4 HT-SMP > processor, with 2GB memory, SATA HDD, running SuSE9.3 with kernel > 2.6.11-8-SMP. Yeah, there are some CFLAGS you could be using to optmise your build.. I am assuming that you have a recent version of GCC, and that your P4 HT cpu is shown as having two logical CPU's Try rebuilding with the following command: cd nagios-2.0b4 CC=gcc CFLAGS="-mtune=i686 -O3 -pipe -march=i686 -funroll-loops -ffast-math" \ ./configure --prefix=/usr/local/nagios ...... [rest of nagios configure commands] The main problem is that you have thousands of hosts, and thousands of services to read in every time you run status.cgi. No matter how efficient the program is, reading in and displaying that much data is going to take a while, and running the same program 15 times simultaneously is going to affect your performance as you see here. How many lines is the status.dat file? I only have a few hundred hosts and services, and the file is 23,000 lines or so, half a meg on disk.. I would imagine yours is closer to about 50mb and closer to a million lines. Some alternatives might be updating the Nagios sidebar so that it doesn't display ALL hosts by default, maybe just a smaller hostgroup.. (although i suspect the status.cgi needs to read in the whole file) Or replacing the standard nagios CGI's with something that is more geared towards handling hundreds of thousands of hosts/services... Or perhaps you could have a separate display server, a webserver running the cgi's which reads in the nagios status.dat file over the network from the nagios server, and does all the processing away from the nagios collector.. This would move processing off of the nagios collector.. You could use rsync to keep the two files in sync, keep a duplicate on the display server on a local disk (or a tmpfs memory based filesystem for extra speed)... Hope this helps rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From Riccardo.GENTILUCCI at skytv.it Thu Oct 6 12:56:03 2005 From: Riccardo.GENTILUCCI at skytv.it (GENTILUCCI Riccardo) Date: Thu, 6 Oct 2005 12:56:03 +0200 Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: Hi Andreas, the package is present and correctly installed and visible (net-snmp-utils-5.2.1.2-FC3.1) Riccardo Gentilucci IT SKY - Direzione Network & Security Management SKY Italia Via Piranesi, 46 - 20137 Milano Tel: +390270027426 Fax: +390270022248 Mob: +393355342826 Emailto: riccardo.gentilucci at skytv.it Website: http://www.skytv.it -----Messaggio originale----- Da: Andreas Ericsson [mailto:ae at op5.se] Inviato: gioved? 6 ottobre 2005 11.06 A: GENTILUCCI Riccardo Cc: nagios-users at lists.sourceforge.net Oggetto: Re: [Nagios-users] Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! GENTILUCCI Riccardo wrote: > The platform is a FedoraC3 on i386 , full updated. > > > > I've installed the required Net::SNNP module but the > nagios-plugins.1.4.1.1 don't see this and they don't proceed to install. > The configure and make script does nothing at all to locate and/or check the usability of the perl Net::SNMP stuff during the installation. What's probably missing from your system is the net-snmp package from www.net-snmp.org (or www.netsnmp.org, one of those, I think), which is the underlying C sources for everything else SNMP-based. Some of the C plugins requires this package to be installed. On FC3, I think the package is named net-snmp-utils and it should be available from freshrpms or some such. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mramos at co.sapo.pt Thu Oct 6 13:00:55 2005 From: mramos at co.sapo.pt (Marco Ramos) Date: Thu, 06 Oct 2005 12:00:55 +0100 Subject: I want to monitor Apache,Mysql,Disk Space,Bandwidth on my Clustered Servers configured on FC3 In-Reply-To: References: Message-ID: <1128596456.3665.20.camel@localhost.localdomain> http://nagios.org/docs/ is a good place to start. On Thu, 2005-10-06 at 15:44 +0530, ankush grover wrote: > hey friends, > > Today I downloaded the rpm of nagios and nagios-plugins from the > nagios site.I have recently configured Apache & Mysql clustering. > Now I want to monitor Apache & Mysql connections,bandwidth consumed by > these services ,load of both these services apart from that I also > want to monitor the CPU Load,Disk Space etc ... on my clustered > servers. > > What files I do need to edit for my reqiurements.I am using Fedora > Core 3. > > Any useful guide or tutorials . > > Thanks & Regards > > Ankush Grover > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralf-Lists at ralfgross.de Thu Oct 6 13:07:50 2005 From: Ralf-Lists at ralfgross.de (Ralf Gross) Date: Thu, 6 Oct 2005 13:07:50 +0200 (CEST) Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! In-Reply-To: References: Message-ID: <20519.141.113.101.31.1128596870.squirrel@www.stz-softwaretechnik.de> > Hi Andreas, the package is present and correctly installed and visible > (net-snmp-utils-5.2.1.2-FC3.1) What error message do you get? Ralf ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralf-Lists at ralfgross.de Thu Oct 6 13:29:47 2005 From: Ralf-Lists at ralfgross.de (Ralf Gross) Date: Thu, 6 Oct 2005 13:29:47 +0200 (CEST) Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! In-Reply-To: References: Message-ID: <41100.141.113.101.31.1128598187.squirrel@www.stz-softwaretechnik.de> > Hi Andreas, the package is present and correctly installed and visible > (net-snmp-utils-5.2.1.2-FC3.1) I think you need the fedora net-snmp-perl package. Ralf ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ankushgrover1711 at gmail.com Thu Oct 6 13:43:31 2005 From: ankushgrover1711 at gmail.com (ankush grover) Date: Thu, 6 Oct 2005 17:13:31 +0530 Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! In-Reply-To: <41100.141.113.101.31.1128598187.squirrel@www.stz-softwaretechnik.de> References: <41100.141.113.101.31.1128598187.squirrel@www.stz-softwaretechnik.de> Message-ID: On 10/6/05, Ralf Gross wrote: > > > Hi Andreas, the package is present and correctly installed and visible > > (net-snmp-utils-5.2.1.2-FC3.1) > > I think you need the fedora net-snmp-perl package. > > Ralf > > > yes , It is perl-Net-SNMP package.It is available at http://dag.wieers.com/packages/perl-Net-SNMP/ Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: From mlist.nagios at inlan.inmics.fi Thu Oct 6 14:09:35 2005 From: mlist.nagios at inlan.inmics.fi (MiikaT) Date: Thu, 6 Oct 2005 15:09:35 +0300 Subject: Histogram report problem Message-ID: <1128600574.434513ff03535@imp.inmics.fi> I am trying to do a service based histogram using Reporting/alert histogram/report type service/service router-ping. I seem to get the same graph all the time, whenever I change the values. For example, I select period/last week, start/end Date October 1, 2005, statistics brakedown/hour of the day, event to graph/service critical events, state types to graph/hard states, assume state retention/yes, initial states logged/no, ignore repeated states no. When hitting the create report, I get a nice report for the chosen time period. If I change the start/end date to, let say October 3, 2005, and all other values are the same, I get the same report, only the selected time period changes on top of the histogram. Am I doing something wrong, or howcome the results are like this? This is with nagios 1.2 and nagios-plugins-1.4.0alpha1. -Miika ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Thu Oct 6 15:45:48 2005 From: robmossrm at aol.com (Rob Moss) Date: Thu, 06 Oct 2005 14:45:48 +0100 Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! In-Reply-To: References: <41100.141.113.101.31.1128598187.squirrel@www.stz-softwaretechnik.de> Message-ID: <43452A8C.3040001@aol.com> ankush grover wrote: > > > On 10/6/05, Ralf Gross > wrote: > > > Hi Andreas, the package is present and correctly installed and > visible > > (net-snmp-utils-5.2.1.2-FC3.1) > > I think you need the fedora net-snmp-perl package. > > Ralf > > > yes , > > > > It is perl-Net-SNMP package.It is available at > > http://dag.wieers.com/packages/perl-Net-SNMP/ > > Regards > > Ankush Grover Or you could learn how to install Perl modules using the standard Perl CPAN installation system: perl -MCPAN -e shell and enter 'install Net::SNMP' or alternatively perl -MCPAN -e 'install Net::SNMP' rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From nagios_user at s1test1.it Thu Oct 6 15:55:13 2005 From: nagios_user at s1test1.it (nagios_user at s1test1.it) Date: Thu, 6 Oct 2005 15:55:13 +0200 Subject: nagiosQL Message-ID: <20051006155513.nrmxnqmlutwk4k88@webmail.s1test1.it> Goodmorning, I have a little problem to issue. I want use nagiosQL, so I've install it. When I try to insert "cfg_dir=/...." in nagios.cfg nagios don't start and give me an error on configuration file. What is the problem? Can you help me? p.s. sorry for my english... thank you very much. Best regards Rodolfo ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Thu Oct 6 16:08:13 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Thu, 6 Oct 2005 16:08:13 +0200 Subject: Inexplicable service escalation behaviour Message-ID: Dear List Subscribers, although I have already asked how to properly set up an escalation scheme a couple of weeks ago here (sorry, only find time to continue Nagios fumbling every now and then at work, and at home it would be useless due to lacking testing ground (ok, I could emulate a host and network farm by vmware or xen etc., but that's too much fuss)) I desperately need further assistance. I am not getting on with this. I swear that I've read carefully the sections on escalations in the Nagios docs at least thrice by now. And the presented examples in the docs sound very convincing to me (though a bit far-fetched) so that I very well can gather, I suppose, how it should work - in *theory*. My objective seems very trivial to me. I just want Nagios to send a *single* notification by using my "file-service-sc-ticket" (misc)command definition to our trouble ticketing system, but at the same time keep continuing sending out repetetive notifications to the various admin recipients at the common notification intervall (at least the latter is working). The filing of the ticket works great. In fact too great, as it turns out to be a flooding of the service center. Tickets keep being generated at common notification intervals, even for recovery alerts (which I never intended). Also are tickets generated for downed hosts I wouldn't have thought to be able (per my Nagios definitions) to send a ticketing request to the service center. I wonder what the host_name directive in the serviceescalation definition is any use for if tickets are filed for other hosts despite? A hostescalation definition so far doesn't exist yet. I deliberately restricted it to a fumbling host called "fiddle" until I get this trivial task working, whereafter I would of course extend it to all my monitored hosts and services. So this is the only escalation definition so far: $ cat escalations.cfg define serviceescalation { host_name fiddle service_description icmp-host-alive first_notification 3 last_notification 3 notification_interval 0 contact_groups service_center } This is the above service: define service { use generic-service service_description icmp-host-alive hostgroup_name non_fwalled_hosts check_command check-host-alive contact_groups nagiosadmin,service_center } This is the inherited service template: define service { name generic-service is_volatile 0 max_check_attempts 5 normal_check_interval 5 retry_check_interval 3 check_period 24x7 active_checks_enabled 1 passive_checks_enabled 0 parallelize_check 1 obsess_over_service 0 check_freshness 0 event_handler notify-by-email event_handler_enabled 0 flap_detection_enabled 0 process_perf_data 0 retain_status_information 1 retain_nonstatus_information 1 notification_interval 30 notification_period 24x7 notification_options w,u,c,r notifications_enabled 1 contact_groups nagiosadmin register 0 } This is the host definition for fiddle: define host { use generic-host host_name fiddle alias MC/SG Cluster Package FIDDLE address 123.123.123.123 hostgroups non_fwalled_hosts contact_groups nagiosadmin } This is the contact group definition receiving the tickets (i.e. service center) define contactgroup { contactgroup_name service_center alias Service Center TT Filer Accounts members scadmin } And finally this is the contact (inclusive template, but with bogus mail address here): define contact { name generic-contact register 0 contact_name grothe alias Must be overridden contactgroups sazadmin host_notification_period workhrs service_notification_period workhrs host_notification_options d,u,r service_notification_options w,u,c,r host_notification_commands host-notify-by-email service_notification_commands notify-by-email email nagios } define contact { use generic-contact contact_name scadmin alias Service Center TT Filer email scadmin at our.rotten.com host_notification_period 24x7 service_notification_period 24x7 host_notification_commands file-host-sc-ticket service_notification_commands file-service-sc-ticket address1 SC Token address2 Another SC Token } I think I can skip the command definition for "file-service-sc-ticket" here (I surely know by the sheer ticket flood that at least this part is doing its duty as expected) I am absolutely clueless why the service center is receiving those ticket filing requests repetitvely, and even from other hosts of host group "non_fwalled_hosts" when I did in fact specify host fiddle in the service escalation definition (something I would consider a clear disambiguator directive). If I can't get this trivial but important functionality of ticket generation working I will have to dismiss the whole Nagios experience and look out for another tool, which I think would be a very sad thing, given the time spent so far and the positive impressions from the working parts. P.S. I don't know if this is of any importance at all, but these are the releases I run: $ printf "%s\n\n" "$(uname -srv)";/opt/sw/nagios/bin/nagios -V|head -5 AIX 3 4 Nagios 2.0b3 Copyright (c) 1999-2005 Ethan Galstad (www.nagios.org) Last Modified: 04-03-2005 License: GPL Many thanks for your kind notice Ralph ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yonienjcn at gmail.com Thu Oct 6 16:16:27 2005 From: yonienjcn at gmail.com (Tao Yaoning) Date: Thu, 6 Oct 2005 10:16:27 -0400 Subject: nagiosQL In-Reply-To: <20051006155513.nrmxnqmlutwk4k88@webmail.s1test1.it> References: <20051006155513.nrmxnqmlutwk4k88@webmail.s1test1.it> Message-ID: <8ee7548c0510060716r3fbd256fkf3c23f5b1e2f6a36@mail.gmail.com> Does your directory really exists? I just make my nagiosQL working. It's really great. In fact, you don't need have a running nagions daemon, when you run nagiosQL. Like me, I don't run nagios, in fact I didn't configure anything for my nagios. but my nagiosQL still works fine, and I can configure my nagios configuration file with nagiosQL On 10/6/05, nagios_user at s1test1.it wrote: > > Goodmorning, > I have a little problem to issue. > I want use nagiosQL, so I've install it. > When I try to insert "cfg_dir=/...." in nagios.cfg nagios don't start and > give > me an error on configuration file. > What is the problem? Can you help me? > p.s. sorry for my english... > thank you very much. > Best regards > > Rodolfo > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From rejaine at bhz.jamef.com.br Thu Oct 6 16:26:03 2005 From: rejaine at bhz.jamef.com.br (Rejaine Monteiro) Date: Thu, 06 Oct 2005 11:26:03 -0300 Subject: I need a tip for page swapping monitoring... Message-ID: <1128608762.6009.14.camel@acesup2.bhz.jamef> Hi, How can I monitoring "page swapping" (not just "swap" or "memory") like in xosview on the "page" option? Any ideas? ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yonienjcn at gmail.com Thu Oct 6 16:12:52 2005 From: yonienjcn at gmail.com (Tao Yaoning) Date: Thu, 6 Oct 2005 10:12:52 -0400 Subject: about nagiosQL installation and configuration In-Reply-To: References: <8ee7548c0510051918h44acdc0ep16bce081c4084d3@mail.gmail.com> Message-ID: <8ee7548c0510060712g2e28d22ft6b3f4745ed2f653a@mail.gmail.com> you are rtight, I need config the physical path in setting.ini On 10/5/05, Aaron Carr wrote: > > Did you configure the php config file for nagiosQL to connect to the > database? > > Have you checked your httpd error log, or the regular httpd log? Usually > those will give some insight to errors like this. > > That's where I would suggest you start. > > Aaron > -------------- next part -------------- An HTML attachment was scrubbed... URL: From msugano at uolinc.com Thu Oct 6 16:45:11 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Thu, 06 Oct 2005 11:45:11 -0300 Subject: WEB-Interface performance In-Reply-To: <4344F968.9090701@aol.com> References: <1128555158.11266.32.camel@localhost> <4344F968.9090701@aol.com> Message-ID: <1128609911.17808.18.camel@localhost> Hi list, Rob, Moshe, Rafael... Thank you all for your time and leading help! ;) On Thu, 2005-10-06 at 11:16 +0100, Rob Moss wrote: > Yeah, there are some CFLAGS you could be using to optmise your build.. > I am assuming that you have a recent version of GCC, and that your P4 > HT cpu is shown as having two logical CPU's > > Try rebuilding with the following command: > cd nagios-2.0b4 > CC=gcc CFLAGS="-mtune=i686 -O3 -pipe -march=i686 -funroll-loops > -ffast-math" \ > ./configure --prefix=/usr/local/nagios ...... [rest of nagios > configure commands] I will try this right now, but, as you stated below, the status.dat file is around 13MB of size and over 500k lines, and I agree with you that recompiling shouldn't improve the performance in a way we could see the status.cgi computing faster. But I will recompile anyway, just for the minimum improve I can get on performance. > > The main problem is that you have thousands of hosts, and thousands of > services to read in every time you run status.cgi. No matter how > efficient the program is, reading in and displaying that much data is > going to take a while, and running the same program 15 times > simultaneously is going to affect your performance as you see here. > How many lines is the status.dat file? I only have a few hundred > hosts and services, and the file is 23,000 lines or so, half a meg on > disk.. I would imagine yours is closer to about 50mb and closer to a > million lines. > > Some alternatives might be updating the Nagios sidebar so that it > doesn't display ALL hosts by default, maybe just a smaller hostgroup.. > (although i suspect the status.cgi needs to read in the whole file) Or > replacing the standard nagios CGI's with something that is more geared > towards handling hundreds of thousands of hosts/services... We are doing that way, people who are watching the service problems page just browse the page with the critical alerts, and sorted by duration of the alarm. Again I think that the status.cgi will read all the file, because response time isn't lower than loading all problems in one page. > > Or perhaps you could have a separate display server, a webserver > running the cgi's which reads in the nagios status.dat file over the > network from the nagios server, and does all the processing away from > the nagios collector.. This would move processing off of the nagios > collector.. You could use rsync to keep the two files in sync, keep a > duplicate on the display server on a local disk (or a tmpfs memory > based filesystem for extra speed)... That is a good idea. I also wondered if it would be possible to setup syslog-ng and send only the information I want to another server, which displays only the critical problems, this way the status.dat should be smaller. Well, there should exists lots of solutions to this kind of problem, and we will be trying something like this soon. > > Hope this helps > rob. Again, thank you, it helped. -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Oct 6 17:28:46 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 06 Oct 2005 17:28:46 +0200 Subject: Inexplicable service escalation behaviour In-Reply-To: References: Message-ID: <434542AE.9090409@op5.se> Ralph.Grothe at itdz-berlin.de wrote: > Dear List Subscribers, > > although I have already asked how to properly set up an > escalation scheme > a couple of weeks ago here (sorry, only find time to continue > Nagios fumbling > every now and then at work, and at home it would be useless due > to lacking testing ground > (ok, I could emulate a host and network farm by vmware or xen > etc., but that's too much fuss)) > I desperately need further assistance. > > I am not getting on with this. > > I swear that I've read carefully the sections on escalations in > the Nagios docs at least thrice > by now. > And the presented examples in the docs sound very convincing to > me > (though a bit far-fetched) so that I very well can gather, I > suppose, how it should work - in *theory*. > > My objective seems very trivial to me. > > I just want Nagios to send a *single* notification by using my > "file-service-sc-ticket" > (misc)command definition to our trouble ticketing system, > but at the same time keep continuing sending out repetetive > notifications to the > various admin recipients at the common notification intervall > (at least the latter is working). > Create a small script that checks the value of the macro $NOTIFICATIONNUMBER$ (or some such, check the docs for exact name). If it's any higher than 1, don't send to the ticketing system. This is by far the easiest way of doing reverse escalations. OTOH, if you're technical staff aren't doing anythingh about network problems, you have other trouble that (to me) seem far more urgent. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Oct 6 17:35:26 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 06 Oct 2005 17:35:26 +0200 Subject: I need a tip for page swapping monitoring... In-Reply-To: <1128608762.6009.14.camel@acesup2.bhz.jamef> References: <1128608762.6009.14.camel@acesup2.bhz.jamef> Message-ID: <4345443E.7090702@op5.se> Rejaine Monteiro wrote: > Hi, > > How can I monitoring "page swapping" (not just "swap" or "memory") like > in xosview on the "page" option? > If by page swapping you mean the amount of times the system hits a page fault (i.e. accesses the data on swap), I have no idea. I further have no idea of how to monitor the amount of time the system swaps pages to disk, although it should be easy enough to at least estimate with a bit of trickery (check swap partition usage with common intervals and see how much it grows/shrinks). I must confess though that unless you're doing some serious voodoo coding involving kernel memory management and cache miss predicitions, I really don't see a use for it, and if you were I've got a feeling you wouldn't be asking these questions. Usually it's enough to monitor the swap usage. When it grows to a certain amount it's time to do something about it, no matter how fast it got there. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From atworkowski at masterfile.com Thu Oct 6 18:51:42 2005 From: atworkowski at masterfile.com (Adam Tworkowski) Date: Thu, 06 Oct 2005 12:51:42 -0400 Subject: voice notification via telephone (vgetty, asterisk, etc.) ? Message-ID: <1128617502.12286.14.camel@localhost.localdomain> Hi, I have searched various Nagios lists (-users, -devel) with the aim of finding out if anyone has successfully implemented Nagios notifications via voice synthesis to a land- or cell-based telephone call. There have been various references to topic in the last year but I cannot find anything that is concrete (links are dead, or threads end and presumably turn into private conversations elsewhere.) To provide further clarity, I am hoping to use a PCI voice modem (real controller,not softmodem) and have selected critical alerts be "forwarded" to a predefined telephone number and play a message like "Nagios is reporting the server X is not reachable". Does anyone have any thoughts on the matter or useful references? Thanks in advance. -Adam ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From christophe.suire at adelux.fr Thu Oct 6 19:23:17 2005 From: christophe.suire at adelux.fr (Christophe SUIRE) Date: Thu, 6 Oct 2005 19:23:17 +0200 Subject: voice notification via telephone (vgetty, asterisk, etc.) ? In-Reply-To: <1128617502.12286.14.camel@localhost.localdomain> References: <1128617502.12286.14.camel@localhost.localdomain> Message-ID: Hi, I have done it for a customer .. In fact after some test with text to voice utility (problems with names of servers), we record some "wav" files, like "the server" - "is down" - "name of a server". I use a tool to convert wav files to modem voice files : wavtopvf When i have to send a notification i group the good pvf files to make a unique file and convert il to rmd I convert this file to rmd file : cat /home/nagios/wav/file1.pvf /home/nagios/wav/file2.pvf 2>/dev/ null | pvfspeed -s 7200 | pvfamp -A 3 | pvftormd Rockwell 2 > /tmp/ $PID-temp.rmd Then with mgetty-voice installed and configured, you can use the script callme.pl to send the file. Bye. -- Christophe Suire #!/usr/bin/perl -w # # $Id: callme.pl,v 1.2 1999/01/15 14:30:44 kas Exp $ # # This script dials a given number and then re-plays the message. # Use "vm shell -S /usr/bin/perl callme.pl number message.rmd" # for calling the "number" and playing the "message.rmd". # # Copyright (c) 1998 Jan "Yenya" Kasprzak . All rights # reserved. This package is free software; you can redistribute it and/or # modify it under the same terms as Perl itself. # use Modem::Vgetty; my $v = new Modem::Vgetty; die "Usage: callme.pl number message.rmd" if $#ARGV != 1; $v->device('DIALUP_LINE'); $v->add_handler('BUSY_TONE', 'finish', sub { $v->stop; exit 0; }); $v->enable_events; $v->dial($ARGV[0]); $v->waitfor('READY'); $v->play_and_wait($ARGV[1]); 1; Le 6 oct. 05 ? 18:51, Adam Tworkowski a ?crit : > Hi, > > I have searched various Nagios lists (-users, -devel) with the aim of > finding out if anyone has successfully implemented Nagios > notifications > via voice synthesis to a land- or cell-based telephone call. > > There have been various references to topic in the last year but I > cannot find anything that is concrete (links are dead, or threads end > and presumably turn into private conversations elsewhere.) > > To provide further clarity, I am hoping to use a PCI voice modem (real > controller,not softmodem) and have selected critical alerts be > "forwarded" to a predefined telephone number and play a message like > "Nagios is reporting the server X is not reachable". > > Does anyone have any thoughts on the matter or useful references? > > Thanks in advance. > > -Adam > > > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From atworkowski at masterfile.com Thu Oct 6 20:17:08 2005 From: atworkowski at masterfile.com (Adam Tworkowski) Date: Thu, 06 Oct 2005 14:17:08 -0400 Subject: voice notification via telephone (vgetty, asterisk, etc.) ? In-Reply-To: References: Message-ID: <1128622628.20040.22.camel@localhost.localdomain> Hi James, There are several reasons: Firstly, it is my manager's preference not to rely on SMS. His reasons are that when he contacted Bell Canada (national carrier) investigating what sort of SLA they could provide for their SMS services, the response was to the effect of [insert chuckle by Simpson's character Dr. Hibbert's here] "God no! We don't even pretend that are SMS services 24x7/365 -- we shut them down all the time for maintenance". Needless to say this didn't leave a warm feeling with my manager. While not that recently, I have on multiple occasions experienced having SMS messages "stay in queue" only to arrive hours later, which of course, is hard to explain to the customer as to why they detected the alert before I did. This was with a different national carrier. Also with SMS, you are relying on a long chain of dependencies including routers, firewalls, switches on our side as well as Internet connection, reliability of carriers servers, etc. You can, of course, use an SMS modem which reduces some of the "hops". A phone call seems to cut out at least 1/2 of the areas of possible failure. That said, telephone and SMS (or pager) used in conjunction for critical alerts would likely provide the best degree of coverage. -Adam On Thu, 2005-10-06 at 18:08 +0100, James Peel wrote: > Hi > > > To provide further clarity, I am hoping to use a PCI voice modem (real > > controller,not softmodem) and have selected critical alerts be > > "forwarded" to a predefined telephone number and play a message like > > "Nagios is reporting the server X is not reachable". > > > Does anyone have any thoughts on the matter or useful references? > > My first thought is that SMS messaging would be much easier to implement. > What made you choose voice rather than text notification? > > Main reason for posting is that I've considered voice messaging in the > past but couldn't really see an advantage unless SMS or pager services > weren't available. > > > Regards, > James > > > -- Regards, Adam Tworkowski, atworkowski at masterfile.com Systems Administrator, Computer Department Masterfile Corporation, www.masterfile.com This e-mail message is privileged, confidential and subject to copyright. Any unauthorized use or disclosure is prohibited. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From john at stilen.com Thu Oct 6 21:25:59 2005 From: john at stilen.com (John Stile) Date: Thu, 06 Oct 2005 12:25:59 -0700 Subject: comple flag --with-template-extinfo Message-ID: <1128626759.11491.7.camel@localhost> Will compiling with --with-template-extinfo result in the same access perfdata as using --with-default-extinfo? I'm not seeing any perfdata sent to the program in the process-service-perfdata definition in misccommands.cfg, so is this config file ignored if I use --with-template-extinfo? ----------------------- A longer story follows: ----------------------- I use debian-3.1 with nagios-1.3 deb packages nagios-common and nagios-mysql, and trying to get perfdata info an rrd. By default the deb package is configure with this incantation: ./configure --prefix=/usr \ --mandir=/usr/share/man \ --bindir=/usr/sbin \ --sbindir=/usr/lib/cgi-bin/nagios \ --datadir=/usr/share/nagios/htdocs \ --sysconfdir=/etc/nagios \ --infodir=/usr/share/info \ --libexecdir=/usr/lib/nagios/plugins \ --localstatedir=/var/log/nagios \ --with-ping-command="/bin/ping -n %s -c %d" \ --with-mail=/usr/bin/mail --with-perlcache \ --with-nagios-user=nagios \ --with-nagios-grp=nagios \ --with-template-objects \ --build=$(DEB_BUILD_GNU_TYPE) \ --host=$(DEB_HOST_GNU_TYPE) --with-pgsql-xdata --with-mysql-xdata --with-template-extinfo This is missing the switch '--with-default-extinfo' but it has a '--with-template-extinfo'. I'm having a hard time finding a doc to tell me the real difference, or how I must change what I am doing to gain access to the perfdata. Do I have to rebuild the package with '--with-default-extinfo'? In /etc/nagios/nagios.conf: process_performance_data=1 service_perfdata_command=process-service-perfdata In /etc/nagios/cgi.bin nagios_check_command=/etc/nagios/check_nagios_db # DG EXTENDED DATA xeddb_host=localhost #xeddb_port=someport xeddb_database=nagios xeddb_username=nagios xeddb_password=nagios In misccommands.cfg define command{ command_name process-service-perfdata command_line /usr/bin/printf "%b" "$LASTCHECK$\t$HOSTNAME$\t$SERVICEDESC$\t$SERVICESTATE$\t$SERVICEATTEMPT$\t$STATETYPE$\t$EXECUTIONTIME$\t$LATENCY$\t$OUTPUT$\t$PERFDATA$" >> /var/log/nagios/service-perfdata.out } 666 permissions on /var/log/nagios/service-perfdata.out But noting ever goes into /var/log/nagios/service-perfdata.out No errors in nagios.log Help! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Thu Oct 6 22:53:56 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Thu, 06 Oct 2005 16:53:56 -0400 Subject: Expanding dynamic service macros for URL use? Message-ID: <200510062053.j96KruqC018557@mx1.cs.umb.edu> Hi all: I was looking at using the action_url in the serviceextinfo object to pass data to a CGI that would create a trouble ticket. I set up the URL (split for readability) as: action_url https://some_site.com/roundup/create?host="$HOSTNAME$"& service="$SERVICEDESC$"&state=$SERVICESTATE$& output="$SERVICEOUTPUT$" or output="$SERVICEOUTPUT:NULL:ping$" While the HOSTNAME and SERVICEDESC macros work fine, the SERVICEOUTPUT either in standard or on-demand form along with the SERVICESTATE macro don't expand. I think they can just expand to the contents shown on the service information page. So does anybody else think this would be a good thing to have, or is it just me? I am not sure that it is necessary to URL encode them since the "bad" characters can just be eliminated using illegal_macro_output_chars, so a SERVICEOUTPUTURL isn't needed, just expansion. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. P.S. I sent it before I rejoined the mailing list and it was apparently rejected by the moderator. In case the report was in error, I apologize for the duplicate. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Thu Oct 6 22:58:24 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Thu, 06 Oct 2005 16:58:24 -0400 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? Message-ID: <200510062058.j96KwO55019295@mx1.cs.umb.edu> Hi all: I have groveled through the cgi files in nagios 2.0b4 and I can't find any instance of the nagios_check_command actually being executed. Am I just missing it somewhere? cgiutils.c load the variable from the config file extinfo.c checks to see if the variable is empty and produces a nice little warning if it is. but nothing seems to actually execute the command. The reason I ask is that nagios was down and the cgi's all happily reported that it was up. Could this be because the host and service status files were available since the machine crashed? There is a variable nagios_process_state that seems to be set unconditionally to STATE_OK that would be set to the output state of the check_nagios_command. Grepping through the source tree turns up lots of comparisons, but only the two assignments. Assignments cgiutils.c:int nagios_process_state=STATE_OK; cgiutils.c: nagios_process_state=STATE_OK; Use/declarations cgiutils.c: if(nagios_process_state!=STATE_OK) cmd.c:extern int nagios_process_state; extinfo.c:extern int nagios_process_state; extinfo.c: if(nagios_process_state==STATE_OK){ extinfo.c: if(nagios_process_state==STATE_OK){ extinfo.c: else if(nagios_process_state==STATE_WARNING){ extinfo.c: else if(nagios_process_state==STATE_CRITICAL){ extinfo.c: if(nagios_process_state==STATE_OK){ extinfo.c: if(nagios_process_state==STATE_OK){ extinfo.c: if(nagios_process_state==STATE_OK){ extinfo.c: if(nagios_process_state==STATE_OK){ statuswml.c:extern int nagios_process_state; statuswml.c: if(nagios_process_state==STATE_OK) tac.c:extern int nagios_process_state; Also the default nagios_check_command define in the sample cgi.cfg looks for the file: status.dat but that has dumped status info, and the check_nagios command (from the 1.4 plugins) can't understand it. It looks like it should be looking for the log file: nagios.log. When run using the nagios.log file the expected result is returned. So thoughts, quips, comments or questions? -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From joeregular at gov.nl.ca Fri Oct 7 01:06:50 2005 From: joeregular at gov.nl.ca (Joe Regular) Date: Thu, 06 Oct 2005 20:36:50 -0230 Subject: Too Many Services! Message-ID: I'm trying to limit the number of hosts and services my HelpDesk can see when they access our Nagios website. So far I've been able to add the HelpDesk account using the htpasswd2 command and making them a member of the contactgroups.cfg. However, now when the HelpDesk logs in the see limited host, which is a good thing, but they can still see all services. How do I limit the number of services that my HelpDesk can see? Joe ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From b.massa at scharr.de Fri Oct 7 06:23:39 2005 From: b.massa at scharr.de (Bernd Massa) Date: Fri, 07 Oct 2005 06:23:39 +0200 Subject: how to make a scheduled downtime persistent Message-ID: <4345F84B.1040808@scharr.de> Hi there, i would like to schedule downtime persistent and not only for one day. Some hosts are not reachable every day at the same time or for the same duration; while full backup for example. Whatever I do, the scheduled downtime is always non-persistent, so I have to schedule everyday again. Any ideas? Yours, Bernd -- ___________________________________________________________________________________________________________________ Bernd Massa IT Systeme und Netze SYNCHRON Gesellschaft f?r Betriebswirtschaftliche Beratung und Informationssysteme mbH Liebknechtstr. 50 70565 Stuttgart ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From amontibello at gmail.com Fri Oct 7 07:44:11 2005 From: amontibello at gmail.com (Anthony Montibello) Date: Fri, 7 Oct 2005 01:44:11 -0400 Subject: WMI Query In-Reply-To: References: Message-ID: Have you tested the command from the command line? before implementing a WMI CHeck in NC_Net it should be tested via the command line, to make sure you get the desired results. Read the help Files. ./check_nt --help=WMICHECK -should be run from command line it returns mutiple lines ./check_nt --help=WMICOUNTER -can use -c and -w against query ./check_nt --help=WMICAT - should be used in nagios instead of WMICHECK and the Readme from NC_Net This command will not work.. the syntax is wrong and it does not define a Query command_line $USER1$check_nt -H $HOSTADDRESS$ -v WMICHECK -l "cimv2/win32_logicaldrive" It should be win32_logicaldisk and it needs to have a select part of the statement like * thus the following should be used instead (all the following are parsed to the same query) -l "cimv2^*^win32_logicaldisk" -l "cimv2^select * from win32_logicaldisk" -l "cimv2&*&win32_logicaldisk" -l "cimv2&select * from win32_logicaldisk" as for the command: command_line /usr/local/nagios/libexec/check_nt -H $HOSTADDRESS$ -v WMICHECK -l "cimv2^select name,freespace,deviceid,status from win32_DiskDrive" win32_DiskDrive does not have Freespace remove it and it should work: -l "cimv2^select name,deviceid,status from win32_diskdrive" As for the command: command_line $USER$/check_nt -H $HOSTADDRESS$ -v WMICHECK -l "cimv2^select DeviceID,Status from win32_DiskDrive" THis ran OK on my system? steps to resolve the issue: Make sure the Check_Nt works from the command line. Make sure this Check_nt is in the libexec directory of Nagios. Check the permissions of check_nt to make sure user Nagios can run it. For the most part Check_nt should always return a 0,1,2,3 so it may be a permission or missing file issue. Also WMICHECK is really meant to be run from the command line because it returns multiple lines and NAgios only resolves the first line of the output. WMICAT should be used instead of WMICHECK in nagios. A example of checking all drives freespace and then returning a critical if any drive has less than 1Gig of space is: (Make sure to exclude all drives that do not implement freespace like floppy and CR-Roms.) -v WMICOUNTER -l "cimv2^select freespace from win32_logicaldisk where name != 'a:' " -c 1000000000 for more information and on WMI query see msdn.com : http://msdn.microsoft.com/library/default.asp?url=/library/en-us/wmisdk/wmi/wql_sql_for_wmi.asp NC_Net has been tested for use mostly with simple select statement, thus all queries should be checked using the command line before implementing in nagios. IF WMICAT and WMICOUNTER do not meed your nagios needs, Then WMICHECK can be used but should be invoked form a wrapper script that will parse the data into a more appropriate format for Nagios. I hope this helps clarifying some of the usage of WMI with NC_Net TOny amontibello at shatterit.com On 10/5/05, Gaston_Perez at itginc.com wrote: > > Hi All I am pretty new to Linux / Nagios and I am running Nagios 1.2, on > a Redhat 9 server. I have Nagios plugins 1.42 installed on my server and > on my windows 2003 servers I have nc_net 2.28. I am trying to run wmi > queries to check the physical hard drive to report back to me if one fails. > Here is what I have in my checkcommands > > # 'WMI Harddrive check' command definition > > define command{ > > command_name check_nt_wmihdd > > command_line $USER1$check_nt -H $HOSTADDRESS$ -v WMICHECK -l > "cimv2/win32_logicaldrive" > > } > > I have also tried this > > command_line /usr/local/nagios/libexec/check_nt -H $HOSTADDRESS$ -v > WMICHECK -l "cimv2^select name,freespace,deviceid,status from > win32_DiskDrive" > > } > > And this > > command_line $USER$/check_nt -H $HOSTADDRESS$ -v WMICHECK -l "cimv2^select > DeviceID,Status from win32_DiskDrive" > > } > > Here is my services command > > define service{ > > use check-nt-service > > register 1 > > host_name server1 > > service_description WMI HDD > > check_command check_nt_wmihdd > > max_check_attempts 3 > > normal_check_interval 5 > > retry_check_interval 3 > > check_period 24x7 > > notification_interval 120 > > notification_period workhours > > notification_options w,c,r > > contact_groups prodsupp > > } > > Response I have seen from the server is > > (Return code of 127 is out of bounds - plugin may be missing) > > Nagios Log displays this > > [1128541333] Warning: Return code of 127 for check of service 'WMI HDD' on > host 'rad-db01' was out of bounds. Make sure the plugin you're trying to run > actually exists. > > When I ran the first check command statement above this is the output to > the Nagios log > > rad-db01;WMI HDD;CRITICAL;HARD;3;(Return code of 127 is out of bounds - > plugin may be missing) > > Now I checked my nc_net plugin and it is the version (2.28) that can read > wmi checks. I also verified that the Nagios plugins I am using (1.42)can > understand wmi checks. Any help at all would be greatly appreciated, Thanks > in Advance > > -+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+- > This message is for the named person's use only. This communication is for > informational purposes only and has been obtained from sources believed to > be reliable, but it is not necessarily complete and its accuracy cannot be > guaranteed. It is not intended as an offer or solicitation for the purchase > or sale of any financial instrument or as an official confirmation of any > transaction. Moreover, this material should not be construed to contain any > recommendation regarding, or opinion concerning, any security. It may > contain confidential, proprietary or legally privileged information. No > confidentiality or privilege is waived or lost by any mistransmission. If > you receive this message in error, please immediately delete it and all > copies of it from your system, destroy any hard copies of it and notify the > sender. You must not, directly or indirectly, use, disclose, distribute, > print, or copy any part of this message if you are not the intended > recipient. Any views expressed in this message are those of the individual > sender, except where the message states otherwise and the sender is > authorized to state them to be the views of any such entity. > > ITG Inc. and/or its affiliates reserves the right to monitor and archive > all electronic communications through its network. > > ITG Inc. Member NASD, SIPC > -+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+-+- > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From amontibello at gmail.com Fri Oct 7 08:04:42 2005 From: amontibello at gmail.com (Anthony Montibello) Date: Fri, 7 Oct 2005 02:04:42 -0400 Subject: how to make a scheduled downtime persistent In-Reply-To: <4345F84B.1040808@scharr.de> References: <4345F84B.1040808@scharr.de> Message-ID: there are several options for this objective, 1) setup Time Periods or notification time periods (Read the Nagios Manual for more info) 2) use one of the downtime scripts that can be found on nagios exchange. http://www.nagiosexchange.org/Downtimes.38.0.html I am usign a slighly modified version of one of the scripts that is run from cron. instead of running the script from cron for each downtime, I run a wrapper script that contains all my downtime for that day and feeds it into the script. hope this helps Tony amontibello at shatterit.com On 10/7/05, Bernd Massa wrote: > > Hi there, > > i would like to schedule downtime persistent and not only for one day. > Some hosts are not reachable every day at the same time or for the same > duration; while full backup for example. Whatever I do, the scheduled > downtime is always non-persistent, so I have to schedule everyday again. > Any ideas? > > Yours, > > Bernd > > -- > > ___________________________________________________________________________________________________________________ > > Bernd Massa > IT Systeme und Netze > SYNCHRON Gesellschaft f?r Betriebswirtschaftliche Beratung und > Informationssysteme mbH > Liebknechtstr. 50 > > 70565 Stuttgart > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From Ralph.Grothe at itdz-berlin.de Fri Oct 7 09:43:47 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Fri, 7 Oct 2005 09:43:47 +0200 Subject: Inexplicable service escalation behaviour Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Andreas > Ericsson > Sent: Thursday, October 06, 2005 5:29 PM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Inexplicable service escalation behaviour > > > Ralph.Grothe at itdz-berlin.de wrote: > > Dear List Subscribers, > > > > although I have already asked how to properly set up an > > escalation scheme > > a couple of weeks ago here (sorry, only find time to continue > > Nagios fumbling > > every now and then at work, and at home it would be useless due > > to lacking testing ground > > (ok, I could emulate a host and network farm by vmware or xen > > etc., but that's too much fuss)) > > I desperately need further assistance. > > > > I am not getting on with this. > > > > I swear that I've read carefully the sections on escalations in > > the Nagios docs at least thrice > > by now. > > And the presented examples in the docs sound very convincing to > > me > > (though a bit far-fetched) so that I very well can gather, I > > suppose, how it should work - in *theory*. > > > > My objective seems very trivial to me. > > > > I just want Nagios to send a *single* notification by using my > > "file-service-sc-ticket" > > (misc)command definition to our trouble ticketing system, > > but at the same time keep continuing sending out repetetive > > notifications to the > > various admin recipients at the common notification intervall > > (at least the latter is working). > > > > Create a small script that checks the value of the macro > $NOTIFICATIONNUMBER$ (or some such, check the docs for exact > name). If > it's any higher than 1, don't send to the ticketing system. > This is by > far the easiest way of doing reverse escalations. Andreas, many thanks for pointing my nose at the Nagios macros, which indeed lend themselves to being used in scripts (well, I guess that's there main purpose). I have to admit that I totally forgot for a while about them. I will revisit the macro tables in the docs and make a wee wrapper script that evaluates appropriate candidates like $NOTIFICATIONNUMBER$ to decide whether to file a ticket, as you suggested. Btw, will the macros be exported to external commands' environments per default, so that for instance in a Perl script I could refer to them by e.g. $ENV{NOTIFICATIONNUMBERS} ? (never mind, more of a rhetorical question which I can easily answer myself by trial) > > OTOH, if you're technical staff aren't doing anythingh about network > problems, you have other trouble that (to me) seem far more urgent. > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Riccardo.GENTILUCCI at skytv.it Fri Oct 7 10:36:44 2005 From: Riccardo.GENTILUCCI at skytv.it (GENTILUCCI Riccardo) Date: Fri, 7 Oct 2005 10:36:44 +0200 Subject: R: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: I've tried to install the perl-net-snmp package..now this request another perl module perl(Crypt::DES).....and the story goes on!! Thanks again for any further ideas Riccardo Gentilucci IT SKY - Direzione Network & Security Management SKY Italia Via Piranesi, 46 - 20137 Milano Tel: +390270027426 Fax: +390270022248 Mob: +393355342826 Emailto: riccardo.gentilucci at skytv.it Website: http://www.skytv.it ________________________________ Da: ankush grover [mailto:ankushgrover1711 at gmail.com] Inviato: gioved? 6 ottobre 2005 13.44 A: nagios-users at lists.sourceforge.net; GENTILUCCI Riccardo Oggetto: Re: R: [Nagios-users] Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! On 10/6/05, Ralf Gross wrote: > Hi Andreas, the package is present and correctly installed and visible > (net-snmp-utils-5.2.1.2-FC3.1) I think you need the fedora net-snmp-perl package. Ralf yes , It is perl-Net-SNMP package.It is available at http://dag.wieers.com/packages/perl-Net-SNMP/ Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.jpg Type: image/jpeg Size: 1402 bytes Desc: image001.jpg URL: From Riccardo.GENTILUCCI at skytv.it Fri Oct 7 10:39:45 2005 From: Riccardo.GENTILUCCI at skytv.it (GENTILUCCI Riccardo) Date: Fri, 7 Oct 2005 10:39:45 +0200 Subject: R: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: I've forgotten to say you that i've installed the requested perl module...but the perl-net-snmp package doesn't find this and so...I can't install the nagios plugins Regards Riccardo Gentilucci IT SKY - Direzione Network & Security Management SKY Italia Via Piranesi, 46 - 20137 Milano Tel: +390270027426 Fax: +390270022248 Mob: +393355342826 Emailto: riccardo.gentilucci at skytv.it Website: http://www.skytv.it ________________________________ Da: GENTILUCCI Riccardo Inviato: venerd? 7 ottobre 2005 10.37 A: 'ankush grover'; nagios-users at lists.sourceforge.net Oggetto: R: R: [Nagios-users] Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! I've tried to install the perl-net-snmp package..now this request another perl module perl(Crypt::DES).....and the story goes on!! Thanks again for any further ideas Riccardo Gentilucci IT SKY - Direzione Network & Security Management SKY Italia Via Piranesi, 46 - 20137 Milano Tel: +390270027426 Fax: +390270022248 Mob: +393355342826 Emailto: riccardo.gentilucci at skytv.it Website: http://www.skytv.it ________________________________ Da: ankush grover [mailto:ankushgrover1711 at gmail.com] Inviato: gioved? 6 ottobre 2005 13.44 A: nagios-users at lists.sourceforge.net; GENTILUCCI Riccardo Oggetto: Re: R: [Nagios-users] Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! On 10/6/05, Ralf Gross wrote: > Hi Andreas, the package is present and correctly installed and visible > (net-snmp-utils-5.2.1.2-FC3.1) I think you need the fedora net-snmp-perl package. Ralf yes , It is perl-Net-SNMP package.It is available at http://dag.wieers.com/packages/perl-Net-SNMP/ Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image002.jpg Type: image/jpeg Size: 1402 bytes Desc: image002.jpg URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image003.jpg Type: image/jpeg Size: 1402 bytes Desc: image003.jpg URL: From Philipp.Sand at sycor.de Fri Oct 7 10:57:21 2005 From: Philipp.Sand at sycor.de (Sand Philipp) Date: Fri, 7 Oct 2005 10:57:21 +0200 Subject: AW: check_freshness not working... Message-ID: <6943DE9E9934CD42BBE4A20E0E28E10E01EA0242@SYGOMAIL.GOE.AD.SYCOR.DE> It seems as if I could solve the Problem by myself... the freshness threshold seems to be configured in minutes and not in seconds... > -----Urspr?ngliche Nachricht----- > Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] Im Auftrag von Sand Philipp > Gesendet: Mittwoch, 5. Oktober 2005 10:41 > An: nagios-users at lists.sourceforge.net > Betreff: [Nagios-users] check_freshness not working... > > Hi, > > I'm trying to check the freshness of my passive checks in Nagios. If my > distributed Nagios has any Problems delivering the checks to my central > Nagios, I want to know :) > I'm using Nagios 2.0b3 > > I configured the following: > In Nagios.cfg: > check_service_freshness=1 > service_freshness_check_interval=60 > > Passive Check: > check_command check_warning > > active_checks_enabled 0 > passive_checks_enabled 1 > check_freshness 1 > freshness_threshold 500 > check_period 24x7 > retry_check_interval 2 > max_check_attempts 3 > > the checkcommand: > define command{ > command_name check_warning > command_line $USER1$/check_dummy 3 "$HOSTNAME$ check not up to > date" > } > > Any suggestions? > > Thanks, > Philipp > > > > > _____________________________ > > Philipp Sand > OC-CC-TEC-SYS > > SYCOR GmbH > Heinrich-von-Stephan-Stra?e 1-5 > D - 37073 G?ttingen > > Telefon +49 (0) 551 - 490 - 0 > Telefax +49 (0) 551 - 490 - 232468 > > philipp.sand at sycor.de > www.sycor.de > ------------------------------------------------ > > > ************************************************ > > "Ferner Osten - neue Maerkte - vertraute Technik" > > unter diesem Motto praesentiert sich sycor auf der > > FAKUMA Messe vom 18.-22. Oktober in Friedrichshafen > > sycor - IT-Kompetenz fuer die Kunststoffbranche! > > Besuchen Sie uns! Zwischen Halle A5 und A6 - Stand 6511 > > Weitere Infos unter www.sycor.de > > ************************************************ > > Diese E-Mail ist vertraulich und kann darueber hinaus persoenliche > Informationen beinhalten. Wenn Sie nicht der bestimmungsgemaesse > Empfaenger sind, loeschen Sie bitte die E-Mail und deren Anhaenge sofort > und benachrichtigen Sie uns darueber. Die Firma sycor willigt in keine > Vertraege oder vertragliche Verpflichtungen ein oder uebermittelt > rechtsverbindliche Angebote, die in Form von E-Mail versandt werden, > sofern dies nicht ausdruecklich in schriftlicher Form zwischen den > Parteien vereinbart wurde. > > This e-mail is confidential and may contain personal and/or privileged > information. If you are not the intended recipient please delete this e- > mail and all attachments immediately and inform us. The company sycor does > not agree with contracts or contract obligations sent by e-mail, neither > do we transmit legally binding offers by e-mail, unless this is not > expressly agreed upon between the parties and documented in written form. > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ************************************************ "Ferner Osten - neue Maerkte - vertraute Technik" unter diesem Motto praesentiert sich sycor auf der FAKUMA Messe vom 18.-22. Oktober in Friedrichshafen sycor - IT-Kompetenz fuer die Kunststoffbranche! Besuchen Sie uns! Zwischen Halle A5 und A6 - Stand 6511 Weitere Infos unter www.sycor.de ************************************************ Diese E-Mail ist vertraulich und kann darueber hinaus persoenliche Informationen beinhalten. Wenn Sie nicht der bestimmungsgemaesse Empfaenger sind, loeschen Sie bitte die E-Mail und deren Anhaenge sofort und benachrichtigen Sie uns darueber. Die Firma sycor willigt in keine Vertraege oder vertragliche Verpflichtungen ein oder uebermittelt rechtsverbindliche Angebote, die in Form von E-Mail versandt werden, sofern dies nicht ausdruecklich in schriftlicher Form zwischen den Parteien vereinbart wurde. This e-mail is confidential and may contain personal and/or privileged information. If you are not the intended recipient please delete this e-mail and all attachments immediately and inform us. The company sycor does not agree with contracts or contract obligations sent by e-mail, neither do we transmit legally binding offers by e-mail, unless this is not expressly agreed upon between the parties and documented in written form. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Fri Oct 7 11:05:42 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Fri, 7 Oct 2005 11:05:42 +0200 Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: > I've tried to install the perl-net-snmp package..now this request another perl module perl(Crypt::DES).....and the story goes on!! It's easiest to resolve common CPAN dependency issues by using the CPAN module for module installation. The first time you invoke the CPAN shell you will be asked a couple of questions like your preferred CPAN mirrors, any proxy settings (if applicable), caching, and follow-up options. Once having gone through this you would simply issue perl -MCPAN -e "install Net::SNMP" and perl would take care to resolve all prerequisites (as far as CPAN modules are concerned). Some modules also require prior installation of libraries (e.g. XML::Parser requires libexpat), which must be carried out separately usually. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Riccardo.GENTILUCCI at skytv.it Fri Oct 7 11:16:16 2005 From: Riccardo.GENTILUCCI at skytv.it (GENTILUCCI Riccardo) Date: Fri, 7 Oct 2005 11:16:16 +0200 Subject: R: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: Dear Ralph, I've made what you've indicated (?'ve installed the package fron CPAN and so its dependencies) but without positive solution... Another guy say me to install the per-net-snmp package and this request me what I've written (the Crypt::DES module) I've tried to startup another system with same fedora fc3 and update it till the last available packages...same story in order to install the nagios plugins... I don't understand if there's some problems in my fedora installations or other (I've install it in two different machines, different hw etc...): this is a big frustration. By the way, thanks for your kind answer. Riccardo Gentilucci IT SKY - Direzione Network & Security Management SKY Italia Via Piranesi, 46 - 20137 Milano Tel: +390270027426 Fax: +390270022248 Mob: +393355342826 Emailto: riccardo.gentilucci at skytv.it Website: http://www.skytv.it -----Messaggio originale----- Da: Ralph.Grothe at itdz-berlin.de [mailto:Ralph.Grothe at itdz-berlin.de] Inviato: venerd? 7 ottobre 2005 11.06 A: GENTILUCCI Riccardo; ankushgrover1711 at gmail.com; nagios-users at lists.sourceforge.net Oggetto: RE: R: [Nagios-users] Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! > I've tried to install the perl-net-snmp package..now this request another perl module perl(Crypt::DES).....and the story goes on!! It's easiest to resolve common CPAN dependency issues by using the CPAN module for module installation. The first time you invoke the CPAN shell you will be asked a couple of questions like your preferred CPAN mirrors, any proxy settings (if applicable), caching, and follow-up options. Once having gone through this you would simply issue perl -MCPAN -e "install Net::SNMP" and perl would take care to resolve all prerequisites (as far as CPAN modules are concerned). Some modules also require prior installation of libraries (e.g. XML::Parser requires libexpat), which must be carried out separately usually. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From smalhotra at dataarmor.net Fri Oct 7 11:20:54 2005 From: smalhotra at dataarmor.net (Sumit Malhotra) Date: Fri, 7 Oct 2005 14:50:54 +0530 Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! In-Reply-To: References: Message-ID: <000001c5cb20$757aa0d0$67c5a8c0@sumit> You have no other option but to live with dependencies. And believe me it's around a set of 5-6 RPMS's you need. You would be better off in resolving which all dependencies from apt-Get / Yum repository of Diagweers. Sumit Malhotra Data Armor -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of GENTILUCCI Riccardo Sent: Friday, October 07, 2005 2:46 PM To: Ralph.Grothe at itdz-berlin.de; ankushgrover1711 at gmail.com; nagios-users at lists.sourceforge.net Subject: R: R: [Nagios-users] Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Dear Ralph, I've made what you've indicated (?'ve installed the package fron CPAN and so its dependencies) but without positive solution... Another guy say me to install the per-net-snmp package and this request me what I've written (the Crypt::DES module) I've tried to startup another system with same fedora fc3 and update it till the last available packages...same story in order to install the nagios plugins... I don't understand if there's some problems in my fedora installations or other (I've install it in two different machines, different hw etc...): this is a big frustration. By the way, thanks for your kind answer. Riccardo Gentilucci IT SKY - Direzione Network & Security Management SKY Italia Via Piranesi, 46 - 20137 Milano Tel: +390270027426 Fax: +390270022248 Mob: +393355342826 Emailto: riccardo.gentilucci at skytv.it Website: http://www.skytv.it -----Messaggio originale----- Da: Ralph.Grothe at itdz-berlin.de [mailto:Ralph.Grothe at itdz-berlin.de] Inviato: venerd? 7 ottobre 2005 11.06 A: GENTILUCCI Riccardo; ankushgrover1711 at gmail.com; nagios-users at lists.sourceforge.net Oggetto: RE: R: [Nagios-users] Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! > I've tried to install the perl-net-snmp package..now this request another perl module perl(Crypt::DES).....and the story goes on!! It's easiest to resolve common CPAN dependency issues by using the CPAN module for module installation. The first time you invoke the CPAN shell you will be asked a couple of questions like your preferred CPAN mirrors, any proxy settings (if applicable), caching, and follow-up options. Once having gone through this you would simply issue perl -MCPAN -e "install Net::SNMP" and perl would take care to resolve all prerequisites (as far as CPAN modules are concerned). Some modules also require prior installation of libraries (e.g. XML::Parser requires libexpat), which must be carried out separately usually. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Fri Oct 7 11:34:50 2005 From: robmossrm at aol.com (Rob Moss) Date: Fri, 07 Oct 2005 10:34:50 +0100 Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! In-Reply-To: <000001c5cb20$757aa0d0$67c5a8c0@sumit> References: <000001c5cb20$757aa0d0$67c5a8c0@sumit> Message-ID: <4346413A.6090503@aol.com> Sumit Malhotra wrote: >You have no other option but to live with dependencies. And believe me >it's around a set of 5-6 RPMS's you need. > >You would be better off in resolving which all dependencies from apt-Get >/ Yum repository of Diagweers. > > Guys, 1. Perl has a module installation interface, called 'CPAN' (refer to my earlier post: perl -MCPAN -e shell) 2. The Perl module Crypt::DES is a standard package in the perl CPAN bundle, which you clearly haven't installed 3. Check for further dependencies by reading the perl script (it's text) and looking at the first 30 or so lines where it has 'use' statements to load in additional modules. You can install the CPAN bundle with all the standard packages with the following simple command (as root): perl -MCPAN -e 'install Bundle::CPAN' Learning how to use the tools will make your life easier.. rob. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Fri Oct 7 11:48:04 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Fri, 7 Oct 2005 11:48:04 +0200 Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: Riccardo, sorry, I wan't following your thread. But I guess I know what you mean. The problem seems to be that you followed two different installation routes that don't seem to care much about the other's standards. I suspect that the perl-net-snmp rpm package doesn't really care about your Perl installation but only consults its rpm database to see if any of the rpm prerequisite packages (those that an "rpm -qpR /some/path/perl-net-snmp.some_release.rpm" would list) is already installed, and would reject installation (unless given the --nodeps) altogether. But I cannot say for sure because I install all the Perl modules I would ever require the Perl way (i.e. either via CPAN.pm or manually) instead of using any rpms. This way however, would probably also force you to build the Nagios or any Nagios Exchange software from the sources as well. But it's more likely to work that way than depending on any pre-built packages for any Linux packaging system. You could, if you required a consistent rpm database that also comprises the Nagios SNMP plug-in stuff, try to get so-called srpm package which really contain the source tarball together with a spec file. Install those like usual rpm files and change to /usr/src/redhat (I hope the path is correct, I only have few Linux boxes to monitor and thus am seldomly exposed to RH) I suggest you read "man rpmbuild" for how to build an RH rpm with an adapted spec file from there. Btw, do you really require Crypt::DES for your SNMP management in your shielded LAN? I may be wrong, but I thought encryption was only an option in SNMPv3? > -----Original Message----- > From: GENTILUCCI Riccardo [mailto:Riccardo.GENTILUCCI at skytv.it] > Sent: Friday, October 07, 2005 11:16 AM > To: Ralph.Grothe at itdz-berlin.de; ankushgrover1711 at gmail.com; > nagios-users at lists.sourceforge.net > Subject: R: R: [Nagios-users] Problem installing nagios-plugins > (1.4.1.1): Perl (Net::SNMP) not found but installed! > > > Dear Ralph, I've made what you've indicated (?'ve installed > the package fron CPAN and so its dependencies) but without > positive solution... > > Another guy say me to install the per-net-snmp package and > this request me what I've written (the Crypt::DES module) > > I've tried to startup another system with same fedora fc3 and > update it till the last available packages...same story in > order to install the nagios plugins... > > I don't understand if there's some problems in my fedora > installations or other (I've install it in two different > machines, different hw etc...): this is a big frustration. > > By the way, thanks for your kind answer. > > > Riccardo Gentilucci > IT SKY - Direzione Network & Security Management > > SKY Italia > Via Piranesi, 46 - 20137 Milano > Tel: +390270027426 > Fax: +390270022248 > Mob: +393355342826 > > Emailto: riccardo.gentilucci at skytv.it > > Website: http://www.skytv.it > -----Messaggio originale----- > Da: Ralph.Grothe at itdz-berlin.de [mailto:Ralph.Grothe at itdz-berlin.de] > Inviato: venerd? 7 ottobre 2005 11.06 > A: GENTILUCCI Riccardo; ankushgrover1711 at gmail.com; > nagios-users at lists.sourceforge.net > Oggetto: RE: R: [Nagios-users] Problem installing > nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! > > > I've tried to install the perl-net-snmp package..now this > request another perl module perl(Crypt::DES).....and the story > goes on!! > > It's easiest to resolve common CPAN dependency issues by using > the CPAN module for module installation. > > The first time you invoke the CPAN shell you will be asked a > couple of questions like your preferred CPAN mirrors, > any proxy settings (if applicable), caching, and follow-up > options. > > Once having gone through this you would simply issue > > perl -MCPAN -e "install Net::SNMP" > > and perl would take care to resolve all prerequisites (as far as > CPAN modules are concerned). > > Some modules also require prior installation of libraries (e.g. > XML::Parser requires libexpat), > which must be carried out separately usually. > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gabriele_d_g at yahoo.it Fri Oct 7 12:04:32 2005 From: gabriele_d_g at yahoo.it (Gabriele Di Giambelardini) Date: Fri, 7 Oct 2005 12:04:32 +0200 (CEST) Subject: chesk_snmp for disck problem notification Message-ID: <20051007100433.14606.qmail@web26510.mail.ukl.yahoo.com> Hi, I have a check_snmp for the disk space on some server, but when the warning or critical limit is exceeded, nagios send me a meil like, HOST DOWN, end in the body i have that message "Info: check_snmp: Invalid warning threshold: %s", Why not send me a WARNING or CRITICAL message?? --------------------------------- Yahoo! Mail: gratis 1GB per i messaggi, antispam, antivirus, POP3 -------------- next part -------------- An HTML attachment was scrubbed... URL: From Ralph.Grothe at itdz-berlin.de Fri Oct 7 12:46:33 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Fri, 7 Oct 2005 12:46:33 +0200 Subject: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: Hi Rob, I've also suggested to him to use perl -MCPAN -e "install Net::SNMP" as this is in accorance with a sound Perl installation. The problem rather seems to be the rpm package builders' ignorance of Perl standards. I experienced a similar nuissance when I was forced to install a Perl module dependable software from an rpm on a SuSE server (a customer of ours has forced upon us, we'd never opt for SuSE due to our bad yast experience) As accustomed I cared for a consistent Perl installation where I exactly new that all required Perl modules definitely were installed. Despite, that stupid rpm package claimed exactly those Perl modules to be missing. The only way out was to install the rpm stuff with the --nodeps flag and disregard to satisfy the rpm database. > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Rob Moss > Sent: Friday, October 07, 2005 11:35 AM > Cc: nagios-users at lists.sourceforge.net > Subject: Re: R: [Nagios-users] Problem installing nagios-plugins > (1.4.1.1): Perl (Net::SNMP) not found but installed! > > > Sumit Malhotra wrote: > > >You have no other option but to live with dependencies. And > believe me > >it's around a set of 5-6 RPMS's you need. > > > >You would be better off in resolving which all dependencies > from apt-Get > >/ Yum repository of Diagweers. > > > > > > Guys, > > 1. Perl has a module installation interface, called 'CPAN' > (refer to my > earlier post: perl -MCPAN -e shell) > 2. The Perl module Crypt::DES is a standard package in the perl CPAN > bundle, which you clearly haven't installed > 3. Check for further dependencies by reading the perl script > (it's text) > and looking at the first 30 or so lines where it has 'use' > statements to > load in additional modules. > > You can install the CPAN bundle with all the standard > packages with the > following simple command (as root): > > perl -MCPAN -e 'install Bundle::CPAN' > > Learning how to use the tools will make your life easier.. > > rob. > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris at aidworld.org Fri Oct 7 12:53:44 2005 From: chris at aidworld.org (Chris Wilson) Date: Fri, 07 Oct 2005 11:53:44 +0100 Subject: Possible bug in NSCA In-Reply-To: <4333A9F5.8050506@op5.se> References: <1127432185.32051.12.camel@localhost> <4333A9F5.8050506@op5.se> Message-ID: <1128682424.28692.57.camel@localhost> Hi Andreas, On Fri, 2005-09-23 at 08:08, Andreas Ericsson wrote: > After > new_sd = accept(sock, 0, 0) > you should add > if(new_sd == -1 && errno == EBADF) { > sock = setup_socket(); > } > > Where setup_socket() is an imaginary function that calls socket(), > possibly setsockopt(), bind() and listen(), in that order. > > A cleaner solution is to have nsca exit if it can't obtain the socket, > since there's no real reason to think it should be able to obtain one later. This just happened to me again, and I want to fix it. I would like to propose the attached patch to NSCA. It treats ECHILD errors the same as EWOULDBLOCK and EINTR, i.e. temporary errors. As far as I can see, ECHILD is not documented in the man pages, and there's no reason to believe that it's a permanent error (i.e. that accept() will fail next time). I haven't found it in the kernel source either, but the code for sys_accept is rather convoluted on Linux. It also ensures that for other errors, NSCA exits cleanly, in a similar manner to the block below, rather than going into an infinite loop. Comments most welcome, especially anything that might prevent this from being accepted into the NSCA source. I'm installing this on our production server, I'll let you all know if anything breaks. Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) -------------- next part -------------- A non-text attachment was scrubbed... Name: nsca-noloop.patch Type: text/x-patch Size: 793 bytes Desc: not available URL: From ae at op5.se Fri Oct 7 13:23:37 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 07 Oct 2005 13:23:37 +0200 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: <200510062058.j96KwO55019295@mx1.cs.umb.edu> References: <200510062058.j96KwO55019295@mx1.cs.umb.edu> Message-ID: <43465AB9.6020304@op5.se> John P. Rouillard wrote: > Hi all: > > I have groveled through the cgi files in nagios 2.0b4 and I can't find > any instance of the nagios_check_command actually being executed. Am I > just missing it somewhere? > The nagios_check_command has been incorporated into the cgi's. That's why the status.dat file has the nagios entry on top. > cgiutils.c load the variable from the config file > extinfo.c checks to see if the variable is empty and produces a nice > little warning if it is. > > but nothing seems to actually execute the command. The reason I ask is > that nagios was down and the cgi's all happily reported that it was > up. Could this be because the host and service status files were > available since the machine crashed? > Yes, that's almost certainly it. There is no really good way of detecting that nagios is actually running unless you're logged in as root. Noone (sane) runs their webserver as root. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Fri Oct 7 14:03:20 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Fri, 7 Oct 2005 14:03:20 +0200 Subject: Probably a trivial permission issue? Message-ID: Hi, while logged in in my Nagios' server web realm with nagiosadmin credentials, who is supposed to have full permission, $ grep nagiosadmin /opt/sw/nagios/etc/cgi.cfg|grep -v ^# authorized_for_all_services=nagiosadmin authorized_for_all_hosts=nagiosadmin authorized_for_all_service_commands=nagiosadmin authorized_for_all_host_commands=nagiosadmin and clicking on the "view notifications for this host" link I get the following: Error: Cannot open log file '/opt/sw/nagios/var/log/nagios.log' for reading! How come? The nagios.log is world readable (even by the underprivileged httpd proc) $ ll /opt/sw/nagios/var/log/nagios.log -rw-rw-r-- 1 nagios nagios 144200 Oct 07 13:54 /opt/sw/nagios/var/log/nagios.log Or is the error message above only malformed, and it should read "cannot acquire shared file lock on ..."? Regards Ralph ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Fri Oct 7 14:33:36 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 07 Oct 2005 08:33:36 -0400 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: Your message of "Fri, 07 Oct 2005 13:23:37 +0200." <43465AB9.6020304@op5.se> References: <43465AB9.6020304@op5.se> Message-ID: <200510071233.j97CXaEV012599@mx1.cs.umb.edu> In message <43465AB9.6020304 at op5.se>, Andreas Ericsson writes: >John P. Rouillard wrote: >> Hi all: >> >> I have groveled through the cgi files in nagios 2.0b4 and I can't find >> any instance of the nagios_check_command actually being executed. Am I >> just missing it somewhere? >> > >The nagios_check_command has been incorporated into the cgi's. That's >why the status.dat file has the nagios entry on top. So the CGI's report nagios is down if and only if the status.date file is missing? That matches what I see when I shut down nagios properly. If I shut down nagios I see the "Error: Could not read host and service status information!" page in the web interface. >> cgiutils.c load the variable from the config file >> extinfo.c checks to see if the variable is empty and produces a nice >> little warning if it is. >> >> but nothing seems to actually execute the command. The reason I ask is >> that nagios was down and the cgi's all happily reported that it was >> up. Could this be because the host and service status files were >> available since the machine crashed? >Yes, that's almost certainly it. There is no really good way of >detecting that nagios is actually running unless you're logged in as >root. Hmm, I am not sure I follow why you need to be logged in as root. Why not stat the status.log file and check to see if its (mtime) timestamp is less than the setting of: status_update_interval*2 if aggregate_status_updates is enabled? One could also allow a setting "freshness_threshold" in cgi.cfg that is the number of seconds/minutes old the status.dat file is allowed to be if aggregate_status_updates isn't set. > Noone (sane) runs their webserver as root. True enough. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Fri Oct 7 14:36:23 2005 From: robmossrm at aol.com (Rob Moss) Date: Fri, 07 Oct 2005 13:36:23 +0100 Subject: Probably a trivial permission issue? In-Reply-To: References: Message-ID: <43466BC7.5060200@aol.com> Ralph.Grothe at itdz-berlin.de wrote: >Hi, > >while logged in in my Nagios' server web realm with nagiosadmin >credentials, >who is supposed to have full permission, > >$ grep nagiosadmin /opt/sw/nagios/etc/cgi.cfg|grep -v ^# >authorized_for_all_services=nagiosadmin >authorized_for_all_hosts=nagiosadmin >authorized_for_all_service_commands=nagiosadmin >authorized_for_all_host_commands=nagiosadmin > >and clicking on the "view notifications for this host" link >I get the following: > >Error: Cannot open log file '/opt/sw/nagios/var/log/nagios.log' >for reading! > > >How come? > > Check all directories from / to ensure that the 'nagios' user has R and X permissions, or at least X permissions to traverse the directories. One of the directories is probably blocking nagios from passing through it. rob. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pla at softflare.com Fri Oct 7 14:43:51 2005 From: pla at softflare.com (Paul L. Allen) Date: Fri, 07 Oct 2005 13:43:51 +0100 Subject: Spam from Groundwork Open Source Solutions In-Reply-To: <21943250.1128623484789.JavaMail.sfdc@na1-app15> References: <21943250.1128623484789.JavaMail.sfdc@na1-app15> Message-ID: <20051007124351.10426.qmail@mullet.softflare.net> I just received this spam from Groundwork Open Source Solutions. To the best of my knowledge nothing I have ever posted has even implied that I might be remotely interested in them, their product or their services. I don't recall responding to this guy on the list about anything either, but I don't make any special effort to remember the names of people I only respond to once or twice. But even if I had answered one of his questions or even asked him a question, that does not entitle him to spam me with an advert for his product. Maybe if I'd asked about an integrated solution like openview he would be justified in sending me a one-liner saying he had such a product and asking if I'd be interested in more info but not a sales pitch. Of course, if I had asked on the list if there was an equivalent of openview then the correct place to respond would have been to the list so that others could see the answer and find it in the archives. The fact that he mailed me personally indicates that he knows damned well that posting a sales pitch to the list would not be appreciated. Sadly, blocking him from the list won't stop him trawling the archives. However, there is nothing to stop the Nagios documentation listing companies like Groundwork that behave unethically and recommending that people refuse to do business with them. Wayne Dahler writes: > Paul, > After a very successful two years in the US market, we are looking to Europe as our next opportunity. I am hoping that you can point me to the right person within your organization for a brief discussion on IT infrastructure-monitoring solutions. > > I have attached several documents that you can share with the person in your company who would be interested in how open source tools might be used to enhance your monitoring environment. > > In summary: > We have integrated and enhanced 15 of the best-of-breed opensource tools together in IT GroundWork including Jetspeed, Nagios, RRD Tool and NTOP. It provides monitoring coverage for networks, servers, applications, databases and network traffic. > > IT GroundWork delivers the same functionality as HP OpenView or CA Unicenter for about 20-30 percent of the cost. Or if you are currently using tools that you like, but find that you have additional needs, IT GroundWork can augment your monitoring environment while maintaining your investment cost effectively. > > We are not looking to replace the tools you are currently using, rather we help companies like yours use open source tools to supplement their tools, cost-effectively enhancing their monitoring environment. > > Some areas that might be interesting to you are: > * An IT Dashboard - high level overview for executives > * Auto discovery tools - for easy of expansion > * Synthetic transactions which document the exact performance of what an end-user/client experiences, breaking down the transaction by segments - internet, network, apps server, database back-end server, etc. to pin-pointing performance and bottlenecks. > * Web portal for all your tools to roll up to - including single sign-on capability etc. > * Highly graphical interface - broken down by the way you do business (geographic, departmental, functional, or any way you wish). > > If this sounds interesting to you or someone on your team, we provide a brief WEBEX presentation which will give you an opportunity to see a demo of the Groundwork package and its capabilities. You can ask questions about how we might work within you environment and how we would integrate with you existing tools. > > Please give me a call and let me know your level of interest. Thanks for your help and your time. > > > > Best Regards, > > Wayne Dahler > Account Development Mgr. > GroundWork Open Source Solutions > (510) 899-7735-direct > (510) 428.0980-fax > www.itgroundwork.com > wdahler at itgroundwork.com > > -- Paul Allen Softflare Support ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Fri Oct 7 14:43:10 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Fri, 7 Oct 2005 14:43:10 +0200 Subject: Scheduled Downtimes Message-ID: Hi, someone else asked today here about how to handle regular scheduled downtimes. Since their place, like ours, seemed to have well defined intervals for maintenance of different host groups he asked how to avoid always having to click a link and fill in a form within the web interface. Someone replied to him to best define a custom notification period that excludes exactly the period of maintenance. Well, this doesn't look quite feasible for our maintenance intervals. Or at least I cannot see how to cover them by usage of the weekday directives, because e.g. there is one host group that have their maintenance every Wednesday in an even week from 15:00 to 19:00 hrs. while another only requires every Wednesday of the 1. uneven week. Isn't it possible to supply Nagios a custom downtime.dat file whenever such scheduled downtime is due? Having clicked one from the web interface for a test host I found this file in the var directory: $ cat var/downtime.dat ######################################## # NAGIOS DOWNTIME FILE # # THIS FILE IS AUTOMATICALLY GENERATED # BY NAGIOS. DO NOT MODIFY THIS FILE! ######################################## info { created=1128687420 version=2.0b3 } hostdowntime { host_name=terra downtime_id=1 entry_time=1128687420 start_time=1128687265 end_time=1136617200 triggered_by=0 fixed=1 duration=7929935 author=nagiosadmin comment=snooze } Is there a description of the expected format of such a file, lingering somewhere well concealed in the docs (haven't found it yet)? Is a hostgroup_name a valid directive in this context? Can there be repeated hostdowntime {} blocks for other hostgroups in this file to cover different requirements? Will Nagios look automatically in its var dir before issuing notifications, or is there a way to signal it the existence of such a file? Regards Ralph ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Fri Oct 7 14:50:39 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Fri, 7 Oct 2005 14:50:39 +0200 Subject: Inexplicable service escalation behaviour In-Reply-To: References: Message-ID: <20051007125039.GA10219469@CIS.FU-Berlin.DE> * Ralph.Grothe at itdz-berlin.de [2005-10-07 09:43]: > Btw, will the macros be exported to external commands' environments > per default, so that for instance in a Perl script I could refer to > them by e.g. $ENV{NOTIFICATIONNUMBERS} ? (never mind, more of a > rhetorical question which I can easily answer myself by trial) By trial you might miss that they are exported, but with "NAGIOS_" prepended to their names ;-) | Macros as Environment Variables | | Starting with Nagios 2.0, most macros have been made available as | environment variables. This means that scripts that are run from Nagios | (i.e. service and host check commands, notification commands, etc.) can | reference these macros directly as standard environment variables. For | purposes of security and sanity, $USERn$ and "on-demand" host and | service macros are not made available as environment variables. | Environment variables that contain macros are named the same as their | corresponding macro names (listed below), with "NAGIOS_" prepended to | their names. For example, the $HOSTNAME$ macro would be available as an | environment variable named "NAGIOS_HOSTNAME". [ http://nagios.sourceforge.net/docs/2_0/macros.html ] Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Riccardo.GENTILUCCI at skytv.it Fri Oct 7 14:52:21 2005 From: Riccardo.GENTILUCCI at skytv.it (GENTILUCCI Riccardo) Date: Fri, 7 Oct 2005 14:52:21 +0200 Subject: R: R: Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Message-ID: Ralph you're right..the encryption is useful for a snmpv3 environment (not mine)!! I'll follow the way you've briefly described and start from an atomic environment to create the molecola NAGIOS!! I really appreciate the kindness and courtesy of the members of this forum.. i hope I'll be able to be useful too sooner Riccardo Gentilucci IT SKY - Direzione Network & Security Management SKY Italia Via Piranesi, 46 - 20137 Milano Tel: +390270027426 Fax: +390270022248 Mob: +393355342826 Emailto: riccardo.gentilucci at skytv.it Website: http://www.skytv.it -----Messaggio originale----- Da: Ralph.Grothe at itdz-berlin.de [mailto:Ralph.Grothe at itdz-berlin.de] Inviato: venerd? 7 ottobre 2005 11.48 A: GENTILUCCI Riccardo; Ralph.Grothe at itdz-berlin.de; ankushgrover1711 at gmail.com; nagios-users at lists.sourceforge.net Oggetto: RE: R: [Nagios-users] Problem installing nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! Riccardo, sorry, I wan't following your thread. But I guess I know what you mean. The problem seems to be that you followed two different installation routes that don't seem to care much about the other's standards. I suspect that the perl-net-snmp rpm package doesn't really care about your Perl installation but only consults its rpm database to see if any of the rpm prerequisite packages (those that an "rpm -qpR /some/path/perl-net-snmp.some_release.rpm" would list) is already installed, and would reject installation (unless given the --nodeps) altogether. But I cannot say for sure because I install all the Perl modules I would ever require the Perl way (i.e. either via CPAN.pm or manually) instead of using any rpms. This way however, would probably also force you to build the Nagios or any Nagios Exchange software from the sources as well. But it's more likely to work that way than depending on any pre-built packages for any Linux packaging system. You could, if you required a consistent rpm database that also comprises the Nagios SNMP plug-in stuff, try to get so-called srpm package which really contain the source tarball together with a spec file. Install those like usual rpm files and change to /usr/src/redhat (I hope the path is correct, I only have few Linux boxes to monitor and thus am seldomly exposed to RH) I suggest you read "man rpmbuild" for how to build an RH rpm with an adapted spec file from there. Btw, do you really require Crypt::DES for your SNMP management in your shielded LAN? I may be wrong, but I thought encryption was only an option in SNMPv3? > -----Original Message----- > From: GENTILUCCI Riccardo [mailto:Riccardo.GENTILUCCI at skytv.it] > Sent: Friday, October 07, 2005 11:16 AM > To: Ralph.Grothe at itdz-berlin.de; ankushgrover1711 at gmail.com; > nagios-users at lists.sourceforge.net > Subject: R: R: [Nagios-users] Problem installing nagios-plugins > (1.4.1.1): Perl (Net::SNMP) not found but installed! > > > Dear Ralph, I've made what you've indicated (?'ve installed > the package fron CPAN and so its dependencies) but without > positive solution... > > Another guy say me to install the per-net-snmp package and > this request me what I've written (the Crypt::DES module) > > I've tried to startup another system with same fedora fc3 and > update it till the last available packages...same story in > order to install the nagios plugins... > > I don't understand if there's some problems in my fedora > installations or other (I've install it in two different > machines, different hw etc...): this is a big frustration. > > By the way, thanks for your kind answer. > > > Riccardo Gentilucci > IT SKY - Direzione Network & Security Management > > SKY Italia > Via Piranesi, 46 - 20137 Milano > Tel: +390270027426 > Fax: +390270022248 > Mob: +393355342826 > > Emailto: riccardo.gentilucci at skytv.it > > Website: http://www.skytv.it > -----Messaggio originale----- > Da: Ralph.Grothe at itdz-berlin.de [mailto:Ralph.Grothe at itdz-berlin.de] > Inviato: venerd? 7 ottobre 2005 11.06 > A: GENTILUCCI Riccardo; ankushgrover1711 at gmail.com; > nagios-users at lists.sourceforge.net > Oggetto: RE: R: [Nagios-users] Problem installing > nagios-plugins (1.4.1.1): Perl (Net::SNMP) not found but installed! > > > I've tried to install the perl-net-snmp package..now this > request another perl module perl(Crypt::DES).....and the story > goes on!! > > It's easiest to resolve common CPAN dependency issues by using > the CPAN module for module installation. > > The first time you invoke the CPAN shell you will be asked a > couple of questions like your preferred CPAN mirrors, > any proxy settings (if applicable), caching, and follow-up > options. > > Once having gone through this you would simply issue > > perl -MCPAN -e "install Net::SNMP" > > and perl would take care to resolve all prerequisites (as far as > CPAN modules are concerned). > > Some modules also require prior installation of libraries (e.g. > XML::Parser requires libexpat), > which must be carried out separately usually. > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Fri Oct 7 15:11:25 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Fri, 7 Oct 2005 15:11:25 +0200 Subject: Inexplicable service escalation behaviour Message-ID: Many thanks Holger, for reminding me of the NAGIOS_ prefix. I must have overread the stanza you cited from the docs (but rediscovered it now). However, semi-consciuosly it must have been in my head because I remembered having read something about environment vars somewhere. > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Holger > Weiss > Sent: Friday, October 07, 2005 2:51 PM > To: Nagios Users > Subject: Re: [Nagios-users] Inexplicable service escalation behaviour > > > * Ralph.Grothe at itdz-berlin.de [2005-10-07 09:43]: > > Btw, will the macros be exported to external commands' environments > > per default, so that for instance in a Perl script I could refer to > > them by e.g. $ENV{NOTIFICATIONNUMBERS} ? (never mind, more of a > > rhetorical question which I can easily answer myself by trial) > > By trial you might miss that they are exported, but with "NAGIOS_" > prepended to their names ;-) > > | Macros as Environment Variables > | > | Starting with Nagios 2.0, most macros have been made available as > | environment variables. This means that scripts that are run > from Nagios > | (i.e. service and host check commands, notification > commands, etc.) can > | reference these macros directly as standard environment > variables. For > | purposes of security and sanity, $USERn$ and "on-demand" host and > | service macros are not made available as environment variables. > | Environment variables that contain macros are named the > same as their > | corresponding macro names (listed below), with "NAGIOS_" > prepended to > | their names. For example, the $HOSTNAME$ macro would be > available as an > | environment variable named "NAGIOS_HOSTNAME". > > [ http://nagios.sourceforge.net/docs/2_0/macros.html ] > > Holger > > -- > PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ralph.Grothe at itdz-berlin.de Fri Oct 7 15:26:45 2005 From: Ralph.Grothe at itdz-berlin.de (Ralph.Grothe at itdz-berlin.de) Date: Fri, 7 Oct 2005 15:26:45 +0200 Subject: Probably a trivial permission issue? Message-ID: > -----Original Message----- > From: Rob Moss [mailto:robmossrm at aol.com] > Sent: Friday, October 07, 2005 2:36 PM > To: Ralph.Grothe at itdz-berlin.de > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Probably a trivial permission issue? > > > Ralph.Grothe at itdz-berlin.de wrote: > > >Hi, > > > >while logged in in my Nagios' server web realm with nagiosadmin > >credentials, > >who is supposed to have full permission, > > > >$ grep nagiosadmin /opt/sw/nagios/etc/cgi.cfg|grep -v ^# > >authorized_for_all_services=nagiosadmin > >authorized_for_all_hosts=nagiosadmin > >authorized_for_all_service_commands=nagiosadmin > >authorized_for_all_host_commands=nagiosadmin > > > >and clicking on the "view notifications for this host" link > >I get the following: > > > >Error: Cannot open log file '/opt/sw/nagios/var/log/nagios.log' > >for reading! > > > > > >How come? > > > > > Check all directories from / to ensure that the 'nagios' > user has R and > X permissions, or at least X permissions to traverse the directories. > > One of the directories is probably blocking nagios from > passing through it. > O, this is so embarrassing!!! :-( Sorry, for bothering you (but sometimes it needs another pair of eyes) $ for d in $(echo /opt/sw/nagios/var/log|tr \/ \\040);do dir="$dir/$d";ls -ld $dir;done drwxr-xr-x 4 root system 512 Jun 21 2004 /opt drwxr-sr-x 53 sys sys 2048 Aug 05 18:28 /opt/sw drwxr-sr-x 10 nagios nagios 512 Sep 20 14:35 /opt/sw/nagios lrwxrwxrwx 1 nagios nagios 15 Jun 20 13:42 /opt/sw/nagios/var -> /var/opt/nagios drwxr----- 2 nagios nagios 512 Oct 02 00:00 /opt/sw/nagios/var/log $ chmod go+rx /opt/sw/nagios/var/log $ ls -ld /opt/sw/nagios/var/log drwxr-xr-x 2 nagios nagios 512 Oct 02 00:00 /opt/sw/nagios/var/log ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lkk at interfree.it Fri Oct 7 15:31:07 2005 From: lkk at interfree.it (lkk at interfree.it) Date: 7 Oct 2005 13:31:07 -0000 Subject: Premature end of script headers statusmap.cgi! Message-ID: <20051007133107.19610.qmail@community22.interfree.it> Hi! I'm a new user of Nagios. I have a problem: Nagios run correctly on my Red Hat 9 Linux Box -------------------------------------------------------------------------------- Server error! Error message: Premature end of script headers: statusmap.cgi Error 500 xxx.xxxx.xxxx.xxxx Fri 07 Oct 2005 03:00:46 PM CEST Apache/2.0.40 (Red Hat Linux) -------------------------------------------------------------------------------- can someone help me? Thanks ------------------------------------------------------------------------- Visita http://domini.interfree.it, il sito di Interfree dove trovare soluzioni semplici e complete che soddisfano le tue esigenze in Internet, ecco due esempi di offerte: - Registrazione Dominio: un dominio con 1 MB di spazio disco + 2 caselle email a soli 18,59 euro - MioDominio: un dominio con 20 MB di spazio disco + 5 caselle email a soli 51,13 euro Vieni a trovarci! Lo Staff di Interfree ------------------------------------------------------------------------- ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 7 15:33:20 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 07 Oct 2005 15:33:20 +0200 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: <200510071233.j97CXaEV012599@mx1.cs.umb.edu> References: <200510071233.j97CXaEV012599@mx1.cs.umb.edu> Message-ID: <43467920.4070508@op5.se> John P. Rouillard wrote: > In message <43465AB9.6020304 at op5.se>, > Andreas Ericsson writes: > >>John P. Rouillard wrote: >> >>>Hi all: >>> >>>I have groveled through the cgi files in nagios 2.0b4 and I can't find >>>any instance of the nagios_check_command actually being executed. Am I >>>just missing it somewhere? >>> >> >>The nagios_check_command has been incorporated into the cgi's. That's >>why the status.dat file has the nagios entry on top. > > > So the CGI's report nagios is down if and only if the status.date file > is missing? That matches what I see when I shut down nagios properly. > If I shut down nagios I see the "Error: Could not read host and > service status information!" page in the web interface. > > >>> cgiutils.c load the variable from the config file >>> extinfo.c checks to see if the variable is empty and produces a nice >>> little warning if it is. >>> >>>but nothing seems to actually execute the command. The reason I ask is >>>that nagios was down and the cgi's all happily reported that it was >>>up. Could this be because the host and service status files were >>>available since the machine crashed? >> >>Yes, that's almost certainly it. There is no really good way of >>detecting that nagios is actually running unless you're logged in as >>root. > > > Hmm, I am not sure I follow why you need to be logged in as root. Because otherwise you shouldn't have access to reading process information about another users process. > Why > not stat the status.log file and check to see if its (mtime) timestamp > is less than the setting of: > > status_update_interval*2 > > if aggregate_status_updates is enabled? One could also allow a setting > "freshness_threshold" in cgi.cfg that is the number of seconds/minutes > old the status.dat file is allowed to be if aggregate_status_updates > isn't set. > Good idea. Write the code for it and submit a patch. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gargula_fug at informacao.srv.br Fri Oct 7 15:43:10 2005 From: gargula_fug at informacao.srv.br (Cesar Tibau) Date: Fri, 7 Oct 2005 10:43:10 -0300 Subject: Nagios in OpenBSD 3.7 Message-ID: <00fa01c5cb45$0edbe080$8102000a@anarquia> Hi all. somebody already obtained to install the nagios 2.X in OpenBSD 3.7? Already I tried of all the ways, but it always gives to problem in "make all". Grateful for any help, Cesar Tibau -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Fri Oct 7 15:44:41 2005 From: robmossrm at aol.com (Rob Moss) Date: Fri, 07 Oct 2005 14:44:41 +0100 Subject: Premature end of script headers statusmap.cgi! In-Reply-To: <20051007133107.19610.qmail@community22.interfree.it> References: <20051007133107.19610.qmail@community22.interfree.it> Message-ID: <43467BC9.3020208@aol.com> lkk at interfree.it wrote: >Hi! >I'm a new user of Nagios. >I have a problem: > Nagios run correctly on my Red Hat 9 Linux Box > >-------------------------------------------------------------------------------- >Server error! > >Error message: >Premature end of script headers: statusmap.cgi > >Error 500 >xxx.xxxx.xxxx.xxxx >Fri 07 Oct 2005 03:00:46 PM CEST >Apache/2.0.40 (Red Hat Linux) > >-------------------------------------------------------------------------------- >can someone help me? >Thanks > > This error is generated by Apache, probably because you haven't installed the Nagios CGI's with the right permissions, or the cgi.cfg file is not set up correctly. Read the error_log in your apache log directory and go from there, it should help you track down the problem rob ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 7 15:48:55 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 07 Oct 2005 15:48:55 +0200 Subject: Nagios in OpenBSD 3.7 In-Reply-To: <00fa01c5cb45$0edbe080$8102000a@anarquia> References: <00fa01c5cb45$0edbe080$8102000a@anarquia> Message-ID: <43467CC7.90005@op5.se> Cesar Tibau wrote: > Hi all. > > somebody already obtained to install the nagios 2.X in OpenBSD 3.7? Already I tried of all the ways, but it always gives to problem in "make all". > What, exactly, are the error messages you get? > Grateful for any help, > Then you should supply some details so someone can. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gargula_fug at informacao.srv.br Fri Oct 7 16:01:14 2005 From: gargula_fug at informacao.srv.br (Cesar Tibau) Date: Fri, 7 Oct 2005 11:01:14 -0300 Subject: Nagios in OpenBSD 3.7 Message-ID: <018d01c5cb47$95000450$8102000a@anarquia> ./configure bash-3.00# make all cd ./base && make gcc -g -O2 -I/usr/local/include -DHAVE_CONFIG_H -DNSCORE -c broker.c In file included from ../include/config.h:114, from broker.c:26: /usr/include/sys/resource.h:56: error: field `ru_utime' has incomplete type /usr/include/sys/resource.h:57: error: field `ru_stime' has incomplete type *** Error code 1 Stop in /usr/src/instalacao/nagios/nagios-2.0b4/base. *** Error code 1 Stop in /usr/src/instalacao/nagios/nagios-2.0b4 (line 58 of Makefile). __________________________________________ Any idea? -------------- next part -------------- An HTML attachment was scrubbed... URL: From kidd1270 at gmail.com Fri Oct 7 15:53:02 2005 From: kidd1270 at gmail.com (Kidd Chaos) Date: Fri, 7 Oct 2005 08:53:02 -0500 Subject: check_fping replacement needed with -t option Message-ID: <91a354090510070653q2e9417ebjd9d9afcdbc060297@mail.gmail.com> Hi kids: Does anyone have a more option version of check_fping besides this one? http://www.nagiosexchange.org/Networking.53.0.html?&tx_netnagext_pi1%5Bp_view%5D=24 Problem: I need the -t (target timeout) option available for check_fping. I've got some 56k's that need about a 10 second timeout on fping. (don't ask). Thanks, Kidd - moving at the speed of government while spending at the speed of government. ---------------------------------------------------------------------------------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: From ae at op5.se Fri Oct 7 16:08:14 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 07 Oct 2005 16:08:14 +0200 Subject: check_fping replacement needed with -t option In-Reply-To: <91a354090510070653q2e9417ebjd9d9afcdbc060297@mail.gmail.com> References: <91a354090510070653q2e9417ebjd9d9afcdbc060297@mail.gmail.com> Message-ID: <4346814E.7070203@op5.se> Kidd Chaos wrote: > Hi kids: > Does anyone have a more option version of check_fping besides this one? > http://www.nagiosexchange.org/Networking.53.0.html?&tx_netnagext_pi1%5Bp_view%5D=24 > > Problem: I need the -t (target timeout) option available for check_fping. > > I've got some 56k's that need about a 10 second timeout on fping. (don't > ask). > Try check_icmp. It's in the plugin pack at http://oss.op5.se/nagios -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From JoeRegular at gov.nl.ca Fri Oct 7 16:21:09 2005 From: JoeRegular at gov.nl.ca (Joe Regular) Date: Fri, 07 Oct 2005 11:51:09 -0230 Subject: Service Detail Displays Everything Message-ID: I've recently setup a Nagios 2.0b4 system and now am in the process of trying to setup contactgroups. Everything with regards to Host Detail seems to be working such that contacts can only see those hose for which they are part of the contactgroup. However, when viewing the Service Details via the website, users can see all services. Users are also notified of all service related problems. Is there any way to limit what services a user can see and access? Joe ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Fri Oct 7 16:30:26 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 07 Oct 2005 10:30:26 -0400 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: Your message of "Fri, 07 Oct 2005 15:33:20 +0200." <43467920.4070508@op5.se> References: <43467920.4070508@op5.se> Message-ID: <200510071431.j97EV0tC028112@mx1.cs.umb.edu> In message <43467920.4070508 at op5.se>, Andreas Ericsson writes: >John P. Rouillard wrote: >> Andreas Ericsson writes: >>>John P. Rouillard wrote: >>>>but nothing seems to actually execute the command. The reason I ask is >>>>that nagios was down and the cgi's all happily reported that it was >>>>up. Could this be because the host and service status files were >>>>available since the machine crashed? >>> >>>Yes, that's almost certainly it. There is no really good way of >>>detecting that nagios is actually running unless you're logged in as >>>root. >> >> Hmm, I am not sure I follow why you need to be logged in as root. > >Because otherwise you shouldn't have access to reading process >information about another users process. But an external command called by the cgi that is suid to the nagios user would allow monitoring of the nagios process w/o root privs. But I understand the point. >> Why not stat the status.log file and check to see if its (mtime) timestamp >> is less than the setting of: >> >> status_update_interval*2 >> >> if aggregate_status_updates is enabled? One could also allow a setting >> "freshness_threshold" in cgi.cfg that is the number of seconds/minutes >> old the status.dat file is allowed to be if aggregate_status_updates >> isn't set. >> > >Good idea. Write the code for it and submit a patch. I'll see what I can do. Hopefully somebody with better C skills will beat me to it. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jmwalker at itgroundwork.com Fri Oct 7 17:04:18 2005 From: jmwalker at itgroundwork.com (John Mark Walker) Date: Fri, 7 Oct 2005 08:04:18 -0700 Subject: Spam from Groundwork Open Source Solutions In-Reply-To: <20051007124351.10426.qmail@mullet.softflare.net> References: <21943250.1128623484789.JavaMail.sfdc@na1-app15> <20051007124351.10426.qmail@mullet.softflare.net> Message-ID: <200510070804.18986.jmwalker@itgroundwork.com> Paul, I profusely apologize that this happened. I have escalated the incident to the highest levels and will ensure that this never happens again. I have directed everyone at GroundWork to go through me first before consulting anyone on this list. To the entire list: In the future, if any of you receive inappropriate contact from us, I would like to know as soon as possible. Thanks, John Mark Walker Developer Relations Manager GroundWork On Friday 07 October 2005 05:43 am, Paul L. Allen wrote: > I just received this spam from Groundwork Open Source Solutions. To > the best of my knowledge nothing I have ever posted has even implied > that I might be remotely interested in them, their product or their > services. > > I don't recall responding to this guy on the list about anything > either, but I don't make any special effort to remember the names of > people I only respond to once or twice. But even if I had answered > one of his questions or even asked him a question, that does not > entitle him to spam me with an advert for his product. Maybe if I'd > asked about an integrated solution like openview he would be > justified in sending me a one-liner saying he had such a product and > asking if I'd be interested in more info but not a sales pitch. > > Of course, if I had asked on the list if there was an equivalent of > openview then the correct place to respond would have been to the > list so that others could see the answer and find it in the archives. > The fact that he mailed me personally indicates that he knows damned > well that posting a sales pitch to the list would not be appreciated. > > Sadly, blocking him from the list won't stop him trawling the > archives. However, there is nothing to stop the Nagios documentation > listing companies like Groundwork that behave unethically and > recommending that people refuse to do business with them. > > Wayne Dahler writes: > > Paul, > > After a very successful two years in the US market, we are looking > > to Europe as our next opportunity. I am hoping that you can point > > me to the right person within your organization for a brief > > discussion on IT infrastructure-monitoring solutions. > > > > I have attached several documents that you can share with the > > person in your company who would be interested in how open source > > tools might be used to enhance your monitoring environment. > > > > In summary: > > We have integrated and enhanced 15 of the best-of-breed opensource > > tools together in IT GroundWork including Jetspeed, Nagios, RRD > > Tool and NTOP. It provides monitoring coverage for networks, > > servers, applications, databases and network traffic. > > > > IT GroundWork delivers the same functionality as HP OpenView or CA > > Unicenter for about 20-30 percent of the cost. Or if you are > > currently using tools that you like, but find that you have > > additional needs, IT GroundWork can augment your monitoring > > environment while maintaining your investment cost effectively. > > > > We are not looking to replace the tools you are currently using, > > rather we help companies like yours use open source tools to > > supplement their tools, cost-effectively enhancing their monitoring > > environment. > > > > Some areas that might be interesting to you are: > > * An IT Dashboard - high level overview for executives > > * Auto discovery tools - for easy of expansion > > * Synthetic transactions which document the exact performance of > > what an end-user/client experiences, breaking down the transaction > > by segments - internet, network, apps server, database back-end > > server, etc. to pin-pointing performance and bottlenecks. * Web > > portal for all your tools to roll up to - including single sign-on > > capability etc. * Highly graphical interface - broken down by the > > way you do business (geographic, departmental, functional, or any > > way you wish). > > > > If this sounds interesting to you or someone on your team, we > > provide a brief WEBEX presentation which will give you an > > opportunity to see a demo of the Groundwork package and its > > capabilities. You can ask questions about how we might work within > > you environment and how we would integrate with you existing tools. > > > > Please give me a call and let me know your level of interest. > > Thanks for your help and your time. > > > > > > > > Best Regards, > > > > Wayne Dahler > > Account Development Mgr. > > GroundWork Open Source Solutions > > (510) 899-7735-direct > > (510) 428.0980-fax > > www.itgroundwork.com > > wdahler at itgroundwork.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Mark.Law at thomson.com Fri Oct 7 17:10:00 2005 From: Mark.Law at thomson.com (Mark.Law at thomson.com) Date: Fri, 7 Oct 2005 11:10:00 -0400 Subject: Nagios monitors in the EDC Message-ID: Please take a look at the Nagios TEST at the EDC: https://10.226.53.17/nagios/ login is guest/guest. I have the basic stuff up. -Mark -----Original Message----- From: Nelson, Lincoln (TH USA) Sent: Wednesday, October 05, 2005 11:40 AM To: Law, Mark (TSH Center); McManus, Timothy (TH USA) Cc: Croce, James (TH USA); Cubr, Timothy (TSH Center); Zephirin, Pascal (TH USA) Subject: RE: Nagios monitors in the EDC Mark, The nc_net agent has been installed on all of the WIN* boxes and I've attached the request form covering our initial server monitoring needs for the NADC / WCM environment. As soon as we can get the process worked out for these I'll follow with forms for the remaining boxes. Please let me know if you'd like for me to open and include this in a Remedy ticket. Thanks, Lincoln --------------------------------------------------------------- Lincoln Nelson Supervisor, Internet Systems Administration Thomson Ph: 201.358.7924 Fx: 201.722.2660 -----Original Message----- From: Law, Mark (TSH Center) Sent: Thursday, August 18, 2005 2:40 PM To: Nelson, Lincoln (TH USA); McManus, Timothy (TH USA) Cc: Croce, James (TH USA); Cubr, Timothy (TSH Center); Naughton, Darrin (TS Ireland); Zephirin, Pascal (TH USA) Subject: RE: Nagios monitors in the EDC Hi Lincoln, I've attached a couple of documents that might help. The word doc lays out the general requirements, including the agents that need to be installed on Windows and unix machines. The spreadsheet has information you can use as a guide to define the people, groups, machines, machine groups and services that need to be monitored. Don't worry too much about getting it right, just put in as much information as you can and we'll work out the details. For starters, the 10.225.x.x addresses of the machines will let me block out the different product groups you mention below and get them up and 'pinged' in the test system. -Mark -----Original Message----- From: Nelson, Lincoln (TH USA) Sent: Wednesday, August 17, 2005 11:21 AM To: McManus, Timothy (TH USA) Cc: Croce, James (TH USA); Cubr, Timothy (TSH Center); Naughton, Darrin (TS Ireland); Law, Mark (TSH Center); Zephirin, Pascal (TH USA) Subject: RE: Nagios monitors in the EDC The following boxes need to be monitored according to this request, please advise as to next step for us to comply: NADC: -------- CenterWatch: - cwp01w01, cwp01w02, cwp01w03, cwp01qa01, cwp01qa02 CPS: - cpsp03w01, cpsp03d01 BPM: - bpmp03w01, bpmp03w02, bpmp03d01 WebEssentials: - cathidwick PDRHealth and static sites: - cameweb1 GCG: - gccp03w01, gccp03d01 WCM / PDR.net: - thnawpprodw1, thnawpprodw2, thnawproda1, thnawpuada1, thnawpdev1, thnawprodc1, thnawpuatw1, thnawpprew1, Infrastructure - DNS, SiteScope, sendmail - thaaop01, thaaop02, thaaop03 EDC: ----- WCM / PDR.net: - thewpprodw1 CPS: - cpsp01w01, cpsp01d01 BPM: - bpmp01w01, bpmp01w01, bpmp01d01 Infrastructure - DNS, SiteScope, sendmail - thedcop01, thedcop02 Thanks, LN. --------------------------------------------------------------- Lincoln Nelson Supervisor, Internet Systems Administration Thomson Ph: 201.358.7924 Fx: 201.722.2660 -----Original Message----- From: McManus, Timothy (TH USA) Sent: Monday, August 15, 2005 4:41 PM To: Nelson, Lincoln (TH USA); Zephirin, Pascal (TH USA) Cc: Croce, James (TH USA); Cubr, Timothy (TSH Center); Naughton, Darrin (TS Ireland); Law, Mark (TSH Center) Subject: Nagios monitors in the EDC Lincoln/Pascal: All servers in the EDC need to be monitored by Nagios. I have included a document describing the base level and additional levels of monitoring available. The following servers need to be added to Nagios. Please add to this list if I missed anything. BPMP01D01 BPMP01W02 BPMP01W01 CPSP01D01 CPSP01W01 Please let me know when we will be able to monitor these machines. If you have any specific questions regarding Nagios or the monitoring in the EDC, Mark Law should be able to assist you. Moving forward, any new boxes move into the NADC or EDC need to be configured for monitoring before they are shipped. << File: EDC-NADC Nagios-Cacti Monitoring - requirements-services.doc >> -- Tim McManus Business Unit Advocate - Healthcare Thomson Scientific and Healthcare Five Paragon Drive Montvale, New Jersey 07645 201-358-7472 201-722-3861 fax timothy.mcmanus at thomson.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pla at softflare.com Fri Oct 7 17:38:00 2005 From: pla at softflare.com (Paul L. Allen) Date: Fri, 07 Oct 2005 16:38:00 +0100 Subject: Spam from Groundwork Open Source Solutions In-Reply-To: <200510070804.18986.jmwalker@itgroundwork.com> References: <21943250.1128623484789.JavaMail.sfdc@na1-app15> <20051007124351.10426.qmail@mullet.softflare.net> <200510070804.18986.jmwalker@itgroundwork.com> Message-ID: <20051007153800.22347.qmail@mullet.softflare.net> Hi John John Mark Walker writes: > I profusely apologize that this happened. Thanks. Apology accepted. -- Paul Allen Softflare Support ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rejaine at bhz.jamef.com.br Fri Oct 7 18:05:30 2005 From: rejaine at bhz.jamef.com.br (Rejaine Monteiro) Date: Fri, 07 Oct 2005 13:05:30 -0300 Subject: Help with nagios_grapher Message-ID: <1128701130.8699.18.camel@acesup2.bhz.jamef> I have this configuration in ngraph.cfg define ngraph{ service_name MEMORY graph_log_regex mem_free:([0-9]*) graph_value Memory Free graph_units bytes graph_legend 3min avg memory free rrd_plottype STACK rrd_color EACC00 } But, nagios_grapher don't work.. Ngraprh.log reports this: 2005-10-07 12:53:21 PIPE: host1 MEMORY host1 MEMORY mem_free:129368 mem_used:385552 swap_free:43484 swap_used:195512 2005-10-07 12:53:21 VALUES: [host1][MEMORY]:No matching output values found.. I tried to put other values in graph_log_regex, like this: mem_free:([0-9]*) mem_free:([0-9]+) mem_free:([0-9]) mem_free:(\d+) mem_free:(\d*) But all failed... What I'm doing wrong? Sorry by bad english and thanks all! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From julien.touche at lycos.com Fri Oct 7 18:19:32 2005 From: julien.touche at lycos.com (Julien TOUCHE) Date: Fri, 07 Oct 2005 18:19:32 +0200 Subject: Nagios in OpenBSD 3.7 In-Reply-To: <00fa01c5cb45$0edbe080$8102000a@anarquia> References: <00fa01c5cb45$0edbe080$8102000a@anarquia> Message-ID: <4346A014.8060102@lycos.com> Cesar Tibau wrote on 07/10/2005 15:43: > somebody already obtained to install the nagios 2.X in OpenBSD 3.7? > Already I tried of all the ways, but it always gives to problem in > "make all". why don't you check openbsd faq and ports ... http://www.openbsd.org/cgi-bin/cvsweb/ports/net/nagios/ seems to me 2.0b4 is in 3.8 and current ports. and patchs are provided, you can test them on 3.7 Regards Julien ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Fri Oct 7 18:26:51 2005 From: benny at bennyvision.com (C. Bensend) Date: Fri, 7 Oct 2005 11:26:51 -0500 (CDT) Subject: Nagios in OpenBSD 3.7 In-Reply-To: <4346A014.8060102@lycos.com> References: <00fa01c5cb45$0edbe080$8102000a@anarquia> <4346A014.8060102@lycos.com> Message-ID: <3553.134.244.169.17.1128702411.squirrel@webmail.stinkweasel.net> > why don't you check openbsd faq and ports ... > > http://www.openbsd.org/cgi-bin/cvsweb/ports/net/nagios/ > > seems to me 2.0b4 is in 3.8 and current ports. > and patchs are provided, you can test them on 3.7 The patches applied just fine for me - I built 2.0b4 on a 3.7-STABLE machine just a few days ago using them. Benny -- "Young lady, I yelled at you because that paperwork looked like it had been done by a drunk four-year-old." -- Dr. Bob Kelso, "Scrubs" ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 7 19:36:44 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 07 Oct 2005 19:36:44 +0200 Subject: Service Detail Displays Everything In-Reply-To: References: Message-ID: <4346B22C.8010706@op5.se> Joe Regular wrote: > I've recently setup a Nagios 2.0b4 system and now am in the process > of trying to setup contactgroups. Everything with regards to Host > Detail seems to be working such that contacts can only see those hose > for which they are part of the contactgroup. However, when viewing > the Service Details via the website, users can see all services. > Users are also notified of all service related problems. > > Is there any way to limit what services a user can see and access? > Yes. Read the docs and the FAQ, or browse the list archives. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 7 19:38:52 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 07 Oct 2005 19:38:52 +0200 Subject: Nagios monitors in the EDC In-Reply-To: References: Message-ID: <4346B2AC.3030200@op5.se> Mark.Law at thomson.com wrote: > Please take a look at the Nagios TEST at the EDC: > https://10.226.53.17/nagios/ login is guest/guest. > I expect you didn't intend to send this email to the nagios-users list, so you might want to change those passwords. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From msugano at uolinc.com Fri Oct 7 20:42:40 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Fri, 07 Oct 2005 15:42:40 -0300 Subject: [web interface problems] status.dat on tmpfs problem In-Reply-To: <433D24F3.5020708@op5.se> References: <433D24F3.5020708@op5.se> Message-ID: <1128710560.25162.28.camel@localhost> Hi List, Andreas, On Fri, 2005-09-30 at 13:43 +0200, Andreas Ericsson wrote: > Lori Adams wrote: > > I can't seem to get to your site. http://oss.op5.se/nagios. I am > > trying to find the latest package or sources for nrpe on solaris. Would > > your site have the most recent ones? > > > Yes. My ISP is however thoroughly stubborn about their stupidity. I have > dual static IP's at home, and they only seem to be capable of serving > one at the time. I didn't even know about it until people started > complaining about not being able to reach oss.op5.se (obviously, it > works from the inside). > I've tried to place status.dat file on a tmpfs, and the status.cgi begins to crash with "premature ending of cgi script" messages on errorlog. And 500 Internal Server error display. What i should do to fix this situation? I've mounted a tmpfs at /opt/nagios/var/tmpfs and pointed nagios to write the status file at that path. My status.dat file is about 15MB. At the nagios-devel list, I saw a lot of messages claiming for CGI scaling improvements, the same problem I need to manage with the nagios config i'm working. It's distributed monitoring with almost 10k of services been actively monitored and all the status piped to send_nsca to a central nagios server. This config gives me something about 300 alarms, and the status.cgi takes real long to load. TIA, Marcel Mitsuto ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From JoeRegular at gov.nl.ca Fri Oct 7 20:53:45 2005 From: JoeRegular at gov.nl.ca (Joe Regular) Date: Fri, 07 Oct 2005 16:23:45 -0230 Subject: Service Detail Displays Everything Message-ID: Andreas, Thank you for taking the time to respond to my previous post. So far I have read the documentation section titled "Authentication And Authorization In The CGI's". I have completed the section describing how to create the htpasswd.users file (htpasswd -c /usr/local/nagios/etc/htpasswd.users) and I've created a user account for myself (htpasswd /usr/local/nagios/etc/htpasswd.users jregular). I then added my contact information to the contacts.cfg file and added myself to the correct groups in the contactgroups.cfg file. Having completed that, the problem is while I can only view the hosts to which I am a contact, I am able to see all services. As for notifications, I am only notified for problems with regards to my host, however, I am notified for all problems related to all services. My inbox is filling up very fast and any help would be greatly appreciated. Joe >>> Andreas Ericsson 2005-10-07 3:06:44 PM >>> Joe Regular wrote: > I've recently setup a Nagios 2.0b4 system and now am in the process > of trying to setup contactgroups. Everything with regards to Host > Detail seems to be working such that contacts can only see those hose > for which they are part of the contactgroup. However, when viewing > the Service Details via the website, users can see all services. > Users are also notified of all service related problems. > > Is there any way to limit what services a user can see and access? > Yes. Read the docs and the FAQ, or browse the list archives. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From msugano at uolinc.com Sat Oct 8 00:00:46 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Fri, 07 Oct 2005 19:00:46 -0300 Subject: WEB-Interface performance In-Reply-To: <4344F968.9090701@aol.com> References: <1128555158.11266.32.camel@localhost> <4344F968.9090701@aol.com> Message-ID: <1128722447.25260.39.camel@localhost> Hello again, On Thu, 2005-10-06 at 11:16 +0100, Rob Moss wrote: > Yeah, there are some CFLAGS you could be using to optmise your build.. > I am assuming that you have a recent version of GCC, and that your P4 > HT cpu is shown as having two logical CPU's > > Try rebuilding with the following command: > > cd nagios-2.0b4 > CC=gcc CFLAGS="-mtune=i686 -O3 -pipe -march=i686 -funroll-loops > -ffast-math" \ > ./configure --prefix=/usr/local/nagios ...... [rest of nagios > configure commands] I've tried this with no luck, having an error message displaying that -ffast-math wasn't a recognized flag, with gcc-3.3.5. This was the second try to handle the cgi's performance. The first thing we tryied, was setting up a tmpfs mountpoint at /opt/nagios/var/tmpfs, and pointed nagios to write the status file there, but again with some errors that was a little weird, the cgi begins to ending prematurely (apache errorlog), and displaying "500 internal server error". This happened to 10% of the check_http running to test it :), and the response time didn't get too much of a performace improvement as well. I don't think this could be Virtual Machine OS's fault, so the problem might be with the status.cgi reading the tmpfs, but i can't tell for sure, as we tried this setup yesterday. Is there anyone here who made it? (status.dat been written in a tmpfs mountpoint?) > > The main problem is that you have thousands of hosts, and thousands of > services to read in every time you run status.cgi. No matter how > efficient the program is, reading in and displaying that much data is > going to take a while, and running the same program 15 times > simultaneously is going to affect your performance as you see here. > How many lines is the status.dat file? I only have a few hundred > hosts and services, and the file is 23,000 lines or so, half a meg on > disk.. I would imagine yours is closer to about 50mb and closer to a > million lines. > Some alternatives might be updating the Nagios sidebar so that it > doesn't display ALL hosts by default, maybe just a smaller hostgroup.. > (although i suspect the status.cgi needs to read in the whole file) Or > replacing the standard nagios CGI's with something that is more geared > towards handling hundreds of thousands of hosts/services... I agree with you, and i've been reading the nagios-devel list, and saw that the CGI is a problem to a lot of people who need to maintain some BIG nagios configuration, over 10k services. I also watch that there is a patch to improve the performance of the CGIs, but I couldn't find it anywhere. > > Or perhaps you could have a separate display server, a webserver > running the cgi's which reads in the nagios status.dat file over the > network from the nagios server, and does all the processing away from > the nagios collector.. This would move processing off of the nagios > collector.. You could use rsync to keep the two files in sync, keep a > duplicate on the display server on a local disk (or a tmpfs memory > based filesystem for extra speed)... We are studying the mysql backend, but my first shot was the tmpfs, and the recompilation with those flags you mentioned. Hope that exists something easier than setup a mysql backend, -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. -------------- next part -------------- An HTML attachment was scrubbed... URL: From john at stilen.com Sat Oct 8 04:22:13 2005 From: john at stilen.com (John Stile) Date: Fri, 07 Oct 2005 19:22:13 -0700 Subject: help configuring nagios-mysql with nagiosgraph In-Reply-To: <1128553631.15324.46.camel@localhost> References: <1128385428.17094.77.camel@localhost> <1128553631.15324.46.camel@localhost> Message-ID: <1128738133.20644.9.camel@localhost> On Wed, 2005-10-05 at 16:07 -0700, John Stile wrote: > On Tue, 2005-10-04 at 10:45 +1000, John Stevens wrote: > > perfdata not recognized, usually means that the performance data > > passed to nagiosgraph did not match any of the map file's definitions. > > Read up on perl regexps and look at the map file. It is pretty well > > documented. Then look at your output > > > > On 10/4/05, John Stile wrote: > > The rrd directory is world writable, but no databases are > > created. > > > > The nagiosgraph log shows 'perfdata not recognized' > > Fri Sep 30 18:06:21 2005 INSERT info: Input lastcheck: > > Fri Sep 30 18:06:21 2005 INSERT info: Input hostname: > > Fri Sep 30 18:06:21 2005 INSERT info: Input perfdata: > > Fri Sep 30 18:06:21 2005 INSERT info: Input > > servicedescr: > > Fri Sep 30 18:06:21 2005 INSERT info: Input output: > > Fri Sep 30 18:06:21 2005 INSERT warn: perfdata not > > recognized > > This shows me that nagiosgraph is not receiving any information. > > There is no data for lastcheck, hostname, perfdata, servicedescr, or > > output. In other words, at best, an empty line. What is your > > nagios.cfg file like? > > You should have lines like this: > > host_perfdata_command=process-host-perfdata > > service_perfdata_command=process-service-perfdata > > process_performance_data=1 > > in nagios.cfg, and > > # 'process-service-perfdata' command definition > > define command{ > > command_name process-service-perfdata > > command_line /usr/local/bin/exec_perf "$HOSTNAME$" > > "$SERVICEDESC$" "$SERVICESTATE$" "$OUTPUT$" "$PERFDATA$" "$LASTCHECK$" > > } > > in misccommands.cfg. Sorry, I am not familiar with MySQL nagios > > backend, but unless you have these set up, and you know what kind of > > data is being passed to your process-service-data command, and the map > > file recognises your performance data and how to handle it, > > nagiosgraph will not produce rrds. It is the process-service-perfdata > > command that creates the rrds and updates them with new data. > > > > Regards > > Watching the logs, I never see an INSERT statement. The logged INSERT > > statement in my original email was because I ran insert.pl from the > > console with no options, so that is why all the data is missing. > > I'm stick stuck. What seems insane usually is due to a type-o, but I really need some > help on where to look next. > > >From the ground up: > 1. Compile flags for the Debian-3.1 package nagios-common (nagios v1.3): > ./configure --prefix=/usr \ > --mandir=/usr/share/man \ > --bindir=/usr/sbin \ > --sbindir=/usr/lib/cgi-bin/nagios \ > --datadir=/usr/share/nagios/htdocs \ > --sysconfdir=/etc/nagios \ > --infodir=/usr/share/info \ > --libexecdir=/usr/lib/nagios/plugins \ > --localstatedir=/var/log/nagios \ > --with-ping-command="/bin/ping -n %s -c %d" \ > --with-mail=/usr/bin/mail --with-perlcache \ > --with-nagios-user=nagios \ > --with-nagios-grp=nagios \ > --with-template-objects \ > --build=$(DEB_BUILD_GNU_TYPE) \ > --host=$(DEB_HOST_GNU_TYPE) \ > --with-pgsql-xdata \ > --with-mysql-xdata \ > --with-template-extinfo > > 2. In /etc/nagios/nagios.conf: > process_performance_data=1 > service_perfdata_command=process-service-perfdata > > 3. In /etc/nagios/misccommands.cfg > define command { > command_name process-service-perfdata > command_line /usr/share/nagios/nagiosgraph/insert.pl "$HOSTNAME$" "$LASTCHECK$||$HOSTNAME$||$SERVICEDESC$||$OUTPUT$||$PERFDATA$" >> /var/log/nagios/host-perfdata.out > } > > 4. Permissions on log files and rrd directory are 777, owned by nagios > daemon user and apache daemon group. > > 5. Nothing goes into /var/log/nagios/host-perfdata.out, no rrd databases > are created. > > So what does that tell me? It appears to mean that perfdata is never > sent to process-service-perfdata command? > > So do I load a bullet in the gun at this point? > Is the problem that --with-template-extinfo was used and it must be > --with-default-extinfo? Are they are mutually exclusive? SOLVED Debian-3.1 package nagios-common (v1.3) was not compiled with support for a nagios.cfg option "service_perfdata_command=" I didn't solve the problem in 'the Debian way' because rebuilding the deb package didn't work for me (due to my level of expertise). I downloaded the source package, and ran configure with Debian-like flags, then make and finally copied the binary to /usr/sbin/nagios. After a restart, my "service_perfdata_command=" directive started working as it should. I wish I know the Debian package maintainer. -------------------- The long version: --------------------- NOTE: found perfparse faq, discussing how Debian package does has perfdata turned off. This FAQ was for an older package version. http://perfparse.sourceforge.net/faq.php See section: Q. I'm running Nagios on Debian Linux, and I can't get Performance Data to work. I've tried everything, what could be my problem? # Default Nagios from Debian before changes: nagios -m # Nagios 1.3 # Copyright (c) 1999-2004 Ethan Galstad (nagios at nagios.org) # Last Modified: 10-24-2004 # License: GPL # # External Data I/O # ----------------- # Object Data: TEMPLATE # Status Data: DATABASE (MySQL) # Retention Data: DATABASE (MySQL) # Comment Data: DATABASE (MySQL) # Downtime Data: DATABASE (MySQL) # Performance Data: FILE # # Options # ------- # Fixing nagios vi /etc/apt/sources.list # deb-src http://ftp.us.debian.org/debian/ sarge main cd /tmp/ apt-get update apt-get source nagios-mysql cd ../nagios-1.3-cvs.20050402/ # Compile flags are listed in debian/rules # Rebuild with proper options # The only new option that matters is --with-default-perfdata ./configure --prefix=/usr/share/nagios \ --prefix=/usr \ --mandir=/usr/share/man \ --bindir=/usr/sbin \ --sbindir=/usr/lib/cgi-bin/nagios \ --datadir=/usr/share/nagios/htdocs \ --libexecdir=/usr/lib/nagios/plugins \ --sysconfdir=/etc/nagios \ --mandir=/usr/share/man \ --infodir=/usr/share/info \ --localstatedir=/var/log/nagios \ --with-ping-command="/bin/ping -n %s -c %d" \ --with-mail=/usr/bin/mail \ --with-perlcache \ --with-cgiurl=/nagios/cgi-bin \ --with-htmurl=/nagios \ --with-nagios-user=nagios \ --with-nagios-grp=nagios \ --with-default-perfdata \ --with-mysql-downtime \ --with-mysql-comments \ --with-mysql-status \ --with-mysql-retention \ --with-mysql-extinfo # backup old nagios mv /usr/sbin/nagios /usr/sbin/nagios.debian # keep a copy of my new nagios around, in-case an update blows it away cp base/nagios /usr/sbin/nagios.john # copy new nagios binary into place cp /usr/sbin/nagios.john /usr/sbin/nagios # Not check what nagios says is running nagios -m # Nagios 1.3 # Copyright (c) 1999-2004 Ethan Galstad (nagios at nagios.org) # Last Modified: 10-24-2004 # License: GPL # # External Data I/O # ----------------- # Object Data: TEMPLATE # Status Data: DATABASE (MySQL) # Retention Data: DATABASE (MySQL) # Comment Data: DATABASE (MySQL) # Downtime Data: DATABASE (MySQL) # Performance Data: DEFAULT # # Options # ------- /etc/init.d/nagios restart # Now we have Data! YA! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Jason.Ellmers at firstgroup.com Sat Oct 8 04:27:17 2005 From: Jason.Ellmers at firstgroup.com (Jason Ellmers) Date: Sat, 8 Oct 2005 03:27:17 +0100 Subject: Jason Ellmers is out of the office. Message-ID: I will be out of the office starting 07/10/2005 and will not return until 17/10/2005. Please call Glenn Turner on 07901854363 in my absence ********************************************************************** This message is confidential. It may not be disclosed to, or used by, anyone other than the addressee. If you receive this message in error, please advise us immediately. Internet email is not necessarily secure. First does not accept responsibility for changes to any email which occur after the email has been sent. Attachments to this email could contain software viruses which could damage your system. First have checked the attachments for viruses before sending, but you should virus-check them before opening. For more information on our range of services or to book your tickets online, please visit:- http://www.firstgroup.com ********************************************************************** ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Sat Oct 8 10:02:41 2005 From: ae at op5.se (Andreas Ericsson) Date: Sat, 08 Oct 2005 10:02:41 +0200 Subject: [web interface problems] status.dat on tmpfs problem In-Reply-To: <1128710560.25162.28.camel@localhost> References: <433D24F3.5020708@op5.se> <1128710560.25162.28.camel@localhost> Message-ID: <43477D21.4090400@op5.se> Marcel Mitsuto Fucatu Sugano wrote: > Hi List, Andreas, > > On Fri, 2005-09-30 at 13:43 +0200, Andreas Ericsson wrote: > >>Lori Adams wrote: >> >>>I can't seem to get to your site. http://oss.op5.se/nagios. I am >>>trying to find the latest package or sources for nrpe on solaris. Would >>>your site have the most recent ones? >> >> >>Yes. My ISP is however thoroughly stubborn about their stupidity. I have >>dual static IP's at home, and they only seem to be capable of serving >>one at the time. I didn't even know about it until people started >>complaining about not being able to reach oss.op5.se (obviously, it >>works from the inside). >> > > > I've tried to place status.dat file on a tmpfs, and the status.cgi > begins to crash with "premature ending of cgi script" messages on > errorlog. And 500 Internal Server error display. > > What i should do to fix this situation? > I'm not sure, since I don't know what the actual errors are. However, writing the status.dat file to ramdisk won't improve performance much since the kernel will cache the file in memory anyways. If it hasn't got memory to spare for the tmpfs you'll end up using the swap partition which is actually more expensive to fetch from than a continuous block of data on a real partition. > I've mounted a tmpfs at /opt/nagios/var/tmpfs and pointed nagios to > write the status file at that path. My status.dat file is about 15MB. At > the nagios-devel list, I saw a lot of messages claiming for CGI scaling > improvements, the same problem I need to manage with the nagios config > i'm working. It's distributed monitoring with almost 10k of services > been actively monitored and all the status piped to send_nsca to a > central nagios server. This config gives me something about 300 alarms, > and the status.cgi takes real long to load. > > TIA, > Marcel Mitsuto > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Sat Oct 8 10:08:16 2005 From: ae at op5.se (Andreas Ericsson) Date: Sat, 08 Oct 2005 10:08:16 +0200 Subject: WEB-Interface performance In-Reply-To: <1128722447.25260.39.camel@localhost> References: <1128555158.11266.32.camel@localhost> <4344F968.9090701@aol.com> <1128722447.25260.39.camel@localhost> Message-ID: <43477E70.1060604@op5.se> Marcel Mitsuto Fucatu Sugano wrote: > Hello again, > > On Thu, 2005-10-06 at 11:16 +0100, Rob Moss wrote: > >>Yeah, there are some CFLAGS you could be using to optmise your build.. >>I am assuming that you have a recent version of GCC, and that your P4 >>HT cpu is shown as having two logical CPU's >> >>Try rebuilding with the following command: >> >>cd nagios-2.0b4 >>CC=gcc CFLAGS="-mtune=i686 -O3 -pipe -march=i686 -funroll-loops >>-ffast-math" \ >> ./configure --prefix=/usr/local/nagios ...... [rest of nagios >>configure commands] > > > I've tried this with no luck, having an error message displaying that > -ffast-math wasn't a recognized flag, with gcc-3.3.5. This was the > second try to handle the cgi's performance. The first thing we tryied, > was setting up a tmpfs mountpoint at /opt/nagios/var/tmpfs, and pointed > nagios to write the status file there, but again with some errors that > was a little weird, the cgi begins to ending prematurely (apache > errorlog), and displaying "500 internal server error". This happened to > 10% of the check_http running to test it :), and the response time > didn't get too much of a performace improvement as well. > > I don't think this could be Virtual Machine OS's fault, so the problem > might be with the status.cgi reading the tmpfs, but i can't tell for > sure, as we tried this setup yesterday. Is there anyone here who made > it? (status.dat been written in a tmpfs mountpoint?) > >>The main problem is that you have thousands of hosts, and thousands of >>services to read in every time you run status.cgi. No matter how >>efficient the program is, reading in and displaying that much data is >>going to take a while, and running the same program 15 times >>simultaneously is going to affect your performance as you see here. >>How many lines is the status.dat file? I only have a few hundred >>hosts and services, and the file is 23,000 lines or so, half a meg on >>disk.. I would imagine yours is closer to about 50mb and closer to a >>million lines. >>Some alternatives might be updating the Nagios sidebar so that it >>doesn't display ALL hosts by default, maybe just a smaller hostgroup.. >>(although i suspect the status.cgi needs to read in the whole file) Or >>replacing the standard nagios CGI's with something that is more geared >>towards handling hundreds of thousands of hosts/services... > > > I agree with you, and i've been reading the nagios-devel list, and saw > that the CGI is a problem to a lot of people who need to maintain some > BIG nagios configuration, over 10k services. I also watch that there is > a patch to improve the performance of the CGIs, but I couldn't find it > anywhere. > Browse the list archives. Search for "binary cgi" and stuff like that. I haven't tested it myself, but according to the author of the patch he noticed a performance increase of factor 30. I had objections to the patch because the code was definitely non-trivial and included quite a bit of black pointer magic. In short, I didn't like it because it would be hard to maintain and wouldn't work too well in certain circumstances. >>Or perhaps you could have a separate display server, a webserver >>running the cgi's which reads in the nagios status.dat file over the >>network from the nagios server, and does all the processing away from >>the nagios collector.. This would move processing off of the nagios >>collector.. You could use rsync to keep the two files in sync, keep a >>duplicate on the display server on a local disk (or a tmpfs memory >>based filesystem for extra speed)... > > > We are studying the mysql backend, but my first shot was the tmpfs, and > the recompilation with those flags you mentioned. Hope that exists > something easier than setup a mysql backend, nagios-db with postgresql is most likely the fastest option available today. We're planning on writing a new gui that will, hopefully, be more complete than the current nagios-db one and which will also use a database backend. I'll let the list know when there's something to download. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Sat Oct 8 10:09:51 2005 From: ae at op5.se (Andreas Ericsson) Date: Sat, 08 Oct 2005 10:09:51 +0200 Subject: Service Detail Displays Everything In-Reply-To: References: Message-ID: <43477ECF.2010508@op5.se> Joe Regular wrote: > Andreas, > > Thank you for taking the time to respond to my previous post. > > So far I have read the documentation section titled "Authentication And Authorization In The CGI's". I have completed the section describing how to create the htpasswd.users file (htpasswd -c /usr/local/nagios/etc/htpasswd.users) and I've created a user account for myself (htpasswd /usr/local/nagios/etc/htpasswd.users jregular). I then added my contact information to the contacts.cfg file and added myself to the correct groups in the contactgroups.cfg file. > > Having completed that, the problem is while I can only view the hosts to which I am a contact, I am able to see all services. As for notifications, I am only notified for problems with regards to my host, however, I am notified for all problems related to all services. > > My inbox is filling up very fast and any help would be greatly appreciated. > Services have their own contact_groups option. It sounds to me like you've only specified it properly for the hosts and have all your contactgroups in a service template or some such. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lkk at interfree.it Sat Oct 8 14:18:53 2005 From: lkk at interfree.it (lkk) Date: Sat, 08 Oct 2005 12:18:53 +0000 Subject: Premature end of script headers statusmap.cgi! In-Reply-To: <43467BC9.3020208@aol.com> References: <20051007133107.19610.qmail@community22.interfree.it> <43467BC9.3020208@aol.com> Message-ID: <1128773933.7165.7.camel@localhost> > This error is generated by Apache, probably because you haven't > installed the Nagios CGI's with the right permissions, or the cgi.cfg > file is not set up correctly. > > Read the error_log in your apache log directory and go from there, it > should help you track down the > > rob > this is the output of the command: ./statusmap.cgi ./statusmap.cgi: error while loadingb shared libraries: libgd.so.2_ cannot open share object file: No such file or directory and now? thancks! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From barbereau at gmail.com Sat Oct 8 12:41:59 2005 From: barbereau at gmail.com (=?ISO-8859-1?Q?S=E9bastien_Barbereau?=) Date: Sat, 8 Oct 2005 12:41:59 +0200 Subject: WEB-Interface performance In-Reply-To: <43477E70.1060604@op5.se> References: <1128555158.11266.32.camel@localhost> <4344F968.9090701@aol.com> <1128722447.25260.39.camel@localhost> <43477E70.1060604@op5.se> Message-ID: <4e0e33ee0510080341p6f21fc0k63af050b744e72a5@mail.gmail.com> Hi to all, just adding my 2cents here: I agree that html page generation from the CGI is one of the major performance bottlenecks for display in Nagios. This is particularly true when you have 10 users querying the same CGI every 90seconds ... but, why not use some caching mechanism to prevent the repeated cgi execution? You dont't have to modify nagios for this, you could just use apaches mod_cache / mod_proxy or squid for example. Seb.Barbereau On 10/8/05, Andreas Ericsson wrote: > > Marcel Mitsuto Fucatu Sugano wrote: > > Hello again, > > > > On Thu, 2005-10-06 at 11:16 +0100, Rob Moss wrote: > > > >>Yeah, there are some CFLAGS you could be using to optmise your build.. > >>I am assuming that you have a recent version of GCC, and that your P4 > >>HT cpu is shown as having two logical CPU's > >> > >>Try rebuilding with the following command: > >> > >>cd nagios-2.0b4 > >>CC=gcc CFLAGS="-mtune=i686 -O3 -pipe -march=i686 -funroll-loops > >>-ffast-math" \ > >> ./configure --prefix=/usr/local/nagios ...... [rest of nagios > >>configure commands] > > > > > > I've tried this with no luck, having an error message displaying that > > -ffast-math wasn't a recognized flag, with gcc-3.3.5. This was the > > second try to handle the cgi's performance. The first thing we tryied, > > was setting up a tmpfs mountpoint at /opt/nagios/var/tmpfs, and pointed > > nagios to write the status file there, but again with some errors that > > was a little weird, the cgi begins to ending prematurely (apache > > errorlog), and displaying "500 internal server error". This happened to > > 10% of the check_http running to test it :), and the response time > > didn't get too much of a performace improvement as well. > > > > I don't think this could be Virtual Machine OS's fault, so the problem > > might be with the status.cgi reading the tmpfs, but i can't tell for > > sure, as we tried this setup yesterday. Is there anyone here who made > > it? (status.dat been written in a tmpfs mountpoint?) > > > >>The main problem is that you have thousands of hosts, and thousands of > >>services to read in every time you run status.cgi. No matter how > >>efficient the program is, reading in and displaying that much data is > >>going to take a while, and running the same program 15 times > >>simultaneously is going to affect your performance as you see here. > >>How many lines is the status.dat file? I only have a few hundred > >>hosts and services, and the file is 23,000 lines or so, half a meg on > >>disk.. I would imagine yours is closer to about 50mb and closer to a > >>million lines. > >>Some alternatives might be updating the Nagios sidebar so that it > >>doesn't display ALL hosts by default, maybe just a smaller hostgroup.. > >>(although i suspect the status.cgi needs to read in the whole file) Or > >>replacing the standard nagios CGI's with something that is more geared > >>towards handling hundreds of thousands of hosts/services... > > > > > > I agree with you, and i've been reading the nagios-devel list, and saw > > that the CGI is a problem to a lot of people who need to maintain some > > BIG nagios configuration, over 10k services. I also watch that there is > > a patch to improve the performance of the CGIs, but I couldn't find it > > anywhere. > > > > Browse the list archives. Search for "binary cgi" and stuff like that. I > haven't tested it myself, but according to the author of the patch he > noticed a performance increase of factor 30. I had objections to the > patch because the code was definitely non-trivial and included quite a > bit of black pointer magic. In short, I didn't like it because it would > be hard to maintain and wouldn't work too well in certain circumstances. > > >>Or perhaps you could have a separate display server, a webserver > >>running the cgi's which reads in the nagios status.dat file over the > >>network from the nagios server, and does all the processing away from > >>the nagios collector.. This would move processing off of the nagios > >>collector.. You could use rsync to keep the two files in sync, keep a > >>duplicate on the display server on a local disk (or a tmpfs memory > >>based filesystem for extra speed)... > > > > > > We are studying the mysql backend, but my first shot was the tmpfs, and > > the recompilation with those flags you mentioned. Hope that exists > > something easier than setup a mysql backend, > > nagios-db with postgresql is most likely the fastest option available > today. We're planning on writing a new gui that will, hopefully, be more > complete than the current nagios-db one and which will also use a > database backend. > > I'll let the list know when there's something to download. > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From al at its-lehmann.de Sat Oct 8 14:37:21 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Sat, 08 Oct 2005 14:37:21 +0200 Subject: Premature end of script headers statusmap.cgi! In-Reply-To: <1128773933.7165.7.camel@localhost> References: <20051007133107.19610.qmail@community22.interfree.it> <43467BC9.3020208@aol.com> <1128773933.7165.7.camel@localhost> Message-ID: <4347BD81.5060801@its-lehmann.de> Hello, On 08.10.2005 14:18, lkk wrote: > > >>This error is generated by Apache, probably because you haven't >>installed the Nagios CGI's with the right permissions, or the cgi.cfg >>file is not set up correctly. >> >>Read the error_log in your apache log directory and go from there, it >>should help you track down the >> >>rob >> > > > this is the output of the command: ./statusmap.cgi > > ./statusmap.cgi: error while loadingb shared libraries: libgd.so.2_ > cannot open share object file: No such file or directory > > and now? Please read the manual, the README file or whatever it is called, and the output of ./configure and make. The list archive has the same information about once every three weeks. As far as I recall, in all four locations you find the explanation that, without usable gd installation, the cgis will be built but the statusmap.cgi will not. There are even suggestions how to resolve that. Arno > > > thancks! > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Sat Oct 8 14:38:56 2005 From: ae at op5.se (Andreas Ericsson) Date: Sat, 08 Oct 2005 14:38:56 +0200 Subject: Premature end of script headers statusmap.cgi! In-Reply-To: <1128773933.7165.7.camel@localhost> References: <20051007133107.19610.qmail@community22.interfree.it> <43467BC9.3020208@aol.com> <1128773933.7165.7.camel@localhost> Message-ID: <4347BDE0.2000107@op5.se> lkk wrote: > > >>This error is generated by Apache, probably because you haven't >>installed the Nagios CGI's with the right permissions, or the cgi.cfg >>file is not set up correctly. >> >>Read the error_log in your apache log directory and go from there, it >>should help you track down the >> >>rob >> > > > this is the output of the command: ./statusmap.cgi > > ./statusmap.cgi: error while loadingb shared libraries: libgd.so.2_ > cannot open share object file: No such file or directory > > and now? > man ldd Read it carefully and then just fix the problem. Now I've told you everything you need to know to figure it out for yourself, although I won't think less of you if you dig the answer out of the list archives or the FAQ rather than applying some small amount of brainpower to it. I will however think less of you if you ask more questions that have been answered ad nauseum without even bothering to make an effort of fixing them yourself or at least submit some details of how, exactly, you went about installing nagios and on which kind of hardware, OS and whatnot it's running. Oh, and if you're using gcc 2.96 to compile the thing with, I need to know that too. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From joeregular at gov.nl.ca Sat Oct 8 18:55:22 2005 From: joeregular at gov.nl.ca (Joe Regular) Date: Sat, 08 Oct 2005 14:25:22 -0230 Subject: Service Detail Displays Everything Message-ID: >>> Andreas Ericsson 10/08/05 5:39 AM >>> Joe Regular wrote: > Andreas, > > Thank you for taking the time to respond to my previous post. > > So far I have read the documentation section titled "Authentication And Authorization In The CGI's". I have completed the section describing how to create the htpasswd.users file (htpasswd -c /usr/local/nagios/etc/htpasswd.users) and I've created a user account for myself (htpasswd /usr/local/nagios/etc/htpasswd.users jregular). I then added my contact information to the contacts.cfg file and added myself to the correct groups in the contactgroups.cfg file. > > Having completed that, the problem is while I can only view the hosts to which I am a contact, I am able to see all services. As for notifications, I am only notified for problems with regards to my host, however, I am notified for all problems related to all services. > > My inbox is filling up very fast and any help would be greatly appreciated. > Services have their own contact_groups option. It sounds to me like you've only specified it properly for the hosts and have all your contactgroups in a service template or some such. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 -------------------------------------------- Thanks again Andreas, I've checked the service.cfg file and under the contact_groups section, only those groups responsible for the services are listed. Is there an issue with having multiple groups listed in the contact_groups section by any chance? Joe. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From JoeRegular at gov.nl.ca Sat Oct 8 21:57:27 2005 From: JoeRegular at gov.nl.ca (Joe Regular) Date: Sat, 08 Oct 2005 17:27:27 -0230 Subject: Service Detail Displays Everything Getting Closer Message-ID: After playing around with the config files I've managed to narrow down the problem a little better. I have two contact_groups called NTAdmins and NWAdmins. The NWAdmins can see all Netwere hosts and all Netware services. The NTAdmins can see all NT hosts and all NT services. However, the NTAdmins can also see all NW services. I've checked the contact_groups settings in both the nwservices.cfg and the ntservices.cfg but I don't see anything linking the NTAdmins to the NW services. Joe ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From msugano at uolinc.com Sat Oct 8 23:06:17 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Sat, 08 Oct 2005 18:06:17 -0300 Subject: WEB-Interface performance In-Reply-To: <4e0e33ee0510080341p6f21fc0k63af050b744e72a5@mail.gmail.com> References: <1128555158.11266.32.camel@localhost> <4344F968.9090701@aol.com> <1128722447.25260.39.camel@localhost> <43477E70.1060604@op5.se> <4e0e33ee0510080341p6f21fc0k63af050b744e72a5@mail.gmail.com> Message-ID: <1128805578.1154.17.camel@localhost> On Sat, 2005-10-08 at 12:41 +0200, S?bastien Barbereau wrote: > Hi to all, just adding my 2cents here: I agree that html page > generation from the CGI is one of the major performance bottlenecks > for display in Nagios. This is particularly true when you have 10 > users querying the same CGI every 90seconds ... but, why not use some > caching mechanism to prevent the repeated cgi execution? You dont't > have to modify nagios for this, you could just use apaches mod_cache / > mod_proxy or squid for example. > Seb.Barbereau Out apache config is already with mod_cache && mod_proxy. But I am still trying to improve performance, looks like bd backend process is the way to go. Thank you all for your answers. Cheers, -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misc at viceconsulting.co.nz Mon Oct 10 02:58:04 2005 From: misc at viceconsulting.co.nz (misc at viceconsulting.co.nz) Date: Mon, 10 Oct 2005 13:58:04 +1300 (NZDT) Subject: Max number of services that can be monitored ? Message-ID: <49361.127.0.0.1.1128905884.squirrel@www.goldenfields.co.nz> All, I have a Nagios server that is monitoring about 60 hosts / 1000 services, however the CPU is pretty much continually maxed out at 100%, and the average active check latency is around 28 seconds (compared to 0.1 seconds with my other Nagios server which are monitoring < 100 services). I was wondering, with 256MB ram and a 2.8GHz P4, should I be maxing out Nagios at 1000 services? I was thinking this seems a little low, and thought I could probably be monitoring around 10,000 services before worrying about maxing out the machine's resources... Is what's happening with my Nagios server normal, or have I made a blatant configuration error that is slowing everything down? What other hardware / number of services are people using / monitoring without maxing out their server's resources? Also curious is there is any rule of thumbs for determining the max number of services you can schedule comfortably on a given server? I'm runinng Nagios 2.0b4 (embedded Perl DISABLED) on redhat es 3 update 5. Cheers, -Alex. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dido at imperium.ph Mon Oct 10 03:52:56 2005 From: dido at imperium.ph (Rafael 'Dido' Sevilla) Date: Mon, 10 Oct 2005 09:52:56 +0800 Subject: Max number of services that can be monitored ? In-Reply-To: <49361.127.0.0.1.1128905884.squirrel@www.goldenfields.co.nz> References: <49361.127.0.0.1.1128905884.squirrel@www.goldenfields.co.nz> Message-ID: <4349C978.50700@imperium.ph> misc at viceconsulting.co.nz wrote: > I was wondering, with 256MB ram and a 2.8GHz P4, should I be maxing out > Nagios at 1000 services? I was thinking this seems a little low, and > thought I could probably be monitoring around 10,000 services before > worrying about maxing out the machine's resources... > That depends on what service checks you are using. If it's something complicated, things are a little messy. Different service checks have got different requirements so it's fairly difficult to state a rule of thumb. > What other hardware / number of services are people using / monitoring > without maxing out their server's resources? Now, we're running a distributed Nagios cluster with seven DSes, monitoring a total of some 30,000 services, giving each DS an average of about 4285 services (mostly ping and SNMP interface status). The CPU on these boxes maxes out every so often but later enters a period of quiescence; from an auxiliary Nagios grapher that we use to watch trends in their behavior, it appears to be cyclic (corresponding, no doubt to the check execution/service reaper cycle of Nagios). Each DS is a Xeon 2.8 with 1 GB of RAM though. Maybe it's thrashing you're experiencing, as memory is always maxed out on our boxes, but fortunately, it only very seldom uses swap. -- True compassion is more than throwing a coin to a beggar. It demands of our humanity that if we live in a society that produces beggars, we are morally commanded to restructure that society. http://stormwyrm.blogspot.com/ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ron at tvnz.co.nz Mon Oct 10 04:35:26 2005 From: ron at tvnz.co.nz (Ron Wilson) Date: Mon, 10 Oct 2005 15:35:26 +1300 Subject: Monitoring Nagios Message-ID: What are people using to monitor Nagios from an external source. I just want something that can page me if Nagios crashes etc. Are there any freeware ping and alert software? ===================================================================== CAUTION: This e-mail and any attachment(s) contains information that is intended to be read only by the named recipient(s). It may contain information that is confidential, proprietary or the subject of legal privilege. This information is not to be used by any other person and/or organisation. If you are not the intended recipient, please advise us immediately and delete this e-mail from your system. Do not use any information contained in it. ================================================================ For more information on the Television New Zealand Group, visit us online at http://www.tvnz.co.nz ================================================================ -------------- next part -------------- An HTML attachment was scrubbed... URL: From RLAdams at AdamsInfoServ.Com Mon Oct 10 06:15:14 2005 From: RLAdams at AdamsInfoServ.Com (Russell Adams) Date: Sun, 9 Oct 2005 23:15:14 -0500 Subject: Monitoring Nagios In-Reply-To: References: Message-ID: <20051010041514.GA7767@keroberos> Make nagios a child of init in inittab, and respawn automatically. Russell On Mon, Oct 10, 2005 at 03:35:26PM +1300, Ron Wilson wrote: > > What are people using to monitor Nagios from an external source. I just > want something that can page me if Nagios crashes etc. Are there any > freeware ping and alert software? > > > > > > ===================================================================== > CAUTION: This e-mail and any attachment(s) contains information that > is intended to be read only by the named recipient(s). It may contain > information that is confidential, proprietary or the subject of legal > privilege. This information is not to be used by any other person > and/or organisation. If you are not the intended recipient, please > advise us immediately and delete this e-mail from your system. Do not > use any information contained in it. > > ================================================================ > For more information on the Television New Zealand Group, visit us > online at http://www.tvnz.co.nz > ================================================================------------------------------------------------------------------ Russell Adams RLAdams at AdamsInfoServ.com PGP Key ID: 0x1160DCB3 http://www.adamsinfoserv.com/ Fingerprint: 1723 D8CA 4280 1EC9 557F 66E8 1154 E018 1160 DCB3 -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available URL: From ankushgrover1711 at gmail.com Mon Oct 10 07:46:15 2005 From: ankushgrover1711 at gmail.com (ankush grover) Date: Mon, 10 Oct 2005 11:16:15 +0530 Subject: Does nagios provides such details Message-ID: hey friends, I wanted to know something regarding the working of nagios.Recently I build a cluster for Apache & Mysql(ultramonkey).Now I am looking for a tool which can give me details like this the no.of connections to apache & mysql,load on Apache & Mysql,bandwidth consumed by both Apache & Mysql,Users connected to the Mysql/Apache ,IPAddresses of the clients connected to apache/mysql at the given time means If I refresh the page the nagios should give me the current details. Does nagios gives such kind of details. Thanks & Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: From maca02 at atlas.cz Mon Oct 10 08:19:27 2005 From: maca02 at atlas.cz (Tomas Macek) Date: Mon, 10 Oct 2005 08:19:27 +0200 (CEST) Subject: HOSTSTATETYPE problem Message-ID: Hi, I have this piece of main.c of my nagios plugin: --------------------------------------------- int main(int argc, char *argv[]) { netsnmp_pdu *response; oid anOID[MAX_OID_LEN]; size_t anOID_len; int i = 0; char ret_pass[9]; int status = STATE_UNKNOWN, outlet = -1; struct stat st; if (argc < 2) { printf("Usage: routlet \n\thost - host to check\n\toutlet - outlet to restart"); return STATE_UNKNOWN; } syslog(LOG_NOTICE, "routlet: '%s' '%s' '%s'", argv[1], argv[2], argv[3]); ... ... ... } --------------------------------------------- and I have this [command] line in my nagios config file /etc/nagios/commands.cfg: -------------------- command[resetOutletSVIt]=$USER1$/routlet 192.168.1.1 5 $HOSTSTATETYPE$ -------------------- And finally - this is output in the /var/log/messages file when the plugin was run: ------------------- routlet: outlet: '213.250.192.60' '5' '$' ------------------- I'm using Nagios version 1.2. Why I got this output, when other variables like $HOSTATTEMPT$ works fine? Any help appreciated!! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ton.voon at altinity.com Mon Oct 10 10:02:58 2005 From: ton.voon at altinity.com (Ton Voon) Date: Mon, 10 Oct 2005 09:02:58 +0100 Subject: Who is maintaining NRPE? In-Reply-To: <4337BDF7.2010407@op5.se> References: <4333170E.6020709@op5.se> <4337BDF7.2010407@op5.se> Message-ID: Andreas, Sorry for the delay. Please find the nrpe 2.2 patch below. This applies cleanly to your latest sources. Patch is to allow compiles on Tru64 and IRIX. This should also fix tracker bug 1164370: http://sourceforge.net/ tracker/index.php?func=detail&aid=1164370&group_id=110412&atid=656217 Are you actively maintaining this bug list? If not, where are you working from? On 26 Sep 2005, at 10:23, Andreas Ericsson wrote: > Ton Voon wrote: >> If so, I don't think this is in the spirit of openess. Can I >> suggest that you use Sourceforge to host the NRPE code? >> > > You can suggest it, but it won't happen. The sourceforge CVS is > near enough impossible to work with from sweden, due to a 400ms > latency. > > I can set them up for public access on oss.op5.se though. Great. Roughly when will this be? I also note that the Changelog file hasn't been updated since you took over. Could you update those please (especially since we don't have CVS access :) ) Ton http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon ? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: nrpe2.2.patch Type: application/octet-stream Size: 4649 bytes Desc: not available URL: -------------- next part -------------- An HTML attachment was scrubbed... URL: From sauk0_s at yahoo.es Mon Oct 10 10:24:25 2005 From: sauk0_s at yahoo.es (sauk0) Date: Mon, 10 Oct 2005 10:24:25 +0200 Subject: check_oracle Message-ID: <434A2539.90408@yahoo.es> I have a problem to check_oracle plugin Somebody use this plugin ? , How i monitoring oracle tablespace ? any example ? thank you ______________________________________________ Renovamos el Correo Yahoo! Nuevos servicios, m?s seguridad http://correo.yahoo.es ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jamie.baddeley at vpc.co.nz Mon Oct 10 10:37:36 2005 From: jamie.baddeley at vpc.co.nz (Jamie Baddeley) Date: Mon, 10 Oct 2005 21:37:36 +1300 Subject: Read Only Web Interface User In-Reply-To: <6943DE9E9934CD42BBE4A20E0E28E10E011C9549@SYGOMAIL.GOE.AD.SYCOR.DE> References: <6943DE9E9934CD42BBE4A20E0E28E10E011C9549@SYGOMAIL.GOE.AD.SYCOR.DE> Message-ID: <1128933456.13610.17.camel@munter> was there ever an answer to this, and I'm running into the same problem. Cheers jamie On Wed, 2005-01-12 at 15:52 +0100, Sand Philipp wrote: > Hi, > > Sorry when I'm to blind, but I can't find this in the Documentation... I'm using 2.0a1 with cgi authentication "1". > Now I want to set up an user for an customer, who should view some hosts, do some reports, but who shouldn't access the extinfo.cgi to commit acknowlegements or other service/host commands. > > Until now, I use the contact_groups definition to add users/groups, who have access to view the service/host. Problem is, that those users are automiatically allowed to access the extinfo.cgi when they are contact to the service/host. > > What can I do? > Thanks in advance, > Philipp > > > _____________________________ > > Philipp Sand > OC-CC-TEC-SYS > > SYCOR GmbH > Heinrich-von-Stephan-Stra?e 1-5 > D - 37073 G?ttingen > > Telefon +49 (0) 551 / 490 - 0 > Telefax +49 (0) 551 / 490 - 232468 > > philipp.sand at sycor.de > www.sycor.de > ------------------------------------------------ > > > > ************************************************ > > sycor plastics - die neue Branchenl?sung f?r die Kunststoffindustrie > > www.sycor-plastics.de > > ************************************************ > > > > > Diese E-Mail ist vertraulich und kann dar?ber hinaus pers?nliche Informationen beinhalten. Wenn Sie nicht der bestimmungsgem??e Empf?nger sind, l?schen Sie bitte die E-Mail und deren Anh?nge sofort und benachrichtigen Sie uns dar?ber. Die Firma sycor willigt in keine Vertr?ge oder vertragliche Verpflichtungen ein oder ?bermittelt rechtsverbindliche Angebote, die in Form von E-Mail versandt werden, sofern dies nicht ausdr?cklich in schriftlicher Form zwischen den Parteien vereinbart wurde. > > This e-mail is confidential and may contain personal and/or privileged information. If you are not the intended recipient please delete this e-mail and all attachments immediately and inform us. The company sycor does not agree with contracts or contract obligations sent by e-mail, neither do we transmit legally binding offers by e-mail, unless this is not expressly agreed upon between the parties and documented in written form. > > > > ------------------------------------------------------- > The SF.Net email is sponsored by: Beat the post-holiday blues > Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek. > It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Jamie Baddeley ViewPoint Consulting Ltd ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Mon Oct 10 11:41:28 2005 From: robmossrm at aol.com (Rob Moss) Date: Mon, 10 Oct 2005 10:41:28 +0100 Subject: WEB-Interface performance In-Reply-To: <1128722447.25260.39.camel@localhost> References: <1128555158.11266.32.camel@localhost> <4344F968.9090701@aol.com> <1128722447.25260.39.camel@localhost> Message-ID: <434A3748.6060206@aol.com> Marcel Mitsuto Fucatu Sugano wrote: > Hello again, > > On Thu, 2005-10-06 at 11:16 +0100, Rob Moss wrote: > >> Yeah, there are some CFLAGS you could be using to optmise your >> build.. I am assuming that you have a recent version of GCC, and that >> your P4 HT cpu is shown as having two logical CPU's >> >> Try rebuilding with the following command: >> >> cd nagios-2.0b4 >> CC=gcc CFLAGS="-mtune=i686 -O3 -pipe -march=i686 -funroll-loops >> -ffast-math" \ >> ./configure --prefix=/usr/local/nagios ...... [rest of nagios >> configure commands] > > > I've tried this with no luck, having an error message displaying that > -ffast-math wasn't a recognized flag, with gcc-3.3.5. This was the > second try to handle the cgi's performance. The first thing we tryied, > was setting up a tmpfs mountpoint at /opt/nagios/var/tmpfs, and > pointed nagios to write the status file there, but again with some > errors that was a little weird, the cgi begins to ending prematurely > (apache errorlog), and displaying "500 internal server error". This > happened to 10% of the check_http running to test it :), and the > response time didn't get too much of a performace improvement as well. > > I don't think this could be Virtual Machine OS's fault, so the problem > might be with the status.cgi reading the tmpfs, but i can't tell for > sure, as we tried this setup yesterday. Is there anyone here who made > it? (status.dat been written in a tmpfs mountpoint?) The flag -ffast-math is in GCC 3.4.2, but I had assumed it was part of the 3.x version of GCC. You can simply omit that flag, and compiling should work again. (I'm surprised you didn't do that yourself). What you are getting into is not easy, you are off the beaten track and it will be a rocky ride until you get everything right. When you see errors like an Apache 500 error, please look through the mailing list for common solutions before posting about it. When you moved the status.dat file into the tmpfs volume, did you make a symlink from where the file came from (make a symlink from /opt/nagios/var/status.dat -> /opt/nagios/var/tmpfs/status.dat) ? Also, when you talk about the Virtual Machine OS.. are you running the O/S under VMWare? You do realise that Emulating an O/S is going to be slower and less stable, right? Cheers rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Mon Oct 10 11:48:27 2005 From: robmossrm at aol.com (Rob Moss) Date: Mon, 10 Oct 2005 10:48:27 +0100 Subject: Max number of services that can be monitored ? In-Reply-To: <49361.127.0.0.1.1128905884.squirrel@www.goldenfields.co.nz> References: <49361.127.0.0.1.1128905884.squirrel@www.goldenfields.co.nz> Message-ID: <434A38EB.70604@aol.com> misc at viceconsulting.co.nz wrote: >All, > >I have a Nagios server that is monitoring about 60 hosts / 1000 services, >however the CPU is pretty much continually maxed out at 100%, and the >average active check latency is around 28 seconds (compared to 0.1 seconds >with my other Nagios server which are monitoring < 100 services). > >I was wondering, with 256MB ram and a 2.8GHz P4, should I be maxing out >Nagios at 1000 services? I was thinking this seems a little low, and >thought I could probably be monitoring around 10,000 services before >worrying about maxing out the machine's resources... > > You haven't sent in any information on what your server is actually doing, so we can't offer much help there. Amount of ram seems low, how much RAM is free when you're running Nagios? What standard service checks are you running for each server? Are you running any of the Perl plugins (and having to launch Perl each time you do a service check?) >Is what's happening with my Nagios server normal, or have I made a blatant >configuration error that is slowing everything down? > > Lack of information provided, so can't provide any clues. >I'm runinng Nagios 2.0b4 (embedded Perl DISABLED) on redhat es 3 update 5. > > As with the service checks. With the Embeded Perl interpereter disabled, if you run any Perl based checks, you will have to launch Perl each and every time. Cheers rob. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From forums at emat.be Mon Oct 10 12:22:32 2005 From: forums at emat.be (Jelle S.) Date: Mon, 10 Oct 2005 12:22:32 +0200 Subject: Does nagios provides such details In-Reply-To: References: Message-ID: <434A40E8.4010206@emat.be> Hi Ankush, You can use nagios to process the amount of cheescakes left in your fridge, if you write your own plugin for this. Basically, Nagios "just" interprets the output of a plugin which is included with the official nagios plugins, or you write one yourself. What the function of this plugin is, you descide yourself. Check out http://nagiosplug.sourceforge.net/developer-guidelines.html for details. That's the power behind Nagios. ankush grover wrote: > hey friends, > > I wanted to know something regarding the working of nagios.Recently I > build a cluster for Apache & Mysql(ultramonkey).Now I am looking for a > tool which can give me details like this the no.of connections to > apache & mysql,load on Apache & Mysql,bandwidth consumed by both > Apache & Mysql,Users connected to the Mysql/Apache ,IPAddresses of the > clients connected to apache/mysql at the given time means If I refresh > the page the nagios should give me the current details. > > Does nagios gives such kind of details. > > Thanks & Regards > > Ankush Grover > > > > -- > This message has been scanned for viruses and > dangerous content by FrontLine, and is believed to be clean. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From roux.emmanuel at gmail.com Mon Oct 10 12:26:13 2005 From: roux.emmanuel at gmail.com (Emmanuel Roux) Date: Mon, 10 Oct 2005 12:26:13 +0200 Subject: Event unhandled in Cgi's but Acknowledged .. Message-ID: <478bd3690510100326l1c5bbe80w@mail.gmail.com> Hello, Since last Week on my Nagios server (1.2) I have one bug ..I don't change anything apart upgraded my system with last security patches (debian Sarge). when go on my cgi tac.cgi i have got my service or host (previously aknowledged) on critical Acknowledged state and when i refresh sometime it appear on unhandled problem state (i verify the state don't change beetwen refreshs..) ...strange sometime got an idea or already got this pb ? thx Emmanuel ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ankushgrover1711 at gmail.com Mon Oct 10 13:47:07 2005 From: ankushgrover1711 at gmail.com (ankush grover) Date: Mon, 10 Oct 2005 17:17:07 +0530 Subject: getting error while starting the nagios service on FC3 Message-ID: hey friends, I created some files like hostgroups.cfg,hosts.cfg,services.cfg,contact.cfgand contactgroups.cfg.But when I tried to start the nagios service it gave me error.Then I tried to start with this command nagios -v /etc/nagios/nagios.cfg it gave me this error. Nagios 2.0b4 Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) Last Modified: 08-02-2005 License: GPL Reading configuration data... Error: Invalid hostgroup object directive 'contact_groups'. Error: Could not add object property in file '/etc/nagios/hostgroups.cfg' on line 4. ***> One or more problems was encountered while processing the config files... Check your configuration file(s) to ensure that they contain valid directives and data defintions. If you are upgrading from a previous version of Nagios, you should be aware that some variables/definitions may have been removed or modified in this version. Make sure to read the HTML documentation regarding the config files, as well as the 'Whats New' section to find out what has changed. I am using Fedora Core 3 both firewall and SELinux are disabled on my pc. Below are my configured files Services.cfg file define service{ use generic-service host_name cluster1 service_description HTTP is_volatile 0 check_period 24x7 max_check_attempts 3 normal_check_interval 2 retry_check_interval 1 contact_groups linux-admins notification_interval 240 notification_period 24x7 notification_options w,u,c,r check_command check_http } Hosts.cfg file define host{ use generic-host host_name cluster1 alias Apache webserver address 192.168.1.68 check_command check-host-alive max_check_attempts 10 notification_interval 120 notification_period 24x7 notification_options d,u,r,w,c } Hostgroups.cfg file define hostgroup{ hostgroup_name www alias Web Servers contact_groups linux-admins members cluster1 } Contacts.cfg file define contact{ contact_name ankush alias ankush grover service_notification_period 24x7 host_notification_period 24x7 service_notification_options w,u,c,r host_notification_options d,u,r service_notification_commands notify-by-email host_notification_commands host-notify-by-email email ankush174 at gmail.com } Ccontactgroups.cfg file define contactgroup{ contactgroup_name linux-admins alias Linux Admins members ankush } I have added /etc/nagios/checkcommands.cfg to nagios.cfg(cfg_file) and also added /etc/nagios/command-plugins.cfg nagios.cfg(cfg_file) I am very new to nagios. Please help me. Thanks & Regards Ankush Grover -------------- next part -------------- An HTML attachment was scrubbed... URL: From tom at dyndns.com Mon Oct 10 14:31:08 2005 From: tom at dyndns.com (Tom Daly) Date: Mon, 10 Oct 2005 08:31:08 -0400 (EDT) Subject: Installation of Nagios 1.2 on FreeBSD 5.4 Message-ID: Hello, We recently installed Nagios 1.2 from source onto a FreeBSD 5.4 machine. When using the method "make init-install", a script named "nagios" was placed into the machine's /usr/local/etc/rc.d/ directory. We know that this is supposed to be Nagios' init script, however, on FreeBSD, init scripts must have the file extension ".sh" per the FreeBSD handbook. "While every script must meet some minimal requirements, most of the time these requirements are FreeBSD version agnostic. Each script must have a .sh extension appended to the end and every script must be executable by the system. The latter may be achieved by using the chmod command and setting the unique permissions of 755. There should also be, at minimal, an option to start the application and an option to stop the application." While this is easy to correct, I'm wondering if others have come across this, and what the proper channels are to notify the author of this platform-nostic issue. Warm Regards, Tom Daly -- Thomas J. Daly tom at dyndns.com Dynamic Network Services, Inc. http://www.dyndns.com/ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Mon Oct 10 15:26:07 2005 From: robmossrm at aol.com (Rob Moss) Date: Mon, 10 Oct 2005 14:26:07 +0100 Subject: getting error while starting the nagios service on FC3 In-Reply-To: References: Message-ID: <434A6BEF.2000603@aol.com> ankush grover wrote: > I created some files like > hostgroups.cfg,hosts.cfg,services.cfg,contact.cfg and > contactgroups.cfg.But when I tried to start the nagios service it gave > me error.Then I tried to start with this command nagios -v > /etc/nagios/nagios.cfg it gave me this error. > [snip] > Error: Invalid hostgroup object directive 'contact_groups'. > Error: Could not add object property in file > '/etc/nagios/hostgroups.cfg' on line 4. > Did you check the hostgroup object directive 'contact_groups' in your file '/etc/nagios/hostgroups.cfg' on line 4? You seem to be using a "contact_groups" directive in the wrong place. Read this page for the format http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#hostgroup And for good measure, read and bookmark this too http://nagios.sourceforge.net/docs/2_0/xodtemplate.html#contact > I am very new to nagios. That much is certain, however you should make attempts at reading the error message(s), investigating the problem yourself and searching previous list postings before posting to the list. rob ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jhmartin at toger.us Mon Oct 10 16:01:22 2005 From: jhmartin at toger.us (Jason Martin) Date: Mon, 10 Oct 2005 10:01:22 -0400 Subject: Does nagios provides such details In-Reply-To: References: Message-ID: <20051010140122.GA24395@mal.members.linode.com> On Mon, Oct 10, 2005 at 11:16:15AM +0530, ankush grover wrote: > on Apache & Mysql,bandwidth consumed by both Apache & Mysql,Users connected > to the Mysql/Apache ,IPAddresses of the clients connected to apache/mysql at > the given time means If I refresh the page the nagios should give me the > current details. I don't believe the current Nagios plugins for these products return that data, and in either case the data would be as of the last check instead of last-page-load. The data would probably be an average of 7 minutes old. -Jason Martin -- This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From msugano at uolinc.com Mon Oct 10 16:20:10 2005 From: msugano at uolinc.com (Marcel Mitsuto Fucatu Sugano) Date: Mon, 10 Oct 2005 11:20:10 -0300 Subject: WEB-Interface performance In-Reply-To: <434A3748.6060206@aol.com> References: <1128555158.11266.32.camel@localhost> <4344F968.9090701@aol.com> <1128722447.25260.39.camel@localhost> <434A3748.6060206@aol.com> Message-ID: <1128954010.14495.8.camel@localhost> On Mon, 2005-10-10 at 10:41 +0100, Rob Moss wrote: > Marcel Mitsuto Fucatu Sugano wrote: > > Hello again, > > > > On Thu, 2005-10-06 at 11:16 +0100, Rob Moss wrote: > > > Yeah, there are some CFLAGS you could be using to optmise your > > > build.. I am assuming that you have a recent version of GCC, and > > > that your P4 HT cpu is shown as having two logical CPU's > > > > > > Try rebuilding with the following command: > > > > > > cd nagios-2.0b4 > > > CC=gcc CFLAGS="-mtune=i686 -O3 -pipe -march=i686 -funroll-loops > > > -ffast-math" \ > > > ./configure --prefix=/usr/local/nagios ...... [rest of nagios > > > configure commands] > > > > I've tried this with no luck, having an error message displaying > > that -ffast-math wasn't a recognized flag, with gcc-3.3.5. This was > > the second try to handle the cgi's performance. The first thing we > > tryied, was setting up a tmpfs mountpoint at /opt/nagios/var/tmpfs, > > and pointed nagios to write the status file there, but again with > > some errors that was a little weird, the cgi begins to ending > > prematurely (apache errorlog), and displaying "500 internal server > > error". This happened to 10% of the check_http running to test > > it :), and the response time didn't get too much of a performace > > improvement as well. > > > > I don't think this could be Virtual Machine OS's fault, so the > > problem might be with the status.cgi reading the tmpfs, but i can't > > tell for sure, as we tried this setup yesterday. Is there anyone > > here who made it? (status.dat been written in a tmpfs mountpoint?) > > The flag -ffast-math is in GCC 3.4.2, but I had assumed it was part of > the 3.x version of GCC. You can simply omit that flag, and compiling > should work again. (I'm surprised you didn't do that yourself). I did it, after placing status.dat on a tmpfs partition, and having no performance improvement. > [...] > > When you moved the status.dat file into the tmpfs volume, did you make > a symlink from where the file came from (make a symlink > from /opt/nagios/var/status.dat -> /opt/nagios/var/tmpfs/status.dat) ? No. I told nagios to put the status.dat on the tmpfs, per config. > Also, when you talk about the Virtual Machine OS.. are you running the > O/S under VMWare? You do realise that Emulating an O/S is going to be > slower and less stable, right? Errr... Sorry, that was a typo. I should meant Virtual Memory FS, the kernel task of handlig shared memory (formerly shm fs). > > Cheers > rob. Thanks for your answers, Bob. -- Marcel Mitsuto Fucatu Sugano Universo Online S.A. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jason.payne at iland.com Mon Oct 10 17:06:45 2005 From: jason.payne at iland.com (Jason Payne - iland Internet Solutions) Date: Mon, 10 Oct 2005 10:06:45 -0500 Subject: Monitoring Nagios Message-ID: <7A626FF425518246801C933014AA98F0A3B680@hou-ex01.ihtx.iland.com> >> What are people using to monitor Nagios from an external source. I just >> want something that can page me if Nagios crashes etc. Are there any >> freeware ping and alert software? >Make nagios a child of init in inittab, and respawn automatically. > >Russell Wow, that totally didn't answer his question. I just installed a second copy of nagios on another machine. http://nagios.sourceforge.net/docs/1_0/redundancy.html Hope this helps. -Jason ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Joseph.Petrucci at ddiworld.com Mon Oct 10 17:16:01 2005 From: Joseph.Petrucci at ddiworld.com (Petrucci, Joseph) Date: Mon, 10 Oct 2005 11:16:01 -0400 Subject: Monitoring Nagios Message-ID: or is you do not have another machine that you want to install NAgios on at a few of my clients I simply setup an ssh connection between a UNIX machine and the Nagios server and ran the following shell script on a cron schedule #!/bin/ksh CNT=`/usr/local/ssh nagios.mydomain.com -l nagios "/usr/bin/ps -ef | /usr/bin/grep -c [n]agios" if [ $CNT -lt 1 ] then /usr/bin/echo Nagios process is down | /usr/bin/mail -S "ERROR NAGIOS" mypager at vtext.com fi >-----Original Message----- >From: nagios-users-admin at lists.sourceforge.net >[mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Jason >Payne - iland Internet Solutions >Sent: Monday, October 10, 2005 11:07 AM >To: nagios-users at lists.sourceforge.net >Subject: RE: [Nagios-users] Monitoring Nagios > > >>> What are people using to monitor Nagios from an external source. I >just >>> want something that can page me if Nagios crashes etc. Are there any >>> freeware ping and alert software? > >>Make nagios a child of init in inittab, and respawn automatically. >> >>Russell > > >Wow, that totally didn't answer his question. > >I just installed a second copy of nagios on another machine. > >http://nagios.sourceforge.net/docs/1_0/redundancy.html > >Hope this helps. > >-Jason > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, >discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS >when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Joseph.Petrucci at ddiworld.com Mon Oct 10 17:22:34 2005 From: Joseph.Petrucci at ddiworld.com (Petrucci, Joseph) Date: Mon, 10 Oct 2005 11:22:34 -0400 Subject: Monitoring Nagios Message-ID: Sorry I was in a hurry I'll translate the mess I just sent. If you do not have another machine that you want to install Nagios on as I do at a few of my clients. Setup an ssh connection between a UNIX machine and the Nagios server and run the following shell script on a cron schedule. #!/bin/ksh CNT=`/usr/local/ssh nagios.mydomain.com -l nagios "/usr/bin/ps -ef | /usr/bin/grep -c [n]agios" if [ $CNT -lt 1 ] then /usr/bin/echo Nagios process is down | /usr/bin/mail -S "ERROR NAGIOS" mypager at vtext.com fi >-----Original Message----- >From: nagios-users-admin at lists.sourceforge.net >[mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Petrucci, >Joseph >Sent: Monday, October 10, 2005 11:16 AM >To: Jason Payne - iland Internet Solutions; >nagios-users at lists.sourceforge.net >Subject: RE: [Nagios-users] Monitoring Nagios > > >or is you do not have another machine that you want to install >NAgios on at a few of my clients I simply setup an ssh >connection between a UNIX machine and the Nagios server and >ran the following shell script on a cron schedule > > > >#!/bin/ksh >CNT=`/usr/local/ssh nagios.mydomain.com -l nagios "/usr/bin/ps >-ef | /usr/bin/grep -c [n]agios" >if [ $CNT -lt 1 ] >then > /usr/bin/echo Nagios process is down | /usr/bin/mail -S >"ERROR NAGIOS" mypager at vtext.com >fi > > > >>-----Original Message----- >>From: nagios-users-admin at lists.sourceforge.net >>[mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Jason >>Payne - iland Internet Solutions >>Sent: Monday, October 10, 2005 11:07 AM >>To: nagios-users at lists.sourceforge.net >>Subject: RE: [Nagios-users] Monitoring Nagios >> >> >>>> What are people using to monitor Nagios from an external source. I >>just >>>> want something that can page me if Nagios crashes etc. Are >there any >>>> freeware ping and alert software? >> >>>Make nagios a child of init in inittab, and respawn automatically. >>> >>>Russell >> >> >>Wow, that totally didn't answer his question. >> >>I just installed a second copy of nagios on another machine. >> >>http://nagios.sourceforge.net/docs/1_0/redundancy.html >> >>Hope this helps. >> >>-Jason >> >> >>------------------------------------------------------- >>This SF.Net email is sponsored by: >>Power Architecture Resource Center: Free content, downloads, >>discussions, >>and more. http://solutions.newsforge.com/ibmarch.tmpl >>_______________________________________________ >>Nagios-users mailing list >>Nagios-users at lists.sourceforge.net >>https://lists.sourceforge.net/lists/listinfo/nagios-users >>::: Please include Nagios version, plugin version (-v) and OS >>when reporting any issue. >>::: Messages without supporting info will risk being sent to /dev/null >> >> > > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, >discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS >when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Oct 10 09:56:04 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 10 Oct 2005 09:56:04 +0200 Subject: Does nagios provides such details In-Reply-To: References: Message-ID: <434A1E94.1050102@op5.se> ankush grover wrote: > hey friends, > > I wanted to know something regarding the working of nagios.Recently I build > a cluster for Apache & Mysql(ultramonkey).Now I am looking for a tool which > can give me details like this the no.of connections to apache & mysql,load > on Apache & Mysql,bandwidth consumed by both Apache & Mysql,Users connected > to the Mysql/Apache ,IPAddresses of the clients connected to apache/mysql at > the given time means If I refresh the page the nagios should give me the > current details. > > Does nagios gives such kind of details. > No it does not. No tool will give you the current amount of connections to apache, because HTTP is a stateless protocol, meaning any given transaction is executed as a TCP handshake, a data request, the response and the closing of the connection. The number of transactions in progress can be determined through netstat -tan | grep :80 | wc -l provided you run the apache server on port 80. As for mysql, I think you can at least get the numbers with check_mysql. However, you won't get that fresh data with Nagios, as it schedules and executes checks and presents the results of those checks. If you want "I hit refresh so now those numbers must be for exactly now", you'll have to write a cgi-script or PHP page to run those checks for you. It would most likely not be very useful though, so don't expect anyone else to have done it already. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Oct 10 10:09:56 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 10 Oct 2005 10:09:56 +0200 Subject: Who is maintaining NRPE? In-Reply-To: References: <4333170E.6020709@op5.se> <4337BDF7.2010407@op5.se> Message-ID: <434A21D4.2080208@op5.se> Ton Voon wrote: > Andreas, > > Sorry for the delay. Please find the nrpe 2.2 patch below. This applies cleanly > to your latest sources. > > Patch is to allow compiles on Tru64 and IRIX. > > This should also fix tracker bug > 1164370: http://sourceforge.net/tracker/index.php?func=detail&aid=1164370&group_id=110412&atid=656217 > > > Are you actively maintaining this bug list? If not, where are you working from? > > On 26 Sep 2005, at 10:23, Andreas Ericsson wrote: > >> Ton Voon wrote: >> > >>> If so, I don't think this is in the spirit of openess. Can I suggest that >>> you use Sourceforge to host the NRPE code? >>> >>> >> >> You can suggest it, but it won't happen. The sourceforge CVS is near enough >> impossible to work with from sweden, due to a 400ms latency. >> >> >> I can set them up for public access on oss.op5.se though. >> > > Great. Roughly when will this be? > "Any day now" ;) I'll have a look at it on thursday when I'm back in civilization again. > I also note that the Changelog file hasn't been updated since you took over. > Could you update those please (especially since we don't have CVS access :) ) > I'll look into it. Nothing much has changed though. check_nrpe is backwards compatible and loglevel is a new configuration option (added due to the noisiness of the backwards compatibility stuff). -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Oct 10 09:57:28 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 10 Oct 2005 09:57:28 +0200 Subject: HOSTSTATETYPE problem In-Reply-To: References: Message-ID: <434A1EE8.4050807@op5.se> Tomas Macek wrote: > Hi, I have this piece of main.c of my nagios plugin: > > --------------------------------------------- > int main(int argc, char *argv[]) { > netsnmp_pdu *response; > oid anOID[MAX_OID_LEN]; > size_t anOID_len; > int i = 0; > char ret_pass[9]; > int status = STATE_UNKNOWN, outlet = -1; > struct stat st; > > if (argc < 2) { > printf("Usage: routlet \n\thost - host to > check\n\toutlet - outlet to restart"); > return STATE_UNKNOWN; > } > syslog(LOG_NOTICE, "routlet: '%s' '%s' '%s'", argv[1], argv[2], > argv[3]); > > ... > ... > ... > } > --------------------------------------------- > > and I have this [command] line in my nagios config file > /etc/nagios/commands.cfg: > > -------------------- > command[resetOutletSVIt]=$USER1$/routlet 192.168.1.1 5 $HOSTSTATETYPE$ > -------------------- > > And finally - this is output in the /var/log/messages file when the > plugin was run: > > ------------------- > routlet: outlet: '213.250.192.60' '5' '$' > ------------------- > > I'm using Nagios version 1.2. Why I got this output, when other > variables like $HOSTATTEMPT$ works fine? > > Any help appreciated!! > Read the docs on macros and make sure you get the docs version that corresponds to your nagios version. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Oct 10 17:59:51 2005 From: marc at ena.com (Marc Powell) Date: Mon, 10 Oct 2005 10:59:51 -0500 Subject: Event unhandled in Cgi's but Acknowledged .. Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Emmanuel Roux > Sent: Monday, October 10, 2005 5:26 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Event unhandled in Cgi's but Acknowledged .. > > Hello, > > Since last Week on my Nagios server (1.2) I have one bug ..I don't > change anything apart upgraded my system with last security patches > (debian Sarge). > > when go on my cgi tac.cgi i have got my service or host (previously > aknowledged) on critical Acknowledged state and when i refresh > sometime it appear on unhandled problem state (i verify the state > don't change beetwen refreshs..) ...strange > > sometime got an idea or already got this pb ? Do you happen to have multiple nagios daemons running at the same time? Your symptom is consistent with that scenario. -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Oct 10 17:58:28 2005 From: marc at ena.com (Marc Powell) Date: Mon, 10 Oct 2005 10:58:28 -0500 Subject: getting error while starting the nagios service on FC3 Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of ankush grover > Sent: Monday, October 10, 2005 6:47 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] getting error while starting the nagios service on > FC3 > > hey friends, > > I created some files like > hostgroups.cfg,hosts.cfg,services.cfg,contact.cfg and > contactgroups.cfg.But when I tried to start the nagios service it gave me > error.Then I tried to start with this command nagios -v > /etc/nagios/nagios.cfg it gave me this error. > > > Nagios 2.0b4 > Copyright (c) 1999-2005 Ethan Galstad (http://www.nagios.org) > Last Modified: 08-02-2005 > License: GPL > > Reading configuration data... > > Error: Invalid hostgroup object directive 'contact_groups'. > Error: Could not add object property in file '/etc/nagios/hostgroups.cfg' > on line 4. > > ***> One or more problems was encountered while processing the config > files... > > Check your configuration file(s) to ensure that they contain valid > directives and data defintions. If you are upgrading from a previous > version of Nagios, you should be aware that some > variables/definitions > may have been removed or modified in this version. Make sure to read > the HTML documentation regarding the config files, as well as the > 'Whats New' section to find out what has changed. > It looks like you're using nagios 1.x config file directives for 2.x that are no longer supported where you are using them, specifically the 'contact_groups' directive. "Make sure to read the HTML documentation regarding the config files, as well as the 'Whats New' section to find out what has changed." >From the What's New section of the documentation -- "Contact group changes - The contact_groups directive has been moved from hostgroup definitions to host definitions. This was done in order to maintain consistency with the way service contacts are specified. Make sure to update your config files!" You'll probably want to review that document for other changes, especially the macro changes. -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From td3201 at gmail.com Mon Oct 10 22:12:17 2005 From: td3201 at gmail.com (Terry) Date: Mon, 10 Oct 2005 15:12:17 -0500 Subject: submitting acknowledgement via commandline Message-ID: <1128975137.8354.17.camel@linux.site> I am sure I am overlooking something big. The subject says what I want to do. Do I have to do this via the cgi somehow or is there an external command I can use that I am not seeing. I am using version 1.x. Thanks! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From boinger at tradingtechnologies.com Mon Oct 10 22:46:26 2005 From: boinger at tradingtechnologies.com (jeff vier) Date: Mon, 10 Oct 2005 15:46:26 -0500 Subject: submitting acknowledgement via commandline In-Reply-To: <1128975137.8354.17.camel@linux.site> References: <1128975137.8354.17.camel@linux.site> Message-ID: <1128977186.4403.5.camel@chi100400> On Mon, 2005-10-10 at 15:12 -0500, Terry wrote: > I am sure I am overlooking something big. The subject says what I want > to do. Do I have to do this via the cgi somehow or is there an external > command I can use that I am not seeing. I am using version 1.x. This has been discussed recently. Check the archives. However, I'm feeling unusually charitable, so I've attached my ack'er script. Feel free to use it or not. :) Usage notes can be gotten by running it without arguments (of course, you shouldn't do that blindly...look at the code to make sure I'm not tricking you into erasing your hard drive and starting a chain letter or something). --jeff -------------- next part -------------- A non-text attachment was scrubbed... Name: submit_acknowledgment.sh Type: application/x-shellscript Size: 814 bytes Desc: not available URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: This is a digitally signed message part URL: From marc at ena.com Mon Oct 10 22:54:33 2005 From: marc at ena.com (Marc Powell) Date: Mon, 10 Oct 2005 15:54:33 -0500 Subject: submitting acknowledgement via commandline Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Terry > Sent: Monday, October 10, 2005 3:12 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] submitting acknowledgement via commandline > > I am sure I am overlooking something big. The subject says what I want > to do. Do I have to do this via the cgi somehow or is there an external > command I can use that I am not seeing. I am using version 1.x. In case you don't find this link when searching the archives -- http://www.nagios.org/developerinfo/externalcommands/commandlist.php -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Mon Oct 10 23:09:15 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Mon, 10 Oct 2005 17:09:15 -0400 Subject: Modeling a VPN network in nagios to suppress spurious notifications. Message-ID: <200510102109.j9AL9Fqh021353@mx1.cs.umb.edu> Hello all: I have reached bit of fork in the road in a nagios deployment. I have a physical network that has a VPN running on it across the Internet to multiple sites. A large number of the hosts/services I need to monitor are on the private network only. The legend for the ASCII art network diagrams below. P(h1) - Private interface for host 1 (172.16...) U(h1) - pUblic interface for host 1 (some public IP) P(h1..h3) - Private interfaces for host 1 to host 3. P(s1) - Private traffic on switch 1 U(s1) - public traffic on switch 1 UP(s1) - public and private traffic on switch 1 U(I) - Internet (public) P(I) - Internet (private VPN tunneled traffic) P(v1) - private address for VPN box U(v1) - public address for VPN box U(r1) - public router 1 U(nagios) - public interface for nagios P(nagios) - private interface for nagios Hopefully you can make some sense of the ASCII art. BTW does anybody have a tool to generate ASCII diagrams from some sort of file specification like dot/graphviz? So the question is: what is the most efficient way to represent the dependencies (via parent or host dependencies) to prevent notifications on the VPN when the public network goes down? I was thinking of creating two separate networks. Each would have the parents defined as seen within its topology. So: private: P(h1..h3) <- P(s1) <- P(v1) <- private_internet <- P(v2) <- P(s2) <- P(nagios) v | P(h14..h16) <- P(s3) <- P(v3) -+ with switches and the private VPN addresses going to a "private internet" and creating all the parents as though the private network was a regular network. Also create a similar one for the public traffic portion of the net. public: U(h3..h5) <- U(s1) <- U(r1) <- U(internet) <- U(r2) <- U(s2) <- U(nagios) U(v1) <---+ v v | +-> U(v2) U(v3) <- U(s3) U(h17..h19) <- U(s4) <-+ Then at each VPN point create a host dependency of the private VPN "host" on the public host. The problem is that I don't think that host dependencies will work in the in same way as the parent' directive does when determining failure. Another way to represent these network is to merge the two so that each private VPN host [P(vN)] has its public host [U(vN)] as its parent private+public: U(h3..h5) <- U(s1) <- public_internet <- U(s2) <- U(nagios) | | ^ U(v2) +--<--+ | ^P(v2) <- P(s2) <- P(nagios) U(v1) | ** Note this | | P(v1) | | | P(s1) | | | P(h1..h3) | | U(v3) <- U(s3) | ^----< U(s4) <-- U(h17..h19) P(v3) | P(h14..h16) <- P(s3) However, if I do this then an outage of P(v2) due to failure of the VPN software, makes it look to Nagios like there is still connectivity to the other VPN sites since the routes/parent dependencies for P(nagios) and U(nagios) are identical due to both having U(s2) (or public_internet) as a common child. However in reality you can't reach P anything downstream of P(v2) if P(v2) or P(s2) are down. That's why the parentage graph directions at ** note are pointing away from P(nagios) rather than away from U(nagios). Also to make things a bit more complex, the switches usually have internal IP addresses but are partitioned so that they have both public and private traffic on them. Well thanks for making it this far in the email. Even if you don't have an answer, does what I am asking even make sense to anybody? If you have any questions, it might help me to see things more clearly. If you have struggled with it and decided that nagios isn't up to the task that is good to know as well. The multiple hosts for public and private interfaces bothers me, but until nagios becomes "network aware" in its outage determination for multi-homed hosts, I think this is the only way to do it. Also if anybody thinks I am nuts after reading this email, don't worry about I think I am nuts too but that's a good thing (TM) 8-) . -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Mon Oct 10 23:24:25 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Mon, 10 Oct 2005 17:24:25 -0400 Subject: Setting flapping state via external command Message-ID: <200510102124.j9ALOlCN023320@mx1.cs.umb.edu> Hello all: While I like the concept behind nagios's flap detection, I have to wonder if anybody besides me thought it would be a good idea to be able to use an external command (e.g. SEC the simple event correlator) to set the state of a service or host to flapping and clear it again? This would allow much finer grained control over flapping determination e.g. you could look at the last 22 states or the last 10 states for flap detection rather than at nagios's default of 21. You could also change flap detection depending on time of day, the state of the rest of the network etc. and allow adaptive flap detection. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Oct 11 00:24:06 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 11 Oct 2005 00:24:06 +0200 Subject: Modeling a VPN network in nagios to suppress spurious notifications. In-Reply-To: <200510102109.j9AL9Fqh021353@mx1.cs.umb.edu> References: <200510102109.j9AL9Fqh021353@mx1.cs.umb.edu> Message-ID: <434AEA06.1060203@op5.se> John P. Rouillard wrote: > Hello all: > > I have reached bit of fork in the road in a nagios deployment. I have > a physical network that has a VPN running on it across the Internet to > multiple sites. A large number of the hosts/services I need to monitor > are on the private network only. > > The legend for the ASCII art network diagrams below. > > P(h1) - Private interface for host 1 (172.16...) > U(h1) - pUblic interface for host 1 (some public IP) > P(h1..h3) - Private interfaces for host 1 to host 3. > P(s1) - Private traffic on switch 1 > U(s1) - public traffic on switch 1 > UP(s1) - public and private traffic on switch 1 > U(I) - Internet (public) > P(I) - Internet (private VPN tunneled traffic) > P(v1) - private address for VPN box > U(v1) - public address for VPN box > U(r1) - public router 1 > U(nagios) - public interface for nagios > P(nagios) - private interface for nagios > > Hopefully you can make some sense of the ASCII art. BTW does anybody > have a tool to generate ASCII diagrams from some sort of file > specification like dot/graphviz? > > So the question is: what is the most efficient way to represent the > dependencies (via parent or host dependencies) to prevent > notifications on the VPN when the public network goes down? > The whole concept of parents was designed for this very purpose, so you might as well put it to use for it. > I was thinking of creating two separate networks. Each would have the > parents defined as seen within its topology. So: > > private: > P(h1..h3) <- P(s1) <- P(v1) <- private_internet <- P(v2) <- P(s2) <- P(nagios) > v > | > P(h14..h16) <- P(s3) <- P(v3) -+ > > with switches and the private VPN addresses going to a "private > internet" and creating all the parents as though the private network > was a regular network. > > Also create a similar one for the public traffic portion of the net. > > public: > U(h3..h5) <- U(s1) <- U(r1) <- U(internet) <- U(r2) <- U(s2) <- U(nagios) > U(v1) <---+ v v > | +-> U(v2) > U(v3) <- U(s3) > U(h17..h19) <- U(s4) <-+ > > Then at each VPN point create a host dependency of the private VPN > "host" on the public host. The problem is that I don't think that host > dependencies will work in the in same way as the parent' directive > does when determining failure. > > Another way to represent these network is to merge the two so that > each private VPN host [P(vN)] has its public host [U(vN)] as its > parent > > private+public: > > U(h3..h5) <- U(s1) <- public_internet <- U(s2) <- U(nagios) > | | ^ U(v2) > +--<--+ | ^P(v2) <- P(s2) <- P(nagios) > U(v1) | ** Note this > | | > P(v1) | > | | > P(s1) | > | | > P(h1..h3) | > | > U(v3) <- U(s3) > | ^----< U(s4) <-- U(h17..h19) > P(v3) > | > P(h14..h16) <- P(s3) > > > However, if I do this then an outage of P(v2) due to failure of the > VPN software, makes it look to Nagios like there is still connectivity > to the other VPN sites since the routes/parent dependencies for > P(nagios) and U(nagios) are identical due to both having U(s2) (or > public_internet) as a common child. However in reality you can't reach > P anything downstream of P(v2) if P(v2) or P(s2) are down. That's why > the parentage graph directions at ** note are pointing away from > P(nagios) rather than away from U(nagios). > Just follow the packets from the Nagios server to the target. Where they go into the VPN, you monitor both endpoints of the VPN and otherwise just do the same thing, like so; nagios -> vpn-entry -> vpn-exit -> monitored-host It's usually favourable to monitor the far side of the interface so you get a notification if the internal routing is acting up in the device (this generally holds for all devices routing traffic). Never mind the devices that need to be up in between the two vpn-endpoints unless you can do somethnig about it when they act up. It gets tricky if you have several redundant routs or a circular redundancy thing set up, but in your case I don't see that particular problem. > Also to make things a bit more complex, the switches usually have > internal IP addresses but are partitioned so that they have both public > and private traffic on them. > You can add the separate addresses as separate hosts if you like. That might be what makes most sense if you're having a lot of trouble figuring out the route for something or if you'd get weird stuff happening (VLAN routing going twice through the same switch so it's its own parent or grandparent or some such, I've seen all sorts really). > Well thanks for making it this far in the email. > > Even if you don't have an answer, does what I am asking even make > sense to anybody? I'm not sure. I'm just winging it and hoping for the best. :) In general though, just follow the packets from the Nagios server to the target. It gets tricky if you have several redundant routs or a circular redundancy thing set up, but in your case it doesn't seem to be the case. > If you have any questions, it might help me to see > things more clearly. If you have struggled with it and decided that > nagios isn't up to the task that is good to know as well. The multiple > hosts for public and private interfaces bothers me, but until nagios > becomes "network aware" in its outage determination for multi-homed > hosts, I think this is the only way to do it. > > Also if anybody thinks I am nuts after reading this email, don't worry > about I think I am nuts too but that's a good thing (TM) 8-) . > > -- rouilj > John Rouillard -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Oct 11 00:26:46 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 11 Oct 2005 00:26:46 +0200 Subject: Setting flapping state via external command In-Reply-To: <200510102124.j9ALOlCN023320@mx1.cs.umb.edu> References: <200510102124.j9ALOlCN023320@mx1.cs.umb.edu> Message-ID: <434AEAA6.7050307@op5.se> John P. Rouillard wrote: > Hello all: > > While I like the concept behind nagios's flap detection, I have to > wonder if anybody besides me thought it would be a good idea to be > able to use an external command (e.g. SEC the simple event correlator) > to set the state of a service or host to flapping and clear it again? > > This would allow much finer grained control over flapping > determination e.g. you could look at the last 22 states or the last 10 > states for flap detection rather than at nagios's default of 21. You > could also change flap detection depending on time of day, the state > of the rest of the network etc. and allow adaptive flap detection. > There is no such mechanism in place today. A NEB-module might be able to help you there though and shouldn't be too hard to write so long as you keep it fairly small and make sure it's stateless (creating threads in modules can wreak some fairly serious havok on nagios due to certain global variables being initialized more than once). -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misc at viceconsulting.co.nz Tue Oct 11 01:13:00 2005 From: misc at viceconsulting.co.nz (misc at viceconsulting.co.nz) Date: Tue, 11 Oct 2005 12:13:00 +1300 (NZDT) Subject: Max number of services that can be monitored ? Message-ID: <50695.127.0.0.1.1128985980.squirrel@www.goldenfields.co.nz> Hi Rob, > You haven't sent in any information on what your server is actually > doing, so we can't offer much help there. Amount of ram seems low, how > much RAM is free when you're running Nagios? > > What standard service checks are you running for each server? > > Are you running any of the Perl plugins (and having to launch Perl each > time you do a service check?) The Nagios server is a dedicated Nagios server, it is not doing anything beyond Nagios. It is monitoring 60 hosts. Each host has 16 services configured on it. Of those 16, 14 are run on the local host via check_nrpe, the remaining 2 are run on the Nagios server and are check_ssh and check_ntp (check_ntp is a Perl script.) The nagios is also self monitoring itself via check_disk, check_load, check_procs and a couple of shell scripts. Just out of interest, if you have NOT enabled embedded Perl, then am I correct in saying that Nagios will INVOKE a Perl script in the exact same way as it will invoke a C plugin, such as check_nrpe ? ie it will do a fork() and run a given program, be it Perl, Shell, C etc. I have about 160MB of RAM free, and that amount has been free for several days, swap hasn't been touched. Should I remove the NTP check, is it really going to be such an issue given its only 1 of 16 services is using Perl? > >Is what's happening with my Nagios server normal, or have I made a blatant > >configuration error that is slowing everything down? > > > > > Lack of information provided, so can't provide any clues. > > >I'm runinng Nagios 2.0b4 (embedded Perl DISABLED) on redhat es 3 update 5. > > > > > As with the service checks. With the Embeded Perl interpereter > disabled, if you run any Perl based checks, you will have to launch Perl > each and every time. With embedded Perl enabled, I was getting memory leaks which was eventually crashing the Nagios server so I disabled it. If you can provide any further advice that would be much appreciated. Cheers, Alex. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hartwick at hartwick.com Tue Oct 11 03:07:35 2005 From: hartwick at hartwick.com (Michael J. Hartwick) Date: Mon, 10 Oct 2005 21:07:35 -0400 (EDT) Subject: Backup Connection Monitoring Message-ID: Hi, I have a DSL line with a dial backup. The DSL connection occasionally gets into a strange state. Ideally I would like to have Nagios detect the dial backup connection come up and run a script that can login to the DSL connection and reset the interface. The script should be easy enough to do. Detecting when the dial backup connection comes up is easy, the problem is the connection is normally down. I have the dial backup and the DSL connection as parents to the servers behind them. When I am running on the DSL connection (most of the time) Nagios shows a blocking outage. Does anyone know any way to configure Nagios around this problem? Michael ---------------------------------------------------------------------- Michael J. Hartwick, VE3SLQ hartwick at hartwick.com Hartwick Communications Consulting (519) 396-7719 Kincardine, ON, CA http://www.hartwick.com ---------------------------------------------------------------------- ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Tue Oct 11 04:16:27 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Mon, 10 Oct 2005 22:16:27 -0400 Subject: Setting flapping state via external command Message-ID: <200510110216.j9B2GReG003936@mx1.cs.umb.edu> Andreas Ericsson wrote: >John P. Rouillard wrote: >> While I like the concept behind nagios's flap detection, I have to >> wonder if anybody besides me thought it would be a good idea to be >> able to use an external command (e.g. SEC the simple event correlator) >> to set the state of a service or host to flapping and clear it again? >> >> This would allow much finer grained control over flapping >> determination e.g. you could look at the last 22 states or the last 10 >> states for flap detection rather than at nagios's default of 21. You >> could also change flap detection depending on time of day, the state >> of the rest of the network etc. and allow adaptive flap detection. > >There is no such mechanism in place today. A NEB-module might be able to >help you there though and shouldn't be too hard to write so long as you >keep it fairly small and make sure it's stateless (creating threads in >modules can wreak some fairly serious havok on nagios due to certain >global variables being initialized more than once). Hmm, I always thought of a NEB module as a way of getting information from the nagios core into other applications. E.G. events/status info into a database, or report the flapping event to an external program. What I want to do is call: set_service_flap() clear_service_flap() and set_host_flap() clear_host_flap() from base/flapping.c when an external program identifies a flapping state. I can see how a NEB module would be useful in feeding the external program with alerts/events. So am I misunderstanding the abilities of the NEB module? -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjk_saji at yahoo.com Tue Oct 11 06:10:43 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Tue, 11 Oct 2005 05:10:43 +0100 (BST) Subject: getting error while starting the nagios service on FC3 In-Reply-To: References: Message-ID: <20051011041043.52137.qmail@web34815.mail.mud.yahoo.com> Hi The message shows that , u need to check ? hostgroups.cfg? , have u defined your contact group and members , please do check it Thanks Joseph John --- ankush grover wrote: > hey friends, > > Error: Invalid hostgroup object directive > 'contact_groups'. > Error: Could not add object property in file > '/etc/nagios/hostgroups.cfg' on > line 4. > > ***> One or more problems was encountered while > processing the config > files... > > > Hostgroups.cfg file > define hostgroup{ > hostgroup_name www > alias Web Servers > contact_groups linux-admins > members cluster1 > } > ___________________________________________________________ To help you stay safe and secure online, we've developed the all new Yahoo! Security Centre. http://uk.security.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From b00mer at gmx.net Tue Oct 11 07:23:23 2005 From: b00mer at gmx.net (Hendrik Baecker) Date: Tue, 11 Oct 2005 07:23:23 +0200 Subject: Backup Connection Monitoring In-Reply-To: References: Message-ID: <434B4C4B.1070101@gmx.net> Hi Michael, what about this: 1. Configure your ISDN Backup to be state OK when it is down and CRITICAL when it is up, perhaps the nagios plugin "negate" can help you. 2. Create an event handler on your backup line which triggers on the above check and tries to reset your dsl connection. Hendrik Michael J. Hartwick schrieb: > Hi, > > I have a DSL line with a dial backup. The DSL connection occasionally > gets into a strange state. Ideally I would like to have Nagios detect > the dial backup connection come up and run a script that can login to > the DSL connection and reset the interface. The script should be easy > enough to do. Detecting when the dial backup connection comes up is > easy, the problem is the connection is normally down. I have the dial > backup and the DSL connection as parents to the servers behind them. > When I am running on the DSL connection (most of the time) Nagios > shows a blocking outage. > > Does anyone know any way to configure Nagios around this problem? > > Michael > > ---------------------------------------------------------------------- > Michael J. Hartwick, VE3SLQ hartwick at hartwick.com > Hartwick Communications Consulting (519) 396-7719 > Kincardine, ON, CA http://www.hartwick.com > ---------------------------------------------------------------------- > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hartwick at hartwick.com Tue Oct 11 07:26:03 2005 From: hartwick at hartwick.com (Michael J. Hartwick) Date: Tue, 11 Oct 2005 01:26:03 -0400 (EDT) Subject: Backup Connection Monitoring In-Reply-To: <434B4C4B.1070101@gmx.net> References: <434B4C4B.1070101@gmx.net> Message-ID: On Tue, 11 Oct 2005 at 07:23 (+0200), Hendrik Baecker wrote: HB> Hi Michael, HB> HB> what about this: HB> HB> 1. Configure your ISDN Backup to be state OK when it is down and CRITICAL HB> when it is up, perhaps the nagios plugin "negate" can help you. Never thought about reversing the logic! THanks HB> 2. Create an event handler on your backup line which triggers on the above HB> check and tries to reset your dsl connection. I have this written, but my logic was backwards. Michael ---------------------------------------------------------------------- Michael J. Hartwick, VE3SLQ hartwick at hartwick.com Hartwick Communications Consulting (519) 396-7719 Kincardine, ON, CA http://www.hartwick.com ---------------------------------------------------------------------- ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From b00mer at gmx.net Tue Oct 11 07:49:49 2005 From: b00mer at gmx.net (Hendrik Baecker) Date: Tue, 11 Oct 2005 07:49:49 +0200 Subject: Max number of services that can be monitored ? In-Reply-To: <50695.127.0.0.1.1128985980.squirrel@www.goldenfields.co.nz> References: <50695.127.0.0.1.1128985980.squirrel@www.goldenfields.co.nz> Message-ID: <434B527D.9070008@gmx.net> Hi all, I would suggest, that Alex should enable the perf-data, to see if there is one host or special service that screws up the latency. We have one Server with double proceccor Xeon 1,0Ghz, 2 Gig of Ram and since Nagios 2.0b3 no latency above 5 seconds. Except there are many hosts done in our network, but this works as designed. Together we are monitoring more than 4000 Services on nearly 1000 Hosts. EMBEDDED Perl ist disabled and in relation to the other posts here we have many perl scripts running as service checks. I don't belive that it helps much when you cancel the check_ntp check. 1. Enable the perf-data. (Tip: Say Nagios it should write both, host and service perf data in one file. So you can see how Nagios works and when the latency screws up) 2. Perhaps, even if the memory is not eaten up by nagios, think about to give your server more RAM. 3. Try to figure out, if your system has sometimes wa-cycles. If every process on your system is waiting for IO, then the ram is NOT eaten up and swapping isn't used to, but even no performance. Please tell us, what service_check interval you have. If your nagios checks nearly every minute it may be hard for: a) your monitoring host, b) the (local) network, c) the (possibly existing) wan network. Are your 60 hosts normaly up and running or do they often falls down? Everytime when nagios is executing a host check, all other service checks are beeing "stopped", because a host check gets a higher check priority then a service check. So if the scheduler has enough to do for service checks and there are some host checks comming between the other service checks are executed later then they normaly were scheduled. Hope that gives some ideas. Hendrik misc at viceconsulting.co.nz schrieb: >Hi Rob, > > > >>You haven't sent in any information on what your server is actually >>doing, so we can't offer much help there. Amount of ram seems low, how >>much RAM is free when you're running Nagios? >> >>What standard service checks are you running for each server? >> >>Are you running any of the Perl plugins (and having to launch Perl each >>time you do a service check?) >> >> > >The Nagios server is a dedicated Nagios server, it is not doing anything >beyond Nagios. It is monitoring 60 hosts. Each host has 16 services >configured on it. Of those 16, 14 are run on the local host via >check_nrpe, the remaining 2 are run on the Nagios server and are check_ssh >and check_ntp (check_ntp is a Perl script.) > >The nagios is also self monitoring itself via check_disk, check_load, >check_procs and a couple of shell scripts. > >Just out of interest, if you have NOT enabled embedded Perl, then am I >correct in saying that Nagios will INVOKE a Perl script in the exact same >way as it will invoke a C plugin, such as check_nrpe ? ie it will do a >fork() and run a given program, be it Perl, Shell, C etc. > >I have about 160MB of RAM free, and that amount has been free for several >days, swap hasn't been touched. > >Should I remove the NTP check, is it really going to be such an issue >given its only 1 of 16 services is using Perl? > > > >>>Is what's happening with my Nagios server normal, or have I made a blatant >>>configuration error that is slowing everything down? >>> >>> >>> >>> >>Lack of information provided, so can't provide any clues. >> >> >> >>>I'm runinng Nagios 2.0b4 (embedded Perl DISABLED) on redhat es 3 update 5. >>> >>> >>> >>> >>As with the service checks. With the Embeded Perl interpereter >>disabled, if you run any Perl based checks, you will have to launch Perl >>each and every time. >> >> > >With embedded Perl enabled, I was getting memory leaks which was >eventually crashing the Nagios server so I disabled it. > >If you can provide any further advice that would be much appreciated. > >Cheers, >Alex. > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From amontibello at gmail.com Tue Oct 11 08:06:43 2005 From: amontibello at gmail.com (Anthony Montibello) Date: Tue, 11 Oct 2005 02:06:43 -0400 Subject: [Nagiosplug-help] Please help with memory check In-Reply-To: <20051008133858.12634.qmail@web52604.mail.yahoo.com> References: <20051008133858.12634.qmail@web52604.mail.yahoo.com> Message-ID: hi Mandy, If you are having an issue with Check_nt try it from the commad line, to make sure it is giving you the desired results. The problem that you may be experiencing could be that NS_Client is using the "Commited Bytes" and "Commit Limit" To determin the % used. It may be confusing because neither check_nt nor the NS_Client documentation was clear on this. test the memory from the command line to experiment with the Thresholds if you are still having issues with them. check the return code result by using the `echo $?` THe results are calculated in the check_nt plugin. expected results: if the Used% result is greater than or equal to the critical value then the result will be a critical. else if the Used% result is greater than or equal to the warning value then the result will be a warning. otherwise the result is OK or an error Also note: with the MEMUSE function of check_nt the -w and -c are optional. you can specify both, one of them or neither depending on your needs. Hope this helps, Tony amontibello at shatterit.com On 10/8/05, Mandy Williams wrote: > > Hi All, > > I posted a little while back with no answers. I am > having issues with the nsclient plugin and checking > memory. more specifically with the thresholds. i > cannot seem to understand what they are really > supposed to alert on. i understand all the rest of > the them, but the memory seems odd. it reports when > it should and is not consistent. could someone PLEASE > tell me what the deal is. if i put it -w 80 -c 90 > should i not be alert when 80% of memory is used on > warning and 90% critical?? i am going mad with > this.... > > TIA, > M > > > > > __________________________________ > Yahoo! Mail - PC Magazine Editors' Choice 2005 > http://mail.yahoo.com > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagiosplug-help mailing list > Nagiosplug-help at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagiosplug-help > ::: Please include plugins version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: From maca02 at atlas.cz Tue Oct 11 09:10:27 2005 From: maca02 at atlas.cz (=?iso-8859-2?B?VG9t4bkgTWFjZWs=?=) Date: Tue, 11 Oct 2005 09:10:27 +0200 Subject: HOSTSTATETYPE problem Message-ID: Hi, I'm writing an event handler in C. I have this code: int main(int argc, char *argv[]) { netsnmp_pdu *response; oid anOID[MAX_OID_LEN]; size_t anOID_len; int i = 0; char ret_pass[9]; int status = STATE_UNKNOWN, outlet = -1; struct stat st; if (argc < 2) { printf("Usage: routlet \n\thost - host to check\n\toutlet - outlet to restart"); return STATE_UNKNOWN; } syslog(LOG_NOTICE, "routlet: '%s' '%s' '%s'", argv[1], argv[2], argv[3]); ... } My /etc/nagios/commands.cfg contains this line: command[resetOutletSVIt]=$USER1$/routlet 192.168.1.1 1 $HOSTSTATETYPE$ My /etc/nagios/hosts.cfg contains this: host[ip1]=IP do ven;192.168.1.2;;check-host-alive;5;60;24x7;1;1;0;resetOutletSVIt and the /var/log/messages contains this line, when the event is run: Oct 7 12:21:45 iks routlet: outlet: '192.168.1.1' '1' '$' Why the "$"??? It should contain the text like "SOFT" or "HARD", shlouldn't it? Any help appreciated! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bimschas at phase4.de Tue Oct 11 09:28:32 2005 From: Bimschas at phase4.de (Daniel Bimschas) Date: Tue, 11 Oct 2005 09:28:32 +0200 Subject: Notification periods question Message-ID: <2CCEB1B7A93CF444AC4FE3BAB759962CD0CC05@p4-muc-nt5.intra.phase4.de> hi there! i have the following problem and it seems i can't solve it without greater effort. i have a service that is checked 24x7. during "workhours", meaning from 09:00 - 22:00 i want to notify always if the service is warning, critical or unknown (w,u,c,r) and when it recovers. at "non-workhours" (22:00-09:00) i want only to notify if the service is critical or unknown. can i realise that with nagios without writing my own notification plugin and how? thanks for your replies -- Daniel Bimschas Entwicklung Phase 4 Communications GmbH Bayerstra?e 85 a 80335 M?nchen Fon +49.89.30 63 21-52 Fax +49.89.30 63 21-21 ISDN +49.89.30 72 93 33 M?nchen | Z?rich | Berlin www.phase4.de ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From VINAY_SHARMA at advanex.co.jp Tue Oct 11 09:50:31 2005 From: VINAY_SHARMA at advanex.co.jp (VINAY_SHARMA at advanex.co.jp) Date: Tue, 11 Oct 2005 16:50:31 +0900 Subject: voice notification via telephone (vgetty, asterisk, etc.) ? Message-ID: Hi List, I want to enable 3-D Status Map.now when i click on 3D Status map.one file status.swrl will asking to save. how can i see 3D status map?? Many Thanks to all. Thanks & regards ************************************** Vinay Sharma Information Systems and Solutions-Associates Advanex Inc (www.advanex.co.jp) Fon : 813-3822-5863 Fax : 813-5815-7881 Email : vinay_sharma at advanex.co.jp Christophe SUIRE ??: Adam Tworkowski ???: cc: nagios-users at lists.sourceforge.net nagios-users-admin at lists.sour ??: Re: [Nagios-users] voice notification via telephone (vgetty, asterisk, etc.) ceforge.net ? 2005/10/07 02:23 Hi, I have done it for a customer .. In fact after some test with text to voice utility (problems with names of servers), we record some "wav" files, like "the server" - "is down" - "name of a server". I use a tool to convert wav files to modem voice files : wavtopvf When i have to send a notification i group the good pvf files to make a unique file and convert il to rmd I convert this file to rmd file : cat /home/nagios/wav/file1.pvf /home/nagios/wav/file2.pvf 2>/dev/ null | pvfspeed -s 7200 | pvfamp -A 3 | pvftormd Rockwell 2 > /tmp/ $PID-temp.rmd Then with mgetty-voice installed and configured, you can use the script callme.pl to send the file. Bye. -- Christophe Suire #!/usr/bin/perl -w # # $Id: callme.pl,v 1.2 1999/01/15 14:30:44 kas Exp $ # # This script dials a given number and then re-plays the message. # Use "vm shell -S /usr/bin/perl callme.pl number message.rmd" # for calling the "number" and playing the "message.rmd". # # Copyright (c) 1998 Jan "Yenya" Kasprzak . All rights # reserved. This package is free software; you can redistribute it and/or # modify it under the same terms as Perl itself. # use Modem::Vgetty; my $v = new Modem::Vgetty; die "Usage: callme.pl number message.rmd" if $#ARGV != 1; $v->device('DIALUP_LINE'); $v->add_handler('BUSY_TONE', 'finish', sub { $v->stop; exit 0; }); $v->enable_events; $v->dial($ARGV[0]); $v->waitfor('READY'); $v->play_and_wait($ARGV[1]); 1; Le 6 oct. 05 ? 18:51, Adam Tworkowski a ?crit : > Hi, > > I have searched various Nagios lists (-users, -devel) with the aim of > finding out if anyone has successfully implemented Nagios > notifications > via voice synthesis to a land- or cell-based telephone call. > > There have been various references to topic in the last year but I > cannot find anything that is concrete (links are dead, or threads end > and presumably turn into private conversations elsewhere.) > > To provide further clarity, I am hoping to use a PCI voice modem (real > controller,not softmodem) and have selected critical alerts be > "forwarded" to a predefined telephone number and play a message like > "Nagios is reporting the server X is not reachable". > > Does anyone have any thoughts on the matter or useful references? > > Thanks in advance. > > -Adam > > > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From VINAY_SHARMA at advanex.co.jp Tue Oct 11 09:53:28 2005 From: VINAY_SHARMA at advanex.co.jp (VINAY_SHARMA at advanex.co.jp) Date: Tue, 11 Oct 2005 16:53:28 +0900 Subject: Nagios 3D status View.. Message-ID: Hi List, I want to enable 3-D Status Map.now when i click on 3D Status map.one file status.swrl will asking to save. how can i see 3D status map?? Many Thanks to all. Thanks & regards ************************************** Vinay Sharma Information Systems and Solutions-Associates Advanex Inc (www.advanex.co.jp) Fon : 813-3822-5863 Fax : 813-5815-7881 Email : vinay_sharma at advanex.co.jp ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bimschas at phase4.de Tue Oct 11 09:58:28 2005 From: Bimschas at phase4.de (Daniel Bimschas) Date: Tue, 11 Oct 2005 09:58:28 +0200 Subject: AW: Nagios 3D status View.. Message-ID: <2CCEB1B7A93CF444AC4FE3BAB759962CE18D36@p4-muc-nt5.intra.phase4.de> hi! simply install a vrml-viewer of your choice ;) -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]Im Auftrag von VINAY_SHARMA at advanex.co.jp Gesendet: Dienstag, 11. Oktober 2005 09:53 An: nagios-users at lists.sourceforge.net; nagios-users-admin at lists.sourceforge.net Betreff: [Nagios-users] Nagios 3D status View.. Hi List, I want to enable 3-D Status Map.now when i click on 3D Status map.one file status.swrl will asking to save. how can i see 3D status map?? Many Thanks to all. Thanks & regards ************************************** Vinay Sharma Information Systems and Solutions-Associates Advanex Inc (www.advanex.co.jp) Fon : 813-3822-5863 Fax : 813-5815-7881 Email : vinay_sharma at advanex.co.jp ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From safarj at kancelarskestroje.cz Tue Oct 11 10:01:01 2005 From: safarj at kancelarskestroje.cz (Jakub Safar) Date: Tue, 11 Oct 2005 10:01:01 +0200 Subject: Nagios 3D status View.. In-Reply-To: References: Message-ID: <1129017661.11957.101.camel@jsa> Hello Vinay, 1/ search and you will find: http://www.google.com/search?num=100&hl=en&lr=&safe=off&q=nagios+3d +faq&btnG=Search 2/ RTFM: http://www.nagios.org/faqs/viewfaq.php?faq_id=51 cheers, Jakub On Tue, 2005-10-11 at 16:53 +0900, VINAY_SHARMA at advanex.co.jp wrote: > Hi List, > > I want to enable 3-D Status Map.now when i click on 3D Status > map.one file status.swrl will asking to save. > > how can i see 3D status map?? > > > Many Thanks to all. > > > > Thanks & regards > ************************************** > Vinay Sharma > Information Systems and Solutions-Associates > Advanex Inc (www.advanex.co.jp) > Fon : 813-3822-5863 > Fax : 813-5815-7881 > Email : vinay_sharma at advanex.co.jp > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjk_saji at yahoo.com Tue Oct 11 10:01:34 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Tue, 11 Oct 2005 09:01:34 +0100 (BST) Subject: Nagios 3D status View.. In-Reply-To: References: Message-ID: <20051011080134.91467.qmail@web34815.mail.mud.yahoo.com> This links may help u http://www.nagios.org/faqs/viewfaq.php?faq_id=53 http://www.nagios.org/faqs/viewfaq.php?faq_id=51 --- VINAY_SHARMA at advanex.co.jp wrote: > Hi List, > > I want to enable 3-D Status Map.now > when i click on 3D Status > map.one file status.swrl will asking to save. > > how can i see 3D status map?? > > > Many Thanks to all. > > > > Thanks & regards > ************************************** > Vinay Sharma > Information Systems and Solutions-Associates > Advanex Inc (www.advanex.co.jp) > Fon : 813-3822-5863 > Fax : 813-5815-7881 > Email : vinay_sharma at advanex.co.jp > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, > downloads, discussions, > and more. > http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > ___________________________________________________________ To help you stay safe and secure online, we've developed the all new Yahoo! Security Centre. http://uk.security.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From VINAY_SHARMA at advanex.co.jp Tue Oct 11 10:14:42 2005 From: VINAY_SHARMA at advanex.co.jp (VINAY_SHARMA at advanex.co.jp) Date: Tue, 11 Oct 2005 17:14:42 +0900 Subject: AW: Nagios 3D status View.. Message-ID: DQpEZWFyIEFsbCwNCg0KICAgICAgICAgICAgVGhhbmsgeW91IHZlcnkgbXVjaC5pdHMgd29ya2lu Zy50aGFua3MgZm9yIGhlbHBpbmcgbWUuDQoNCg0KVGhhbmtzICYgcmVnYXJkcw0KKioqKioqKioq KioqKioqKioqKioqKioqKioqKioqKioqKioqKioNClZpbmF5IFNoYXJtYQ0KSW5mb3JtYXRpb24g U3lzdGVtcyBhbmQgU29sdXRpb25zLUFzc29jaWF0ZXMNCkFkdmFuZXggSW5jICh3d3cuYWR2YW5l eC5jby5qcCkNCkZvbiAgIDogODEzLTM4MjItNTg2Mw0KRmF4ICAgICAgICAgOiA4MTMtNTgxNS03 ODgxDQpFbWFpbCAgICAgICA6IHZpbmF5X3NoYXJtYUBhZHZhbmV4LmNvLmpwDQoNCg0KICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgDQogICAgICAgICAgICAgICAgICAgICAgIkRhbmll bCBCaW1zY2hhcyIgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICANCiAgICAgICAgICAgICAgICAgICAgICA8Qmltc2NoYXNAcGhhc2U0LmRlPiAgICAgICAg ICAgICAgICAg5a6b5YWIOiAgICAgPFZJTkFZX1NIQVJNQUBhZHZhbmV4LmNvLmpwPiwgPG5hZ2lv cy11c2Vyc0BsaXN0cy5zb3VyY2Vmb3JnZS5uZXQ+LCAgICAgICAgICAgDQogICAgICAgICAgICAg ICAgICAgICAg6YCB5L+h6ICFOiAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICA8bmFnaW9z LXVzZXJzLWFkbWluQGxpc3RzLnNvdXJjZWZvcmdlLm5ldD4gICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICANCiAgICAgICAgICAgICAgICAgICAgICBuYWdpb3MtdXNl cnMtYWRtaW5AbGlzdHMuc291ciAgICAgICAgY2M6ICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg IA0KICAgICAgICAgICAgICAgICAgICAgIGNlZm9yZ2UubmV0ICAgICAgICAgICAgICAgICAgICAg ICAgICDku7blkI06ICAgICBBVzogW05hZ2lvcy11c2Vyc10gTmFnaW9zIDNEIHN0YXR1cyBWaWV3 Li4gICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICANCiAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgIA0KICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgDQogICAg ICAgICAgICAgICAgICAgICAgMjAwNS8xMC8xMSAxNjo1OCAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICANCiAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgIA0KICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAg ICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgICAgDQoNCg0KDQoNCmhpIQ0K DQpzaW1wbHkgaW5zdGFsbCBhIHZybWwtdmlld2VyIG9mIHlvdXIgY2hvaWNlIDspDQoNCg0KDQot LS0tLVVyc3Byw7xuZ2xpY2hlIE5hY2hyaWNodC0tLS0tDQpWb246IG5hZ2lvcy11c2Vycy1hZG1p bkBsaXN0cy5zb3VyY2Vmb3JnZS5uZXQNClttYWlsdG86bmFnaW9zLXVzZXJzLWFkbWluQGxpc3Rz LnNvdXJjZWZvcmdlLm5ldF1JbSBBdWZ0cmFnIHZvbg0KVklOQVlfU0hBUk1BQGFkdmFuZXguY28u anANCkdlc2VuZGV0OiBEaWVuc3RhZywgMTEuIE9rdG9iZXIgMjAwNSAwOTo1Mw0KQW46IG5hZ2lv cy11c2Vyc0BsaXN0cy5zb3VyY2Vmb3JnZS5uZXQ7DQpuYWdpb3MtdXNlcnMtYWRtaW5AbGlzdHMu c291cmNlZm9yZ2UubmV0DQpCZXRyZWZmOiBbTmFnaW9zLXVzZXJzXSBOYWdpb3MgM0Qgc3RhdHVz IFZpZXcuLg0KDQoNCkhpIExpc3QsDQoNCiAgICAgICAgICAgICBJIHdhbnQgdG8gZW5hYmxlIDMt RCBTdGF0dXMgTWFwLm5vdyB3aGVuIGkgY2xpY2sgb24gM0QgU3RhdHVzDQptYXAub25lIGZpbGUg c3RhdHVzLnN3cmwgd2lsbCBhc2tpbmcgdG8gc2F2ZS4NCg0KaG93IGNhbiBpIHNlZSAzRCBzdGF0 dXMgbWFwPz8NCg0KDQpNYW55IFRoYW5rcyB0byBhbGwuDQoNCg0KDQpUaGFua3MgJiByZWdhcmRz DQoqKioqKioqKioqKioqKioqKioqKioqKioqKioqKioqKioqKioqKg0KVmluYXkgU2hhcm1hDQpJ bmZvcm1hdGlvbiBTeXN0ZW1zIGFuZCBTb2x1dGlvbnMtQXNzb2NpYXRlcw0KQWR2YW5leCBJbmMg KHd3dy5hZHZhbmV4LmNvLmpwKQ0KRm9uICAgOiA4MTMtMzgyMi01ODYzDQpGYXggICAgICAgICA6 IDgxMy01ODE1LTc4ODENCkVtYWlsICAgICAgIDogdmluYXlfc2hhcm1hQGFkdmFuZXguY28uanAN Cg0KDQoNCg0KLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0t LS0tLS0tLQ0KVGhpcyBTRi5OZXQgZW1haWwgaXMgc3BvbnNvcmVkIGJ5Og0KUG93ZXIgQXJjaGl0 ZWN0dXJlIFJlc291cmNlIENlbnRlcjogRnJlZSBjb250ZW50LCBkb3dubG9hZHMsIGRpc2N1c3Np b25zLA0KYW5kIG1vcmUuIGh0dHA6Ly9zb2x1dGlvbnMubmV3c2ZvcmdlLmNvbS9pYm1hcmNoLnRt cGwNCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fDQpOYWdp b3MtdXNlcnMgbWFpbGluZyBsaXN0DQpOYWdpb3MtdXNlcnNAbGlzdHMuc291cmNlZm9yZ2UubmV0 DQpodHRwczovL2xpc3RzLnNvdXJjZWZvcmdlLm5ldC9saXN0cy9saXN0aW5mby9uYWdpb3MtdXNl cnMNCjo6OiBQbGVhc2UgaW5jbHVkZSBOYWdpb3MgdmVyc2lvbiwgcGx1Z2luIHZlcnNpb24gKC12 KSBhbmQgT1Mgd2hlbg0KcmVwb3J0aW5nIGFueSBpc3N1ZS4NCjo6OiBNZXNzYWdlcyB3aXRob3V0 IHN1cHBvcnRpbmcgaW5mbyB3aWxsIHJpc2sgYmVpbmcgc2VudCB0byAvZGV2L251bGwNCg0KDQot LS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tLS0tDQpU aGlzIFNGLk5ldCBlbWFpbCBpcyBzcG9uc29yZWQgYnk6DQpQb3dlciBBcmNoaXRlY3R1cmUgUmVz b3VyY2UgQ2VudGVyOiBGcmVlIGNvbnRlbnQsIGRvd25sb2FkcywgZGlzY3Vzc2lvbnMsDQphbmQg bW9yZS4gaHR0cDovL3NvbHV0aW9ucy5uZXdzZm9yZ2UuY29tL2libWFyY2gudG1wbA0KX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX18NCk5hZ2lvcy11c2VycyBt YWlsaW5nIGxpc3QNCk5hZ2lvcy11c2Vyc0BsaXN0cy5zb3VyY2Vmb3JnZS5uZXQNCmh0dHBzOi8v bGlzdHMuc291cmNlZm9yZ2UubmV0L2xpc3RzL2xpc3RpbmZvL25hZ2lvcy11c2Vycw0KOjo6IFBs ZWFzZSBpbmNsdWRlIE5hZ2lvcyB2ZXJzaW9uLCBwbHVnaW4gdmVyc2lvbiAoLXYpIGFuZCBPUyB3 aGVuDQpyZXBvcnRpbmcgYW55IGlzc3VlLg0KOjo6IE1lc3NhZ2VzIHdpdGhvdXQgc3VwcG9ydGlu ZyBpbmZvIHdpbGwgcmlzayBiZWluZyBzZW50IHRvIC9kZXYvbnVsbA0KDQoNCg0K ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From g.vickers at qut.edu.au Tue Oct 11 11:31:12 2005 From: g.vickers at qut.edu.au (Greg Vickers) Date: Tue, 11 Oct 2005 19:31:12 +1000 Subject: Notification periods question In-Reply-To: <2CCEB1B7A93CF444AC4FE3BAB759962CD0CC05@p4-muc-nt5.intra.phase4.de> References: <2CCEB1B7A93CF444AC4FE3BAB759962CD0CC05@p4-muc-nt5.intra.phase4.de> Message-ID: <434B8660.1050503@qut.edu.au> Hi Daniel, Daniel Bimschas wrote: > hi there! > > i have the following problem and it seems i can't solve it > without greater effort. > > i have a service that is checked 24x7. during "workhours", > meaning from 09:00 - 22:00 i want to notify always if the > service is warning, critical or unknown (w,u,c,r) and when > it recovers. > > at "non-workhours" (22:00-09:00) i want only to notify if > the service is critical or unknown. > > can i realise that with nagios without writing my own > notification plugin and how? Here's one way to do it: Notifications during work hours: define contact{ contact_name fredworkhours alias Notifications to Fred during work hours host_notification_period something service_notification_period workhours host_notification_options n service_notification_options w,u,c,r email fred at someserver } Notifications during non-work hours: define contact{ contact_name frednonworkhours alias Notifications to Fred outside of work hours host_notification_period something service_notification_period nonworkhours host_notification_options n service_notification_options w,u email fred at someserver } > thanks for your replies Such a spoon-fed reply is given because I am feeling out of touch with my server atm :) -- Greg Vickers Project Manager, IT Security Information Technology Services Queensland University of Technology L12, 126 Margaret St, Brisbane Phone: (07) 3864 9536 Email: g.vickers at qut.edu.au IT Security web site: http://www.its.qut.edu.au/itsecurity/ CRICOS No. 00213J ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bimschas at phase4.de Tue Oct 11 11:45:26 2005 From: Bimschas at phase4.de (Daniel Bimschas) Date: Tue, 11 Oct 2005 11:45:26 +0200 Subject: AW: Notification periods question Message-ID: <2CCEB1B7A93CF444AC4FE3BAB759962CD0CC06@p4-muc-nt5.intra.phase4.de> first, thank you for your answer, greg. but unfortunately i've come so far too. the problem now is that nagios reschedules the notifications that occur during non-notification hours for the specific contact. it then sends the notifications when the timeperiod is valid for this contact. with this i will always get notified about all problems that occured at night, in the night AND in the morning. my notifications will also be sent out via SMS, so it would be really stressy to get so many notifications. you also have an idea regarding this? thanks -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]Im Auftrag von Greg Vickers Gesendet: Dienstag, 11. Oktober 2005 11:31 An: nagios-users at lists.sourceforge.net Betreff: Re: [Nagios-users] Notification periods question Hi Daniel, Daniel Bimschas wrote: > hi there! > > i have the following problem and it seems i can't solve it > without greater effort. > > i have a service that is checked 24x7. during "workhours", > meaning from 09:00 - 22:00 i want to notify always if the > service is warning, critical or unknown (w,u,c,r) and when > it recovers. > > at "non-workhours" (22:00-09:00) i want only to notify if > the service is critical or unknown. > > can i realise that with nagios without writing my own > notification plugin and how? Here's one way to do it: Notifications during work hours: define contact{ contact_name fredworkhours alias Notifications to Fred during work hours host_notification_period something service_notification_period workhours host_notification_options n service_notification_options w,u,c,r email fred at someserver } Notifications during non-work hours: define contact{ contact_name frednonworkhours alias Notifications to Fred outside of work hours host_notification_period something service_notification_period nonworkhours host_notification_options n service_notification_options w,u email fred at someserver } > thanks for your replies Such a spoon-fed reply is given because I am feeling out of touch with my server atm :) -- Greg Vickers Project Manager, IT Security Information Technology Services Queensland University of Technology L12, 126 Margaret St, Brisbane Phone: (07) 3864 9536 Email: g.vickers at qut.edu.au IT Security web site: http://www.its.qut.edu.au/itsecurity/ CRICOS No. 00213J ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From b00mer at gmx.net Tue Oct 11 13:43:34 2005 From: b00mer at gmx.net (Hendrik Baecker) Date: Tue, 11 Oct 2005 13:43:34 +0200 Subject: HOSTSTATETYPE problem In-Reply-To: References: Message-ID: <434BA566.4040005@gmx.net> Tom?? Macek schrieb: > Hi, I'm writing an event handler in C. I have this code: > > int main(int argc, char *argv[]) { > netsnmp_pdu *response; > oid anOID[MAX_OID_LEN]; > size_t anOID_len; > int i = 0; > char ret_pass[9]; > int status = STATE_UNKNOWN, outlet = -1; > struct stat st; > > if (argc < 2) { > printf("Usage: routlet \n\thost - host to > check\n\toutlet - outlet to restart"); > return STATE_UNKNOWN; > } > syslog(LOG_NOTICE, "routlet: '%s' '%s' '%s'", argv[1], argv[2], > argv[3]); > > ... > > } > > > My /etc/nagios/commands.cfg contains this line: > > command[resetOutletSVIt]=$USER1$/routlet 192.168.1.1 1 > $HOSTSTATETYPE$ > > My /etc/nagios/hosts.cfg contains this: > host[ip1]=IP do > ven;192.168.1.2;;check-host-alive;5;60;24x7;1;1;0;resetOutletSVIt > > and the /var/log/messages contains this line, when the event is run: > > Oct 7 12:21:45 iks routlet: outlet: '192.168.1.1' '1' '$' > > Why the "$"??? It should contain the text like "SOFT" or "HARD", > shlouldn't it? > > Any help appreciated! > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > > Double-posting won't help you, just read the answer that Marc gaves you.... RTFM: http://nagios.sourceforge.net/docs/1_0/macros.html ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tom.deblende at dhl.com Tue Oct 11 13:51:20 2005 From: tom.deblende at dhl.com (Tom DE BLENDE (GCC)) Date: Tue, 11 Oct 2005 13:51:20 +0200 Subject: Weekly maintenance Message-ID: <9DD0C1E7667F4A4B86E847ED9B8755CC01272733@PRGDCEX002.prg-dc.dhl.com> Hi All, I want to have a weekly one hour window (what's in a word) of downtime to perform some maintenance on our Windows servers. What would be the most elegant way to do this? By trying to schedule a recurring downtime? Or by using Host directives? Your thoughts? Tom ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Tue Oct 11 14:44:13 2005 From: robmossrm at aol.com (Rob Moss) Date: Tue, 11 Oct 2005 13:44:13 +0100 Subject: Max number of services that can be monitored ? In-Reply-To: <50695.127.0.0.1.1128985980.squirrel@www.goldenfields.co.nz> References: <50695.127.0.0.1.1128985980.squirrel@www.goldenfields.co.nz> Message-ID: <434BB39D.5010509@aol.com> misc at viceconsulting.co.nz wrote: >The Nagios server is a dedicated Nagios server, it is not doing anything >beyond Nagios. It is monitoring 60 hosts. Each host has 16 services >configured on it. Of those 16, 14 are run on the local host via >check_nrpe, the remaining 2 are run on the Nagios server and are check_ssh >and check_ntp (check_ntp is a Perl script.) > >The nagios is also self monitoring itself via check_disk, check_load, >check_procs and a couple of shell scripts. > >Just out of interest, if you have NOT enabled embedded Perl, then am I >correct in saying that Nagios will INVOKE a Perl script in the exact same >way as it will invoke a C plugin, such as check_nrpe ? ie it will do a >fork() and run a given program, be it Perl, Shell, C etc. > > Yep correct. Except that when you run the Perl script, it will load the Perl interpereter which in most systems is ~ 5mb and also read in a bunch of modules too. This is (probably) the reason why there's an option to compile in the Perl interpereter, to save all of those overheads. Some of the checks are wrappers around other programs like check_ping. This one runs /usr/bin/ping and evaluates the output. I would recommend replacing that with check_icmp (which works the same) as it's a compiled C program which does all the work itself. What are the standard checks you are doing? There may be alternate plugins to do the same job that are more efficient. Have you installed the nagios-plugins pack? >I have about 160MB of RAM free, and that amount has been free for several >days, swap hasn't been touched. > >Should I remove the NTP check, is it really going to be such an issue >given its only 1 of 16 services is using Perl? > > Lets see.. Every 5 minutes you check 60 hosts, so thats 60 times you invoke the perl interpereter.. even if it's cached, its still lots of extra processing that could be done more elegantly if you compiled in the perl interpereter with the nagios daemon. >With embedded Perl enabled, I was getting memory leaks which was >eventually crashing the Nagios server so I disabled it. > > Which version of perl, and what platform? x86? sparc? Older versions of perl (pre 5.6) had some problems being compiled in to apps such as mod_perl and stuff, but 5.6.1 or above "should" be fine. Memory leaks in Solaris wouldn't surprise me, but any of the Linuxes or BSD's would.. I'm runing Solaris 8 Sparc with Perl 5.6.1 builtin without any known leaks.. Cheers rob. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Bimschas at phase4.de Tue Oct 11 14:58:25 2005 From: Bimschas at phase4.de (Daniel Bimschas) Date: Tue, 11 Oct 2005 14:58:25 +0200 Subject: AW: Notification periods question Message-ID: <2CCEB1B7A93CF444AC4FE3BAB759962CE18D3C@p4-muc-nt5.intra.phase4.de> Hey Jonas, that solutions looks great, will try that later! Thanks a lot! -----Urspr?ngliche Nachricht----- Von: Karsch, Jonas [mailto:j.karsch at radeberger-gruppe.de] Gesendet: Dienstag, 11. Oktober 2005 14:54 An: Daniel Bimschas Betreff: AW: [Nagios-users] Notification periods question Hi, maybe you can do it with hostescalations: define hostescalation{ hostgroup_name Exchangeserver contact_groups exchange-admins first_notification 1 last_notification 0 notification_interval 120 escalation_options w,u,c,r escalation_period workhours } define hostescalation{ hostgroup_name Exchangeserver contact_groups exchange-admins first_notification 1 last_notification 0 notification_interval 120 escalation_options c,u escalation_period nonworkhours } regards Jonas Karsch IT-Service Tel.: +49 - (0)30 - 68992 - 254 -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]Im Auftrag von Daniel Bimschas Gesendet: Di, 11. Oktober 2005 11:45 An: Greg Vickers; nagios-users at lists.sourceforge.net Betreff: AW: [Nagios-users] Notification periods question first, thank you for your answer, greg. but unfortunately i've come so far too. the problem now is that nagios reschedules the notifications that occur during non-notification hours for the specific contact. it then sends the notifications when the timeperiod is valid for this contact. with this i will always get notified about all problems that occured at night, in the night AND in the morning. my notifications will also be sent out via SMS, so it would be really stressy to get so many notifications. you also have an idea regarding this? thanks -----Urspr?ngliche Nachricht----- Von: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]Im Auftrag von Greg Vickers Gesendet: Dienstag, 11. Oktober 2005 11:31 An: nagios-users at lists.sourceforge.net Betreff: Re: [Nagios-users] Notification periods question Hi Daniel, Daniel Bimschas wrote: > hi there! > > i have the following problem and it seems i can't solve it > without greater effort. > > i have a service that is checked 24x7. during "workhours", > meaning from 09:00 - 22:00 i want to notify always if the > service is warning, critical or unknown (w,u,c,r) and when > it recovers. > > at "non-workhours" (22:00-09:00) i want only to notify if > the service is critical or unknown. > > can i realise that with nagios without writing my own > notification plugin and how? Here's one way to do it: Notifications during work hours: define contact{ contact_name fredworkhours alias Notifications to Fred during work hours host_notification_period something service_notification_period workhours host_notification_options n service_notification_options w,u,c,r email fred at someserver } Notifications during non-work hours: define contact{ contact_name frednonworkhours alias Notifications to Fred outside of work hours host_notification_period something service_notification_period nonworkhours host_notification_options n service_notification_options w,u email fred at someserver } > thanks for your replies Such a spoon-fed reply is given because I am feeling out of touch with my server atm :) -- Greg Vickers Project Manager, IT Security Information Technology Services Queensland University of Technology L12, 126 Margaret St, Brisbane Phone: (07) 3864 9536 Email: g.vickers at qut.edu.au IT Security web site: http://www.its.qut.edu.au/itsecurity/ CRICOS No. 00213J ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Joseph.Petrucci at ddiworld.com Tue Oct 11 15:33:53 2005 From: Joseph.Petrucci at ddiworld.com (Petrucci, Joseph) Date: Tue, 11 Oct 2005 09:33:53 -0400 Subject: Notification periods question Message-ID: set up 2 different contacts Joseph A. Petrucci E Systems DBA ------------------------------------------------ Desk: 412-220-2646 Cell: 412-916-2867 Text message (e-mail) 4129162867 at mobile.att.net e-mail joseph.petrucci at ddiworld.com Personal Cell: 724-462-0443 Personal e-mail: japetrucci at hotmail.com ________________________________ From: nagios-users-admin at lists.sourceforge.net on behalf of Daniel Bimschas Sent: Tue 10/11/2005 3:28 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Notification periods question hi there! i have the following problem and it seems i can't solve it without greater effort. i have a service that is checked 24x7. during "workhours", meaning from 09:00 - 22:00 i want to notify always if the service is warning, critical or unknown (w,u,c,r) and when it recovers. at "non-workhours" (22:00-09:00) i want only to notify if the service is critical or unknown. can i realise that with nagios without writing my own notification plugin and how? thanks for your replies -- Daniel Bimschas Entwicklung Phase 4 Communications GmbH Bayerstra?e 85 a 80335 M?nchen Fon +49.89.30 63 21-52 Fax +49.89.30 63 21-21 ISDN +49.89.30 72 93 33 M?nchen | Z?rich | Berlin www.phase4.de ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: From gabriele_d_g at yahoo.it Tue Oct 11 16:13:04 2005 From: gabriele_d_g at yahoo.it (Gabriele Di Giambelardini) Date: Tue, 11 Oct 2005 16:13:04 +0200 (CEST) Subject: check_snmp for disk space notification problem Message-ID: <20051011141304.70643.qmail@web26501.mail.ukl.yahoo.com> hi to all, hi have a problem with nagios notification for check_snmp for disk space. this is my check: $USER1$/check_snmp -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -C public -o $ARG3$. i tried also whith some switch "-l" but not work. when i launch the command i have an "OK" response or "DOWN" , but i received an email whit "HOST UP" or "HOST DOWN" and not warning or critical, and i can't see the space limit.... the email show an erroro with the %s switch some idea?? --------------------------------- Yahoo! Mail: gratis 1GB per i messaggi, antispam, antivirus, POP3 -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Tue Oct 11 16:57:20 2005 From: marc at ena.com (Marc Powell) Date: Tue, 11 Oct 2005 09:57:20 -0500 Subject: Weekly maintenance Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Tom DE BLENDE (GCC) > Sent: Tuesday, October 11, 2005 6:51 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Weekly maintenance > > Hi All, > > I want to have a weekly one hour window (what's in a word) of downtime > to perform some maintenance on our Windows servers. What would be the > most elegant way to do this? By trying to schedule a recurring downtime? > Or by using Host directives? > > Your thoughts? Create a time period definition that excludes your maintenance window (http://nagios.sourceforge.net/docs/1_0/xodtemplate.html#timeperiod) and assign it as the check_period for your services on that host (http://nagios.sourceforge.net/docs/1_0/xodtemplate.html). The services (and hence the hosts) will not be checked at all during that window. If they are still down at the end of the window you'll receive notifications as usual. -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at pointpub.net Tue Oct 11 16:59:07 2005 From: nagios at pointpub.net (Sebastien Roy) Date: Tue, 11 Oct 2005 10:59:07 -0400 Subject: Nagios 2.0b4 on AIX 4.3.3 Message-ID: <434BD33B.90700@pointpub.net> Hey guys.... Someone here have compiled Nagios 2.0b4 succesfully on AIX 4.3.3 ? Is it possible to send me the binary? I got some probleme about GD when I compiled nagios on AIX 4.3.3 Thanks S?bastien Roy Administrateur de Syst?mes Senior / Senior System Administrator PointPub Communications. - Laval, Canada (514) 867-1079 Sebastien.Roy at PointPub.NET www.pointpub.net ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hamid.forough at amd.com Tue Oct 11 17:45:47 2005 From: hamid.forough at amd.com (Forough, Hamid) Date: Tue, 11 Oct 2005 10:45:47 -0500 Subject: PLease help Message-ID: <360C39FD92313F48A74A60471FA38E87013559F0@SAUSEXMB1.amd.com> I need to know how to write a custom plugin that checks results of a stored procedure in MSSQL 2k and returns Yes or NO? I am using 2b4 verision of Nagios Thanks, Hamid Advanced Micro Devices W: 512.602.4704 C: 512.971-5345 E: hamid.forough at amd.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 2669 bytes Desc: image001.gif URL: From marc at ena.com Tue Oct 11 17:51:18 2005 From: marc at ena.com (Marc Powell) Date: Tue, 11 Oct 2005 10:51:18 -0500 Subject: PLease help Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Forough, Hamid > Sent: Tuesday, October 11, 2005 10:46 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] PLease help > > I need to know how to write a custom plugin that checks results of a > stored procedure in MSSQL 2k and returns Yes or NO? I am using 2b4 > verision of Nagios Presuming you already know how to perform your query from a script this should be all you need -- http://nagiosplug.sourceforge.net/developer-guidelines.html -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hamid.forough at amd.com Tue Oct 11 18:26:38 2005 From: hamid.forough at amd.com (Forough, Hamid) Date: Tue, 11 Oct 2005 11:26:38 -0500 Subject: PLease help Message-ID: <360C39FD92313F48A74A60471FA38E87013559F4@SAUSEXMB1.amd.com> Thanks for that. I have almost completed it except for following: I have a custom shell that checks the existence of a file on linux and sends the correct result. This is the piece of the code that I have question: . . . if test -e "/usr/lib/nagios/plugins/out.txt" then echo "Jobs XXFAILED" rm -f /usr/lib/nagios/plugins/out.txt exit 2 else echo "Jobs XXSUCCEEDED" rm -f /usr/lib/nagios/plugins/out.txt exit 0 fi The problem is when I run this file directly from linux, I get the echo correctly and the files gets deleted. But when ran from Nagios, the echo gets returned BUT IT DOES NOT DELETE THE FILE. Do u know why? -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Marc Powell Sent: Tuesday, October 11, 2005 10:51 AM To: nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] PLease help > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Forough, Hamid > Sent: Tuesday, October 11, 2005 10:46 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] PLease help > > I need to know how to write a custom plugin that checks results of a > stored procedure in MSSQL 2k and returns Yes or NO? I am using 2b4 > verision of Nagios Presuming you already know how to perform your query from a script this should be all you need -- http://nagiosplug.sourceforge.net/developer-guidelines.html -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Tue Oct 11 18:34:50 2005 From: robmossrm at aol.com (Rob Moss) Date: Tue, 11 Oct 2005 17:34:50 +0100 Subject: PLease help In-Reply-To: <360C39FD92313F48A74A60471FA38E87013559F4@SAUSEXMB1.amd.com> References: <360C39FD92313F48A74A60471FA38E87013559F4@SAUSEXMB1.amd.com> Message-ID: <434BE9AA.2080304@aol.com> Forough, Hamid wrote: >The problem is when I run this file directly from linux, I get the echo >correctly and the files gets deleted. But when ran from Nagios, the echo >gets returned BUT IT DOES NOT DELETE THE FILE. Do u know why? > > Probably because you're writing the file as root and the user nagios doesn't have the right permissions to delete the file.. rm -f will silently fail deleting the file. You can evaluate the rm command's error code $? to see if it's 0 (success) or anything else (failed, reason code). Cheers rob ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hmann at itgroundwork.com Tue Oct 11 18:36:42 2005 From: hmann at itgroundwork.com (Harper Mann) Date: Tue, 11 Oct 2005 09:36:42 -0700 Subject: PLease help In-Reply-To: <360C39FD92313F48A74A60471FA38E87013559F4@SAUSEXMB1.amd.com> References: <360C39FD92313F48A74A60471FA38E87013559F4@SAUSEXMB1.amd.com> Message-ID: <434BEA1A.2070003@itgroundwork.com> Did you try running the script from user "nagios"? It's often a permission problem. Also, you generally want to put the full path to commands in the script so the env is good: /bin/rm /bin/echo Etcetera... Regards, - Harper Harper Mann Groundwork Open Source Solutions 510-599-2075 (cell) Forough, Hamid wrote: Thanks for that. I have almost completed it except for following: I have a custom shell that checks the existence of a file on linux and sends the correct result. This is the piece of the code that I have question: . . . if test -e "/usr/lib/nagios/plugins/out.txt" then echo "Jobs XXFAILED" rm -f /usr/lib/nagios/plugins/out.txt exit 2 else echo "Jobs XXSUCCEEDED" rm -f /usr/lib/nagios/plugins/out.txt exit 0 fi The problem is when I run this file directly from linux, I get the echo correctly and the files gets deleted. But when ran from Nagios, the echo gets returned BUT IT DOES NOT DELETE THE FILE. Do u know why? -----Original Message----- From: [1]nagios-users-admin at lists.sourceforge.net [[2]mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Marc Powell Sent: Tuesday, October 11, 2005 10:51 AM To: [3]nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] PLease help -----Original Message----- From: [4]nagios-users-admin at lists.sourceforge.net [[5]mailto:nagios-users- [6]admin at lists.sourceforge.net] On Behalf Of Forough, Hamid Sent: Tuesday, October 11, 2005 10:46 AM To: [7]nagios-users at lists.sourceforge.net Subject: [Nagios-users] PLease help I need to know how to write a custom plugin that checks results of a stored procedure in MSSQL 2k and returns Yes or NO? I am using 2b4 verision of Nagios Presuming you already know how to perform your query from a script this should be all you need -- [8]http://nagiosplug.sourceforge.net/developer-guidelines.html -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. [9]http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list [10]Nagios-users at lists.sourceforge.net [11]https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. [12]http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list [13]Nagios-users at lists.sourceforge.net [14]https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting an y issue. ::: Messages without supporting info will risk being sent to /dev/null References 1. mailto:nagios-users-admin at lists.sourceforge.net 2. mailto:nagios-users-admin at lists.sourceforge.net 3. mailto:nagios-users at lists.sourceforge.net 4. mailto:nagios-users-admin at lists.sourceforge.net 5. mailto:nagios-users 6. mailto:admin at lists.sourceforge.net 7. mailto:nagios-users at lists.sourceforge.net 8. http://nagiosplug.sourceforge.net/developer-guidelines.html 9. http://solutions.newsforge.com/ibmarch.tmpl 10. mailto:Nagios-users at lists.sourceforge.net 11. https://lists.sourceforge.net/lists/listinfo/nagios-users 12. http://solutions.newsforge.com/ibmarch.tmpl 13. mailto:Nagios-users at lists.sourceforge.net 14. https://lists.sourceforge.net/lists/listinfo/nagios-users -------------- next part -------------- An HTML attachment was scrubbed... URL: From Joseph.Petrucci at ddiworld.com Tue Oct 11 18:57:31 2005 From: Joseph.Petrucci at ddiworld.com (Petrucci, Joseph) Date: Tue, 11 Oct 2005 12:57:31 -0400 Subject: PLease help Message-ID: I do this with Sybase and it should still work this way i MSSQL 2K. I would use a passive check (info on this is in the manual) that reads the errorlog. Then you will have to have your procedure execute a raiserror in the event of failure or any other message you want. Have the plugin look at the error log for specific error numbers that you define in the master.dbo.syserrors table (If there still is one) Joseph A. Petrucci E Systems DBA ------------------------------------------------ Desk: 412-220-2646 Cell: 412-916-2867 Text message (e-mail) 4129162867 at mobile.att.net e-mail joseph.petrucci at ddiworld.com Personal Cell: 724-462-0443 Personal e-mail: japetrucci at hotmail.com ________________________________ From: nagios-users-admin at lists.sourceforge.net on behalf of Forough, Hamid Sent: Tue 10/11/2005 11:45 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] PLease help I need to know how to write a custom plugin that checks results of a stored procedure in MSSQL 2k and returns Yes or NO? I am using 2b4 verision of Nagios Thanks, Hamid Advanced Micro Devices W: 512.602.4704 C: 512.971-5345 E: hamid.forough at amd.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 2669 bytes Desc: image001.gif URL: From pbc at phhp.ufl.edu Tue Oct 11 19:06:57 2005 From: pbc at phhp.ufl.edu (Philip Chase) Date: Tue, 11 Oct 2005 13:06:57 -0400 Subject: USERn macros in 2.0b4 Message-ID: I am trying to use user-definable macros in Nagios 2.0b4 with no luck. This line: $USER1$=/usr/lib/nagios/plugins produces this error on my preflight check: Error: Unexpected token or statement in file '/etc/nagios/local/resource.cfg' on line 26. Did the syntax for defining user-definable macros change in 2.x? I can't find any text in the 2.x manual that says how to define these macros, but given 2.x's penchant for syntax like define thingie { thingie_name foo thingie_value bar } I bet the "thingie_name=thingie_value" syntax is now completely obsolete. If this is an RTFM can someone give me a URL, because I just can't find it. Philip ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Oct 11 19:17:01 2005 From: marc at ena.com (Marc Powell) Date: Tue, 11 Oct 2005 12:17:01 -0500 Subject: USERn macros in 2.0b4 Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Philip Chase > Sent: Tuesday, October 11, 2005 12:07 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] USERn macros in 2.0b4 > > I am trying to use user-definable macros in Nagios 2.0b4 with no luck. > This line: > > $USER1$=/usr/lib/nagios/plugins > > produces this error on my preflight check: > > Error: Unexpected token or statement in file > '/etc/nagios/local/resource.cfg' on line 26. > > Did the syntax for defining user-definable macros change in 2.x? I can't No. Is /etc/nagios/local/resource.cfg defined as a cfg_file (incorrect) or resource_file (correct) in nagios.cfg? http://nagios.sourceforge.net/docs/2_0/configmain.html#resource_file -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pbc at phhp.ufl.edu Tue Oct 11 19:25:08 2005 From: pbc at phhp.ufl.edu (Philip Chase) Date: Tue, 11 Oct 2005 13:25:08 -0400 Subject: USERn macros in 2.0b4 Message-ID: OK, I get it now. That file is special and has a special variable name. I moved it back to /etc/nagios, adjusted /etc/nagios/nagios.conf to say "resource_file=/etc/nagios/resource.cfg" and now everything is happy. Thanks, Philip >>> "Marc Powell" 10/11/05 1:17 PM >>> > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Philip Chase > Sent: Tuesday, October 11, 2005 12:07 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] USERn macros in 2.0b4 > > I am trying to use user-definable macros in Nagios 2.0b4 with no luck. > This line: > > $USER1$=/usr/lib/nagios/plugins > > produces this error on my preflight check: > > Error: Unexpected token or statement in file > '/etc/nagios/local/resource.cfg' on line 26. > > Did the syntax for defining user-definable macros change in 2.x? I can't No. Is /etc/nagios/local/resource.cfg defined as a cfg_file (incorrect) or resource_file (correct) in nagios.cfg? http://nagios.sourceforge.net/docs/2_0/configmain.html#resource_file -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jzelazny at sandboxdev.com Tue Oct 11 22:46:20 2005 From: jzelazny at sandboxdev.com (Jennifer Zelazny) Date: Tue, 11 Oct 2005 15:46:20 -0500 Subject: Notifications using AIM Message-ID: <978E25C8-3C09-4A0E-9ED7-CCBDC6F336FB@sandboxdev.com> Hello. Has anyone had success using AIM using the script referenced here: http://nagios.org/faqs/viewfaq.php?faq_id=33? I have tried to use the script, but when testing it, I get the following errors: Can't connect to toc.oscar.aol.com:9898! at /Library/Perl/5.8.6/Net/ AIM.pm line 141 Can't connect to AIM server. (in cleanup) No method called "handler" for object. at /usr/ local/nagios/libexec/aol_im.pl line 0 Any thoughts would be appreciated. Thanks, Jen ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sanjay.Sangavikar at ge.com Wed Oct 12 00:02:10 2005 From: Sanjay.Sangavikar at ge.com (Sangavikar, Sanjay (GE Healthcare, non-ge)) Date: Tue, 11 Oct 2005 17:02:10 -0500 Subject: please email checkcommands.cfg and services.cfg Message-ID: <23583238FE8CF044B2C2484E8306DAEB07487DCC@MKEMLVEM05.e2k.ad.ge.com> Hi, Please some one email your working checkcommands.cfg and services.cfg configuration files. i am having dificulties to understand NRPE command execution on sun solaris server. Thanks in advance. San ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stalinsk at gmail.com Wed Oct 12 00:52:59 2005 From: stalinsk at gmail.com (Stalin) Date: Tue, 11 Oct 2005 15:52:59 -0700 Subject: Check diskspace on remote machines Message-ID: Hi, Just got nagios (1.2) installed and configured with 1.4.2 plugins. I'd few hosts/service checks configured. All works fine. Now i'd like to do the same for remote machines and wondering if anyone could share the plugin to monitor diskspace on remote machines or any pointers is greatly appreciated. Thanks, Stalin ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From drussell at mpifix.com Wed Oct 12 01:59:55 2005 From: drussell at mpifix.com (Danny Russell) Date: Tue, 11 Oct 2005 17:59:55 -0600 Subject: Check diskspace on remote machines Message-ID: <028B223BB24E5443A9784BBE1CC5EBE05B983F@Lisa.mpifix.com> Look into NRPE, Nagios Remote Plug-in Executor. If you need to check windows box there is a NRPE-NT as well. -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Stalin Sent: Tuesday, October 11, 2005 4:53 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Check diskspace on remote machines Hi, Just got nagios (1.2) installed and configured with 1.4.2 plugins. I'd few hosts/service checks configured. All works fine. Now i'd like to do the same for remote machines and wondering if anyone could share the plugin to monitor diskspace on remote machines or any pointers is greatly appreciated. Thanks, Stalin ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From g.vickers at qut.edu.au Wed Oct 12 01:59:35 2005 From: g.vickers at qut.edu.au (Greg Vickers) Date: Wed, 12 Oct 2005 09:59:35 +1000 Subject: Nagios 2.0b4 on AIX 4.3.3 In-Reply-To: <434BD33B.90700@pointpub.net> References: <434BD33B.90700@pointpub.net> Message-ID: <434C51E7.9020002@qut.edu.au> Hello Sebastien, Sebastien Roy wrote: > Hey guys.... > > Someone here have compiled Nagios 2.0b4 succesfully on AIX 4.3.3 ? > Is it possible to send me the binary? I got some probleme about GD when > I compiled nagios on AIX 4.3.3 A MUCH (much, much, much) safer option is to install the GD libraries yourself and compile Nagios successfully on your AIX server. Say I was feeling malicious, I might compile the binary for the AIX platform but include something nasty like a memory leak or something that deletes random files. Accepting binaries compiled by strangers is just like accepting candy from strangers - fraught with danger! -- Greg Vickers Project Manager, IT Security Information Technology Services Queensland University of Technology L12, 126 Margaret St, Brisbane Phone: (07) 3864 9536 Email: g.vickers at qut.edu.au IT Security web site: http://www.its.qut.edu.au/itsecurity/ CRICOS No. 00213J ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Wed Oct 12 02:23:10 2005 From: teng at dataway.com (Tedman Eng) Date: Tue, 11 Oct 2005 17:23:10 -0700 Subject: Check diskspace on remote machines Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2BDF@dw-mail.dataway.com> FAQ # F0059: How do I monitor services on remote hosts? http://www.nagios.org/faqs/viewfaq.php?faq_id=59&expand=false&showdesc=false As for pointers: 1. SNMP is easiest if you have it installed 2. ssh is most secure if configured properly 3. nrpe is most similar to native nagios configuration directives > -----Original Message----- > From: Stalin [mailto:stalinsk at gmail.com] > Sent: Tuesday, October 11, 2005 3:53 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Check diskspace on remote machines > > > Hi, > > Just got nagios (1.2) installed and configured with 1.4.2 plugins. I'd > few hosts/service checks configured. All works fine. Now i'd like to > do the same for remote machines and wondering if anyone could share > the plugin to monitor diskspace on remote machines or any pointers is > greatly appreciated. > > Thanks, > Stalin > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From VINAY_SHARMA at advanex.co.jp Wed Oct 12 04:06:45 2005 From: VINAY_SHARMA at advanex.co.jp (VINAY_SHARMA at advanex.co.jp) Date: Wed, 12 Oct 2005 11:06:45 +0900 Subject: Service Flapping... Message-ID: Hi all, i want to know about services flapping?? what means of comment of service fappling..like Service State Information Current Status: ??OK Status Information: PING OK - Packet loss = 0%, RTA = 34.38 ms Current Attempt: 1/3 State Type: HARD Last Check Type: ACTIVE Last Check Time: 10-12-2005 11:00:15 Status Data Age: 0d 0h 1m 39s Next Scheduled Active Check: 10-12-2005 11:05:15 Latency: < 1 second Check Duration: 5 seconds Last State Change: 10-12-2005 10:55:27 Current State Duration: 0d 0h 6m 27s Last Service Notification: N/A Current Notification Number: 0 Is This Service Flapping? ??YES Percent State Change: 28.75% In Scheduled Downtime? ??NO Last Update: 10-12-2005 11:01:47 Entry Time Author Comment Comment ID Persist ent Actions 10-12-2005 10:45:33 (Nagios Process) Notifications for this service are being supressed because it was detected as having been flapping between different states (24.2% change > 20.0% threshold). When the service state stabilizes and the flapping stops, notifications will be re-enabled. 1 No (Embedded image moved to file: pic19264.gif) Thanks & regards ************************************** Vinay Sharma Information Systems and Solutions-Associates Advanex Inc (www.advanex.co.jp) Fon : 813-3822-5863 Fax : 813-5815-7881 Email : vinay_sharma at advanex.co.jp -------------- next part -------------- A non-text attachment was scrubbed... Name: pic19264.gif Type: image/gif Size: 1124 bytes Desc: not available URL: From sghosh at sghosh.org Wed Oct 12 04:57:16 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Tue, 11 Oct 2005 22:57:16 -0400 (EDT) Subject: Service Flapping... In-Reply-To: References: Message-ID: On Wed, 12 Oct 2005 VINAY_SHARMA at advanex.co.jp wrote: > > > Hi all, > i want to know about services flapping?? what means of comment of > service fappling..like > > > Actions > 10-12-2005 10:45:33 > (Nagios Process) > Notifications for > this service are > being supressed > because it was > detected as > having been > flapping between > different states > (24.2% change > > 20.0% threshold). > When the service > state stabilizes > and the flapping > stops, > notifications > will be > re-enabled. > 1 > No 24% state changes have occurred over the last 21 service checks... http://nagios.sourceforge.net/docs/1_0/flapping.html -- ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jjk_saji at yahoo.com Wed Oct 12 07:22:52 2005 From: jjk_saji at yahoo.com (John Joseph) Date: Wed, 12 Oct 2005 06:22:52 +0100 (BST) Subject: check_snmp for disk space notification problem In-Reply-To: <20051011141304.70643.qmail@web26501.mail.ukl.yahoo.com> References: <20051011141304.70643.qmail@web26501.mail.ukl.yahoo.com> Message-ID: <20051012052253.9325.qmail@web34805.mail.mud.yahoo.com> Hi Looking through your command definition , I am little confused why u are passing the community string , it should be something like -C $ARG1$ , not -C community-string , later when u define services , pass the value using ?Check_command_arguments? I am adding my command definition and services definition , please check the notification option in services.cfg Thanks joseph John from "checkcommands.cfg" # 'SnmpDisk' command definition 54 define command{ command_name SnmpDisk command_line $USER1$/check_snmp_storage.pl -H $HOSTADDRESS$ -C $ARG1$ -m $ARG2$ -w $ARG3$ -c $ARG4$ } # 'snmpmodified' command definition 55 define command{ command_name snmpmodified command_line $USER1$/check_snmp_storage.pl -H $HOSTADDRESS$ -C $ARG1$ -m $ARG2$ -w $ARG3$ -c $ARG4$ } ***************************** $USER1$/check_snmp_storage.pl -H $HOSTADDRESS$ -C $ARG1$ -m $ARG2$ -w $ARG3$ -c $ARG4$ ***************************** Service.cfg # 'SNMP Disk /OPT1 Values' service definition 108 # # #service_id 277 define service{ service_description SNMP Disk /OPT1 Values host_name devdb check_command snmpmodified!john!/opt1!80!95 max_check_attempts 20 normal_check_interval 3 retry_check_interval 20 active_checks_enabled 1 check_period 24x7 freshness_threshold 0 low_flap_threshold 0 high_flap_threshold 0 notification_interval 20 notification_period 24x7 notification_options w,u,c,r notifications_enabled 1 contact_groups IT-Support stalking_options o,w,u,c } --- Gabriele Di Giambelardini wrote: > hi to all, > hi have a problem with nagios notification for > check_snmp for disk space. > this is my check: > $USER1$/check_snmp -H $HOSTADDRESS$ -w $ARG1$ -c > $ARG2$ -C public -o $ARG3$. > i tried also whith some switch "-l" but not work. > when i launch the command i have an "OK" response or > "DOWN" , but i received an email whit "HOST UP" or > "HOST DOWN" and not warning or critical, and i can't > see the space limit.... > the email show an erroro with the %s switch > some idea?? > > > > --------------------------------- > Yahoo! Mail: gratis 1GB per i messaggi, antispam, > antivirus, POP3 ___________________________________________________________ To help you stay safe and secure online, we've developed the all new Yahoo! Security Centre. http://uk.security.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hamid.forough at amd.com Wed Oct 12 16:18:14 2005 From: hamid.forough at amd.com (Forough, Hamid) Date: Wed, 12 Oct 2005 09:18:14 -0500 Subject: PLease help Message-ID: <360C39FD92313F48A74A60471FA38E87013559FC@SAUSEXMB1.amd.com> I don't understand. Can you elaborate please? ________________________________ From: Petrucci, Joseph [mailto:Joseph.Petrucci at ddiworld.com] Sent: Tuesday, October 11, 2005 11:58 AM To: Forough, Hamid; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] PLease help I do this with Sybase and it should still work this way i MSSQL 2K. I would use a passive check (info on this is in the manual) that reads the errorlog. Then you will have to have your procedure execute a raiserror in the event of failure or any other message you want. Have the plugin look at the error log for specific error numbers that you define in the master.dbo.syserrors table (If there still is one) Joseph A. Petrucci E Systems DBA ------------------------------------------------ Desk: 412-220-2646 Cell: 412-916-2867 Text message (e-mail) 4129162867 at mobile.att.net e-mail joseph.petrucci at ddiworld.com Personal Cell: 724-462-0443 Personal e-mail: japetrucci at hotmail.com ________________________________ From: nagios-users-admin at lists.sourceforge.net on behalf of Forough, Hamid Sent: Tue 10/11/2005 11:45 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] PLease help I need to know how to write a custom plugin that checks results of a stored procedure in MSSQL 2k and returns Yes or NO? I am using 2b4 verision of Nagios Thanks, Hamid Advanced Micro Devices W: 512.602.4704 C: 512.971-5345 E: hamid.forough at amd.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 2669 bytes Desc: image001.gif URL: From Joseph.Petrucci at ddiworld.com Wed Oct 12 16:35:30 2005 From: Joseph.Petrucci at ddiworld.com (Petrucci, Joseph) Date: Wed, 12 Oct 2005 10:35:30 -0400 Subject: PLease help Message-ID: There is a SQL function called raiseerror it will write an error that you definewithin the SQL server to the errorlog. This is in Sybase but I thionk it is still in MSSQL. you can find information on it in the admin manual. I use this functio inside my stored procedures to put the results of the procedure in the errorlog. I then use a passive check in Nagios to read the errorlog (I wrote this in ActivePerl) and look for certain error codes. The passive plugin passes the results to Nagios every 5 minutes and Nagios sends a critical alert if any of the errors are present in the errorlog for the given period. -----Original Message----- From: Forough, Hamid [mailto:hamid.forough at amd.com] Sent: Wednesday, October 12, 2005 10:18 AM To: Petrucci, Joseph; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] PLease help I don't understand. Can you elaborate please? _____ From: Petrucci, Joseph [mailto:Joseph.Petrucci at ddiworld.com] Sent: Tuesday, October 11, 2005 11:58 AM To: Forough, Hamid; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] PLease help I do this with Sybase and it should still work this way i MSSQL 2K. I would use a passive check (info on this is in the manual) that reads the errorlog. Then you will have to have your procedure execute a raiserror in the event of failure or any other message you want. Have the plugin look at the error log for specific error numbers that you define in the master.dbo.syserrors table (If there still is one) Joseph A. Petrucci E Systems DBA ------------------------------------------------ Desk: 412-220-2646 Cell: 412-916-2867 Text message (e-mail) 4129162867 at mobile.att.net e-mail joseph.petrucci at ddiworld.com Personal Cell: 724-462-0443 Personal e-mail: japetrucci at hotmail.com _____ From: nagios-users-admin at lists.sourceforge.net on behalf of Forough, Hamid Sent: Tue 10/11/2005 11:45 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] PLease help I need to know how to write a custom plugin that checks results of a stored procedure in MSSQL 2k and returns Yes or NO? I am using 2b4 verision of Nagios Thanks, Hamid Advanced Micro Devices W: 512.602.4704 C: 512.971-5345 E: hamid.forough at amd.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 2669 bytes Desc: image001.gif URL: From b00mer at gmx.net Wed Oct 12 18:48:06 2005 From: b00mer at gmx.net (Hendrik Baecker) Date: Wed, 12 Oct 2005 18:48:06 +0200 Subject: check_snmp for disk space notification problem In-Reply-To: <20051011141304.70643.qmail@web26501.mail.ukl.yahoo.com> References: <20051011141304.70643.qmail@web26501.mail.ukl.yahoo.com> Message-ID: <434D3E46.108@gmx.net> Hi, am I right? When your service is down, your command gives you an CRITICAL, but Nagios self tells you an Host DOWN Alert? I am just guessing, but may it be, that your host just have this single service check? Do you have a normal host-check and are your host checks enabled? Gabriele Di Giambelardini schrieb: > hi to all, > hi have a problem with nagios notification for check_snmp for disk space. > this is my check: > $USER1$/check_snmp -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -C public -o > $ARG3$. > i tried also whith some switch "-l" but not work. > when i launch the command i have an "OK" response or "DOWN" , but i > received an email whit "HOST UP" or "HOST DOWN" and not warning or > critical, and i can't see the space limit.... > the email show an erroro with the %s switch > some idea?? > > ------------------------------------------------------------------------ > *Yahoo! Mail* > : > gratis 1GB per i messaggi, antispam, antivirus, POP3 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ladams at cloudmark.com Wed Oct 12 19:18:17 2005 From: ladams at cloudmark.com (Lori Adams) Date: Wed, 12 Oct 2005 10:18:17 -0700 Subject: nrpe install on solaris 5.8 Message-ID: When attempting to run nrpe-2.2 after installing on a solaris 5.8 box, I receive an error: # /srv/nagios_nrpe/bin/nrpe -c /srv/nagios_nrpe/etc/nrpe.cfg -d ld.so.1: /srv/nagios_nrpe/bin/nrpe: fatal: libssl.so.0.9.6: open failed: No such file or directory Killed Here is the information I've gathered about the machine. And some futile attempts at trying to fix the issue. # uname -a SunOS 5.8 Generic_108528-13 sun4u sparc SUNW,UltraAX-i2 Solaris I have nrpe-2.2 downloaded from Andreas Ericsson's site. Once I started receiving the error, I went to find libssl.so.0.9.6: # find / -name libssl.so.0.9.6 /usr/local/ssl/lib/libssl.so.0.9.6 So I am now running the configure script with these arguments. # ./configure --enable-command-args --with-ssl-lib=/usr/local/ssl/lib/ --with-ssl-inc=/usr/local/ssl/include/ The compile runs with no errors. I then do a "make all" # make all cd ./src/; make ; cd .. make[1]: Entering directory `/export/spare/srv/tmp/nrpe-2.2/src' gcc -g -O2 -I/usr/local/ssl/include/openssl -I/usr/local/ssl/include -DHAVE_CONFIG_H -o nrpe nrpe.c utils.c -L/usr/local/ssl/lib/ -lssl -lcrypt o -lnsl -lsocket gcc -g -O2 -I/usr/local/ssl/include/openssl -I/usr/local/ssl/include -DHAVE_CONFIG_H -o check_nrpe check_nrpe.c utils.c check_utils.c -L/usr/loc al/ssl/lib/ -lssl -lcrypto -lnsl -lsocket make[1]: Leaving directory `/export/spare/srv/tmp/nrpe-2.2/src' I don't see any errors there. I then cp the nrpe binary to the dir that I want to run it in along with the nrpe.cfg file that works on other solaris boxen. I then attempt to run it and get the aforementioned error. # /srv/nagios_nrpe/bin/nrpe -c /srv/nagios_nrpe/etc/nrpe.cfg -d ld.so.1: /srv/nagios_nrpe/bin/nrpe: fatal: libssl.so.0.9.6: open failed: No such file or directory Killed I think that I am missing something fundamental, but I can't quite get my finger on it. Does anyone have any thoughts? Thanks. -Lori -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Wed Oct 12 19:24:31 2005 From: robmossrm at aol.com (Rob Moss) Date: Wed, 12 Oct 2005 18:24:31 +0100 Subject: nrpe install on solaris 5.8 In-Reply-To: References: Message-ID: <434D46CF.5030304@aol.com> Lori Adams wrote: > When attempting to run nrpe-2.2 after installing on a solaris 5.8 box, > I receive an error: > > # /srv/nagios_nrpe/bin/nrpe -c /srv/nagios_nrpe/etc/nrpe.cfg -d > > ld.so.1: /srv/nagios_nrpe/bin/nrpe: fatal: libssl.so.0.9.6: open > failed: No such file or directory > > Killed > You don't have the library path /usr/local/ssl/lib in your LD_LIBRARY_PATH environment variable, or you don't have it configured in the OS level library system using 'crle'. if you do 'ldd /srv/nagios_nrpe/bin/nrpe' it will display libssl.so.0.9.6 as or something similar.. You can do: LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/ssl/lib export LD_LIBRARY_PATH Add that to your NRPE startup script, and run that in your shell to make it work now. Cheers rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From todd_barbera at wgbh.org Wed Oct 12 19:28:47 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Wed, 12 Oct 2005 13:28:47 -0400 Subject: nrpe install on solaris 5.8 In-Reply-To: References: Message-ID: <002701c5cf52$67633530$620a010a@GRAPEAPE> Hi Lori, Try setting your LD_LIBRARY_PATH. You can use "crle" to do so. I.e., crle -l /usr/lib:/usr/local/lib:/usr/local/ssl/lib:/usr/ccs/lib Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Lori Adams Sent: Wednesday, October 12, 2005 1:18 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] nrpe install on solaris 5.8 When attempting to run nrpe-2.2 after installing on a solaris 5.8 box, I receive an error: # /srv/nagios_nrpe/bin/nrpe -c /srv/nagios_nrpe/etc/nrpe.cfg -d ld.so.1: /srv/nagios_nrpe/bin/nrpe: fatal: libssl.so.0.9.6: open failed: No such file or directory Killed Here is the information I've gathered about the machine. And some futile attempts at trying to fix the issue. # uname -a SunOS 5.8 Generic_108528-13 sun4u sparc SUNW,UltraAX-i2 Solaris I have nrpe-2.2 downloaded from Andreas Ericsson's site. Once I started receiving the error, I went to find libssl.so.0.9.6: # find / -name libssl.so.0.9.6 /usr/local/ssl/lib/libssl.so.0.9.6 So I am now running the configure script with these arguments. # ./configure --enable-command-args --with-ssl-lib=/usr/local/ssl/lib/ --with-ssl-inc=/usr/local/ssl/include/ The compile runs with no errors. I then do a "make all" # make all cd ./src/; make ; cd .. make[1]: Entering directory `/export/spare/srv/tmp/nrpe-2.2/src' gcc -g -O2 -I/usr/local/ssl/include/openssl -I/usr/local/ssl/include -DHAVE_CONFIG_H -o nrpe nrpe.c utils.c -L/usr/local/ssl/lib/ -lssl -lcrypt o -lnsl -lsocket gcc -g -O2 -I/usr/local/ssl/include/openssl -I/usr/local/ssl/include -DHAVE_CONFIG_H -o check_nrpe check_nrpe.c utils.c check_utils.c -L/usr/loc al/ssl/lib/ -lssl -lcrypto -lnsl -lsocket make[1]: Leaving directory `/export/spare/srv/tmp/nrpe-2.2/src' I don't see any errors there. I then cp the nrpe binary to the dir that I want to run it in along with the nrpe.cfg file that works on other solaris boxen. I then attempt to run it and get the aforementioned error. # /srv/nagios_nrpe/bin/nrpe -c /srv/nagios_nrpe/etc/nrpe.cfg -d ld.so.1: /srv/nagios_nrpe/bin/nrpe: fatal: libssl.so.0.9.6: open failed: No such file or directory Killed I think that I am missing something fundamental, but I can't quite get my finger on it. Does anyone have any thoughts? Thanks. -Lori -------------- next part -------------- An HTML attachment was scrubbed... URL: From tjn at umn.edu Wed Oct 12 19:48:40 2005 From: tjn at umn.edu (Travis J. Noll) Date: Wed, 12 Oct 2005 12:48:40 -0500 Subject: Checking Traffic via SNMP Message-ID: <434D4C78.3020206@umn.edu> Does anyone know of a plugin I can use to monitor bandwidth utilization of a remote host via snmp? I can use check_snmp to obtain total bytes in either direction, but I'm more interested in some sort of aggregation of bytes/sec. Perhaps an average bytes/sec over an N second interval. Anything like that? I've recently found a check_snmp_if perl script and I'm looking to extend that to poll ifInOctets and ifOutOctets over an interval and averaging. Any thoughts? Much Thanks, -Travis ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gardealuis at gmail.com Wed Oct 12 19:54:48 2005 From: gardealuis at gmail.com (Luis Gardea) Date: Wed, 12 Oct 2005 11:54:48 -0600 Subject: check_wins.pl Message-ID: <44f056cc0510121054o27ef97dfr438a51f2226070b1@mail.gmail.com> Hi Somebody has worked with check_wins.pl, I've tried but allways I get the next message: [root at netids libexec]# ./check_wins.pl -W netdns -D uacj Failed. WINS "netdns" failed to resolve at least one of "dc1 dc2", the domain controller(s) of "uacj". Got "name_query failed to find name dc1#20 name_query failed to find name dc2#20" Any Ideas for that? Thanks. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From aleksey at loehmanns.com Wed Oct 12 19:56:46 2005 From: aleksey at loehmanns.com (Aleksey Domorad) Date: Wed, 12 Oct 2005 13:56:46 -0400 Subject: PLease help Message-ID: Hi all, In the contrib folder there's a check_mssql plug-in. u can tweak it to do whatever you want it to do. I'm working on changing it so it can check free space left in the MsSQL database A. ________________________________ From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Petrucci, Joseph Sent: Wednesday, October 12, 2005 10:36 AM To: Forough, Hamid; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] PLease help There is a SQL function called raiseerror it will write an error that you definewithin the SQL server to the errorlog. This is in Sybase but I thionk it is still in MSSQL. you can find information on it in the admin manual. I use this functio inside my stored procedures to put the results of the procedure in the errorlog. I then use a passive check in Nagios to read the errorlog (I wrote this in ActivePerl) and look for certain error codes. The passive plugin passes the results to Nagios every 5 minutes and Nagios sends a critical alert if any of the errors are present in the errorlog for the given period. -----Original Message----- From: Forough, Hamid [mailto:hamid.forough at amd.com] Sent: Wednesday, October 12, 2005 10:18 AM To: Petrucci, Joseph; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] PLease help I don't understand. Can you elaborate please? ________________________________ From: Petrucci, Joseph [mailto:Joseph.Petrucci at ddiworld.com] Sent: Tuesday, October 11, 2005 11:58 AM To: Forough, Hamid; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] PLease help I do this with Sybase and it should still work this way i MSSQL 2K. I would use a passive check (info on this is in the manual) that reads the errorlog. Then you will have to have your procedure execute a raiserror in the event of failure or any other message you want. Have the plugin look at the error log for specific error numbers that you define in the master.dbo.syserrors table (If there still is one) Joseph A. Petrucci E Systems DBA ------------------------------------------------ Desk: 412-220-2646 Cell: 412-916-2867 Text message (e-mail) 4129162867 at mobile.att.net e-mail joseph.petrucci at ddiworld.com Personal Cell: 724-462-0443 Personal e-mail: japetrucci at hotmail.com ________________________________ From: nagios-users-admin at lists.sourceforge.net on behalf of Forough, Hamid Sent: Tue 10/11/2005 11:45 AM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] PLease help I need to know how to write a custom plugin that checks results of a stored procedure in MSSQL 2k and returns Yes or NO? I am using 2b4 verision of Nagios Thanks, Hamid Advanced Micro Devices W: 512.602.4704 C: 512.971-5345 E: hamid.forough at amd.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: image001.gif Type: image/gif Size: 2669 bytes Desc: image001.gif URL: From ladams at cloudmark.com Wed Oct 12 20:07:27 2005 From: ladams at cloudmark.com (Lori Adams) Date: Wed, 12 Oct 2005 11:07:27 -0700 Subject: nrpe install on solaris 5.8 Message-ID: Thank you all for your quick responses. Todd's answer ended up working. First I ran # crle which output: Command line: crle -c /var/ld/ld.config -l /usr/lib:/usr/lib/secure:/usr/local/lib:/usr/local/BerkeleyDB.3.3/lib So I ran that, and added /usr/local/ssl/lib to the end; # crle -c /var/ld/ld.config -l /usr/lib:/usr/lib/secure:/usr/local/lib:/usr/local/BerkeleyDB.3.3/lib:/u sr/local/ssl/lib Recompiled, and it worked wonderfully. Thanks again. -Lori ________________________________ From: Todd Barbera [mailto:todd_barbera at wgbh.org] Sent: Wednesday, October 12, 2005 10:29 AM To: Lori Adams; nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] nrpe install on solaris 5.8 Hi Lori, Try setting your LD_LIBRARY_PATH. You can use "crle" to do so. I.e., crle -l /usr/lib:/usr/local/lib:/usr/local/ssl/lib:/usr/ccs/lib Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Lori Adams Sent: Wednesday, October 12, 2005 1:18 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] nrpe install on solaris 5.8 When attempting to run nrpe-2.2 after installing on a solaris 5.8 box, I receive an error: # /srv/nagios_nrpe/bin/nrpe -c /srv/nagios_nrpe/etc/nrpe.cfg -d ld.so.1: /srv/nagios_nrpe/bin/nrpe: fatal: libssl.so.0.9.6: open failed: No such file or directory Killed Here is the information I've gathered about the machine. And some futile attempts at trying to fix the issue. # uname -a SunOS 5.8 Generic_108528-13 sun4u sparc SUNW,UltraAX-i2 Solaris I have nrpe-2.2 downloaded from Andreas Ericsson's site. Once I started receiving the error, I went to find libssl.so.0.9.6: # find / -name libssl.so.0.9.6 /usr/local/ssl/lib/libssl.so.0.9.6 So I am now running the configure script with these arguments. # ./configure --enable-command-args --with-ssl-lib=/usr/local/ssl/lib/ --with-ssl-inc=/usr/local/ssl/include/ The compile runs with no errors. I then do a "make all" # make all cd ./src/; make ; cd .. make[1]: Entering directory `/export/spare/srv/tmp/nrpe-2.2/src' gcc -g -O2 -I/usr/local/ssl/include/openssl -I/usr/local/ssl/include -DHAVE_CONFIG_H -o nrpe nrpe.c utils.c -L/usr/local/ssl/lib/ -lssl -lcrypt o -lnsl -lsocket gcc -g -O2 -I/usr/local/ssl/include/openssl -I/usr/local/ssl/include -DHAVE_CONFIG_H -o check_nrpe check_nrpe.c utils.c check_utils.c -L/usr/loc al/ssl/lib/ -lssl -lcrypto -lnsl -lsocket make[1]: Leaving directory `/export/spare/srv/tmp/nrpe-2.2/src' I don't see any errors there. I then cp the nrpe binary to the dir that I want to run it in along with the nrpe.cfg file that works on other solaris boxen. I then attempt to run it and get the aforementioned error. # /srv/nagios_nrpe/bin/nrpe -c /srv/nagios_nrpe/etc/nrpe.cfg -d ld.so.1: /srv/nagios_nrpe/bin/nrpe: fatal: libssl.so.0.9.6: open failed: No such file or directory Killed I think that I am missing something fundamental, but I can't quite get my finger on it. Does anyone have any thoughts? Thanks. -Lori -------------- next part -------------- An HTML attachment was scrubbed... URL: From rouilj at cs.umb.edu Wed Oct 12 20:18:29 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Wed, 12 Oct 2005 14:18:29 -0400 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: Your message of "Fri, 07 Oct 2005 15:33:20 +0200." <43467920.4070508@op5.se> References: <43467920.4070508@op5.se> Message-ID: <200510121818.j9CIITNc019597@mx1.cs.umb.edu> In message <43467920.4070508 at op5.se>, Andreas Ericsson writes: >John P. Rouillard wrote: >> In message <43465AB9.6020304 at op5.se>, >> Andreas Ericsson writes: >>>John P. Rouillard wrote: >>>>The reason I ask is >>>>that nagios was down and the cgi's all happily reported that it was >>>>up. Could this be because the host and service status files were >>>>available since the machine crashed? >>> >>>Yes, that's almost certainly it. There is no really good way of >>>detecting that nagios is actually running unless you're logged in as >>>root. >> Hmm, I am not sure I follow why you need to be logged in as root. >Because otherwise you shouldn't have access to reading process >information about another users process. >> Why not stat the status.log file and check to see if its (mtime) >> timestamp is less than the setting of: >> >> status_update_interval*2 >> >> if aggregate_status_updates is enabled? One could also allow a setting >> "freshness_threshold" in cgi.cfg that is the number of seconds/minutes >> old the status.dat file is allowed to be if aggregate_status_updates >> isn't set. > >Good idea. Write the code for it and submit a patch. Actually not so much a good idea. There is actully a creation datestamp in the status.dat file I was going to use, but I decided to run an experiment first. I have my status_update_interval set to 3 seconds. I used check_fileage to warn me if the file's age was over 3 seconds and ran it in a while loop. It failed often. The longest interval was 139 seconds between updates with a number of periods of 20-30 seconds. My guesses are: nagios only writes the status file when it needs to. nagios was caught up on host checks and didn't write the status file. I have 50 hosts and 100 checks now, but it will be growing a lot. However looking at the status file for a nagos heartbeat isn't reliable without some changes to nagios 2.0. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From al at its-lehmann.de Wed Oct 12 20:23:41 2005 From: al at its-lehmann.de (Arno Lehmann) Date: Wed, 12 Oct 2005 20:23:41 +0200 Subject: Checking Traffic via SNMP In-Reply-To: <434D4C78.3020206@umn.edu> References: <434D4C78.3020206@umn.edu> Message-ID: <434D54AD.8030808@its-lehmann.de> Hello, On 12.10.2005 19:48, Travis J. Noll wrote: > Does anyone know of a plugin I can use to monitor bandwidth utilization > of a remote host via snmp? > > I can use check_snmp to obtain total bytes in either direction, but I'm > more interested in some sort of aggregation of bytes/sec. Perhaps an > average bytes/sec over an N second interval. > > Anything like that? Yes. check_iftraffic.pl. I found it on nagios-exchange.org, I believe. I modified it a bit to better suit my needs, but it's working out of the box. Probably not the most resource-friendly plugin, though - uses perl, some modules, SNMP, and disk space to store data between runs. Arno > I've recently found a check_snmp_if perl script and I'm looking to > extend that to poll ifInOctets and ifOutOctets over an interval and > averaging. Any thoughts? > > Much Thanks, > -Travis > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > -- IT-Service Lehmann al at its-lehmann.de Arno Lehmann http://www.its-lehmann.de ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From boinger at tradingtechnologies.com Wed Oct 12 20:26:18 2005 From: boinger at tradingtechnologies.com (jeff vier) Date: Wed, 12 Oct 2005 13:26:18 -0500 Subject: Checking Traffic via SNMP In-Reply-To: <434D4C78.3020206@umn.edu> References: <434D4C78.3020206@umn.edu> Message-ID: <1129141578.4403.43.camel@chi100400> On Wed, 2005-10-12 at 12:48 -0500, Travis J. Noll wrote: > Does anyone know of a plugin I can use to monitor bandwidth utilization > of a remote host via snmp? I collect with MRTG and then wrote a plugin to check the resulting RRD files against the MaxBytes in the mrtg.cfg It works well, and you get the bonus graphs (though I use cacti as the frontend). > I can use check_snmp to obtain total bytes in either direction, but I'm > more interested in some sort of aggregation of bytes/sec. Perhaps an > average bytes/sec over an N second interval. > > Anything like that? > > I've recently found a check_snmp_if perl script and I'm looking to > extend that to poll ifInOctets and ifOutOctets over an interval and > averaging. Any thoughts? -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 189 bytes Desc: This is a digitally signed message part URL: From LUIS.WIEDEMANN at FLHOSP.ORG Wed Oct 12 20:35:52 2005 From: LUIS.WIEDEMANN at FLHOSP.ORG (Wiedemann, Luis) Date: Wed, 12 Oct 2005 14:35:52 -0400 Subject: Track failed Logins? Message-ID: <2F382057E5DDE8479839E5625F94EF0F067D25F3@fh2k127.fhmis.net> Has anyone found a way to track or alert on a certain number of failed logins to a windows FTP server? I'm not even sure it's possible with Nagios. TIA! Luis Wiedemann MIS - Open Systems Florida Hospital -------------- next part -------------- An HTML attachment was scrubbed... URL: From ae at op5.se Wed Oct 12 20:50:26 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 12 Oct 2005 20:50:26 +0200 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: <200510121818.j9CIITNc019597@mx1.cs.umb.edu> References: <200510121818.j9CIITNc019597@mx1.cs.umb.edu> Message-ID: <434D5AF2.7010504@op5.se> John P. Rouillard wrote: > In message <43467920.4070508 at op5.se>, > Andreas Ericsson writes: > > >>John P. Rouillard wrote: >> >>>In message <43465AB9.6020304 at op5.se>, >>>Andreas Ericsson writes: >>> >>>>John P. Rouillard wrote: > > >>>>>The reason I ask is >>>>>that nagios was down and the cgi's all happily reported that it was >>>>>up. Could this be because the host and service status files were >>>>>available since the machine crashed? >>>> >>>>Yes, that's almost certainly it. There is no really good way of >>>>detecting that nagios is actually running unless you're logged in as >>>>root. >>> >>>Hmm, I am not sure I follow why you need to be logged in as root. >> >>Because otherwise you shouldn't have access to reading process >>information about another users process. >> >>>Why not stat the status.log file and check to see if its (mtime) >>>timestamp is less than the setting of: >>> >>> status_update_interval*2 >>> >>>if aggregate_status_updates is enabled? One could also allow a setting >>>"freshness_threshold" in cgi.cfg that is the number of seconds/minutes >>>old the status.dat file is allowed to be if aggregate_status_updates >>>isn't set. >> >>Good idea. Write the code for it and submit a patch. > > > Actually not so much a good idea. There is actully a creation > datestamp in the status.dat file I was going to use, but I decided to > run an experiment first. I have my status_update_interval set to 3 > seconds. > > I used check_fileage to warn me if the file's age was over 3 seconds > and ran it in a while loop. It failed often. The longest interval was > 139 seconds between updates with a number of periods of 20-30 seconds. > > My guesses are: > > nagios only writes the status file when it needs to. > This is correct. The status_update_interval is never checked, although the status is updated every time a service changes either state or output (or a host, for that matter). > nagios was caught up on host checks and didn't write the status file. > Nopes. It actually (and this is where flat files most obviously fail) writes the status file after each individual hostcheck, so as to make sure the GUI can properly display what attempt it's currently at. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Wed Oct 12 20:54:41 2005 From: ae at op5.se (Andreas Ericsson) Date: Wed, 12 Oct 2005 20:54:41 +0200 Subject: Track failed Logins? In-Reply-To: <2F382057E5DDE8479839E5625F94EF0F067D25F3@fh2k127.fhmis.net> References: <2F382057E5DDE8479839E5625F94EF0F067D25F3@fh2k127.fhmis.net> Message-ID: <434D5BF1.5010004@op5.se> Wiedemann, Luis wrote: > Has anyone found a way to track or alert on a certain number of failed > logins to a windows FTP server? > What server are you talking about? There are about a hundred or so to choose from. > > > I'm not even sure it's possible with Nagios. > It's possible. If the numbers aren't easily accessible through some other means you could write a multiplexing port-forwarding daemon that speaks FTP and have it count the number of unsuccessful logins to be reported whenever some special user logs on or some such. It's just a question of how much time and/or money you want to spend on it. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Oct 11 08:37:45 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 11 Oct 2005 08:37:45 +0200 Subject: Setting flapping state via external command In-Reply-To: <200510110216.j9B2GReG003936@mx1.cs.umb.edu> References: <200510110216.j9B2GReG003936@mx1.cs.umb.edu> Message-ID: <434B5DB9.4090304@op5.se> John P. Rouillard wrote: > Andreas Ericsson wrote: > >>John P. Rouillard wrote: >> >>>While I like the concept behind nagios's flap detection, I have to >>>wonder if anybody besides me thought it would be a good idea to be >>>able to use an external command (e.g. SEC the simple event correlator) >>>to set the state of a service or host to flapping and clear it again? >>> >>>This would allow much finer grained control over flapping >>>determination e.g. you could look at the last 22 states or the last 10 >>>states for flap detection rather than at nagios's default of 21. You >>>could also change flap detection depending on time of day, the state >>>of the rest of the network etc. and allow adaptive flap detection. >> >>There is no such mechanism in place today. A NEB-module might be able to >>help you there though and shouldn't be too hard to write so long as you >>keep it fairly small and make sure it's stateless (creating threads in >>modules can wreak some fairly serious havok on nagios due to certain >>global variables being initialized more than once). > > > Hmm, I always thought of a NEB module as a way of getting information > from the nagios core into other applications. E.G. events/status info > into a database, or report the flapping event to an external > program. What I want to do is call: > > set_service_flap() > clear_service_flap() > > and > > set_host_flap() > clear_host_flap() > > from base/flapping.c when an external program identifies a flapping > state. I can see how a NEB module would be useful in feeding the > external program with alerts/events. So am I misunderstanding the > abilities of the NEB module? > Yes. NEB modules are basically like kernel modules. They trigger on events. Support is far from complete yet (anything going into the FIFO that nagios doesn't understand is discarded instead of being passed to modules, modules aren't allowed to try and understand things in the object configuration and so on), but it can do what you want. I suggest using a second named pipe (or udp socket) and then scheduling an event to run every three seconds to poll the fifo/socket for input. It should be about two hours work for a skilled C-programmer. Make sure you use Nagios 2.0b4 to do it though, since the API changed between 2.0b3 and 2.0b4. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stalinsk at gmail.com Wed Oct 12 22:56:25 2005 From: stalinsk at gmail.com (Stalin) Date: Wed, 12 Oct 2005 13:56:25 -0700 Subject: check_nrpe - Could not complete SSL handshake. Message-ID: Hi All, I'm completely lost on this. I get this error when i try to run commands on remote host via check_nrpe. I've checked FAQ on this error, didn't help much. [nagios at host]$ telnet hostname.com 5666 Trying xx.xxx.xxx.xxx... telnet: Unable to connect to remote host: Connection refused remote server:tail -f /var/log/message Oct 12 15:07:25 remotehost nrpe[7542]: Host xx.xxx.xxx.xxx is not allowed to talk to us! I've included the ip of monitoring server on allowed_ip_list of nrpe.cfg, but still i get this error on the remote server logs file. Is there anything i should try to debug this further. Version : nagios 1.2, plugins 1.4.2, nrpe 2.0 Thanks, Stalin ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios-user at proy.org Wed Oct 12 22:58:14 2005 From: nagios-user at proy.org (Patrick Proy) Date: Wed, 12 Oct 2005 22:58:14 +0200 Subject: Checking Traffic via SNMP In-Reply-To: <434D4C78.3020206@umn.edu> References: <434D4C78.3020206@umn.edu> Message-ID: <20051012205814.AAA5EAB6E4@mail.proy.org> Hi, Try : http://www.manubulon.com/nagios/snmp_int.html Patrick http://sourceforge.net/projects/nagios-snmp/ -----Message d'origine----- De : nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] De la part de Travis J. Noll Envoy? : mercredi 12 octobre 2005 19:49 ? : nagios-users at lists.sourceforge.net Objet : [Nagios-users] Checking Traffic via SNMP Does anyone know of a plugin I can use to monitor bandwidth utilization of a remote host via snmp? I can use check_snmp to obtain total bytes in either direction, but I'm more interested in some sort of aggregation of bytes/sec. Perhaps an average bytes/sec over an N second interval. Anything like that? I've recently found a check_snmp_if perl script and I'm looking to extend that to poll ifInOctets and ifOutOctets over an interval and averaging. Any thoughts? Much Thanks, -Travis ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misc at viceconsulting.co.nz Wed Oct 12 23:07:41 2005 From: misc at viceconsulting.co.nz (misc at viceconsulting.co.nz) Date: Thu, 13 Oct 2005 10:07:41 +1300 (NZDT) Subject: Max number of services that can be monitored ? Message-ID: <47194.127.0.0.1.1129151261.squirrel@www.goldenfields.co.nz> Hi Hendrick, > I would suggest, that Alex should enable the perf-data, to see if > there is one host or special service that screws up the latency. > We have one Server with double proceccor Xeon 1,0Ghz, 2 Gig of Ram OK, so you're saying there may be one host that is particularly lagged and therefore increasing my average latency? > 1. Enable the perf-data. (Tip: Say Nagios it should write both, host > and service perf data in one file. So you can see how Nagios works > and when the latency screws up) > 2. Perhaps, even if the memory is not eaten up by nagios, think about > to give your server more RAM. > 3. Try to figure out, if your system has sometimes wa-cycles. If > every process on your system is waiting for IO, then the ram is NOT > eaten up and swapping isn't used to, but even no performance. Well I do have perf-data enabled and I am graphing the Nagios server's metrics. Basically latency used to be fine, but I added 55 new hosts then it slowed down. The only change in the graphs is the CPU usage went from ~50% to 90-100%. I'm also graphing CPU IO Wait% and IO Wait is negligible. > Please tell us, what service_check interval you have. If your nagios > checks nearly every minute it may be hard for: a) your monitoring > host, b) the (local) network, c) the (possibly existing) wan network. >From nagios.cfg: interval_length=60 > Are your 60 hosts normaly up and running or do they often falls down? > Everytime when nagios is executing a host check, all other service > checks are beeing "stopped", because a host check gets a higher check > priority then a service check. So if the scheduler has enough to do > for service checks and there are some host checks comming between the > other service checks are executed later then they normaly were > scheduled. The 60 hosts are usually always up.... I may disable host checking though, I don't find its that useful. > Hope that gives some ideas. Yeah... thanks for that info. Gives me a couple of things to think about. -Alex. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From prosolutions at gmx.net Wed Oct 12 23:11:43 2005 From: prosolutions at gmx.net (prosolutions at gmx.net) Date: Wed, 12 Oct 2005 14:11:43 -0700 Subject: parents Message-ID: <20051012211143.GA3170@think.alaya.mine.nu> Reading the docs, trying to make status map look sane, I found mention on the web that the parents directive inside a host stanza can be used to show relationships in the status map. Yet does the parents directive have any other functions besides this? From the docs (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html) the explanation of what parents is/does is vague. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misc at viceconsulting.co.nz Wed Oct 12 23:21:38 2005 From: misc at viceconsulting.co.nz (misc at viceconsulting.co.nz) Date: Thu, 13 Oct 2005 10:21:38 +1300 (NZDT) Subject: Max number of services that can be monitored ? Message-ID: <47971.127.0.0.1.1129152098.squirrel@www.goldenfields.co.nz> Hi Rob, > What are the standard checks you are doing? There may be alternate > plugins to do the same job that are more efficient. Have you installed > the nagios-plugins pack? Yeah I have installed nagios-plugins on all monitored hosts. They are being invoked via check_nrpe from the Nagios server. The plugins being run via check_nrpe on the hosts are (C binaries unless otherwise specified): check_procs (to see if a specific process is running) check_load shell script wrapped around sar for CPU % usage (system/user/nice/iowait) check_disk check_memory shell_script to determine number of file descriptors in use shell_script wrapped around free -m check_procs --metric=VSZ check_procs (make sure there are not excessive # of procs running) check_swap check_procs (check for zombies) check_users Network checks run from the Nagios server against the host: check_ssh check_ntp (perl) check_tcp (see if nrpe is responding on host) > Lets see.. Every 5 minutes you check 60 hosts, so thats 60 times you > invoke the perl interpereter.. even if it's cached, its still lots of > extra processing that could be done more elegantly if you compiled in > the perl interpereter with the nagios daemon. I agree it could be done more elegantly, I will try disabling NTP check to see if that makes a difference. > >With embedded Perl enabled, I was getting memory leaks which was > >eventually crashing the Nagios server so I disabled it. > > > > > Which version of perl, and what platform? x86? sparc? > > Older versions of perl (pre 5.6) had some problems being compiled in to > apps such as mod_perl and stuff, but 5.6.1 or above "should" be fine. > > Memory leaks in Solaris wouldn't surprise me, but any of the Linuxes or > BSD's would.. I'm runing Solaris 8 Sparc with Perl 5.6.1 builtin > without any known leaks.. perl 5.8.0 on redhat es 3 update 5 x86... My memory was decreasing over time crashing the server, at my end, disabling embedded perl seemed to fix the prob... -Alex. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yonienjcn at gmail.com Wed Oct 12 18:52:21 2005 From: yonienjcn at gmail.com (Tao Yaoning) Date: Wed, 12 Oct 2005 12:52:21 -0400 Subject: how does nagios call service Message-ID: <8ee7548c0510120952h67d4583aw26e8c38d79b2c866@mail.gmail.com> Hi, all I use nagios 1.x to monitor my network, it works greatly. Now I want to upgrade nagios to 2.0b4. So I just set up a new nagios server with nagios 2.0b4 package, and copy all configuration files from old nagios server to new server and edit all configuration files to make them have correct syntax for nagios 2. Every thing is fine except one problem. I use check_snmp_apcups plugin to monitor my ups. It works very fine in nagios 1.x, but in nagios 2.0b4, it always tell me an error "**ePN failed to compile /usr/lib/nagios/plugins/check_snmp_apcups: "Global symbol "$script" requires explicit package name at (eval 5) line 23,Global symbol "$script_version" requires explicit package name at (eval 5) line 31." I checked the file, it's OK, it looks like this $script = "check_snmp_apcups"; $script_version = "2.1.0"; and these global symbol are useless in the check_snmp_apcups. The script only call them when display the help messages. I configure my UPS_services like /usr/lib/nagios/plugins/check_snmp_apcups -H 192.168.0.10 I can run this command manually and get the correct reslut. but nagios server couldn't. nagios.apache is the owner of the check_snmp_apcups So, anybody can give me some idea to resolve this problem? Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: From teng at dataway.com Wed Oct 12 23:51:19 2005 From: teng at dataway.com (Tedman Eng) Date: Wed, 12 Oct 2005 14:51:19 -0700 Subject: parents Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2BE1@dw-mail.dataway.com> The description of the "parents" directive from the page you cited includes this text: Read the "Determining Status and Reachability of Network Hosts" document located here for more information. here = http://nagios.sourceforge.net/docs/2_0/networkreachability.html > -----Original Message----- > From: prosolutions at gmx.net [mailto:prosolutions at gmx.net] > Sent: Wednesday, October 12, 2005 2:12 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] parents > > > > Reading the docs, trying to make status map look sane, I found mention > on the web that the parents directive inside a host stanza can be used > to show relationships in the status map. Yet does the > parents directive > have any other functions besides this? From the docs > (http://nagios.sourceforge.net/docs/2_0/xodtemplate.html) the > explanation of what parents is/does is vague. > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ladams at cloudmark.com Thu Oct 13 02:38:02 2005 From: ladams at cloudmark.com (Lori Adams) Date: Wed, 12 Oct 2005 17:38:02 -0700 Subject: check_nrpe - Could not complete SSL handshake. Message-ID: It looks like you can't connect to the remote host via 5666. Your telnet failed. Make sure that port is open. -Lori > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Stalin > Sent: Wednesday, October 12, 2005 1:56 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] check_nrpe - Could not complete SSL handshake. > > Hi All, > > I'm completely lost on this. I get this error when i try to run > commands on remote host via check_nrpe. I've checked FAQ on this > error, didn't help much. > > [nagios at host]$ telnet hostname.com 5666 > Trying xx.xxx.xxx.xxx... > telnet: Unable to connect to remote host: Connection refused > > remote server:tail -f /var/log/message > Oct 12 15:07:25 remotehost nrpe[7542]: Host xx.xxx.xxx.xxx is not > allowed to talk to us! > > I've included the ip of monitoring server on allowed_ip_list of > nrpe.cfg, but still i get this error on the remote server logs file. > > Is there anything i should try to debug this further. > > Version : nagios 1.2, plugins 1.4.2, nrpe 2.0 > > Thanks, > Stalin > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From bench at silentmedia.com Thu Oct 13 02:45:41 2005 From: bench at silentmedia.com (Ben) Date: Wed, 12 Oct 2005 17:45:41 -0700 Subject: How to configure nagios2.0b4 with nagios-db..... In-Reply-To: <1128008467.3265.16.camel@localhost.localdomain> References: <1128008467.3265.16.camel@localhost.localdomain> Message-ID: <210B8C74-F5BD-4308-A2E3-0297D866E293@silentmedia.com> There is a small patch you have to apply to nagios-db. I've attached it below. I am lame and haven't put my priorities into applying it myself. -------------- next part -------------- A non-text attachment was scrubbed... Name: inserter.c-fix-nebregistration.diff Type: application/octet-stream Size: 1744 bytes Desc: not available URL: -------------- next part -------------- On Sep 29, 2005, at 8:41 AM, Prashant Jawale wrote: > Hello to all board, > > I want to configure nagios with postgresql such that nagios > stores it's > check results and runtime information in postgresql. > I am using "nagios2.0b4" and additional module "nagios-db". > I really can't figure out how to use nagios-db to get work > done. Please > does anyone knows how to use nagios-db with nagios. May be any > tutorial, > any site, any help.... > > Thanks in advance for all help, it's really urgent... > > -Prash > > > > > -- > This message has been scanned for viruses and > dangerous content by MailScanner, and is > believed to be clean. > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > From gwaugh at scu.edu.au Thu Oct 13 03:15:55 2005 From: gwaugh at scu.edu.au (Guy Waugh) Date: Thu, 13 Oct 2005 11:15:55 +1000 Subject: check_nrpe - Could not complete SSL handshake. In-Reply-To: References: Message-ID: <434DB54B.7020401@scu.edu.au> Hi there, A couple of thoughts... Have you put the NRPE service into /etc/services, like so: [root at remote-server /]# grep nrpe /etc/services nrpe 5666/tcp # NRPE for Nagios Also, do you allow connections on the NRPE port to the remote server from the Nagios server in /etc/hosts.allow, like so: [root at remote-server /]# grep nrpe /etc/hosts.allow nrpe: 10.0.0.1 (where 10.0.0.1 is the Nagios host) I run NRPE out of xinetd - this is a facsimile of my /etc/xinetd.d/nrpe file: service nrpe { flags = REUSE socket_type = stream wait = no user = nagios log_on_success += USERID log_on_failure += USERID server = /usr/local/sbin/nrpe server_args = -c /etc/nrpe.cfg --inetd disable = no only_from = 10.0.0.1 } HTH, Guy. Stalin wrote: > Hi All, > > I'm completely lost on this. I get this error when i try to run > commands on remote host via check_nrpe. I've checked FAQ on this > error, didn't help much. > > [nagios at host]$ telnet hostname.com 5666 > Trying xx.xxx.xxx.xxx... > telnet: Unable to connect to remote host: Connection refused > > remote server:tail -f /var/log/message > Oct 12 15:07:25 remotehost nrpe[7542]: Host xx.xxx.xxx.xxx is not > allowed to talk to us! > > I've included the ip of monitoring server on allowed_ip_list of > nrpe.cfg, but still i get this error on the remote server logs file. > > Is there anything i should try to debug this further. > > Version : nagios 1.2, plugins 1.4.2, nrpe 2.0 > > Thanks, > Stalin > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From elizar.palad at gmail.com Wed Oct 12 02:04:49 2005 From: elizar.palad at gmail.com (Elizar M. Palad) Date: Wed, 12 Oct 2005 08:04:49 +0800 Subject: Check diskspace on remote machines In-Reply-To: References: Message-ID: Hi Stalin! Really depends on what kind of host you're trying to check.. in my case, for solaris i used the plugin that uses the rsh to connect to the remote machnine. (check_disk_remote) for other unices.. use check_nrpe server side, standard nagios plugin for remote hosts.. and check_local_disk on the remote side.. read the details. for windows, i used the nsclient, check_nt. there's a check for diskspace option there, read the details. If you learn this your way.. you wont forget it! Goodluck! eli On 10/12/05, Stalin wrote: > Hi, > > Just got nagios (1.2) installed and configured with 1.4.2 plugins. I'd > few hosts/service checks configured. All works fine. Now i'd like to > do the same for remote machines and wondering if anyone could share > the plugin to monitor diskspace on remote machines or any pointers is > greatly appreciated. > > Thanks, > Stalin > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- ---- Don't Tell Me How Hard You Work.. Show Me How Much You'd Accomplished.. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Oct 13 08:38:15 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 13 Oct 2005 08:38:15 +0200 Subject: Max number of services that can be monitored ? In-Reply-To: <47194.127.0.0.1.1129151261.squirrel@www.goldenfields.co.nz> References: <47194.127.0.0.1.1129151261.squirrel@www.goldenfields.co.nz> Message-ID: <434E00D7.4090202@op5.se> misc at viceconsulting.co.nz wrote: > Hi Hendrick, > > >>I would suggest, that Alex should enable the perf-data, to see if >>there is one host or special service that screws up the latency. >>We have one Server with double proceccor Xeon 1,0Ghz, 2 Gig of Ram > > > OK, so you're saying there may be one host that is particularly lagged and > therefore increasing my average latency? > > >>1. Enable the perf-data. (Tip: Say Nagios it should write both, host >>and service perf data in one file. So you can see how Nagios works >>and when the latency screws up) >>2. Perhaps, even if the memory is not eaten up by nagios, think about >>to give your server more RAM. >>3. Try to figure out, if your system has sometimes wa-cycles. If >>every process on your system is waiting for IO, then the ram is NOT >>eaten up and swapping isn't used to, but even no performance. > > > Well I do have perf-data enabled and I am graphing the Nagios server's > metrics. Basically latency used to be fine, but I added 55 new hosts then > it slowed down. The only change in the graphs is the CPU usage went from > ~50% to 90-100%. I'm also graphing CPU IO Wait% and IO Wait is > negligible. > > >>Please tell us, what service_check interval you have. If your nagios >>checks nearly every minute it may be hard for: a) your monitoring >>host, b) the (local) network, c) the (possibly existing) wan network. > > >>From nagios.cfg: > > interval_length=60 > > > >>Are your 60 hosts normaly up and running or do they often falls down? >>Everytime when nagios is executing a host check, all other service >>checks are beeing "stopped", because a host check gets a higher check >>priority then a service check. So if the scheduler has enough to do >>for service checks and there are some host checks comming between the >>other service checks are executed later then they normaly were >>scheduled. > > > The 60 hosts are usually always up.... I may disable host checking though, > I don't find its that useful. > The docs covering scheduled hostchecks tells you explicitly that it is a very bad idea indeed since no other checks are run when a host check is. > >>Hope that gives some ideas. > > > Yeah... thanks for that info. Gives me a couple of things to think about. > > -Alex. > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Oct 13 09:19:44 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 13 Oct 2005 09:19:44 +0200 Subject: check_nrpe - Could not complete SSL handshake. In-Reply-To: References: Message-ID: <434E0A90.7020401@op5.se> Stalin wrote: > Hi All, > > I'm completely lost on this. I get this error when i try to run > commands on remote host via check_nrpe. I've checked FAQ on this > error, didn't help much. > > [nagios at host]$ telnet hostname.com 5666 > Trying xx.xxx.xxx.xxx... > telnet: Unable to connect to remote host: Connection refused > > remote server:tail -f /var/log/message > Oct 12 15:07:25 remotehost nrpe[7542]: Host xx.xxx.xxx.xxx is not > allowed to talk to us! > This is weird. Since the connection was refused, indicating that NRPE isn't listening on the port, you really shouldn't have gotten anything at all in the logs. Since you did anyway .... > I've included the ip of monitoring server on allowed_ip_list of > nrpe.cfg, but still i get this error on the remote server logs file. > ... did you restart nrpe after adding the ip to the config? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mbano at epo.org Thu Oct 13 09:33:36 2005 From: mbano at epo.org (Marco Bano) Date: Thu, 13 Oct 2005 09:33:36 +0200 (CEST) Subject: early return from script and wrong return-code Message-ID: <20051013073336.C0DD74F415C@desire.netways.de> Hi list Hi list Nagios 2.0b4 and 1.2: Checking via nrpe "channel" to a remote host a service (and locally on the same host where nagios run) The remote check script has amf test inside. (tomcat) locally form shell if I run the script no problem. But "inside" nagios the script sound as the return code arrive earlier and wrong. Infact the script takes 5-10 seconds to excute....giving before the finish just a bit of output... but not timeout (try to extend it) The same via check_nrpe run from shell. Miracle via check_by_ssh it works well. Sounds that by_ssh plugin wait that the script finish and gives the real return-code. any hint, any experience on similar problem? thanks in advance to all of you. - Marco Bano (mbano) ----------------------- The mailing list archive is found here: http://www.nagiosexchange.org/nagios-users.34.0.html ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From thorsten.wacker at saarstahl.com Thu Oct 13 09:58:10 2005 From: thorsten.wacker at saarstahl.com (WACKER, Thorsten) Date: Thu, 13 Oct 2005 09:58:10 +0200 Subject: possible bug - Nagios 2.0b4 dont reload config after SIGHUP Message-ID: Hello, I have noticed when I do either a sighup, or /etc/init.d/nagios reload, to reload the configuration, nagios sometimes doesn't reload the configuration-files Version: Nagios 2.0b4 Gru? Thorsten Wacker Saarstahl AG Germany ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ton.voon at altinity.com Thu Oct 13 11:51:48 2005 From: ton.voon at altinity.com (Ton Voon) Date: Thu, 13 Oct 2005 10:51:48 +0100 Subject: nrpe install on solaris 5.8 In-Reply-To: References: Message-ID: <7AF32364-89B7-4F3A-AF03-4B885CF2CB27@altinity.com> On 12 Oct 2005, at 18:18, Lori Adams wrote: > When attempting to run nrpe-2.2 after installing on a solaris 5.8 > box, I receive an error: > > # /srv/nagios_nrpe/bin/nrpe -c /srv/nagios_nrpe/etc/nrpe.cfg -d > > ld.so.1: /srv/nagios_nrpe/bin/nrpe: fatal: libssl.so.0.9.6: open > failed: No such file or directory Looks like you've found one answer to this already. Another way is to fix at compile time: set LD_RUN_PATH to include /usr/local/ssl/lib, then run configure and make. Ton http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris at aidworld.org Thu Oct 13 12:10:34 2005 From: chris at aidworld.org (Chris Wilson) Date: Thu, 13 Oct 2005 11:10:34 +0100 Subject: Possible bug in NSCA In-Reply-To: <1128682424.28692.57.camel@localhost> References: <1127432185.32051.12.camel@localhost> <4333A9F5.8050506@op5.se> <1128682424.28692.57.camel@localhost> Message-ID: <1129198234.4130.20.camel@localhost> Hi all, On Fri, 2005-10-07 at 11:53, Chris Wilson wrote: > I would like to propose the attached patch to NSCA. It treats ECHILD > errors the same as EWOULDBLOCK and EINTR, i.e. temporary errors. [...] > Comments most welcome, especially anything that might prevent this from > being accepted into the NSCA source. I have not heard any comments at all about this patch. I would really appreciate it if those in the know would have a look at it. There definitely seems to be a bug in NSCA - this issue is now happening to me once a week. Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ton.voon at altinity.com Thu Oct 13 12:25:54 2005 From: ton.voon at altinity.com (Ton Voon) Date: Thu, 13 Oct 2005 11:25:54 +0100 Subject: Possible bug in NSCA In-Reply-To: <1129198234.4130.20.camel@localhost> References: <1127432185.32051.12.camel@localhost> <4333A9F5.8050506@op5.se> <1128682424.28692.57.camel@localhost> <1129198234.4130.20.camel@localhost> Message-ID: <3991819A-094C-476F-A7CC-FE6F415FE215@altinity.com> On 13 Oct 2005, at 11:10, Chris Wilson wrote: > Hi all, > > On Fri, 2005-10-07 at 11:53, Chris Wilson wrote: > >> I would like to propose the attached patch to NSCA. It treats ECHILD >> errors the same as EWOULDBLOCK and EINTR, i.e. temporary errors. >> > [...] > >> Comments most welcome, especially anything that might prevent this >> from >> being accepted into the NSCA source. >> > > I have not heard any comments at all about this patch. I would really > appreciate it if those in the know would have a look at it. > > There definitely seems to be a bug in NSCA - this issue is now > happening > to me once a week. > Chris, Which platform are you running on? We're recently fixed a problem on the plugins re: ECHLD status codes for spopen calls on Redhat ES4. Ton http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon -------------- next part -------------- An HTML attachment was scrubbed... URL: From chris at aidworld.org Thu Oct 13 12:31:19 2005 From: chris at aidworld.org (Chris Wilson) Date: Thu, 13 Oct 2005 11:31:19 +0100 Subject: Possible bug in NSCA In-Reply-To: <3991819A-094C-476F-A7CC-FE6F415FE215@altinity.com> References: <1127432185.32051.12.camel@localhost> <4333A9F5.8050506@op5.se> <1128682424.28692.57.camel@localhost> <1129198234.4130.20.camel@localhost> <3991819A-094C-476F-A7CC-FE6F415FE215@altinity.com> Message-ID: <1129199478.4130.57.camel@localhost> Hi Ton, On Thu, 2005-10-13 at 11:25, Ton Voon wrote: > Which platform are you running on? We're recently fixed a problem on > the plugins re: ECHLD status codes for spopen calls on Redhat ES4. Running on Fedora Core 2. I don't know what spopen() does, but this is an error that occurs in accept() (where ECHILD doesn't make sense to me, and is apparently undocumented). Cheers, Chris. -- (aidworld) chris wilson | chief engineer (chris at aidworld.org) ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ton.voon at altinity.com Thu Oct 13 13:06:36 2005 From: ton.voon at altinity.com (Ton Voon) Date: Thu, 13 Oct 2005 12:06:36 +0100 Subject: Possible bug in NSCA In-Reply-To: <1129199478.4130.57.camel@localhost> References: <1127432185.32051.12.camel@localhost> <4333A9F5.8050506@op5.se> <1128682424.28692.57.camel@localhost> <1129198234.4130.20.camel@localhost> <3991819A-094C-476F-A7CC-FE6F415FE215@altinity.com> <1129199478.4130.57.camel@localhost> Message-ID: <4A31453D-BDF0-4ECC-9264-A677227B02F6@altinity.com> On 13 Oct 2005, at 11:31, Chris Wilson wrote: > On Thu, 2005-10-13 at 11:25, Ton Voon wrote: > > >> Which platform are you running on? We're recently fixed a problem on >> the plugins re: ECHLD status codes for spopen calls on Redhat ES4. >> > > Running on Fedora Core 2. I don't know what spopen() does, but this is > an error that occurs in accept() (where ECHILD doesn't make sense > to me, > and is apparently undocumented). > As Fedora is a derivative of Red Hat ES, it could be related (or I could be leading you down a tangential path). The thread was called "check_dns: DNS WARNING - nslookup returned error status" in the nagiosplug-devel mailing list, but here's a summary: Problem was that using a popen call, nslookup and dig were occasionally returning ECHLD statuses on a wait() call and thus check_dns and check_dig were incorrectly returning warning statuses. Couldn't find any other commands causing the problem. We created a test case where if you ran the nslookup command repeatedly, it would fail about 60% of the time (for 10 invocations). You'll find the test in the 1.4.2 release of the plugins. Sasche Runschke has raised this with Red Hat. It has been acknowledged, but I haven't had an update recently. The feeling is that the futex handling in the kernel is responsible, but I don't know if it could be related to your issue. Ton http://www.altinity.com T: +44 (0)870 787 9243 F: +44 (0)845 280 1725 Skype: tonvoon -------------- next part -------------- An HTML attachment was scrubbed... URL: From robmossrm at aol.com Thu Oct 13 14:04:30 2005 From: robmossrm at aol.com (Rob Moss) Date: Thu, 13 Oct 2005 13:04:30 +0100 Subject: how does nagios call service In-Reply-To: <8ee7548c0510120952h67d4583aw26e8c38d79b2c866@mail.gmail.com> References: <8ee7548c0510120952h67d4583aw26e8c38d79b2c866@mail.gmail.com> Message-ID: <434E4D4E.4010305@aol.com> Tao Yaoning wrote: > Hi, all > > I use nagios 1.x to monitor my network, it works greatly. Now I want > to upgrade nagios to 2.0b4. So I just set up a new nagios server with > nagios 2.0b4 package, and copy all configuration files from old nagios > server to new server and edit all configuration files to make them > have correct syntax for nagios 2. > > Every thing is fine except one problem. > > I use check_snmp_apcups plugin to monitor my ups. It works very fine > in nagios 1.x, but in nagios 2.0b4, it always tell me an error "**ePN > failed to compile /usr/lib/nagios/plugins/check_snmp_apcups: "Global > symbol "$script" requires explicit package name at (eval 5) line > 23,Global symbol "$script_version" requires explicit package name at > (eval 5) line 31." I checked the file, it's OK, it looks like this > $script = "check_snmp_apcups"; > $script_version = "2.1.0"; > and these global symbol are useless in the check_snmp_apcups. The > script only call them when display the help messages. Is this your own Perl script? Does this perl script use warnings, and load the 'strict' module? ie: #!/usr/bin/perl -w use strict; ..rest of script.. I suspect that you have not enabled Strict mode in your script, and have not given the -w (warning) flag to perl.. What if you run 'perl -w -c ' does it warn you about the same errors? You should be defining your variables with 'my' or 'our' functions at the beginning of your script. The embedded perl interpereter in Nagios probably is using the good coding practices of using warnings and strict code checking, which is why you are seeing these errors. Review your script, change the lines $script = "check_snmp_apcups"; $script_version = "2.1.0"; to my $script = "check_snmp_apcups"; my $script_version = "2.1.0"; And all subsequent variables, and you shouldn't receive any warnings. The warnings are telling you that you aren't defining your variables. Define the variables and there will be no warnings. Cheers rob -------------- next part -------------- An HTML attachment was scrubbed... URL: From yonienjcn at gmail.com Thu Oct 13 14:11:23 2005 From: yonienjcn at gmail.com (Tao Yaoning) Date: Thu, 13 Oct 2005 08:11:23 -0400 Subject: how does nagios call service In-Reply-To: <434E4D4E.4010305@aol.com> References: <8ee7548c0510120952h67d4583aw26e8c38d79b2c866@mail.gmail.com> <434E4D4E.4010305@aol.com> Message-ID: <8ee7548c0510130511o60878159g93a554a0301b12a2@mail.gmail.com> It works now, Thanks > Is this your own Perl script? > Does this perl script use warnings, and load the 'strict' module? > ie: > > #!/usr/bin/perl *-w* > *use strict;* > ..rest of script.. > > I suspect that you have not enabled Strict mode in your script, and have > not given the -w (warning) flag to perl.. > > What if you run 'perl -w -c ' does it warn you about the same > errors? > > You should be defining your variables with 'my' or 'our' functions at the > beginning of your script. > > The embedded perl interpereter in Nagios probably is using the good coding > practices of using warnings and strict code checking, which is why you are > seeing these errors. > > Review your script, change the lines > > $script = "check_snmp_apcups"; > $script_version = "2.1.0"; > > to > > *my *$script = "check_snmp_apcups"; > *my *$script_version = "2.1.0"; > > And all subsequent variables, and you shouldn't receive any warnings. The > warnings are telling you that you aren't defining your variables. Define the > variables and there will be no warnings. > > Cheers > rob > > -------------- next part -------------- An HTML attachment was scrubbed... URL: From btmanmeh at verizon.net Thu Oct 13 15:39:54 2005 From: btmanmeh at verizon.net (Max) Date: Thu, 13 Oct 2005 09:39:54 -0400 Subject: Check diskspace on remote machines In-Reply-To: <37ED92F9890FAF4BB947613C66FF8B1A08BB2BDF@dw-mail.dataway.com> References: <37ED92F9890FAF4BB947613C66FF8B1A08BB2BDF@dw-mail.dataway.com> Message-ID: <434E63AA.6090400@verizon.net> Tedman Eng wrote: > FAQ # F0059: How do I monitor services on remote hosts? > > http://www.nagios.org/faqs/viewfaq.php?faq_id=59&expand=false&showdesc=false > > > As for pointers: > 1. SNMP is easiest if you have it installed Is it possible using check_snmp, to have the command query the remote machine, return the list of disks, and then automatically have those checked for disk space? Meaning, say I have / /home /var /opt that I would like to monitor. As it stands now, I have a command defined below: define command{ command_name snmp_disk command_line $USER1$/check_snmp -H $HOSTADDRESS$ -C public -o UCD-SNMP-MIB::dskAvail.$ARG1$,UCD-SNMP-MIB::dskPercent.$ARG1$ -w 90:,:90 -c 95:,:95 -u 'kB free (','% used)' -l 'Disk space' -o UCD-SNMP-MIB::dskPath.$ARG1$ This works great and returns "Disk space OK - 2868624 kB free ( 27 % used) /", which is reading the snmpd.conf file where "/" is my first disk to be listed as sysroot. Basically, I want to be able to add a host, tell Nagios to check the host's disks automatically by polling the remote machine, and then return it's information back with all of the disks and space used. The reason for this is because I have a ton of machines with a not so standard disk and partition layout. One machine might have it's usual /home /opt /var, one might have things being mounted and used on different things. Max ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Oct 13 16:19:58 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 13 Oct 2005 16:19:58 +0200 Subject: Check diskspace on remote machines In-Reply-To: <434E63AA.6090400@verizon.net> References: <37ED92F9890FAF4BB947613C66FF8B1A08BB2BDF@dw-mail.dataway.com> <434E63AA.6090400@verizon.net> Message-ID: <434E6D0E.8070908@op5.se> Max wrote: > Tedman Eng wrote: > >> FAQ # F0059: How do I monitor services on remote hosts? >> >> http://www.nagios.org/faqs/viewfaq.php?faq_id=59&expand=false&showdesc=false >> >> >> >> As for pointers: >> 1. SNMP is easiest if you have it installed > > > Is it possible using check_snmp, to have the command query the remote > machine, return the list of disks, and then automatically have those > checked for disk space? > Nopes, but you should be able to write either a fairly simple wrapper script to do it automagically on each check, or an autoconfiguration tool that does it for you. sed is your friend (well, it's my friend, but you can borrow it if you like. ;) ) > Meaning, say I have / /home /var /opt that I would like to monitor. As > it stands now, I have a command defined below: > > define command{ > command_name snmp_disk > command_line $USER1$/check_snmp -H $HOSTADDRESS$ -C public -o > UCD-SNMP-MIB::dskAvail.$ARG1$,UCD-SNMP-MIB::dskPercent.$ARG1$ -w 90:,:90 > -c 95:,:95 -u 'kB free (','% used)' -l 'Disk space' -o > UCD-SNMP-MIB::dskPath.$ARG1$ > > This works great and returns "Disk space OK - 2868624 kB free ( 27 % > used) /", which is reading the snmpd.conf file where "/" is my first > disk to be listed as sysroot. > > Basically, I want to be able to add a host, tell Nagios to check the > host's disks automatically by polling the remote machine, and then > return it's information back with all of the disks and space used. The > reason for this is because I have a ton of machines with a not so > standard disk and partition layout. One machine might have it's usual > /home /opt /var, one might have things being mounted and used on > different things. > > Max > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From btmanmeh at verizon.net Thu Oct 13 16:27:56 2005 From: btmanmeh at verizon.net (Max) Date: Thu, 13 Oct 2005 10:27:56 -0400 Subject: Check diskspace on remote machines In-Reply-To: <434E6D0E.8070908@op5.se> References: <37ED92F9890FAF4BB947613C66FF8B1A08BB2BDF@dw-mail.dataway.com> <434E63AA.6090400@verizon.net> <434E6D0E.8070908@op5.se> Message-ID: <434E6EEC.60909@verizon.net> Andreas Ericsson wrote: > Nopes, but you should be able to write either a fairly simple wrapper > script to do it automagically on each check, or an autoconfiguration > tool that does it for you. sed is your friend (well, it's my friend, but > you can borrow it if you like. ;) ) Yeah, that was going to be the next step, but I figured I'd ask in case there was a plugin already to do such a thing. There are so many plugins available that I don't even know about, I figured it was worth a shot in checking...no pun intended. Thanks! Max ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Thu Oct 13 16:40:27 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Thu, 13 Oct 2005 10:40:27 -0400 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: Your message of "Wed, 12 Oct 2005 20:50:26 +0200." <434D5AF2.7010504@op5.se> References: <434D5AF2.7010504@op5.se> Message-ID: <200510131440.j9DEeRZr011398@mx1.cs.umb.edu> In message <434D5AF2.7010504 at op5.se>, Andreas Ericsson writes: >John P. Rouillard wrote: >> In message <43467920.4070508 at op5.se>, >> Andreas Ericsson writes: >>>John P. Rouillard wrote: >>> >>>>In message <43465AB9.6020304 at op5.se>, >>>>Andreas Ericsson writes: >>>> >>>>>John P. Rouillard wrote: >> >> >>>>>>The reason I ask is >>>>>>that nagios was down and the cgi's all happily reported that it was >>>>>>up. Could this be because the host and service status files were >>>>>>available since the machine crashed? >>>>> >>>>>Yes, that's almost certainly it. There is no really good way of >>>>>detecting that nagios is actually running unless you're logged in as >>>>>root. >>>> >>>>Hmm, I am not sure I follow why you need to be logged in as root. >>> >>>Because otherwise you shouldn't have access to reading process >>>information about another users process. >>> >>>>Why not stat the status.log file and check to see if its (mtime) >>>>timestamp is less than the setting of: >>>> >>>> status_update_interval*2 >>>> >>>>if aggregate_status_updates is enabled? One could also allow a setting >>>>"freshness_threshold" in cgi.cfg that is the number of seconds/minutes >>>>old the status.dat file is allowed to be if aggregate_status_updates >>>>isn't set. >>> >>>Good idea. Write the code for it and submit a patch. >> >> Actually not so much a good idea. There is actully a creation >> datestamp in the status.dat file I was going to use, but I decided to >> run an experiment first. I have my status_update_interval set to 3 >> seconds. >> >> I used check_fileage to warn me if the file's age was over 3 seconds >> and ran it in a while loop. It failed often. The longest interval was >> 139 seconds between updates with a number of periods of 20-30 seconds. >> >> My guesses are: nagios only writes the status file when it needs to. > >This is correct. The status_update_interval is never checked, although >the status is updated every time a service changes either state or >output (or a host, for that matter). Ideally nagios would provide a next_check_time in the status.dat, but I wonder if that could be usefully intuited from: min( min(next_check time on services) + service_check_timeout), min(next_check time on hosts) + host_check_timeout) ) Possible problems: on demand host checks (if part of a network is down) could screw up the timing since everything else stops. Just because a service check is scheduled doesn't mean that it is going to run (time period may be wrong etc), but if its determined to be non-runnable the escheduled time for it should cause a re-write of the status.dat file correct? There has to be an easier way of determining if nagios is running doesn't there? -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Thu Oct 13 16:54:36 2005 From: ae at op5.se (Andreas Ericsson) Date: Thu, 13 Oct 2005 16:54:36 +0200 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: <200510131440.j9DEeRZr011398@mx1.cs.umb.edu> References: <200510131440.j9DEeRZr011398@mx1.cs.umb.edu> Message-ID: <434E752C.3050301@op5.se> John P. Rouillard wrote: > In message <434D5AF2.7010504 at op5.se>, > Andreas Ericsson writes: > > >>John P. Rouillard wrote: >> >>>In message <43467920.4070508 at op5.se>, >>>Andreas Ericsson writes: >>> >>>>John P. Rouillard wrote: >>>> >>>> >>>>>In message <43465AB9.6020304 at op5.se>, >>>>>Andreas Ericsson writes: >>>>> >>>>> >>>>>>John P. Rouillard wrote: >>> >>> >>>>>>>The reason I ask is >>>>>>>that nagios was down and the cgi's all happily reported that it was >>>>>>>up. Could this be because the host and service status files were >>>>>>>available since the machine crashed? >>>>>> >>>>>>Yes, that's almost certainly it. There is no really good way of >>>>>>detecting that nagios is actually running unless you're logged in as >>>>>>root. >>>>> >>>>>Hmm, I am not sure I follow why you need to be logged in as root. >>>> >>>>Because otherwise you shouldn't have access to reading process >>>>information about another users process. >>>> >>>> >>>>>Why not stat the status.log file and check to see if its (mtime) >>>>>timestamp is less than the setting of: >>>>> >>>>> status_update_interval*2 >>>>> >>>>>if aggregate_status_updates is enabled? One could also allow a setting >>>>>"freshness_threshold" in cgi.cfg that is the number of seconds/minutes >>>>>old the status.dat file is allowed to be if aggregate_status_updates >>>>>isn't set. >>>> >>>>Good idea. Write the code for it and submit a patch. >>> >>>Actually not so much a good idea. There is actully a creation >>>datestamp in the status.dat file I was going to use, but I decided to >>>run an experiment first. I have my status_update_interval set to 3 >>>seconds. >>> >>>I used check_fileage to warn me if the file's age was over 3 seconds >>>and ran it in a while loop. It failed often. The longest interval was >>>139 seconds between updates with a number of periods of 20-30 seconds. >>> >>>My guesses are: nagios only writes the status file when it needs to. >> >>This is correct. The status_update_interval is never checked, although >>the status is updated every time a service changes either state or >>output (or a host, for that matter). > > > Ideally nagios would provide a next_check_time in the status.dat, but > I wonder if that could be usefully intuited from: > > min( > min(next_check time on services) + service_check_timeout), > min(next_check time on hosts) + host_check_timeout) > ) > > Possible problems: on demand host checks (if part of a network is > down) could screw up the timing since everything else stops. > > Just because a service check is scheduled doesn't mean that it is > going to run (time period may be wrong etc), but if its determined to > be non-runnable the escheduled time for it should cause a re-write of > the status.dat file correct? > > There has to be an easier way of determining if nagios is running > doesn't there? > Easy isn't the problem. The trick is to get it to work from a different and almost always less privileged user. Perhaps a simple neb-module can touch some file every 10 seconds and if it's 30 seconds old the GUI could then reasonably suspect that nagios has crashed. However, I haven't noticed nagios crashing on a modern system. It used to, with glibc-2.0.35 and linuxthreads-0.7 (which was really buggy). Since upgrading to glibc-2.3.30 (or some such) and linuxthreads-0.10 everything is running smoothly, so this isn't really a problem for me or any of our customers. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From carlospeon at hotmail.com Thu Oct 13 17:18:06 2005 From: carlospeon at hotmail.com (=?iso-8859-1?B?Q2FybG9zIFBl824gQ29zdGE=?=) Date: Thu, 13 Oct 2005 17:18:06 +0200 Subject: nagios_grapher collect script. Message-ID: Hi, I'm testing nagios grapher and I'm having some problems with de collect script. The problem is than nagios dies on my test machine and I'd like to discuss some pice of code with Marius Hein: while (1) { # Open the PIPE and read open( FIFO, $pipe ) || die "Can't open pipe ($pipe): $!"; $fifo_string = ; close( FIFO ); ... } This seemed estrange to me, because closing the fifo causes locks to the fifo write process, Nagios (please tell me if I'm wrong). I try open( FIFO, $pipe ) || die "Can't open pipe ($pipe): $!"; while ($fifo_string = ) { ... } and adding a "close( FIFO );" in the sig_term handler. This seems have solved the problem. I our production machine (more than 400 active checks) I have to modify our performance data handler to allow listen from a fifo, and with the first piece of code I loose preformance data randomly. It's working fine with the second loop. Hope this is useful and thanks for your job, nagios grapher is fantastic for large environments! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Thu Oct 13 17:28:21 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Thu, 13 Oct 2005 11:28:21 -0400 Subject: Verion 2.0b4 how does cgi's nagios_check_command work? In-Reply-To: Your message of "Thu, 13 Oct 2005 16:54:36 +0200." <434E752C.3050301@op5.se> References: <434E752C.3050301@op5.se> Message-ID: <200510131528.j9DFSLnH018877@mx1.cs.umb.edu> In message <434E752C.3050301 at op5.se>, Andreas Ericsson writes: >John P. Rouillard wrote: >> In message <434D5AF2.7010504 at op5.se>, >> Andreas Ericsson writes: >> >> >>>John P. Rouillard wrote: >>> >>>>In message <43467920.4070508 at op5.se>, >>>>Andreas Ericsson writes: >>>> >>>>>John P. Rouillard wrote: >>>>> >>>>> >>>>>>In message <43465AB9.6020304 at op5.se>, >>>>>>Andreas Ericsson writes: >>>>>> >>>>>> >>>>>>>John P. Rouillard wrote: >>>> >>>> >>>>>>>>The reason I ask is >>>>>>>>that nagios was down and the cgi's all happily reported that it was >>>>>>>>up. Could this be because the host and service status files were >>>>>>>>available since the machine crashed? >>>>>>> >>>>>>>Yes, that's almost certainly it. There is no really good way of >>>>>>>detecting that nagios is actually running unless you're logged in as >>>>>>>root. >>>>>> >>>>>>Hmm, I am not sure I follow why you need to be logged in as root. >>>>> >>>>>Because otherwise you shouldn't have access to reading process >>>>>information about another users process. >>>>> >>>>> >>>>>>Why not stat the status.log file and check to see if its (mtime) >>>>>timestamp is less than the setting of: >>>>>> >>>>>> status_update_interval*2 >>>>>> >>>>>>if aggregate_status_updates is enabled? One could also allow a setting >>>>>>"freshness_threshold" in cgi.cfg that is the number of seconds/minutes >>>>>>old the status.dat file is allowed to be if aggregate_status_updates >>>>>>isn't set. >>>>> >>>>>Good idea. Write the code for it and submit a patch. >>>> >>>>Actually not so much a good idea. There is actully a creation >>>>datestamp in the status.dat file I was going to use, but I decided to >>>>run an experiment first. I have my status_update_interval set to 3 >>>>seconds. >>>> >>>>I used check_fileage to warn me if the file's age was over 3 seconds >>>>and ran it in a while loop. It failed often. The longest interval was >>>>139 seconds between updates with a number of periods of 20-30 seconds. >>>> >>>>My guesses are: nagios only writes the status file when it needs to. >>> >>>This is correct. The status_update_interval is never checked, although >>>the status is updated every time a service changes either state or >>>output (or a host, for that matter). >> >> >> Ideally nagios would provide a next_check_time in the status.dat, but >> I wonder if that could be usefully intuited from: >> >> min( >> min(next_check time on services) + service_check_timeout), >> min(next_check time on hosts) + host_check_timeout) >> ) >> >> Possible problems: on demand host checks (if part of a network is >> down) could screw up the timing since everything else stops. >> >> Just because a service check is scheduled doesn't mean that it is >> going to run (time period may be wrong etc), but if its determined to >> be non-runnable the escheduled time for it should cause a re-write of >> the status.dat file correct? >> >> There has to be an easier way of determining if nagios is running >> doesn't there? >> > >Easy isn't the problem. The trick is to get it to work from a different >and almost always less privileged user. Perhaps a simple neb-module can >touch some file every 10 seconds and if it's 30 seconds old the GUI >could then reasonably suspect that nagios has crashed. > >However, I haven't noticed nagios crashing on a modern system. It used >to, with glibc-2.0.35 and linuxthreads-0.7 (which was really buggy). >Since upgrading to glibc-2.3.30 (or some such) and linuxthreads-0.10 >everything is running smoothly, so this isn't really a problem for me or >any of our customers. Yeah, but its just bad that the gui will blithely go on even if there is no nagios daemon running because somebody (maliciously?) killed it manually and failed to restart it, or it failed to restart on boot after a crash (maybe bad config files, need to check to see if rc script will delete the status file as well as the command file) etc. At this point I guess I'll just have to live with it. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From agrajag at dragaera.net Thu Oct 13 19:28:07 2005 From: agrajag at dragaera.net (Sean Dilda) Date: Thu, 13 Oct 2005 13:28:07 -0400 Subject: Check diskspace on remote machines In-Reply-To: References: Message-ID: <434E9927.4090603@dragaera.net> Stalin wrote: > Hi, > > Just got nagios (1.2) installed and configured with 1.4.2 plugins. I'd > few hosts/service checks configured. All works fine. Now i'd like to > do the same for remote machines and wondering if anyone could share > the plugin to monitor diskspace on remote machines or any pointers is > greatly appreciated. Take a look at http://sourceforge.net/tracker/index.php?func=detail&aid=1006780&group_id=29880&atid=541465. There's a check_snmp_disk.pl. It connects to the machine via snmp, then works exactly like check_disk, except without having to do ssh or npre. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From todd_barbera at wgbh.org Thu Oct 13 19:53:25 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Thu, 13 Oct 2005 13:53:25 -0400 Subject: CHECK_NRPE: Error receiving data from daemon. Message-ID: <000201c5d01f$026f0140$620a010a@GRAPEAPE> Hi, I have Nagios version 2.0b3 installed and running and I am using check_nrpe 2.0 running on Solaris 9 (sparc). I've been using check_nrpe successfully against Solaris and Windows hosts. Today I downloaded check_log2 from Nagiosexchange under the Solaris section as regular check_log was not working properly. If I run check_log2 locally, it works just fine. When I try to run it via check_nrpe from my Nagios host, I receive the above message. This occurs via the command line. (check_nrpe -H 10.1.2.2 -c check_log2 -t 60). Turning on debugging on the client side did not produce any errors in syslog. Other check_nrpe commands work fine on the same two hosts. I know that the code was tested with an earlier version of check_nrpe, but I was hoping to get it working with 2.0. Any ideas? Thanks. Todd Barbera Systems Administrator WGBH Educational Foundation 617 300-3443 WGBH Boston informs, inspires, and entertains millions through public broadcasting, the Web, and educational multimedia, and access services for people with disabilities. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From btmanmeh at verizon.net Thu Oct 13 20:38:13 2005 From: btmanmeh at verizon.net (Max) Date: Thu, 13 Oct 2005 14:38:13 -0400 Subject: Check diskspace on remote machines In-Reply-To: <434E9927.4090603@dragaera.net> References: <434E9927.4090603@dragaera.net> Message-ID: <434EA995.3050200@verizon.net> Sean Dilda wrote: > Take a look at > http://sourceforge.net/tracker/index.php?func=detail&aid=1006780&group_id=29880&atid=541465. > There's a check_snmp_disk.pl. It connects to the machine via snmp, then > works exactly like check_disk, except without having to do ssh or npre. Thank you very much! That was exactly what I was looking for, and also a bonus of remote load and process checking. Excellent. Max ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From davea at support.kcm.org Thu Oct 13 21:14:07 2005 From: davea at support.kcm.org (Dave Augustus) Date: Thu, 13 Oct 2005 14:14:07 -0500 Subject: NRPE- Results from amavis_check.pl Message-ID: <1129230847.10928.15.camel@kcm40202> Hello All, I am trying to check the status of amavisd on a remote host via NRPE. I am using the amavis-check.pl from nagiosexchange.org . When I check locally, I get accurate responses. However, when via NRPE, I always get OK, even though NRPE receives the response of CRITICAL. Here is the local plugin output: ./amavis_check.pl --server localhost --from davea at support.kcm.org --to davea at support.kcm.org --port 10024 CRITICAL - amavisd-new server unreachable Here is the NRPE output: ./check_nrpe -H rs1.kcm.org -c check_amavisd CRITICAL - amavisd-new server unreachable nrpe.cfg contains the command definition: command[check_amavisd]=/usr/lib/nagios/kcm_plugins/amavis_check.pl -- server localhost --from davea at support.kcm.org --to davea at support.kcm.org --port 10024 THE PROBLEM IS: The status that nagios has on this check is ALWAYS OK!!! Any ideas? Running Nagios 2.04b on linux Running NRPE 2.0-3.2 on linux -- Dave Augustus ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stalinsk at gmail.com Thu Oct 13 21:14:39 2005 From: stalinsk at gmail.com (Stalin) Date: Thu, 13 Oct 2005 12:14:39 -0700 Subject: check_nrpe - Could not complete SSL handshake. In-Reply-To: <434E0A90.7020401@op5.se> References: <434E0A90.7020401@op5.se> Message-ID: To answer to who have replied so far... 5666 port is open for sure. here is the output [nagios at host libexec]$ telnet remotehostname.com 5666 Trying xx.xxx.xxx.xxx... Connected to remotehostname.com (xx.xxx.xxx.xxx). Escape character is '^]'. Connection closed by foreign host. I'm running nrpe as a standalone deamon. so i assume there isn't any need to add entries in hosts.allow. Yes, i restarted nrpe after making changes to nrpe.cfg. Also i'd like give some additional information to this problem. the ip that shows in /var/log/messages file i.e. "Host xx.xxx.xxx.xxx is not allowed to talk to us!" is NOT the ip address from where the request origniated from. As the monitoring machine is behind the firewall, the requesting nagios server ip gets natted at fw and sends an internal address to nrpe deamon. In fact, i added the internal ip to nrpe.cfg still no luck. Regards, Stalin On 10/13/05, Andreas Ericsson wrote: > Stalin wrote: > > Hi All, > > > > I'm completely lost on this. I get this error when i try to run > > commands on remote host via check_nrpe. I've checked FAQ on this > > error, didn't help much. > > > > [nagios at host]$ telnet hostname.com 5666 > > Trying xx.xxx.xxx.xxx... > > telnet: Unable to connect to remote host: Connection refused > > > > remote server:tail -f /var/log/message > > Oct 12 15:07:25 remotehost nrpe[7542]: Host xx.xxx.xxx.xxx is not > > allowed to talk to us! > > > > This is weird. Since the connection was refused, indicating that NRPE > isn't listening on the port, you really shouldn't have gotten anything > at all in the logs. Since you did anyway .... > > > I've included the ip of monitoring server on allowed_ip_list of > > nrpe.cfg, but still i get this error on the remote server logs file. > > > > ... did you restart nrpe after adding the ip to the config? > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From john at stilen.com Thu Oct 13 21:18:52 2005 From: john at stilen.com (John Stile) Date: Thu, 13 Oct 2005 12:18:52 -0700 Subject: check_https error: Cannot retrieve server certificate. Message-ID: <1129231132.21491.30.camel@localhost> What does this error mean: Command: /usr/lib/nagios/plugins/check_http --ssl -H localhost Error: CRITICAL - Cannot retrieve server certificate. I turned on debugging, but nothing was logged (In nrpe.cfg, set 'debug=1'). I verified that an ssl certificate is served to the https client, that is valid, but self signed by me. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Joseph.Petrucci at ddiworld.com Thu Oct 13 21:27:39 2005 From: Joseph.Petrucci at ddiworld.com (Petrucci, Joseph) Date: Thu, 13 Oct 2005 15:27:39 -0400 Subject: check_https error: Cannot retrieve server certificate. Message-ID: The only difference I am seeing in your call and one that I use that works is I also supply the URL. >-----Original Message----- >From: nagios-users-admin at lists.sourceforge.net >[mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of >John Stile >Sent: Thursday, October 13, 2005 3:19 PM >To: nagios-users at lists.sourceforge.net >Subject: [Nagios-users] check_https error: Cannot retrieve server >certificate. > > >What does this error mean: >Command: > /usr/lib/nagios/plugins/check_http --ssl -H localhost >Error: > CRITICAL - Cannot retrieve server certificate. > >I turned on debugging, but nothing was logged (In nrpe.cfg, set >'debug=1'). > >I verified that an ssl certificate is served to the https client, that >is valid, but self signed by me. > > > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, >discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS >when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stalinsk at gmail.com Thu Oct 13 22:06:34 2005 From: stalinsk at gmail.com (Stalin) Date: Thu, 13 Oct 2005 13:06:34 -0700 Subject: check_nrpe - Could not complete SSL handshake. In-Reply-To: References: <434E0A90.7020401@op5.se> Message-ID: Nevermind. That was brutal. I got it working. Previously I'd set server_address to bind to localhost (only one interface), After i changed to ip address, restarted nrpe deamon, all started to work fine. dang! Thanks everyone for your inputs. On 10/13/05, Stalin wrote: > To answer to who have replied so far... > > 5666 port is open for sure. here is the output > > [nagios at host libexec]$ telnet remotehostname.com 5666 > Trying xx.xxx.xxx.xxx... > Connected to remotehostname.com (xx.xxx.xxx.xxx). > Escape character is '^]'. > Connection closed by foreign host. > > I'm running nrpe as a standalone deamon. so i assume there isn't any > need to add entries in hosts.allow. > > Yes, i restarted nrpe after making changes to nrpe.cfg. > > Also i'd like give some additional information to this problem. the ip > that shows in /var/log/messages file i.e. "Host xx.xxx.xxx.xxx is not > allowed to talk to us!" is NOT the ip address from where the request > origniated from. As the monitoring machine is behind the firewall, the > requesting nagios server ip gets natted at fw and sends an internal > address to nrpe deamon. In fact, i added the internal ip to nrpe.cfg > still no luck. > > Regards, > Stalin > > > On 10/13/05, Andreas Ericsson wrote: > > Stalin wrote: > > > Hi All, > > > > > > I'm completely lost on this. I get this error when i try to run > > > commands on remote host via check_nrpe. I've checked FAQ on this > > > error, didn't help much. > > > > > > [nagios at host]$ telnet hostname.com 5666 > > > Trying xx.xxx.xxx.xxx... > > > telnet: Unable to connect to remote host: Connection refused > > > > > > remote server:tail -f /var/log/message > > > Oct 12 15:07:25 remotehost nrpe[7542]: Host xx.xxx.xxx.xxx is not > > > allowed to talk to us! > > > > > > > This is weird. Since the connection was refused, indicating that NRPE > > isn't listening on the port, you really shouldn't have gotten anything > > at all in the logs. Since you did anyway .... > > > > > I've included the ip of monitoring server on allowed_ip_list of > > > nrpe.cfg, but still i get this error on the remote server logs file. > > > > > > > ... did you restart nrpe after adding the ip to the config? > > > > -- > > Andreas Ericsson andreas.ericsson at op5.se > > OP5 AB www.op5.se > > Tel: +46 8-230225 Fax: +46 8-230231 > > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From john at stilen.com Thu Oct 13 22:33:08 2005 From: john at stilen.com (John Stile) Date: Thu, 13 Oct 2005 13:33:08 -0700 Subject: check_https error: Cannot retrieve server certificate. In-Reply-To: References: Message-ID: <1129235589.21491.47.camel@localhost> On Thu, 2005-10-13 at 15:27 -0400, Petrucci, Joseph wrote: > The only difference I am seeing in your call and one that I use that works is I also supply the URL. > > >-----Original Message----- > >From: nagios-users-admin at lists.sourceforge.net > >[mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of > >John Stile > >Sent: Thursday, October 13, 2005 3:19 PM > >To: nagios-users at lists.sourceforge.net > >Subject: [Nagios-users] check_https error: Cannot retrieve server > >certificate. > > > > > >What does this error mean: > >Command: > > /usr/lib/nagios/plugins/check_http --ssl -H localhost > >Error: > > CRITICAL - Cannot retrieve server certificate. > > > >I turned on debugging, but nothing was logged (In nrpe.cfg, set > >'debug=1'). > > > >I verified that an ssl certificate is served to the https client, that > >is valid, but self signed by me. I verified the ssl connection to the local host with: openssl s_client -connect localhost:443 Then I tried many versions of the check_http incantation, to no avail: /usr/lib/nagios/plugins/check_http -S -H localhost -p 443 CRITICAL - Cannot retrieve server certificate. /usr/lib/nagios/plugins/check_http -S -H localhost -p 443 CRITICAL - Cannot retrieve server certificate. /usr/lib/nagios/plugins/check_http -S -H localhost -u https://localhost -p 443 CRITICAL - Cannot retrieve server certificate. /usr/lib/nagios/plugins/check_http -S -H localhost -u https://localhost CRITICAL - Cannot retrieve server certificate. Is there anything else I could be doing wrong? On this client, I have installed the following debian 3.1 packages: nagios-common 2:1.3-cvs. 2:1.3-cvs. nagios-nrpe-plugin 2.0-7 2.0-7 nagios-nrpe-server 2.0-7 2.0-7 nagios-plugins 1.4-6 1.4-6 nagios-text 2:1.3-cvs. 2:1.3-cvs. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From admin at damian-flynn.info Fri Oct 14 01:23:32 2005 From: admin at damian-flynn.info (Damian Flynn) Date: Fri, 14 Oct 2005 00:23:32 +0100 Subject: Nagios_Grapher RRD Error Message-ID: <342336F9B0BD7F41A67F1A203B69B58E5556@anubis.enigma.sys> Hi, I have this tools installed and the collector working, the RRD's appear to be created ok, and the rights are ok also, but I continue to get the following errors like "cannot parse CF in 'DEF:RTA=/var/spool/nagios..... I have RRD 1.2 installed, but can not figure out what is my root issue. I get this creation information in my log: 2005-10-12 12:02:48 RRD: rrdtool create /var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd --step= DS:Loss:GAUGE:600:U:U DS:RTA:GAUGE:600:U:U RRA:AVERAGE:0.5:5:600 RRA:MAX:0.5:5:600 RRA:MIN:0.5:5:600 RRA:AVERAGE:0.5:30:600 RRA:MAX:0.5:30:600 RRA:MIN:0.5:30:600 RRA:AVERAGE:0.5:120:600 RRA:MAX:0.5:120:600 RRA:MIN:0.5:120:600 RRA:AVERAGE:0.5:1440:600 RRA:MAX:0.5:1440:600 RRA:MIN:0.5:1440:600 2005-10-12 12:02:48 RRD: [bal-smtpfw1][PING]:creating '/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd' : No such file or directory 2005-10-12 12:02:48 SERVICEEXT: [bal-smtpfw1][PING]:Entry in serviceextinfo not exists, create NEW! 2005-10-12 12:02:48 VALUES: [bal-smtpfw1][PING]: RTA=163.31 Loss=0 2005-10-12 12:02:48 RRD: rrdtool update /var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd --template=RTA:Loss N:163.31:0 2005-10-12 12:02:48 RRD: [bal-smtpfw1][PING]:opening '/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd' : No such file or directory Any help would be great Regards Damian ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at natecarlson.com Fri Oct 14 01:28:32 2005 From: nagios at natecarlson.com (Nate Carlson) Date: Thu, 13 Oct 2005 18:28:32 -0500 (CDT) Subject: check_ping vs. check_icmp? Message-ID: What's the difference between check_ping and check_icmp? Thanks! ------------------------------------------------------------------------ | nate carlson | natecars at natecarlson.com | http://www.natecarlson.com | | depriving some poor village of its idiot since 1981 | ------------------------------------------------------------------------ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From carlospeon at hotmail.com Fri Oct 14 08:32:33 2005 From: carlospeon at hotmail.com (=?iso-8859-1?B?Q2FybG9zIFBl824gQ29zdGE=?=) Date: Fri, 14 Oct 2005 08:32:33 +0200 Subject: Nagios_Grapher RRD Error In-Reply-To: <342336F9B0BD7F41A67F1A203B69B58E5556@anubis.enigma.sys> References: <342336F9B0BD7F41A67F1A203B69B58E5556@anubis.enigma.sys> Message-ID: It seems /var/spool/nagios/rrd does not exists or nagios does not have write permissions on it. >From: "Damian Flynn" >To: >Subject: [Nagios-users] Nagios_Grapher RRD Error >Date: Fri, 14 Oct 2005 00:23:32 +0100 > >Hi, > >I have this tools installed and the collector working, the RRD's appear >to be created ok, and the rights are ok also, but I continue to get the >following errors like "cannot parse CF in >'DEF:RTA=/var/spool/nagios..... > >I have RRD 1.2 installed, but can not figure out what is my root issue. > >I get this creation information in my log: > >2005-10-12 12:02:48 RRD: rrdtool create >/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd >--step= DS:Loss:GAUGE:600:U:U DS:RTA:GAUGE:600:U:U RRA:AVERAGE:0.5:5:600 >RRA:MAX:0.5:5:600 RRA:MIN:0.5:5:600 RRA:AVERAGE:0.5:30:600 >RRA:MAX:0.5:30:600 RRA:MIN:0.5:30:600 RRA:AVERAGE:0.5:120:600 >RRA:MAX:0.5:120:600 RRA:MIN:0.5:120:600 RRA:AVERAGE:0.5:1440:600 >RRA:MAX:0.5:1440:600 RRA:MIN:0.5:1440:600 >2005-10-12 12:02:48 RRD: [bal-smtpfw1][PING]:creating >'/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd' >: No such file or directory >2005-10-12 12:02:48 SERVICEEXT: [bal-smtpfw1][PING]:Entry in >serviceextinfo not exists, create NEW! >2005-10-12 12:02:48 VALUES: [bal-smtpfw1][PING]: RTA=163.31 Loss=0 >2005-10-12 12:02:48 RRD: rrdtool update >/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd >--template=RTA:Loss N:163.31:0 >2005-10-12 12:02:48 RRD: [bal-smtpfw1][PING]:opening >'/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd' >: No such file or directory > > >Any help would be great > >Regards >Damian > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when >reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 14 09:46:07 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 14 Oct 2005 09:46:07 +0200 Subject: NRPE- Results from amavis_check.pl In-Reply-To: <1129230847.10928.15.camel@kcm40202> References: <1129230847.10928.15.camel@kcm40202> Message-ID: <434F623F.4050708@op5.se> Dave Augustus wrote: > Hello All, > > I am trying to check the status of amavisd on a remote host via NRPE. > > I am using the amavis-check.pl from nagiosexchange.org . > > When I check locally, I get accurate responses. However, when via NRPE, > I always get OK, even though NRPE receives the response of CRITICAL. > > Here is the local plugin output: > > ./amavis_check.pl --server localhost --from davea at support.kcm.org --to > davea at support.kcm.org --port 10024 > CRITICAL - amavisd-new server unreachable > > > Here is the NRPE output: > ./check_nrpe -H rs1.kcm.org -c check_amavisd > CRITICAL - amavisd-new server unreachable > > nrpe.cfg contains the command definition: > command[check_amavisd]=/usr/lib/nagios/kcm_plugins/amavis_check.pl -- > server localhost --from davea at support.kcm.org --to davea at support.kcm.org > --port 10024 > > THE PROBLEM IS: > The status that nagios has on this check is ALWAYS OK!!! > > Any ideas? > The plugin issues exit(0) somewhere, or silently drops off the end of the script which also causes its exit status to be 0. Neither Nagios nor NRPE cares about what a plugin prints. NRPE only forwards the exit-status of the program, and Nagios uses that and that only to determine the status. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 14 09:54:42 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 14 Oct 2005 09:54:42 +0200 Subject: check_ping vs. check_icmp? In-Reply-To: References: Message-ID: <434F6442.8020004@op5.se> Nate Carlson wrote: > What's the difference between check_ping and check_icmp? > check_ping executes the external command ping, while check_icmp does its own fiddling with the ICMP protocol. As a result, check_icmp is faster, smarter and requires less resources to run. check_icmp can also be used in check_host mode (create a symlink check_host -> check_icmp and execute check_host) which runs extremely quickly to determine if a host is up whenever a service check fails. Ordinary check_ping would take 5 seconds to determine that the host is up in an ordinary setup, while check_host usually does the same trick in just about the same amount of time as it takes for a packet to make a round trip to the destination target (usually between 1 and 10 milliseconds on a local network). Considering the fact that service checks aren't executed while host checks are running, the check_host mode of check_icmp is a fairly major improvement in terms of overall Nagios performance. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shaikh.m.a at gmail.com Fri Oct 14 12:20:09 2005 From: shaikh.m.a at gmail.com (M.Saeed Shaikh) Date: Fri, 14 Oct 2005 15:50:09 +0530 Subject: Insufficient Data Message-ID: <9e66c57d0510140320n4fbdaef0w1e8014d01b3b5502@mail.gmail.com> HI, My nagios is working perefectly for almost 40 servers and monitoring / alerts worlking properly. My problem is that While I am clicking on Trends and make Reports is show almost 55 to 60% Undeetined data Why is it so ? if all is working peoper then it must be 0% -- M.A.Shaikh Linux System Administrator -------------- next part -------------- An HTML attachment was scrubbed... URL: From sudheer at tgs-solutions.com Fri Oct 14 14:32:29 2005 From: sudheer at tgs-solutions.com (Sudheer Muddappa) Date: Fri, 14 Oct 2005 08:32:29 -0400 Subject: Plugin timed out after 10 seconds In-Reply-To: <9e66c57d0510140320n4fbdaef0w1e8014d01b3b5502@mail.gmail.com> References: <9e66c57d0510140320n4fbdaef0w1e8014d01b3b5502@mail.gmail.com> Message-ID: <434FA55D.9090703@tgs-solutions.com> Hi all, Do you know what si the meaning of this? Plug in timed out after 10 seconds !! The server is up and running, But I get this message often. Thanks, Sudheer Muddappa ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Joseph.Petrucci at ddiworld.com Fri Oct 14 14:37:33 2005 From: Joseph.Petrucci at ddiworld.com (Petrucci, Joseph) Date: Fri, 14 Oct 2005 08:37:33 -0400 Subject: Plugin timed out after 10 seconds Message-ID: Your default timeout setting for plugins is 10 seconds. I get this at clients for the following reasons 1.. Firewall (proper ports not opened) 2.. monitor agent (NSClient, etc.... ) service not running on monitored machine 3.. Network problems. I know there are more reasons but these are the 3 I usually run into >-----Original Message----- >From: nagios-users-admin at lists.sourceforge.net >[mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Sudheer >Muddappa >Sent: Friday, October 14, 2005 8:32 AM >To: nagios-users at lists.sourceforge.net >Subject: [Nagios-users] Plugin timed out after 10 seconds > > >Hi all, >Do you know what si the meaning of this? > >Plug in timed out after 10 seconds !! > >The server is up and running, But I get this message often. > > >Thanks, > >Sudheer Muddappa > > > > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, >discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS >when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Fri Oct 14 14:39:53 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 14 Oct 2005 08:39:53 -0400 Subject: check_ping vs. check_icmp? Unexpected output. In-Reply-To: Your message of "Fri, 14 Oct 2005 09:54:42 +0200." <434F6442.8020004@op5.se> References: <434F6442.8020004@op5.se> Message-ID: <200510141239.j9ECdrtR017616@mx1.cs.umb.edu> In message <434F6442.8020004 at op5.se>, Andreas Ericsson writes: >Nate Carlson wrote: >> What's the difference between check_ping and check_icmp? >> >check_icmp can also be used in check_host mode (create a symlink >check_host -> check_icmp and execute check_host) which runs extremely >quickly to determine if a host is up whenever a service check fails. >Ordinary check_ping would take 5 seconds to determine that the host is >up in an ordinary setup, while check_host usually does the same trick in >just about the same amount of time as it takes for a packet to make a >round trip to the destination target (usually between 1 and 10 >milliseconds on a local network). That is great info. "check_icmp -h" doesn't mention the check_host functioanlity at all. >Considering the fact that service checks aren't executed while host >checks are running, the check_host mode of check_icmp is a fairly major >improvement in terms of overall Nagios performance. I have the check_host link in place and with the 1.4.2 release of the plugins, with cvs id: * $Id: check_icmp.c,v 1.5 2005/02/01 07:33:13 stanleyhopcroft Exp $ I am getting (manually wrapped): /usr/lib/nagios/plugins/check_host www.yahoo.com www.freshmeat.net mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 OK - www.yahoo.com responds to ICMP. Packet 1, rta 25.769ms| \ pkt=1;;0;5 rta=25.769;1000.000;1000.000;; Can I just comment out the code that reports the mode? Also with -v -v specified, the command line above causes a segfault. /usr/lib/nagios/plugins/check_host -v -v www.yahoo.com www.freshmeat.net mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 ttl set to 64 Setting alarm timeout to 10 seconds packets: 5, targets: 9 target_interval: 0.000, pkt_interval 1000.000 crit.rta: 1000.000 max_completion_time: 91000.000 max_completion_time: 91000000 timeout: 10 Timout must be at lest 92 crit = {1000000, 100%}, warn = {1000000, 100%} pkt_interval: 1000000 target_interval: 0 retry_interval: 0 icmp_pkt_size: 64 timeout: 10 Segmentation fault but with 1 -v, it succeeds: /usr/lib/nagios/plugins/check_host -v www.yahoo.com www.freshmeat.net mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 ttl set to 64 Setting alarm timeout to 10 seconds packets: 5, targets: 9 target_interval: 0.000, pkt_interval 1000.000 crit.rta: 1000.000 max_completion_time: 91000.000 max_completion_time: 91000000 timeout: 10 Timout must be at lest 92 crit = {1000000, 100%}, warn = {1000000, 100%} pkt_interval: 1000000 target_interval: 0 retry_interval: 0 icmp_pkt_size: 64 timeout: 10 25.784 ms rtt from 216.109.117.206, outgoing ttl: 64, incoming ttl: 58 OK - www.yahoo.com responds to ICMP. Packet 1, rta 25.784ms| \ pkt=1;;0;5 rta=25.784;1000.000;1000.000;; Also, if I change the host to /usr/lib/nagios/plugins/check_host www.ibm.com I get mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 Segmentation fault not exactly what I was expecting. With -v's I get: /usr/lib/nagios/plugins/check_host -v -v -v www.ibm.com mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 mode: 1 ttl set to 64 Setting alarm timeout to 10 seconds packets: 5, targets: 6 target_interval: 0.000, pkt_interval 1000.000 crit.rta: 1000.000 max_completion_time: 61000.000 max_completion_time: 61000000 timeout: 10 Timout must be at lest 62 icmp_pkt_size = 64 icmp_pkt_size = 64 crit = {1000000, 100%}, warn = {1000000, 100%} pkt_interval: 1000000 target_interval: 0 retry_interval: 0 icmp_pkt_size: 64 timeout: 10 Segmentation fault Note that www.ibm.com resolves but doesn't respond to pings. Also the nagios-plugins Makefile fails to create the links to check_host or check_rta_multi. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From agrajag at dragaera.net Fri Oct 14 15:13:35 2005 From: agrajag at dragaera.net (Sean Dilda) Date: Fri, 14 Oct 2005 09:13:35 -0400 Subject: check_ping vs. check_icmp? In-Reply-To: <434F6442.8020004@op5.se> References: <434F6442.8020004@op5.se> Message-ID: <434FAEFF.3020705@dragaera.net> Andreas Ericsson wrote: > > check_icmp can also be used in check_host mode (create a symlink > check_host -> check_icmp and execute check_host) which runs extremely > quickly to determine if a host is up whenever a service check fails. > Ordinary check_ping would take 5 seconds to determine that the host is > up in an ordinary setup, while check_host usually does the same trick in > just about the same amount of time as it takes for a packet to make a > round trip to the destination target (usually between 1 and 10 > milliseconds on a local network). > > That sounds useful. However, how does it do that without getting a lot of false positives? If the network is just a little loaded, it sounds like it'll report a host as down when its not. For what its worth, its not too difficult to get check_ping to only take 1 second to timeout. However, that's still longer than 1 to 10 milliseconds. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 14 15:22:59 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 14 Oct 2005 15:22:59 +0200 Subject: check_ping vs. check_icmp? Unexpected output. In-Reply-To: <200510141239.j9ECdrtR017616@mx1.cs.umb.edu> References: <200510141239.j9ECdrtR017616@mx1.cs.umb.edu> Message-ID: <434FB133.9040100@op5.se> John P. Rouillard wrote: > In message <434F6442.8020004 at op5.se>, > Andreas Ericsson writes: > > >>Nate Carlson wrote: >> >>>What's the difference between check_ping and check_icmp? >>> >> >>check_icmp can also be used in check_host mode (create a symlink >>check_host -> check_icmp and execute check_host) which runs extremely >>quickly to determine if a host is up whenever a service check fails. >>Ordinary check_ping would take 5 seconds to determine that the host is >>up in an ordinary setup, while check_host usually does the same trick in >>just about the same amount of time as it takes for a packet to make a >>round trip to the destination target (usually between 1 and 10 >>milliseconds on a local network). > > > That is great info. "check_icmp -h" doesn't mention the check_host > functioanlity at all. > I know. I'm one of those lazy people who think users should read the code if they want documentation. I'll make a mention of it though. > >>Considering the fact that service checks aren't executed while host >>checks are running, the check_host mode of check_icmp is a fairly major >>improvement in terms of overall Nagios performance. > > > > I have the check_host link in place and with the 1.4.2 release of the > plugins, with cvs id: > > * $Id: check_icmp.c,v 1.5 2005/02/01 07:33:13 stanleyhopcroft Exp $ > This isn't anywhere near the latest version and I believe I've added quite a few patches to it since then. For the latest version you'll need the plugin-pack from http://oss.op5.se/nagios. Quite a lot of bugs has been fixed since february. [ ... cut ... ] > > Also the nagios-plugins Makefile fails to create the links to > check_host or check_rta_multi. > True. The check_rta_multi isn't very tested or documented. The check_host link would most likely be nice to create though. I'll add it to the OP5 version of the plugins as soon as I get around to it. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 14 16:10:01 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 14 Oct 2005 16:10:01 +0200 Subject: check_ping vs. check_icmp? In-Reply-To: <434FAEFF.3020705@dragaera.net> References: <434F6442.8020004@op5.se> <434FAEFF.3020705@dragaera.net> Message-ID: <434FBC39.90309@op5.se> Sean Dilda wrote: > Andreas Ericsson wrote: > >> >> check_icmp can also be used in check_host mode (create a symlink >> check_host -> check_icmp and execute check_host) which runs extremely >> quickly to determine if a host is up whenever a service check fails. >> Ordinary check_ping would take 5 seconds to determine that the host is >> up in an ordinary setup, while check_host usually does the same trick >> in just about the same amount of time as it takes for a packet to make >> a round trip to the destination target (usually between 1 and 10 >> milliseconds on a local network). >> >> > > That sounds useful. However, how does it do that without getting a lot > of false positives? The keyword here is *usually*. In pseudo-code, it goes like this; while(sent < packets_to_send) { send_packet(sent++); wait_for_packet(); if(response_is_proper_ICMP_ECHOREPLY) { print_timing_info(); exit(); } } In check_host mode, it'll wait by default 0.2 seconds for each packet before sending a new one until it reaches the maximum completion time, which is calculated by multiplying the number of IP-addresses to ping (if you're using a resolvable hostname with check_host it'll try to ping all the addresses. If any is up it'll return OK), the number of packets to sent and the critical threshold. When all packets are sent and none still has come in, it'll wait the remainder of the max_completion_time or until it gets the signal-based ALARM (by default ten seconds). You'll only get the extremely fast response time when the host is up and responding to ping properly, which is usually the case. Nagios runs a hostcheck immediately whenever a service reports non-ok, so this is really a major benefit. If you're interested in even more details, download the latest plugin-release at oss.op5.se/nagios and run: check_host -vvvv www.microsoft.com check_host -vvvv www.google.com check_host -vvvv www.microsoft.com www.google.com www.microsoft.com doesn't allow pings, while www.google.com does > If the network is just a little loaded, it sounds > like it'll report a host as down when its not. > It won't. Trust me on this. > For what its worth, its not too difficult to get check_ping to only take > 1 second to timeout. However, that's still longer than 1 to 10 > milliseconds. > If you're referring to the -t flag to check_ping, you'll then force it to kill the ping program as it doesn't pass any -i flag to it. This means the ping binary will only send one packet per second and thus can't possibly complete in less than (number_of_packets) seconds unless you change the code and re-compile. Unfortunately, this is beyond most users and in general more hassle than it's worth since check_icmp does it a whole lot faster anyways. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 14 16:10:02 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 14 Oct 2005 16:10:02 +0200 Subject: check_ping vs. check_icmp? Unexpected output. In-Reply-To: <434FB133.9040100@op5.se> References: <200510141239.j9ECdrtR017616@mx1.cs.umb.edu> <434FB133.9040100@op5.se> Message-ID: <434FBC3A.8060604@op5.se> Andreas Ericsson wrote: >> >> I have the check_host link in place and with the 1.4.2 release of the >> plugins, with cvs id: >> >> * $Id: check_icmp.c,v 1.5 2005/02/01 07:33:13 stanleyhopcroft Exp $ >> > > This isn't anywhere near the latest version and I believe I've added > quite a few patches to it since then. For the latest version you'll need > the plugin-pack from http://oss.op5.se/nagios. > > Quite a lot of bugs has been fixed since february. > Although this one wasn't, now that I've checked it up (sorry about that). Using check_host mode against a resolvable hostname allocates a host-table that may be too small to hold all targets. Writing to the table works nicely, since it's just shuffling of pointers taken care of on the heap but when it tries to access any of the structs variables it crashes with SIGSEGV. This is read access only, so it's not a security problem, but annoying all the same. http://oss.op5.se/nagios/op5plugins-2005-10-14.tar.gz holds an updated version, which also fixes a less annoying bug in check_hpjd along with some minor stuff in check_users (for instance). -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Oct 14 16:52:31 2005 From: marc at ena.com (Marc Powell) Date: Fri, 14 Oct 2005 09:52:31 -0500 Subject: Insufficient Data Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of M.Saeed Shaikh > Sent: Friday, October 14, 2005 5:20 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Insufficient Data > > HI, > > My nagios is working perefectly for almost 40 servers and monitoring / > alerts worlking properly. > > My problem is that > While I am clicking on Trends and make Reports is show almost 55 to 60% > Undeetined data > > Why is it so ? > if all is working peoper then it must be 0% Without knowing where the undetermined time periods are, I would hazard that Nagios was unable to determine the initial state of the host/service within your specified reporting period. Either set the initial state when creating the report or backtrack through more archives to determine the initial state. http://www.nagios.org/faqs/viewfaq.php?faq_id=136 http://www.nagios.org/faqs/viewfaq.php?faq_id=135&expand=false&showdesc= true -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Fri Oct 14 17:21:26 2005 From: robmossrm at aol.com (Rob Moss) Date: Fri, 14 Oct 2005 16:21:26 +0100 Subject: Max number of services that can be monitored ? In-Reply-To: <47971.127.0.0.1.1129152098.squirrel@www.goldenfields.co.nz> References: <47971.127.0.0.1.1129152098.squirrel@www.goldenfields.co.nz> Message-ID: <434FCCF6.9080405@aol.com> misc at viceconsulting.co.nz wrote: > <>Hi Rob, >>>With embedded Perl enabled, I was getting memory leaks which was >>>eventually crashing the Nagios server so I disabled it. >>> >>> >>Which version of perl, and what platform? x86? sparc? >> >>Older versions of perl (pre 5.6) had some problems being compiled in to >>apps such as mod_perl and stuff, but 5.6.1 or above "should" be fine. >> >>Memory leaks in Solaris wouldn't surprise me, but any of the Linuxes or >>BSD's would.. I'm runing Solaris 8 Sparc with Perl 5.6.1 builtin >>without any known leaks.. >> >> > >perl 5.8.0 on redhat es 3 update 5 x86... My memory was decreasing over >time crashing the server, at my end, disabling embedded perl seemed to fix >the prob... > > Okay, according to the Perl website, you're running Perl 5.8.0 Pre-Release. Go bleeding edge, nice one Redhat! http://www.perl.com/CPAN/ports/index.html#linux This would probably have quite a few known bugs and possibly memory leaks. I recommend that you update your version of Perl, to 5.8.7 which is the latest stable version (anything above that is experimental). Or you could downgrade to a trusty version of Perl 5.6.x Cheers rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From drussell at mpifix.com Fri Oct 14 17:57:09 2005 From: drussell at mpifix.com (Danny Russell) Date: Fri, 14 Oct 2005 09:57:09 -0600 Subject: Service with multiple hosts in different contact groups Message-ID: <028B223BB24E5443A9784BBE1CC5EBE062B521@Lisa.mpifix.com> I have a service that has hosts from multiple contact groups. For example I have service 'check_disk'. I have 3 contact groups, tier1 tier2 and tier3. I have 3 hosts, SaltLakeCity (tier1), Sandy (tier2), and Draper (tier3). All 3 hosts are assigned to 'check_disk'. I have added the 3 contact groups to the contact_groups in the service 'check_disk'. The problem is that all 3 groups are getting notified regardless of which host it is. The hostgroup -> contact_group relationship doesn't stick or something. Can some one give me some ideas of what to check? Thanks! -------------- next part -------------- An HTML attachment was scrubbed... URL: From marc at ena.com Fri Oct 14 18:23:16 2005 From: marc at ena.com (Marc Powell) Date: Fri, 14 Oct 2005 11:23:16 -0500 Subject: Service with multiple hosts in different contact groups Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Danny Russell > Sent: Friday, October 14, 2005 10:57 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Service with multiple hosts in different contact > groups > > I have a service that has hosts from multiple contact groups. > > > > For example I have service 'check_disk'. > > I have 3 contact groups, tier1 tier2 and tier3. > > I have 3 hosts, SaltLakeCity (tier1), Sandy (tier2), and Draper (tier3). > > All 3 hosts are assigned to 'check_disk'. I am presuming that you're using multiple, comma-separated hosts for the host_name directive or hostgroup_name in the service definition. > I have added the 3 contact groups to the contact_groups in the service > 'check_disk'. The problem is that all 3 groups are getting notified You've told Nagios to notify all three groups for problems with this service so this is correct behavior. > regardless of which host it is. The hostgroup -> contact_group > relationship doesn't stick or something. The hostgroup->contact_group relationship only controls who receives _host_ notifications in this context. Host notification and service notifications are completely independent and can have different contacts entirely. There is no implicit link between host contact_groups and service contact_groups. You'll need to make three different service definitions, one per host, with the appropriate contact_groups. -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From admin at damian-flynn.info Fri Oct 14 19:12:26 2005 From: admin at damian-flynn.info (Damian Flynn) Date: Fri, 14 Oct 2005 18:12:26 +0100 Subject: Nagios_Grapher RRD Error Message-ID: <342336F9B0BD7F41A67F1A203B69B58E7C7D@anubis.enigma.sys> My Bad, I copied the original log extract - your right i had Security issues but sorted that. I realised that the Log I pasted earlier was incorrect, as it illustrated a rights issue which I had resolved. The correct Log extract looks as follows: 2005-10-14 11:42:07 PIPE: bal-trend1 PING PING OK - Packet loss = 0%, RTA = 122.44 ms 2005-10-14 11:42:07 RRD: rrdtool create /var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd --step= DS:Loss:GAUGE:600:U:U DS:RTA:GAUGE:600:U:U RRA:AVERAGE:0.5:5:600 RRA:MAX:0.5:5:600 RRA:MIN:0.5:5:600 RRA:AVERAGE:0.5:30:600 RRA:MAX:0.5:30:600 RRA:MIN:0.5:30:600 RRA:AVERAGE:0.5:120:600 RRA:MAX:0.5:120:600 RRA:MIN:0.5:120:600 RRA:AVERAGE:0.5:1440:600 RRA:MAX:0.5:1440:600 RRA:MIN:0.5:1440:600 2005-10-14 11:42:07 RRD: [bal-trend1][PING]:/var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd - successfully created! 2005-10-14 11:42:07 SERVICEEXT: [bal-trend1][PING]:Entry in serviceextinfo not exists, create NEW! 2005-10-14 11:42:07 VALUES: [bal-trend1][PING]: RTA=122.44 Loss=0 2005-10-14 11:42:07 RRD: rrdtool update /var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd --template=RTA:Loss N:122.44:0 2005-10-14 11:42:07 PIPE: bei-isp1 Online PING OK - Packet loss = 0%, RTA = 287.90 ms This issue appears in the rrd2-graph.cgi; Where i recive the following message in a PNG. Some Error occured: ~~~~~~~~~~~~~~~~~~~ RRD Error: Cannot parse CF in 'DEF:RTA=/var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd Options: -aPNG -cBACK#fcfcfc -cFONT#333333 -cARROW#ff0000 -cGRID#330000 -cMGRID#330000 -cCANVAS#ffffff Values: DEF:RTA=/var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd:RTA:AVERAGE Graphs: AREA:RTA#00a000:Time to answer/J VRULE:1129305600#FF0000 VRULE:1129302000#FF0000 GPrints: [11:58/14.10.2005] As I mentioned before this system is running 1.2.6 thank you Damian ________________________________ From: nagios-users-admin at lists.sourceforge.net on behalf of Carlos Pe?n Costa Sent: Fri 14/10/2005 07:32 To: nagios-users at lists.sourceforge.net Subject: RE: [Nagios-users] Nagios_Grapher RRD Error It seems /var/spool/nagios/rrd does not exists or nagios does not have write permissions on it. >From: "Damian Flynn" >To: >Subject: [Nagios-users] Nagios_Grapher RRD Error >Date: Fri, 14 Oct 2005 00:23:32 +0100 > >Hi, > >I have this tools installed and the collector working, the RRD's appear >to be created ok, and the rights are ok also, but I continue to get the >following errors like "cannot parse CF in >'DEF:RTA=/var/spool/nagios..... > >I have RRD 1.2 installed, but can not figure out what is my root issue. > >I get this creation information in my log: > >2005-10-12 12:02:48 RRD: rrdtool create >/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd >--step= DS:Loss:GAUGE:600:U:U DS:RTA:GAUGE:600:U:U RRA:AVERAGE:0.5:5:600 >RRA:MAX:0.5:5:600 RRA:MIN:0.5:5:600 RRA:AVERAGE:0.5:30:600 >RRA:MAX:0.5:30:600 RRA:MIN:0.5:30:600 RRA:AVERAGE:0.5:120:600 >RRA:MAX:0.5:120:600 RRA:MIN:0.5:120:600 RRA:AVERAGE:0.5:1440:600 >RRA:MAX:0.5:1440:600 RRA:MIN:0.5:1440:600 >2005-10-12 12:02:48 RRD: [bal-smtpfw1][PING]:creating >'/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd' >: No such file or directory >2005-10-12 12:02:48 SERVICEEXT: [bal-smtpfw1][PING]:Entry in >serviceextinfo not exists, create NEW! >2005-10-12 12:02:48 VALUES: [bal-smtpfw1][PING]: RTA=163.31 Loss=0 >2005-10-12 12:02:48 RRD: rrdtool update >/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd >--template=RTA:Loss N:163.31:0 >2005-10-12 12:02:48 RRD: [bal-smtpfw1][PING]:opening >'/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd' >: No such file or directory > > >Any help would be great > >Regards >Damian > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when >reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: From nagios at natecarlson.com Fri Oct 14 19:15:50 2005 From: nagios at natecarlson.com (Nate Carlson) Date: Fri, 14 Oct 2005 12:15:50 -0500 (CDT) Subject: check_ping vs. check_icmp? In-Reply-To: <434F6442.8020004@op5.se> References: <434F6442.8020004@op5.se> Message-ID: On Fri, 14 Oct 2005, Andreas Ericsson wrote: > check_ping executes the external command ping, while check_icmp does its > own fiddling with the ICMP protocol. As a result, check_icmp is faster, > smarter and requires less resources to run. > > check_icmp can also be used in check_host mode (create a symlink > check_host -> check_icmp and execute check_host) which runs extremely > quickly to determine if a host is up whenever a service check fails. > Ordinary check_ping would take 5 seconds to determine that the host is > up in an ordinary setup, while check_host usually does the same trick in > just about the same amount of time as it takes for a packet to make a > round trip to the destination target (usually between 1 and 10 > milliseconds on a local network). > > Considering the fact that service checks aren't executed while host > checks are running, the check_host mode of check_icmp is a fairly major > improvement in terms of overall Nagios performance. In other words, check_icmp is certainly worth making the change. :) Thanks - I'll grab the newest version of the plugin pack you mention in later messages, and make the cut! ------------------------------------------------------------------------ | nate carlson | natecars at natecarlson.com | http://www.natecarlson.com | | depriving some poor village of its idiot since 1981 | ------------------------------------------------------------------------ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From carlospeon at hotmail.com Fri Oct 14 19:48:32 2005 From: carlospeon at hotmail.com (=?iso-8859-1?B?Q2FybG9zIFBl824gQ29zdGE=?=) Date: Fri, 14 Oct 2005 19:48:32 +0200 Subject: Nagios_Grapher RRD Error In-Reply-To: <342336F9B0BD7F41A67F1A203B69B58E7C7D@anubis.enigma.sys> References: <342336F9B0BD7F41A67F1A203B69B58E7C7D@anubis.enigma.sys> Message-ID: Rrd seems fine. If the graph config is right # Ping RTA define ngraph{ service_name PING graph_log_regex rta = ([0-9]*\.[0-9]*) graph_value RTA graph_units ms graph_legend Time to answer page RTA rrd_plottype AREA rrd_color 00a000 } you only need to wait or schedule inmediate ping checks to have enought data to plot. >From: "Damian Flynn" >To: Carlos Pe?n Costa >, >Subject: RE: [Nagios-users] Nagios_Grapher RRD Error >Date: Fri, 14 Oct 2005 18:12:26 +0100 > >My Bad, > >I copied the original log extract - your right i had Security issues but >sorted that. > >I realised that the Log I pasted earlier was incorrect, as it illustrated a >rights issue which I had resolved. The correct Log extract looks as >follows: > >2005-10-14 11:42:07 PIPE: bal-trend1 PING PING OK - Packet loss = 0%, >RTA = 122.44 ms >2005-10-14 11:42:07 RRD: rrdtool create >/var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd >--step= DS:Loss:GAUGE:600:U:U DS:RTA:GAUGE:600:U:U RRA:AVERAGE:0.5:5:600 >RRA:MAX:0.5:5:600 RRA:MIN:0.5:5:600 RRA:AVERAGE:0.5:30:600 >RRA:MAX:0.5:30:600 RRA:MIN:0.5:30:600 RRA:AVERAGE:0.5:120:600 >RRA:MAX:0.5:120:600 RRA:MIN:0.5:120:600 RRA:AVERAGE:0.5:1440:600 >RRA:MAX:0.5:1440:600 RRA:MIN:0.5:1440:600 >2005-10-14 11:42:07 RRD: >[bal-trend1][PING]:/var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd >- successfully created! >2005-10-14 11:42:07 SERVICEEXT: [bal-trend1][PING]:Entry in serviceextinfo >not exists, create NEW! >2005-10-14 11:42:07 VALUES: [bal-trend1][PING]: RTA=122.44 Loss=0 >2005-10-14 11:42:07 RRD: rrdtool update >/var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd >--template=RTA:Loss N:122.44:0 >2005-10-14 11:42:07 PIPE: bei-isp1 Online PING OK - Packet loss = 0%, >RTA = 287.90 ms > > >This issue appears in the rrd2-graph.cgi; Where i recive the following >message in a PNG. > > >Some Error occured: >~~~~~~~~~~~~~~~~~~~ >RRD Error: Cannot parse CF in >'DEF:RTA=/var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd >Options: > -aPNG > -cBACK#fcfcfc > -cFONT#333333 > -cARROW#ff0000 > -cGRID#330000 > -cMGRID#330000 > -cCANVAS#ffffff >Values: > >DEF:RTA=/var/spool/nagios/rrd/bal-trend1/205a292aceed56e53abdb8d85a46120a.rrd:RTA:AVERAGE >Graphs: > AREA:RTA#00a000:Time to answer/J > VRULE:1129305600#FF0000 > VRULE:1129302000#FF0000 >GPrints: >[11:58/14.10.2005] > >As I mentioned before this system is running 1.2.6 >thank you >Damian > >________________________________ > >From: nagios-users-admin at lists.sourceforge.net on behalf of Carlos Pe?n >Costa >Sent: Fri 14/10/2005 07:32 >To: nagios-users at lists.sourceforge.net >Subject: RE: [Nagios-users] Nagios_Grapher RRD Error > > > >It seems /var/spool/nagios/rrd does not exists or nagios does not have >write >permissions on it. > > > >From: "Damian Flynn" > >To: > >Subject: [Nagios-users] Nagios_Grapher RRD Error > >Date: Fri, 14 Oct 2005 00:23:32 +0100 > > > >Hi, > > > >I have this tools installed and the collector working, the RRD's appear > >to be created ok, and the rights are ok also, but I continue to get the > >following errors like "cannot parse CF in > >'DEF:RTA=/var/spool/nagios..... > > > >I have RRD 1.2 installed, but can not figure out what is my root issue. > > > >I get this creation information in my log: > > > >2005-10-12 12:02:48 RRD: rrdtool create > >/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd > >--step= DS:Loss:GAUGE:600:U:U DS:RTA:GAUGE:600:U:U RRA:AVERAGE:0.5:5:600 > >RRA:MAX:0.5:5:600 RRA:MIN:0.5:5:600 RRA:AVERAGE:0.5:30:600 > >RRA:MAX:0.5:30:600 RRA:MIN:0.5:30:600 RRA:AVERAGE:0.5:120:600 > >RRA:MAX:0.5:120:600 RRA:MIN:0.5:120:600 RRA:AVERAGE:0.5:1440:600 > >RRA:MAX:0.5:1440:600 RRA:MIN:0.5:1440:600 > >2005-10-12 12:02:48 RRD: [bal-smtpfw1][PING]:creating > >'/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd' > >: No such file or directory > >2005-10-12 12:02:48 SERVICEEXT: [bal-smtpfw1][PING]:Entry in > >serviceextinfo not exists, create NEW! > >2005-10-12 12:02:48 VALUES: [bal-smtpfw1][PING]: RTA=163.31 Loss=0 > >2005-10-12 12:02:48 RRD: rrdtool update > >/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd > >--template=RTA:Loss N:163.31:0 > >2005-10-12 12:02:48 RRD: [bal-smtpfw1][PING]:opening > >'/var/spool/nagios/rrd/bal-smtpfw1/356dec7f98ed3cc3ee60ce33288a5119.rrd' > >: No such file or directory > > > > > >Any help would be great > > > >Regards > >Damian > > > > > >------------------------------------------------------- > >This SF.Net email is sponsored by: > >Power Architecture Resource Center: Free content, downloads, discussions, > >and more. http://solutions.newsforge.com/ibmarch.tmpl > >_______________________________________________ > >Nagios-users mailing list > >Nagios-users at lists.sourceforge.net > >https://lists.sourceforge.net/lists/listinfo/nagios-users > >::: Please include Nagios version, plugin version (-v) and OS when > >reporting any issue. > >::: Messages without supporting info will risk being sent to /dev/null > > > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when >reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Fri Oct 14 18:45:03 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 14 Oct 2005 12:45:03 -0400 Subject: check_ping vs. check_icmp? Unexpected output. In-Reply-To: Your message of "Fri, 14 Oct 2005 16:10:02 +0200." <434FBC3A.8060604@op5.se> References: <434FBC3A.8060604@op5.se> Message-ID: <200510141645.j9EGj3b2028980@mx1.cs.umb.edu> In message <434FBC3A.8060604 at op5.se>, Andreas Ericsson writes: >Andreas Ericsson wrote: >>> >>> I have the check_host link in place and with the 1.4.2 release of the >>> plugins, with cvs id: >>> >>> * $Id: check_icmp.c,v 1.5 2005/02/01 07:33:13 stanleyhopcroft Exp $ >> This isn't anywhere near the latest version and I believe I've added >> quite a few patches to it since then. For the latest version you'll need >> the plugin-pack from http://oss.op5.se/nagios. >> >> Quite a lot of bugs has been fixed since february. >> > >Although this one wasn't, now that I've checked it up (sorry about >that). Not a problem. > >http://oss.op5.se/nagios/op5plugins-2005-10-14.tar.gz holds an updated >version, which also fixes a less annoying bug in check_hpjd along with >some minor stuff in check_users (for instance). Cool. Thanks. BTW why have these changes not filtered back into the mainline code? -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.Laden at tudor.com Fri Oct 14 19:58:09 2005 From: Andrew.Laden at tudor.com (Andrew Laden) Date: Fri, 14 Oct 2005 13:58:09 -0400 Subject: check_ping vs. check_icmp? Message-ID: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> How does using check_icmp compare to using check_fping? It seems that check_fping will return a down answer much faster. Since host checks are most often run when the host is down, that seems to be the performance that we are concerned with. # time ./check_fping -H em1.dra.tudor.com FPING CRITICAL - em1.dra.tudor.com (loss=100% )|loss=100%;;;0;100 0.52s real 0.00s user 0.01s system # time ./check_icmp -H em1.dra.tudor.com CRITICAL - em1.dra.tudor.com: rta nan, lost 100%|rta=0.000ms;200.000;500.000;0; pl=100%;40;80;; 2.96s real 0.00s user 0.00s system -----Original Message----- From: Nate Carlson [mailto:nagios at natecarlson.com] Sent: Friday, October 14, 2005 1:16 PM To: Andreas Ericsson Cc: 'nagios-users at lists.sourceforge.net' Subject: Re: [Nagios-users] check_ping vs. check_icmp? On Fri, 14 Oct 2005, Andreas Ericsson wrote: > check_ping executes the external command ping, while check_icmp does > its own fiddling with the ICMP protocol. As a result, check_icmp is > faster, smarter and requires less resources to run. > > check_icmp can also be used in check_host mode (create a symlink > check_host -> check_icmp and execute check_host) which runs extremely > quickly to determine if a host is up whenever a service check fails. > Ordinary check_ping would take 5 seconds to determine that the host is > up in an ordinary setup, while check_host usually does the same trick > in just about the same amount of time as it takes for a packet to make > a round trip to the destination target (usually between 1 and 10 > milliseconds on a local network). > > Considering the fact that service checks aren't executed while host > checks are running, the check_host mode of check_icmp is a fairly > major improvement in terms of overall Nagios performance. In other words, check_icmp is certainly worth making the change. :) Thanks - I'll grab the newest version of the plugin pack you mention in later messages, and make the cut! ------------------------------------------------------------------------ | nate carlson | natecars at natecarlson.com | http://www.natecarlson.com | | depriving some poor village of its idiot since 1981 | ------------------------------------------------------------------------ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Fri Oct 14 20:16:17 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 14 Oct 2005 14:16:17 -0400 Subject: check_ping vs. check_icmp? In-Reply-To: Your message of "Fri, 14 Oct 2005 13:58:09 EDT." <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> Message-ID: <200510141816.j9EIGHTj014697@mx1.cs.umb.edu> In message <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF at tudor.com>, Andrew Laden writes: >How does using check_icmp compare to using check_fping? You missed part of the message. Make a symbolic link to check_icmp called check_host and execute check_host. Now it's very fast. >It seems that check_fping will return a down answer much faster. Since host >checks are most often run when the host is down, that seems to be the >performance that we are concerned with. This still applies. ># time ./check_fping -H em1.dra.tudor.com >FPING CRITICAL - em1.dra.tudor.com (loss=100% )|loss=100%;;;0;100 > 0.52s real 0.00s user 0.01s system ># time ./check_icmp -H em1.dra.tudor.com >CRITICAL - em1.dra.tudor.com: rta nan, lost >100%|rta=0.000ms;200.000;500.000;0; pl=100%;40;80;; > 2.96s real 0.00s user 0.00s system Sorry, DNS can't resolve em1.dra.tudor.com or I'd show you how much faster it is in host mode 8-). >-----Original Message----- >From: Nate Carlson [mailto:nagios at natecarlson.com] >To: Andreas Ericsson >Subject: Re: [Nagios-users] check_ping vs. check_icmp? > >On Fri, 14 Oct 2005, Andreas Ericsson wrote: >> check_ping executes the external command ping, while check_icmp does >> its own fiddling with the ICMP protocol. As a result, check_icmp is >> faster, smarter and requires less resources to run. >> >> check_icmp can also be used in check_host mode (create a symlink >> check_host -> check_icmp and execute check_host) which runs extremely ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ >> quickly to determine if a host is up whenever a service check fails. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.Laden at tudor.com Fri Oct 14 20:24:37 2005 From: Andrew.Laden at tudor.com (Andrew Laden) Date: Fri, 14 Oct 2005 14:24:37 -0400 Subject: check_ping vs. check_icmp? Message-ID: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1B00@tudor.com> Symlink doesn't help. If I understand the check_host portion of the logic. It will return up on the first packet receieved, However if no packets are coming back it still has to wait the full timeout.... # time ./check_fping -H em1.dra.tudor.com FPING CRITICAL - em1.dra.tudor.com (loss=100% )|loss=100%;;;0;100 0.53s real 0.00s user 0.00s system # time ./check_host -H em1.dra.tudor.com em1.dra.tudor.com is DOWN - rta: nan, lost 100%|pkt=6;5;5;5;5 pl=100%;95;100;; 10.01s real 0.00s user 0.00s system Even with -n 1 # time ./check_host -H em1.dra.tudor.com -n 1 em1.dra.tudor.com is DOWN - rta: nan, lost 100%|pkt=2;1;1;1;1 pl=100%;95;100;; 2.31s real 0.00s user 0.00s system -----Original Message----- From: John P. Rouillard [mailto:rouilj at cs.umb.edu] Sent: Friday, October 14, 2005 2:16 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] check_ping vs. check_icmp? In message <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF at tudor.com>, Andrew Laden writes: >How does using check_icmp compare to using check_fping? You missed part of the message. Make a symbolic link to check_icmp called check_host and execute check_host. Now it's very fast. >It seems that check_fping will return a down answer much faster. Since >host checks are most often run when the host is down, that seems to be >the performance that we are concerned with. This still applies. ># time ./check_fping -H em1.dra.tudor.com FPING CRITICAL - >em1.dra.tudor.com (loss=100% )|loss=100%;;;0;100 > 0.52s real 0.00s user 0.01s system ># time ./check_icmp -H em1.dra.tudor.com CRITICAL - em1.dra.tudor.com: >rta nan, lost 100%|rta=0.000ms;200.000;500.000;0; pl=100%;40;80;; > 2.96s real 0.00s user 0.00s system Sorry, DNS can't resolve em1.dra.tudor.com or I'd show you how much faster it is in host mode 8-). >-----Original Message----- >From: Nate Carlson [mailto:nagios at natecarlson.com] >To: Andreas Ericsson >Subject: Re: [Nagios-users] check_ping vs. check_icmp? > >On Fri, 14 Oct 2005, Andreas Ericsson wrote: >> check_ping executes the external command ping, while check_icmp does >> its own fiddling with the ICMP protocol. As a result, check_icmp is >> faster, smarter and requires less resources to run. >> >> check_icmp can also be used in check_host mode (create a symlink >> check_host -> check_icmp and execute check_host) which runs extremely ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ >> quickly to determine if a host is up whenever a service check fails. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 14 20:44:38 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 14 Oct 2005 20:44:38 +0200 Subject: check_ping vs. check_icmp? Unexpected output. In-Reply-To: <200510141645.j9EGj3b2028980@mx1.cs.umb.edu> References: <200510141645.j9EGj3b2028980@mx1.cs.umb.edu> Message-ID: <434FFC96.8060503@op5.se> John P. Rouillard wrote: > In message <434FBC3A.8060604 at op5.se>, > Andreas Ericsson writes: > >>Andreas Ericsson wrote: >> >>>>I have the check_host link in place and with the 1.4.2 release of the >>>>plugins, with cvs id: >>>> >>>> * $Id: check_icmp.c,v 1.5 2005/02/01 07:33:13 stanleyhopcroft Exp $ >>> >>>This isn't anywhere near the latest version and I believe I've added >>>quite a few patches to it since then. For the latest version you'll need >>>the plugin-pack from http://oss.op5.se/nagios. >>> >>>Quite a lot of bugs has been fixed since february. >>> >> >>Although this one wasn't, now that I've checked it up (sorry about >>that). > > > Not a problem. > > >>http://oss.op5.se/nagios/op5plugins-2005-10-14.tar.gz holds an updated >>version, which also fixes a less annoying bug in check_hpjd along with >>some minor stuff in check_users (for instance). > > > Cool. Thanks. BTW why have these changes not filtered back into the > mainline code? > For a couple of reasons; 1) I don't like the sourceforge tracker item. It usually takes me a full five minutes to send in a single patch to it, so I don't. I also positively loathe the feedback provided from it (no in-line posting in the messages, making it difficult to follow what's being said and by whom). 2) Official maintainers keep a rock-hard policy that patches should be submitted to the tracker thingie if they're to go into the main distribution. This is good because it makes their life easier, but it's bad because it makes my life harder. 3) The plugins have gone from very portable (1.3.1) to less portable while not, strictly speaking, adding very much in the ways of new functionality (the new ps tool for solaris is neat). What has been added is perfparse output (but poorly implemented), i18n support (but poor use of library functions makes life harder on the translators). 4) Much of the code in the plugin distribution is far from stellar, but the fixes required are, literally, all over the place. Therefore it's very hard to juggle the amount of patches required to fix it all in one go. 4 is what finally turned out to be too much for me. I'm in charge of plugin quality and testing at OP5 (where I work), and the last plugin package where we used the official distro had 63 patches. It took six full days to make them all apply cleanly while it only took five minutes to fork the plugins and set up my own repo where I continually fix things and implement code-reducing library routines and improvements. Sadly, things have gotten to where it would require somewhere near a weeks work to merge the two repositories, so it keeps getting put off and the list of code-conflicts grows. This is mostly my fault, since Sean Finney and Ben Clewett have both offered to assist with the merging. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Fri Oct 14 21:36:56 2005 From: ae at op5.se (Andreas Ericsson) Date: Fri, 14 Oct 2005 21:36:56 +0200 Subject: check_ping vs. check_icmp? In-Reply-To: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> Message-ID: <435008D8.1060907@op5.se> Andrew Laden wrote: > How does using check_icmp compare to using check_fping? > > It seems that check_fping will return a down answer much faster. Since host > checks are most often run when the host is down, that seems to be the > performance that we are concerned with. > This might seem to be the case, but it actually isn't. A hostcheck is run each time a service changes from whatever to any non-OK state. In a (somewhat) healthy network hostchecks are being run when the host is up more often than when they're down. The opposite is ofcourse true if there are hosts being down for a long time or if a whole segment of the network goes to lunch, but check_icmp can sometimes deduce this through other means than by simply not getting any OK responses (it detects routing errors, among other things). > # time ./check_fping -H em1.dra.tudor.com > FPING CRITICAL - em1.dra.tudor.com (loss=100% )|loss=100%;;;0;100 > 0.52s real 0.00s user 0.01s system > # time ./check_icmp -H em1.dra.tudor.com > CRITICAL - em1.dra.tudor.com: rta nan, lost > 100%|rta=0.000ms;200.000;500.000;0; pl=100%;40;80;; > 2.96s real 0.00s user 0.00s system > This is due to a couple of different things. 1) A logical error adding critical.rta to the max_completion_time too many times in check_icmp. This was fixed some time ago. http://oss.op5.se/nagios/op5plugins-2005-10-14.tar.gz for fresh code (fourth time today I post that link....). With the fix in place, check_icmp finishes closer to 0.7 seconds, 2) check_fping sets critical RTA to 100ms, while check_icmp sets it to 500ms. check_icmp can't possibly finish in 0.5 seconds if it has to wait 0.5 seconds to make sure there are no more packets coming in within the maximum threshold. For fair testing, you should use check_icmp -i 100 -w 50.0,20% -c 100.0,40% -p 1 check_fping -w 50.0,20% -c 100ms,40% -p 1 check_icmp -i 100 -w 50.0,20% -c 100.0,40% -p 5 check_fping -w 50.0,20% -c 100.0,40% -p 5 Note that when check_icmp is used in check_host mode it sets thresholds very differently (-w 2s,100% -c 2s,100%) but you can override this with the usual -w and -c switches. If used with a hostname rather than an IP-address it also checks *all* ip-addresses connected to the hostname. This ofcourse also has an impact on timing. For some more benefits of check_icmp, you can try running check_host -H 193.201.96.45 and check_host -H oss.op5.se. It'll work the same with check_icmp in normal mode. Community question here; Would it be sane to treat an ICMP_PORTUNREACH from the intended target host as a valid icmp response? For hostchecks only, perhaps? > -----Original Message----- > From: Nate Carlson [mailto:nagios at natecarlson.com] > Sent: Friday, October 14, 2005 1:16 PM > To: Andreas Ericsson > Cc: 'nagios-users at lists.sourceforge.net' > Subject: Re: [Nagios-users] check_ping vs. check_icmp? > > On Fri, 14 Oct 2005, Andreas Ericsson wrote: > >>check_ping executes the external command ping, while check_icmp does >>its own fiddling with the ICMP protocol. As a result, check_icmp is >>faster, smarter and requires less resources to run. >> >>check_icmp can also be used in check_host mode (create a symlink >>check_host -> check_icmp and execute check_host) which runs extremely >>quickly to determine if a host is up whenever a service check fails. >>Ordinary check_ping would take 5 seconds to determine that the host is >>up in an ordinary setup, while check_host usually does the same trick >>in just about the same amount of time as it takes for a packet to make >>a round trip to the destination target (usually between 1 and 10 >>milliseconds on a local network). >> >>Considering the fact that service checks aren't executed while host >>checks are running, the check_host mode of check_icmp is a fairly >>major improvement in terms of overall Nagios performance. > > > In other words, check_icmp is certainly worth making the change. :) > > Thanks - I'll grab the newest version of the plugin pack you mention in > later messages, and make the cut! > > ------------------------------------------------------------------------ > | nate carlson | natecars at natecarlson.com | http://www.natecarlson.com | > | depriving some poor village of its idiot since 1981 | > ------------------------------------------------------------------------ > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Andrew.Laden at tudor.com Fri Oct 14 22:37:17 2005 From: Andrew.Laden at tudor.com (Andrew Laden) Date: Fri, 14 Oct 2005 16:37:17 -0400 Subject: check_ping vs. check_icmp? Message-ID: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1B06@tudor.com> Ok, grabbed the latest version of the plugins. (my fault on that, I wanted the check_icmp only, so I just grabbed the latest of that. Didn't realize there was a more recent one in the full package.) If I run with the options you suggest, then I do see the performance difference. It's the "out of the box" test that is the problem then. Given the default parameters check_fping runs faster on a host down then check_icmp. You have to tweak it for better performance. # time ./check_host -H 10.8.10.201 10.8.10.201 is DOWN - rta: nan, lost 100%|pkt=6;5;5;5;5 pl=100%;95;100;; 3.05s real 0.00s user 0.00s system # time ./check_fping -H 10.8.10.201 FPING CRITICAL - 10.8.10.201 (loss=100% )|loss=100%;;;0;100 0.51s real 0.01s user 0.00s system # time ./check_host -i 100 -w 50.0,%20 -c 100.0,40% -p 1 -H 10.8.10.201 10.8.10.201 is DOWN - rta: nan, lost 100%|pkt=2;1;1;1;1 pl=100%;95;100;; 0.26s real 0.00s user 0.00s system I am more worried about the host down state. Yes a host check will run when any service check fails. But on a host that is up, they both return relativly quickly. It's the host down case that will concern me, as that is what will slow nagios down. Guess I have to play with the numbers a bit. -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Friday, October 14, 2005 3:37 PM To: 'nagios-users at lists.sourceforge.net' Subject: Re: [Nagios-users] check_ping vs. check_icmp? Andrew Laden wrote: > How does using check_icmp compare to using check_fping? > > It seems that check_fping will return a down answer much faster. Since > host checks are most often run when the host is down, that seems to be > the performance that we are concerned with. > This might seem to be the case, but it actually isn't. A hostcheck is run each time a service changes from whatever to any non-OK state. In a (somewhat) healthy network hostchecks are being run when the host is up more often than when they're down. The opposite is ofcourse true if there are hosts being down for a long time or if a whole segment of the network goes to lunch, but check_icmp can sometimes deduce this through other means than by simply not getting any OK responses (it detects routing errors, among other things). > # time ./check_fping -H em1.dra.tudor.com FPING CRITICAL - > em1.dra.tudor.com (loss=100% )|loss=100%;;;0;100 > 0.52s real 0.00s user 0.01s system > # time ./check_icmp -H em1.dra.tudor.com CRITICAL - > em1.dra.tudor.com: rta nan, lost 100%|rta=0.000ms;200.000;500.000;0; > pl=100%;40;80;; > 2.96s real 0.00s user 0.00s system > This is due to a couple of different things. 1) A logical error adding critical.rta to the max_completion_time too many times in check_icmp. This was fixed some time ago. http://oss.op5.se/nagios/op5plugins-2005-10-14.tar.gz for fresh code (fourth time today I post that link....). With the fix in place, check_icmp finishes closer to 0.7 seconds, 2) check_fping sets critical RTA to 100ms, while check_icmp sets it to 500ms. check_icmp can't possibly finish in 0.5 seconds if it has to wait 0.5 seconds to make sure there are no more packets coming in within the maximum threshold. For fair testing, you should use check_icmp -i 100 -w 50.0,20% -c 100.0,40% -p 1 check_fping -w 50.0,20% -c 100ms,40% -p 1 check_icmp -i 100 -w 50.0,20% -c 100.0,40% -p 5 check_fping -w 50.0,20% -c 100.0,40% -p 5 Note that when check_icmp is used in check_host mode it sets thresholds very differently (-w 2s,100% -c 2s,100%) but you can override this with the usual -w and -c switches. If used with a hostname rather than an IP-address it also checks *all* ip-addresses connected to the hostname. This ofcourse also has an impact on timing. For some more benefits of check_icmp, you can try running check_host -H 193.201.96.45 and check_host -H oss.op5.se. It'll work the same with check_icmp in normal mode. Community question here; Would it be sane to treat an ICMP_PORTUNREACH from the intended target host as a valid icmp response? For hostchecks only, perhaps? > -----Original Message----- > From: Nate Carlson [mailto:nagios at natecarlson.com] > Sent: Friday, October 14, 2005 1:16 PM > To: Andreas Ericsson > Cc: 'nagios-users at lists.sourceforge.net' > Subject: Re: [Nagios-users] check_ping vs. check_icmp? > > On Fri, 14 Oct 2005, Andreas Ericsson wrote: > >>check_ping executes the external command ping, while check_icmp does >>its own fiddling with the ICMP protocol. As a result, check_icmp is >>faster, smarter and requires less resources to run. >> >>check_icmp can also be used in check_host mode (create a symlink >>check_host -> check_icmp and execute check_host) which runs extremely >>quickly to determine if a host is up whenever a service check fails. >>Ordinary check_ping would take 5 seconds to determine that the host is >>up in an ordinary setup, while check_host usually does the same trick >>in just about the same amount of time as it takes for a packet to make >>a round trip to the destination target (usually between 1 and 10 >>milliseconds on a local network). >> >>Considering the fact that service checks aren't executed while host >>checks are running, the check_host mode of check_icmp is a fairly >>major improvement in terms of overall Nagios performance. > > > In other words, check_icmp is certainly worth making the change. :) > > Thanks - I'll grab the newest version of the plugin pack you mention > in later messages, and make the cut! > > ---------------------------------------------------------------------- > -- > | nate carlson | natecars at natecarlson.com | http://www.natecarlson.com | > | depriving some poor village of its idiot since 1981 | > ---------------------------------------------------------------------- > -- > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, > discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kevin at mica.net Fri Oct 14 22:44:18 2005 From: kevin at mica.net (Kevin Hanser) Date: Fri, 14 Oct 2005 16:44:18 -0400 Subject: nagios doesn't start at boot on Gentoo Message-ID: <435018A2.4090109@mica.net> I recently had to reboot my Gentoo box to install a new kernel, and when I did, I noticed that nagios didn't start automatically on boot. I have it added to the current runlevel (default on Gentoo). It appears to be some sort of syntax issue w/the way Gentoo starts services at boot, because when it should have started nagios, I saw the syntax displayed instead: Usage: nagios {start|stop|restart|reload|force-reload|status} I've also noticed that if I to an "rc-status" on Gentoo, nagios always shows as "off" even if it really is running. I didn't use the ebuild of Nagios to install, I installed it from source manually (v2.0 wasn't available as an ebuild when I last looked... guess it's available now in ~x86). Is there something I can do to the init.d script to make it work better with Gentoo? thx! k ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jun at lifecapturemedia.com Fri Oct 14 22:47:11 2005 From: jun at lifecapturemedia.com (jun at lifecapturemedia.com) Date: Fri, 14 Oct 2005 16:47:11 -0400 Subject: Fwd: reboot Message-ID: <1129322831.4350194fa7358@webmail.lifecapturemedia.com> Anybody can let me know how can I or any plugin/addon reboot the system when we find the remote host is down. Thank you in advance, JUn ----- End forwarded message ----- ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jun at lifecapturemedia.com Fri Oct 14 22:58:31 2005 From: jun at lifecapturemedia.com (jun) Date: Fri, 14 Oct 2005 16:58:31 -0400 Subject: boot system Message-ID: <200510141658.31737.jun@lifecapturemedia.com> Anybody can let me know how can I or any plugin/addon reboot the system when we find the remote host is down. Thank you in advance, JUn ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Oct 14 23:02:16 2005 From: marc at ena.com (Marc Powell) Date: Fri, 14 Oct 2005 16:02:16 -0500 Subject: Fwd: reboot Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of jun at lifecapturemedia.com > Sent: Friday, October 14, 2005 3:47 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Fwd: reboot > > > > > > > Anybody can let me know how can I or any plugin/addon reboot the system > when we > find the remote host is down. If the host is down, how will you reboot it remotely? That's the biggest question you need to answer before you can proceed. Second is what OS is running on the remote machine? If you had it attached to some kind of remote power device you might be able to script an event handler to perform whatever action is required to get it to reset the power to the device but we can't tell you how to do that because we don't know what you'd be using or how it works. If you can still access the remote system, it's unix-like, ssh is still available _and_ it's responsive to commands, you could create an event handler script that ssh's in and issues the 'shutdown -r now' command as a first attempt to reboot the machine. If all that's true however, why would nagios be seeing it as down? You could probably also use an event handler to trigger snmpd running on the remote machine to restart the box as well but again, if that's working, why is nagios showing it as down? I'd also be _very_ cautious of allowing remote restarts via SNMP if this wasn't on a private, secure network (and even then it's questionable...) -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sudheer at tgs-solutions.com Fri Oct 14 23:14:41 2005 From: sudheer at tgs-solutions.com (Sudheer Muddappa) Date: Fri, 14 Oct 2005 17:14:41 -0400 Subject: Where to enable service check commands. In-Reply-To: <435008D8.1060907@op5.se> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> <435008D8.1060907@op5.se> Message-ID: <43501FC1.9090008@tgs-solutions.com> Hi all, I am running nagios 1.3 on Debian Linux. As some of you know debian tailors the packages. Which file do I have to edit to add the check_icmp command? for checking the hosts? By default Debian installs all the plugin binaries in /usr/lib/nagios/plugins and their config files in /etc/nagios-plugins/config I have created a icmp.cfg file like below. # 'check_icmp' command definition define command{ command_name check-icmp command_line /usr/lib/nagios/plugins/check_icmp -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ } When I run a flight check i get this error Service check command 'check_icmp' specified in service 'ICMP' for host 'navigant_prod' not defined anywhere! Thanks, Sudheer Muddappa ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kevin at mica.net Fri Oct 14 23:30:45 2005 From: kevin at mica.net (Kevin Hanser) Date: Fri, 14 Oct 2005 17:30:45 -0400 Subject: Where to enable service check commands. In-Reply-To: <43501FC1.9090008@tgs-solutions.com> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> <435008D8.1060907@op5.se> <43501FC1.9090008@tgs-solutions.com> Message-ID: <43502385.5000709@mica.net> Looks like maybe you have a typo? See below.. Sudheer Muddappa wrote: > Hi all, > > I am running nagios 1.3 on Debian Linux. > > I have created a icmp.cfg file like below. > > # 'check_icmp' command definition > define command{ > command_name check-icmp > command_line /usr/lib/nagios/plugins/check_icmp -H > $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ > } > > When I run a flight check i get this error > Service check command 'check_icmp' specified in service 'ICMP' for > host 'navigant_prod' not defined anywhere! > In your command definition above, your "command_name" is "check-icmp". The error message indicates that it's looking for "check_icmp". If you notice, in the command definition you're using a dash ( - ), and in the error message it's using an underscore ( _ ). Change the command_name to check_icmp and it may take care of it.. k ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From BStover at NorcalMutual.com Fri Oct 14 23:35:27 2005 From: BStover at NorcalMutual.com (Stover, Beth) Date: Fri, 14 Oct 2005 14:35:27 -0700 Subject: Problem with Cingular TAP Gateway Message-ID: Hi, I've been using Nagios ver 1 to send SMS text pages to my cell through a Cingular TAP gateway since April. The number I'm using: (408)961-2819 A few weeks ago, my SMS alerts stopped working through the TAP gateway. All other alerts are working fine. Is anyone else having a problem with the Cingular TAP gateway???? I've verified that I cannot send SMS text pages through the TAP gateway from sms_client, ppp on Unix, cu on UNIX, and an SMS client for Windows. In each case, the TAP Gateway server is prompting me for a login id: ID= I never had to enter a login ID before. Cingular insists that there is no login prompt even though I'm getting one. I've been troubleshooting with Cingular for almost a week, and they insist that the problem is on my end. If anyone else is having a similar problem, it would be helpful to explain to Cingular that other people are also affected. Thanks! Beth Stover Systems Administrator 415.395.8768 -------------- next part -------------- An HTML attachment was scrubbed... URL: From sudheer at tgs-solutions.com Fri Oct 14 23:43:29 2005 From: sudheer at tgs-solutions.com (Sudheer Muddappa) Date: Fri, 14 Oct 2005 17:43:29 -0400 Subject: Where to enable service check commands. In-Reply-To: <43502385.5000709@mica.net> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> <435008D8.1060907@op5.se> <43501FC1.9090008@tgs-solutions.com> <43502385.5000709@mica.net> Message-ID: <43502681.4090305@tgs-solutions.com> did that. dose not seam to work :( same error Thanks, Sudheer Muddappa Kevin Hanser wrote: > Looks like maybe you have a typo? See below.. > > Sudheer Muddappa wrote: > >> Hi all, >> >> I am running nagios 1.3 on Debian Linux. >> >> > > > I have created a icmp.cfg file like below. > >> >> # 'check_icmp' command definition >> define command{ >> command_name check-icmp >> command_line /usr/lib/nagios/plugins/check_icmp -H >> $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ >> } >> >> When I run a flight check i get this error >> Service check command 'check_icmp' specified in service 'ICMP' for >> host 'navigant_prod' not defined anywhere! >> > In your command definition above, your "command_name" is > "check-icmp". The error message indicates that it's looking for > "check_icmp". If you notice, in the command definition you're using a > dash ( - ), and in the error message it's using an underscore ( _ ). > > Change the command_name to check_icmp and it may take care of it.. > > k > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kevin at mica.net Fri Oct 14 23:56:01 2005 From: kevin at mica.net (Kevin Hanser) Date: Fri, 14 Oct 2005 17:56:01 -0400 Subject: Where to enable service check commands. In-Reply-To: <43502681.4090305@tgs-solutions.com> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> <435008D8.1060907@op5.se> <43501FC1.9090008@tgs-solutions.com> <43502385.5000709@mica.net> <43502681.4090305@tgs-solutions.com> Message-ID: <43502971.9070807@mica.net> Did you include icmp.cfg in your nagios.cfg file? ie, do you have a line something like this in nagios.cfg: cfg_file=/icmp.cfg Also, does /usr/lib/nagios/plugins/check_icmp exist? And is executable? (if you run it from a command prompt, what does it do?) If you can re-post your config files in their current state that may help too (icmp.cfg and the config for the service that's trying to call that command) k Sudheer Muddappa wrote: > did that. dose not seam to work :( > > same error > > Thanks, > > Sudheer Muddappa > > > > > Kevin Hanser wrote: > >> >> >> I have created a icmp.cfg file like below. >> >>> >>> # 'check_icmp' command definition >>> define command{ >>> command_name check-icmp >>> command_line /usr/lib/nagios/plugins/check_icmp -H >>> $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ >>> } >>> >>> When I run a flight check i get this error >>> Service check command 'check_icmp' specified in service 'ICMP' for >>> host 'navigant_prod' not defined anywhere! >>> >> In your command definition above, your "command_name" is >> "check-icmp". The error message indicates that it's looking for >> "check_icmp". If you notice, in the command definition you're using >> a dash ( - ), and in the error message it's using an underscore ( _ ). >> >> Change the command_name to check_icmp and it may take care of it.. >> >> k >> >> > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From RLAdams at AdamsInfoServ.Com Fri Oct 14 23:56:50 2005 From: RLAdams at AdamsInfoServ.Com (Russell Adams) Date: Fri, 14 Oct 2005 16:56:50 -0500 Subject: Problem with Cingular TAP Gateway In-Reply-To: References: Message-ID: <20051014215650.GA17141@keroberos> ID= is the proper prompt for IXO/TAP. Try entering your phone number there, numeric only. Also, give Sendpage a whirl. Its not as protocol picky as some other TAP paging software I've used. Russell On Fri, Oct 14, 2005 at 02:35:27PM -0700, Stover, Beth wrote: > Hi, > > I've been using Nagios ver 1 to send SMS text pages to my cell through > a Cingular TAP gateway since April. The number I'm using: > > (408)961-2819 > > A few weeks ago, my SMS alerts stopped working through the TAP gateway. > All other alerts are working fine. > > Is anyone else having a problem with the Cingular TAP gateway???? > > I've verified that I cannot send SMS text pages through the TAP gateway > from sms_client, ppp on Unix, cu on UNIX, and an SMS client for Windows. > In each case, the TAP Gateway server is prompting me for a login id: > > ID= > > I never had to enter a login ID before. Cingular insists that there is > no login prompt even though I'm getting one. > > I've been troubleshooting with Cingular for almost a week, and they > insist that the problem is on my end. If anyone else is having a > similar problem, it would be helpful to explain to Cingular that other > people are also affected. > > Thanks! > > Beth Stover > Systems Administrator > 415.395.8768 > > ------------------------------------------------------------------ Russell Adams RLAdams at AdamsInfoServ.com PGP Key ID: 0x1160DCB3 http://www.adamsinfoserv.com/ Fingerprint: 1723 D8CA 4280 1EC9 557F 66E8 1154 E018 1160 DCB3 -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available URL: From sjohnson at santarosa.edu Fri Oct 14 23:54:14 2005 From: sjohnson at santarosa.edu (Johnson Steven) Date: Fri, 14 Oct 2005 14:54:14 -0700 Subject: Groups in services... Message-ID: <5DC5085B189C5449A59F658D5F6A9EDF03970081@busxis1.ad.santarosa.edu> I just installed Nagios 2.x (Whatever the current beta is), and I was wondering in stead of adding each host to a service definition, can you add a group name, so that everyone in that group will run that service check? If it is possible, I can not see how to do this. I would like it if all I had to do to add a new server to nagios was to add its host definition, and then add that hostname to the proper groups, based on what services need to be monitored.. But as it is, it seems I have to add that new host name to each service on its host_name line. -Steve ---------------------------------------------------------------------------- Steve Johnson - Internet Services Specialist Santa Rosa Junior College (707)524-1850 sjohnson at santarosa.edu ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From BStover at NorcalMutual.com Sat Oct 15 00:00:15 2005 From: BStover at NorcalMutual.com (Stover, Beth) Date: Fri, 14 Oct 2005 15:00:15 -0700 Subject: Problem with Cingular TAP Gateway Message-ID: I realize that this e-mail might seem off topic. I've tried the sms_client mailing list, and no one responded. Also, I know that there are other Nagios users who are sending SMS text pages FROM Nagios to themselves through the Cingular TAP gateway. I'm at the end of my rope with Cingular. If anyone else has information regarding the Cingular TAP gateway, then feel free to contact me offline. bstover at norcalmutual.com Thanks! -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Stover, Beth Sent: Friday, October 14, 2005 2:35 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Problem with Cingular TAP Gateway Hi, I've been using Nagios ver 1 to send SMS text pages to my cell through a Cingular TAP gateway since April. The number I'm using: (408)961-2819 A few weeks ago, my SMS alerts stopped working through the TAP gateway. All other alerts are working fine. Is anyone else having a problem with the Cingular TAP gateway???? I've verified that I cannot send SMS text pages through the TAP gateway from sms_client, ppp on Unix, cu on UNIX, and an SMS client for Windows. In each case, the TAP Gateway server is prompting me for a login id: ID= I never had to enter a login ID before. Cingular insists that there is no login prompt even though I'm getting one. I've been troubleshooting with Cingular for almost a week, and they insist that the problem is on my end. If anyone else is having a similar problem, it would be helpful to explain to Cingular that other people are also affected. Thanks! Beth Stover Systems Administrator 415.395.8768 -------------- next part -------------- An HTML attachment was scrubbed... URL: From rouilj at cs.umb.edu Sat Oct 15 00:05:54 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 14 Oct 2005 18:05:54 -0400 Subject: boot system In-Reply-To: Your message of "Fri, 14 Oct 2005 16:58:31 EDT." <200510141658.31737.jun@lifecapturemedia.com> References: <200510141658.31737.jun@lifecapturemedia.com> Message-ID: <200510142205.j9EM5sdE023097@mx1.cs.umb.edu> In message <200510141658.31737.jun at lifecapturemedia.com>, jun writes: >Anybody can let me know how can I or any plugin/addon reboot the system when >we find the remote host is down. Well you will need some way to reset the power on the system to force it to reboot if its "down". There are all sorts of remotely controlled power strips. Google for remote power control. Once you have that just write a script to force the reboot. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rouilj at cs.umb.edu Sat Oct 15 00:18:01 2005 From: rouilj at cs.umb.edu (John P. Rouillard) Date: Fri, 14 Oct 2005 18:18:01 -0400 Subject: Groups in services... In-Reply-To: Your message of "Fri, 14 Oct 2005 14:54:14 PDT." <5DC5085B189C5449A59F658D5F6A9EDF03970081@busxis1.ad.santarosa.edu> References: <5DC5085B189C5449A59F658D5F6A9EDF03970081@busxis1.ad.santarosa.edu> Message-ID: <200510142218.j9EMI1cg024986@mx1.cs.umb.edu> In message <5DC5085B189C5449A59F658D5F6A9EDF03970081 at busxis1.ad.santarosa.edu>, "Johnson Steven" writes: >I just installed Nagios 2.x (Whatever the current beta is), and I was >wondering in stead of adding each host to a service definition, can you >add a group name, so that everyone in that group will run that service check? Yes. >If it is possible, I can not see how to do this. It's in the manual just not where you expect it to be. >From the table of contexts/index for the nagios 2.0 doc look at Advanced topics/Time-saving tips for object definitions Synopsis: define service{ hostgroup_name HOSTGROUP1,HOSTGROUP2,...,HOSTGROUPN service_description SOMESERVICE other service directives ... } Definitely got to fix the documentation and add the hostgroup_name to the template explanations. >I would like it if all I had to do to add a new server to nagios was to add >its host definition, and then add that hostname to the proper groups, based >on what services need to be monitored. That's precisely what I am doing. Works well. Rather than changing the hostgroups, use the "hostgroups" keyword in the host definition. See the main template docs xodtemplate.html. Its buried under Index/Object configuration file options/How Is Object Data Defined?/ "Click here". Bookmark it. >But as it is, it seems I have to >add that new host name to each service on its host_name line. Nope. You can do it just like you want to. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From todd_barbera at wgbh.org Sat Oct 15 00:27:45 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Fri, 14 Oct 2005 18:27:45 -0400 Subject: check_nrpe issue Message-ID: <002d01c5d10e$802fc9d0$620a010a@GRAPEAPE> Hi, I had tried posting about a problem using NRPE 2.0 and check_log2 yesterday but unfortunately no one responded. I ended up trying the plug-in check_log.pl, but I'm still getting the same error, which is: CHECK_NRPE: Error receiving data from daemon. Other plug-ins are working fine, such as check_oracle. I'm running Nagios 2.0b3 on a Solaris 9 sparc machine. The client is the same. Does anyone have any ideas why the check_log2 and check_log.pl plug-ins do not seem to work with NRPE? BTW, check_log.pl is MUCH faster than check_log2. What exactly does this error message imply? A search of the mailing lists and FAQ didn't shed any light on it. Thanks in advance for any insight you can offer. Todd Barbera ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sjohnson at santarosa.edu Sat Oct 15 00:26:39 2005 From: sjohnson at santarosa.edu (Johnson Steven) Date: Fri, 14 Oct 2005 15:26:39 -0700 Subject: Groups in services... Message-ID: <5DC5085B189C5449A59F658D5F6A9EDF03970083@busxis1.ad.santarosa.edu> Great, that worked perfectly. Also just an FYI, the 2.x tarball of nagios, comes with the V1.0 online documentation =) Had to surf over to the nagios home page to read the 2.x docs. Thanks again! -Steve -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of John P. Rouillard Sent: Friday, October 14, 2005 3:18 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Groups in services... In message <5DC5085B189C5449A59F658D5F6A9EDF03970081 at busxis1.ad.santarosa.edu>, "Johnson Steven" writes: >I just installed Nagios 2.x (Whatever the current beta is), and I was >wondering in stead of adding each host to a service definition, can you >add a group name, so that everyone in that group will run that service check? Yes. >If it is possible, I can not see how to do this. It's in the manual just not where you expect it to be. >From the table of contexts/index for the nagios 2.0 doc look at Advanced topics/Time-saving tips for object definitions Synopsis: define service{ hostgroup_name HOSTGROUP1,HOSTGROUP2,...,HOSTGROUPN service_description SOMESERVICE other service directives ... } Definitely got to fix the documentation and add the hostgroup_name to the template explanations. >I would like it if all I had to do to add a new server to nagios was to add >its host definition, and then add that hostname to the proper groups, based >on what services need to be monitored. That's precisely what I am doing. Works well. Rather than changing the hostgroups, use the "hostgroups" keyword in the host definition. See the main template docs xodtemplate.html. Its buried under Index/Object configuration file options/How Is Object Data Defined?/ "Click here". Bookmark it. >But as it is, it seems I have to >add that new host name to each service on its host_name line. Nope. You can do it just like you want to. -- rouilj John Rouillard =========================================================================== My employers don't acknowledge my existence much less my opinions. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sudheer at tgs-solutions.com Sat Oct 15 00:35:39 2005 From: sudheer at tgs-solutions.com (Sudheer Muddappa) Date: Fri, 14 Oct 2005 18:35:39 -0400 Subject: Where to enable service check commands. In-Reply-To: <43502971.9070807@mica.net> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> <435008D8.1060907@op5.se> <43501FC1.9090008@tgs-solutions.com> <43502385.5000709@mica.net> <43502681.4090305@tgs-solutions.com> <43502971.9070807@mica.net> Message-ID: <435032BB.5000906@tgs-solutions.com> Thanks for helping Kevin. In Debian, If I have to add any new or say this check_icmp command then I need to enter it in "checkcommands.cfg " or have to run this command "update-nagios" ( which I did not find). Now its working fine. Thanks, Sudheer Muddappa Kevin Hanser wrote: > Did you include icmp.cfg in your nagios.cfg file? > > ie, do you have a line something like this in nagios.cfg: > > cfg_file=/icmp.cfg > > Also, does /usr/lib/nagios/plugins/check_icmp exist? And is > executable? (if you run it from a command prompt, what does it do?) > > If you can re-post your config files in their current state that may > help too (icmp.cfg and the config for the service that's trying to > call that command) > > k > > Sudheer Muddappa wrote: > >> did that. dose not seam to work :( >> >> same error >> >> Thanks, >> >> Sudheer Muddappa >> >> >> >> >> Kevin Hanser wrote: >> >>> >>> >>> I have created a icmp.cfg file like below. >>> >>>> >>>> # 'check_icmp' command definition >>>> define command{ >>>> command_name check-icmp >>>> command_line /usr/lib/nagios/plugins/check_icmp -H >>>> $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ >>>> } >>>> >>>> When I run a flight check i get this error >>>> Service check command 'check_icmp' specified in service 'ICMP' for >>>> host 'navigant_prod' not defined anywhere! >>>> >>> In your command definition above, your "command_name" is >>> "check-icmp". The error message indicates that it's looking for >>> "check_icmp". If you notice, in the command definition you're using >>> a dash ( - ), and in the error message it's using an underscore ( _ ). >>> >>> Change the command_name to check_icmp and it may take care of it.. >>> >>> k >>> >>> >> > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. ::: Messages without supporting info will risk > being sent to /dev/null > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From RLAdams at AdamsInfoServ.Com Sat Oct 15 02:23:06 2005 From: RLAdams at AdamsInfoServ.Com (Russell Adams) Date: Fri, 14 Oct 2005 19:23:06 -0500 Subject: [SPAM] Re: Problem with Cingular TAP Gateway In-Reply-To: References: Message-ID: <20051015002306.GA505@keroberos> What happens if your Internet connection is down? ;] Russell On Fri, Oct 14, 2005 at 06:37:56PM -0500, Paul Harrison wrote: > We have sendmail setup on our nagios server to send out email to our cingular phones. > Is there any advantage to sending out SMS messages rather than email? > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net on behalf of Russell Adams > Sent: Fri 10/14/2005 4:56 PM > To: nagios-users at lists.sourceforge.net > Cc: > Subject: [SPAM] Re: [Nagios-users] Problem with Cingular TAP Gateway > > > ------------------------------------------------------------------ Russell Adams RLAdams at AdamsInfoServ.com PGP Key ID: 0x1160DCB3 http://www.adamsinfoserv.com/ Fingerprint: 1723 D8CA 4280 1EC9 557F 66E8 1154 E018 1160 DCB3 -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 189 bytes Desc: not available URL: From lanning at lanning.cc Sat Oct 15 02:53:47 2005 From: lanning at lanning.cc (Robert Hajime Lanning) Date: Fri, 14 Oct 2005 17:53:47 -0700 (PDT) Subject: OT: Problem with Cingular TAP Gateway In-Reply-To: <20051015002306.GA505@keroberos> References: <20051015002306.GA505@keroberos> Message-ID: <59844.192.168.128.67.1129337627.squirrel@ssl.monsoonwind.com> This is the reason we have out of band paging (TAP to Skytel). I had talked to Cingular about the "Enterprise Paging" feature. This is a grandfathered AT&T feature that costs about $9.95/month. Gives you unlimited SMS from the "Enterprise Paging" sources to the phone that has the feature. SMS still costs for other sources and sending. http://www.enterprisepaging.com/ Cingular plans on migrating this service to the offical Cingular network and billing system, by the end of the calendar year. So, until then, the service is not available to "Offical" Cingular customers, only customers still on the old AT&T billing system. You could see about http://www.mblox.com/. They seemed to have bought the old MobileSys company. They provide the ability to send SMS to anywhere, internationaly. But, $$$. They primarily use Internet connections, but also provide TAP access for last resort, like Cingular. I suspect the Cingular feature will be unreliable until to full migration is complete. > What happens if your Internet connection is down? ;] > > Russell > > On Fri, Oct 14, 2005 at 06:37:56PM -0500, Paul Harrison wrote: >> We have sendmail setup on our nagios server to send out email to our >> cingular phones. >> Is there any advantage to sending out SMS messages rather than >> email? >> >> -----Original Message----- >> From: nagios-users-admin at lists.sourceforge.net on behalf of Russell >> Adams >> Sent: Fri 10/14/2005 4:56 PM >> To: nagios-users at lists.sourceforge.net >> Cc: >> Subject: [SPAM] Re: [Nagios-users] Problem with Cingular TAP >> Gateway >> >> >> > ------------------------------------------------------------------ > Russell Adams RLAdams at AdamsInfoServ.com > > PGP Key ID: 0x1160DCB3 http://www.adamsinfoserv.com/ > > Fingerprint: 1723 D8CA 4280 1EC9 557F 66E8 1154 E018 1160 DCB3 > -- And, did Guloka think the Ulus were too ugly to save? -Centauri ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Sat Oct 15 09:41:17 2005 From: ae at op5.se (Andreas Ericsson) Date: Sat, 15 Oct 2005 09:41:17 +0200 Subject: check_nrpe issue In-Reply-To: <002d01c5d10e$802fc9d0$620a010a@GRAPEAPE> References: <002d01c5d10e$802fc9d0$620a010a@GRAPEAPE> Message-ID: <4350B29D.80401@op5.se> Todd Barbera wrote: > Hi, > > I had tried posting about a problem using NRPE 2.0 and check_log2 yesterday > but unfortunately no one responded. I ended up trying the plug-in > check_log.pl, but I'm still getting the same error, which is: > > CHECK_NRPE: Error receiving data from daemon. > > Other plug-ins are working fine, such as check_oracle. I'm running Nagios > 2.0b3 on a Solaris 9 sparc machine. The client is the same. Does anyone have > any ideas why the check_log2 and check_log.pl plug-ins do not seem to work > with NRPE? BTW, check_log.pl is MUCH faster than check_log2. What exactly > does this error message imply? That you need to do what says in the readme and look in the logs on the remote site. In 75 cases out of 100, the nrpe host isn't configured to allow the client to connect to it. In about 20% of the cases one end supports (and tries to use) SSL while the other doesn't. In the remaining 5, there is a protocol mismatch (check_nrpe 2.0 being used to poll nrpe 1.8). > A search of the mailing lists and FAQ didn't > shed any light on it. Thanks in advance for any insight you can offer. > I shall have to remedy that. Thanks for pointing it out. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpm at retail-sc.com Sat Oct 15 11:53:57 2005 From: jpm at retail-sc.com (Jan-Piet Mens) Date: Sat, 15 Oct 2005 11:53:57 +0200 Subject: Reducing Load on a Distributed Nagios Installation Message-ID: <20051015095357.GA31218@m1.intdus.retail-sc.com> We've experienced quite a bit of load on a distributed Nagios installation with several thousand passive service checks which are supplied to a central Nagios server via NSCA. Our central Nagios 1.2 server started swapping and subsequently thrashed itself to death. After a bit of debugging, we've come up with a solution which may be interesting to those in a similar position. We've documented the proceedings as well as the solution we implemented at http://wiki.fupps.com/nagios/icmp Regards, -JP ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From BStover at NorcalMutual.com Sat Oct 15 22:33:21 2005 From: BStover at NorcalMutual.com (Stover, Beth) Date: Sat, 15 Oct 2005 13:33:21 -0700 Subject: OT: Problem with Cingular TAP Gateway Message-ID: I'm being billed through Cingular and I'm a Cingular user. Is anyone out there dialing the Cingular TAP gateway? -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Robert Hajime Lanning Sent: Friday, October 14, 2005 5:54 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] OT: Problem with Cingular TAP Gateway This is the reason we have out of band paging (TAP to Skytel). I had talked to Cingular about the "Enterprise Paging" feature. This is a grandfathered AT&T feature that costs about $9.95/month. Gives you unlimited SMS from the "Enterprise Paging" sources to the phone that has the feature. SMS still costs for other sources and sending. http://www.enterprisepaging.com/ Cingular plans on migrating this service to the offical Cingular network and billing system, by the end of the calendar year. So, until then, the service is not available to "Offical" Cingular customers, only customers still on the old AT&T billing system. You could see about http://www.mblox.com/. They seemed to have bought the old MobileSys company. They provide the ability to send SMS to anywhere, internationaly. But, $$$. They primarily use Internet connections, but also provide TAP access for last resort, like Cingular. I suspect the Cingular feature will be unreliable until to full migration is complete. > What happens if your Internet connection is down? ;] > > Russell > > On Fri, Oct 14, 2005 at 06:37:56PM -0500, Paul Harrison wrote: >> We have sendmail setup on our nagios server to send out email to our >> cingular phones. >> Is there any advantage to sending out SMS messages rather than email? >> >> -----Original Message----- >> From: nagios-users-admin at lists.sourceforge.net on behalf of Russell >> Adams >> Sent: Fri 10/14/2005 4:56 PM >> To: nagios-users at lists.sourceforge.net >> Cc: >> Subject: [SPAM] Re: [Nagios-users] Problem with Cingular TAP Gateway >> >> >> > ------------------------------------------------------------------ > Russell Adams RLAdams at AdamsInfoServ.com > > PGP Key ID: 0x1160DCB3 http://www.adamsinfoserv.com/ > > Fingerprint: 1723 D8CA 4280 1EC9 557F 66E8 1154 E018 1160 DCB3 > -- And, did Guloka think the Ulus were too ugly to save? -Centauri ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jstefani at yorku.ca Sun Oct 16 17:32:36 2005 From: jstefani at yorku.ca (John Stefani) Date: Sun, 16 Oct 2005 11:32:36 -0400 (EDT) Subject: Possible bug - host notifications Message-ID: Hi, I am monitoring switches and routers, and since there are no services running on these devices I had to enable the check_interval directive in the host configuration. Although I set max_check_attempts to be a value greater than one, I get notifications after only one of the checks fails. I use the check_fping plugin with the -n option set to 4. I have nagios set up in a distributed environment, so the check results are being sent with nsca to a central nagios server that accepts passive hosts and service checks and does not do any active checks. If anyone experienced my problem of being notified after only one check fails for a host and perhaps managed to fix it or has some suggestions I would greatly appreciate some help. Many thanks - John ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From smadim2 at grads.ece.mcmaster.ca Sun Oct 16 20:42:10 2005 From: smadim2 at grads.ece.mcmaster.ca (Mohammed Smadi) Date: Sun, 16 Oct 2005 14:42:10 -0400 (EDT) Subject: email notifications not sent Message-ID: Hi; I check my var/nagios.log file and notification commands are executed just fine in case of errors, but am never recieveing the emails. I am sure that i defined the contacts right. any clue why even though the mail commmands are being executed am never recieving one in my inbox? thanks m smadi ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From b00mer at gmx.net Sun Oct 16 20:43:40 2005 From: b00mer at gmx.net (Hendrik Baecker) Date: Sun, 16 Oct 2005 20:43:40 +0200 Subject: Possible bug - host notifications In-Reply-To: References: Message-ID: <43529F5C.7010200@gmx.net> Hi John, it might be incorrect, that you're getting notifications after only one host_check. I hope, that your central Server has the max_attempt for your hosts even above "1", if your central Server is your notification sending one... But let me tell you, that it is a realy bad idea to just do host checking on your systems. Every host check will be a high priority check in Nagios working queue, every other check will be hold on until the host check is finished. Read here: http://nagios.sourceforge.net/docs/2_0/checkscheduling.html#host_checks Why don't you just create a service like "ICMP" for each of your hosts with check_command = check_icmp? Yes, I know, it sounds like double checking with these pings, but it's the best for your performance. It should be performance rule #1: Do everything to avoid host checks ;) Greets Hendrik John Stefani schrieb: >Hi, > >I am monitoring switches and routers, and since there are no services >running on these devices I had to enable the check_interval directive in >the host configuration. >Although I set max_check_attempts to be a value greater than one, I get >notifications after only one of the checks fails. >I use the check_fping plugin with the -n option set to 4. > >I have nagios set up in a distributed environment, so the check results >are being sent with nsca to a central nagios server that accepts passive >hosts and service checks and does not do any active checks. > >If anyone experienced my problem of being notified after only one check >fails for a host and perhaps managed to fix it or has some suggestions I >would greatly appreciate some help. > >Many thanks > >- John > > > >------------------------------------------------------- >This SF.Net email is sponsored by: >Power Architecture Resource Center: Free content, downloads, discussions, >and more. http://solutions.newsforge.com/ibmarch.tmpl >_______________________________________________ >Nagios-users mailing list >Nagios-users at lists.sourceforge.net >https://lists.sourceforge.net/lists/listinfo/nagios-users >::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. >::: Messages without supporting info will risk being sent to /dev/null > > > > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dgullett at symmetrixtech.com Mon Oct 17 01:02:15 2005 From: dgullett at symmetrixtech.com (David Gullett) Date: Sun, 16 Oct 2005 18:02:15 -0500 Subject: nsca / xinetd "Failed to contact identity server" In-Reply-To: <1116972770.4293a6e207b74@webmail.m5computersecurity.com> References: <1116972770.4293a6e207b74@webmail.m5computersecurity.com> Message-ID: <20051016230216.09DC5538E5@hoggle.dreamhost.com> Has anyone else experienced this problem? I am getting "Failed to contact identity server" every few days and have to restart xinetd. Thanks, David Gullett Symmetrix Technologies -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Michael J McCafferty Sent: Tuesday, May 24, 2005 5:13 PM To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] nsca / xinetd "Failed to contact identity server" I built a distributed Nagios setup for a customer. It monitors about 200 hosts. The central Nagios server monitors several dozen hosts actively, and the distributed Nagios servers monitor the rest. The distributed Nagios servers are due to network architecture, not for capacity. Almost all checks are done every minute. The distributed Nagios servers send their check results back to the central Nagios host via NSCA, using xinetd to listen for inbound connections from the remote Nagios hosts. With the exception of having to increase the connections per second and the number of instances for xinetd some time ago (upped it to 60 connections), I have had no problems with the setup... until today. Today I updated the kernel and rebooted. When the system came back up the central Nagios server said there where hosts down on one of the remote networks which are checked by the remote Nagios servers. Upon further investigation I realized that they had been "down" (actually it was new FW rule that made them appear to be down cuz they can't be pinged anymore) for some time (6 days), but were just now being reported down by Nagios. The exact moment of the last successful receipt of data from the remote Nagios servers was the exact moment the following message began appearing in /var/log/messages: May 16 10:48:46 nagioshost xinetd[21405]: Failed to contact identity server a t 172.16.0.1: timeout May 16 10:48:48 nagioshost xinetd[21406]: Failed to contact identity server at 192.168.1.2: timeout May 16 10:48:49 nagioshost xinetd[21407]: Failed to contact identity server at 10.0.0.1: timeout These messages appear every few seconds from the time of the last successful receipt of data from the distributed Nagios servers (6days ago) until the reboot today. The IP addresses in the messages are the IP addresses of the remote Nagios servers. No FW rules have changed to cause this. I see no outbound port 113 (identd) traffic at the main Nagios server. I am sure this is a xinetd issue... My questions are: 1) What does this error message mean. 2) What broke ? 3) How do I keep it from breaking again ? In the meantime I set up a check_log check to see if the message appears in the messages file again. Thanks, Mike -- ************************************************************ Michael J. McCafferty Principal, Security Engineer M5 Hosting 858-576-7325 Voice http://www.m5hosting.com ************************************************************ ---------------------------------------------------------------- This message was sent using IMP, the Internet Messaging Program. ------------------------------------------------------- This SF.Net email is sponsored by Yahoo. Introducing Yahoo! Search Developer Network - Create apps using Yahoo! Search APIs Find out how you can build Yahoo! directly into your own Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005 _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From g.vickers at qut.edu.au Mon Oct 17 01:58:53 2005 From: g.vickers at qut.edu.au (Greg Vickers) Date: Mon, 17 Oct 2005 09:58:53 +1000 Subject: Parent/child behaviour, WAS:Re: check_ping vs. check_icmp? In-Reply-To: <435008D8.1060907@op5.se> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> <435008D8.1060907@op5.se> Message-ID: <4352E93D.5060109@qut.edu.au> Andreas, Andreas Ericsson wrote: > Andrew Laden wrote: > >> How does using check_icmp compare to using check_fping? >> >> It seems that check_fping will return a down answer much faster. Since >> host checks are most often run when the host is down, that seems to be the >> performance that we are concerned with. > > This might seem to be the case, but it actually isn't. A hostcheck is > run each time a service changes from whatever to any non-OK state. In a > (somewhat) healthy network hostchecks are being run when the host is up > more often than when they're down. The opposite is of course true if > there are hosts being down for a long time or if a whole segment of the > network goes to lunch, I thought that if parents were set up correctly that Nagios would not run any service or host checks on hosts that are children of the blocking outage? So there would be a delay while Nagios figures out which is the parent host that is down (i.e. the service checks failing 'up' the parent dependencies and the subsequent delays on the host checks until the 'top' parent host is checked) but once the top-most parent is host checked, no host or service checks will be run on the children until that parent becomes good. Subsequently you would only see a delay in check scheduling/processing when the host check is run on that 'top' parent host. Is this the expected and correct behavior or is it too early on Monday morning for me? Ah-ha - RTFM prior to inserting foot in mouth. The networkoutages.html states: "If all of the immediate child hosts of one of these flagged hosts is DOWN or UNREACHABLE and has no immediate parent host that is up, the flagged host is the cause of a network outage. If even one of the immediate children of a flagged host does not pass this test, then the flagged host is not the cause of a network outage." So from this statement, I understand that all children will be host checked to determine fully which host is the cause of a network outage, and that could cause a large delay if there are a lot of hosts to check. However I don't understand the statement "... has no immediate parent host that is up..." Shouldn't that read "... has a parent host up..." otherwise how would Nagios reach that blocking host to test it??? It really could be too early... Thanks, -- Greg Vickers Project Manager, IT Security Information Technology Services Queensland University of Technology L12, 126 Margaret St, Brisbane Phone: (07) 3864 9536 Email: g.vickers at qut.edu.au IT Security web site: http://www.its.qut.edu.au/itsecurity/ CRICOS No. 00213J ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dleangen at canada.com Mon Oct 17 06:17:16 2005 From: dleangen at canada.com (David Leangen) Date: Mon, 17 Oct 2005 13:17:16 +0900 Subject: Configuration error (NULL value) Message-ID: <435325CC.3030108@canada.com> Hello! I am trying to configure my first Nagios installation. I based my config on the minimal.cfg-sample file. Unless I have a completely empty file, I keep getting the following error: Error in configuration file '/usr/local/nagios/etc/nagios.cfg' - Line 1 (NULL value) I think it is a bit odd that I get this no matter what I have in the file. Any ideas why this would happen? Thank you! ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From axanet at ms32.hinet.net Mon Oct 17 07:58:21 2005 From: axanet at ms32.hinet.net (axanet at ms32.hinet.net) Date: Mon, 17 Oct 2005 13:58:21 +0800 Subject: nagios notification by skype ??? Message-ID: <20051017135819.89F7.AXANET@ms32.hinet.net> Hello list: Does anyone use skype IM for nagios notification ??? I know a api named JSkype could use for nagios notification api,but i dont use it before. -- axa at HiNet ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Evert.Goor at benp.nl Mon Oct 17 09:32:20 2005 From: Evert.Goor at benp.nl (Evert Goor) Date: Mon, 17 Oct 2005 09:32:20 +0200 Subject: Checking hosts with nsca Message-ID: <19CCC8E70E4FA941848269F0C5D832CC58A4C4@post.systems.nl> Hi All I have passive checking running with nsca. The services are checked. Only the UP status of a host is not showed it stays on pending. How do I resolve this. The main service is only doing passive checks. Did I mis something in the manual ? Some advise would be welcome. Evert -------------- next part -------------- An HTML attachment was scrubbed... URL: From ae at op5.se Mon Oct 17 09:47:15 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 17 Oct 2005 09:47:15 +0200 Subject: Parent/child behaviour, WAS:Re: check_ping vs. check_icmp? In-Reply-To: <4352E93D.5060109@qut.edu.au> References: <56EAA5BC64E6C34F8C9EE6725D4A2DFA01AA1AFF@tudor.com> <435008D8.1060907@op5.se> <4352E93D.5060109@qut.edu.au> Message-ID: <43535703.1090600@op5.se> Greg Vickers wrote: > Andreas, > > Andreas Ericsson wrote: > >> Andrew Laden wrote: >> >>> How does using check_icmp compare to using check_fping? >>> >>> It seems that check_fping will return a down answer much faster. >>> Since host checks are most often run when the host is down, that >>> seems to be the >>> performance that we are concerned with. >> >> >> This might seem to be the case, but it actually isn't. A hostcheck is >> run each time a service changes from whatever to any non-OK state. In >> a (somewhat) healthy network hostchecks are being run when the host is >> up more often than when they're down. The opposite is of course true >> if there are hosts being down for a long time or if a whole segment of >> the network goes to lunch, > > > I thought that if parents were set up correctly that Nagios would not > run any service or host checks on hosts that are children of the > blocking outage? So there would be a delay while Nagios figures out > which is the parent host that is down (i.e. the service checks failing > 'up' the parent dependencies and the subsequent delays on the host > checks until the 'top' parent host is checked) but once the top-most > parent is host checked, no host or service checks will be run on the > children until that parent becomes good. Subsequently you would only see > a delay in check scheduling/processing when the host check is run on > that 'top' parent host. > > Is this the expected and correct behavior or is it too early on Monday > morning for me? > > > > Ah-ha - RTFM prior to inserting foot in mouth. The networkoutages.html > states: > > "If all of the immediate child hosts of one of these flagged hosts is > DOWN or UNREACHABLE and has no immediate parent host that is up, the > flagged host is the cause of a network outage. If even one of the > immediate children of a flagged host does not pass this test, then the > flagged host is not the cause of a network outage." > > So from this statement, I understand that all children will be host > checked to determine fully which host is the cause of a network outage, > and that could cause a large delay if there are a lot of hosts to check. > However I don't understand the statement "... has no immediate parent > host that is up..." Shouldn't that read "... has a parent host up..." > otherwise how would Nagios reach that blocking host to test it??? > It probably should read "has a parent host up". Whichever way you look at it, lots of hostchecks are going to be run when a large number of hosts are anything else than OK, but most of the time hostchecks are run against hosts that are up. > It really could be too early... > It always is. :) -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stein at proact.no Mon Oct 17 13:35:58 2005 From: stein at proact.no (=?iso-8859-1?Q?Stein_Bj=F8rndal?=) Date: Mon, 17 Oct 2005 13:35:58 +0200 Subject: check_hpjd gripes. Message-ID: <93E3002CE426FB4A94905C4EE8DA7A2004205E93@nopro01.internal.proact.no> Hi all, We've been using Nagios for monitoring our network for quite some time and also some time back started using check_hpjd for monitoring printers. That works just fine, but as it stands now the printer may well be out of service without an alarm being raised. That is mostly due to exhausted supplies (toner or fuser kit). The status string reports the condition just fine, it's just that status is still OK so nobody is made aware of the situation. As an added complication, the strings reported here are not in English (national language set on printer). So ideally I'd like a set of strings that will set the status of the check to WARNING (aka "TONER LOW") and another set of strings that will raise an alarm (aka "REPLACE FUSER KIT"). Raising an alarm when supplies is running low would be useful to avoid service interuption while not having to stock supplies since we could order new when a warning is raised. It would be interesting to hear how others are handling this, I'm hazarding a guess that this must have been discussed before, but were unable to find anything relevant in the list archives. Regards, Stein ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rejaine at bhz.jamef.com.br Mon Oct 17 13:48:48 2005 From: rejaine at bhz.jamef.com.br (Rejaine Monteiro) Date: Mon, 17 Oct 2005 09:48:48 -0200 Subject: Relation between freshness_threshold and timeperiods Message-ID: <1129549728.30760.19.camel@acesup2.bhz.jamef> Hi , First all, sorry by english.. So, Is there any relation between freshness_threshold and timeperiods? For example: define timeperiod{ timeperiod_name backup_hours alias Backup hours monday 05:00-24:00 tuesday 00:00-24:00 wednesday 00:00-24:00 thursday 00:00-24:00 friday 00:00-24:00 saturday 00:00-05:00 } define service{ service_description BACKUP1 host_name host1 active_checks_enabled 0 passive_checks_enabled 1 parallelize_check 1 obsess_over_service 1 check_freshness 1 freshness_threshold 93600 notifications_enabled 1 event_handler_enabled 1 flap_detection_enabled 1 is_volatile 0 max_check_attempts 1 check_period backup_hours normal_check_interval 1 retry_check_interval 1 notification_interval 120 notification_period 24x7 notification_options w,c,r contact_groups backup_operators process_perf_data 1 retain_status_information 1 retain_nonstatus_information 1 check_command no_backup_report!26!hours } All is work fine.. But... Let me try to explain my problem: - Last OK Backup status: Saturday 03:00pm - The backup services aren't executed on Sundays (timeperiod=backup_hours) - But in monday, at the morning, I recived alerts from Nagios, because the freshness_threshold limit (26 hours) - But sundays hours don't should be count, right? So, what can I do for Nagios respect de time_period on passive checks?? Rejaine ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From eivind at aminor.no Mon Oct 17 14:14:25 2005 From: eivind at aminor.no (Eivind Olsen) Date: Mon, 17 Oct 2005 14:14:25 +0200 Subject: Problems with check_nrpe+SSL, and I have read the FAQ Message-ID: <435395A1.3020609@aminor.no> Hello. I'm trying to set up check_nrpe 2.0 to use SSL, but I can't get it to work. The nagios-server (192.168.1.4) is running Solaris 10, the other machine 192.168.1.2 is running Solaris 8. The nrpe-daemon is run from the command line in daemon-mode (-d option), not from inetd. I'm currently doing all testing from the command line. Here's what I do on the nagios-server: First, I test with normal check_nrpe and then check_nrpe_ssl (configured with --enable-ssl) against the SSL-enabled nrpe on 192.168.1.2, and finally I do a telnet on the port just to see that I have network connectivity and can get through: bash-3.00# ./check_nrpe -H 192.168.1.2 -p 5666 -c check_load CHECK_NRPE: Received 0 bytes from daemon. Check the remote server logs for error messages. bash-3.00# ./check_nrpe_ssl -H 192.168.1.2 -p 5666 -c check_load CHECK_NRPE: Error - Could not complete SSL handshake. bash-3.00# telnet 192.168.1.2 5666 Trying 192.168.1.2... Connected to 192.168.1.2. Escape character is '^]'. ^] telnet> q Connection to 192.168.1.2 closed. When I look in the syslog on 192.168.1.2 I see the famous "Error: Could not complete SSL handshake."-message. Then, I kill the SSL-enabled nrpe-daemon on 192.168.1.2 and start the SSL-disabled nrpe-daemon and do the same tests, we can now see that check_nrpe works as expected: bash-3.00# ./check_nrpe -H 192.168.1.2 -p 5666 -c check_load WARNING - load average: 5.35, 5.58, 6.14|load1=5.348;15.000;30.000;0; load5=5.582;10.000;25.000;0; load15=6.141;5.000;20.000;0; bash-3.00# ./check_nrpe_ssl -H 192.168.1.2 -p 5666 -c check_load CHECK_NRPE: Socket timeout after 10 seconds. bash-3.00# telnet 192.168.1.2 5666 Trying 192.168.1.2... Connected to 192.168.1.2. Escape character is '^]'. ^] telnet> q Connection to 192.168.1.2 closed. I've looked at the Nagios FAQ and can't find anything wrong there. FAQ-entry 191 mentions: * Different versions: both the SSL and non-SSL versions of both check_nrpe/check_nrpe_ssl and nrpe/nrpe_ssl show: Version: 2.0 Last Modified: 09-08-2003 * SSL is disabled: Both the check_nrpe_ssl and nrpe_ssl commands have the following in their output when I run them: "SSL/TLS Available: Anonymous DH Mode, OpenSSL 0.9.6 or higher required" I don't think I've managed to disable SSL-support on the command lines on any of these? The options for check_nrpe_ssl are given above, nrpe_ssl is started as "./nrpe_ssl -c nrpe.cfg -d" * Incorrect file permissions: The nrpe.cfg is readable, it's the exact same file I'm using both when running in non-SSL mode and in SSL-enabled mode. * Pseudo-random device files are not readable: yes, they're world readable. Here's how they are on the nagios-server: crw-r--r-- 1 root sys 190, 0 Aug 18 07:12 /devices/pseudo/random at 0:random crw-r--r-- 1 root sys 190, 1 Oct 17 09:07 /devices/pseudo/random at 0:urandom And here's how they are on 192.168.1.2: crw-r--r-- 1 root sys 259, 0 May 30 2003 /devices/pseudo/random at 0:random crw-r--r-- 1 root sys 259, 1 May 30 2003 /devices/pseudo/random at 0:urandom * Unallowed address: I'm not running under xinetd. The nrpe.cfg file on 192.168.1.2 lists the server as allowed_hosts: allowed_hosts=192.168.1.4 (192.168.1.4 is the IP-address of the nagios server) I have also seen FAQ entry 261, it mentions also tcp-wrappers, but I think this can be ruled out since: - I'm not aware of anything that should cause me to use these - I can make a telnet connection to the port 5666, both with non-SSL and SSL-enabled nrpe running. Does anyone have any idea what I might be doing wrong? And suggestions on things I might try? -- Regards Eivind Olsen ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From todd_barbera at wgbh.org Mon Oct 17 15:33:53 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Mon, 17 Oct 2005 09:33:53 -0400 Subject: check_nrpe issue In-Reply-To: <4350B29D.80401@op5.se> References: <4350B29D.80401@op5.se> Message-ID: <007c01c5d31f$6b985740$620a010a@GRAPEAPE> " That you need to do what says in the readme and look in the logs on the remote site. In 75 cases out of 100, the nrpe host isn't configured to allow the client to connect to it. In about 20% of the cases one end supports (and tries to use) SSL while the other doesn't. In the remaining 5, there is a protocol mismatch (check_nrpe 2.0 being used to poll nrpe 1.8)." I think perhaps I wasn't clear on what this issue was given your response. check_nrpe does work against these two client hosts. I am able to run check_oracle, check_load, and check_disk via check_nrpe all without incident. When I downloaded check_log2 and check_log.pl and tried to call them via check_nrpe, that is when I received the error. Both check_log2 and check_log.pl work fine if I run them locally on the client so I'm not sure why it isn't successfully passing the results using nrpe. If I turn on debugging, all I see in my syslog are successful connection messages. I have read the readme, FAQ's, and mailing lists prior to posting to see if I could find an answer, but I was unable to find any information as to why this may be the case. Both check_nrpe and nrpe are version 2.0. Both are configured for SSL. Here's the relevant section of nrpe.cfg: command[check_users]=/usr/local/nagios/libexec/check_users -w 5 -c 10 command[check_load]=/usr/local/nagios/libexec/check_load -w 15,10,5 -c 30,25,20 command[check_disk]=/usr/local/nagios/libexec/check_disk -w 10% -c 5% -l -e command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c 200 command[check_oracle]=/usr/local/nagios/libexec/check_oracle --login insight command[check_log]=/usr/local/nagios/libexec/check_logs.pl -c /usr/local/nagios/etc/log.conf command[check_log2]=/usr/local/nagios/libexec/check_log2 -F /oracle/app/oracle/admin/insight/bdump/alert_insight.log - O /usr/local/nagios/tmp/alert_insight.log -q ORA- command[check_disksuite]=/usr/local/nagios/libexec/check_disksuite All of these can be called successfully using check_nrpe except for check_log and check_log2. Any ideas as to why? Thanks in advance. Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas Ericsson Sent: Saturday, October 15, 2005 3:41 AM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] check_nrpe issue Todd Barbera wrote: > Hi, > > I had tried posting about a problem using NRPE 2.0 and check_log2 yesterday > but unfortunately no one responded. I ended up trying the plug-in > check_log.pl, but I'm still getting the same error, which is: > > CHECK_NRPE: Error receiving data from daemon. > > Other plug-ins are working fine, such as check_oracle. I'm running Nagios > 2.0b3 on a Solaris 9 sparc machine. The client is the same. Does anyone have > any ideas why the check_log2 and check_log.pl plug-ins do not seem to work > with NRPE? BTW, check_log.pl is MUCH faster than check_log2. What exactly > does this error message imply? That you need to do what says in the readme and look in the logs on the remote site. In 75 cases out of 100, the nrpe host isn't configured to allow the client to connect to it. In about 20% of the cases one end supports (and tries to use) SSL while the other doesn't. In the remaining 5, there is a protocol mismatch (check_nrpe 2.0 being used to poll nrpe 1.8). > A search of the mailing lists and FAQ didn't > shed any light on it. Thanks in advance for any insight you can offer. > I shall have to remedy that. Thanks for pointing it out. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From p.miquet at hafiba.fr Mon Oct 17 15:48:48 2005 From: p.miquet at hafiba.fr (Pascal MIQUET) Date: Mon, 17 Oct 2005 15:48:48 +0200 Subject: Check_http Message-ID: <2DB6674E64C314478B384C12B8990F501B7ED7@srv2k3sbs.hafiba.local> Using NAGIOS 2 Nagios Plugins 1.4.2 If I issue the command ./check_http -H 192.168.1.4 I get the warning HTTP/1.1 403 Fobidden with the -v option I get the content of the default web server, which is a Centos 3.5 On the web server side, I got a Directory index forbidden by rule : /var/www/html/ So all seems that the server side needs to be tuned to get the right response. Note that if I issue the command check_http -H 192.168.1.4 -u /nagios the response is OK Thanks for your help Regards Pascal -------------- next part -------------- An HTML attachment was scrubbed... URL: From stein at proact.no Mon Oct 17 15:49:04 2005 From: stein at proact.no (=?iso-8859-1?Q?Stein_Bj=F8rndal?=) Date: Mon, 17 Oct 2005 15:49:04 +0200 Subject: Problems with check_nrpe+SSL, and I have read the FAQ Message-ID: <93E3002CE426FB4A94905C4EE8DA7A2004205EAF@nopro01.internal.proact.no> > The nagios-server (192.168.1.4) is running Solaris 10, the other machine > 192.168.1.2 is running Solaris 8. The nrpe-daemon is run from the command > line in daemon-mode (-d option), not from inetd. I also had an issue with that on Solaris 8 clients. Our nagios server is running on Linux, but we have a fair amount of Solaris 8 machines. To use SSL we had to apply this Solaris patch: 112438-02. The minor number seems to have changed to 03 since we did that: http://sunsolve.sun.com/search/document.do?assetkey=1-21-112438-03-1 Patch is for the random number generator function. Hope this helps. Regards, Stein ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fmagoufis at furniture.com Mon Oct 17 15:56:53 2005 From: fmagoufis at furniture.com (Fotios Magoufis) Date: Mon, 17 Oct 2005 09:56:53 -0400 Subject: Problem with Check_DHCP Message-ID: I've been having such a hard time getting the check_dhcp to work, but I still can't figure it out. When the service runs as the nagios user, I get the following error: Error: Could not bind socket to interface eth0. Check your privileges. I tried setting chmod +s check_dhcp, but it did not have any positive effect. Any ideas on how to fix? Nagios version 2.0b4 Check_DHCP version 1.7 OS is Fedora 4 Fotios Magoufis | Operations Manager Furniture.com | 85 River Street Waltham, MA 02453 | 781-693-2131 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/x-pkcs7-signature Size: 3036 bytes Desc: not available URL: From robmossrm at aol.com Mon Oct 17 16:01:49 2005 From: robmossrm at aol.com (Rob Moss) Date: Mon, 17 Oct 2005 15:01:49 +0100 Subject: Check_http In-Reply-To: <2DB6674E64C314478B384C12B8990F501B7ED7@srv2k3sbs.hafiba.local> References: <2DB6674E64C314478B384C12B8990F501B7ED7@srv2k3sbs.hafiba.local> Message-ID: <4353AECD.7000608@aol.com> Pascal MIQUET wrote: > Using NAGIOS 2 Nagios Plugins 1.4.2 > If I issue the command ./check_http -H 192.168.1.4 I get the warning > HTTP/1.1 403 Fobidden > with the -v option I get the content of the default web server, which > is a Centos 3.5 > > On the web server side, I got a Directory index forbidden by rule : > /var/www/html/ > > So all seems that the server side needs to be tuned to get the right > response. > Note that if I issue the command check_http -H 192.168.1.4 -u /nagios > the response is OK This is the webserver configuration causing the problem. To work around this, drop an index file (usually named 'index.html') into the /var/www/html directory.. If that doesn't work, you need to check your configuration and see why it's giving a 403 forbidden error, usually to do with directory permissions, or missing index file.. rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ae at op5.se Mon Oct 17 16:03:53 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 17 Oct 2005 16:03:53 +0200 Subject: Problem with Check_DHCP In-Reply-To: References: Message-ID: <4353AF49.9020109@op5.se> Fotios Magoufis wrote: > I've been having such a hard time getting the check_dhcp to work, but I > still can't figure it out. When the service runs as the nagios user, I get > the following error: > > Error: Could not bind socket to interface eth0. Check your privileges. > > I tried setting chmod +s check_dhcp, but it did not have any positive > effect. > > Any ideas on how to fix? > I suppose you did chmod +s check_dhcp as root, right? Does it work when you run it as root from the console? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Oct 17 16:02:42 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 17 Oct 2005 16:02:42 +0200 Subject: Check_http In-Reply-To: <2DB6674E64C314478B384C12B8990F501B7ED7@srv2k3sbs.hafiba.local> References: <2DB6674E64C314478B384C12B8990F501B7ED7@srv2k3sbs.hafiba.local> Message-ID: <4353AF02.9060703@op5.se> Pascal MIQUET wrote: > Using NAGIOS 2 Nagios Plugins 1.4.2 > If I issue the command ./check_http -H 192.168.1.4 I get the warning > HTTP/1.1 403 Fobidden > with the -v option I get the content of the default web server, which is > a Centos 3.5 > > On the web server side, I got a Directory index forbidden by rule : > /var/www/html/ > > So all seems that the server side needs to be tuned to get the right > response. > Note that if I issue the command check_http -H 192.168.1.4 -u /nagios > the response is OK > > Thanks for your help I didn't find a question in there, so answering it will be kind of hard. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fmagoufis at furniture.com Mon Oct 17 16:06:34 2005 From: fmagoufis at furniture.com (Fotios Magoufis) Date: Mon, 17 Oct 2005 10:06:34 -0400 Subject: Problem with Check_DHCP Message-ID: Yes, I ran it as root and still no success. If I execute the command as root, it works fine. Fotios Magoufis | Operations Manager Furniture.com | 85 River Street Waltham, MA 02453 | 781-693-2131 -----Original Message----- From: Andreas Ericsson [mailto:ae at op5.se] Sent: Monday, October 17, 2005 10:04 AM To: Fotios Magoufis Cc: Nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Problem with Check_DHCP Fotios Magoufis wrote: > I've been having such a hard time getting the check_dhcp to work, but > I still can't figure it out. When the service runs as the nagios > user, I get the following error: > > Error: Could not bind socket to interface eth0. Check your privileges. > > I tried setting chmod +s check_dhcp, but it did not have any positive > effect. > > Any ideas on how to fix? > I suppose you did chmod +s check_dhcp as root, right? Does it work when you run it as root from the console? -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/x-pkcs7-signature Size: 3036 bytes Desc: not available URL: From p.miquet at hafiba.fr Mon Oct 17 16:11:27 2005 From: p.miquet at hafiba.fr (Pascal MIQUET) Date: Mon, 17 Oct 2005 16:11:27 +0200 Subject: Check_http[Scanned] Message-ID: <2DB6674E64C314478B384C12B8990F501B7EDC@srv2k3sbs.hafiba.local> That's it. This does the job. But, let me ask you this. If I want to check a specific port, do I have to edit the checkcommand.cfg to add a custom http ? Same for accessing a specific path to the web server ? Thanks for your help Regards Pascal Miquet ________________________________ De : Rob Moss [mailto:robmossrm at aol.com] Envoy? : lundi 17 octobre 2005 16:02 ? : Pascal MIQUET Cc : nagios-users at lists.sourceforge.net Objet : Re: [Nagios-users] Check_http[Scanned] Pascal MIQUET wrote: Using NAGIOS 2 Nagios Plugins 1.4.2 If I issue the command ./check_http -H 192.168.1.4 I get the warning HTTP/1.1 403 Fobidden with the -v option I get the content of the default web server, which is a Centos 3.5 On the web server side, I got a Directory index forbidden by rule : /var/www/html/ So all seems that the server side needs to be tuned to get the right response. Note that if I issue the command check_http -H 192.168.1.4 -u /nagios the response is OK This is the webserver configuration causing the problem. To work around this, drop an index file (usually named 'index.html') into the /var/www/html directory.. If that doesn't work, you need to check your configuration and see why it's giving a 403 forbidden error, usually to do with directory permissions, or missing index file.. rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From ae at op5.se Mon Oct 17 16:11:21 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 17 Oct 2005 16:11:21 +0200 Subject: check_nrpe issue In-Reply-To: <007c01c5d31f$6b985740$620a010a@GRAPEAPE> References: <007c01c5d31f$6b985740$620a010a@GRAPEAPE> Message-ID: <4353B109.8050307@op5.se> Todd Barbera wrote: > " That you need to do what says in the readme and look in the logs on the > remote site. In 75 cases out of 100, the nrpe host isn't configured to > allow the client to connect to it. In about 20% of the cases one end > supports (and tries to use) SSL while the other doesn't. In the > remaining 5, there is a protocol mismatch (check_nrpe 2.0 being used to > poll nrpe 1.8)." > > I think perhaps I wasn't clear on what this issue was given your response. > check_nrpe does work against these two client hosts. I am able to run > check_oracle, check_load, and check_disk via check_nrpe all without > incident. When I downloaded check_log2 and check_log.pl and tried to call > them via check_nrpe, that is when I received the error. Both check_log2 and > check_log.pl work fine if I run them locally on the client so I'm not sure > why it isn't successfully passing the results using nrpe. If I turn on > debugging, all I see in my syslog are successful connection messages. I have > read the readme, FAQ's, and mailing lists prior to posting to see if I could > find an answer, but I was unable to find any information as to why this may > be the case. Both check_nrpe and nrpe are version 2.0. Both are configured > for SSL. Here's the relevant section of nrpe.cfg: > Ah. That was missing from the original mail. > command[check_users]=/usr/local/nagios/libexec/check_users -w 5 -c 10 > command[check_load]=/usr/local/nagios/libexec/check_load -w 15,10,5 -c > 30,25,20 > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 10% -c 5% -l -e > command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 > -s Z > command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c > 200 > command[check_oracle]=/usr/local/nagios/libexec/check_oracle --login insight > command[check_log]=/usr/local/nagios/libexec/check_logs.pl -c > /usr/local/nagios/etc/log.conf > command[check_log2]=/usr/local/nagios/libexec/check_log2 -F > /oracle/app/oracle/admin/insight/bdump/alert_insight.log - > O /usr/local/nagios/tmp/alert_insight.log -q ORA- > command[check_disksuite]=/usr/local/nagios/libexec/check_disksuite > > All of these can be called successfully using check_nrpe except for > check_log and check_log2. Any ideas as to why? Not really, no, although I don't recognize the options for check_log and check_log2. That's not exactly odd though since I've seen at least three distinctly different versions of it. You could try running the following command su - `sed -n s/^nrpe_user=//p` followed by this slightly longer incantation which should all be on one line; sed -n 's/^command\[\([^]]*\)\]=/\1 /p' /etc/nrpe.conf | while read name cmd; do $cmd >/dev/null 2>&1 || (echo "Command $name failed."; $cmd; echo); done NRPE users that have enabled command-line arguments shouldn't try the above command, and shouldn't ask me why it isn't working if they do. > Thanks in advance. > > Todd > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas > Ericsson > Sent: Saturday, October 15, 2005 3:41 AM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] check_nrpe issue > > Todd Barbera wrote: > >>Hi, >> >>I had tried posting about a problem using NRPE 2.0 and check_log2 > > yesterday > >>but unfortunately no one responded. I ended up trying the plug-in >>check_log.pl, but I'm still getting the same error, which is: >> >>CHECK_NRPE: Error receiving data from daemon. >> >>Other plug-ins are working fine, such as check_oracle. I'm running Nagios >>2.0b3 on a Solaris 9 sparc machine. The client is the same. Does anyone > > have > >>any ideas why the check_log2 and check_log.pl plug-ins do not seem to work >>with NRPE? BTW, check_log.pl is MUCH faster than check_log2. What exactly >>does this error message imply? > > > > That you need to do what says in the readme and look in the logs on the > remote site. In 75 cases out of 100, the nrpe host isn't configured to > allow the client to connect to it. In about 20% of the cases one end > supports (and tries to use) SSL while the other doesn't. In the > remaining 5, there is a protocol mismatch (check_nrpe 2.0 being used to > poll nrpe 1.8). > > >>A search of the mailing lists and FAQ didn't >>shed any light on it. Thanks in advance for any insight you can offer. >> > > > I shall have to remedy that. Thanks for pointing it out. > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Mon Oct 17 16:16:38 2005 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Mon, 17 Oct 2005 10:16:38 -0400 Subject: Monitoring Samba Message-ID: <9C63A4713C4E3342B90428CE44806A73EC382F@PHSXMB5.partners.org> Hi there -- What plug-in or script monitors Samba running on a remote host? Thanks. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Oct 17 16:21:29 2005 From: marc at ena.com (Marc Powell) Date: Mon, 17 Oct 2005 09:21:29 -0500 Subject: Problem with Check_DHCP Message-ID: Is check_dhcp owned by root? Just adding the sticky bit isn't sufficient if the owner (nagios presumably) doesn't have sufficient privileges. -- Marc > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Fotios Magoufis > Sent: Monday, October 17, 2005 9:07 AM > To: Andreas Ericsson > Cc: Nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] Problem with Check_DHCP > > Yes, I ran it as root and still no success. If I execute the command as > root, it works fine. > > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Monday, October 17, 2005 10:04 AM > To: Fotios Magoufis > Cc: Nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Problem with Check_DHCP > > Fotios Magoufis wrote: > > I've been having such a hard time getting the check_dhcp to work, but > > I still can't figure it out. When the service runs as the nagios > > user, I get the following error: > > > > Error: Could not bind socket to interface eth0. Check your privileges. > > > > I tried setting chmod +s check_dhcp, but it did not have any positive > > effect. > > > > Any ideas on how to fix? > > > > I suppose you did chmod +s check_dhcp as root, right? > > Does it work when you run it as root from the console? ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Oct 17 16:24:19 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 17 Oct 2005 16:24:19 +0200 Subject: Problem with Check_DHCP In-Reply-To: References: Message-ID: <4353B413.6020905@op5.se> Please don't top-post. It's terribly annoying. Also keep in mind that it helps a lot if you post some command output rather than trying to tell us all how you think things are set up. Fotios Magoufis wrote: > Yes, I ran it as root and still no success. If I execute the command as > root, it works fine. > Nagios doesn't run check_dhcp as root then, no matter what you chmod'ed it too. Does your ls -l output look something like this? # ls -l check_dhcp -rwsr-xr-x 1 root root 19876 Jun 30 19:03 check_dhcp The important part here is the first s and that the owner is root. Is the plugin located on a partition mounted with the nosuid option? Is SELinux enabled and stripping suid binaries of their privileges? Is check_dhcp a symlink somewhere, and were you daft enough to chmod the symlink rather than the actual file? Does the directory where check_dhcp resides have the suid bit set? Try this; chown root:root check_dhcp chmod 4711 check_dhcp Did it work? > > Fotios Magoufis | Operations Manager > Furniture.com | 85 River Street Waltham, MA 02453 | 781-693-2131 > > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Monday, October 17, 2005 10:04 AM > To: Fotios Magoufis > Cc: Nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Problem with Check_DHCP > > Fotios Magoufis wrote: > >>I've been having such a hard time getting the check_dhcp to work, but >>I still can't figure it out. When the service runs as the nagios >>user, I get the following error: >> >>Error: Could not bind socket to interface eth0. Check your privileges. >> >>I tried setting chmod +s check_dhcp, but it did not have any positive >>effect. >> >>Any ideas on how to fix? >> > > > I suppose you did chmod +s check_dhcp as root, right? > > Does it work when you run it as root from the console? > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Oct 17 16:33:26 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 17 Oct 2005 16:33:26 +0200 Subject: Problem with Check_DHCP In-Reply-To: References: Message-ID: <4353B636.5050102@op5.se> Marc Powell wrote: > Is check_dhcp owned by root? Just adding the sticky bit isn't sufficient > if the owner (nagios presumably) doesn't have sufficient privileges. > Umm... just to be anal. The sticky bit is t (octal 1000). It has no real use for files anymore, but can be used to override write-permissions on directories (only owner and root can delete files in directories with the sticky bit set). The suid bit (s, or octal 4000) is quite different. > -- > Marc > > >>-----Original Message----- >>From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- >>admin at lists.sourceforge.net] On Behalf Of Fotios Magoufis >>Sent: Monday, October 17, 2005 9:07 AM >>To: Andreas Ericsson >>Cc: Nagios-users at lists.sourceforge.net >>Subject: RE: [Nagios-users] Problem with Check_DHCP >> >>Yes, I ran it as root and still no success. If I execute the command > > as > >>root, it works fine. > > > >>-----Original Message----- >>From: Andreas Ericsson [mailto:ae at op5.se] >>Sent: Monday, October 17, 2005 10:04 AM >>To: Fotios Magoufis >>Cc: Nagios-users at lists.sourceforge.net >>Subject: Re: [Nagios-users] Problem with Check_DHCP >> >>Fotios Magoufis wrote: >> >>>I've been having such a hard time getting the check_dhcp to work, > > but > >>>I still can't figure it out. When the service runs as the nagios >>>user, I get the following error: >>> >>>Error: Could not bind socket to interface eth0. Check your > > privileges. > >>>I tried setting chmod +s check_dhcp, but it did not have any > > positive > >>>effect. >>> >>>Any ideas on how to fix? >>> >> >>I suppose you did chmod +s check_dhcp as root, right? >> >>Does it work when you run it as root from the console? > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From p.miquet at hafiba.fr Mon Oct 17 16:35:37 2005 From: p.miquet at hafiba.fr (Pascal MIQUET) Date: Mon, 17 Oct 2005 16:35:37 +0200 Subject: Editing nagios configuration Message-ID: <2DB6674E64C314478B384C12B8990F501B7EE1@srv2k3sbs.hafiba.local> Hi, Is there some nice tool to edit the nagios configuration files ? Vi or emacs are some poor interfaces to set the right options, and sometimes they are legions Thanks for your help Regards Pascal Miquet -------------- next part -------------- An HTML attachment was scrubbed... URL: From todd_barbera at wgbh.org Mon Oct 17 16:41:06 2005 From: todd_barbera at wgbh.org (Todd Barbera) Date: Mon, 17 Oct 2005 10:41:06 -0400 Subject: check_nrpe issue In-Reply-To: <4353B109.8050307@op5.se> References: <4353B109.8050307@op5.se> Message-ID: <00ad01c5d328$ce291a80$620a010a@GRAPEAPE> Of course! Permissions. I had been running the command as root and the files that required updating were owned by root, not nagios. After changing the ownership to the nagios user, check_nrpe worked like a charm. Thanks for helping me sort that out! Todd -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas Ericsson Sent: Monday, October 17, 2005 10:11 AM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] check_nrpe issue Todd Barbera wrote: > " That you need to do what says in the readme and look in the logs on the > remote site. In 75 cases out of 100, the nrpe host isn't configured to > allow the client to connect to it. In about 20% of the cases one end > supports (and tries to use) SSL while the other doesn't. In the > remaining 5, there is a protocol mismatch (check_nrpe 2.0 being used to > poll nrpe 1.8)." > > I think perhaps I wasn't clear on what this issue was given your response. > check_nrpe does work against these two client hosts. I am able to run > check_oracle, check_load, and check_disk via check_nrpe all without > incident. When I downloaded check_log2 and check_log.pl and tried to call > them via check_nrpe, that is when I received the error. Both check_log2 and > check_log.pl work fine if I run them locally on the client so I'm not sure > why it isn't successfully passing the results using nrpe. If I turn on > debugging, all I see in my syslog are successful connection messages. I have > read the readme, FAQ's, and mailing lists prior to posting to see if I could > find an answer, but I was unable to find any information as to why this may > be the case. Both check_nrpe and nrpe are version 2.0. Both are configured > for SSL. Here's the relevant section of nrpe.cfg: > Ah. That was missing from the original mail. > command[check_users]=/usr/local/nagios/libexec/check_users -w 5 -c 10 > command[check_load]=/usr/local/nagios/libexec/check_load -w 15,10,5 -c > 30,25,20 > command[check_disk]=/usr/local/nagios/libexec/check_disk -w 10% -c 5% -l -e > command[check_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 > -s Z > command[check_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 -c > 200 > command[check_oracle]=/usr/local/nagios/libexec/check_oracle --login insight > command[check_log]=/usr/local/nagios/libexec/check_logs.pl -c > /usr/local/nagios/etc/log.conf > command[check_log2]=/usr/local/nagios/libexec/check_log2 -F > /oracle/app/oracle/admin/insight/bdump/alert_insight.log - > O /usr/local/nagios/tmp/alert_insight.log -q ORA- > command[check_disksuite]=/usr/local/nagios/libexec/check_disksuite > > All of these can be called successfully using check_nrpe except for > check_log and check_log2. Any ideas as to why? Not really, no, although I don't recognize the options for check_log and check_log2. That's not exactly odd though since I've seen at least three distinctly different versions of it. You could try running the following command su - `sed -n s/^nrpe_user=//p` followed by this slightly longer incantation which should all be on one line; sed -n 's/^command\[\([^]]*\)\]=/\1 /p' /etc/nrpe.conf | while read name cmd; do $cmd >/dev/null 2>&1 || (echo "Command $name failed."; $cmd; echo); done NRPE users that have enabled command-line arguments shouldn't try the above command, and shouldn't ask me why it isn't working if they do. > Thanks in advance. > > Todd > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas > Ericsson > Sent: Saturday, October 15, 2005 3:41 AM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] check_nrpe issue > > Todd Barbera wrote: > >>Hi, >> >>I had tried posting about a problem using NRPE 2.0 and check_log2 > > yesterday > >>but unfortunately no one responded. I ended up trying the plug-in >>check_log.pl, but I'm still getting the same error, which is: >> >>CHECK_NRPE: Error receiving data from daemon. >> >>Other plug-ins are working fine, such as check_oracle. I'm running Nagios >>2.0b3 on a Solaris 9 sparc machine. The client is the same. Does anyone > > have > >>any ideas why the check_log2 and check_log.pl plug-ins do not seem to work >>with NRPE? BTW, check_log.pl is MUCH faster than check_log2. What exactly >>does this error message imply? > > > > That you need to do what says in the readme and look in the logs on the > remote site. In 75 cases out of 100, the nrpe host isn't configured to > allow the client to connect to it. In about 20% of the cases one end > supports (and tries to use) SSL while the other doesn't. In the > remaining 5, there is a protocol mismatch (check_nrpe 2.0 being used to > poll nrpe 1.8). > > >>A search of the mailing lists and FAQ didn't >>shed any light on it. Thanks in advance for any insight you can offer. >> > > > I shall have to remedy that. Thanks for pointing it out. > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Oct 17 16:38:14 2005 From: marc at ena.com (Marc Powell) Date: Mon, 17 Oct 2005 09:38:14 -0500 Subject: Problem with Check_DHCP Message-ID: You're absolutely correct of course =) -- Marc > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Andreas Ericsson > Sent: Monday, October 17, 2005 9:33 AM > To: Nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Problem with Check_DHCP > > Marc Powell wrote: > > Is check_dhcp owned by root? Just adding the sticky bit isn't sufficient > > if the owner (nagios presumably) doesn't have sufficient privileges. > > > > Umm... just to be anal. The sticky bit is t (octal 1000). It has no real > use for files anymore, but can be used to override write-permissions on > directories (only owner and root can delete files in directories with > the sticky bit set). The suid bit (s, or octal 4000) is quite different. > > > -- > > Marc > > > > > >>-----Original Message----- > >>From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > >>admin at lists.sourceforge.net] On Behalf Of Fotios Magoufis > >>Sent: Monday, October 17, 2005 9:07 AM > >>To: Andreas Ericsson > >>Cc: Nagios-users at lists.sourceforge.net > >>Subject: RE: [Nagios-users] Problem with Check_DHCP > >> > >>Yes, I ran it as root and still no success. If I execute the command > > > > as > > > >>root, it works fine. > > > > > > > >>-----Original Message----- > >>From: Andreas Ericsson [mailto:ae at op5.se] > >>Sent: Monday, October 17, 2005 10:04 AM > >>To: Fotios Magoufis > >>Cc: Nagios-users at lists.sourceforge.net > >>Subject: Re: [Nagios-users] Problem with Check_DHCP > >> > >>Fotios Magoufis wrote: > >> > >>>I've been having such a hard time getting the check_dhcp to work, > > > > but > > > >>>I still can't figure it out. When the service runs as the nagios > >>>user, I get the following error: > >>> > >>>Error: Could not bind socket to interface eth0. Check your > > > > privileges. > > > >>>I tried setting chmod +s check_dhcp, but it did not have any > > > > positive > > > >>>effect. > >>> > >>>Any ideas on how to fix? > >>> > >> > >>I suppose you did chmod +s check_dhcp as root, right? > >> > >>Does it work when you run it as root from the console? > > > > > > > > ------------------------------------------------------- > > This SF.Net email is sponsored by: > > Power Architecture Resource Center: Free content, downloads, > discussions, > > and more. http://solutions.newsforge.com/ibmarch.tmpl > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > -- > Andreas Ericsson andreas.ericsson at op5.se > OP5 AB www.op5.se > Tel: +46 8-230225 Fax: +46 8-230231 > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fmagoufis at furniture.com Mon Oct 17 16:46:33 2005 From: fmagoufis at furniture.com (Fotios Magoufis) Date: Mon, 17 Oct 2005 10:46:33 -0400 Subject: Problem with Check_DHCP Message-ID: -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Andreas Ericsson Sent: Monday, October 17, 2005 10:24 AM To: Nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Problem with Check_DHCP Please don't top-post. It's terribly annoying. Also keep in mind that it helps a lot if you post some command output rather than trying to tell us all how you think things are set up. Fotios Magoufis wrote: > Yes, I ran it as root and still no success. If I execute the command > as root, it works fine. > Nagios doesn't run check_dhcp as root then, no matter what you chmod'ed it too. Does your ls -l output look something like this? # ls -l check_dhcp -rwsr-xr-x 1 root root 19876 Jun 30 19:03 check_dhcp The important part here is the first s and that the owner is root. Is the plugin located on a partition mounted with the nosuid option? Is SELinux enabled and stripping suid binaries of their privileges? Is check_dhcp a symlink somewhere, and were you daft enough to chmod the symlink rather than the actual file? Does the directory where check_dhcp resides have the suid bit set? Try this; chown root:root check_dhcp chmod 4711 check_dhcp Did it work? Everything worked like a charm, I completely forgot to set the ownership to root. > > Fotios Magoufis | Operations Manager > Furniture.com | 85 River Street Waltham, MA 02453 | 781-693-2131 > > -----Original Message----- > From: Andreas Ericsson [mailto:ae at op5.se] > Sent: Monday, October 17, 2005 10:04 AM > To: Fotios Magoufis > Cc: Nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Problem with Check_DHCP > > Fotios Magoufis wrote: > >>I've been having such a hard time getting the check_dhcp to work, but >>I still can't figure it out. When the service runs as the nagios >>user, I get the following error: >> >>Error: Could not bind socket to interface eth0. Check your privileges. >> >>I tried setting chmod +s check_dhcp, but it did not have any positive >>effect. >> >>Any ideas on how to fix? >> > > > I suppose you did chmod +s check_dhcp as root, right? > > Does it work when you run it as root from the console? > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- A non-text attachment was scrubbed... Name: smime.p7s Type: application/x-pkcs7-signature Size: 3036 bytes Desc: not available URL: From Mark.Law at thomson.com Mon Oct 17 16:49:10 2005 From: Mark.Law at thomson.com (Mark.Law at thomson.com) Date: Mon, 17 Oct 2005 10:49:10 -0400 Subject: Nagios URL and accounts Message-ID: OK - the monitors are up on the test system: https://10.226.53.17/nagios/ guest/guest Please take a look at the unix servers - they apparently have not been configured to monitor any important processes - please edit /usr/local/share/snmp/snmpd.conf and put entries in there to monitor services. When Darrin is OK with the test system, I'll move to production. Thanks, Mark -----Original Message----- From: Schat, Curly (TH USA) Sent: Friday, October 14, 2005 2:42 PM To: Law, Mark (TSH Center) Subject: RE: Nagios URL and accounts Its just those 2 for now... we will be setting up Production soon. I would like these 2 Windows Servers grouped with the 3 Solaris servers Dean Hansen requested last week. ccidv2ap1.tshmg.com 10.226.12.107 Solaris ccidv2db1.tshmg.com 10.226.12.108 Solaris ccidv2ws1.tshmg.com 10.226.12.109 Solaris ccidv2fs1.tshmg.com 10.226.12.110 Win2K ccipd2ms1.tshmg.com 10.226.12.111 Win2K Thanks Mark - have a good weekend. Curly -----Original Message----- From: Law, Mark (TSH Center) Sent: Friday, October 14, 2005 10:43 AM To: Schat, Curly (TH USA) Subject: RE: Nagios URL and accounts No problem - please send all machine name, IP addresses and OS type. -Mark -----Original Message----- From: Schat, Curly (TH USA) Sent: Thursday, October 13, 2005 4:18 PM To: Law, Mark (TSH Center) Subject: RE: Nagios URL and accounts Thanks Mark... By spreadsheets, are you referring to appendix 1? These servers should be grouped with the Nagios request Dean Hansen submitted last week... for PeopleSoft ccidv2ap1 - ccidv2db1 - ccidv2ws1 are all solaris servers We would like to add ccidv2fs1 and ccipd2ms1 to this group - these are all non-productions servers. thanks -----Original Message----- From: Law, Mark (TSH Center) Sent: Thursday, October 13, 2005 1:49 PM To: Schat, Curly (TH USA) Cc: Naughton, Darrin (TS Ireland); Claypoole, Douglas (TS USA) Subject: RE: Nagios URL and accounts That works. If you could please review the attached and work through the spreadsheets to get me some information - I'll be able to start working on this. Thanks, Mark -----Original Message----- From: Schat, Curly (TH USA) Sent: Thursday, October 13, 2005 3:08 PM To: Law, Mark (TSH Center) Subject: RE: Nagios URL and accounts Try .111 now. I have set it up like .110 C -----Original Message----- From: Law, Mark (TSH Center) Sent: Thursday, October 13, 2005 11:39 AM To: Schat, Curly (TH USA) Subject: RE: Nagios URL and accounts OK - worked for .110 but not for .111, wrong password. For the sites implemented so far, passwords have not been used - we can use them if you like - I'll just have to create other service definitions when we get to that point. [root at edcnagios2 plugins]# ./check_nt -H 10.226.12.110 -v CPULOAD -l 10,80,95,60,80,95,1440,80,95 -s TSHedcRO CPU Load (10 min. 0%) (60 min. 1%) (1440 min. 1%) [root at edcnagios2 plugins]# ./check_nt -H 10.226.12.111 -v CPULOAD -l 10,80,95,60,80,95,1440,80,95 -s TSHedcRO NSClient - ERROR:Wrong password [root at edcnagios2 plugins]# ./check_nt -H 10.226.12.111 -v CPULOAD -l 10,80,95,60,80,95,1440,80,95 NSClient - ERROR:Wrong password [root at edcnagios2 plugins]# -----Original Message----- From: Schat, Curly (TH USA) Sent: Thursday, October 13, 2005 12:45 PM To: Law, Mark (TSH Center) Subject: RE: Nagios URL and accounts Thanks! ccidv2fs1 - 10.226.12.110 ccipd2ms1 - 10.226.12.111 I have changed the community string on ccidv2fs1 to "TSHedcRO" and the port is: 1248 I have not made the changes on ccipd2ms1. -----Original Message----- From: Law, Mark (TSH Center) Sent: Thursday, October 13, 2005 10:14 AM To: Schat, Curly (TH USA) Subject: RE: Nagios URL and accounts It should - the docs say we are using nc_net, but that is only because it is actively being worked on. Nsclient should work fine. If you have the addresses of a couple of hosts, I can perform a few tests. -Mark -----Original Message----- From: Schat, Curly (TH USA) Sent: Thursday, October 13, 2005 12:13 PM To: Law, Mark (TSH Center) Cc: Naughton, Darrin (TS Ireland); Price, Patrick (TSH Center); Cubr, Timothy (TSH Center) Subject: RE: Nagios URL and accounts nsclient is the agent we used on Windows servers in Denver to talk to Nagios. Will that work with the Nagios implementation in the EDC? -----Original Message----- From: Law, Mark (TSH Center) Sent: Thursday, October 13, 2005 10:09 AM To: Schat, Curly (TH USA) Cc: Naughton, Darrin (TS Ireland); Price, Patrick (TSH Center); Cubr, Timothy (TSH Center) Subject: RE: Nagios URL and accounts My understanding is that you can continue to use that if you want, but that for machines located at the EDC/NADC/Eagan, monitoring needs to be done by the Nagios installation at that Data Center. Tim/Pat/Darrin, is that your understanding? Regards, Mark -----Original Message----- From: Schat, Curly (TH USA) Sent: Thursday, October 13, 2005 10:56 AM To: Law, Mark (TSH Center) Subject: RE: Nagios URL and accounts We are using NSclient with Nagios in Denver. - is there any reason that will not work? -----Original Message----- From: Law, Mark (TSH Center) Sent: Wednesday, October 12, 2005 4:49 PM To: Schat, Curly (TH USA) Subject: RE: Nagios URL and accounts Hi Curly, Please see the attached for the information I need to get started. Regards, Mark -----Original Message----- From: Schat, Curly (TH USA) Sent: Wednesday, October 12, 2005 5:58 PM To: Law, Mark (TSH Center) Subject: RE: Nagios URL and accounts Hi Mark: Thanks for your help... I the PeopleSoft sys admin (Denver). I have another request I am about to submit. I have 3, soon to be 4 Win2k peoplesoft servers in the EDC. They were monitored with nagios here in denver (using nsclient). I can change the community string... Can you monitor Windows servers with you Nagios? If you can monitor these boxes, let me know what you need... I'll open a Remedy ticket. Thanks, Curly -----Original Message----- From: Law, Mark (TSH Center) Sent: Tuesday, October 11, 2005 8:49 AM To: Hansen, Dean (TH USA); McManus, Timothy (TH USA) Cc: Cetera, Anthony (TH USA); June, Leo (TS USA); Schat, Curly (TH USA); Pearce, Angela (TH USA); Naughton, Darrin (TS Ireland) Subject: RE: Nagios URL and accounts They can be added when you like. Please check with Darrin Naughton to make sure he has all he needs prior to moving to production. If Darrin is happy, I'm happy. :-) -Mark -----Original Message----- From: Hansen, Dean (TH USA) Sent: Tuesday, October 11, 2005 10:46 AM To: Law, Mark (TSH Center); McManus, Timothy (TH USA) Cc: Cetera, Anthony (TH USA); June, Leo (TS USA); Schat, Curly (TH USA); Pearce, Angela (TH USA) Subject: RE: Nagios URL and accounts Mark, I was able to access Nagios Test and check out the monitors for the first three PeopleSoft servers. At what point will these servers be added to the Nagios Prod server? Thanks, Dean -----Original Message----- From: Law, Mark (TSH Center) Sent: Friday, October 07, 2005 1:19 PM To: McManus, Timothy (TH USA); Hansen, Dean (TH USA) Cc: Cetera, Anthony (TH USA); June, Leo (TS USA) Subject: RE: Nagios URL and accounts Hi, EDC TEST: https://10.226.53.17/nagios/ EDC PROD: https://edcnagios.edc.tshhosting.com/nagios/ Username/password guest/guest Let me know if problems. -Mark -----Original Message----- From: McManus, Timothy (TH USA) Sent: Thursday, October 06, 2005 3:01 PM To: Hansen, Dean (TH USA); Law, Mark (TSH Center) Cc: Cetera, Anthony (TH USA); June, Leo (TS USA) Subject: Re: Nagios URL and accounts Mark: Can you provide this information to Dean? Thanks! -------------------------- Sent from my BlackBerry Wireless Handheld -----Original Message----- From: Hansen, Dean (TH USA) To: McManus, Timothy (TH USA) CC: Cetera, Anthony (TH USA) ; June, Leo (TS USA) Sent: Thu Oct 06 14:58:19 2005 Subject: Nagios URL and accounts Tim, As we move forward and begin to use Nagios, we will require access to the Nagios monitoring site and the appropriate login ID's. Can you forward the URL and ensure that we obtain any accounts that may be needed to access the site? Thanks, Dean ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From p.miquet at hafiba.fr Mon Oct 17 16:56:33 2005 From: p.miquet at hafiba.fr (Pascal MIQUET) Date: Mon, 17 Oct 2005 16:56:33 +0200 Subject: check cron Message-ID: <2DB6674E64C314478B384C12B8990F501B7EE4@srv2k3sbs.hafiba.local> Hi, Is there a plugin, or a way to check on a remote linux server that the crontab is up an running, and some commands are active into the crontab of a specific user ? Thanks for your help. Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: From deborah.martin at WhiteCross.com Mon Oct 17 17:11:59 2005 From: deborah.martin at WhiteCross.com (Deborah Martin) Date: Mon, 17 Oct 2005 16:11:59 +0100 Subject: Fruity 1.0 Beta 2 Patchlevel 1 Released! In-Reply-To: <3d2eb72a05092808185c2a45f1@mail.gmail.com> References: <3d2eb72a05092808185c2a45f1@mail.gmail.com> Message-ID: Hi, A while ago I sent a post to say I was having problems with getting fruity up and running. It was thought to be caused by the version of PHP. I'm definitely using PHP 5.0 but not being particuarly php literate, i'm still getting the same error. I must be missing something in the php config or something else is amiss. Any input would be appreciated. regards, deborah --- Posting from before ---- Hi, Just tried to setup and use fruity as below and I get the following error from my browser : - Parse error: parse error, unexpected T_OBJECT_OPERATOR in /srv/www/htdocs/fruity/output/output.php on line 314 How do i fix this ? regards, deborah -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Taylor Dondich Sent: 28 September 2005 16:18 To: nagios-users at lists.sourceforge.net Subject: [Nagios-users] Fruity 1.0 Beta 2 Patchlevel 1 Released! Fruity, your favorite Nagios configuration editor, has been updated. Fruity 1.0 Beta 2 Patchlevel 1 has been released! CHANGELOG SINCE BETA2: - Lots of bugfixes. - Thanks to Craig Hancock for making this patchlevel release. http://fruity.sf.net Take a look! Taylor ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null __________________________________________________ Virus checked by MessageLabs Virus Control Centre. __________________________________________________ Virus checked by MessageLabs Virus Control Centre. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Oct 17 17:12:34 2005 From: marc at ena.com (Marc Powell) Date: Mon, 17 Oct 2005 10:12:34 -0500 Subject: check cron Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users- > admin at lists.sourceforge.net] On Behalf Of Pascal MIQUET > Sent: Monday, October 17, 2005 9:57 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] check cron > > Hi, > > Is there a plugin, or a way to check on a remote linux server that the > crontab is up an running, and some commands are active into the crontab of > a specific user ? Checking that cron is running is easy: check_procs + check_by_ssh or NRPE. There's not a specific plugin that can check the entries in a users crontab but you could use check_log to see if it ran in /var/log/cron or you could script a simple plugin to verify that the line exists as expected in /var/spool/cron/{user}. There are probably other ways but those two come to mind first. -- Marc ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Mon Oct 17 17:18:41 2005 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Mon, 17 Oct 2005 11:18:41 -0400 Subject: Just In Case the First E-mail Didn't Get Through Message-ID: <9C63A4713C4E3342B90428CE44806A73EC3833@PHSXMB5.partners.org> Hi there -- What plug-in or script monitors Samba running on a remote host? Thanks. ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From p.miquet at hafiba.fr Mon Oct 17 17:25:44 2005 From: p.miquet at hafiba.fr (Pascal MIQUET) Date: Mon, 17 Oct 2005 17:25:44 +0200 Subject: check dig Message-ID: <2DB6674E64C314478B384C12B8990F501B7EE7@srv2k3sbs.hafiba.local> I've got some trouble using the check_dig module The command /usr/bin/dig www.free.fr return some results. If I use the ./check_dig -v www.free.fr -a 192.168.1.10 for tests, the command run in time out The -v option show me that the dig command is issued with the options /usr/bin/dig @www.free.fr -p 53 (null) -t A And this command executed into a terminal got a connection time out; no servers could be reached What's wrong with the check_dig command ? Thanks for your help Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: From james at jameswhite.org Mon Oct 17 17:38:23 2005 From: james at jameswhite.org (James S. White) Date: Mon, 17 Oct 2005 10:38:23 -0500 (CDT) Subject: $SERVICEOUTPUT$ wierdness Message-ID: I upgraded from 2.0b2 to 2.0b4 and now some of my servicechecks say "(No output!)" even when running it by hand produces: "Load : 0.00 0.00 0.00 : OK " I can copy the 2.0b2 binary of "nagios" from a working (production) system and overwrite the 2.0b4 binary and My output is then seen by nagios and passed to nagiostat. Further examination showed that $SERVICEOUTPUT$ wan empty. Was this something that was introduced with the 2.0b3 changelog item "Fixed segfaults in CGIs where plugin output was NULL"? Or does nagios now do strict output checking of the servicechecks. This seems to only effect plugins whose output doesn't take the form: TAG STATUS - as in: CPU OK - 0 0 99 246458 43619 40728186 2312 I hate the idea that I will have to re-configure nagiostat's regular expressions. ----------------------------------------------------------------------- James S. White primary/voip: (615) 469-0268 928 Mountain Valley Dr. .O. mobile: (256) 476-2619 Nashville, TN 37209 ..O work: (615) 341-8218 http://www.jameswhite.org OOO work-mobile: (615) 519-5160 james at jameswhite.org fax: (866) 260-5465 ----------------------------------------------------------------------- "Computers are useless. They only give you answers." -- Pablo Picasso ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Ludwig.Pummer at Copart.Com Mon Oct 17 18:09:09 2005 From: Ludwig.Pummer at Copart.Com (Ludwig Pummer) Date: Mon, 17 Oct 2005 09:09:09 -0700 Subject: Problems with check_nrpe+SSL, and I have read the FAQ Message-ID: > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > Of Eivind Olsen > Sent: Monday, October 17, 2005 5:14 AM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] Problems with check_nrpe+SSL, and I > have read the FAQ > > Hello. > > I'm trying to set up check_nrpe 2.0 to use SSL, but I can't > get it to work. The nagios-server (192.168.1.4) is running > Solaris 10, the other machine 192.168.1.2 is running Solaris > 8. The nrpe-daemon is run from the command line in > daemon-mode (-d option), not from inetd. > I'm currently doing all testing from the command line. Here's > what I do on the nagios-server: I haven't had to try to make it work in Solaris 8, so I'm afraid I can't help you there. However, I can tell you that I wasn't able to get the OpenSSL that ships with Solaris 10 to work with NRPE. On the Solaris 10 box, run the nrpe daemon and try to check_nrpe on localhost. Don't forget to add localhost to the allowed_hosts line. If you get the "Could not complete SSL handshake" there too, you're running into the same issue I had. I compiled OpenSSL from the source package and linked nrpe against it (./configure --with-ssl-lib=/usr/local/openssl/lib --with-ssl-inc=/usr/local/openssl" and then it worked. One of the OpenSSL packages on one of the free package sites like sunfreeware.com would probably work for you. --Ludwig Pummer ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robmossrm at aol.com Mon Oct 17 18:26:03 2005 From: robmossrm at aol.com (Rob Moss) Date: Mon, 17 Oct 2005 17:26:03 +0100 Subject: Editing nagios configuration In-Reply-To: <2DB6674E64C314478B384C12B8990F501B7EE1@srv2k3sbs.hafiba.local> References: <2DB6674E64C314478B384C12B8990F501B7EE1@srv2k3sbs.hafiba.local> Message-ID: <4353D09B.70806@aol.com> Pascal MIQUET wrote: > Hi, > > Is there some nice tool to edit the nagios configuration files ? > Vi or emacs are some poor interfaces to set the right options, and > sometimes they are legions Fruity is a nice web based frontend to Nagios http://fruity.sf.net rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From sghosh at sghosh.org Mon Oct 17 18:29:11 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Mon, 17 Oct 2005 12:29:11 -0400 (EDT) Subject: check dig In-Reply-To: <2DB6674E64C314478B384C12B8990F501B7EE7@srv2k3sbs.hafiba.local> References: <2DB6674E64C314478B384C12B8990F501B7EE7@srv2k3sbs.hafiba.local> Message-ID: On Mon, 17 Oct 2005, Pascal MIQUET wrote: > I've got some trouble using the check_dig module > > The command /usr/bin/dig www.free.fr return some results. > If I use the ./check_dig -v www.free.fr -a 192.168.1.10 for tests, the > command run in time out > The -v option show me that the dig command is issued with the options > /usr/bin/dig @www.free.fr -p 53 (null) -t A > And this command executed into a terminal got a connection time out; no > servers could be reached > > What's wrong with the check_dig command ? > > Thanks for your help > Regards > > -H specifies the nameserver to query -l specifies the names to resolve. Try the following: ./check_dig -v -H nameserver -l www.free.fr -a 192.168.1.10 -- -sg ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From p.miquet at hafiba.fr Mon Oct 17 18:41:35 2005 From: p.miquet at hafiba.fr (Pascal MIQUET) Date: Mon, 17 Oct 2005 18:41:35 +0200 Subject: check dig[Scanned] Message-ID: <2DB6674E64C314478B384C12B8990F501B7EFC@srv2k3sbs.hafiba.local> The serveral tests I've made show me that the -H option is the name server. Thanks for your reply. Pascal -----Message d'origine----- De : Subhendu Ghosh [mailto:sghosh at sghosh.org] Envoy? : lundi 17 octobre 2005 18:29 ? : Pascal MIQUET Cc : nagios-users at lists.sourceforge.net Objet : Re: [Nagios-users] check dig[Scanned] On Mon, 17 Oct 2005, Pascal MIQUET wrote: > I've got some trouble using the check_dig module > > The command /usr/bin/dig www.free.fr return some results. > If I use the ./check_dig -v www.free.fr -a 192.168.1.10 for tests, the > command run in time out The -v option show me that the dig command is > issued with the options /usr/bin/dig @www.free.fr -p 53 (null) -t A > And this command executed into a terminal got a connection time out; > no servers could be reached > > What's wrong with the check_dig command ? > > Thanks for your help > Regards > > -H specifies the nameserver to query -l specifies the names to resolve. Try the following: ./check_dig -v -H nameserver -l www.free.fr -a 192.168.1.10 -- -sg ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From p.miquet at hafiba.fr Mon Oct 17 18:40:14 2005 From: p.miquet at hafiba.fr (Pascal MIQUET) Date: Mon, 17 Oct 2005 18:40:14 +0200 Subject: Editing nagios configuration[Scanned] Message-ID: <2DB6674E64C314478B384C12B8990F501B7EFA@srv2k3sbs.hafiba.local> Thanks for your reply. What I see, is that I didn't get the right tools to let a try (PHP 4 and MySQL 3.23) Best regards Pascal Miquet ________________________________ De : Rob Moss [mailto:robmossrm at aol.com] Envoy? : lundi 17 octobre 2005 18:26 ? : Pascal MIQUET Cc : nagios-users at lists.sourceforge.net Objet : Re: [Nagios-users] Editing nagios configuration[Scanned] Pascal MIQUET wrote: Hi, Is there some nice tool to edit the nagios configuration files ? Vi or emacs are some poor interfaces to set the right options, and sometimes they are legions Fruity is a nice web based frontend to Nagios http://fruity.sf.net rob. -------------- next part -------------- An HTML attachment was scrubbed... URL: From scotjenkins at gmail.com Mon Oct 17 18:45:50 2005 From: scotjenkins at gmail.com (Scot Jenkins) Date: Mon, 17 Oct 2005 11:45:50 -0500 Subject: Problem with Cingular TAP Gateway In-Reply-To: References: Message-ID: On 10/14/05, Stover, Beth wrote: > > Hi, > > I've been using Nagios ver 1 to send SMS text pages to my cell through a > Cingular TAP gateway since April. The number I'm using: > > (408)961-2819 > > A few weeks ago, my SMS alerts stopped working through the TAP gateway. All > other alerts are working fine. > > Is anyone else having a problem with the Cingular TAP gateway???? A few years ago I had been using qpage and an 800# to send messages to my AT&T cell phone (eg, actual paging, not email). The line quit working about 2 years ago. I suspect someone at AT&T started paying attention to those 800# bills. I've been just emailing my phone since then. AT&T wireless is now Cingular in case you missed it. You might try just emailing to cellnumber at mymmode.com to see if that works. Or try some of the paging numbers linked under "resources" on the bottom of this page: http://thinkunix.net/docs/monitoring/qpage.html Scot ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From BStover at NorcalMutual.com Mon Oct 17 20:31:07 2005 From: BStover at NorcalMutual.com (Stover, Beth) Date: Mon, 17 Oct 2005 11:31:07 -0700 Subject: Problem with Cingular TAP Gateway Message-ID: Thanks Scot, My ability to send via SMTP is not impaired. I use the TAP gateway for sending alerts (from modem to cell) in case my Internet connection or E-mail server go down. I see from reading about TAP that there are two modes: Automatic and Manual login. The TAP server can be configured for either. If it's configured for manual mode, then the user will be prompted with "ID=" which is what I'm seeing. Looks like they changed something on the server. Beth -----Original Message----- From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Scot Jenkins Sent: Monday, October 17, 2005 9:46 AM To: Stover, Beth Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Problem with Cingular TAP Gateway On 10/14/05, Stover, Beth wrote: > > Hi, > > I've been using Nagios ver 1 to send SMS text pages to my cell > through a Cingular TAP gateway since April. The number I'm using: > > (408)961-2819 > > A few weeks ago, my SMS alerts stopped working through the TAP > gateway. All other alerts are working fine. > > Is anyone else having a problem with the Cingular TAP gateway???? A few years ago I had been using qpage and an 800# to send messages to my AT&T cell phone (eg, actual paging, not email). The line quit working about 2 years ago. I suspect someone at AT&T started paying attention to those 800# bills. I've been just emailing my phone since then. AT&T wireless is now Cingular in case you missed it. You might try just emailing to cellnumber at mymmode.com to see if that works. Or try some of the paging numbers linked under "resources" on the bottom of this page: http://thinkunix.net/docs/monitoring/qpage.html Scot ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Mon Oct 17 22:23:07 2005 From: teng at dataway.com (Tedman Eng) Date: Mon, 17 Oct 2005 13:23:07 -0700 Subject: nsca / xinetd "Failed to contact identity serv er" Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2BE9@dw-mail.dataway.com> Try removing the "USERID" from logging options in xinetd.conf (or /etc/xinetd.d/) for your nsca/nrpe. > -----Original Message----- > From: David Gullett [mailto:dgullett at symmetrixtech.com] > Sent: Sunday, October 16, 2005 4:02 PM > To: nagios-users at lists.sourceforge.net > Subject: RE: [Nagios-users] nsca / xinetd "Failed to contact identity > server" > > > Has anyone else experienced this problem? I am getting > "Failed to contact > identity server" every few days and have to restart xinetd. > > Thanks, > > David Gullett > Symmetrix Technologies > > -----Original Message----- > From: nagios-users-admin at lists.sourceforge.net > [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf > Of Michael J > McCafferty > Sent: Tuesday, May 24, 2005 5:13 PM > To: nagios-users at lists.sourceforge.net > Subject: [Nagios-users] nsca / xinetd "Failed to contact > identity server" > > I built a distributed Nagios setup for a customer. It > monitors about 200 > hosts. The central Nagios server monitors several dozen hosts > actively, and > the > distributed Nagios servers monitor the rest. The distributed > Nagios servers > are > due to network architecture, not for capacity. Almost all > checks are done > every > minute. The distributed Nagios servers send their check > results back to the > central Nagios host via NSCA, using xinetd to listen for > inbound connections > from the remote Nagios hosts. > With the exception of having to increase the connections > per second and > the > number of instances for xinetd some time ago (upped it to 60 > connections), I > have had no problems with the setup... until today. > Today I updated the kernel and rebooted. When the system > came back up the > central Nagios server said there where hosts down on one of the remote > networks > which are checked by the remote Nagios servers. Upon further > investigation I > realized that they had been "down" (actually it was new FW > rule that made > them > appear to be down cuz they can't be pinged anymore) for some > time (6 days), > but > were just now being reported down by Nagios. The exact moment > of the last > successful receipt of data from the remote Nagios servers was > the exact > moment > the following message began appearing in /var/log/messages: > > > May 16 10:48:46 nagioshost xinetd[21405]: Failed to contact > identity server > a > t 172.16.0.1: timeout > May 16 10:48:48 nagioshost xinetd[21406]: Failed to contact > identity server > at > 192.168.1.2: timeout > May 16 10:48:49 nagioshost xinetd[21407]: Failed to contact > identity server > at > 10.0.0.1: timeout > > > These messages appear every few seconds from the time of the > last successful > receipt of data from the distributed Nagios servers (6days > ago) until the > reboot > today. The IP addresses in the messages are the IP addresses > of the remote > Nagios servers. No FW rules have changed to cause this. I see > no outbound > port > 113 (identd) traffic at the main Nagios server. I am sure > this is a xinetd > issue... > > My questions are: > > 1) What does this error message mean. > 2) What broke ? > 3) How do I keep it from breaking again ? > > In the meantime I set up a check_log check to see if the > message appears > in > the messages file again. > > > Thanks, > Mike > > > -- > ************************************************************ > Michael J. McCafferty > Principal, Security Engineer > M5 Hosting > 858-576-7325 Voice > http://www.m5hosting.com > ************************************************************ > > ---------------------------------------------------------------- > This message was sent using IMP, the Internet Messaging Program. > > > > ------------------------------------------------------- > This SF.Net email is sponsored by Yahoo. > Introducing Yahoo! Search Developer Network - Create apps using Yahoo! > Search APIs Find out how you can build Yahoo! directly into your own > Applications - visit http://developer.yahoo.net/?fr=offad-ysdn-ostg-q22005 _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jun at lifecapturemedia.com Mon Oct 17 23:02:30 2005 From: jun at lifecapturemedia.com (Jun Li) Date: Mon, 17 Oct 2005 17:02:30 -0400 Subject: nagios snmp plugins problems Message-ID: <200510171702.30909.jun@lifecapturemedia.com> Hello everyone, I have downloaded check_netsnmp plugin and when i I type ./check_netsnmp -h, i got error to say Can't locate Net/SNMP.pm in @INC (@INC contains: /u... I have downloaded and run net-snmp fine in my computer. i also locate SNMP.pm in my computer and copy it to one of the @INC folder, but the problem is still there. anybody can let me know what's the problem and how to fix it. Thank you in advance, Jun ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jun at lifecapturemedia.com Mon Oct 17 23:22:41 2005 From: jun at lifecapturemedia.com (Jun Li) Date: Mon, 17 Oct 2005 17:22:41 -0400 Subject: install Net::SNMP problem Message-ID: <200510171722.42012.jun@lifecapturemedia.com> HI, When i tried to install Net::SNMP via CPAN, i got error to say .... t/usm.t 15 13 86.67% 1-4 6-11 13-15 Failed 1/3 test scripts, 66.67% okay. 13/29 subtests failed, 55.17% okay. make: *** [test_dynamic] Error 255 /usr/bin/make test -- NOT OK Running make install make test had returned bad status, won't install without force Anybody can let me know what's the problem and how can I fix it. Thank you in advance, JUn ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From james at jameswhite.org Mon Oct 17 23:19:08 2005 From: james at jameswhite.org (James S. White) Date: Mon, 17 Oct 2005 16:19:08 -0500 (CDT) Subject: nagios snmp plugins problems In-Reply-To: <200510171702.30909.jun@lifecapturemedia.com> References: <200510171702.30909.jun@lifecapturemedia.com> Message-ID: This is perl looking for a module (Net::SNMP) that is not installed oe not in your @INC path as in: perl -le 'print join("\n", at INC);' You can aquire it here: http://search.cpan.org/~dtown/Net-SNMP-5.1.0/lib/Net/SNMP.pm But your linux distribution undoubtably provides some version of it. Debian: apt-get install libnet-snmp-perl RedHat: up2date net-snmp-perl Other distros: ask someone who uses them. ----------------------------------------------------------------------- James S. White primary/voip: (615) 469-0268 928 Mountain Valley Dr. .O. mobile: (256) 476-2619 Nashville, TN 37209 ..O work: (615) 341-8218 http://www.jameswhite.org OOO work-mobile: (615) 519-5160 james at jameswhite.org fax: (866) 260-5465 ----------------------------------------------------------------------- "Computers are useless. They only give you answers." -- Pablo Picasso On Mon, 17 Oct 2005, Jun Li wrote: > Hello everyone, > I have downloaded check_netsnmp plugin and when i I type ./check_netsnmp -h, i > got error to say Can't locate Net/SNMP.pm in @INC (@INC contains: /u... > > I have downloaded and run net-snmp fine in my computer. i also locate SNMP.pm > in my computer and copy it to one of the @INC folder, but the problem is > still there. > > anybody can let me know what's the problem and how to fix it. > > Thank you in advance, > Jun > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sanjay.Sangavikar at ge.com Mon Oct 17 23:29:28 2005 From: Sanjay.Sangavikar at ge.com (Sangavikar, Sanjay (GE Healthcare, non-ge)) Date: Mon, 17 Oct 2005 16:29:28 -0500 Subject: Connection refused Message-ID: <23583238FE8CF044B2C2484E8306DAEB07634B99@MKEMLVEM05.e2k.ad.ge.com> Hi all, I had configured Nagios 1.2 HOST on solaris 8 and using nrpe 1.9 client daemon on solaris 8 and 9. i am using check_process_remote to check remote process using rsh but its working only with one sun solaris 8 box, on other solaris box its saying "Connection refused" for rsh. Please help me on this issue. Thanks San ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Mon Oct 17 23:42:39 2005 From: ae at op5.se (Andreas Ericsson) Date: Mon, 17 Oct 2005 23:42:39 +0200 Subject: Connection refused In-Reply-To: <23583238FE8CF044B2C2484E8306DAEB07634B99@MKEMLVEM05.e2k.ad.ge.com> References: <23583238FE8CF044B2C2484E8306DAEB07634B99@MKEMLVEM05.e2k.ad.ge.com> Message-ID: <43541ACF.9030101@op5.se> Sangavikar, Sanjay (GE Healthcare, non-ge) wrote: > Hi all, > I had configured Nagios 1.2 HOST on solaris 8 and using nrpe 1.9 client daemon on solaris 8 and 9. > i am using check_process_remote to check remote process using rsh but its working only with > one sun solaris 8 box, on other solaris box its saying "Connection refused" for rsh. > Please help me on this issue. > I think this is an all-time low for uninformative help requests. Have you tried anything at all by yourself? Looking at logs? Logging in via rsh? Looking at logs *while* logging in via rsh? Made sure rshd even exists on the remote server? Made sure rshd is *running* on the remote server? > Thanks You're welcome. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ghenry at suretecsystems.com Tue Oct 18 01:04:45 2005 From: ghenry at suretecsystems.com (Gavin Henry) Date: Tue, 18 Oct 2005 00:04:45 +0100 (BST) Subject: check_procs: Unknown argument - (null) Message-ID: <37943.192.168.100.90.1129590285.squirrel@webmail.suretecsystems.com> Dear List, I can't figure out why I am getting this error: check_procs: Unknown argument - (null) 320 define service{ 321 use generic-service 322 host_name localhost 323 service_description Total Processes 324 is_volatile 0 325 check_period 24x7 326 max_check_attempts 4 327 normal_check_interval 5 328 retry_check_interval 1 329 contact_groups admins 330 notification_interval 960 331 notification_period 24x7 332 check_command check_local_procs!250!400 333 } Aren't I giving it parameters here ^^^^^^^^^^^^ Thanks. -- Kind Regards, Gavin Henry. Open Source. Open Solutions(tm). http://www.suretecsystems.com/ ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From teng at dataway.com Tue Oct 18 01:41:50 2005 From: teng at dataway.com (Tedman Eng) Date: Mon, 17 Oct 2005 16:41:50 -0700 Subject: check_procs: Unknown argument - (null) Message-ID: <37ED92F9890FAF4BB947613C66FF8B1A08BB2BEC@dw-mail.dataway.com> > From: Gavin Henry [mailto:ghenry at suretecsystems.com] > Sent: Monday, October 17, 2005 4:05 PM > I can't figure out why I am getting this error: > > check_procs: Unknown argument - (null) > > 320 define service{ > 321 use generic-service > 322 host_name localhost > 323 service_description Total Processes > 324 is_volatile 0 > 325 check_period 24x7 > 326 max_check_attempts 4 > 327 normal_check_interval 5 > 328 retry_check_interval 1 > 329 contact_groups admins > 330 notification_interval 960 > 331 notification_period 24x7 > 332 check_command > check_local_procs!250!400 > 333 } > > Aren't I giving it parameters here ^^^^^^^^^^^^ > Yes, those are parameters. (about as informative as your question was) Provide some context please. Wheere are you seeing this "Unknown argument error"? What does the pre-flight check say? What is in your checkcommands.cfg section for check_local_procs? ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From webpurchasing at yahoo.com Tue Oct 18 04:51:37 2005 From: webpurchasing at yahoo.com (Zhan Zhaohua) Date: Mon, 17 Oct 2005 19:51:37 -0700 (PDT) Subject: API questions about Nagios Message-ID: <20051018025138.43459.qmail@web52102.mail.yahoo.com> Hi all, I'm a new guy of nagios, I have some API questions about nagios, would you please to help me. 1.I know the Snort is writed in C. Does it have C/C++ API? 2. Because I will use it at many way so we want to know whether Snort support Java API, or the other language? Thank you. yours Zhan __________________________________ Yahoo! Mail - PC Magazine Editors' Choice 2005 http://mail.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sghosh at sghosh.org Tue Oct 18 04:58:03 2005 From: sghosh at sghosh.org (Subhendu Ghosh) Date: Mon, 17 Oct 2005 22:58:03 -0400 (EDT) Subject: API questions about Nagios In-Reply-To: <20051018025138.43459.qmail@web52102.mail.yahoo.com> References: <20051018025138.43459.qmail@web52102.mail.yahoo.com> Message-ID: Better ask the question on a snort mailing list. On Mon, 17 Oct 2005, Zhan Zhaohua wrote: > Hi all, > I'm a new guy of nagios, I have some API questions > about nagios, would you please to help me. > > 1.I know the Snort is writed in C. Does it > have C/C++ API? > > 2. Because I will use it at many way so we want to > know whether Snort support Java API, or the other > language? > > Thank you. > > yours Zhan > > > > > > __________________________________ > Yahoo! Mail - PC Magazine Editors' Choice 2005 > http://mail.yahoo.com > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From webpurchasing at yahoo.com Tue Oct 18 05:01:56 2005 From: webpurchasing at yahoo.com (Zhan Zhaohua) Date: Mon, 17 Oct 2005 20:01:56 -0700 (PDT) Subject: Sorry for mistake : API questions about Nagios In-Reply-To: <20051018025138.43459.qmail@web52102.mail.yahoo.com> References: <20051018025138.43459.qmail@web52102.mail.yahoo.com> Message-ID: <20051018030156.20650.qmail@web52106.mail.yahoo.com> Sorry, here is a mistake, the correct question is: 1.I know the Nagios is writed in C. Does it have C/C++ API? 2. Because I will use it at many way so we want to know whether Nagios support Java API, or the other language? Thank you. yours Zhan --- Zhan Zhaohua wrote: > Hi all, > I'm a new guy of nagios, I have some API > questions > about nagios, would you please to help me. > > 1.I know the Snort is writed in C. Does it > have C/C++ API? > > 2. Because I will use it at many way so we want to > know whether Snort support Java API, or the other > language? > > Thank you. > > yours Zhan > > > > > > __________________________________ > Yahoo! Mail - PC Magazine Editors' Choice 2005 > http://mail.yahoo.com > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, > downloads, discussions, > and more. > http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version > (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being > sent to /dev/null > __________________________________ Yahoo! Mail - PC Magazine Editors' Choice 2005 http://mail.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jhmartin at toger.us Tue Oct 18 05:04:58 2005 From: jhmartin at toger.us (Jason Martin) Date: Mon, 17 Oct 2005 23:04:58 -0400 Subject: API questions about Nagios In-Reply-To: <20051018025138.43459.qmail@web52102.mail.yahoo.com> References: <20051018025138.43459.qmail@web52102.mail.yahoo.com> Message-ID: <20051018030458.GF24395@mal.members.linode.com> On Mon, Oct 17, 2005 at 07:51:37PM -0700, Zhan Zhaohua wrote: > Hi all, > I'm a new guy of nagios, I have some API questions > about nagios, would you please to help me. > > 1.I know the Snort is writed in C. Does it > have C/C++ API? Well, this is Nagios, not Snort. Assuming you meant Nagios, no, there isn't any sort of C/C++/Java API. Nagios uses the standard UNIX API, which is to say executing commands w/command line arguments as the 'input' and reading STDOUT and the return code as the 'output'. Nagios also reads text commands through a named pipe. The commands it executes can be in C/C++/ObjC/Fortran/Java (may be slow due to instantiating the jvm every time)/Perl (same kind of problem)/Python (not so bad). The reporting aspect does have a C API, but it is not well documented yet. It is called the 'event broker'. However it is primarily a write-only mechanism. -Jason Martin -- This message is PGP/MIME signed. -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: application/pgp-signature Size: 211 bytes Desc: not available URL: From webpurchasing at yahoo.com Tue Oct 18 05:14:14 2005 From: webpurchasing at yahoo.com (Zhan Zhaohua) Date: Mon, 17 Oct 2005 20:14:14 -0700 (PDT) Subject: API questions about Nagios In-Reply-To: <20051018030458.GF24395@mal.members.linode.com> References: <20051018030458.GF24395@mal.members.linode.com> Message-ID: <20051018031414.27415.qmail@web52105.mail.yahoo.com> Thank you for the answer, Martin I got it. --- Jason Martin wrote: > On Mon, Oct 17, 2005 at 07:51:37PM -0700, Zhan > Zhaohua wrote: > > Hi all, > > I'm a new guy of nagios, I have some API > questions > > about nagios, would you please to help me. > > > > 1.I know the Snort is writed in C. Does it > > have C/C++ API? > Well, this is Nagios, not Snort. Assuming you meant > Nagios, no, there > isn't any sort of C/C++/Java API. Nagios uses the > standard UNIX > API, which is to say executing commands w/command > line arguments > as the 'input' and reading STDOUT and the return > code as the > 'output'. Nagios also reads text commands through a > named pipe. > > The commands it executes can be in > C/C++/ObjC/Fortran/Java > (may be slow due to instantiating the jvm every > time)/Perl (same > kind of problem)/Python (not so bad). > > The reporting aspect does have a C API, but it is > not well > documented yet. It is called the 'event broker'. > However it is > primarily a write-only mechanism. > > -Jason Martin > -- > This message is PGP/MIME signed. > __________________________________ Yahoo! Mail - PC Magazine Editors' Choice 2005 http://mail.yahoo.com ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ian.marlier at studentuniverse.com Tue Oct 18 05:16:11 2005 From: ian.marlier at studentuniverse.com (Ian Marlier) Date: Mon, 17 Oct 2005 23:16:11 -0400 Subject: Checking a service on all hosts _except_... Message-ID: Hey, all -- This is one of those questions that I'm sure many people have asked before, but I haven't been able to find the answer if/when it was given...so sorry if I'm making someone repeat him/herself.... Anyway. I've got an environment with about 65 linux servers, and about 5 windows servers. There's a base set of 6 or 7 services that I want to monitor on all of them: connectivity, ssh, cron, and a couple of others. I'm wondering whether it's possible to define a hostgroup (say, "windows-servers") with the non-linux boxes listed, and then define a service check for all hosts except those. Something like: define service { name SSH host_name * except_hostgroup windows-servers check_command check_ssh . . . } So, anyone have any thoughts? C'mon, make my life better ;-) Thanks! - Ian ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From holger at CIS.FU-Berlin.DE Tue Oct 18 05:23:11 2005 From: holger at CIS.FU-Berlin.DE (Holger Weiss) Date: Tue, 18 Oct 2005 05:23:11 +0200 Subject: API questions about Nagios In-Reply-To: <20051018030458.GF24395@mal.members.linode.com> References: <20051018025138.43459.qmail@web52102.mail.yahoo.com> <20051018030458.GF24395@mal.members.linode.com> Message-ID: <20051018032311.GD12590549@CIS.FU-Berlin.DE> * Jason Martin [2005-10-17 23:04]: > The commands it executes can be in C/C++/ObjC/Fortran/Java > (may be slow due to instantiating the jvm every time)/Perl (same > kind of problem)/Python (not so bad). Though for Perl, the embedded interpreter could be used and the compiled scripts can be cached by Nagios, so this might be the most performant alternative if you don't want to write your plugins in C. See: http://nagios.sourceforge.net/docs/2_0/embeddedperl.html Holger -- PGP fingerprint: F1F0 9071 8084 A426 DD59 9839 59D3 F3A1 B8B5 D3DE ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ian.marlier at studentuniverse.com Tue Oct 18 05:31:28 2005 From: ian.marlier at studentuniverse.com (Ian Marlier) Date: Mon, 17 Oct 2005 23:31:28 -0400 Subject: Sorry for mistake : API questions about Nagios In-Reply-To: <20051018030156.20650.qmail@web52106.mail.yahoo.com> References: <20051018030156.20650.qmail@web52106.mail.yahoo.com> Message-ID: > From: Zhan Zhaohua > Date: Mon, 17 Oct 2005 20:01:56 -0700 (PDT) > To: > Subject: Sorry for mistake : [Nagios-users] API questions about Nagios > > Sorry, here is a mistake, the correct question is: > > 1.I know the Nagios is writed in C. Does it > have C/C++ API? > > 2. Because I will use it at many way so we want to > know whether Nagios support Java API, or the other > language? > > Thank you. > > yours Zhan > I find that bash works wonderfully, myself. - Ian ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ray.lapeyre at itng.com.au Tue Oct 18 08:06:38 2005 From: ray.lapeyre at itng.com.au (Ray La Peyre) Date: Tue, 18 Oct 2005 16:06:38 +1000 Subject: Unable to get to status map Message-ID: Hi all I have successfully setup nagios and everything works well. The only trouble I am having is when I click on the status map (NOT the 3-D Status Map that works fine) I get an: Object not found! The requested URL was not found on this server. The link on the referring page seems to be wrong or outdated. Please inform the author of that page about the error. If you think this is a server error, please contact the webmaster Error 404 I also get the same message when I click on trends and on Alert histogram. As far as I can tell everything else seems to be working. Any suggestions would be quite helpful Thanks ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Oct 18 08:57:20 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 18 Oct 2005 08:57:20 +0200 Subject: Checking a service on all hosts _except_... In-Reply-To: References: Message-ID: <43549CD0.2090503@op5.se> Ian Marlier wrote: > Hey, all -- > > This is one of those questions that I'm sure many people have asked before, > but I haven't been able to find the answer if/when it was given...so sorry > if I'm making someone repeat him/herself.... > > Anyway. > > I've got an environment with about 65 linux servers, and about 5 windows > servers. There's a base set of 6 or 7 services that I want to monitor on > all of them: connectivity, ssh, cron, and a couple of others. > > I'm wondering whether it's possible to define a hostgroup (say, > "windows-servers") with the non-linux boxes listed, and then define a > service check for all hosts except those. > > Something like: > > define service { > name SSH > host_name * > except_hostgroup windows-servers > check_command check_ssh > . > . > . > } > > So, anyone have any thoughts? C'mon, make my life better ;-) > define service { service_description SSH host_name * hostgroup_name !windows-servers check_command check_ssh } I think that should work. If it doesn't you can always do it the other way around and create a hostgroup with everything but the windows-servers in them. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Oct 18 09:00:44 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 18 Oct 2005 09:00:44 +0200 Subject: Unable to get to status map In-Reply-To: References: Message-ID: <43549D9C.9070101@op5.se> Ray La Peyre wrote: > > Hi all > > I have successfully setup nagios and everything works well. The only trouble > I am having is when I click on the status map (NOT the 3-D Status Map that > works fine) I get an: > > > > Object not found! > > The requested URL was not found on this server. The link on the referring > page seems to be wrong or outdated. Please inform the author of that page > about the error. > > If you think this is a server error, please contact the webmaster > > Error 404 > > > > I also get the same message when I click on trends and on Alert histogram. > > As far as I can tell everything else seems to be working. Any suggestions > would be quite helpful > Install boutell's GD library. > Thanks > > > > ------------------------------------------------------- > This SF.Net email is sponsored by: > Power Architecture Resource Center: Free content, downloads, discussions, > and more. http://solutions.newsforge.com/ibmarch.tmpl > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 ------------------------------------------------------- This SF.Net email is sponsored by: Power Architecture Resource Center: Free content, downloads, discussions, and more. http://solutions.newsforge.com/ibmarch.tmpl _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Oct 18 09:06:43 2005 From: ae at op5.se (Andreas Ericsson) Date: Tue, 18 Oct 2005 09:06:43 +0200 Subject: Sorry for mistake : API questions about Nagios In-Reply-To: <20051018030156.20650.qmail@web52106.mail.yahoo.com> References: <20051018030156.20650.qmail@web52106.mail.yahoo.com> Message-ID: <43549F03.9040303@op5.se> Zhan Zhaohua wrote: > Sorry, here is a mistake, the correct question is: > > 1.I know the Nagios is writed in C. Does it > have C/C++ API? > The core has a C API. You can make a C++ API out of it by adding the proper ifdefs. It isn't documented at all. You might want to read the sources of the nagios-db NEB-module, or have a look at http://oss.op5.se/nagios/sampler.c to see what can be done with it. > 2. Because I will use it at many way so we want to > know whether Nagios support Java API, or the other > language? > It doesn't. > Thank you. > > yours Zhan > > > --- Zhan Zhaohua wrote: > > >>Hi all, >> I'm a new guy of nagios, I have some API >>questions >>about nagios, would you please to help me. >> >>1.I know the Snort is writed in C. Does it >>have C/C++ API? >> >>2. Because I will use it at many way so we want to >>know whether Snort support Java API, or the other >>language? >> >> Thank you. >> >>yours Zhan >> >> >> >> >> >>__________________________________ >>Yahoo! Mail - PC Magazine Editors' Choice 2005 >>http://mail.yahoo.com >> >> >> > > ------------------------------------------------------- > >>This SF.Net email is sponsored by: >>Power Architecture Resource Center: Free content, >>downloads, discussions, >>and more. >>http://solutions.newsforge.com/ibmarch.tmpl >>_______________________________________________ >>Nagios-users mailing list >>Nagios-users at lists.